automatic speech recognition (ASR)
Projects with this topic
-
Empowering seamless transcription with cutting-edge STT (Speech-to-Text) technology, revolutionizing interaction through accurate speech recognition
Updated -
C# library that provides an easy to use abstraction of the Vosk speech recognition toolkit
Updated -
This repository aims at archiving the code used during the performance ECPC 2022 that took place at the Paris Fine Art School with the collaboration of IRCAM.
Updated -
-
LEM speech recognition device, designed for Signal Processing lecture.
Updated -
This repository provides resources for a Quick Start guide for connecting Amazon Connect with Xdroid platform to provide post-call analytics. Intended target audience are system administrators who manage and configure the AWS Amazon Connect instance, and also for system architects and support engineers.
Updated -
Models trained with Kaldi
Updated -
TIMIT: famous corpus of American English with phone-level transcriptions (LDC93S1).
Updated -
LibriSpeech: large ASR data set of read books (SLR12) [Panayotov et al. 2015].
Updated -
AN4: Alphanumeric or "census" database from CMU [Acero 1993].
Updated -
Buckeye: Buckeye Speech Corpus (release 2) of interviews from Ohio State.
Updated -
Proof-of-concept (POC) app towards Aida English app.
UpdatedUpdated -
IndicTTS: Indian English speech from the IIT TTS Team.
Updated -
EMIME Bilingual {Finnish,German,Mandarin}/English database (www.emime.org).
Updated -
UCAM Bilingual database from EMIME (www.emime.org).
Updated -
CRM (coordinate response number) corpus [Bolia et al. 2000].
Updated -
GRID: audiovisual corpus of grid-related commands, from Univ. Sheffield.
Updated -
Lombard Grid: extension of Grid corpus with Lombard and normal speech.
Updated -
CTIMIT: TIMIT played through cellphone network (LDC96S30).
Updated