automatic speech recognition (ASR)

Projects with this topic

Sébastien Demanou / Oremi Andika

Empowering seamless transcription with cutting-edge STT (Speech-to-Text) technology, revolutionizing interaction through accurate speech recognition

stt automatic sp... speech-to-text

0

Updated Jun 06, 2024

0 0 0 0

Updated Jun 06, 2024
T

projs / asr / en / timit

TIMIT: famous corpus of American English with phone-level transcriptions (LDC93S1).

automatic sp...

0

Updated Sep 22, 2021

0 0 0 0

Updated Sep 22, 2021
L

projs / asr / en / librispeech

LibriSpeech: large ASR data set of read books (SLR12) [Panayotov et al. 2015].

automatic sp...

0

Updated Sep 22, 2021

0 1 0 0

Updated Sep 22, 2021
A

projs / asr / en / an4

AN4: Alphanumeric or "census" database from CMU [Acero 1993].

automatic sp...

0

Updated Jun 29, 2021

0 0 0 0

Updated Jun 29, 2021
B

projs / asr / en / buckeye

Buckeye: Buckeye Speech Corpus (release 2) of interviews from Ohio State.

automatic sp...

0

Updated Jun 28, 2021

0 0 0 0

Updated Jun 28, 2021
P

projs / pearson / poc

Proof-of-concept (POC) app towards Aida English app.

automatic sp...

1

Updated Apr 08, 2021

1

Updated Apr 08, 2021
I

projs / asr / en / indictts

IndicTTS: Indian English speech from the IIT TTS Team.

automatic sp...

0

Updated Dec 03, 2020

0 1 0 0

Updated Dec 03, 2020
E

projs / asr / en / emime

EMIME Bilingual {Finnish,German,Mandarin}/English database (www.emime.org).

automatic sp...

0

Updated Nov 28, 2020

0 1 0 0

Updated Nov 28, 2020
U

projs / asr / en / ucam

UCAM Bilingual database from EMIME (www.emime.org).

automatic sp...

0

Updated Nov 27, 2020

0 0 0 0

Updated Nov 27, 2020
C

projs / asr / en / crm

CRM (coordinate response number) corpus [Bolia et al. 2000].

automatic sp...

0

Updated Nov 19, 2020

0 0 0 0

Updated Nov 19, 2020
G

projs / asr / en / grid

GRID: audiovisual corpus of grid-related commands, from Univ. Sheffield.

automatic sp...

0

Updated Nov 19, 2020

0 0 0 0

Updated Nov 19, 2020
L

projs / asr / en / lombard

Lombard Grid: extension of Grid corpus with Lombard and normal speech.

automatic sp...

0

Updated Nov 19, 2020

0 0 0 0

Updated Nov 19, 2020
C

projs / asr / en / ctimit

CTIMIT: TIMIT played through cellphone network (LDC96S30).

automatic sp...

0

Updated Nov 19, 2020

0 0 0 0

Updated Nov 19, 2020
N

projs / asr / en / noisy-vctk

Noisy-VCTK: Noisy subset of VCTK (Voice Cloning Toolkit) dataset from CSTR.

automatic sp...

0

Updated Nov 19, 2020

0 0 0 0

Updated Nov 19, 2020
L

projs / asr / en / libritts

LibriTTS: Librispeech for text-to-speech (TTS) corpus (SLR60).

automatic sp...

0

Updated Nov 19, 2020

0 0 0 0

Updated Nov 19, 2020
A

projs / asr / en / arctic

CMU_ARCTIC dataset from CMU FestVox project (www.festvox.org/cmu_arctic).

automatic sp...

0

Updated Nov 19, 2020

0 0 0 0

Updated Nov 19, 2020
C

projs / asr / en / cmu_sin

CMU_SIN (speech-in-noise) dataset of Lombard speech from CMU FestVox project.

automatic sp...

0

Updated Nov 19, 2020

0 0 0 0

Updated Nov 19, 2020
C

projs / asr / en / commonvoice

commonvoice: Common Voice dataset of crowdsourced speech from Mozilla.

automatic sp...

0

Updated Nov 19, 2020

0 0 0 0

Updated Nov 19, 2020
D

projs / asr / en / dr-vctk

DR-VCTK: device-recorded Voice Cloning Toolkit (DR-VCTK) dataset from CSTR.

automatic sp...

0

Updated Nov 19, 2020

0 0 0 0

Updated Nov 19, 2020
V

projs / asr / en / vctk

VCTK: Voice Cloning Toolkit dataset from CSTR, Edinburgh [Veaux et al. 2013].

automatic sp...

0

Updated Nov 19, 2020

0 0 0 0

Updated Nov 19, 2020