automatic speech recognition (ASR)

Projects with this topic

N

projs / asr / en / noisy-vctk

Noisy-VCTK: Noisy subset of VCTK (Voice Cloning Toolkit) dataset from CSTR.

automatic sp...

0

Updated Nov 19, 2020

0 0 0 0

Updated Nov 19, 2020
L

projs / asr / en / libritts

LibriTTS: Librispeech for text-to-speech (TTS) corpus (SLR60).

automatic sp...

0

Updated Nov 19, 2020

0 0 0 0

Updated Nov 19, 2020
A

projs / asr / en / arctic

CMU_ARCTIC dataset from CMU FestVox project (www.festvox.org/cmu_arctic).

automatic sp...

0

Updated Nov 19, 2020

0 0 0 0

Updated Nov 19, 2020
C

projs / asr / en / cmu_sin

CMU_SIN (speech-in-noise) dataset of Lombard speech from CMU FestVox project.

automatic sp...

0

Updated Nov 19, 2020

0 0 0 0

Updated Nov 19, 2020
C

projs / asr / en / commonvoice

commonvoice: Common Voice dataset of crowdsourced speech from Mozilla.

automatic sp...

0

Updated Nov 19, 2020

0 0 0 0

Updated Nov 19, 2020
D

projs / asr / en / dr-vctk

DR-VCTK: device-recorded Voice Cloning Toolkit (DR-VCTK) dataset from CSTR.

automatic sp...

0

Updated Nov 19, 2020

0 0 0 0

Updated Nov 19, 2020
V

projs / asr / en / vctk

VCTK: Voice Cloning Toolkit dataset from CSTR, Edinburgh [Veaux et al. 2013].

automatic sp...

0

Updated Nov 19, 2020

0 0 0 0

Updated Nov 19, 2020
V

projs / asr / en / voxforge

VoxForge: free, open-source ASR dataset of crowdsourced speech from voxforge.org.

automatic sp...

0

Updated Nov 19, 2020

0 0 0 0

Updated Nov 19, 2020
R

projs / asr / en / reddots

RedDots: corpus of short-dur utts from mobile apps (sites.google.com/site/thereddotsproject).

automatic sp...

0

Updated Nov 11, 2020

0 0 0 0

Updated Nov 11, 2020
F

projs / asr / en / fred

FRED: Freiburg English Dialect Corpus Sampler (FRED-S) of British English interviews.

automatic sp...

0

Updated Oct 28, 2020

0 0 0 0

Updated Oct 28, 2020
W

projs / asr / en / wsj

WSJ: Wall Street Journal corpus from ARPA in 1992, 1994 (LDC93S6A, LDC94S13A).

automatic sp...

0

Updated Oct 28, 2020

0 0 0 0

Updated Oct 28, 2020
A

projs / asr / en / aesl

AESL: American English Spoken Lexicon from 1 female speaker (LDC99L23).

automatic sp...

0

Updated Oct 28, 2020

0 0 0 0

Updated Oct 28, 2020
T

projs / asr / en / tatoeba

Tatoeba: Tatoeba Project of English sentences (https://tatoeba.org/eng).

automatic sp...

0

Updated Oct 28, 2020

0 0 0 0

Updated Oct 28, 2020
F

projs / asr / en / fluentcmd

fluentcmd: Fluent Speech Commands Dataset for SLU from fluent.ai.

automatic sp...

0

Updated Oct 28, 2020

0 0 0 0

Updated Oct 28, 2020
H

projs / asr / en / heysnips2

heysnips2: Hey Snips Dataset 2 for KWS from Sonos [Leroy et al. 2019].

automatic sp...

0

Updated Oct 28, 2020

0 0 0 0

Updated Oct 28, 2020
L

projs / asr / en / lj

LJ: LJ (Linda Johson) Speech Corpus (v. 1.1), often used for TTS.

automatic sp...

0

Updated Oct 28, 2020

0 0 0 0

Updated Oct 28, 2020
H

projs / asr / en / heysnips1

heysnips1: Hey Snips Dataset 1 for KWS from Sonos [Coucke et al. 2019]

automatic sp...

0

Updated Oct 28, 2020

0 0 0 0

Updated Oct 28, 2020
A

projs / asr / en / audiomnist

AudioMNIST: free dataset of spoken digits (0-9) from 60 speakers.

automatic sp...

0

Updated Oct 28, 2020

0 0 0 0

Updated Oct 28, 2020
S

projs / asr / en / swbd1

Switchboard-1, release 2: famous ASR data set of CTS from the 1990s (LDC97S62).

automatic sp...

0

Updated Oct 28, 2020

0 0 0 0

Updated Oct 28, 2020
T

projs / asr / en / tedlium1

TED-LIUM 1: Release 1 of TED talk corpus from LIUM (SLR7).

automatic sp...

0

Updated Oct 28, 2020

0 0 0 0

Updated Oct 28, 2020