automatic speech recognition (ASR)

Projects with this topic

A

alphaspeech / alphaspeech-python

This project provides a client package and example scripts to access the alphaspeech pro ASR APIs.

automatic sp... speech recog... text-to-speech

0

Updated Nov 15, 2024

0 0 0 0

Updated Nov 15, 2024
W

projs / asr / en / wsj

WSJ: Wall Street Journal corpus from ARPA in 1992, 1994 (LDC93S6A, LDC94S13A).

automatic sp...

0

Updated Nov 14, 2024

0 0 0 0

Updated Nov 14, 2024
V

projs / asr / en / vystadial

Vystadial: English part of Vystadial CTS corpus from Prague [Korvas et al. 2014].

automatic sp...

0

Updated Nov 14, 2024

0 0 0 0

Updated Nov 14, 2024
V

projs / asr / en / voxforge

VoxForge: free, open-source ASR dataset of crowdsourced speech from voxforge.org.

automatic sp...

0

Updated Nov 14, 2024

0 0 0 0

Updated Nov 14, 2024
V

projs / asr / en / vctk

VCTK: Voice Cloning Toolkit dataset from CSTR, Edinburgh [Veaux et al. 2013].

automatic sp...

0

Updated Nov 14, 2024

0 0 0 0

Updated Nov 14, 2024
T

projs / asr / en / timit

TIMIT: famous corpus of American English with phone-level transcriptions (LDC93S1).

automatic sp...

0

Updated Nov 14, 2024

0 0 0 0

Updated Nov 14, 2024
T

projs / asr / en / tedlium3

TED-LIUM 3: Release 3 of TED talk corpus from LIUM (SLR51).

automatic sp...

0

Updated Nov 14, 2024

0 0 0 0

Updated Nov 14, 2024
T

projs / asr / en / tedlium2

TED-LIUM 2: Release 2 of TED talk corpus from LIUM (SLR19).

automatic sp...

0

Updated Nov 14, 2024

0 0 0 0

Updated Nov 14, 2024
T

projs / asr / en / tedlium1

TED-LIUM 1: Release 1 of TED talk corpus from LIUM (SLR7).

automatic sp...

0

Updated Nov 14, 2024

0 0 0 0

Updated Nov 14, 2024
T

projs / asr / en / tatoeba

Tatoeba: Tatoeba Project of English sentences (https://tatoeba.org/eng).

automatic sp...

0

Updated Nov 14, 2024

0 0 0 0

Updated Nov 14, 2024
S

projs / asr / en / synthcmd

synthcmd: Synthetic Speech Commands Dataset from Kaggle.

automatic sp...

0

Updated Nov 14, 2024

0 0 0 0

Updated Nov 14, 2024
S

projs / asr / en / swc

SWC: Spoken Wikipedia Corpus, crowdsourced speech of read Wikipedia articles.

automatic sp...

0

Updated Nov 14, 2024

0 0 0 0

Updated Nov 14, 2024
S

projs / asr / en / swbd1

Switchboard-1, release 2: famous ASR data set of CTS from the 1990s (LDC97S62).

automatic sp...

0

Updated Nov 14, 2024

0 0 0 0

Updated Nov 14, 2024
S

projs / asr / en / st-aeds

ST-AEDS: Surfingtech American English Dataset of cellphone speech (SLR45).

automatic sp...

0

Updated Nov 14, 2024

0 0 0 0

Updated Nov 14, 2024
S

projs / asr / en / speechcmd

speechcmd: Speech Commands Dataset from Google [Warden 2018].

automatic sp...

0

Updated Nov 14, 2024

0 0 0 0

Updated Nov 14, 2024
S

projs / asr / en / snips

Snips: SLU dataset from Snips (now part of Sonos) [Saade et al. 2019].

automatic sp...

0

Updated Nov 14, 2024

0 0 0 0

Updated Nov 14, 2024
R

projs / asr / en / rt03

rt03: NIST 2003 Rich Transcription Evaluation Data for CTS (LDC2007S10).

automatic sp...

0

Updated Nov 14, 2024

0 0 0 0

Updated Nov 14, 2024
R

projs / asr / en / rm

RM: Resource Management v. 2.0 corpus from DARPA in the 1990s (LDC93S3A).

automatic sp...

0

Updated Nov 14, 2024

0 0 0 0

Updated Nov 14, 2024
R

projs / asr / en / reddots

RedDots: corpus of short-dur utts from mobile apps (sites.google.com/site/thereddotsproject).

automatic sp...

0

Updated Nov 14, 2024

0 0 0 0

Updated Nov 14, 2024
P

projs / asr / en / pda

PDA: Personal Digital Assistant speech dataset from CMU.

automatic sp...

0

Updated Nov 14, 2024

0 0 0 0

Updated Nov 14, 2024