All Projects → robmsmt → ASR-Audio-Data-Links

robmsmt / ASR-Audio-Data-Links

Licence: Apache-2.0 license
A list of publically available audio data that anyone can download for ASR or other speech activities

Programming Languages

shell
77523 projects

Projects that are alternatives of or similar to ASR-Audio-Data-Links

wav2vec2-live
A live speech recognition using Facebooks wav2vec 2.0 model.
Stars: ✭ 205 (+14.53%)
Mutual labels:  speech, speech-recognition, speech-to-text, asr
Asr audio data links
A list of publically available audio data that anyone can download for ASR or other speech activities
Stars: ✭ 128 (-28.49%)
Mutual labels:  speech, speech-recognition, speech-to-text, asr
Edgedict
Working online speech recognition based on RNN Transducer. ( Trained model release available in release )
Stars: ✭ 205 (+14.53%)
Mutual labels:  speech, speech-recognition, speech-to-text, asr
Openasr
A pytorch based end2end speech recognition system.
Stars: ✭ 69 (-61.45%)
Mutual labels:  speech, speech-recognition, speech-to-text, asr
Syn Speech
Syn.Speech is a flexible speaker independent continuous speech recognition engine for Mono and .NET framework
Stars: ✭ 57 (-68.16%)
Mutual labels:  speech, speech-recognition, speech-to-text, asr
Lingvo
Lingvo
Stars: ✭ 2,361 (+1218.99%)
Mutual labels:  speech, speech-recognition, speech-to-text, asr
sova-asr
SOVA ASR (Automatic Speech Recognition)
Stars: ✭ 123 (-31.28%)
Mutual labels:  speech, speech-recognition, speech-to-text, asr
Tacotron asr
Speech Recognition Using Tacotron
Stars: ✭ 165 (-7.82%)
Mutual labels:  speech, speech-recognition, speech-to-text
Pytorch Kaldi
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
Stars: ✭ 2,097 (+1071.51%)
Mutual labels:  speech, speech-recognition, asr
megs
A merged version of multiple open-source German speech datasets.
Stars: ✭ 21 (-88.27%)
Mutual labels:  speech-recognition, speech-to-text, asr
Delta
DELTA is a deep learning based natural language and speech processing platform.
Stars: ✭ 1,479 (+726.26%)
Mutual labels:  speech, speech-recognition, asr
Discordspeechbot
A speech-to-text bot for discord with music commands and more using NodeJS. Ideally for controlling your Discord server using voice commands, can also be useful for hearing-impaired people.
Stars: ✭ 35 (-80.45%)
Mutual labels:  speech, speech-recognition, speech-to-text
Deepspeech
A PaddlePaddle implementation of ASR.
Stars: ✭ 1,219 (+581.01%)
Mutual labels:  speech, speech-recognition, speech-to-text
End2end Asr Pytorch
End-to-End Automatic Speech Recognition on PyTorch
Stars: ✭ 175 (-2.23%)
Mutual labels:  speech, speech-recognition, asr
leopard
On-device speech-to-text engine powered by deep learning
Stars: ✭ 354 (+97.77%)
Mutual labels:  speech-recognition, speech-to-text, asr
Kerasdeepspeech
A Keras CTC implementation of Baidu's DeepSpeech for model experimentation
Stars: ✭ 245 (+36.87%)
Mutual labels:  speech, speech-to-text, asr
anycontrol
Voice control for your websites and applications
Stars: ✭ 53 (-70.39%)
Mutual labels:  speech, speech-recognition, speech-to-text
Annyang
💬 Speech recognition for your site
Stars: ✭ 6,216 (+3372.63%)
Mutual labels:  speech, speech-recognition, speech-to-text
Pykaldi
A Python wrapper for Kaldi
Stars: ✭ 756 (+322.35%)
Mutual labels:  speech, speech-recognition, asr
Kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
Stars: ✭ 11,151 (+6129.61%)
Mutual labels:  speech, speech-recognition, speech-to-text

Audio Data Links

A list of common publically (and privately) available audio data that you can download for ASR or other speech activities. All your WERs are belong to us. Inspired by wer are we who stole someone elses joke.

1. FREE

Source Name & Direct Link Type Size(Hours)
OpenSLR LibriSpeech - Train:100 360 500
Test:Clean Other Dev:Clean Other
Read 960
OpenSLR TED-LIUM Release 2 Read 118
OpenSLR TED-LIUM Release 3 Read 452
Voxforge Voxforge English Read 130
Mozilla Common Voice v1 Read 500
Mozilla Common Voice en_1087h_2019-06-12 Read 1,087
Tatoeba Tatoeba Audio Eng Read ~200
Valentini Noisy Speech Database All Files, DOI Read TBC
VOiCES Complex Environmental Settings All Files Read
LibriSpeech
15
ai4bharat NPTEL2020
en-IN Torrent
Lectures 15,700
Opencollective open_stt
Russian Torrent
Various Read/Presented 20,108
Speechcolab GigaSpeech
Link
Various Read/Presented 33,000 Unlabeled
10,000 Labeled

2. PAID

Source Name Type Size(Hours) Code
LDC Fisher Conversational 2000 Speech LDC2004S13 LDC2005S13
Transcripts LDC2004T19 LDC2005T19
LDC Switchboard Hub 500 Conversational 240 LDC2002S09
LDC Switchboard Release 2 Conversational 300 LDC97S62
LDC TIMIT Read 5 LDC93S1
LDC Wall Street Journal (WSJ) Read 80 LDC93S6A or LDC93S6B

TTS

1. FREE

Source Name & Direct Link Type Size(Hours)
Edinburgh CSTR CSTR VCTK Corpus Read 44
LJ Speech LJ Speech Read 24
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].