robmsmt / Asr_audio_data_links
Licence: apache-2.0
A list of publically available audio data that anyone can download for ASR or other speech activities
Stars: ✭ 128
Projects that are alternatives of or similar to Asr audio data links
wav2vec2-live
A live speech recognition using Facebooks wav2vec 2.0 model.
Stars: ✭ 205 (+60.16%)
Mutual labels: speech, speech-recognition, speech-to-text, asr
Lingvo
Lingvo
Stars: ✭ 2,361 (+1744.53%)
Mutual labels: speech-recognition, speech, speech-to-text, asr
sova-asr
SOVA ASR (Automatic Speech Recognition)
Stars: ✭ 123 (-3.91%)
Mutual labels: speech, speech-recognition, speech-to-text, asr
Edgedict
Working online speech recognition based on RNN Transducer. ( Trained model release available in release )
Stars: ✭ 205 (+60.16%)
Mutual labels: speech-recognition, speech, speech-to-text, asr
Syn Speech
Syn.Speech is a flexible speaker independent continuous speech recognition engine for Mono and .NET framework
Stars: ✭ 57 (-55.47%)
Mutual labels: speech-recognition, speech, speech-to-text, asr
ASR-Audio-Data-Links
A list of publically available audio data that anyone can download for ASR or other speech activities
Stars: ✭ 179 (+39.84%)
Mutual labels: speech, speech-recognition, speech-to-text, asr
Openasr
A pytorch based end2end speech recognition system.
Stars: ✭ 69 (-46.09%)
Mutual labels: speech-recognition, speech, speech-to-text, asr
Silero Models
Silero Models: pre-trained STT models and benchmarks made embarrassingly simple
Stars: ✭ 522 (+307.81%)
Mutual labels: speech-recognition, speech-to-text, asr
Vad
Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.
Stars: ✭ 622 (+385.94%)
Mutual labels: data, speech-recognition, speech
Delta
DELTA is a deep learning based natural language and speech processing platform.
Stars: ✭ 1,479 (+1055.47%)
Mutual labels: speech-recognition, speech, asr
Vosk Api
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Stars: ✭ 1,357 (+960.16%)
Mutual labels: speech-recognition, speech-to-text, asr
Java Speech Api
The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.
Stars: ✭ 490 (+282.81%)
Mutual labels: speech-recognition, speech, speech-to-text
Sonus
💬 /so.nus/ STT (speech to text) for Node with offline hotword detection
Stars: ✭ 532 (+315.63%)
Mutual labels: speech-recognition, speech, speech-to-text
Neural sp
End-to-end ASR/LM implementation with PyTorch
Stars: ✭ 408 (+218.75%)
Mutual labels: speech-recognition, speech, asr
Eesen
The official repository of the Eesen project
Stars: ✭ 738 (+476.56%)
Mutual labels: speech-recognition, speech-to-text, asr
Awesome Kaldi
This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )
Stars: ✭ 393 (+207.03%)
Mutual labels: speech-recognition, speech, speech-to-text
Annyang
💬 Speech recognition for your site
Stars: ✭ 6,216 (+4756.25%)
Mutual labels: speech-recognition, speech, speech-to-text
Mongolian Speech Recognition
Mongolian speech recognition with PyTorch
Stars: ✭ 97 (-24.22%)
Mutual labels: speech-recognition, speech-to-text, asr
Discordspeechbot
A speech-to-text bot for discord with music commands and more using NodeJS. Ideally for controlling your Discord server using voice commands, can also be useful for hearing-impaired people.
Stars: ✭ 35 (-72.66%)
Mutual labels: speech-recognition, speech, speech-to-text
Audio Data Links
A list of common publically (and privately) available audio data that you can download for ASR or other speech activities. All your WERs are belong to us. Inspired by wer are we who stole someone elses joke.
1. FREE
Source | Name & Direct Link | Type | Size(Hours) |
---|---|---|---|
OpenSLR | LibriSpeech - Train:100 360 500 Test:Clean Other Dev:Clean Other |
Read | 960 |
OpenSLR | TED-LIUM Release 2 | Read | 118 |
OpenSLR | TED-LIUM Release 3 | Read | 452 |
Voxforge | Voxforge English | Read | 130 |
Mozilla | Common Voice v1 | Read | 500 |
Mozilla | Common Voice en_1087h_2019-06-12 | Read | 1087 |
Tatoeba | Tatoeba Audio Eng | Read | ~200 |
Valentini | Noisy Speech Database All Files, DOI | Read | TBC |
2. PAID
Source | Name | Type | Size(Hours) | Code |
---|---|---|---|---|
LDC | Fisher | Conversational | 2000 | Speech LDC2004S13 LDC2005S13 Transcripts LDC2004T19 LDC2005T19 |
LDC | Switchboard Hub 500 | Conversational | 240 | LDC2002S09 |
LDC | Switchboard Release 2 | Conversational | 300 | LDC97S62 |
LDC | TIMIT | Read | 5 | LDC93S1 |
LDC | Wall Street Journal (WSJ) | Read | 80 | LDC93S6A or LDC93S6B |
TTS
1. FREE
Source | Name & Direct Link | Type | Size(Hours) |
---|---|---|---|
Edinburgh CSTR | CSTR VCTK Corpus | Read | 44 |
LJ Speech | LJ Speech | Read | 24 |
Note that the project description data, including the texts, logos, images, and/or trademarks,
for each open source project belongs to its rightful owner.
If you wish to add or remove any projects, please contact us at [email protected].