Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → robmsmt → ASR-Audio-Data-Links

robmsmt / ASR-Audio-Data-Links

Licence: Apache-2.0 license

A list of publically available audio data that anyone can download for ASR or other speech activities

Programming Languages

77523 projects

Labels

data speech speech-recognition audio-data speech-to-text asr speech-activities

Projects that are alternatives of or similar to ASR-Audio-Data-Links

A live speech recognition using Facebooks wav2vec 2.0 model.

Stars: ✭ 205 (+14.53%)

Mutual labels: speech, speech-recognition, speech-to-text, asr

Asr audio data links

A list of publically available audio data that anyone can download for ASR or other speech activities

Stars: ✭ 128 (-28.49%)

Mutual labels: speech, speech-recognition, speech-to-text, asr

Working online speech recognition based on RNN Transducer. ( Trained model release available in release )

Stars: ✭ 205 (+14.53%)

Mutual labels: speech, speech-recognition, speech-to-text, asr

A pytorch based end2end speech recognition system.

Stars: ✭ 69 (-61.45%)

Mutual labels: speech, speech-recognition, speech-to-text, asr

Syn.Speech is a flexible speaker independent continuous speech recognition engine for Mono and .NET framework

Stars: ✭ 57 (-68.16%)

Mutual labels: speech, speech-recognition, speech-to-text, asr

Lingvo

Stars: ✭ 2,361 (+1218.99%)

Mutual labels: speech, speech-recognition, speech-to-text, asr

SOVA ASR (Automatic Speech Recognition)

Stars: ✭ 123 (-31.28%)

Mutual labels: speech, speech-recognition, speech-to-text, asr

Speech Recognition Using Tacotron

Stars: ✭ 165 (-7.82%)

Mutual labels: speech, speech-recognition, speech-to-text

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

Stars: ✭ 2,097 (+1071.51%)

Mutual labels: speech, speech-recognition, asr

A merged version of multiple open-source German speech datasets.

Stars: ✭ 21 (-88.27%)

Mutual labels: speech-recognition, speech-to-text, asr

DELTA is a deep learning based natural language and speech processing platform.

Stars: ✭ 1,479 (+726.26%)

Mutual labels: speech, speech-recognition, asr

Discordspeechbot

A speech-to-text bot for discord with music commands and more using NodeJS. Ideally for controlling your Discord server using voice commands, can also be useful for hearing-impaired people.

Stars: ✭ 35 (-80.45%)

Mutual labels: speech, speech-recognition, speech-to-text

A PaddlePaddle implementation of ASR.

Stars: ✭ 1,219 (+581.01%)

Mutual labels: speech, speech-recognition, speech-to-text

End2end Asr Pytorch

End-to-End Automatic Speech Recognition on PyTorch

Stars: ✭ 175 (-2.23%)

Mutual labels: speech, speech-recognition, asr

On-device speech-to-text engine powered by deep learning

Stars: ✭ 354 (+97.77%)

Mutual labels: speech-recognition, speech-to-text, asr

Kerasdeepspeech

A Keras CTC implementation of Baidu's DeepSpeech for model experimentation

Stars: ✭ 245 (+36.87%)

Mutual labels: speech, speech-to-text, asr

Voice control for your websites and applications

Stars: ✭ 53 (-70.39%)

Mutual labels: speech, speech-recognition, speech-to-text

💬 Speech recognition for your site

Stars: ✭ 6,216 (+3372.63%)

Mutual labels: speech, speech-recognition, speech-to-text

A Python wrapper for Kaldi

Stars: ✭ 756 (+322.35%)

Mutual labels: speech, speech-recognition, asr

kaldi-asr/kaldi is the official location of the Kaldi project.

Stars: ✭ 11,151 (+6129.61%)

Mutual labels: speech, speech-recognition, speech-to-text

View All Similar Projects ➔

Audio Data Links

A list of common publically (and privately) available audio data that you can download for ASR or other speech activities. All your WERs are belong to us. Inspired by wer are we who stole someone elses joke.

1. FREE

Source	Name & Direct Link	Type	Size(Hours)
OpenSLR	LibriSpeech - Train:100 360 500 Test:Clean Other Dev:Clean Other	Read	960
OpenSLR	TED-LIUM Release 2	Read	118
OpenSLR	TED-LIUM Release 3	Read	452
Voxforge	Voxforge English	Read	130
Mozilla	Common Voice v1	Read	500
Mozilla	Common Voice en_1087h_2019-06-12	Read	1,087
Tatoeba	Tatoeba Audio Eng	Read	~200
Valentini	Noisy Speech Database All Files, DOI	Read	TBC
VOiCES	Complex Environmental Settings All Files	Read LibriSpeech	15
ai4bharat	NPTEL2020 en-IN Torrent	Lectures	15,700
Opencollective	open_stt Russian Torrent	Various Read/Presented	20,108
Speechcolab	GigaSpeech Link	Various Read/Presented	33,000 Unlabeled 10,000 Labeled

2. PAID

Source	Name	Type	Size(Hours)	Code
LDC	Fisher	Conversational	2000	Speech LDC2004S13 LDC2005S13 Transcripts LDC2004T19 LDC2005T19
LDC	Switchboard Hub 500	Conversational	240	LDC2002S09
LDC	Switchboard Release 2	Conversational	300	LDC97S62
LDC	TIMIT	Read	5	LDC93S1
LDC	Wall Street Journal (WSJ)	Read	80	LDC93S6A or LDC93S6B

TTS

1. FREE

Source	Name & Direct Link	Type	Size(Hours)
Edinburgh CSTR	CSTR VCTK Corpus	Read	44
LJ Speech	LJ Speech	Read	24

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 179

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (0) 🔗