robmsmt / Kerasdeepspeech
Licence: agpl-3.0
A Keras CTC implementation of Baidu's DeepSpeech for model experimentation
Stars: ✭ 245
Programming Languages
python
139335 projects - #7 most used programming language
Labels
Projects that are alternatives of or similar to Kerasdeepspeech
Speechbrain.github.io
The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.
Stars: ✭ 242 (-1.22%)
Mutual labels: neural-networks, deeplearning, speech, speech-to-text
sova-asr
SOVA ASR (Automatic Speech Recognition)
Stars: ✭ 123 (-49.8%)
Mutual labels: speech, speech-to-text, asr
torch-asg
Auto Segmentation Criterion (ASG) implemented in pytorch
Stars: ✭ 42 (-82.86%)
Mutual labels: speech, asr, ctc
Asr audio data links
A list of publically available audio data that anyone can download for ASR or other speech activities
Stars: ✭ 128 (-47.76%)
Mutual labels: speech, speech-to-text, asr
wav2vec2-live
A live speech recognition using Facebooks wav2vec 2.0 model.
Stars: ✭ 205 (-16.33%)
Mutual labels: speech, speech-to-text, asr
deepspeech.mxnet
A MXNet implementation of Baidu's DeepSpeech architecture
Stars: ✭ 82 (-66.53%)
Mutual labels: speech, baidu, speech-to-text
Neural sp
End-to-end ASR/LM implementation with PyTorch
Stars: ✭ 408 (+66.53%)
Mutual labels: speech, asr, ctc
ASR-Audio-Data-Links
A list of publically available audio data that anyone can download for ASR or other speech activities
Stars: ✭ 179 (-26.94%)
Mutual labels: speech, speech-to-text, asr
Syn Speech
Syn.Speech is a flexible speaker independent continuous speech recognition engine for Mono and .NET framework
Stars: ✭ 57 (-76.73%)
Mutual labels: speech, speech-to-text, asr
Openasr
A pytorch based end2end speech recognition system.
Stars: ✭ 69 (-71.84%)
Mutual labels: speech, speech-to-text, asr
Tensorflow end2end speech recognition
End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)
Stars: ✭ 305 (+24.49%)
Mutual labels: speech-to-text, asr, ctc
Eesen
The official repository of the Eesen project
Stars: ✭ 738 (+201.22%)
Mutual labels: speech-to-text, asr, ctc
Wav2letter
Speech Recognition model based off of FAIR research paper built using Pytorch.
Stars: ✭ 78 (-68.16%)
Mutual labels: neural-networks, speech-to-text, asr
Edgedict
Working online speech recognition based on RNN Transducer. ( Trained model release available in release )
Stars: ✭ 205 (-16.33%)
Mutual labels: speech, speech-to-text, asr
Paddlex
PaddlePaddle End-to-End Development Toolkit(『飞桨』深度学习全流程开发工具)
Stars: ✭ 3,399 (+1287.35%)
Mutual labels: neural-networks, deeplearning
Horovod
Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
Stars: ✭ 11,943 (+4774.69%)
Mutual labels: baidu, deeplearning
Machine Learning Tutorials
machine learning and deep learning tutorials, articles and other resources
Stars: ✭ 11,692 (+4672.24%)
Mutual labels: neural-networks, deeplearning
Speecht
An opensource speech-to-text software written in tensorflow
Stars: ✭ 152 (-37.96%)
Mutual labels: speech-to-text, asr
Keras DeepSpeech
Repository for experimenting with different CTC based model designs for ASR. Supports live recording and testing of speech and quickly creates customised datasets using own-voice dataset creation scripts!
OVERVIEW
SETUP
- Recommended > use virtualenv installed with python2.7 (3.x untested and will not work with Core ML)
git clone https://github.com/robmsmt/KerasDeepSpeech
pip install -r requirements.txt
- Get the data using the import/download scripts in the folder, LibriSpeech is a good example.
- Download the language model (large file) run
./lm/get_lm.sh
RUN
- To Train, simply run
python run-train.py
In order to specify training/validation files usepython run-train.py --train_files <csvfile> --valid_files <csvfile>
(see run-train for complete arguments list) - To Test, run
python run-test.py --test_files <datacsvfile>
CREDIT
- Mozilla DeepSpeech
- Baidu DS1 & DS2 papers
Licence
The content of this project itself is licensed under the GNU General Public License. Copyright © 2018
Contributing
Have a question? Like the tool? Don't like it? Open an issue and let's talk about it! Pull requests are appreciated!
Note that the project description data, including the texts, logos, images, and/or trademarks,
for each open source project belongs to its rightful owner.
If you wish to add or remove any projects, please contact us at [email protected].