Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → robmsmt → Kerasdeepspeech

robmsmt / Kerasdeepspeech

Licence: agpl-3.0

A Keras CTC implementation of Baidu's DeepSpeech for model experimentation

Programming Languages

python

139335 projects - #7 most used programming language

Labels

deep-learning machine-learning keras neural-network neural-networks deeplearning speech speech-to-text coreml baidu asr ctc

Projects that are alternatives of or similar to Kerasdeepspeech

Speechbrain.github.io

The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.

Stars: ✭ 242 (-1.22%)

Mutual labels: neural-networks, deeplearning, speech, speech-to-text

sova-asr

SOVA ASR (Automatic Speech Recognition)

Stars: ✭ 123 (-49.8%)

Mutual labels: speech, speech-to-text, asr

torch-asg

Auto Segmentation Criterion (ASG) implemented in pytorch

Stars: ✭ 42 (-82.86%)

Mutual labels: speech, asr, ctc

Asr audio data links

A list of publically available audio data that anyone can download for ASR or other speech activities

Stars: ✭ 128 (-47.76%)

Mutual labels: speech, speech-to-text, asr

wav2vec2-live

A live speech recognition using Facebooks wav2vec 2.0 model.

Stars: ✭ 205 (-16.33%)

Mutual labels: speech, speech-to-text, asr

deepspeech.mxnet

A MXNet implementation of Baidu's DeepSpeech architecture

Stars: ✭ 82 (-66.53%)

Mutual labels: speech, baidu, speech-to-text

Neural sp

End-to-end ASR/LM implementation with PyTorch

Stars: ✭ 408 (+66.53%)

Mutual labels: speech, asr, ctc

ASR-Audio-Data-Links

A list of publically available audio data that anyone can download for ASR or other speech activities

Stars: ✭ 179 (-26.94%)

Mutual labels: speech, speech-to-text, asr

Syn Speech

Syn.Speech is a flexible speaker independent continuous speech recognition engine for Mono and .NET framework

Stars: ✭ 57 (-76.73%)

Mutual labels: speech, speech-to-text, asr

Openasr

A pytorch based end2end speech recognition system.

Stars: ✭ 69 (-71.84%)

Mutual labels: speech, speech-to-text, asr

Lingvo

Stars: ✭ 2,361 (+863.67%)

Mutual labels: speech, speech-to-text, asr

Pytorch Asr

ASR with PyTorch

Stars: ✭ 124 (-49.39%)

Mutual labels: speech, asr, ctc

Tensorflow end2end speech recognition

End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)

Stars: ✭ 305 (+24.49%)

Mutual labels: speech-to-text, asr, ctc

Eesen

The official repository of the Eesen project

Stars: ✭ 738 (+201.22%)

Mutual labels: speech-to-text, asr, ctc

Wav2letter

Speech Recognition model based off of FAIR research paper built using Pytorch.

Stars: ✭ 78 (-68.16%)

Mutual labels: neural-networks, speech-to-text, asr

Edgedict

Working online speech recognition based on RNN Transducer. ( Trained model release available in release )

Stars: ✭ 205 (-16.33%)

Mutual labels: speech, speech-to-text, asr

Paddlex

PaddlePaddle End-to-End Development Toolkit（『飞桨』深度学习全流程开发工具）

Stars: ✭ 3,399 (+1287.35%)

Mutual labels: neural-networks, deeplearning

Horovod

Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.

Stars: ✭ 11,943 (+4774.69%)

Mutual labels: baidu, deeplearning

Machine Learning Tutorials

machine learning and deep learning tutorials, articles and other resources

Stars: ✭ 11,692 (+4672.24%)

Mutual labels: neural-networks, deeplearning

Speecht

An opensource speech-to-text software written in tensorflow

Stars: ✭ 152 (-37.96%)

Mutual labels: speech-to-text, asr

View All Similar Projects ➔

Keras DeepSpeech

Repository for experimenting with different CTC based model designs for ASR. Supports live recording and testing of speech and quickly creates customised datasets using own-voice dataset creation scripts!

OVERVIEW

SETUP

Recommended > use virtualenv installed with python2.7 (3.x untested and will not work with Core ML)
git clone https://github.com/robmsmt/KerasDeepSpeech
pip install -r requirements.txt
Get the data using the import/download scripts in the folder, LibriSpeech is a good example.
Download the language model (large file) run ./lm/get_lm.sh

RUN

To Train, simply run python run-train.py In order to specify training/validation files use python run-train.py --train_files <csvfile> --valid_files <csvfile> (see run-train for complete arguments list)
To Test, run python run-test.py --test_files <datacsvfile>

CREDIT

Mozilla DeepSpeech
Baidu DS1 & DS2 papers

Licence

Contributing

Have a question? Like the tool? Don't like it? Open an issue and let's talk about it! Pull requests are appreciated!

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 245

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (13) 🔗