Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.

Stars: ✭ 242 (-80.15%)

Mutual labels: speech-recognition, speech, speech-to-text

KeenASR-Android-PoC

A proof-of-concept app using KeenASR SDK on Android. WE ARE HIRING: https://keenresearch.com/careers.html

Stars: ✭ 21 (-98.28%)

Mutual labels: speech, speech-recognition, speech-to-text

Edgedict

Working online speech recognition based on RNN Transducer. ( Trained model release available in release )

Stars: ✭ 205 (-83.18%)

Mutual labels: speech-recognition, speech, speech-to-text

kaldi ag training

Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-grammar.

Stars: ✭ 14 (-98.85%)

Mutual labels: speech, speech-recognition, speech-to-text

Asr audio data links

A list of publically available audio data that anyone can download for ASR or other speech activities

Stars: ✭ 128 (-89.5%)

Mutual labels: speech-recognition, speech, speech-to-text

Tacotron asr

Speech Recognition Using Tacotron

Stars: ✭ 165 (-86.46%)

Mutual labels: speech-recognition, speech, speech-to-text

deepspeech.mxnet

A MXNet implementation of Baidu's DeepSpeech architecture

Stars: ✭ 82 (-93.27%)

Mutual labels: speech, speech-recognition, speech-to-text

Java Speech Api

The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.

Stars: ✭ 490 (-59.8%)

Mutual labels: speech-recognition, speech, speech-to-text

View All Similar Projects ➔

中文版

DeepSpeech on PaddlePaddle

DeepSpeech on PaddlePaddle is an open-source implementation of end-to-end Automatic Speech Recognition (ASR) engine, with PaddlePaddle platform. Our vision is to empower both industrial application and academic research on speech recognition, via an easy-to-use, efficient and scalable implementation, including training, inference & testing module, and demo deployment.

Models

Baidu's Deep Speech2

Setup

python>=3.7
paddlepaddle>=2.0.0

Run the setup script for the remaining dependencies

git clone https://github.com/PaddlePaddle/DeepSpeech.git
cd DeepSpeech
pushd tools; make; popd
source tools/venv/bin/activate
bash setup.sh

Source venv before do experiment.

source tools/venv/bin/activate

Getting Started

Please see Getting Started and tiny egs.

More Information

Questions and Help

You are welcome to submit questions and bug reports in Github Issues. You are also welcome to contribute to this project.

License

DeepSpeech is provided under the Apache-2.0 License.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 1,219

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (205) 🔗