Speech recognition with tensorflowImplementation of a seq2seq model for Speech Recognition using the latest version of TensorFlow. Architecture similar to Listen, Attend and Spell.
Speechbrain.github.ioThe SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.
KerasdeepspeechA Keras CTC implementation of Baidu's DeepSpeech for model experimentation
Stt🐸STT - a deep learning toolkit for Speech-to-Text, battle-tested in research and production
Go AstibobGolang framework to build an AI that can understand and speak back to you, and everything else you want
Rnn ctcRecurrent Neural Network and Long Short Term Memory (LSTM) with Connectionist Temporal Classification implemented in Theano. Includes a Toy training example.
EdgedictWorking online speech recognition based on RNN Transducer. ( Trained model release available in release )
Kaldi Active GrammarPython Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time
K6neleAn Android app that offers speech-to-text services and user interfaces to other apps
Dictate.jsA small Javascript library for browser-based real-time speech recognition, which uses Recorderjs for audio capture, and a WebSocket connection to the Kaldi GStreamer server for speech recognition.
Voice Overlay Android🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI
VoskVOSK Speech Recognition Toolkit
Deepspeech ServerA testing server for a speech to text service based on mozilla deepspeech
NaomiThe Naomi Project is an open source, technology agnostic platform for developing always-on, voice-controlled applications!
Hey JetsonDeep Learning based Automatic Speech Recognition with attention for the Nvidia Jetson.
JiwerEvaluate your speech-to-text system with similarity measures such as word error rate (WER)
Proctoring AiCreating a software for automatic monitoring in online proctoring
SpeechtAn opensource speech-to-text software written in tensorflow
Zzz Retired opensttRETIRED - OpenSTT is now retired. If you would like more information on Mycroft AI's open source STT projects, please visit:
SpeechrecognizerbuttonUIButton subclass with push to talk recording, speech recognition and Siri-style waveform view.
Asr audio data linksA list of publically available audio data that anyone can download for ASR or other speech activities
Kaldikaldi-asr/kaldi is the official location of the Kaldi project.
Nlp Models TensorflowGathers machine learning and Tensorflow deep learning models for NLP problems, 1.13 < Tensorflow < 2.0
KalliopeKalliope is a framework that will help you to create your own personal assistant.
Spokestack PythonSpokestack is a library that allows a user to easily incorporate a voice interface into any Python application.
Speech And TextSpeech to text (PocketSphinx, Iflytex API, Baidu API) and text to speech (pyttsx3) | 语音转文字(PocketSphinx、百度 API、科大讯飞 API)和文字转语音(pyttsx3)
Openseq2seqToolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP
Vosk ApiOffline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
DexterLet your talking do the code
B.e.n.j.i.B.E.N.J.I.- The Impossible Missions Force's digital assistant
Wav2letterSpeech Recognition model based off of FAIR research paper built using Pytorch.
Casr Demo基于Flask Web的中文自动语音识别演示系统,包含语音识别、语音合成、声纹识别之说话人识别。
Patterspeech-to-text in pytorch
OpenasrA pytorch based end2end speech recognition system.
WatbotAn Android ChatBot powered by IBM Watson Services (Assistant V1, Text-to-Speech, and Speech-to-Text with Speaker Recognition) on IBM Cloud.
Dragonfirethe open-source virtual assistant for Ubuntu based Linux distributions
Angle⦠ Angle: new speakable syntax for python 💡
Syn SpeechSyn.Speech is a flexible speaker independent continuous speech recognition engine for Mono and .NET framework
Voice SynthesisThis repository is an implementation of Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis (SV2TTS) with a vocoder that works in real-time. SV2TTS is a three-stage deep learning framework that allows to create a numerical representation of a voice from a few seconds of audio, and to use it to condition a text-to-speech model trained to generalize to new voices.
SoloudFree, easy, portable audio engine for games
Dc ttsA TensorFlow Implementation of DC-TTS: yet another text-to-speech model
Artyom.jsA voice control - voice commands - speech recognition and speech synthesis javascript library. Create your own siri,google now or cortana with Google Chrome within your website.