All Categories → Machine Learning → speech-to-text

Top 151 speech-to-text open source projects

Speech recognition with tensorflow
Implementation of a seq2seq model for Speech Recognition using the latest version of TensorFlow. Architecture similar to Listen, Attend and Spell.
Speechbrain.github.io
The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.
Stt
🐸STT - a deep learning toolkit for Speech-to-Text, battle-tested in research and production
Go Astibob
Golang framework to build an AI that can understand and speak back to you, and everything else you want
Rnn ctc
Recurrent Neural Network and Long Short Term Memory (LSTM) with Connectionist Temporal Classification implemented in Theano. Includes a Toy training example.
Edgedict
Working online speech recognition based on RNN Transducer. ( Trained model release available in release )
Kaldi Active Grammar
Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time
K6nele
An Android app that offers speech-to-text services and user interfaces to other apps
Dictate.js
A small Javascript library for browser-based real-time speech recognition, which uses Recorderjs for audio capture, and a WebSocket connection to the Kaldi GStreamer server for speech recognition.
Expressive tacotron
Tensorflow Implementation of Expressive Tacotron
Tensorflow Speech Recognition
🎙Speech recognition using the tensorflow deep learning framework, sequence-to-sequence neural networks
Speaker adapted tts
Making a TTS model with 1 minute of speech samples within 10 minutes
Deepspeech Server
A testing server for a speech to text service based on mozilla deepspeech
Naomi
The Naomi Project is an open source, technology agnostic platform for developing always-on, voice-controlled applications!
Jiwer
Evaluate your speech-to-text system with similarity measures such as word error rate (WER)
Proctoring Ai
Creating a software for automatic monitoring in online proctoring
Speecht
An opensource speech-to-text software written in tensorflow
Speech To Text Russian
Проект для распознавания речи на русском языке на основе pykaldi.
Zzz Retired openstt
RETIRED - OpenSTT is now retired. If you would like more information on Mycroft AI's open source STT projects, please visit:
Speechrecognizerbutton
UIButton subclass with push to talk recording, speech recognition and Siri-style waveform view.
Go Astideepspeech
Golang bindings for Mozilla's DeepSpeech speech-to-text library
Asr audio data links
A list of publically available audio data that anyone can download for ASR or other speech activities
Tensorflow Ctc Speech Recognition
Application of Connectionist Temporal Classification (CTC) for Speech Recognition (Tensorflow 1.0 but compatible with 2.0).
Kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
Self Supervised Speech Recognition
speech to text with self-supervised learning based on wav2vec 2.0 framework
Speech And Text
Speech to text (PocketSphinx, Iflytex API, Baidu API) and text to speech (pyttsx3) | 语音转文字(PocketSphinx、百度 API、科大讯飞 API)和文字转语音(pyttsx3)
Vosk Api
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
B.e.n.j.i.
B.E.N.J.I.- The Impossible Missions Force's digital assistant
Deepspeech Websocket Server
Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environments
Deepspeech
A PaddlePaddle implementation of ASR.
Wav2letter
Speech Recognition model based off of FAIR research paper built using Pytorch.
Casr Demo
基于Flask Web的中文自动语音识别演示系统,包含语音识别、语音合成、声纹识别之说话人识别。
Nativescript Speech Recognition
💬 Speech to text, using the awesome engines readily available on the device.
Openasr
A pytorch based end2end speech recognition system.
Watbot
An Android ChatBot powered by IBM Watson Services (Assistant V1, Text-to-Speech, and Speech-to-Text with Speaker Recognition) on IBM Cloud.
Angle
⦠ Angle: new speakable syntax for python 💡
Syn Speech
Syn.Speech is a flexible speaker independent continuous speech recognition engine for Mono and .NET framework
Voice Synthesis
This repository is an implementation of Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis (SV2TTS) with a vocoder that works in real-time. SV2TTS is a three-stage deep learning framework that allows to create a numerical representation of a voice from a few seconds of audio, and to use it to condition a text-to-speech model trained to generalize to new voices.
Dc tts
A TensorFlow Implementation of DC-TTS: yet another text-to-speech model
Artyom.js
A voice control - voice commands - speech recognition and speech synthesis javascript library. Create your own siri,google now or cortana with Google Chrome within your website.
1-60 of 151 speech-to-text projects