All Categories → Machine Learning → speech-to-text

Top 151 speech-to-text open source projects

Implementation of a seq2seq model for Speech Recognition using the latest version of TensorFlow. Architecture similar to Listen, Attend and Spell.

✭ 253

python jupyter-notebook deep-learning machine-learning tensorflow nlp speech-recognition seq2seq speech-to-text encoder-decoder sequence-to-sequence

Speechbrain.github.io

The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.

✭ 242

html deep-learning neural-network neural-networks deeplearning speech-recognition speech speech-to-text speech-processing

Kerasdeepspeech

A Keras CTC implementation of Baidu's DeepSpeech for model experimentation

✭ 245

python deep-learning machine-learning keras neural-network neural-networks deeplearning speech speech-to-text coreml baidu asr ctc

Stt

🐸STT - a deep learning toolkit for Speech-to-Text, battle-tested in research and production

✭ 197

deep-learning tensorflow speech-to-text

Go Astibob

Golang framework to build an AI that can understand and speak back to you, and everything else you want

✭ 222

go golang audio text-to-speech speech-to-text

Rnn ctc

Recurrent Neural Network and Long Short Term Memory (LSTM) with Connectionist Temporal Classification implemented in Theano. Includes a Toy training example.

✭ 220

python deep-learning neural-network lstm ocr speech-recognition rnn recurrent-neural-networks speech-to-text captcha theano gru ctc

Edgedict

Working online speech recognition based on RNN Transducer. ( Trained model release available in release )

✭ 205

python speech-recognition speech speech-to-text asr

Kaldi Active Grammar

Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time

✭ 196

python speech-recognition speech-to-text kaldi

K6nele

An Android app that offers speech-to-text services and user interfaces to other apps

✭ 196

java android speech-recognition speech-to-text

Dictate.js

A small Javascript library for browser-based real-time speech recognition, which uses Recorderjs for audio capture, and a WebSocket connection to the Kaldi GStreamer server for speech recognition.

✭ 195

javascript speech-recognition speech-to-text

Expressive tacotron

Tensorflow Implementation of Expressive Tacotron

✭ 192

python speech-to-text speech-synthesis tacotron

Automatic Speech Recognition

🎧 Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)

✭ 192

python deep-learning machine-learning tensorflow keras neural-networks speech-recognition language-model speech-to-text tensorflow-models

Voice Overlay Android

🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI

✭ 189

kotlin android search speech-recognition input speech-to-text voice overlay permissions permission chatbots conversation voice-recognition conversational-ui

Tensorflow Speech Recognition

🎙Speech recognition using the tensorflow deep learning framework, sequence-to-sequence neural networks

✭ 2,118

python swift deep-learning tensorflow neural-network speech-recognition speech-to-text stt

Speaker adapted tts

Making a TTS model with 1 minute of speech samples within 10 minutes

✭ 183

speech-to-text tts

Vosk

VOSK Speech Recognition Toolkit

✭ 182

python c speech-recognition speech-to-text semi-supervised-learning multilingual voice-recognition

Deepspeech Server

A testing server for a speech to text service based on mozilla deepspeech

✭ 176

python speech-recognition speech-to-text reactivex reactive-extensions

Naomi

The Naomi Project is an open source, technology agnostic platform for developing always-on, voice-controlled applications!

✭ 171

python hacktoberfest linux raspberry-pi iot home-automation speech-recognition text-to-speech speech-to-text voice speech-synthesis assistant personal-assistant

Tacotron asr

Speech Recognition Using Tacotron

✭ 165

python speech-recognition speech speech-to-text tacotron

Hey Jetson

Deep Learning based Automatic Speech Recognition with attention for the Nvidia Jetson.

✭ 161

python html css jupyter-notebook deep-learning tensorflow keras rest-api flask deep-neural-networks azure speech-recognition sentiment-analysis recurrent-neural-networks attention inference speech-to-text

Jiwer

Evaluate your speech-to-text system with similarity measures such as word error rate (WER)

✭ 158

python python3 speech-to-text

Proctoring Ai

Creating a software for automatic monitoring in online proctoring

✭ 155

python hacktoberfest automation opencv face-detection yolov3 speech-to-text mobilenet ssd dlib nltk

Speecht

An opensource speech-to-text software written in tensorflow

✭ 152

python python3 tensorflow language-model speech-to-text asr

Speech To Text Russian

Проект для распознавания речи на русском языке на основе pykaldi.

✭ 151

python speech-recognition speech-to-text asr kaldi

Zzz Retired openstt

RETIRED - OpenSTT is now retired. If you would like more information on Mycroft AI's open source STT projects, please visit:

✭ 146

nlp speech-recognition speech-to-text voice nlp-machine-learning nlu speech-processing

Speechrecognizerbutton

UIButton subclass with push to talk recording, speech recognition and Siri-style waveform view.

✭ 144

swift speech-recognition speech-to-text waveform siri uibutton

Go Astideepspeech

Golang bindings for Mozilla's DeepSpeech speech-to-text library

✭ 137

go golang speech-recognition speech-to-text

Awesome Ai Services

An overview of the AI-as-a-service landscape

✭ 133

javascript java kotlin nodejs machine-learning computer-vision natural-language-processing artificial-intelligence jvm speech-recognition face-recognition sentiment-analysis text-to-speech speech-to-text speech-synthesis machine-translation text-recognition

Asr audio data links

A list of publically available audio data that anyone can download for ASR or other speech activities

✭ 128

data speech-recognition speech speech-to-text asr

Tensorflow Ctc Speech Recognition

Application of Connectionist Temporal Classification (CTC) for Speech Recognition (Tensorflow 1.0 but compatible with 2.0).

✭ 127

python deep-learning machine-learning tensorflow tutorial speech-recognition speech-to-text ctc

Kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.

✭ 11,151

shell C++python perl c TeX cuda speech-recognition speech speech-to-text kaldi speaker-verification speaker-id

Nlp Models Tensorflow

Gathers machine learning and Tensorflow deep learning models for NLP problems, 1.13 < Tensorflow < 2.0

✭ 1,603

Jupyter Notebook python deep-learning machine-learning nlp chatbot lstm embedded attention speech-to-text neural-machine-translation summarization pos-tagging language-detection optical-character-recognition lstm-seq2seq-tf dnc-seq2seq luong-api

Kalliope

Kalliope is a framework that will help you to create your own personal assistant.

✭ 1,509

python shell Dockerfile HTML linux bot home-automation speech-recognition speech-to-text speech-synthesis raspberry personal-assistant bot-creation jarvis

Self Supervised Speech Recognition

speech to text with self-supervised learning based on wav2vec 2.0 framework

✭ 106

python speech-recognition unsupervised-learning speech-to-text semi-supervised-learning

Wav2letter.pytorch

A fully convolution-network for speech-to-text, built on pytorch.

✭ 104

python deep-learning machine-learning pytorch neural-network convolutional-neural-networks speech-recognition speech-to-text

Spokestack Python

Spokestack is a library that allows a user to easily incorporate a voice interface into any Python application.

✭ 103

python deep-learning machine-learning tensorflow nlp bot natural-language-processing raspberry-pi neural-networks embedded speech-recognition text-to-speech speech-to-text tts speech-synthesis natural-language-understanding nlu smart-home voice-recognition

Speech And Text

Speech to text (PocketSphinx, Iflytex API, Baidu API) and text to speech (pyttsx3) | 语音转文字（PocketSphinx、百度 API、科大讯飞 API）和文字转语音（pyttsx3）

✭ 102

python speech-recognition text-to-speech speech-to-text

Openseq2seq

Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP

✭ 1,378

python deep-learning tensorflow speech-recognition seq2seq language-model text-to-speech speech-to-text speech-synthesis neural-machine-translation sequence-to-sequence

Vosk Api

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

✭ 1,357

python android deep-learning ios raspberry-pi deep-neural-networks privacy speech-recognition offline speech-to-text asr kaldi voice-recognition

Mongolian Speech Recognition

Mongolian speech recognition with PyTorch

✭ 97

python deep-learning pytorch convolutional-neural-networks speech-recognition speech-to-text asr

Dexter

Let your talking do the code

✭ 93

javascript python language natural-language-processing accessibility speech-to-text

B.e.n.j.i.

B.E.N.J.I.- The Impossible Missions Force's digital assistant

✭ 83

python python3 speech-recognition speech-to-text

Deepspeech Websocket Server

Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environments

✭ 79

python websocket speech-recognition speech-to-text

Deepspeech

A PaddlePaddle implementation of ASR.

✭ 1,219

python speech-recognition speech speech-to-text

Wav2letter

Speech Recognition model based off of FAIR research paper built using Pytorch.

✭ 78

python deep-learning pytorch convolutional-neural-networks neural-networks speech-recognition speech-to-text asr

Casr Demo

基于Flask Web的中文自动语音识别演示系统,包含语音识别、语音合成、声纹识别之说话人识别。

✭ 76

css speech-to-text flask-application ctc

Udacity Natural Language Processing Nanodegree

Tutorials and my solutions to the Udacity NLP Nanodegree

✭ 73

python html nlp keras deeplearning speech-to-text machine-translation topic-modeling udacity part-of-speech-tagger

Nativescript Speech Recognition

💬 Speech to text, using the awesome engines readily available on the device.

✭ 72

typescript speech-recognition speech-to-text nativescript voice-recognition siri

Patter

speech-to-text in pytorch

✭ 71

python pytorch ocr speech-recognition rnn speech-to-text

Openasr

A pytorch based end2end speech recognition system.

✭ 69

python speech-recognition transformer speech speech-to-text asr

Watbot

An Android ChatBot powered by IBM Watson Services (Assistant V1, Text-to-Speech, and Speech-to-Text with Speaker Recognition) on IBM Cloud.

✭ 64

java android chatbot dialog android-studio speech text-to-speech speech-to-text assistant entity conversation intent workspace cognitive-services watson

Dragonfire

the open-source virtual assistant for Ubuntu based Linux distributions

✭ 1,120

python machine-learning linux nlp artificial-intelligence ubuntu chatbot speech-recognition text-to-speech speech-to-text spacy kaldi personal-assistant

Angle

⦠ Angle: new speakable syntax for python 💡

✭ 61

python compiler speech-recognition speech-to-text

Audio Pretrained Model

A collection of Audio and Speech pre-trained models.

✭ 61

python3 machine-learning pytorch tensorflow keras neural-network audio speech-recognition caffe mxnet speech-to-text audio-processing keras-tensorflow tensorflow-models keras-models

Syn Speech

Syn.Speech is a flexible speaker independent continuous speech recognition engine for Mono and .NET framework

✭ 57

dotnet speech-recognition speech speech-to-text mono asr

Voice Synthesis

This repository is an implementation of Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis (SV2TTS) with a vocoder that works in real-time. SV2TTS is a three-stage deep learning framework that allows to create a numerical representation of a voice from a few seconds of audio, and to use it to condition a text-to-speech model trained to generalize to new voices.

✭ 51

python tensorflow keras speech-to-text

Soloud

Free, easy, portable audio engine for games

✭ 1,048

python c ruby cpp game audio game-development engine mp3 sound portable speech speech-to-text synthesizer flac

Dc tts

A TensorFlow Implementation of DC-TTS: yet another text-to-speech model

✭ 1,017

python speech speech-to-text tts

Artyom.js

A voice control - voice commands - speech recognition and speech synthesis javascript library. Create your own siri,google now or cortana with Google Chrome within your website.

✭ 1,011

javascript speech-recognition speech-to-text speech-synthesis recognition voice-commands

1-60 of 151 speech-to-text projects

›