All Categories → Machine Learning → speech-recognition

Top 326 speech-recognition open source projects

End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow

✭ 2,751

python deep-learning tensorflow audio cnn lstm paper speech-recognition rnn evaluation end-to-end automatic-speech-recognition feature-vector data-preprocessing phonemes timit-dataset layer-normalization rnn-encoder-decoder chinese-speech-recognition

Speech recognition with tensorflow

Implementation of a seq2seq model for Speech Recognition using the latest version of TensorFlow. Architecture similar to Listen, Attend and Spell.

✭ 253

python jupyter-notebook deep-learning machine-learning tensorflow nlp speech-recognition seq2seq speech-to-text encoder-decoder sequence-to-sequence

Cn2an

📦 快速转化「中文数字」和「阿拉伯数字」～ (最新特性：分数，日期、温度等转化）

✭ 249

python speech-recognition pypi asr nlp-library

Zeroth

Kaldi-based Korean ASR (한국어 음성인식) open-source project

✭ 248

shell open-source speech-recognition language-model data-augmentation korean asr kaldi

Speechbrain.github.io

The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.

✭ 242

html deep-learning neural-network neural-networks deeplearning speech-recognition speech speech-to-text speech-processing

Chinese text normalization

Chinese text normalization for speech processing

✭ 242

python chinese speech-recognition asr

Nemo

NeMo: a toolkit for conversational AI

✭ 3,685

Jupyter Notebook python Roff shell C++HTML deep-learning nlp neural-network speech-recognition nlp-machine-learning text-to-speech machine-translation speech-synthesis speech-to-text nmt

Rnn ctc

Recurrent Neural Network and Long Short Term Memory (LSTM) with Connectionist Temporal Classification implemented in Theano. Includes a Toy training example.

✭ 220

python deep-learning neural-network lstm ocr speech-recognition rnn recurrent-neural-networks speech-to-text captcha theano gru ctc

Dragonfly

Speech recognition framework allowing powerful Python-based scripting and extension of Dragon NaturallySpeaking (DNS), Windows Speech Recognition (WSR), Kaldi and CMU Pocket Sphinx

✭ 209

python speech-recognition

Edgedict

Working online speech recognition based on RNN Transducer. ( Trained model release available in release )

✭ 205

python speech-recognition speech speech-to-text asr

Kaldi Active Grammar

Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time

✭ 196

python speech-recognition speech-to-text kaldi

K6nele

An Android app that offers speech-to-text services and user interfaces to other apps

✭ 196

java android speech-recognition speech-to-text

Dictate.js

A small Javascript library for browser-based real-time speech recognition, which uses Recorderjs for audio capture, and a WebSocket connection to the Kaldi GStreamer server for speech recognition.

✭ 195

javascript speech-recognition speech-to-text

Automatic Speech Recognition

🎧 Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)

✭ 192

python deep-learning machine-learning tensorflow keras neural-networks speech-recognition language-model speech-to-text tensorflow-models

Speechtotext Websockets Javascript

SDK & Sample to do speech recognition using websockets in Javascript

✭ 191

javascript typescript js ts sdk websocket browser websockets microsoft speech-recognition speech recognition cognitive-services

Kospeech

Open-Source Toolkit for End-to-End Korean Automatic Speech Recognition.

✭ 190

python pytorch speech-recognition transformer seq2seq asr end-to-end attention-is-all-you-need

Asr Evaluation

Python module for evaluating ASR hypotheses (e.g. word error rate, word recognition rate).

✭ 190

python speech-recognition evaluation asr

Voice Overlay Android

🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI

✭ 189

kotlin android search speech-recognition input speech-to-text voice overlay permissions permission chatbots conversation voice-recognition conversational-ui

Tensorflow Speech Recognition

🎙Speech recognition using the tensorflow deep learning framework, sequence-to-sequence neural networks

✭ 2,118

python swift deep-learning tensorflow neural-network speech-recognition speech-to-text stt

Kaldi Offline Transcriber

Offline transcription system for Estonian using Kaldi

✭ 182

python speech-recognition

Vosk

VOSK Speech Recognition Toolkit

✭ 182

python c speech-recognition speech-to-text semi-supervised-learning multilingual voice-recognition

Deepspeech German

Automatic Speech Recognition (ASR) - German

✭ 179

python speech-recognition

Cidlib

The CIDLib general purpose C++ development environment

✭ 179

cpp networking xml encryption compression speech-recognition test-framework odbc platform-independent ui-framework

Deepspeech Server

A testing server for a speech to text service based on mozilla deepspeech

✭ 176

python speech-recognition speech-to-text reactivex reactive-extensions

End2end Asr Pytorch

End-to-End Automatic Speech Recognition on PyTorch

✭ 175

python pytorch speech-recognition transformer speech asr end-to-end

Kaldi Onnx

Kaldi model converter to ONNX

✭ 174

python android ios speech-recognition onnx kaldi

Cordova Plugin Speechrecognition

🎤 Cordova Plugin for Speech Recognition

✭ 174

java android ios speech-recognition cordova-plugin

Gst Kaldi Nnet2 Online

GStreamer plugin around Kaldi's online neural network decoder

✭ 171

speech-recognition

Naomi

The Naomi Project is an open source, technology agnostic platform for developing always-on, voice-controlled applications!

✭ 171

python hacktoberfest linux raspberry-pi iot home-automation speech-recognition text-to-speech speech-to-text voice speech-synthesis assistant personal-assistant

Pytorch Kaldi

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

Tacotron asr

Speech Recognition Using Tacotron

✭ 165

python speech-recognition speech speech-to-text tacotron

Hey Jetson

Deep Learning based Automatic Speech Recognition with attention for the Nvidia Jetson.

✭ 161

python html css jupyter-notebook deep-learning tensorflow keras rest-api flask deep-neural-networks azure speech-recognition sentiment-analysis recurrent-neural-networks attention inference speech-to-text

Kaldiio

A pure python module for reading and writing kaldi ark files

✭ 160

python python3 python2 speech-recognition kaldi

Interspeech2019 Tutorial

INTERSPEECH 2019 Tutorial Materials

✭ 160

jupyter-notebook tutorial speech-recognition text-to-speech

Rnnt Speech Recognition

End-to-end speech recognition using RNN Transducers in Tensorflow 2.0

✭ 158

python deep-learning machine-learning tensorflow artificial-intelligence speech-recognition rnn

Py Kaldi Asr

Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.

✭ 156

python wrapper speech-recognition asr kaldi

Clovacall

ClovaCall dataset and Pytorch LAS baseline code (Interspeech 2020)

✭ 151

python speech-recognition

Speech To Text Russian

Проект для распознавания речи на русском языке на основе pykaldi.

✭ 151

python speech-recognition speech-to-text asr kaldi

Swiftspeech

A speech recognition framework designed for SwiftUI.

✭ 149

swift ios audio speech-recognition voice-recognition

Speech Recognition Neural Network

This is the end-to-end Speech Recognition neural network, deployed in Keras. This was my final project for Artificial Intelligence Nanodegree @Udacity.

✭ 148

html deep-learning speech-recognition recurrent-neural-networks lstm-neural-networks gru

Zzz Retired openstt

RETIRED - OpenSTT is now retired. If you would like more information on Mycroft AI's open source STT projects, please visit:

✭ 146

nlp speech-recognition speech-to-text voice nlp-machine-learning nlu speech-processing

Speechrecognizerbutton

UIButton subclass with push to talk recording, speech recognition and Siri-style waveform view.

✭ 144

swift speech-recognition speech-to-text waveform siri uibutton

Dla

Deep learning for audio processing