Top 106 asr open source projects

HMS ML Demo provides an example of integrating Huawei ML Kit service into applications. This example demonstrates how to integrate services provided by ML Kit, such as face detection, text recognition, image segmentation, asr, and tts.

✭ 187

java deep-learning machine-learning classification ocr face-detection face-recognition text-to-speech image-segmentation document asr language-detection

End2end Asr Pytorch

End-to-End Automatic Speech Recognition on PyTorch

✭ 175

python pytorch speech-recognition transformer speech asr end-to-end

Pytorch Kaldi

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

Mrcp Plugin With Freeswitch

使用FreeSWITCH接受用户手机呼叫，通过UniMRCP Server集成讯飞开放平台（xfyun）插件将用户语音进行语音识别（ASR），并根据自定义业务逻辑调用语音合成（TTS），构建简单的端到端语音呼叫中心。

✭ 168

tts asr

Py Kaldi Asr

Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.

✭ 156

python wrapper speech-recognition asr kaldi

Speecht

An opensource speech-to-text software written in tensorflow

✭ 152

python python3 tensorflow language-model speech-to-text asr

Speech To Text Russian

Проект для распознавания речи на русском языке на основе pykaldi.

✭ 151

python speech-recognition speech-to-text asr kaldi

Listen Attend Spell

A PyTorch implementation of Listen, Attend and Spell (LAS), an End-to-End ASR framework.

✭ 147

python pytorch asr end-to-end

Asr audio data links

A list of publically available audio data that anyone can download for ASR or other speech activities

✭ 128

data speech-recognition speech speech-to-text asr

Asr syllable

基于卷积神经网络的语音识别声学模型的研究

✭ 127

python keras cnn attention asr densenet ctc

Pytorch Asr

ASR with PyTorch

✭ 124

python pytorch speech-recognition resnet speech decoder asr densenet kaldi ctc capsule-network

Rnn Transducer

MXNet implementation of RNN Transducer (Graves 2012): Sequence Transduction with Recurrent Neural Networks

✭ 114

python speech-recognition mxnet asr end-to-end

Deepspeechrecognition

A Chinese Deep Speech Recognition System 包括基于深度学习的声学模型和基于深度学习的语言模型

✭ 1,421

python speech-recognition asr

E2e Asr

PyTorch Implementations for End-to-End Automatic Speech Recognition

✭ 106

python pytorch speech-recognition asr end-to-end

Bigcidian

Pronunciation lexicon covering both English and Chinese languages for Automatic Speech Recognition.

✭ 99

python speech-recognition asr multilingual pinyin ipa

Delta

DELTA is a deep learning based natural language and speech processing platform.

Vosk Api

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

✭ 1,357

python android deep-learning ios raspberry-pi deep-neural-networks privacy speech-recognition offline speech-to-text asr kaldi voice-recognition

Zerospeech Tts Without T

A Pytorch implementation for the ZeroSpeech 2019 challenge.

✭ 100

python gan text-to-speech tts autoencoder asr adversarial-learning

Mongolian Speech Recognition

Mongolian speech recognition with PyTorch

✭ 97

python deep-learning pytorch convolutional-neural-networks speech-recognition speech-to-text asr

Ktspeechcrawler

Automatically constructing corpus for automatic speech recognition from YouTube videos

✭ 92

python crawler youtube speech-recognition asr

Wav2letter

Speech Recognition model based off of FAIR research paper built using Pytorch.

✭ 78

python deep-learning pytorch convolutional-neural-networks neural-networks speech-recognition speech-to-text asr

Voicer

AGI-server voice recognizer for #Asterisk

✭ 73

javascript google voice recognition asr yandex voice-recognition asterisk voice-commands voice-control agi

Asr benchmark

Program to benchmark various speech recognition APIs

✭ 71

python benchmark speech-recognition asr voice-recognition

Openasr

A pytorch based end2end speech recognition system.

✭ 69

python speech-recognition transformer speech speech-to-text asr

Syn Speech

Syn.Speech is a flexible speaker independent continuous speech recognition engine for Mono and .NET framework

✭ 57

dotnet speech-recognition speech speech-to-text mono asr

Asr

✭ 54

python transformer seq2seq asr

Keras Sincnet

Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)

✭ 47

python deep-learning machine-learning tensorflow keras neural-network audio artificial-intelligence convolutional-neural-networks cnn speech-recognition audio-processing asr speech-processing filtering waveform

Asrgen

Attacking Speaker Recognition with Deep Generative Models

✭ 31

jupyter-notebook text-to-speech gans asr

Espresso

Espresso: A Fast End-to-End Neural Speech Recognition Toolkit

✭ 808

python pytorch speech-recognition asr kaldi end-to-end

Sincnet

SincNet is a neural architecture for efficiently processing raw audio samples.

✭ 764

python deep-learning pytorch audio artificial-intelligence convolutional-neural-networks neural-networks cnn speech-recognition audio-processing signal-processing asr speech-processing filtering waveform

Pykaldi

A Python wrapper for Kaldi

✭ 756

python numpy wrapper speech-recognition speech language-model feature-extraction asr kaldi

Eesen

The official repository of the Eesen project

✭ 738

tensorflow speech-recognition speech-to-text asr kaldi ctc

Libreasr

💬 An On-Premises, Streaming Speech Recognition System

✭ 633

python deep-learning pytorch speech-recognition asr

Wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

✭ 617

python pytorch speech-recognition transformer asr

Open stt

Open STT

✭ 584

python dataset speech-to-text russian asr

Speech Transformer

A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.

✭ 565

python pytorch transformer attention asr end-to-end attention-is-all-you-need

Athena

an open-source implementation of sequence-to-sequence based speech processing engine

✭ 542

python tensorflow deployment speech-recognition transformer unsupervised-learning tts speech-synthesis asr sequence-to-sequence ctc

Silero Models

Silero Models: pre-trained STT models and benchmarks made embarrassingly simple

✭ 522

jupyter-notebook pytorch speech-recognition speech-to-text pretrained-models english asr

Neural sp

End-to-end ASR/LM implementation with PyTorch

✭ 408

python pytorch streaming speech-recognition transformer attention-mechanism attention seq2seq speech language-model asr sequence-to-sequence ctc

Nmtpytorch

Sequence-to-Sequence Framework in PyTorch

✭ 392

jupyter-notebook deep-learning pytorch cnn speech-recognition seq2seq asr neural-machine-translation nmt

Cheetah

On-device streaming speech-to-text engine powered by deep learning

✭ 383

python c android deep-learning ios machine-learning raspberry-pi iot webassembly arm speech-recognition offline speech-to-text asr voice-recognition

Zamia Speech

Open tools and data for cloudless automatic speech recognition

✭ 374

python speech-recognition language-model asr kaldi

Asr theory

语音识别理论，论文和PPT

✭ 344

tensorflow keras deeplearning papers asr kaldi ppt

Tensorflow end2end speech recognition

End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)

✭ 305

python tensorflow speech-recognition attention-mechanism speech-to-text asr end-to-end ctc beam-search

Vosk Server

WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries

✭ 277

python websocket webrtc grpc speech-recognition saas asr kaldi

Vosk Android Demo

Offline speech recognition for Android with Vosk library.

✭ 271

java android speech-recognition offline asr kaldi

Docker Kaldi Gstreamer Server

Dockerfile for kaldi-gstreamer-server.

✭ 266

docker dockerfile asr kaldi

spinorama

A library to display and compare spinorama (speakers measurements) graphs.

✭ 29

python Jupyter Notebook javascript HTML TeX shell measurements asr speakers spinorama cea2034 audiosciencereview

UnityASR

Automatic Speech Recognition in Unity.

✭ 14

C#unity3d speech-recognition stt asr unity3d-speech

demo vietasr

Vietnamese Speech Recognition

✭ 22

C++python Makefile Cuda shell Jupyter Notebook speech-recognition automatic-speech-recognition speech-to-text stt asr vietnamese-nlp ctc-loss vietnamese-language ctc-decode vietnamese-speech-recognition

sova-asr

SOVA ASR (Automatic Speech Recognition)

✭ 123

python javascript CSS HTML Dockerfile speech speech-recognition automatic-speech-recognition speech-to-text stt asr wav2letter asr-model

1-60 of 106 asr projects

›