SincnetSincNet is a neural architecture for efficiently processing raw audio samples.
Stars: ✭ 764 (+879.49%)
DeepspeechDeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
Stars: ✭ 18,680 (+23848.72%)
Asr audio data linksA list of publically available audio data that anyone can download for ASR or other speech activities
Stars: ✭ 128 (+64.1%)
LingvoLingvo
Stars: ✭ 2,361 (+2926.92%)
CheetahOn-device streaming speech-to-text engine powered by deep learning
Stars: ✭ 383 (+391.03%)
KerasdeepspeechA Keras CTC implementation of Baidu's DeepSpeech for model experimentation
Stars: ✭ 245 (+214.1%)
Keras SincnetKeras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)
Stars: ✭ 47 (-39.74%)
spokestack-iosSpokestack: give your iOS app a voice interface!
Stars: ✭ 27 (-65.38%)
EesenThe official repository of the Eesen project
Stars: ✭ 738 (+846.15%)
ASR-Audio-Data-LinksA list of publically available audio data that anyone can download for ASR or other speech activities
Stars: ✭ 179 (+129.49%)
wav2vec2-liveA live speech recognition using Facebooks wav2vec 2.0 model.
Stars: ✭ 205 (+162.82%)
Spokestack PythonSpokestack is a library that allows a user to easily incorporate a voice interface into any Python application.
Stars: ✭ 103 (+32.05%)
sova-asrSOVA ASR (Automatic Speech Recognition)
Stars: ✭ 123 (+57.69%)
leopardOn-device speech-to-text engine powered by deep learning
Stars: ✭ 354 (+353.85%)
Vosk ApiOffline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Stars: ✭ 1,357 (+1639.74%)
Speech To Text RussianПроект для распознавания речи на русском языке на основе pykaldi.
Stars: ✭ 151 (+93.59%)
Speechbrain.github.ioThe SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.
Stars: ✭ 242 (+210.26%)
Wav2letter.pytorchA fully convolution-network for speech-to-text, built on pytorch.
Stars: ✭ 104 (+33.33%)
KurDescriptive Deep Learning
Stars: ✭ 811 (+939.74%)
megsA merged version of multiple open-source German speech datasets.
Stars: ✭ 21 (-73.08%)
vosk-asteriskSpeech Recognition in Asterisk with Vosk Server
Stars: ✭ 52 (-33.33%)
Syn SpeechSyn.Speech is a flexible speaker independent continuous speech recognition engine for Mono and .NET framework
Stars: ✭ 57 (-26.92%)
Silero ModelsSilero Models: pre-trained STT models and benchmarks made embarrassingly simple
Stars: ✭ 522 (+569.23%)
EdgedictWorking online speech recognition based on RNN Transducer. ( Trained model release available in release )
Stars: ✭ 205 (+162.82%)
PCPMPresenting Collection of Pretrained Models. Links to pretrained models in NLP and voice.
Stars: ✭ 21 (-73.08%)
demo vietasrVietnamese Speech Recognition
Stars: ✭ 22 (-71.79%)
OpenasrA pytorch based end2end speech recognition system.
Stars: ✭ 69 (-11.54%)
BrevitasBrevitas: quantization-aware training in PyTorch
Stars: ✭ 343 (+339.74%)
Zamia SpeechOpen tools and data for cloudless automatic speech recognition
Stars: ✭ 374 (+379.49%)
ArtificioDeep Learning Computer Vision Algorithms for Real-World Use
Stars: ✭ 326 (+317.95%)
NmtpytorchSequence-to-Sequence Framework in PyTorch
Stars: ✭ 392 (+402.56%)
Graph 2d cnnCode and data for the paper 'Classifying Graphs as Images with Convolutional Neural Networks' (new title: 'Graph Classification with 2D Convolutional Neural Networks')
Stars: ✭ 67 (-14.1%)
Neural spEnd-to-end ASR/LM implementation with PyTorch
Stars: ✭ 408 (+423.08%)
Tensorflowasr⚡️ TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords
Stars: ✭ 400 (+412.82%)
Asrt speechrecognitionA Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
Stars: ✭ 4,943 (+6237.18%)
Awesome KaldiThis is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )
Stars: ✭ 393 (+403.85%)
RhinoOn-device speech-to-intent engine powered by deep learning
Stars: ✭ 406 (+420.51%)
Voice Overlay Ios🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI
Stars: ✭ 440 (+464.1%)
Java Speech ApiThe J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.
Stars: ✭ 490 (+528.21%)
Sonus💬 /so.nus/ STT (speech to text) for Node with offline hotword detection
Stars: ✭ 532 (+582.05%)
Athenaan open-source implementation of sequence-to-sequence based speech processing engine
Stars: ✭ 542 (+594.87%)
LayerNeural network inference the Unix way
Stars: ✭ 539 (+591.03%)
Open sttOpen STT
Stars: ✭ 584 (+648.72%)
Speech recognitionSpeech recognition module for Python, supporting several engines and APIs, online and offline.
Stars: ✭ 5,999 (+7591.03%)
Libreasr💬 An On-Premises, Streaming Speech Recognition System
Stars: ✭ 633 (+711.54%)
Asr benchmarkProgram to benchmark various speech recognition APIs
Stars: ✭ 71 (-8.97%)
AdaptAdapt Intent Parser
Stars: ✭ 690 (+784.62%)
Cs231Complete Assignments for CS231n: Convolutional Neural Networks for Visual Recognition
Stars: ✭ 317 (+306.41%)
WenetProduction First and Production Ready End-to-End Speech Recognition Toolkit
Stars: ✭ 617 (+691.03%)