Wav2letterFacebook AI Research's Automatic Speech Recognition Toolkit
Stars: ✭ 5,907 (-1.97%)
Speech-RecognitionEnd-to-end Automatic Speech Recognition for Madarian and English in Tensorflow
Stars: ✭ 21 (-99.65%)
sova-asrSOVA ASR (Automatic Speech Recognition)
Stars: ✭ 123 (-97.96%)
KospeechOpen-Source Toolkit for End-to-End Korean Automatic Speech Recognition.
Stars: ✭ 190 (-96.85%)
EspressoEspresso: A Fast End-to-End Neural Speech Recognition Toolkit
Stars: ✭ 808 (-86.59%)
kosrKorean speech recognition based on transformer (트랜스포머 기반 한국어 음성 인식)
Stars: ✭ 25 (-99.59%)
Transformer-TransducerPyTorch implementation of "Transformer Transducer: A Streamable Speech Recognition Model with Transformer Encoders and RNN-T Loss" (ICASSP 2020)
Stars: ✭ 61 (-98.99%)
EspnetEnd-to-End Speech Processing Toolkit
Stars: ✭ 4,533 (-24.78%)
Rnn TransducerMXNet implementation of RNN Transducer (Graves 2012): Sequence Transduction with Recurrent Neural Networks
Stars: ✭ 114 (-98.11%)
E2e AsrPyTorch Implementations for End-to-End Automatic Speech Recognition
Stars: ✭ 106 (-98.24%)
kospeechOpen-Source Toolkit for End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.
Stars: ✭ 456 (-92.43%)
spokestack-iosSpokestack: give your iOS app a voice interface!
Stars: ✭ 27 (-99.55%)
UniSpeechUniSpeech - Large Scale Self-Supervised Learning for Speech
Stars: ✭ 224 (-96.28%)
react-clientAn React client library for Speechly API
Stars: ✭ 71 (-98.82%)
speech-to-text-code-patternReact app using the Watson Speech to Text service to transform voice audio into written text.
Stars: ✭ 37 (-99.39%)
WaveGrad2PyTorch Implementation of Google Brain's WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis
Stars: ✭ 55 (-99.09%)
mongolian-nlpUseful resources for Mongolian NLP
Stars: ✭ 119 (-98.03%)
soxanWav2Vec for speech recognition, classification, and audio classification
Stars: ✭ 113 (-98.12%)
open-speech-corpora💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
Stars: ✭ 841 (-86.04%)
TinyCogSmall Robot, Toy Robot platform
Stars: ✭ 29 (-99.52%)
cobraOn-device voice activity detection (VAD) powered by deep learning.
Stars: ✭ 76 (-98.74%)
lightning-asrModular and extensible speech recognition library leveraging pytorch-lightning and hydra.
Stars: ✭ 36 (-99.4%)
NLP ToolkitLibrary of state-of-the-art models (PyTorch) for NLP tasks
Stars: ✭ 92 (-98.47%)
speechlessSpeech-to-text based on wav2letter built for transfer learning
Stars: ✭ 92 (-98.47%)
deepspeechA PyTorch implementation of DeepSpeech and DeepSpeech2.
Stars: ✭ 45 (-99.25%)
Unity live captionUse Google Speech-to-Text API to do real-time live stream caption on Unity! Best when combined with your virtual character!
Stars: ✭ 26 (-99.57%)
Speech-BackbonesThis is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.
Stars: ✭ 205 (-96.6%)
quran-alignWord-accurate timestamps for Qur'anic audio.
Stars: ✭ 139 (-97.69%)
Tensorflow-Keyword-SpottingKeyword spotting using various architecture like convolutional vggnet , 1D convolutional network and CTC.
Stars: ✭ 27 (-99.55%)
syn-speech-samplesAn application that demostrate the usage of Syn.Speech library for Speech Recognition
Stars: ✭ 24 (-99.6%)
wenetProduction First and Production Ready End-to-End Speech Recognition Toolkit
Stars: ✭ 2,384 (-60.44%)
pytorch audioaudio processing module for pytorch:stft, istft
Stars: ✭ 33 (-99.45%)
voce-browserVoice Controlled Chromium Web Browser
Stars: ✭ 34 (-99.44%)
pyjsgfJSpeech Grammar Format (JSGF) compiler, matcher and parser package for Python.
Stars: ✭ 40 (-99.34%)
sdk-androidTanker client-side encryption SDK for Android
Stars: ✭ 14 (-99.77%)
speech-transformerTransformer implementation speciaized in speech recognition tasks using Pytorch.
Stars: ✭ 40 (-99.34%)
favorite-research-papersListing my favorite research papers 📝 from different fields as I read them.
Stars: ✭ 12 (-99.8%)
VoiceDictation迅飞 语音听写 WebAPI - 把语音(≤60秒)转换成对应的文字信息,让机器能够“听懂”人类语言,相当于给机器安装上“耳朵”,使其具备“能听”的功能。
Stars: ✭ 36 (-99.4%)
speech to texthow to use the Google Cloud Speech API to transcribe audio/video files.
Stars: ✭ 35 (-99.42%)
ASVspoof PANo description or website provided.
Stars: ✭ 22 (-99.63%)
scriptySpeech to text bot for Discord using Mozilla's DeepSpeech
Stars: ✭ 14 (-99.77%)
timit-preprocessorExtract mfcc vectors and phones from TIMIT dataset
Stars: ✭ 14 (-99.77%)
A chronology of deep learningTracing back and exposing in chronological order the main ideas in the field of deep learning, to help everyone better understand the current intense research in AI.
Stars: ✭ 47 (-99.22%)
rustfstRust re-implementation of OpenFST - library for constructing, combining, optimizing, and searching weighted finite-state transducers (FSTs). A Python binding is also available.
Stars: ✭ 104 (-98.27%)
mixupspeechpro.com/
Stars: ✭ 23 (-99.62%)
pocketsphinxUpdated ROS bindings to pocketsphinx
Stars: ✭ 36 (-99.4%)
postchildren-desktop👨👦👦 A E2E test visualization tool (get along with postman and postwoman)
Stars: ✭ 23 (-99.62%)