PCPMPresenting Collection of Pretrained Models. Links to pretrained models in NLP and voice.
Stars: ✭ 21 (-86.18%)
LingvoLingvo
Stars: ✭ 2,361 (+1453.29%)
simple diarizerSimplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code
Stars: ✭ 26 (-82.89%)
ZerothKaldi-based Korean ASR (한국어 음성인식) open-source project
Stars: ✭ 248 (+63.16%)
EdgedictWorking online speech recognition based on RNN Transducer. ( Trained model release available in release )
Stars: ✭ 205 (+34.87%)
wav2vec2-liveA live speech recognition using Facebooks wav2vec 2.0 model.
Stars: ✭ 205 (+34.87%)
Vosk ApiOffline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Stars: ✭ 1,357 (+792.76%)
CheetahOn-device streaming speech-to-text engine powered by deep learning
Stars: ✭ 383 (+151.97%)
sova-asrSOVA ASR (Automatic Speech Recognition)
Stars: ✭ 123 (-19.08%)
Asr audio data linksA list of publically available audio data that anyone can download for ASR or other speech activities
Stars: ✭ 128 (-15.79%)
demo vietasrVietnamese Speech Recognition
Stars: ✭ 22 (-85.53%)
asr2424-hour Automatic Speech Recognition
Stars: ✭ 27 (-82.24%)
Openseq2seqToolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP
Stars: ✭ 1,378 (+806.58%)
Neural spEnd-to-end ASR/LM implementation with PyTorch
Stars: ✭ 408 (+168.42%)
PykaldiA Python wrapper for Kaldi
Stars: ✭ 756 (+397.37%)
spokestack-iosSpokestack: give your iOS app a voice interface!
Stars: ✭ 27 (-82.24%)
Open sttOpen STT
Stars: ✭ 584 (+284.21%)
EesenThe official repository of the Eesen project
Stars: ✭ 738 (+385.53%)
Wav2letterSpeech Recognition model based off of FAIR research paper built using Pytorch.
Stars: ✭ 78 (-48.68%)
OpenasrA pytorch based end2end speech recognition system.
Stars: ✭ 69 (-54.61%)
leopardOn-device speech-to-text engine powered by deep learning
Stars: ✭ 354 (+132.89%)
KerasdeepspeechA Keras CTC implementation of Baidu's DeepSpeech for model experimentation
Stars: ✭ 245 (+61.18%)
Silero ModelsSilero Models: pre-trained STT models and benchmarks made embarrassingly simple
Stars: ✭ 522 (+243.42%)
ASR-Audio-Data-LinksA list of publically available audio data that anyone can download for ASR or other speech activities
Stars: ✭ 179 (+17.76%)
vosk-asteriskSpeech Recognition in Asterisk with Vosk Server
Stars: ✭ 52 (-65.79%)
megsA merged version of multiple open-source German speech datasets.
Stars: ✭ 21 (-86.18%)
Zamia SpeechOpen tools and data for cloudless automatic speech recognition
Stars: ✭ 374 (+146.05%)
Syn SpeechSyn.Speech is a flexible speaker independent continuous speech recognition engine for Mono and .NET framework
Stars: ✭ 57 (-62.5%)
DeepspeechrecognitionA Chinese Deep Speech Recognition System 包括基于深度学习的声学模型和基于深度学习的语言模型
Stars: ✭ 1,421 (+834.87%)
E2e AsrPyTorch Implementations for End-to-End Automatic Speech Recognition
Stars: ✭ 106 (-30.26%)
Easy BertA Dead Simple BERT API for Python and Java (https://github.com/google-research/bert)
Stars: ✭ 106 (-30.26%)
Zzz Retired opensttRETIRED - OpenSTT is now retired. If you would like more information on Mycroft AI's open source STT projects, please visit:
Stars: ✭ 146 (-3.95%)
Tensorflow Ctc Speech RecognitionApplication of Connectionist Temporal Classification (CTC) for Speech Recognition (Tensorflow 1.0 but compatible with 2.0).
Stars: ✭ 127 (-16.45%)
BigcidianPronunciation lexicon covering both English and Chinese languages for Automatic Speech Recognition.
Stars: ✭ 99 (-34.87%)
DeltaDELTA is a deep learning based natural language and speech processing platform.
Stars: ✭ 1,479 (+873.03%)
Wav2letter.pytorchA fully convolution-network for speech-to-text, built on pytorch.
Stars: ✭ 104 (-31.58%)
Awd Lstm LmLSTM and QRNN Language Model Toolkit for PyTorch
Stars: ✭ 1,834 (+1106.58%)
SpeechrecognizerbuttonUIButton subclass with push to talk recording, speech recognition and Siri-style waveform view.
Stars: ✭ 144 (-5.26%)
Kaldikaldi-asr/kaldi is the official location of the Kaldi project.
Stars: ✭ 11,151 (+7236.18%)
Spokestack PythonSpokestack is a library that allows a user to easily incorporate a voice interface into any Python application.
Stars: ✭ 103 (-32.24%)
Speech And TextSpeech to text (PocketSphinx, Iflytex API, Baidu API) and text to speech (pyttsx3) | 语音转文字(PocketSphinx、百度 API、科大讯飞 API)和文字转语音(pyttsx3)
Stars: ✭ 102 (-32.89%)
TupeTransformer with Untied Positional Encoding (TUPE). Code of paper "Rethinking Positional Encoding in Language Pre-training". Improve existing models like BERT.
Stars: ✭ 143 (-5.92%)
Kogpt2 Finetuning🔥 Korean GPT-2, KoGPT2 FineTuning cased. 한국어 가사 데이터 학습 🔥
Stars: ✭ 124 (-18.42%)
Pytorch gbw lmPyTorch Language Model for 1-Billion Word (LM1B / GBW) Dataset
Stars: ✭ 101 (-33.55%)
DexterLet your talking do the code
Stars: ✭ 93 (-38.82%)
Electra pytorchPretrain and finetune ELECTRA with fastai and huggingface. (Results of the paper replicated !)
Stars: ✭ 149 (-1.97%)