Awesome KaldiThis is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )
Stars: ✭ 393 (+73.13%)
opensnipsOpen source projects related to Snips https://snips.ai/.
Stars: ✭ 50 (-77.97%)
Pytorch Kaldipytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
Stars: ✭ 2,097 (+823.79%)
Kaldikaldi-asr/kaldi is the official location of the Kaldi project.
Stars: ✭ 11,151 (+4812.33%)
kaldi helpers🙊 A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.
Stars: ✭ 13 (-94.27%)
kaldi ag trainingDocker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-grammar.
Stars: ✭ 14 (-93.83%)
Speech Alignerspeech-aligner,是一个从“人声语音”及其“语言文本”,产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech and its transcription
Stars: ✭ 259 (+14.1%)
PykaldiA Python wrapper for Kaldi
Stars: ✭ 756 (+233.04%)
DiffwaveDiffWave is a fast, high-quality neural vocoder and waveform synthesizer.
Stars: ✭ 139 (-38.77%)
TacotronA TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model
Stars: ✭ 1,756 (+673.57%)
Esp8266samSpeech synthesis for ESP8266 using S.A.M. port
Stars: ✭ 199 (-12.33%)
Kaldi OnnxKaldi model converter to ONNX
Stars: ✭ 174 (-23.35%)
VocA physical model of the human vocal tract using literate programming, based on Pink Trombone.
Stars: ✭ 129 (-43.17%)
Kaldi Active GrammarPython Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time
Stars: ✭ 196 (-13.66%)
Chatbot Watson AndroidAn Android ChatBot powered by Watson Services - Assistant, Speech-to-Text and Text-to-Speech on IBM Cloud.
Stars: ✭ 169 (-25.55%)
TtsText-to-Speech for Arduino
Stars: ✭ 118 (-48.02%)
Tts CubeEnd-2-end speech synthesis with recurrent neural networks
Stars: ✭ 213 (-6.17%)
LingvoLingvo
Stars: ✭ 2,361 (+940.09%)
Tacotron asrSpeech Recognition Using Tacotron
Stars: ✭ 165 (-27.31%)
Tf Kaldi SpeakerNeural speaker recognition/verification system based on Kaldi and Tensorflow
Stars: ✭ 117 (-48.46%)
HolobotHoloBot is a reusable 3D interface that allows HoloLens & VR users to interact with any bot using Mixed Reality & Speech.
Stars: ✭ 114 (-49.78%)
WavegradA fast, high-quality neural vocoder.
Stars: ✭ 138 (-39.21%)
TimitThe DARPA TIMIT Acoustic-Phonetic Continuous Speech Corpus.
Stars: ✭ 202 (-11.01%)
AllosaurusAllosaurus is a pretrained universal phone recognizer for more than 2000 languages
Stars: ✭ 135 (-40.53%)
Avpian open source voice command macro software
Stars: ✭ 130 (-42.73%)
Asr audio data linksA list of publically available audio data that anyone can download for ASR or other speech activities
Stars: ✭ 128 (-43.61%)
KaldiioA pure python module for reading and writing kaldi ark files
Stars: ✭ 160 (-29.52%)
Code Switching PapersA curated list of research papers and resources on code-switching
Stars: ✭ 122 (-46.26%)
Kaldi GopComputes the GMM-based Goodness of Pronunciation (GOP). Bases on Kaldi.
Stars: ✭ 104 (-54.19%)
VoluteRaspberry Pi + Nodejs = Speech Robot
Stars: ✭ 224 (-1.32%)
Ctc pytorchCTC end -to-end ASR for timit and 863 corpus.
Stars: ✭ 161 (-29.07%)
DurianImplementation of "Duration Informed Attention Network for Multimodal Synthesis" (https://arxiv.org/pdf/1909.01700.pdf) paper.
Stars: ✭ 111 (-51.1%)
DeltaDELTA is a deep learning based natural language and speech processing platform.
Stars: ✭ 1,479 (+551.54%)
Tts Papers🐸 collection of TTS papers
Stars: ✭ 160 (-29.52%)
Vosk ApiOffline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Stars: ✭ 1,357 (+497.8%)
Pykaldi2Yet another speech toolkit based on Kaldi and PyTorch
Stars: ✭ 158 (-30.4%)
Elpis🙊 WIP software for creating speech recognition models.
Stars: ✭ 101 (-55.51%)
Py Kaldi AsrSome simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.
Stars: ✭ 156 (-31.28%)
AudiomatePython library for handling audio datasets.
Stars: ✭ 99 (-56.39%)
WikipronMassively multilingual pronunciation mining
Stars: ✭ 99 (-56.39%)
Depression DetectPredicting depression from acoustic features of speech using a Convolutional Neural Network.
Stars: ✭ 187 (-17.62%)
Aeneasaeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)
Stars: ✭ 1,942 (+755.51%)
Factorized TdnnPyTorch implementation of the Factorized TDNN (TDNN-F) from "Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks" and Kaldi
Stars: ✭ 98 (-56.83%)