Kaldikaldi-asr/kaldi is the official location of the Kaldi project.
Stars: ✭ 11,151 (+535.02%)
Annyang💬 Speech recognition for your site
Stars: ✭ 6,216 (+253.99%)
WikipronMassively multilingual pronunciation mining
Stars: ✭ 99 (-94.36%)
EkhoChinese text-to-speech engine
Stars: ✭ 690 (-60.71%)
AndroidmaryttsAndroid MARY TTS - an open-source, offline HMM-Based text-to-speech synthesis system based on MaryTTS
Stars: ✭ 134 (-92.37%)
PraatPraat: Doing Phonetics By Computer
Stars: ✭ 675 (-61.56%)
JoytanCreative Audio/Textbook Maker 🎵 📖 See our YouTube channel
Stars: ✭ 91 (-94.82%)
Code Switching PapersA curated list of research papers and resources on code-switching
Stars: ✭ 122 (-93.05%)
Ios 10 SamplerCode examples for new APIs of iOS 10.
Stars: ✭ 3,341 (+90.26%)
Transformertts🤖💬 Transformer TTS: Implementation of a non-autoregressive Transformer based neural network for text to speech.
Stars: ✭ 617 (-64.86%)
PitchtronTTS for pitch-accented language. Korean dialect DB.
Stars: ✭ 91 (-94.82%)
Real Time Voice CloningClone a voice in 5 seconds to generate arbitrary speech in real-time
Stars: ✭ 32,095 (+1727.73%)
Amazon Polly SampleSample application for Amazon Polly. Allows to convert any blog into an audio podcast.
Stars: ✭ 139 (-92.08%)
Sonus💬 /so.nus/ STT (speech to text) for Node with offline hotword detection
Stars: ✭ 532 (-69.7%)
Nlp Paper自然语言处理领域下的对话语音领域,整理相关论文(附阅读笔记),复现模型以及数据处理等(代码含TensorFlow和PyTorch两版本)
Stars: ✭ 67 (-96.18%)
Hifi GanHiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
Stars: ✭ 325 (-81.49%)
Deepvoice3 pytorchPyTorch implementation of convolutional neural networks-based text-to-speech synthesis models
Stars: ✭ 1,654 (-5.81%)
Xr3player🎧 🎼 Advanced JavaFX Media Player
Stars: ✭ 472 (-73.12%)
TalkifyJavascript Text to speech library
Stars: ✭ 132 (-92.48%)
Cognitive Speech TtsMicrosoft Text-to-Speech API sample code in several languages, part of Cognitive Services.
Stars: ✭ 312 (-82.23%)
SpecaugmentA Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain
Stars: ✭ 408 (-76.77%)
DeepspeechA PaddlePaddle implementation of ASR.
Stars: ✭ 1,219 (-30.58%)
Awesome KaldiThis is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )
Stars: ✭ 393 (-77.62%)
Cnn vocoderA fast cnn-based vocoder
Stars: ✭ 74 (-95.79%)
VocA physical model of the human vocal tract using literate programming, based on Pink Trombone.
Stars: ✭ 129 (-92.65%)
Css10CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages
Stars: ✭ 302 (-82.8%)
DlaDeep learning for audio processing
Stars: ✭ 142 (-91.91%)
OpenasrA pytorch based end2end speech recognition system.
Stars: ✭ 69 (-96.07%)
Multilingual text to speechAn implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.
Stars: ✭ 324 (-81.55%)
HolobotHoloBot is a reusable 3D interface that allows HoloLens & VR users to interact with any bot using Mixed Reality & Speech.
Stars: ✭ 114 (-93.51%)
Facemoji😆 A voice chatbot that can imitate your expression. OpenCV+Dlib+Live2D+Moments Recorder+Turing Robot+Iflytek IAT+Iflytek TTS
Stars: ✭ 320 (-81.78%)
WatbotAn Android ChatBot powered by IBM Watson Services (Assistant V1, Text-to-Speech, and Speech-to-Text with Speaker Recognition) on IBM Cloud.
Stars: ✭ 64 (-96.36%)
PysptkA python wrapper for Speech Signal Processing Toolkit (SPTK).
Stars: ✭ 297 (-83.09%)
Cs224n Gpu That TalksAttention, I'm Trying to Speak: End-to-end speech synthesis (CS224n '18)
Stars: ✭ 52 (-97.04%)
Glow TtsA Generative Flow for Text-to-Speech via Monotonic Alignment Search
Stars: ✭ 284 (-83.83%)
Pocketsphinx PythonPython interface to CMU Sphinxbase and Pocketsphinx libraries
Stars: ✭ 298 (-83.03%)
Syn SpeechSyn.Speech is a flexible speaker independent continuous speech recognition engine for Mono and .NET framework
Stars: ✭ 57 (-96.75%)
Sednndeep learning based speech enhancement using keras or pytorch, make it easy to use
Stars: ✭ 288 (-83.6%)
CrystalCrystal - C++ implementation of a unified framework for multilingual TTS synthesis engine with SSML specification as interface.
Stars: ✭ 108 (-93.85%)
ParakeetPAddle PARAllel text-to-speech toolKIT (supporting WaveFlow, WaveNet, Transformer TTS and Tacotron2)
Stars: ✭ 279 (-84.11%)
SoloudFree, easy, portable audio engine for games
Stars: ✭ 1,048 (-40.32%)
DiffwaveDiffWave is a fast, high-quality neural vocoder and waveform synthesizer.
Stars: ✭ 139 (-92.08%)
Asr audio data linksA list of publically available audio data that anyone can download for ASR or other speech activities
Stars: ✭ 128 (-92.71%)
Parrots Automatic Speech Recognition(ASR), Text-To-Speech(TTS) engine for Chinese.
Stars: ✭ 48 (-97.27%)
Flutter ttsFlutter Text to Speech package
Stars: ✭ 263 (-85.02%)
Py NltoolsA collection of basic python modules for spoken natural language processing
Stars: ✭ 46 (-97.38%)
Speech Alignerspeech-aligner,是一个从“人声语音”及其“语言文本”,产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech and its transcription
Stars: ✭ 259 (-85.25%)
Amazing Python Scripts🚀 Curated collection of Amazing Python scripts from Basics to Advance with automation task scripts.
Stars: ✭ 229 (-86.96%)
WavernnWaveRNN Vocoder + TTS
Stars: ✭ 1,636 (-6.83%)
StlThe ITU-T Software Tool Library (G.191)
Stars: ✭ 44 (-97.49%)
Comprehensive-Tacotron2PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supports both single-, multi-speaker TTS and several techniques to enforce the robustness and efficiency of the model.
Stars: ✭ 22 (-98.75%)
Noise2Noise-audio denoising without clean training dataSource code for the paper titled "Speech Denoising without Clean Training Data: a Noise2Noise Approach". Paper accepted at the INTERSPEECH 2021 conference. This paper tackles the problem of the heavy dependence of clean speech data required by deep learning based audio denoising methods by showing that it is possible to train deep speech denoisi…
Stars: ✭ 49 (-97.21%)
hifigan-denoiserHiFi-GAN: High Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks
Stars: ✭ 88 (-94.99%)