VocA physical model of the human vocal tract using literate programming, based on Pink Trombone.
Stars: ✭ 129 (-36.14%)
DeltaDELTA is a deep learning based natural language and speech processing platform.
Stars: ✭ 1,479 (+632.18%)
LightspeechLightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search
Stars: ✭ 31 (-84.65%)
DiffwaveDiffWave is a fast, high-quality neural vocoder and waveform synthesizer.
Stars: ✭ 139 (-31.19%)
DeepspeechA PaddlePaddle implementation of ASR.
Stars: ✭ 1,219 (+503.47%)
Chatbot Watson AndroidAn Android ChatBot powered by Watson Services - Assistant, Speech-to-Text and Text-to-Speech on IBM Cloud.
Stars: ✭ 169 (-16.34%)
Dc ttsA TensorFlow Implementation of DC-TTS: yet another text-to-speech model
Stars: ✭ 1,017 (+403.47%)
DurianImplementation of "Duration Informed Attention Network for Multimodal Synthesis" (https://arxiv.org/pdf/1909.01700.pdf) paper.
Stars: ✭ 111 (-45.05%)
SeganSpeech Enhancement Generative Adversarial Network in TensorFlow
Stars: ✭ 661 (+227.23%)
TacotronA TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model
Stars: ✭ 1,756 (+769.31%)
WikipronMassively multilingual pronunciation mining
Stars: ✭ 99 (-50.99%)
JuliusOpen-Source Large Vocabulary Continuous Speech Recognition Engine
Stars: ✭ 1,258 (+522.77%)
Nlp Paper自然语言处理领域下的对话语音领域,整理相关论文(附阅读笔记),复现模型以及数据处理等(代码含TensorFlow和PyTorch两版本)
Stars: ✭ 67 (-66.83%)
Depression DetectPredicting depression from acoustic features of speech using a Convolutional Neural Network.
Stars: ✭ 187 (-7.43%)
SoloudFree, easy, portable audio engine for games
Stars: ✭ 1,048 (+418.81%)
DiscordspeechbotA speech-to-text bot for discord with music commands and more using NodeJS. Ideally for controlling your Discord server using voice commands, can also be useful for hearing-impaired people.
Stars: ✭ 35 (-82.67%)
Tacotron asrSpeech Recognition Using Tacotron
Stars: ✭ 165 (-18.32%)
Annyang💬 Speech recognition for your site
Stars: ✭ 6,216 (+2977.23%)
TtsText-to-Speech for Arduino
Stars: ✭ 118 (-41.58%)
HolobotHoloBot is a reusable 3D interface that allows HoloLens & VR users to interact with any bot using Mixed Reality & Speech.
Stars: ✭ 114 (-43.56%)
Speech Emotion AnalyzerThe neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)
Stars: ✭ 633 (+213.37%)
AudiomatePython library for handling audio datasets.
Stars: ✭ 99 (-50.99%)
WavegradA fast, high-quality neural vocoder.
Stars: ✭ 138 (-31.68%)
GttsPython library and CLI tool to interface with Google Translate's text-to-speech API
Stars: ✭ 1,303 (+545.05%)
AudioData manipulation and transformation for audio signal processing, powered by PyTorch
Stars: ✭ 1,262 (+524.75%)
AllosaurusAllosaurus is a pretrained universal phone recognizer for more than 2000 languages
Stars: ✭ 135 (-33.17%)
TtsTools to convert text to speech 📚💬
Stars: ✭ 84 (-58.42%)
OpenasrA pytorch based end2end speech recognition system.
Stars: ✭ 69 (-65.84%)
Avpian open source voice command macro software
Stars: ✭ 130 (-35.64%)
WatbotAn Android ChatBot powered by IBM Watson Services (Assistant V1, Text-to-Speech, and Speech-to-Text with Speaker Recognition) on IBM Cloud.
Stars: ✭ 64 (-68.32%)
LingvoLingvo
Stars: ✭ 2,361 (+1068.81%)
Syn SpeechSyn.Speech is a flexible speaker independent continuous speech recognition engine for Mono and .NET framework
Stars: ✭ 57 (-71.78%)
Asr audio data linksA list of publically available audio data that anyone can download for ASR or other speech activities
Stars: ✭ 128 (-36.63%)
StlThe ITU-T Software Tool Library (G.191)
Stars: ✭ 44 (-78.22%)
Pytorch Kaldipytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
Stars: ✭ 2,097 (+938.12%)
Dialectid e2eEnd to End Dialect Identification using Convolutional Neural Network
Stars: ✭ 40 (-80.2%)
Kaldikaldi-asr/kaldi is the official location of the Kaldi project.
Stars: ✭ 11,151 (+5420.3%)
WsayWindows "say"
Stars: ✭ 36 (-82.18%)
Vq Vae SpeechPyTorch implementation of VQ-VAE + WaveNet by [Chorowski et al., 2019] and VQ-VAE on speech signals by [van den Oord et al., 2017]
Stars: ✭ 187 (-7.43%)
PykaldiA Python wrapper for Kaldi
Stars: ✭ 756 (+274.26%)
Code Switching PapersA curated list of research papers and resources on code-switching
Stars: ✭ 122 (-39.6%)
PraatPraat: Doing Phonetics By Computer
Stars: ✭ 675 (+234.16%)
Tts Papers🐸 collection of TTS papers
Stars: ✭ 160 (-20.79%)
Esp8266samSpeech synthesis for ESP8266 using S.A.M. port
Stars: ✭ 199 (-1.49%)
Aeneasaeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)
Stars: ✭ 1,942 (+861.39%)