SoloudFree, easy, portable audio engine for games
Stars: ✭ 1,048 (+502.3%)
Annyang💬 Speech recognition for your site
Stars: ✭ 6,216 (+3472.41%)
DurianImplementation of "Duration Informed Attention Network for Multimodal Synthesis" (https://arxiv.org/pdf/1909.01700.pdf) paper.
Stars: ✭ 111 (-36.21%)
Nlp Paper自然语言处理领域下的对话语音领域,整理相关论文(附阅读笔记),复现模型以及数据处理等(代码含TensorFlow和PyTorch两版本)
Stars: ✭ 67 (-61.49%)
TacotronAudio samples accompanying publications related to Tacotron, an end-to-end speech synthesis model.
Stars: ✭ 493 (+183.33%)
TtsText-to-Speech for Arduino
Stars: ✭ 118 (-32.18%)
DiscordspeechbotA speech-to-text bot for discord with music commands and more using NodeJS. Ideally for controlling your Discord server using voice commands, can also be useful for hearing-impaired people.
Stars: ✭ 35 (-79.89%)
VadVoice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.
Stars: ✭ 622 (+257.47%)
WikipronMassively multilingual pronunciation mining
Stars: ✭ 99 (-43.1%)
DeepspeechA PaddlePaddle implementation of ASR.
Stars: ✭ 1,219 (+600.57%)
SpecaugmentA Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain
Stars: ✭ 408 (+134.48%)
DiffwaveDiffWave is a fast, high-quality neural vocoder and waveform synthesizer.
Stars: ✭ 139 (-20.11%)
Dc ttsA TensorFlow Implementation of DC-TTS: yet another text-to-speech model
Stars: ✭ 1,017 (+484.48%)
LightspeechLightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search
Stars: ✭ 31 (-82.18%)
Aeneasaeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)
Stars: ✭ 1,942 (+1016.09%)
SeganSpeech Enhancement Generative Adversarial Network in TensorFlow
Stars: ✭ 661 (+279.89%)
DeltaDELTA is a deep learning based natural language and speech processing platform.
Stars: ✭ 1,479 (+750%)
Sonus💬 /so.nus/ STT (speech to text) for Node with offline hotword detection
Stars: ✭ 532 (+205.75%)
VocA physical model of the human vocal tract using literate programming, based on Pink Trombone.
Stars: ✭ 129 (-25.86%)
Xr3player🎧 🎼 Advanced JavaFX Media Player
Stars: ✭ 472 (+171.26%)
TtsTools to convert text to speech 📚💬
Stars: ✭ 84 (-51.72%)
Neural spEnd-to-end ASR/LM implementation with PyTorch
Stars: ✭ 408 (+134.48%)
Kaldikaldi-asr/kaldi is the official location of the Kaldi project.
Stars: ✭ 11,151 (+6308.62%)
OpenasrA pytorch based end2end speech recognition system.
Stars: ✭ 69 (-60.34%)
WavegradA fast, high-quality neural vocoder.
Stars: ✭ 138 (-20.69%)
WatbotAn Android ChatBot powered by IBM Watson Services (Assistant V1, Text-to-Speech, and Speech-to-Text with Speaker Recognition) on IBM Cloud.
Stars: ✭ 64 (-63.22%)
Code Switching PapersA curated list of research papers and resources on code-switching
Stars: ✭ 122 (-29.89%)
Syn SpeechSyn.Speech is a flexible speaker independent continuous speech recognition engine for Mono and .NET framework
Stars: ✭ 57 (-67.24%)
Tts Papers🐸 collection of TTS papers
Stars: ✭ 160 (-8.05%)
StlThe ITU-T Software Tool Library (G.191)
Stars: ✭ 44 (-74.71%)
Dialectid e2eEnd to End Dialect Identification using Convolutional Neural Network
Stars: ✭ 40 (-77.01%)
AllosaurusAllosaurus is a pretrained universal phone recognizer for more than 2000 languages
Stars: ✭ 135 (-22.41%)
WsayWindows "say"
Stars: ✭ 36 (-79.31%)
HolobotHoloBot is a reusable 3D interface that allows HoloLens & VR users to interact with any bot using Mixed Reality & Speech.
Stars: ✭ 114 (-34.48%)
PykaldiA Python wrapper for Kaldi
Stars: ✭ 756 (+334.48%)
Pytorch Kaldipytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
Stars: ✭ 2,097 (+1105.17%)
PraatPraat: Doing Phonetics By Computer
Stars: ✭ 675 (+287.93%)
Speech Emotion AnalyzerThe neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)
Stars: ✭ 633 (+263.79%)
Avpian open source voice command macro software
Stars: ✭ 130 (-25.29%)
Nodejs SpeechNode.js client for Google Cloud Speech: Speech to text conversion powered by machine learning.
Stars: ✭ 545 (+213.22%)
AudiomatePython library for handling audio datasets.
Stars: ✭ 99 (-43.1%)
Java Speech ApiThe J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.
Stars: ✭ 490 (+181.61%)
GttsPython library and CLI tool to interface with Google Translate's text-to-speech API
Stars: ✭ 1,303 (+648.85%)
CboardAAC communication system with text-to-speech for the browser
Stars: ✭ 437 (+151.15%)
Asr audio data linksA list of publically available audio data that anyone can download for ASR or other speech activities
Stars: ✭ 128 (-26.44%)
AudioData manipulation and transformation for audio signal processing, powered by PyTorch
Stars: ✭ 1,262 (+625.29%)
Chatbot Watson AndroidAn Android ChatBot powered by Watson Services - Assistant, Speech-to-Text and Text-to-Speech on IBM Cloud.
Stars: ✭ 169 (-2.87%)
Tacotron asrSpeech Recognition Using Tacotron
Stars: ✭ 165 (-5.17%)
TacotronA TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model
Stars: ✭ 1,756 (+909.2%)
JuliusOpen-Source Large Vocabulary Continuous Speech Recognition Engine
Stars: ✭ 1,258 (+622.99%)