ttslearnttslearn: Library for Pythonで学ぶ音声合成 (Text-to-speech with Python)
Stars: ✭ 158 (+83.72%)
Vq Vae SpeechPyTorch implementation of VQ-VAE + WaveNet by [Chorowski et al., 2019] and VQ-VAE on speech signals by [van den Oord et al., 2017]
Stars: ✭ 187 (+117.44%)
hifigan-denoiserHiFi-GAN: High Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks
Stars: ✭ 88 (+2.33%)
Java Speech ApiThe J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.
Stars: ✭ 490 (+469.77%)
DiscordspeechbotA speech-to-text bot for discord with music commands and more using NodeJS. Ideally for controlling your Discord server using voice commands, can also be useful for hearing-impaired people.
Stars: ✭ 35 (-59.3%)
SpecaugmentA Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain
Stars: ✭ 408 (+374.42%)
Tts🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Stars: ✭ 305 (+254.65%)
Syn SpeechSyn.Speech is a flexible speaker independent continuous speech recognition engine for Mono and .NET framework
Stars: ✭ 57 (-33.72%)
Wavenet SttAn end-to-end speech recognition system with Wavenet. Built using C++ and python.
Stars: ✭ 18 (-79.07%)
InaspeechsegmenterCNN-based audio segmentation toolkit. Allows to detect speech, music and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.
Stars: ✭ 352 (+309.3%)
Sonus💬 /so.nus/ STT (speech to text) for Node with offline hotword detection
Stars: ✭ 532 (+518.6%)
Dialectid e2eEnd to End Dialect Identification using Convolutional Neural Network
Stars: ✭ 40 (-53.49%)
FlowavenetA Pytorch implementation of "FloWaveNet: A Generative Flow for Raw Audio"
Stars: ✭ 471 (+447.67%)
Awesome KaldiThis is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )
Stars: ✭ 393 (+356.98%)
LightspeechLightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search
Stars: ✭ 31 (-63.95%)
PycadlPython package with source code from the course "Creative Applications of Deep Learning w/ TensorFlow"
Stars: ✭ 356 (+313.95%)
Chainer Vq VaeA Chainer implementation of VQ-VAE.
Stars: ✭ 77 (-10.47%)
Ios 10 SamplerCode examples for new APIs of iOS 10.
Stars: ✭ 3,341 (+3784.88%)
Annyang💬 Speech recognition for your site
Stars: ✭ 6,216 (+7127.91%)
Css10CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages
Stars: ✭ 302 (+251.16%)
PysptkA python wrapper for Speech Signal Processing Toolkit (SPTK).
Stars: ✭ 297 (+245.35%)
SoloudFree, easy, portable audio engine for games
Stars: ✭ 1,048 (+1118.6%)
PraatPraat: Doing Phonetics By Computer
Stars: ✭ 675 (+684.88%)
ClarinetA Pytorch Implementation of ClariNet
Stars: ✭ 273 (+217.44%)
Dc ttsA TensorFlow Implementation of DC-TTS: yet another text-to-speech model
Stars: ✭ 1,017 (+1082.56%)
TacotronAudio samples accompanying publications related to Tacotron, an end-to-end speech synthesis model.
Stars: ✭ 493 (+473.26%)
WatbotAn Android ChatBot powered by IBM Watson Services (Assistant V1, Text-to-Speech, and Speech-to-Text with Speaker Recognition) on IBM Cloud.
Stars: ✭ 64 (-25.58%)
Xr3player🎧 🎼 Advanced JavaFX Media Player
Stars: ✭ 472 (+448.84%)
Vq Vae WavenetTensorFlow implementation of VQ-VAE with WaveNet decoder, based on https://arxiv.org/abs/1711.00937 and https://arxiv.org/abs/1901.08810
Stars: ✭ 40 (-53.49%)
CboardAAC communication system with text-to-speech for the browser
Stars: ✭ 437 (+408.14%)
DeepspeechA PaddlePaddle implementation of ASR.
Stars: ✭ 1,219 (+1317.44%)
Neural spEnd-to-end ASR/LM implementation with PyTorch
Stars: ✭ 408 (+374.42%)
WsayWindows "say"
Stars: ✭ 36 (-58.14%)
Voice BuilderAn opensource text-to-speech (TTS) voice building tool
Stars: ✭ 362 (+320.93%)
Time Series PredictionA collection of time series prediction methods: rnn, seq2seq, cnn, wavenet, transformer, unet, n-beats, gan, kalman-filter
Stars: ✭ 351 (+308.14%)
JuliusOpen-Source Large Vocabulary Continuous Speech Recognition Engine
Stars: ✭ 1,258 (+1362.79%)
Tts🤖 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
Stars: ✭ 5,427 (+6210.47%)
PykaldiA Python wrapper for Kaldi
Stars: ✭ 756 (+779.07%)
Android SpeechAndroid speech recognition and text to speech made easy
Stars: ✭ 310 (+260.47%)
WavenetWaveNet implementation with chainer
Stars: ✭ 53 (-38.37%)
Pocketsphinx PythonPython interface to CMU Sphinxbase and Pocketsphinx libraries
Stars: ✭ 298 (+246.51%)
ParallelwaveganUnofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN) with Pytorch
Stars: ✭ 682 (+693.02%)
Sednndeep learning based speech enhancement using keras or pytorch, make it easy to use
Stars: ✭ 288 (+234.88%)
OpenasrA pytorch based end2end speech recognition system.
Stars: ✭ 69 (-19.77%)
SeganSpeech Enhancement Generative Adversarial Network in TensorFlow
Stars: ✭ 661 (+668.6%)
Speech Alignerspeech-aligner,是一个从“人声语音”及其“语言文本”,产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech and its transcription
Stars: ✭ 259 (+201.16%)
Tacotron2pytorch tacotron2 https://arxiv.org/pdf/1712.05884.pdf
Stars: ✭ 46 (-46.51%)
Speech Emotion AnalyzerThe neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)
Stars: ✭ 633 (+636.05%)
Amazing Python Scripts🚀 Curated collection of Amazing Python scripts from Basics to Advance with automation task scripts.
Stars: ✭ 229 (+166.28%)
Noise2Noise-audio denoising without clean training dataSource code for the paper titled "Speech Denoising without Clean Training Data: a Noise2Noise Approach". Paper accepted at the INTERSPEECH 2021 conference. This paper tackles the problem of the heavy dependence of clean speech data required by deep learning based audio denoising methods by showing that it is possible to train deep speech denoisi…
Stars: ✭ 49 (-43.02%)
VadVoice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.
Stars: ✭ 622 (+623.26%)
AudioData manipulation and transformation for audio signal processing, powered by PyTorch
Stars: ✭ 1,262 (+1367.44%)