Annyang💬 Speech recognition for your site
Stars: ✭ 6,216 (+222.74%)
Awesome KaldiThis is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )
Stars: ✭ 393 (-79.6%)
GcommandspytorchConvNets for Audio Recognition using Google Commands Dataset
Stars: ✭ 65 (-96.63%)
Tts🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Stars: ✭ 305 (-84.16%)
PraatPraat: Doing Phonetics By Computer
Stars: ✭ 675 (-64.95%)
Awesome DiarizationA curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
Stars: ✭ 673 (-65.06%)
DncDiscriminative Neural Clustering for Speaker Diarisation
Stars: ✭ 60 (-96.88%)
Asr audio data linksA list of publically available audio data that anyone can download for ASR or other speech activities
Stars: ✭ 128 (-93.35%)
InaspeechsegmenterCNN-based audio segmentation toolkit. Allows to detect speech, music and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.
Stars: ✭ 352 (-81.72%)
EspeakeSpeak NG is an open source speech synthesizer that supports 101 languages and accents.
Stars: ✭ 339 (-82.4%)
KalliopeKalliope is a framework that will help you to create your own personal assistant.
Stars: ✭ 1,509 (-21.65%)
Ios 10 SamplerCode examples for new APIs of iOS 10.
Stars: ✭ 3,341 (+73.47%)
Pink TromboneA programmable version of Neil Thapen's Pink Trombone
Stars: ✭ 54 (-97.2%)
Hifi GanHiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
Stars: ✭ 325 (-83.13%)
Xva SynthMachine learning based speech synthesis Electron app, with voices from specific characters from video games
Stars: ✭ 136 (-92.94%)
Cognitive Speech TtsMicrosoft Text-to-Speech API sample code in several languages, part of Cognitive Services.
Stars: ✭ 312 (-83.8%)
Cs224n Gpu That TalksAttention, I'm Trying to Speak: End-to-end speech synthesis (CS224n '18)
Stars: ✭ 52 (-97.3%)
Numpy MlMachine learning, in numpy
Stars: ✭ 11,100 (+476.32%)
SoloudFree, easy, portable audio engine for games
Stars: ✭ 1,048 (-45.59%)
SamSoftware Automatic Mouth - Tiny Speech Synthesizer
Stars: ✭ 667 (-65.37%)
Sednndeep learning based speech enhancement using keras or pytorch, make it easy to use
Stars: ✭ 288 (-85.05%)
ParakeetPAddle PARAllel text-to-speech toolKIT (supporting WaveFlow, WaveNet, Transformer TTS and Tacotron2)
Stars: ✭ 279 (-85.51%)
Tacotron2pytorch tacotron2 https://arxiv.org/pdf/1712.05884.pdf
Stars: ✭ 46 (-97.61%)
AudioData manipulation and transformation for audio signal processing, powered by PyTorch
Stars: ✭ 1,262 (-34.48%)
SeganSpeech Enhancement Generative Adversarial Network in TensorFlow
Stars: ✭ 661 (-65.68%)
Awesome Speech EnhancementA tutorial for Speech Enhancement researchers and practitioners. The purpose of this repo is to organize the world’s resources for speech enhancement and make them universally accessible and useful.
Stars: ✭ 257 (-86.66%)
Formant AnalyzeriOS application for finding formants in spoken sounds
Stars: ✭ 43 (-97.77%)
Speech Emotion AnalyzerThe neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)
Stars: ✭ 633 (-67.13%)
SpeechTransProgressTracking the progress in end-to-end speech translation
Stars: ✭ 139 (-92.78%)
Dc ttsA TensorFlow Implementation of DC-TTS: yet another text-to-speech model
Stars: ✭ 1,017 (-47.2%)
DeltaDELTA is a deep learning based natural language and speech processing platform.
Stars: ✭ 1,479 (-23.21%)
TtsText-to-Speech for Arduino
Stars: ✭ 118 (-93.87%)
JuliusOpen-Source Large Vocabulary Continuous Speech Recognition Engine
Stars: ✭ 1,258 (-34.68%)
VadVoice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.
Stars: ✭ 622 (-67.71%)
voice-conversionan tutorial implement of voice conversion using pytorch
Stars: ✭ 26 (-98.65%)
Dialectid e2eEnd to End Dialect Identification using Convolutional Neural Network
Stars: ✭ 40 (-97.92%)
EmotionalConversionStarGANThis repository contains code to replicate results from the ICASSP 2020 paper "StarGAN for Emotional Speech Conversion: Validated by Data Augmentation of End-to-End Emotion Recognition".
Stars: ✭ 92 (-95.22%)
Kaldikaldi-asr/kaldi is the official location of the Kaldi project.
Stars: ✭ 11,151 (+478.97%)
flite-goGo bindings for Flite (festival-lite)
Stars: ✭ 14 (-99.27%)
Vq Vae WavenetTensorFlow implementation of VQ-VAE with WaveNet decoder, based on https://arxiv.org/abs/1711.00937 and https://arxiv.org/abs/1901.08810
Stars: ✭ 40 (-97.92%)
vaka neural network toolbox for animal vocalizations and bioacoustics
Stars: ✭ 21 (-98.91%)
Spokestack PythonSpokestack is a library that allows a user to easily incorporate a voice interface into any Python application.
Stars: ✭ 103 (-94.65%)
speechportal(1st place at HopHacks) A dynamic webVR memory palace for speech training, utilizing natural language processing and Google Streetview API
Stars: ✭ 14 (-99.27%)
CotatronOfficial code for Cotatron @ INTERSPEECH 2020
Stars: ✭ 137 (-92.89%)
ParrotRNN-based generative models for speech.
Stars: ✭ 601 (-68.8%)
ser-with-w2v2Official implementation of INTERSPEECH 2021 paper 'Emotion Recognition from Speech Using Wav2vec 2.0 Embeddings'
Stars: ✭ 40 (-97.92%)
TtsTools to convert text to speech 📚💬
Stars: ✭ 84 (-95.64%)
FastspeechThe Implementation of FastSpeech based on pytorch.
Stars: ✭ 600 (-68.85%)
Melgan NeuripsGAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis
Stars: ✭ 592 (-69.26%)
ZerospeechVQ-VAE for Acoustic Unit Discovery and Voice Conversion
Stars: ✭ 137 (-92.89%)
Avpian open source voice command macro software
Stars: ✭ 130 (-93.25%)
NonautoreggenprogressTracking the progress in non-autoregressive generation (translation, transcription, etc.)
Stars: ✭ 118 (-93.87%)
DeepspeechA PaddlePaddle implementation of ASR.
Stars: ✭ 1,219 (-36.71%)
Nodejs SpeechNode.js client for Google Cloud Speech: Speech to text conversion powered by machine learning.
Stars: ✭ 545 (-71.7%)