flite-goGo bindings for Flite (festival-lite)
Stars: ✭ 14 (-93.83%)
JuliusOpen-Source Large Vocabulary Continuous Speech Recognition Engine
Stars: ✭ 1,258 (+454.19%)
dropclass speakerDropClass and DropAdapt - repository for the paper accepted to Speaker Odyssey 2020
Stars: ✭ 20 (-91.19%)
Speech256An FPGA implementation of a classic 80ies speech synthesizer. Done for the Retro Challenge 2017/10.
Stars: ✭ 51 (-77.53%)
DeepspeechA PaddlePaddle implementation of ASR.
Stars: ✭ 1,219 (+437%)
ser-with-w2v2Official implementation of INTERSPEECH 2021 paper 'Emotion Recognition from Speech Using Wav2vec 2.0 Embeddings'
Stars: ✭ 40 (-82.38%)
torch-asgAuto Segmentation Criterion (ASG) implemented in pytorch
Stars: ✭ 42 (-81.5%)
Ivector XvectorExtract xvector and ivector under kaldi
Stars: ✭ 67 (-70.48%)
Fre-GAN-pytorchFre-GAN: Adversarial Frequency-consistent Audio Synthesis
Stars: ✭ 73 (-67.84%)
WavegradA fast, high-quality neural vocoder.
Stars: ✭ 138 (-39.21%)
spokestack-androidExtensible Android mobile voice framework: wakeword, ASR, NLU, and TTS. Easily add voice to any Android app!
Stars: ✭ 52 (-77.09%)
WatbotAn Android ChatBot powered by IBM Watson Services (Assistant V1, Text-to-Speech, and Speech-to-Text with Speaker Recognition) on IBM Cloud.
Stars: ✭ 64 (-71.81%)
LIUMScripts for LIUM SpkDiarization tools
Stars: ✭ 28 (-87.67%)
TimitThe DARPA TIMIT Acoustic-Phonetic Continuous Speech Corpus.
Stars: ✭ 202 (-11.01%)
jackpairp2p speech encrypting device with analog audio interface suitable for GSM phones
Stars: ✭ 26 (-88.55%)
NhyaiAI智能审查,支持色情识别、暴恐识别、语言识别、敏感文字检测和视频检测等功能,以及各种OCR识别能力,如身份证、驾照、行驶证、营业执照、银行卡、手写体、车牌和名片识别等功能,可以访问网站体验功能。
Stars: ✭ 60 (-73.57%)
fadeA Simulation Framework for Auditory Discrimination Experiments
Stars: ✭ 12 (-94.71%)
AllosaurusAllosaurus is a pretrained universal phone recognizer for more than 2000 languages
Stars: ✭ 135 (-40.53%)
kaldi-timit-sre-ivectorDevelop speaker recognition model based on i-vector using TIMIT database
Stars: ✭ 17 (-92.51%)
Syn SpeechSyn.Speech is a flexible speaker independent continuous speech recognition engine for Mono and .NET framework
Stars: ✭ 57 (-74.89%)
nabaztag-phpa simple php implementation of a Nabaztag server
Stars: ✭ 14 (-93.83%)
KARENKAREN: Unifying Hatespeech Detection and Benchmarking
Stars: ✭ 18 (-92.07%)
StlThe ITU-T Software Tool Library (G.191)
Stars: ✭ 44 (-80.62%)
EendEnd-to-End Neural Diarization
Stars: ✭ 153 (-32.6%)
Avpian open source voice command macro software
Stars: ✭ 130 (-42.73%)
nlp-classA Natural Language Processing course taught by Professor Ghassemi
Stars: ✭ 95 (-58.15%)
Dialectid e2eEnd to End Dialect Identification using Convolutional Neural Network
Stars: ✭ 40 (-82.38%)
Voice2MeshCVPR 2022: Cross-Modal Perceptionist: Can Face Geometry be Gleaned from Voices?
Stars: ✭ 67 (-70.48%)
UniSpeechUniSpeech - Large Scale Self-Supervised Learning for Speech
Stars: ✭ 224 (-1.32%)
Voxceleb IvectorVoxceleb1 i-vector based speaker recognition system
Stars: ✭ 36 (-84.14%)
gtranscribeSoftware for interview transcription
Stars: ✭ 12 (-94.71%)
Asr audio data linksA list of publically available audio data that anyone can download for ASR or other speech activities
Stars: ✭ 128 (-43.61%)
Theano Kaldi RnnTHEANO-KALDI-RNNs is a project implementing various Recurrent Neural Networks (RNNs) for RNN-HMM speech recognition. The Theano Code is coupled with the Kaldi decoder.
Stars: ✭ 31 (-86.34%)
speech-transformerTransformer implementation speciaized in speech recognition tasks using Pytorch.
Stars: ✭ 40 (-82.38%)
VAD-LTSDEfficient voice activity detection algorithm using long-term speech information
Stars: ✭ 37 (-83.7%)
Kaldi Ioc++ Kaldi IO lib (static and dynamic).
Stars: ✭ 22 (-90.31%)
JD-NMFJoint Dictionary Learning-based Non-Negative Matrix Factorization for Voice Conversion (TBME 2016)
Stars: ✭ 20 (-91.19%)
Amazing Python Scripts🚀 Curated collection of Amazing Python scripts from Basics to Advance with automation task scripts.
Stars: ✭ 229 (+0.88%)
Kaldi Active GrammarPython Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time
Stars: ✭ 196 (-13.66%)
PldaAn LDA/PLDA estimator using KALDI in python for speaker verification tasks
Stars: ✭ 85 (-62.56%)
Noise2Noise-audio denoising without clean training dataSource code for the paper titled "Speech Denoising without Clean Training Data: a Noise2Noise Approach". Paper accepted at the INTERSPEECH 2021 conference. This paper tackles the problem of the heavy dependence of clean speech data required by deep learning based audio denoising methods by showing that it is possible to train deep speech denoisi…
Stars: ✭ 49 (-78.41%)
torchainWIP: pytorch FFI wrapper for Kaldi chain loss (a.k.a. Lattice Free MMI)
Stars: ✭ 20 (-91.19%)
Annyang💬 Speech recognition for your site
Stars: ✭ 6,216 (+2638.33%)
data-at-hand-mobileMobile application for exploring fitness data using both speech and touch interaction.
Stars: ✭ 50 (-77.97%)
Code Switching PapersA curated list of research papers and resources on code-switching
Stars: ✭ 122 (-46.26%)
hifigan-denoiserHiFi-GAN: High Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks
Stars: ✭ 88 (-61.23%)
SeganSpeech Enhancement Generative Adversarial Network in TensorFlow
Stars: ✭ 661 (+191.19%)
Source separationDeep learning based speech source separation using Pytorch
Stars: ✭ 226 (-0.44%)
Speech DenoiserA speech denoise lv2 plugin based on RNNoise library
Stars: ✭ 220 (-3.08%)
EdgedictWorking online speech recognition based on RNN Transducer. ( Trained model release available in release )
Stars: ✭ 205 (-9.69%)
Vq Vae SpeechPyTorch implementation of VQ-VAE + WaveNet by [Chorowski et al., 2019] and VQ-VAE on speech signals by [van den Oord et al., 2017]
Stars: ✭ 187 (-17.62%)
Speech To Text RussianПроект для распознавания речи на русском языке на основе pykaldi.
Stars: ✭ 151 (-33.48%)