Ivector XvectorExtract xvector and ivector under kaldi
Stars: ✭ 67 (-70.48%)
WavegradA fast, high-quality neural vocoder.
Stars: ✭ 138 (-39.21%)
WatbotAn Android ChatBot powered by IBM Watson Services (Assistant V1, Text-to-Speech, and Speech-to-Text with Speaker Recognition) on IBM Cloud.
Stars: ✭ 64 (-71.81%)
TimitThe DARPA TIMIT Acoustic-Phonetic Continuous Speech Corpus.
Stars: ✭ 202 (-11.01%)
NhyaiAI智能审查,支持色情识别、暴恐识别、语言识别、敏感文字检测和视频检测等功能,以及各种OCR识别能力,如身份证、驾照、行驶证、营业执照、银行卡、手写体、车牌和名片识别等功能,可以访问网站体验功能。
Stars: ✭ 60 (-73.57%)
AllosaurusAllosaurus is a pretrained universal phone recognizer for more than 2000 languages
Stars: ✭ 135 (-40.53%)
Syn SpeechSyn.Speech is a flexible speaker independent continuous speech recognition engine for Mono and .NET framework
Stars: ✭ 57 (-74.89%)
StlThe ITU-T Software Tool Library (G.191)
Stars: ✭ 44 (-80.62%)
Avpian open source voice command macro software
Stars: ✭ 130 (-42.73%)
Dialectid e2eEnd to End Dialect Identification using Convolutional Neural Network
Stars: ✭ 40 (-82.38%)
Voxceleb IvectorVoxceleb1 i-vector based speaker recognition system
Stars: ✭ 36 (-84.14%)
Asr audio data linksA list of publically available audio data that anyone can download for ASR or other speech activities
Stars: ✭ 128 (-43.61%)
Theano Kaldi RnnTHEANO-KALDI-RNNs is a project implementing various Recurrent Neural Networks (RNNs) for RNN-HMM speech recognition. The Theano Code is coupled with the Kaldi decoder.
Stars: ✭ 31 (-86.34%)
Kaldi Ioc++ Kaldi IO lib (static and dynamic).
Stars: ✭ 22 (-90.31%)
AudiomatePython library for handling audio datasets.
Stars: ✭ 99 (-56.39%)
Pocketsphinx PythonPython interface to CMU Sphinxbase and Pocketsphinx libraries
Stars: ✭ 298 (+31.28%)
Sednndeep learning based speech enhancement using keras or pytorch, make it easy to use
Stars: ✭ 288 (+26.87%)
Annyang💬 Speech recognition for your site
Stars: ✭ 6,216 (+2638.33%)
Code Switching PapersA curated list of research papers and resources on code-switching
Stars: ✭ 122 (-46.26%)
SeganSpeech Enhancement Generative Adversarial Network in TensorFlow
Stars: ✭ 661 (+191.19%)
Aeneasaeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)
Stars: ✭ 1,942 (+755.51%)
Factorized TdnnPyTorch implementation of the Factorized TDNN (TDNN-F) from "Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks" and Kaldi
Stars: ✭ 98 (-56.83%)
Vosk ServerWebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries
Stars: ✭ 277 (+22.03%)
VadVoice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.
Stars: ✭ 622 (+174.01%)
Sonus💬 /so.nus/ STT (speech to text) for Node with offline hotword detection
Stars: ✭ 532 (+134.36%)
VoluteRaspberry Pi + Nodejs = Speech Robot
Stars: ✭ 224 (-1.32%)
Java Speech ApiThe J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.
Stars: ✭ 490 (+115.86%)
Ctc pytorchCTC end -to-end ASR for timit and 863 corpus.
Stars: ✭ 161 (-29.07%)
CboardAAC communication system with text-to-speech for the browser
Stars: ✭ 437 (+92.51%)
DurianImplementation of "Duration Informed Attention Network for Multimodal Synthesis" (https://arxiv.org/pdf/1909.01700.pdf) paper.
Stars: ✭ 111 (-51.1%)
DeltaDELTA is a deep learning based natural language and speech processing platform.
Stars: ✭ 1,479 (+551.54%)
Tts🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Stars: ✭ 305 (+34.36%)
Tts Papers🐸 collection of TTS papers
Stars: ✭ 160 (-29.52%)
Voice BuilderAn opensource text-to-speech (TTS) voice building tool
Stars: ✭ 362 (+59.47%)
Vosk ApiOffline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Stars: ✭ 1,357 (+497.8%)
Ios 10 SamplerCode examples for new APIs of iOS 10.
Stars: ✭ 3,341 (+1371.81%)
Css10CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages
Stars: ✭ 302 (+33.04%)
Py Kaldi AsrSome simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.
Stars: ✭ 156 (-31.28%)
PysptkA python wrapper for Speech Signal Processing Toolkit (SPTK).
Stars: ✭ 297 (+30.84%)
WikipronMassively multilingual pronunciation mining
Stars: ✭ 99 (-56.39%)
React Transcript EditorA React component to make correcting automated transcriptions of audio and video easier and faster. By BBC News Labs. - Work in progress
Stars: ✭ 285 (+25.55%)
Depression DetectPredicting depression from acoustic features of speech using a Convolutional Neural Network.
Stars: ✭ 187 (-17.62%)
Vosk Android DemoOffline speech recognition for Android with Vosk library.
Stars: ✭ 271 (+19.38%)
EendEnd-to-End Neural Diarization
Stars: ✭ 153 (-32.6%)
PldaAn LDA/PLDA estimator using KALDI in python for speaker verification tasks
Stars: ✭ 85 (-62.56%)
GttsPython library and CLI tool to interface with Google Translate's text-to-speech API
Stars: ✭ 1,303 (+474.01%)
Source separationDeep learning based speech source separation using Pytorch
Stars: ✭ 226 (-0.44%)