VadVoice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.
Stars: ✭ 622 (+3355.56%)
Listen-Attend-Spell-v2PyTorch implementation of Listen Attend and Spell Automatic Speech Recognition (ASR).
Stars: ✭ 29 (+61.11%)
RhinoOn-device speech-to-intent engine powered by deep learning
Stars: ✭ 406 (+2155.56%)
Annyang💬 Speech recognition for your site
Stars: ✭ 6,216 (+34433.33%)
demo vietasrVietnamese Speech Recognition
Stars: ✭ 22 (+22.22%)
Tensorflowasr⚡️ TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords
Stars: ✭ 400 (+2122.22%)
Athenaan open-source implementation of sequence-to-sequence based speech processing engine
Stars: ✭ 542 (+2911.11%)
Recording-BotA bot built to record and transcribe audio fragments from Discord.
Stars: ✭ 22 (+22.22%)
Stephanie VaStephanie is an open-source platform built specifically for voice-controlled applications as well as to automate daily tasks imitating much of an virtual assistant's work.
Stars: ✭ 772 (+4188.89%)
download audioset📁 This repo makes it easy to download the raw audio files from AudioSet (32.45 GB, 632 classes).
Stars: ✭ 53 (+194.44%)
NmtpytorchSequence-to-Sequence Framework in PyTorch
Stars: ✭ 392 (+2077.78%)
leon🧠 Leon is your open-source personal assistant.
Stars: ✭ 8,560 (+47455.56%)
Sonus💬 /so.nus/ STT (speech to text) for Node with offline hotword detection
Stars: ✭ 532 (+2855.56%)
sepia-stt-serverSEPIA server to support open-source speech recognition via WebSocket connection.
Stars: ✭ 45 (+150%)
Zamia SpeechOpen tools and data for cloudless automatic speech recognition
Stars: ✭ 374 (+1977.78%)
speech-to-textmixlingual speech recognition system; hybrid (GMM+NNet) model; Kaldi + Keras
Stars: ✭ 61 (+238.89%)
ParallelwaveganUnofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN) with Pytorch
Stars: ✭ 682 (+3688.89%)
scim[wip]Speech recognition tool-box written by Nim. Based on Arraymancer.
Stars: ✭ 17 (-5.56%)
Alan Sdk WebAlan AI Web SDK adds a voice assistant or chatbot to your app. Supports React, Angular, Vue, Ember, JavaScript, Electron.
Stars: ✭ 368 (+1944.44%)
kosrKorean speech recognition based on transformer (트랜스포머 기반 한국어 음성 인식)
Stars: ✭ 25 (+38.89%)
Silero ModelsSilero Models: pre-trained STT models and benchmarks made embarrassingly simple
Stars: ✭ 522 (+2800%)
CCAligner🔮 Word by word audio subtitle synchronisation tool and API. Developed under GSoC 2017 with CCExtractor.
Stars: ✭ 131 (+627.78%)
KurDescriptive Deep Learning
Stars: ✭ 811 (+4405.56%)
wav2letterFacebook AI Research's Automatic Speech Recognition Toolkit
Stars: ✭ 6,026 (+33377.78%)
Libfaceidlibfaceid is a research framework for prototyping of face recognition solutions. It seamlessly integrates multiple detection, recognition and liveness models w/ speech synthesis and speech recognition.
Stars: ✭ 354 (+1866.67%)
sepia-docsDocumentation and Wiki for SEPIA. Please post your questions and bug-reports here in the issues section! Thank you :-)
Stars: ✭ 160 (+788.89%)
Java Speech ApiThe J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.
Stars: ✭ 490 (+2622.22%)
deepspeechA PyTorch implementation of DeepSpeech and DeepSpeech2.
Stars: ✭ 45 (+150%)
BrevitasBrevitas: quantization-aware training in PyTorch
Stars: ✭ 343 (+1805.56%)
mixupspeechpro.com/
Stars: ✭ 23 (+27.78%)
Awesome DiarizationA curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
Stars: ✭ 673 (+3638.89%)
lightning-asrModular and extensible speech recognition library leveraging pytorch-lightning and hydra.
Stars: ✭ 36 (+100%)
J.a.r.v.i.spython powered Intelligent System
Stars: ✭ 325 (+1705.56%)
quran-alignWord-accurate timestamps for Qur'anic audio.
Stars: ✭ 139 (+672.22%)
pocketsphinxUpdated ROS bindings to pocketsphinx
Stars: ✭ 36 (+100%)
Alan Sdk FlutterAlan AI Flutter SDK adds a voice assistant or chatbot to your app.
Stars: ✭ 309 (+1616.67%)
Speech-RecognitionEnd-to-end Automatic Speech Recognition for Madarian and English in Tensorflow
Stars: ✭ 21 (+16.67%)
PykaldiA Python wrapper for Kaldi
Stars: ✭ 756 (+4100%)
NLP ToolkitLibrary of state-of-the-art models (PyTorch) for NLP tasks
Stars: ✭ 92 (+411.11%)
Speech recognitionA Flutter plugin to use speech recognition on iOS & Android (Swift/Java)
Stars: ✭ 302 (+1577.78%)
RhasspyOffline private voice assistant for many human languages
Stars: ✭ 458 (+2444.44%)
Tensorflow-Keyword-SpottingKeyword spotting using various architecture like convolutional vggnet , 1D convolutional network and CTC.
Stars: ✭ 27 (+50%)
Alan Sdk IonicAlan AI Ionic SDK adds a voice assistant or chatbot to your app. Supports React, Angular.
Stars: ✭ 287 (+1494.44%)
A chronology of deep learningTracing back and exposing in chronological order the main ideas in the field of deep learning, to help everyone better understand the current intense research in AI.
Stars: ✭ 47 (+161.11%)
Speech Emotion AnalyzerThe neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)
Stars: ✭ 633 (+3416.67%)
Vosk ServerWebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries
Stars: ✭ 277 (+1438.89%)
Speechpy💬 SpeechPy - A Library for Speech Processing and Recognition: http://speechpy.readthedocs.io/en/latest/
Stars: ✭ 833 (+4527.78%)
EspressoEspresso: A Fast End-to-End Neural Speech Recognition Toolkit
Stars: ✭ 808 (+4388.89%)
EesenThe official repository of the Eesen project
Stars: ✭ 738 (+4000%)
Libreasr💬 An On-Premises, Streaming Speech Recognition System
Stars: ✭ 633 (+3416.67%)
Voice Overlay Ios🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI
Stars: ✭ 440 (+2344.44%)
Vosk Android DemoOffline speech recognition for Android with Vosk library.
Stars: ✭ 271 (+1405.56%)