Speechbrain.github.ioThe SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.
Stars: ✭ 242 (+65.75%)
Annyang💬 Speech recognition for your site
Stars: ✭ 6,216 (+4157.53%)
anycontrolVoice control for your websites and applications
Stars: ✭ 53 (-63.7%)
Voice Overlay Ios🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI
Stars: ✭ 440 (+201.37%)
Voice Overlay Android🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI
Stars: ✭ 189 (+29.45%)
NaomiThe Naomi Project is an open source, technology agnostic platform for developing always-on, voice-controlled applications!
Stars: ✭ 171 (+17.12%)
spokestack-iosSpokestack: give your iOS app a voice interface!
Stars: ✭ 27 (-81.51%)
NemoNeMo: a toolkit for conversational AI
Stars: ✭ 3,685 (+2423.97%)
Spokestack PythonSpokestack is a library that allows a user to easily incorporate a voice interface into any Python application.
Stars: ✭ 103 (-29.45%)
speechreca simple speech recognition app using the Web Speech API Interfaces
Stars: ✭ 18 (-87.67%)
open-speech-corpora💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
Stars: ✭ 841 (+476.03%)
spokestack-androidExtensible Android mobile voice framework: wakeword, ASR, NLU, and TTS. Easily add voice to any Android app!
Stars: ✭ 52 (-64.38%)
Kaldikaldi-asr/kaldi is the official location of the Kaldi project.
Stars: ✭ 11,151 (+7537.67%)
Tensorflowasr⚡️ TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords
Stars: ✭ 400 (+173.97%)
Asrt speechrecognitionA Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
Stars: ✭ 4,943 (+3285.62%)
Silero ModelsSilero Models: pre-trained STT models and benchmarks made embarrassingly simple
Stars: ✭ 522 (+257.53%)
Java Speech ApiThe J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.
Stars: ✭ 490 (+235.62%)
Sonus💬 /so.nus/ STT (speech to text) for Node with offline hotword detection
Stars: ✭ 532 (+264.38%)
AdaptAdapt Intent Parser
Stars: ✭ 690 (+372.6%)
EesenThe official repository of the Eesen project
Stars: ✭ 738 (+405.48%)
DiscordspeechbotA speech-to-text bot for discord with music commands and more using NodeJS. Ideally for controlling your Discord server using voice commands, can also be useful for hearing-impaired people.
Stars: ✭ 35 (-76.03%)
Speech Emotion AnalyzerThe neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)
Stars: ✭ 633 (+333.56%)
Stephanie VaStephanie is an open-source platform built specifically for voice-controlled applications as well as to automate daily tasks imitating much of an virtual assistant's work.
Stars: ✭ 772 (+428.77%)
PnccA implementation of Power Normalized Cepstral Coefficients: PNCC
Stars: ✭ 40 (-72.6%)
NonautoreggenprogressTracking the progress in non-autoregressive generation (translation, transcription, etc.)
Stars: ✭ 118 (-19.18%)
Go AstideepspeechGolang bindings for Mozilla's DeepSpeech speech-to-text library
Stars: ✭ 137 (-6.16%)
Awesome KaldiThis is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )
Stars: ✭ 393 (+169.18%)
RhinoOn-device speech-to-intent engine powered by deep learning
Stars: ✭ 406 (+178.08%)
CheetahOn-device streaming speech-to-text engine powered by deep learning
Stars: ✭ 383 (+162.33%)
UspeechSpeech recognition toolkit for the arduino
Stars: ✭ 448 (+206.85%)
Alan Sdk WebAlan AI Web SDK adds a voice assistant or chatbot to your app. Supports React, Angular, Vue, Ember, JavaScript, Electron.
Stars: ✭ 368 (+152.05%)
Awesome DiarizationA curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
Stars: ✭ 673 (+360.96%)
SincnetSincNet is a neural architecture for efficiently processing raw audio samples.
Stars: ✭ 764 (+423.29%)
Speech recognitionSpeech recognition module for Python, supporting several engines and APIs, online and offline.
Stars: ✭ 5,999 (+4008.9%)
KurDescriptive Deep Learning
Stars: ✭ 811 (+455.48%)
Rasa UiRasa UI is a frontend for the Rasa Framework
Stars: ✭ 796 (+445.21%)
Artyom.jsA voice control - voice commands - speech recognition and speech synthesis javascript library. Create your own siri,google now or cortana with Google Chrome within your website.
Stars: ✭ 1,011 (+592.47%)
DeepspeechDeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
Stars: ✭ 18,680 (+12694.52%)
Angle⦠ Angle: new speakable syntax for python 💡
Stars: ✭ 61 (-58.22%)
Dragonfirethe open-source virtual assistant for Ubuntu based Linux distributions
Stars: ✭ 1,120 (+667.12%)
Patterspeech-to-text in pytorch
Stars: ✭ 71 (-51.37%)
Syn SpeechSyn.Speech is a flexible speaker independent continuous speech recognition engine for Mono and .NET framework
Stars: ✭ 57 (-60.96%)
OpenasrA pytorch based end2end speech recognition system.
Stars: ✭ 69 (-52.74%)
Deepspeech Websocket ServerServer & client for DeepSpeech using WebSockets for real-time speech recognition in separate environments
Stars: ✭ 79 (-45.89%)
DeepspeechA PaddlePaddle implementation of ASR.
Stars: ✭ 1,219 (+734.93%)
B.e.n.j.i.B.E.N.J.I.- The Impossible Missions Force's digital assistant
Stars: ✭ 83 (-43.15%)
Keras SincnetKeras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)
Stars: ✭ 47 (-67.81%)
Wav2letterSpeech Recognition model based off of FAIR research paper built using Pytorch.
Stars: ✭ 78 (-46.58%)
Asr audio data linksA list of publically available audio data that anyone can download for ASR or other speech activities
Stars: ✭ 128 (-12.33%)
Speech And TextSpeech to text (PocketSphinx, Iflytex API, Baidu API) and text to speech (pyttsx3) | 语音转文字(PocketSphinx、百度 API、科大讯飞 API)和文字转语音(pyttsx3)
Stars: ✭ 102 (-30.14%)
Wav2letter.pytorchA fully convolution-network for speech-to-text, built on pytorch.
Stars: ✭ 104 (-28.77%)