Vosk ApiOffline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Stars: ✭ 1,357 (+67.32%)
Speechbrain.github.ioThe SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.
Stars: ✭ 242 (-70.16%)
Spokestack PythonSpokestack is a library that allows a user to easily incorporate a voice interface into any Python application.
Stars: ✭ 103 (-87.3%)
Hey JetsonDeep Learning based Automatic Speech Recognition with attention for the Nvidia Jetson.
Stars: ✭ 161 (-80.15%)
Wav2letterSpeech Recognition model based off of FAIR research paper built using Pytorch.
Stars: ✭ 78 (-90.38%)
IresnetImproved Residual Networks (https://arxiv.org/pdf/2004.04989.pdf)
Stars: ✭ 163 (-79.9%)
DeepspeechDeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
Stars: ✭ 18,680 (+2203.33%)
Stephanie VaStephanie is an open-source platform built specifically for voice-controlled applications as well as to automate daily tasks imitating much of an virtual assistant's work.
Stars: ✭ 772 (-4.81%)
DeepfacelabDeepFaceLab is the leading software for creating deepfakes.
Stars: ✭ 30,308 (+3637.11%)
SincnetSincNet is a neural architecture for efficiently processing raw audio samples.
Stars: ✭ 764 (-5.8%)
speech-to-text-code-patternReact app using the Watson Speech to Text service to transform voice audio into written text.
Stars: ✭ 37 (-95.44%)
deepspeechA PyTorch implementation of DeepSpeech and DeepSpeech2.
Stars: ✭ 45 (-94.45%)
AdaptAdapt Intent Parser
Stars: ✭ 690 (-14.92%)
leon🧠 Leon is your open-source personal assistant.
Stars: ✭ 8,560 (+955.49%)
musicologistMusic advice from a conversational interface powered by Algolia
Stars: ✭ 19 (-97.66%)
Dlpython courseПримеры для курса "Программирование глубоких нейронных сетей на Python"
Stars: ✭ 266 (-67.2%)
deepspeech.mxnetA MXNet implementation of Baidu's DeepSpeech architecture
Stars: ✭ 82 (-89.89%)
deep avsrA PyTorch implementation of the Deep Audio-Visual Speech Recognition paper.
Stars: ✭ 104 (-87.18%)
spokestack-iosSpokestack: give your iOS app a voice interface!
Stars: ✭ 27 (-96.67%)
speech to texthow to use the Google Cloud Speech API to transcribe audio/video files.
Stars: ✭ 35 (-95.68%)
vosk-asteriskSpeech Recognition in Asterisk with Vosk Server
Stars: ✭ 52 (-93.59%)
voce-browserVoice Controlled Chromium Web Browser
Stars: ✭ 34 (-95.81%)
htkHTK Toolkit with Linux 64 bit and Docker support
Stars: ✭ 14 (-98.27%)
sova-asrSOVA ASR (Automatic Speech Recognition)
Stars: ✭ 123 (-84.83%)
speech-to-textmixlingual speech recognition system; hybrid (GMM+NNet) model; Kaldi + Keras
Stars: ✭ 61 (-92.48%)
Deeplearning.ai NotesThese are my notes which I prepared during deep learning specialization taught by AI guru Andrew NG. I have used diagrams and code snippets from the code whenever needed but following The Honor Code.
Stars: ✭ 262 (-67.69%)
demo vietasrVietnamese Speech Recognition
Stars: ✭ 22 (-97.29%)
Unity live captionUse Google Speech-to-Text API to do real-time live stream caption on Unity! Best when combined with your virtual character!
Stars: ✭ 26 (-96.79%)
CheetahOn-device streaming speech-to-text engine powered by deep learning
Stars: ✭ 383 (-52.77%)
Tensorflowasr⚡️ TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords
Stars: ✭ 400 (-50.68%)
BrevitasBrevitas: quantization-aware training in PyTorch
Stars: ✭ 343 (-57.71%)
Awesome KaldiThis is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )
Stars: ✭ 393 (-51.54%)
EesenThe official repository of the Eesen project
Stars: ✭ 738 (-9%)
Nn playgroundExperimental keras implementation of novel neural network structures
Stars: ✭ 414 (-48.95%)
Asrt speechrecognitionA Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
Stars: ✭ 4,943 (+509.49%)
Voice Overlay Ios🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI
Stars: ✭ 440 (-45.75%)
SaliencyTensorFlow implementation for SmoothGrad, Grad-CAM, Guided backprop, Integrated Gradients and other saliency techniques
Stars: ✭ 648 (-20.1%)
ArtificioDeep Learning Computer Vision Algorithms for Real-World Use
Stars: ✭ 326 (-59.8%)
RhinoOn-device speech-to-intent engine powered by deep learning
Stars: ✭ 406 (-49.94%)
Quickdraw Implementation of Quickdraw - an online game developed by Google
Stars: ✭ 805 (-0.74%)
Annyang💬 Speech recognition for your site
Stars: ✭ 6,216 (+666.46%)
Speech recognitionSpeech recognition module for Python, supporting several engines and APIs, online and offline.
Stars: ✭ 5,999 (+639.7%)
Java Speech ApiThe J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.
Stars: ✭ 490 (-39.58%)
Sonus💬 /so.nus/ STT (speech to text) for Node with offline hotword detection
Stars: ✭ 532 (-34.4%)