Voice Overlay Ios🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI
Stars: ✭ 440 (-55.69%)
Spokestack PythonSpokestack is a library that allows a user to easily incorporate a voice interface into any Python application.
Stars: ✭ 103 (-89.63%)
SwiftspeechA speech recognition framework designed for SwiftUI.
Stars: ✭ 149 (-84.99%)
Vosk ApiOffline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Stars: ✭ 1,357 (+36.66%)
Mycroft PreciseA lightweight, simple-to-use, RNN wake word listener
Stars: ✭ 481 (-51.56%)
open-speech-corpora💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
Stars: ✭ 841 (-15.31%)
Voice Overlay Android🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI
Stars: ✭ 189 (-80.97%)
voce-browserVoice Controlled Chromium Web Browser
Stars: ✭ 34 (-96.58%)
cobraOn-device voice activity detection (VAD) powered by deep learning.
Stars: ✭ 76 (-92.35%)
AmazonSpeechTranslatorEnd-to-end Solution for Speech Recognition, Text Translation, and Text-to-Speech for iOS using Amazon Translate and Amazon Polly as AWS Machine Learning managed services.
Stars: ✭ 50 (-94.96%)
Sonus💬 /so.nus/ STT (speech to text) for Node with offline hotword detection
Stars: ✭ 532 (-46.42%)
KeenASR-Android-PoCA proof-of-concept app using KeenASR SDK on Android. WE ARE HIRING: https://keenresearch.com/careers.html
Stars: ✭ 21 (-97.89%)
RhinoOn-device speech-to-intent engine powered by deep learning
Stars: ✭ 406 (-59.11%)
Asr benchmarkProgram to benchmark various speech recognition APIs
Stars: ✭ 71 (-92.85%)
leopardOn-device speech-to-text engine powered by deep learning
Stars: ✭ 354 (-64.35%)
CheetahOn-device streaming speech-to-text engine powered by deep learning
Stars: ✭ 383 (-61.43%)
octopusOn-device speech-to-index engine powered by deep learning.
Stars: ✭ 30 (-96.98%)
VoskVOSK Speech Recognition Toolkit
Stars: ✭ 182 (-81.67%)
picovoiceThe end-to-end platform for building voice products at scale
Stars: ✭ 316 (-68.18%)
spokestack-iosSpokestack: give your iOS app a voice interface!
Stars: ✭ 27 (-97.28%)
spokestack-androidExtensible Android mobile voice framework: wakeword, ASR, NLU, and TTS. Easily add voice to any Android app!
Stars: ✭ 52 (-94.76%)
Java Speech ApiThe J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.
Stars: ✭ 490 (-50.65%)
JarvisJarvis.sh is a simple configurable multi-lang assistant.
Stars: ✭ 701 (-29.41%)
AdaptAdapt Intent Parser
Stars: ✭ 690 (-30.51%)
RhasspyOffline private voice assistant for many human languages
Stars: ✭ 458 (-53.88%)
Speechpy💬 SpeechPy - A Library for Speech Processing and Recognition: http://speechpy.readthedocs.io/en/latest/
Stars: ✭ 833 (-16.11%)
Wav2letterFacebook AI Research's Automatic Speech Recognition Toolkit
Stars: ✭ 5,907 (+494.86%)
UspeechSpeech recognition toolkit for the arduino
Stars: ✭ 448 (-54.88%)
RhasspyRhasspy voice assistant for offline home automation
Stars: ✭ 851 (-14.3%)
KurDescriptive Deep Learning
Stars: ✭ 811 (-18.33%)
Awesome DiarizationA curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
Stars: ✭ 673 (-32.23%)
Tensorflow Ios ExampleSource code for my blog post "Getting started with TensorFlow on iOS"
Stars: ✭ 432 (-56.5%)
Asrt speechrecognitionA Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
Stars: ✭ 4,943 (+397.78%)
Speech recognitionSpeech recognition module for Python, supporting several engines and APIs, online and offline.
Stars: ✭ 5,999 (+504.13%)
SpecaugmentA Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain
Stars: ✭ 408 (-58.91%)
EspressoEspresso: A Fast End-to-End Neural Speech Recognition Toolkit
Stars: ✭ 808 (-18.63%)
Speech Emotion AnalyzerThe neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)
Stars: ✭ 633 (-36.25%)
Neural spEnd-to-end ASR/LM implementation with PyTorch
Stars: ✭ 408 (-58.91%)
Tensorflowasr⚡️ TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords
Stars: ✭ 400 (-59.72%)
Libreasr💬 An On-Premises, Streaming Speech Recognition System
Stars: ✭ 633 (-36.25%)
CtcwordbeamsearchConnectionist Temporal Classification (CTC) decoder with dictionary and language model for TensorFlow.
Stars: ✭ 398 (-59.92%)
Kaldi Gstreamer ServerReal-time full-duplex speech recognition server, based on the Kaldi toolkit and the GStreamer framwork.
Stars: ✭ 935 (-5.84%)
Stephanie VaStephanie is an open-source platform built specifically for voice-controlled applications as well as to automate daily tasks imitating much of an virtual assistant's work.
Stars: ✭ 772 (-22.26%)
VadVoice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.
Stars: ✭ 622 (-37.36%)
Awesome KaldiThis is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )
Stars: ✭ 393 (-60.42%)
WenetProduction First and Production Ready End-to-End Speech Recognition Toolkit
Stars: ✭ 617 (-37.87%)
NmtpytorchSequence-to-Sequence Framework in PyTorch
Stars: ✭ 392 (-60.52%)
Zamia SpeechOpen tools and data for cloudless automatic speech recognition
Stars: ✭ 374 (-62.34%)
SincnetSincNet is a neural architecture for efficiently processing raw audio samples.
Stars: ✭ 764 (-23.06%)
EddiscoveryCaptains log and 3d star map for Elite Dangerous
Stars: ✭ 541 (-45.52%)
SubsyncSubtitle Speech Synchronizer
Stars: ✭ 379 (-61.83%)
Alan Sdk WebAlan AI Web SDK adds a voice assistant or chatbot to your app. Supports React, Angular, Vue, Ember, JavaScript, Electron.
Stars: ✭ 368 (-62.94%)
Athenaan open-source implementation of sequence-to-sequence based speech processing engine
Stars: ✭ 542 (-45.42%)