spokestack-androidExtensible Android mobile voice framework: wakeword, ASR, NLU, and TTS. Easily add voice to any Android app!
Stars: ✭ 52 (-31.58%)
spokestack-iosSpokestack: give your iOS app a voice interface!
Stars: ✭ 27 (-64.47%)
open-speech-corpora💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
Stars: ✭ 841 (+1006.58%)
Asr benchmarkProgram to benchmark various speech recognition APIs
Stars: ✭ 71 (-6.58%)
AmazonSpeechTranslatorEnd-to-end Solution for Speech Recognition, Text Translation, and Text-to-Speech for iOS using Amazon Translate and Amazon Polly as AWS Machine Learning managed services.
Stars: ✭ 50 (-34.21%)
SwiftspeechA speech recognition framework designed for SwiftUI.
Stars: ✭ 149 (+96.05%)
VoskVOSK Speech Recognition Toolkit
Stars: ✭ 182 (+139.47%)
Mycroft PreciseA lightweight, simple-to-use, RNN wake word listener
Stars: ✭ 481 (+532.89%)
RhinoOn-device speech-to-intent engine powered by deep learning
Stars: ✭ 406 (+434.21%)
voce-browserVoice Controlled Chromium Web Browser
Stars: ✭ 34 (-55.26%)
Voice🎤 React Native Voice Recognition library for iOS and Android (Online and Offline Support)
Stars: ✭ 993 (+1206.58%)
FfsubsyncAutomagically synchronize subtitles with video.
Stars: ✭ 5,167 (+6698.68%)
Spokestack PythonSpokestack is a library that allows a user to easily incorporate a voice interface into any Python application.
Stars: ✭ 103 (+35.53%)
voice gender detection♂️♀️ Detect a person's gender from a voice file (90.7% +/- 1.3% accuracy).
Stars: ✭ 51 (-32.89%)
CheetahOn-device streaming speech-to-text engine powered by deep learning
Stars: ✭ 383 (+403.95%)
Sonus💬 /so.nus/ STT (speech to text) for Node with offline hotword detection
Stars: ✭ 532 (+600%)
Voice Overlay Ios🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI
Stars: ✭ 440 (+478.95%)
leopardOn-device speech-to-text engine powered by deep learning
Stars: ✭ 354 (+365.79%)
Voice Overlay Android🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI
Stars: ✭ 189 (+148.68%)
KeenASR-Android-PoCA proof-of-concept app using KeenASR SDK on Android. WE ARE HIRING: https://keenresearch.com/careers.html
Stars: ✭ 21 (-72.37%)
android-vadThis VAD library can process audio in real-time utilizing GMM which helps identify presence of human speech in an audio sample that contains a mixture of speech and noise.
Stars: ✭ 64 (-15.79%)
Vosk ApiOffline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Stars: ✭ 1,357 (+1685.53%)
picovoiceThe end-to-end platform for building voice products at scale
Stars: ✭ 316 (+315.79%)
octopusOn-device speech-to-index engine powered by deep learning.
Stars: ✭ 30 (-60.53%)
VoiceDictation迅飞 语音听写 WebAPI - 把语音(≤60秒)转换成对应的文字信息,让机器能够“听懂”人类语言,相当于给机器安装上“耳朵”,使其具备“能听”的功能。
Stars: ✭ 36 (-52.63%)
DeepSpeech-APIThe code enables users to use Mozilla's Deep Speech model over the Web Browser.
Stars: ✭ 31 (-59.21%)
scriptySpeech to text bot for Discord using Mozilla's DeepSpeech
Stars: ✭ 14 (-81.58%)
apiSpeechly public API definitions and generated code
Stars: ✭ 15 (-80.26%)
Transformer-TransducerPyTorch implementation of "Transformer Transducer: A Streamable Speech Recognition Model with Transformer Encoders and RNN-T Loss" (ICASSP 2020)
Stars: ✭ 61 (-19.74%)
2018-dlslUPC Deep Learning for Speech and Language 2018
Stars: ✭ 18 (-76.32%)
rnnt decoder cudaAn efficient implementation of RNN-T Prefix Beam Search in C++/CUDA.
Stars: ✭ 60 (-21.05%)
speechlessSpeech-to-text based on wav2letter built for transfer learning
Stars: ✭ 92 (+21.05%)
syn-speech-samplesAn application that demostrate the usage of Syn.Speech library for Speech Recognition
Stars: ✭ 24 (-68.42%)
timit-preprocessorExtract mfcc vectors and phones from TIMIT dataset
Stars: ✭ 14 (-81.58%)
salutejsSmartApp Framework для создания навыков семейства Виртуальных Ассистентов "Салют" на языке JavaScript
Stars: ✭ 35 (-53.95%)
SmartMirrorMy MagicMirror running on a Raspberry Pi
Stars: ✭ 110 (+44.74%)
speechreca simple speech recognition app using the Web Speech API Interfaces
Stars: ✭ 18 (-76.32%)
wenetProduction First and Production Ready End-to-End Speech Recognition Toolkit
Stars: ✭ 2,384 (+3036.84%)
Android-TTS-STTOne line solution for Android Text to speech(TTS) & Speech to Text(STT) translation problem
Stars: ✭ 77 (+1.32%)
telltimeiOS application to tell the time in the British way 🇬🇧⏰
Stars: ✭ 49 (-35.53%)
rustfstRust re-implementation of OpenFST - library for constructing, combining, optimizing, and searching weighted finite-state transducers (FSTs). A Python binding is also available.
Stars: ✭ 104 (+36.84%)
Unity live captionUse Google Speech-to-Text API to do real-time live stream caption on Unity! Best when combined with your virtual character!
Stars: ✭ 26 (-65.79%)
KhronosThe open source intelligent personal assistant
Stars: ✭ 25 (-67.11%)
QuietVRA Quiet Place in VR: Generate any 3D object with your voice. It's magic!
Stars: ✭ 17 (-77.63%)
web-speech-cognitive-servicesPolyfill Web Speech API with Cognitive Services Bing Speech for both speech-to-text and text-to-speech service.
Stars: ✭ 35 (-53.95%)
ctc-asrEnd-to-end trained speech recognition system, based on RNNs and the connectionist temporal classification (CTC) cost function.
Stars: ✭ 112 (+47.37%)
kospeechOpen-Source Toolkit for End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.
Stars: ✭ 456 (+500%)
rVADfastThis is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsupervised Segment-Based Robust Voice Activity Detection Method.
Stars: ✭ 80 (+5.26%)