picovoiceThe end-to-end platform for building voice products at scale
Stars: ✭ 316 (+112.08%)
leopardOn-device speech-to-text engine powered by deep learning
Stars: ✭ 354 (+137.58%)
Voice Overlay Android🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI
Stars: ✭ 189 (+26.85%)
Sonus💬 /so.nus/ STT (speech to text) for Node with offline hotword detection
Stars: ✭ 532 (+257.05%)
AmazonSpeechTranslatorEnd-to-end Solution for Speech Recognition, Text Translation, and Text-to-Speech for iOS using Amazon Translate and Amazon Polly as AWS Machine Learning managed services.
Stars: ✭ 50 (-66.44%)
SincnetSincNet is a neural architecture for efficiently processing raw audio samples.
Stars: ✭ 764 (+412.75%)
spokestack-androidExtensible Android mobile voice framework: wakeword, ASR, NLU, and TTS. Easily add voice to any Android app!
Stars: ✭ 52 (-65.1%)
Mycroft PreciseA lightweight, simple-to-use, RNN wake word listener
Stars: ✭ 481 (+222.82%)
Asr benchmarkProgram to benchmark various speech recognition APIs
Stars: ✭ 71 (-52.35%)
open-speech-corpora💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
Stars: ✭ 841 (+464.43%)
Avsr Deep SpeechGoogle Summer of Code 2017 Project: Development of Speech Recognition Module for Red Hen Lab
Stars: ✭ 43 (-71.14%)
Voice🎤 React Native Voice Recognition library for iOS and Android (Online and Offline Support)
Stars: ✭ 993 (+566.44%)
RhinoOn-device speech-to-intent engine powered by deep learning
Stars: ✭ 406 (+172.48%)
cobraOn-device voice activity detection (VAD) powered by deep learning.
Stars: ✭ 76 (-48.99%)
spokestack-iosSpokestack: give your iOS app a voice interface!
Stars: ✭ 27 (-81.88%)
Voice Overlay Ios🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI
Stars: ✭ 440 (+195.3%)
CheetahOn-device streaming speech-to-text engine powered by deep learning
Stars: ✭ 383 (+157.05%)
Keras SincnetKeras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)
Stars: ✭ 47 (-68.46%)
KeenASR-Android-PoCA proof-of-concept app using KeenASR SDK on Android. WE ARE HIRING: https://keenresearch.com/careers.html
Stars: ✭ 21 (-85.91%)
Vosk ApiOffline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Stars: ✭ 1,357 (+810.74%)
VoskVOSK Speech Recognition Toolkit
Stars: ✭ 182 (+22.15%)
voce-browserVoice Controlled Chromium Web Browser
Stars: ✭ 34 (-77.18%)
octopusOn-device speech-to-index engine powered by deep learning.
Stars: ✭ 30 (-79.87%)
Speech recognitionSpeech recognition module for Python, supporting several engines and APIs, online and offline.
Stars: ✭ 5,999 (+3926.17%)
AudiomatePython library for handling audio datasets.
Stars: ✭ 99 (-33.56%)
Spokestack PythonSpokestack is a library that allows a user to easily incorporate a voice interface into any Python application.
Stars: ✭ 103 (-30.87%)
MediafileA unified reader of metadata from audio & video files.
Stars: ✭ 138 (-7.38%)
MalgoMini audio library
Stars: ✭ 138 (-7.38%)
Go AstideepspeechGolang bindings for Mozilla's DeepSpeech speech-to-text library
Stars: ✭ 137 (-8.05%)
Speech Recognition Neural NetworkThis is the end-to-end Speech Recognition neural network, deployed in Keras. This was my final project for Artificial Intelligence Nanodegree @Udacity.
Stars: ✭ 148 (-0.67%)
MusicplayerA minimal music player built on electron.
Stars: ✭ 145 (-2.68%)
Cordova Plugin AudioinputThis iOS/Android Cordova/PhoneGap plugin enables audio capture from the device microphone, by in near real-time forwarding audio to the web layer of your application. A typical usage scenario for this plugin would be to use the captured audio as source for a web audio node chain, where it then can be analyzed, manipulated and/or played.
Stars: ✭ 137 (-8.05%)
Gydlgydl (Graphical Youtube-dl) is a GUI wrapper around the already existing youtube-dl program.
Stars: ✭ 136 (-8.72%)
TimecatA Magical Web Recorder & Player 🖥
Stars: ✭ 1,955 (+1212.08%)
AvdemoDemo projects for iOS Audio & Video development.
Stars: ✭ 136 (-8.72%)
JamesdspmanagerAudio DSP effects build on Android system framework layer. This is a repository contains a pack of high quality DSP algorithms specialized for audio processing.
Stars: ✭ 136 (-8.72%)
RnnoiseRecurrent neural network for audio noise reduction
Stars: ✭ 2,266 (+1420.81%)
Zzz Retired opensttRETIRED - OpenSTT is now retired. If you would like more information on Mycroft AI's open source STT projects, please visit:
Stars: ✭ 146 (-2.01%)
SpeechrecognizerbuttonUIButton subclass with push to talk recording, speech recognition and Siri-style waveform view.
Stars: ✭ 144 (-3.36%)
AllosaurusAllosaurus is a pretrained universal phone recognizer for more than 2000 languages
Stars: ✭ 135 (-9.4%)
Webrtc CliWebRTC command-line peer.
Stars: ✭ 135 (-9.4%)
YoucastTurn YouTube Channels into Subscribable Podcasts.
Stars: ✭ 142 (-4.7%)
Sbplayer ios基于AVPlayer封装的轻量级播放器,可播放本地及网络视频,易于定制
Stars: ✭ 134 (-10.07%)
UbicousticsAccompanying repository for Ubicoustics: Plug-and-Play Acoustic Activity Recognition
Stars: ✭ 134 (-10.07%)
FsynthWeb-based and pixels-based collaborative synthesizer
Stars: ✭ 146 (-2.01%)
DlaDeep learning for audio processing
Stars: ✭ 142 (-4.7%)
Yt AudioA simple, configurable youtube-dl wrapper to download and manage youtube audio
Stars: ✭ 132 (-11.41%)
Managedbass.Net Wrapper for 'Bass' Audio Library
Stars: ✭ 131 (-12.08%)