DeepspeechA PaddlePaddle implementation of ASR.
Stars: ✭ 1,219 (-79.36%)
DeepSpeech-APIThe code enables users to use Mozilla's Deep Speech model over the Web Browser.
Stars: ✭ 31 (-99.48%)
Zzz Retired opensttRETIRED - OpenSTT is now retired. If you would like more information on Mycroft AI's open source STT projects, please visit:
Stars: ✭ 146 (-97.53%)
UnityandroidspeechrecognitionThis repository is a Unity plugin for Android Speech Recognition (based on Java implementation)
Stars: ✭ 73 (-98.76%)
AmazonSpeechTranslatorEnd-to-end Solution for Speech Recognition, Text Translation, and Text-to-Speech for iOS using Amazon Translate and Amazon Polly as AWS Machine Learning managed services.
Stars: ✭ 50 (-99.15%)
Android Speech RecognitionContinuous speech recognition library for Android with options to use GoogleVoiceIme dialog and offline mode.
Stars: ✭ 72 (-98.78%)
Patterspeech-to-text in pytorch
Stars: ✭ 71 (-98.8%)
2018-dlslUPC Deep Learning for Speech and Language 2018
Stars: ✭ 18 (-99.7%)
Speech aiSimple speech linguistic AI with Python
Stars: ✭ 66 (-98.88%)
Libreasr💬 An On-Premises, Streaming Speech Recognition System
Stars: ✭ 633 (-89.28%)
Dragonfirethe open-source virtual assistant for Ubuntu based Linux distributions
Stars: ✭ 1,120 (-81.04%)
salutejsSmartApp Framework для создания навыков семейства Виртуальных Ассистентов "Салют" на языке JavaScript
Stars: ✭ 35 (-99.41%)
spokestack-androidExtensible Android mobile voice framework: wakeword, ASR, NLU, and TTS. Easily add voice to any Android app!
Stars: ✭ 52 (-99.12%)
Iflytek awaken asruse iflytek's technology to realize awaken and order recognition
Stars: ✭ 53 (-99.1%)
Libfaceidlibfaceid is a research framework for prototyping of face recognition solutions. It seamlessly integrates multiple detection, recognition and liveness models w/ speech synthesis and speech recognition.
Stars: ✭ 354 (-94.01%)
Py NltoolsA collection of basic python modules for spoken natural language processing
Stars: ✭ 46 (-99.22%)
VoiceComA Simple Voice Command Application powered by Java and Sphinx4 Speech Recognition Library
Stars: ✭ 17 (-99.71%)
Formant AnalyzeriOS application for finding formants in spoken sounds
Stars: ✭ 43 (-99.27%)
telltimeiOS application to tell the time in the British way 🇬🇧⏰
Stars: ✭ 49 (-99.17%)
RhasspyOffline private voice assistant for many human languages
Stars: ✭ 458 (-92.25%)
SpeechrecognizerbuttonUIButton subclass with push to talk recording, speech recognition and Siri-style waveform view.
Stars: ✭ 144 (-97.56%)
Voice🎤 React Native Voice Recognition library for iOS and Android (Online and Offline Support)
Stars: ✭ 993 (-83.19%)
KhronosThe open source intelligent personal assistant
Stars: ✭ 25 (-99.58%)
SOLQ"SOLQ: Segmenting Objects by Learning Queries", SOLQ is an end-to-end instance segmentation framework with Transformer.
Stars: ✭ 159 (-97.31%)
RhasspyRhasspy voice assistant for offline home automation
Stars: ✭ 851 (-85.59%)
ctc-asrEnd-to-end trained speech recognition system, based on RNNs and the connectionist temporal classification (CTC) cost function.
Stars: ✭ 112 (-98.1%)
DeepspeechDeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
Stars: ✭ 18,680 (+216.23%)
Speechpy💬 SpeechPy - A Library for Speech Processing and Recognition: http://speechpy.readthedocs.io/en/latest/
Stars: ✭ 833 (-85.9%)
DlaDeep learning for audio processing
Stars: ✭ 142 (-97.6%)
SincnetSincNet is a neural architecture for efficiently processing raw audio samples.
Stars: ✭ 764 (-87.07%)
titanium-speechUse the iOS 10 SFSpeechRecognizer API in JavaScript with Appcelerator Hyperloop.
Stars: ✭ 21 (-99.64%)
EesenThe official repository of the Eesen project
Stars: ✭ 738 (-87.51%)
Sonus💬 /so.nus/ STT (speech to text) for Node with offline hotword detection
Stars: ✭ 532 (-90.99%)
AdaptAdapt Intent Parser
Stars: ✭ 690 (-88.32%)
KodiSharpUse Kodi python APIs in C#, and write rich addons using the .NET framework/Mono
Stars: ✭ 22 (-99.63%)
CCAligner🔮 Word by word audio subtitle synchronisation tool and API. Developed under GSoC 2017 with CCExtractor.
Stars: ✭ 131 (-97.78%)
EendEnd-to-End Neural Diarization
Stars: ✭ 153 (-97.41%)
SSNM-Coseg[AAAI20] Deep Object Co-segmentation via Spatial-Semantic Network Modulation(Oral paper)
Stars: ✭ 21 (-99.64%)
Deepvoice3 pytorchPyTorch implementation of convolutional neural networks-based text-to-speech synthesis models
Stars: ✭ 1,654 (-72%)
KeenASR-Android-PoCA proof-of-concept app using KeenASR SDK on Android. WE ARE HIRING: https://keenresearch.com/careers.html
Stars: ✭ 21 (-99.64%)
Awesome DiarizationA curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
Stars: ✭ 673 (-88.61%)
Speech Emotion AnalyzerThe neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)
Stars: ✭ 633 (-89.28%)
Speech TransformerA PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.
Stars: ✭ 565 (-90.44%)
Lanedetection end2endEnd-to-end Lane Detection for Self-Driving Cars (ICCV 2019 Workshop)
Stars: ✭ 500 (-91.54%)
demo vietasrVietnamese Speech Recognition
Stars: ✭ 22 (-99.63%)