Vosk Android DemoOffline speech recognition for Android with Vosk library.
Stars: ✭ 271 (+256.58%)
rnnt decoder cudaAn efficient implementation of RNN-T Prefix Beam Search in C++/CUDA.
Stars: ✭ 60 (-21.05%)
PocketsphinxPocketSphinx is a lightweight speech recognition engine, specifically tuned for handheld and mobile devices, though it works equally well on the desktop
Stars: ✭ 2,934 (+3760.53%)
CidlibThe CIDLib general purpose C++ development environment
Stars: ✭ 179 (+135.53%)
CaptionThis"Caption This" is an iOS app that adds real-time captions to videos for Instagram Stories
Stars: ✭ 12 (-84.21%)
Listen-Attend-Spell-v2PyTorch implementation of Listen Attend and Spell Automatic Speech Recognition (ASR).
Stars: ✭ 29 (-61.84%)
UnityASRAutomatic Speech Recognition in Unity.
Stars: ✭ 14 (-81.58%)
speechlessSpeech-to-text based on wav2letter built for transfer learning
Stars: ✭ 92 (+21.05%)
learning invariances in speech recognitionIn this work I investigate the speech command task developing and analyzing deep learning models. The state of the art technology uses convolutional neural networks (CNN) because of their intrinsic nature of learning correlated represen- tations as is the speech. In particular I develop different CNNs trained on the Google Speech Command Dataset…
Stars: ✭ 15 (-80.26%)
sova-asrSOVA ASR (Automatic Speech Recognition)
Stars: ✭ 123 (+61.84%)
PnccA implementation of Power Normalized Cepstral Coefficients: PNCC
Stars: ✭ 40 (-47.37%)
StageMateStageMate is the smart assistant for your presentation. It will cover all aspects of your pitch from skipping slides to reminding you if you miss some major point.
Stars: ✭ 60 (-21.05%)
NaomiThe Naomi Project is an open source, technology agnostic platform for developing always-on, voice-controlled applications!
Stars: ✭ 171 (+125%)
Multi-Hotword SpottingWon't it be cool to build a speech assistant like Alexa or Siri yourself without voice API and network connection?
Stars: ✭ 31 (-59.21%)
SmartMirrorMy MagicMirror running on a Raspberry Pi
Stars: ✭ 110 (+44.74%)
musicologistMusic advice from a conversational interface powered by Algolia
Stars: ✭ 19 (-75%)
Tacotron asrSpeech Recognition Using Tacotron
Stars: ✭ 165 (+117.11%)
htkHTK Toolkit with Linux 64 bit and Docker support
Stars: ✭ 14 (-81.58%)
KaldiioA pure python module for reading and writing kaldi ark files
Stars: ✭ 160 (+110.53%)
scim[wip]Speech recognition tool-box written by Nim. Based on Arraymancer.
Stars: ✭ 17 (-77.63%)
todo-listTodoList using Ionic2/3 & Firebase: * PWA * SSO Google plus. * Share list via QRcode. * Upload image from Camera or Storage. * Speech Recognition.
Stars: ✭ 18 (-76.32%)
DiscordspeechbotA speech-to-text bot for discord with music commands and more using NodeJS. Ideally for controlling your Discord server using voice commands, can also be useful for hearing-impaired people.
Stars: ✭ 35 (-53.95%)
kosrKorean speech recognition based on transformer (트랜스포머 기반 한국어 음성 인식)
Stars: ✭ 25 (-67.11%)
Rnnt Speech RecognitionEnd-to-end speech recognition using RNN Transducers in Tensorflow 2.0
Stars: ✭ 158 (+107.89%)
anycontrolVoice control for your websites and applications
Stars: ✭ 53 (-30.26%)
CCAligner🔮 Word by word audio subtitle synchronisation tool and API. Developed under GSoC 2017 with CCExtractor.
Stars: ✭ 131 (+72.37%)
ClovacallClovaCall dataset and Pytorch LAS baseline code (Interspeech 2020)
Stars: ✭ 151 (+98.68%)
wav2letterFacebook AI Research's Automatic Speech Recognition Toolkit
Stars: ✭ 6,026 (+7828.95%)
Android-TTS-STTOne line solution for Android Text to speech(TTS) & Speech to Text(STT) translation problem
Stars: ✭ 77 (+1.32%)
deepspeechA PyTorch implementation of DeepSpeech and DeepSpeech2.
Stars: ✭ 45 (-40.79%)
Speech Recognition Neural NetworkThis is the end-to-end Speech Recognition neural network, deployed in Keras. This was my final project for Artificial Intelligence Nanodegree @Udacity.
Stars: ✭ 148 (+94.74%)
mixupspeechpro.com/
Stars: ✭ 23 (-69.74%)
lightning-asrModular and extensible speech recognition library leveraging pytorch-lightning and hydra.
Stars: ✭ 36 (-52.63%)
SpeechrecognizerbuttonUIButton subclass with push to talk recording, speech recognition and Siri-style waveform view.
Stars: ✭ 144 (+89.47%)
wenetProduction First and Production Ready End-to-End Speech Recognition Toolkit
Stars: ✭ 2,384 (+3036.84%)
pocketsphinxUpdated ROS bindings to pocketsphinx
Stars: ✭ 36 (-52.63%)
Speech-RecognitionEnd-to-end Automatic Speech Recognition for Madarian and English in Tensorflow
Stars: ✭ 21 (-72.37%)
UHV-OTS-SpeechA data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.
Stars: ✭ 94 (+23.68%)
NLP ToolkitLibrary of state-of-the-art models (PyTorch) for NLP tasks
Stars: ✭ 92 (+21.05%)
AllosaurusAllosaurus is a pretrained universal phone recognizer for more than 2000 languages
Stars: ✭ 135 (+77.63%)
telltimeiOS application to tell the time in the British way 🇬🇧⏰
Stars: ✭ 49 (-35.53%)
react-clientAn React client library for Speechly API
Stars: ✭ 71 (-6.58%)
Speech-BackbonesThis is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.
Stars: ✭ 205 (+169.74%)
ml-with-audioHF's ML for Audio study group
Stars: ✭ 104 (+36.84%)
iOSProjectsIt's project that contains different applications developed with Swift 5.7 👨💻👩🏼💻🧑🏿💻
Stars: ✭ 122 (+60.53%)