multilingual kwsFew-shot Keyword Spotting in Any Language and Multilingual Spoken Word Corpus
Stars: ✭ 122 (+510%)
Multi-Hotword SpottingWon't it be cool to build a speech assistant like Alexa or Siri yourself without voice API and network connection?
Stars: ✭ 31 (+55%)
awesome-keyword-spottingThis repository is a curated list of awesome Speech Keyword Spotting (Wake-Up Word Detection).
Stars: ✭ 150 (+650%)
vosk-asteriskSpeech Recognition in Asterisk with Vosk Server
Stars: ✭ 52 (+160%)
VoiceComA Simple Voice Command Application powered by Java and Sphinx4 Speech Recognition Library
Stars: ✭ 17 (-15%)
musicologistMusic advice from a conversational interface powered by Algolia
Stars: ✭ 19 (-5%)
speech-to-text-code-patternReact app using the Watson Speech to Text service to transform voice audio into written text.
Stars: ✭ 37 (+85%)
YouTube-Tutorials--Italian📂 Source Code for (some of) the Programming Tutorials from my Italian YouTube Channel and website ProgrammareInPython.it. This is just a small portion of the content: please visit the website for more.
Stars: ✭ 28 (+40%)
Tensorflow-Keyword-SpottingKeyword spotting using various architecture like convolutional vggnet , 1D convolutional network and CTC.
Stars: ✭ 27 (+35%)
spokestack-androidExtensible Android mobile voice framework: wakeword, ASR, NLU, and TTS. Easily add voice to any Android app!
Stars: ✭ 52 (+160%)
speech to texthow to use the Google Cloud Speech API to transcribe audio/video files.
Stars: ✭ 35 (+75%)
pocketsphinxUpdated ROS bindings to pocketsphinx
Stars: ✭ 36 (+80%)
UniSpeechUniSpeech - Large Scale Self-Supervised Learning for Speech
Stars: ✭ 224 (+1020%)
porfirГолосовой ассистент Порфирьевич
Stars: ✭ 23 (+15%)
soxanWav2Vec for speech recognition, classification, and audio classification
Stars: ✭ 113 (+465%)
sova-asrSOVA ASR (Automatic Speech Recognition)
Stars: ✭ 123 (+515%)
A chronology of deep learningTracing back and exposing in chronological order the main ideas in the field of deep learning, to help everyone better understand the current intense research in AI.
Stars: ✭ 47 (+135%)
rosechoTianbot Rosecho (Tianecho),中文语音人机交互模块,支持ROS即插即用
Stars: ✭ 28 (+40%)
htkHTK Toolkit with Linux 64 bit and Docker support
Stars: ✭ 14 (-30%)
deep avsrA PyTorch implementation of the Deep Audio-Visual Speech Recognition paper.
Stars: ✭ 104 (+420%)
sepia-docsDocumentation and Wiki for SEPIA. Please post your questions and bug-reports here in the issues section! Thank you :-)
Stars: ✭ 160 (+700%)
spokestack-iosSpokestack: give your iOS app a voice interface!
Stars: ✭ 27 (+35%)
react-clientAn React client library for Speechly API
Stars: ✭ 71 (+255%)
mixupspeechpro.com/
Stars: ✭ 23 (+15%)
speech-to-textmixlingual speech recognition system; hybrid (GMM+NNet) model; Kaldi + Keras
Stars: ✭ 61 (+205%)
lightning-asrModular and extensible speech recognition library leveraging pytorch-lightning and hydra.
Stars: ✭ 36 (+80%)
quran-alignWord-accurate timestamps for Qur'anic audio.
Stars: ✭ 139 (+595%)
scim[wip]Speech recognition tool-box written by Nim. Based on Arraymancer.
Stars: ✭ 17 (-15%)
deepspeechA PyTorch implementation of DeepSpeech and DeepSpeech2.
Stars: ✭ 45 (+125%)
Speech-RecognitionEnd-to-end Automatic Speech Recognition for Madarian and English in Tensorflow
Stars: ✭ 21 (+5%)
kosrKorean speech recognition based on transformer (트랜스포머 기반 한국어 음성 인식)
Stars: ✭ 25 (+25%)
NLP ToolkitLibrary of state-of-the-art models (PyTorch) for NLP tasks
Stars: ✭ 92 (+360%)
download audioset📁 This repo makes it easy to download the raw audio files from AudioSet (32.45 GB, 632 classes).
Stars: ✭ 53 (+165%)
cvaecaposrCode for the Paper: "Conditional Variational Capsule Network for Open Set Recognition", Y. Guo, G. Camporese, W. Yang, A. Sperduti, L. Ballan, arXiv:2104.09159, 2021.
Stars: ✭ 29 (+45%)
demo vietasrVietnamese Speech Recognition
Stars: ✭ 22 (+10%)
TensorMONKA collection of deep learning models (PyTorch implemtation)
Stars: ✭ 21 (+5%)
CCAligner🔮 Word by word audio subtitle synchronisation tool and API. Developed under GSoC 2017 with CCExtractor.
Stars: ✭ 131 (+555%)
leon🧠 Leon is your open-source personal assistant.
Stars: ✭ 8,560 (+42700%)
deepspeech.mxnetA MXNet implementation of Baidu's DeepSpeech architecture
Stars: ✭ 82 (+310%)
wav2letterFacebook AI Research's Automatic Speech Recognition Toolkit
Stars: ✭ 6,026 (+30030%)
TinyCogSmall Robot, Toy Robot platform
Stars: ✭ 29 (+45%)
Recording-BotA bot built to record and transcribe audio fragments from Discord.
Stars: ✭ 22 (+10%)
cobraOn-device voice activity detection (VAD) powered by deep learning.
Stars: ✭ 76 (+280%)
voce-browserVoice Controlled Chromium Web Browser
Stars: ✭ 34 (+70%)
sepia-stt-serverSEPIA server to support open-source speech recognition via WebSocket connection.
Stars: ✭ 45 (+125%)
speechlessSpeech-to-text based on wav2letter built for transfer learning
Stars: ✭ 92 (+360%)
pyjsgfJSpeech Grammar Format (JSGF) compiler, matcher and parser package for Python.
Stars: ✭ 40 (+100%)
UnityASRAutomatic Speech Recognition in Unity.
Stars: ✭ 14 (-30%)
learning invariances in speech recognitionIn this work I investigate the speech command task developing and analyzing deep learning models. The state of the art technology uses convolutional neural networks (CNN) because of their intrinsic nature of learning correlated represen- tations as is the speech. In particular I develop different CNNs trained on the Google Speech Command Dataset…
Stars: ✭ 15 (-25%)
StageMateStageMate is the smart assistant for your presentation. It will cover all aspects of your pitch from skipping slides to reminding you if you miss some major point.
Stars: ✭ 60 (+200%)