speech to texthow to use the Google Cloud Speech API to transcribe audio/video files.
Stars: ✭ 35 (-98.81%)
soxanWav2Vec for speech recognition, classification, and audio classification
Stars: ✭ 113 (-96.15%)
kosrKorean speech recognition based on transformer (트랜스포머 기반 한국어 음성 인식)
Stars: ✭ 25 (-99.15%)
spokestack-iosSpokestack: give your iOS app a voice interface!
Stars: ✭ 27 (-99.08%)
speech-to-textmixlingual speech recognition system; hybrid (GMM+NNet) model; Kaldi + Keras
Stars: ✭ 61 (-97.92%)
UniSpeechUniSpeech - Large Scale Self-Supervised Learning for Speech
Stars: ✭ 224 (-92.37%)
CCAligner🔮 Word by word audio subtitle synchronisation tool and API. Developed under GSoC 2017 with CCExtractor.
Stars: ✭ 131 (-95.54%)
pyjsgfJSpeech Grammar Format (JSGF) compiler, matcher and parser package for Python.
Stars: ✭ 40 (-98.64%)
sepia-stt-serverSEPIA server to support open-source speech recognition via WebSocket connection.
Stars: ✭ 45 (-98.47%)
Recording-BotA bot built to record and transcribe audio fragments from Discord.
Stars: ✭ 22 (-99.25%)
vosk-asteriskSpeech Recognition in Asterisk with Vosk Server
Stars: ✭ 52 (-98.23%)
scim[wip]Speech recognition tool-box written by Nim. Based on Arraymancer.
Stars: ✭ 17 (-99.42%)
speech-to-text-code-patternReact app using the Watson Speech to Text service to transform voice audio into written text.
Stars: ✭ 37 (-98.74%)
demo vietasrVietnamese Speech Recognition
Stars: ✭ 22 (-99.25%)
A chronology of deep learningTracing back and exposing in chronological order the main ideas in the field of deep learning, to help everyone better understand the current intense research in AI.
Stars: ✭ 47 (-98.4%)
deep avsrA PyTorch implementation of the Deep Audio-Visual Speech Recognition paper.
Stars: ✭ 104 (-96.46%)
download audioset📁 This repo makes it easy to download the raw audio files from AudioSet (32.45 GB, 632 classes).
Stars: ✭ 53 (-98.19%)
react-clientAn React client library for Speechly API
Stars: ✭ 71 (-97.58%)
wav2letterFacebook AI Research's Automatic Speech Recognition Toolkit
Stars: ✭ 6,026 (+105.39%)
deepspeechA PyTorch implementation of DeepSpeech and DeepSpeech2.
Stars: ✭ 45 (-98.47%)
speechlessSpeech-to-text based on wav2letter built for transfer learning
Stars: ✭ 92 (-96.86%)
htkHTK Toolkit with Linux 64 bit and Docker support
Stars: ✭ 14 (-99.52%)
mixupspeechpro.com/
Stars: ✭ 23 (-99.22%)
sova-asrSOVA ASR (Automatic Speech Recognition)
Stars: ✭ 123 (-95.81%)
lightning-asrModular and extensible speech recognition library leveraging pytorch-lightning and hydra.
Stars: ✭ 36 (-98.77%)
quran-alignWord-accurate timestamps for Qur'anic audio.
Stars: ✭ 139 (-95.26%)
UnityASRAutomatic Speech Recognition in Unity.
Stars: ✭ 14 (-99.52%)
pocketsphinxUpdated ROS bindings to pocketsphinx
Stars: ✭ 36 (-98.77%)
spokestack-androidExtensible Android mobile voice framework: wakeword, ASR, NLU, and TTS. Easily add voice to any Android app!
Stars: ✭ 52 (-98.23%)
Speech-RecognitionEnd-to-end Automatic Speech Recognition for Madarian and English in Tensorflow
Stars: ✭ 21 (-99.28%)
StageMateStageMate is the smart assistant for your presentation. It will cover all aspects of your pitch from skipping slides to reminding you if you miss some major point.
Stars: ✭ 60 (-97.96%)
NLP ToolkitLibrary of state-of-the-art models (PyTorch) for NLP tasks
Stars: ✭ 92 (-96.86%)
VoiceComA Simple Voice Command Application powered by Java and Sphinx4 Speech Recognition Library
Stars: ✭ 17 (-99.42%)
Listen-Attend-Spell-v2PyTorch implementation of Listen Attend and Spell Automatic Speech Recognition (ASR).
Stars: ✭ 29 (-99.01%)
Tensorflow-Keyword-SpottingKeyword spotting using various architecture like convolutional vggnet , 1D convolutional network and CTC.
Stars: ✭ 27 (-99.08%)
porfirГолосовой ассистент Порфирьевич
Stars: ✭ 23 (-99.22%)
Multi-Hotword SpottingWon't it be cool to build a speech assistant like Alexa or Siri yourself without voice API and network connection?
Stars: ✭ 31 (-98.94%)
deepspeech.mxnetA MXNet implementation of Baidu's DeepSpeech architecture
Stars: ✭ 82 (-97.21%)
YouTube-Tutorials--Italian📂 Source Code for (some of) the Programming Tutorials from my Italian YouTube Channel and website ProgrammareInPython.it. This is just a small portion of the content: please visit the website for more.
Stars: ✭ 28 (-99.05%)
TinyCogSmall Robot, Toy Robot platform
Stars: ✭ 29 (-99.01%)
learning invariances in speech recognitionIn this work I investigate the speech command task developing and analyzing deep learning models. The state of the art technology uses convolutional neural networks (CNN) because of their intrinsic nature of learning correlated represen- tations as is the speech. In particular I develop different CNNs trained on the Google Speech Command Dataset…
Stars: ✭ 15 (-99.49%)
cobraOn-device voice activity detection (VAD) powered by deep learning.
Stars: ✭ 76 (-97.41%)
rosechoTianbot Rosecho (Tianecho),中文语音人机交互模块,支持ROS即插即用
Stars: ✭ 28 (-99.05%)
musicologistMusic advice from a conversational interface powered by Algolia
Stars: ✭ 19 (-99.35%)
sepia-docsDocumentation and Wiki for SEPIA. Please post your questions and bug-reports here in the issues section! Thank you :-)
Stars: ✭ 160 (-94.55%)
HotVoiceAdds Speech Recognition support to AutoHotkey, via a C# DLL
Stars: ✭ 41 (-98.6%)
leon🧠 Leon is your open-source personal assistant.
Stars: ✭ 8,560 (+191.75%)
voce-browserVoice Controlled Chromium Web Browser
Stars: ✭ 34 (-98.84%)