Asr EvaluationPython module for evaluating ASR hypotheses (e.g. word error rate, word recognition rate).
Stars: ✭ 190 (+1257.14%)
lightning-asrModular and extensible speech recognition library leveraging pytorch-lightning and hydra.
Stars: ✭ 36 (+157.14%)
DeltaDELTA is a deep learning based natural language and speech processing platform.
Stars: ✭ 1,479 (+10464.29%)
open-speech-corpora💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
Stars: ✭ 841 (+5907.14%)
Zamia SpeechOpen tools and data for cloudless automatic speech recognition
Stars: ✭ 374 (+2571.43%)
Py Kaldi AsrSome simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.
Stars: ✭ 156 (+1014.29%)
wav2vec2-liveA live speech recognition using Facebooks wav2vec 2.0 model.
Stars: ✭ 205 (+1364.29%)
deepspeech.mxnetA MXNet implementation of Baidu's DeepSpeech architecture
Stars: ✭ 82 (+485.71%)
simple diarizerSimplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code
Stars: ✭ 26 (+85.71%)
cobraOn-device voice activity detection (VAD) powered by deep learning.
Stars: ✭ 76 (+442.86%)
react-clientAn React client library for Speechly API
Stars: ✭ 71 (+407.14%)
htkHTK Toolkit with Linux 64 bit and Docker support
Stars: ✭ 14 (+0%)
sepia-docsDocumentation and Wiki for SEPIA. Please post your questions and bug-reports here in the issues section! Thank you :-)
Stars: ✭ 160 (+1042.86%)
torchainWIP: pytorch FFI wrapper for Kaldi chain loss (a.k.a. Lattice Free MMI)
Stars: ✭ 20 (+42.86%)
speechlessSpeech-to-text based on wav2letter built for transfer learning
Stars: ✭ 92 (+557.14%)
torch-asgAuto Segmentation Criterion (ASG) implemented in pytorch
Stars: ✭ 42 (+200%)
voce-browserVoice Controlled Chromium Web Browser
Stars: ✭ 34 (+142.86%)
Unity live captionUse Google Speech-to-Text API to do real-time live stream caption on Unity! Best when combined with your virtual character!
Stars: ✭ 26 (+85.71%)
mongolian-nlpUseful resources for Mongolian NLP
Stars: ✭ 119 (+750%)
kaldi-allignerscripts to align a given wave to its transcription using trained models by Kaldi
Stars: ✭ 24 (+71.43%)
Speech-BackbonesThis is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.
Stars: ✭ 205 (+1364.29%)
sepia-stt-serverSEPIA server to support open-source speech recognition via WebSocket connection.
Stars: ✭ 45 (+221.43%)
deepspeechA PyTorch implementation of DeepSpeech and DeepSpeech2.
Stars: ✭ 45 (+221.43%)
pyjsgfJSpeech Grammar Format (JSGF) compiler, matcher and parser package for Python.
Stars: ✭ 40 (+185.71%)
AESRC2020a deep accent recognition network
Stars: ✭ 35 (+150%)
klaamArabic speech recognition, classification and text-to-speech.
Stars: ✭ 151 (+978.57%)
SpeechToTextSpeech To Text in Android
Stars: ✭ 53 (+278.57%)
mixupspeechpro.com/
Stars: ✭ 23 (+64.29%)
pytorch audioaudio processing module for pytorch:stft, istft
Stars: ✭ 33 (+135.71%)
favorite-research-papersListing my favorite research papers 📝 from different fields as I read them.
Stars: ✭ 12 (-14.29%)
speech-to-textmixlingual speech recognition system; hybrid (GMM+NNet) model; Kaldi + Keras
Stars: ✭ 61 (+335.71%)
VoiceDictation迅飞 语音听写 WebAPI - 把语音(≤60秒)转换成对应的文字信息,让机器能够“听懂”人类语言,相当于给机器安装上“耳朵”,使其具备“能听”的功能。
Stars: ✭ 36 (+157.14%)
Transformer-TransducerPyTorch implementation of "Transformer Transducer: A Streamable Speech Recognition Model with Transformer Encoders and RNN-T Loss" (ICASSP 2020)
Stars: ✭ 61 (+335.71%)
Multi-Hotword SpottingWon't it be cool to build a speech assistant like Alexa or Siri yourself without voice API and network connection?
Stars: ✭ 31 (+121.43%)
quran-alignWord-accurate timestamps for Qur'anic audio.
Stars: ✭ 139 (+892.86%)
spokestack-tray-androidA UI component that makes it easy to add voice interaction to your app.
Stars: ✭ 13 (-7.14%)
opensnipsOpen source projects related to Snips https://snips.ai/.
Stars: ✭ 50 (+257.14%)
QuantumSpeech-QCNNIEEE ICASSP 21 - Quantum Convolution Neural Networks for Speech Processing and Automatic Speech Recognition
Stars: ✭ 71 (+407.14%)
scim[wip]Speech recognition tool-box written by Nim. Based on Arraymancer.
Stars: ✭ 17 (+21.43%)
Star TrackerDevelopment of SPEL - Open Star Tracker (SOST) - Will be tested in our new space projects
Stars: ✭ 38 (+171.43%)
VoiceBridgeVoiceBridge - an AI-TOOLKIT Open Source C++ Speech Recognition Toolkit
Stars: ✭ 17 (+21.43%)
speech to texthow to use the Google Cloud Speech API to transcribe audio/video files.
Stars: ✭ 35 (+150%)
kaldi ag trainingDocker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-grammar.
Stars: ✭ 14 (+0%)
InimesedAn Android app that lets you search your contacts by voice. Internet not required. Based on Pocketsphinx. Uses Estonian acoustic models.
Stars: ✭ 65 (+364.29%)
learning invariances in speech recognitionIn this work I investigate the speech command task developing and analyzing deep learning models. The state of the art technology uses convolutional neural networks (CNN) because of their intrinsic nature of learning correlated represen- tations as is the speech. In particular I develop different CNNs trained on the Google Speech Command Dataset…
Stars: ✭ 15 (+7.14%)
leopard-chat-ui-teneoLeopard Chat UI - A Teneo Chat Client based on Vue and Vuetify
Stars: ✭ 65 (+364.29%)
download audioset📁 This repo makes it easy to download the raw audio files from AudioSet (32.45 GB, 632 classes).
Stars: ✭ 53 (+278.57%)
VoiceComA Simple Voice Command Application powered by Java and Sphinx4 Speech Recognition Library
Stars: ✭ 17 (+21.43%)
pocketsphinxUpdated ROS bindings to pocketsphinx
Stars: ✭ 36 (+157.14%)