wenetProduction First and Production Ready End-to-End Speech Recognition Toolkit
Stars: ✭ 2,384 (+2009.73%)
open-speech-corpora💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
Stars: ✭ 841 (+644.25%)
DeltaDELTA is a deep learning based natural language and speech processing platform.
Stars: ✭ 1,479 (+1208.85%)
2018-dlslUPC Deep Learning for Speech and Language 2018
Stars: ✭ 18 (-84.07%)
demo vietasrVietnamese Speech Recognition
Stars: ✭ 22 (-80.53%)
deep avsrA PyTorch implementation of the Deep Audio-Visual Speech Recognition paper.
Stars: ✭ 104 (-7.96%)
obviA Polymer 3+ webcomponent / button for doing speech recognition
Stars: ✭ 54 (-52.21%)
sova-asrSOVA ASR (Automatic Speech Recognition)
Stars: ✭ 123 (+8.85%)
leopardOn-device speech-to-text engine powered by deep learning
Stars: ✭ 354 (+213.27%)
Transformer-TransducerPyTorch implementation of "Transformer Transducer: A Streamable Speech Recognition Model with Transformer Encoders and RNN-T Loss" (ICASSP 2020)
Stars: ✭ 61 (-46.02%)
rustfstRust re-implementation of OpenFST - library for constructing, combining, optimizing, and searching weighted finite-state transducers (FSTs). A Python binding is also available.
Stars: ✭ 104 (-7.96%)
speechlessSpeech-to-text based on wav2letter built for transfer learning
Stars: ✭ 92 (-18.58%)
QuantumSpeech-QCNNIEEE ICASSP 21 - Quantum Convolution Neural Networks for Speech Processing and Automatic Speech Recognition
Stars: ✭ 71 (-37.17%)
VoiceBridgeVoiceBridge - an AI-TOOLKIT Open Source C++ Speech Recognition Toolkit
Stars: ✭ 17 (-84.96%)
mongolian-nlpUseful resources for Mongolian NLP
Stars: ✭ 119 (+5.31%)
PCPMPresenting Collection of Pretrained Models. Links to pretrained models in NLP and voice.
Stars: ✭ 21 (-81.42%)
scriptySpeech to text bot for Discord using Mozilla's DeepSpeech
Stars: ✭ 14 (-87.61%)
react-clientAn React client library for Speechly API
Stars: ✭ 71 (-37.17%)
timit-preprocessorExtract mfcc vectors and phones from TIMIT dataset
Stars: ✭ 14 (-87.61%)
wavenet-classifierKeras Implementation of Deepmind's WaveNet for Supervised Learning Tasks
Stars: ✭ 54 (-52.21%)
Tensorflow-Keyword-SpottingKeyword spotting using various architecture like convolutional vggnet , 1D convolutional network and CTC.
Stars: ✭ 27 (-76.11%)
STEPSpatial Temporal Graph Convolutional Networks for Emotion Perception from Gaits
Stars: ✭ 39 (-65.49%)
DeepSpeech-APIThe code enables users to use Mozilla's Deep Speech model over the Web Browser.
Stars: ✭ 31 (-72.57%)
ferFacial Expression Recognition
Stars: ✭ 32 (-71.68%)
converseConversational text Analysis using various NLP techniques
Stars: ✭ 147 (+30.09%)
Unity live captionUse Google Speech-to-Text API to do real-time live stream caption on Unity! Best when combined with your virtual character!
Stars: ✭ 26 (-76.99%)
kaldi ag trainingDocker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-grammar.
Stars: ✭ 14 (-87.61%)
deepspeech.mxnetA MXNet implementation of Baidu's DeepSpeech architecture
Stars: ✭ 82 (-27.43%)
InimesedAn Android app that lets you search your contacts by voice. Internet not required. Based on Pocketsphinx. Uses Estonian acoustic models.
Stars: ✭ 65 (-42.48%)
Speech-BackbonesThis is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.
Stars: ✭ 205 (+81.42%)
EmotiW2018No description or website provided.
Stars: ✭ 83 (-26.55%)
syn-speech-samplesAn application that demostrate the usage of Syn.Speech library for Speech Recognition
Stars: ✭ 24 (-78.76%)
TinyCogSmall Robot, Toy Robot platform
Stars: ✭ 29 (-74.34%)
AmazonSpeechTranslatorEnd-to-end Solution for Speech Recognition, Text Translation, and Text-to-Speech for iOS using Amazon Translate and Amazon Polly as AWS Machine Learning managed services.
Stars: ✭ 50 (-55.75%)
apiSpeechly public API definitions and generated code
Stars: ✭ 15 (-86.73%)
AGHMNImplementation of the paper "Real-Time Emotion Recognition via Attention Gated Hierarchical Memory Network" in AAAI-2020.
Stars: ✭ 25 (-77.88%)
rnnt decoder cudaAn efficient implementation of RNN-T Prefix Beam Search in C++/CUDA.
Stars: ✭ 60 (-46.9%)
A chronology of deep learningTracing back and exposing in chronological order the main ideas in the field of deep learning, to help everyone better understand the current intense research in AI.
Stars: ✭ 47 (-58.41%)
spokestack-iosSpokestack: give your iOS app a voice interface!
Stars: ✭ 27 (-76.11%)
pytorch audioaudio processing module for pytorch:stft, istft
Stars: ✭ 33 (-70.8%)
salutejsSmartApp Framework для создания навыков семейства Виртуальных Ассистентов "Салют" на языке JavaScript
Stars: ✭ 35 (-69.03%)
RECCONThis repository contains the dataset and the PyTorch implementations of the models from the paper Recognizing Emotion Cause in Conversations.
Stars: ✭ 126 (+11.5%)
favorite-research-papersListing my favorite research papers 📝 from different fields as I read them.
Stars: ✭ 12 (-89.38%)
speechreca simple speech recognition app using the Web Speech API Interfaces
Stars: ✭ 18 (-84.07%)
emotion-recognition-GANThis project is a semi-supervised approach to detect emotions on faces in-the-wild using GAN
Stars: ✭ 20 (-82.3%)