Neural spEnd-to-end ASR/LM implementation with PyTorch
Stars: ✭ 408 (+871.43%)
LingvoLingvo
Stars: ✭ 2,361 (+5521.43%)
DeltaDELTA is a deep learning based natural language and speech processing platform.
Stars: ✭ 1,479 (+3421.43%)
KerasdeepspeechA Keras CTC implementation of Baidu's DeepSpeech for model experimentation
Stars: ✭ 245 (+483.33%)
NeuralconvoNeural conversational model in Torch
Stars: ✭ 773 (+1740.48%)
Syn SpeechSyn.Speech is a flexible speaker independent continuous speech recognition engine for Mono and .NET framework
Stars: ✭ 57 (+35.71%)
opensnipsOpen source projects related to Snips https://snips.ai/.
Stars: ✭ 50 (+19.05%)
speech recognition ctcUse ctc to do chinese speech recognition by keras / 通过keras和ctc实现中文语音识别
Stars: ✭ 40 (-4.76%)
EesenThe official repository of the Eesen project
Stars: ✭ 738 (+1657.14%)
KospeechOpen-Source Toolkit for End-to-End Korean Automatic Speech Recognition.
Stars: ✭ 190 (+352.38%)
NmtpytorchSequence-to-Sequence Framework in PyTorch
Stars: ✭ 392 (+833.33%)
PykaldiA Python wrapper for Kaldi
Stars: ✭ 756 (+1700%)
AESRC2020a deep accent recognition network
Stars: ✭ 35 (-16.67%)
ASR-Audio-Data-LinksA list of publically available audio data that anyone can download for ASR or other speech activities
Stars: ✭ 179 (+326.19%)
Asr Stars: ✭ 54 (+28.57%)
Asr audio data linksA list of publically available audio data that anyone can download for ASR or other speech activities
Stars: ✭ 128 (+204.76%)
OpenasrA pytorch based end2end speech recognition system.
Stars: ✭ 69 (+64.29%)
Pytorch Kaldipytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
Stars: ✭ 2,097 (+4892.86%)
EdgedictWorking online speech recognition based on RNN Transducer. ( Trained model release available in release )
Stars: ✭ 205 (+388.1%)
Multimodal-Gesture-Recognition-with-LSTMs-and-CTCAn end-to-end system that performs temporal recognition of gesture sequences using speech and skeletal input. The model combines three networks with a CTC output layer that recognises gestures from continuous stream.
Stars: ✭ 25 (-40.48%)
avsr-tf1Audio-Visual Speech Recognition using Sequence to Sequence Models
Stars: ✭ 76 (+80.95%)
Athenaan open-source implementation of sequence-to-sequence based speech processing engine
Stars: ✭ 542 (+1190.48%)
opensource-voice-toolsA repo listing known open source voice tools, ordered by where they sit in the voice stack
Stars: ✭ 21 (-50%)
kospeechOpen-Source Toolkit for End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.
Stars: ✭ 456 (+985.71%)
ctc-asrEnd-to-end trained speech recognition system, based on RNNs and the connectionist temporal classification (CTC) cost function.
Stars: ✭ 112 (+166.67%)
sentence2vecDeep sentence embedding using Sequence to Sequence learning
Stars: ✭ 23 (-45.24%)
ttslearnttslearn: Library for Pythonで学ぶ音声合成 (Text-to-speech with Python)
Stars: ✭ 158 (+276.19%)
sova-asrSOVA ASR (Automatic Speech Recognition)
Stars: ✭ 123 (+192.86%)
wav2vec2-liveA live speech recognition using Facebooks wav2vec 2.0 model.
Stars: ✭ 205 (+388.1%)
speech-transformerTransformer implementation speciaized in speech recognition tasks using Pytorch.
Stars: ✭ 40 (-4.76%)
spokestack-androidExtensible Android mobile voice framework: wakeword, ASR, NLU, and TTS. Easily add voice to any Android app!
Stars: ✭ 52 (+23.81%)
deep-molecular-optimizationMolecular optimization by capturing chemist’s intuition using the Seq2Seq with attention and the Transformer
Stars: ✭ 60 (+42.86%)
torch-lrcnAn implementation of the LRCN in Torch
Stars: ✭ 85 (+102.38%)
vqa-softAccompanying code for "A Simple Loss Function for Improving the Convergence and Accuracy of Visual Question Answering Models" CVPR 2017 VQA workshop paper.
Stars: ✭ 14 (-66.67%)
tensorsemStructural Equation Modeling using Torch
Stars: ✭ 36 (-14.29%)
transformerNeutron: A pytorch based implementation of Transformer and its variants.
Stars: ✭ 60 (+42.86%)
fadeA Simulation Framework for Auditory Discrimination Experiments
Stars: ✭ 12 (-71.43%)
LIUMScripts for LIUM SpkDiarization tools
Stars: ✭ 28 (-33.33%)
MelNet-SpeechGenerationImplementation of MelNet in PyTorch to generate high-fidelity audio samples
Stars: ✭ 19 (-54.76%)
kaldi-allignerscripts to align a given wave to its transcription using trained models by Kaldi
Stars: ✭ 24 (-42.86%)
ai-visual-storytelling-seq2seqImplementation of seq2seq model for Visual Storytelling Challenge (VIST) http://visionandlanguage.net/VIST/index.html
Stars: ✭ 50 (+19.05%)
kosrKorean speech recognition based on transformer (트랜스포머 기반 한국어 음성 인식)
Stars: ✭ 25 (-40.48%)
bouncerAn application to cycle (bounce) all nodes in a coordinated fashion in an AWS ASG or set of related ASGs
Stars: ✭ 123 (+192.86%)
nabaztag-phpa simple php implementation of a Nabaztag server
Stars: ✭ 14 (-66.67%)
DLCV2018SPRINGDeep Learning for Computer Vision (CommE 5052) in NTU
Stars: ✭ 38 (-9.52%)
HTKThe Hidden Markov Model Toolkit (HTK) from University of Cambridge, with fixed issues.
Stars: ✭ 23 (-45.24%)
SER-datasetsA collection of datasets for the purpose of emotion recognition/detection in speech.
Stars: ✭ 74 (+76.19%)
neuralBlackA Multi-Class Brain Tumor Classifier using Convolutional Neural Network with 99% Accuracy achieved by applying the method of Transfer Learning using Python and Pytorch Deep Learning Framework
Stars: ✭ 36 (-14.29%)
EmbeddingEmbedding模型代码和学习笔记总结
Stars: ✭ 25 (-40.48%)
dtsA Keras library for multi-step time-series forecasting.
Stars: ✭ 130 (+209.52%)