Speechbrain.github.ioThe SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.
Stars: ✭ 242 (-1.22%)
Syn SpeechSyn.Speech is a flexible speaker independent continuous speech recognition engine for Mono and .NET framework
Stars: ✭ 57 (-76.73%)
Wav2letterSpeech Recognition model based off of FAIR research paper built using Pytorch.
Stars: ✭ 78 (-68.16%)
EesenThe official repository of the Eesen project
Stars: ✭ 738 (+201.22%)
OpenasrA pytorch based end2end speech recognition system.
Stars: ✭ 69 (-71.84%)
torch-asgAuto Segmentation Criterion (ASG) implemented in pytorch
Stars: ✭ 42 (-82.86%)
ASR-Audio-Data-LinksA list of publically available audio data that anyone can download for ASR or other speech activities
Stars: ✭ 179 (-26.94%)
sova-asrSOVA ASR (Automatic Speech Recognition)
Stars: ✭ 123 (-49.8%)
wav2vec2-liveA live speech recognition using Facebooks wav2vec 2.0 model.
Stars: ✭ 205 (-16.33%)
Asr audio data linksA list of publically available audio data that anyone can download for ASR or other speech activities
Stars: ✭ 128 (-47.76%)
deepspeech.mxnetA MXNet implementation of Baidu's DeepSpeech architecture
Stars: ✭ 82 (-66.53%)
EdgedictWorking online speech recognition based on RNN Transducer. ( Trained model release available in release )
Stars: ✭ 205 (-16.33%)
Neural spEnd-to-end ASR/LM implementation with PyTorch
Stars: ✭ 408 (+66.53%)
LingvoLingvo
Stars: ✭ 2,361 (+863.67%)
PykaldiA Python wrapper for Kaldi
Stars: ✭ 756 (+208.57%)
SincnetSincNet is a neural architecture for efficiently processing raw audio samples.
Stars: ✭ 764 (+211.84%)
KurDescriptive Deep Learning
Stars: ✭ 811 (+231.02%)
Dc ttsA TensorFlow Implementation of DC-TTS: yet another text-to-speech model
Stars: ✭ 1,017 (+315.1%)
Basic reinforcement learningAn introductory series to Reinforcement Learning (RL) with comprehensive step-by-step tutorials.
Stars: ✭ 826 (+237.14%)
SoloudFree, easy, portable audio engine for games
Stars: ✭ 1,048 (+327.76%)
Deep Kernel GpDeep Kernel Learning. Gaussian Process Regression where the input is a neural network mapping of x that maximizes the marginal likelihood
Stars: ✭ 58 (-76.33%)
Deeplearning4jAll DeepLearning4j projects go here.
Stars: ✭ 68 (-72.24%)
BlinkdlA minimalist deep learning library in Javascript using WebGL + asm.js. Run convolutional neural network in your browser.
Stars: ✭ 69 (-71.84%)
Mit Deep LearningTutorials, assignments, and competitions for MIT Deep Learning related courses.
Stars: ✭ 8,912 (+3537.55%)
Annyang💬 Speech recognition for your site
Stars: ✭ 6,216 (+2437.14%)
Quickdraw Implementation of Quickdraw - an online game developed by Google
Stars: ✭ 805 (+228.57%)
DeepfacelabDeepFaceLab is the leading software for creating deepfakes.
Stars: ✭ 30,308 (+12270.61%)
DiscordspeechbotA speech-to-text bot for discord with music commands and more using NodeJS. Ideally for controlling your Discord server using voice commands, can also be useful for hearing-impaired people.
Stars: ✭ 35 (-85.71%)
Bidaf KerasBidirectional Attention Flow for Machine Comprehension implemented in Keras 2
Stars: ✭ 60 (-75.51%)
Casr Demo基于Flask Web的中文自动语音识别演示系统,包含语音识别、语音合成、声纹识别之说话人识别。
Stars: ✭ 76 (-68.98%)
Mit Deep Learning Book PdfMIT Deep Learning Book in PDF format (complete and parts) by Ian Goodfellow, Yoshua Bengio and Aaron Courville
Stars: ✭ 9,859 (+3924.08%)
Spokestack PythonSpokestack is a library that allows a user to easily incorporate a voice interface into any Python application.
Stars: ✭ 103 (-57.96%)
Open sttOpen STT
Stars: ✭ 584 (+138.37%)
WatbotAn Android ChatBot powered by IBM Watson Services (Assistant V1, Text-to-Speech, and Speech-to-Text with Speaker Recognition) on IBM Cloud.
Stars: ✭ 64 (-73.88%)
AorunDeep Learning over PyTorch
Stars: ✭ 61 (-75.1%)
DeepspeechA PaddlePaddle implementation of ASR.
Stars: ✭ 1,219 (+397.55%)
Vosk ApiOffline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Stars: ✭ 1,357 (+453.88%)
Nodejs SpeechNode.js client for Google Cloud Speech: Speech to text conversion powered by machine learning.
Stars: ✭ 545 (+122.45%)
Rnn ctcRecurrent Neural Network and Long Short Term Memory (LSTM) with Connectionist Temporal Classification implemented in Theano. Includes a Toy training example.
Stars: ✭ 220 (-10.2%)
Kaldikaldi-asr/kaldi is the official location of the Kaldi project.
Stars: ✭ 11,151 (+4451.43%)
FaceswapDeepfakes Software For All
Stars: ✭ 39,911 (+16190.2%)
Tensorflow Ctc Speech RecognitionApplication of Connectionist Temporal Classification (CTC) for Speech Recognition (Tensorflow 1.0 but compatible with 2.0).
Stars: ✭ 127 (-48.16%)
HorovodDistributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
Stars: ✭ 11,943 (+4774.69%)
PaddlexPaddlePaddle End-to-End Development Toolkit(『飞桨』深度学习全流程开发工具)
Stars: ✭ 3,399 (+1287.35%)
Ssd PytorchSSD: Single Shot MultiBox Detector pytorch implementation focusing on simplicity
Stars: ✭ 107 (-56.33%)
Speech To Text RussianПроект для распознавания речи на русском языке на основе pykaldi.
Stars: ✭ 151 (-38.37%)
SpeechtAn opensource speech-to-text software written in tensorflow
Stars: ✭ 152 (-37.96%)
Tacotron asrSpeech Recognition Using Tacotron
Stars: ✭ 165 (-32.65%)
FixyAmacımız Türkçe NLP literatüründeki birçok farklı sorunu bir arada çözebilen, eşsiz yaklaşımlar öne süren ve literatürdeki çalışmaların eksiklerini gideren open source bir yazım destekleyicisi/denetleyicisi oluşturmak. Kullanıcıların yazdıkları metinlerdeki yazım yanlışlarını derin öğrenme yaklaşımıyla çözüp aynı zamanda metinlerde anlamsal analizi de gerçekleştirerek bu bağlamda ortaya çıkan yanlışları da fark edip düzeltebilmek.
Stars: ✭ 165 (-32.65%)
Pytorch Kaldipytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
Stars: ✭ 2,097 (+755.92%)
LibraErgonomic machine learning for everyone.
Stars: ✭ 1,925 (+685.71%)