sova-asrSOVA ASR (Automatic Speech Recognition)
Stars: ✭ 123 (+241.67%)
EdgedictWorking online speech recognition based on RNN Transducer. ( Trained model release available in release )
Stars: ✭ 205 (+469.44%)
Vosk Android DemoOffline speech recognition for Android with Vosk library.
Stars: ✭ 271 (+652.78%)
Rnn TransducerMXNet implementation of RNN Transducer (Graves 2012): Sequence Transduction with Recurrent Neural Networks
Stars: ✭ 114 (+216.67%)
megsA merged version of multiple open-source German speech datasets.
Stars: ✭ 21 (-41.67%)
rustfstRust re-implementation of OpenFST - library for constructing, combining, optimizing, and searching weighted finite-state transducers (FSTs). A Python binding is also available.
Stars: ✭ 104 (+188.89%)
QuantumSpeech-QCNNIEEE ICASSP 21 - Quantum Convolution Neural Networks for Speech Processing and Automatic Speech Recognition
Stars: ✭ 71 (+97.22%)
react-clientAn React client library for Speechly API
Stars: ✭ 71 (+97.22%)
hydra-hppHydra Hot Potato Player (game)
Stars: ✭ 12 (-66.67%)
ppg-vcPPG-Based Voice Conversion
Stars: ✭ 154 (+327.78%)
VoiceBridgeVoiceBridge - an AI-TOOLKIT Open Source C++ Speech Recognition Toolkit
Stars: ✭ 17 (-52.78%)
kaldi ag trainingDocker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-grammar.
Stars: ✭ 14 (-61.11%)
InimesedAn Android app that lets you search your contacts by voice. Internet not required. Based on Pocketsphinx. Uses Estonian acoustic models.
Stars: ✭ 65 (+80.56%)
Neural-HMMNeural HMMs are all you need (for high-quality attention-free TTS)
Stars: ✭ 69 (+91.67%)
soxanWav2Vec for speech recognition, classification, and audio classification
Stars: ✭ 113 (+213.89%)
torchainWIP: pytorch FFI wrapper for Kaldi chain loss (a.k.a. Lattice Free MMI)
Stars: ✭ 20 (-44.44%)
labml🔎 Monitor deep learning model training and hardware usage from your mobile phone 📱
Stars: ✭ 1,213 (+3269.44%)
ml-with-audioHF's ML for Audio study group
Stars: ✭ 104 (+188.89%)
speechlessSpeech-to-text based on wav2letter built for transfer learning
Stars: ✭ 92 (+155.56%)
DeepSpeech-APIThe code enables users to use Mozilla's Deep Speech model over the Web Browser.
Stars: ✭ 31 (-13.89%)
map-floodwater-satellite-imageryThis repository focuses on training semantic segmentation models to predict the presence of floodwater for disaster prevention. Models were trained using SageMaker and Colab.
Stars: ✭ 21 (-41.67%)
Tensorflow-Keyword-SpottingKeyword spotting using various architecture like convolutional vggnet , 1D convolutional network and CTC.
Stars: ✭ 27 (-25%)
MVSNet plMVSNet: Depth Inference for Unstructured Multi-view Stereo using pytorch-lightning
Stars: ✭ 49 (+36.11%)
AutoTabularAutomatic machine learning for tabular data. ⚡🔥⚡
Stars: ✭ 51 (+41.67%)
Unity live captionUse Google Speech-to-Text API to do real-time live stream caption on Unity! Best when combined with your virtual character!
Stars: ✭ 26 (-27.78%)
AmazonSpeechTranslatorEnd-to-end Solution for Speech Recognition, Text Translation, and Text-to-Speech for iOS using Amazon Translate and Amazon Polly as AWS Machine Learning managed services.
Stars: ✭ 50 (+38.89%)
uetaiCustom ML tracking experiment and debugging tools.
Stars: ✭ 17 (-52.78%)
apiSpeechly public API definitions and generated code
Stars: ✭ 15 (-58.33%)
speech-transformerTransformer implementation speciaized in speech recognition tasks using Pytorch.
Stars: ✭ 40 (+11.11%)
mongolian-nlpUseful resources for Mongolian NLP
Stars: ✭ 119 (+230.56%)
2018-dlslUPC Deep Learning for Speech and Language 2018
Stars: ✭ 18 (-50%)
specificationRDF vocabulary and specification
Stars: ✭ 21 (-41.67%)
pytorch multi input exampleMulti-Input Deep Neural Networks with PyTorch-Lightning - Combine Image and Tabular Data
Stars: ✭ 40 (+11.11%)
rnnt decoder cudaAn efficient implementation of RNN-T Prefix Beam Search in C++/CUDA.
Stars: ✭ 60 (+66.67%)
uvadlc notebooksRepository of Jupyter notebook tutorials for teaching the Deep Learning Course at the University of Amsterdam (MSc AI), Fall 2022/Spring 2022
Stars: ✭ 901 (+2402.78%)
hydraA command-line utility for generating language-specific project structure.
Stars: ✭ 18 (-50%)
quickvisionAn Easy To Use PyTorch Computer Vision Library
Stars: ✭ 49 (+36.11%)
KoLMKorean text normalization and language preparation package for LM in Kaldi-based ASR system
Stars: ✭ 46 (+27.78%)
salutejsSmartApp Framework для создания навыков семейства Виртуальных Ассистентов "Салют" на языке JavaScript
Stars: ✭ 35 (-2.78%)
bert-squeeze🛠️ Tools for Transformers compression using PyTorch Lightning ⚡
Stars: ✭ 56 (+55.56%)
pytorch-lightning-templateAn easy/swift-to-adapt PyTorch-Lighting template. 套壳模板,简单易用,稍改原来Pytorch代码,即可适配Lightning。You can translate your previous Pytorch code much easier using this template, and keep your freedom to edit all the functions as well. Big-project-friendly as well.
Stars: ✭ 555 (+1441.67%)
fastfaceLight Face Detection using PyTorch Lightning
Stars: ✭ 71 (+97.22%)
tidal-looperDifferent looper variants for SuperDirt to provide live sampling in TidalCycles.
Stars: ✭ 55 (+52.78%)
Speech-BackbonesThis is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.
Stars: ✭ 205 (+469.44%)
speechreca simple speech recognition app using the Web Speech API Interfaces
Stars: ✭ 18 (-50%)
deepfillv2-pylightningClean minimal implementation of Free-Form Image Inpainting with Gated Convolutions in pytorch lightning. Inspired from pytorch implementation by @avalonstrel.
Stars: ✭ 13 (-63.89%)
open-speech-corpora💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
Stars: ✭ 841 (+2236.11%)
BrainMaGeBrain extraction in presence of abnormalities, using single and multiple MRI modalities
Stars: ✭ 23 (-36.11%)
A chronology of deep learningTracing back and exposing in chronological order the main ideas in the field of deep learning, to help everyone better understand the current intense research in AI.
Stars: ✭ 47 (+30.56%)
Android-TTS-STTOne line solution for Android Text to speech(TTS) & Speech to Text(STT) translation problem
Stars: ✭ 77 (+113.89%)
telltimeiOS application to tell the time in the British way 🇬🇧⏰
Stars: ✭ 49 (+36.11%)
AESRC2020a deep accent recognition network
Stars: ✭ 35 (-2.78%)
opensnipsOpen source projects related to Snips https://snips.ai/.
Stars: ✭ 50 (+38.89%)