myG2PMyanmar (Burmese) Language Grapheme to Phoneme (myG2P) Conversion Dictionary for speech recognition (ASR) and speech synthesis (TTS).
Stars: ✭ 43 (+34.38%)
asr2424-hour Automatic Speech Recognition
Stars: ✭ 27 (-15.62%)
dialectID siamDialect identification using Siamese network
Stars: ✭ 15 (-53.12%)
leopardOn-device speech-to-text engine powered by deep learning
Stars: ✭ 354 (+1006.25%)
AESRC2020Data preperation scripts, training pipeline and baseline experiment results for the Interspeech 2020 Accented English Speech Recognition Challenge (AESRC).
Stars: ✭ 40 (+25%)
wenetProduction First and Production Ready End-to-End Speech Recognition Toolkit
Stars: ✭ 2,384 (+7350%)
ASR-Audio-Data-LinksA list of publically available audio data that anyone can download for ASR or other speech activities
Stars: ✭ 179 (+459.38%)
DeepPhonemizerGrapheme to phoneme conversion with deep learning.
Stars: ✭ 152 (+375%)
Wukong Robot🤖 wukong-robot 是一个简单、灵活、优雅的中文语音对话机器人/智能音箱项目,还可能是首个支持脑机交互的开源智能音箱项目。
Stars: ✭ 3,110 (+9618.75%)
KoLMKorean text normalization and language preparation package for LM in Kaldi-based ASR system
Stars: ✭ 46 (+43.75%)
kospeechOpen-Source Toolkit for End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.
Stars: ✭ 456 (+1325%)
syn-speech-samplesAn application that demostrate the usage of Syn.Speech library for Speech Recognition
Stars: ✭ 24 (-25%)
avsr-tf1Audio-Visual Speech Recognition using Sequence to Sequence Models
Stars: ✭ 76 (+137.5%)
opensnipsOpen source projects related to Snips https://snips.ai/.
Stars: ✭ 50 (+56.25%)
G2PGrapheme To Phoneme
Stars: ✭ 59 (+84.38%)
rustfstRust re-implementation of OpenFST - library for constructing, combining, optimizing, and searching weighted finite-state transducers (FSTs). A Python binding is also available.
Stars: ✭ 104 (+225%)
wav2vec2-liveA live speech recognition using Facebooks wav2vec 2.0 model.
Stars: ✭ 205 (+540.63%)
kaldi-allignerscripts to align a given wave to its transcription using trained models by Kaldi
Stars: ✭ 24 (-25%)
rasrThe RWTH ASR Toolkit.
Stars: ✭ 43 (+34.38%)
PCPMPresenting Collection of Pretrained Models. Links to pretrained models in NLP and voice.
Stars: ✭ 21 (-34.37%)
ZerothKaldi-based Korean ASR (한국어 음성인식) open-source project
Stars: ✭ 248 (+675%)
spokestack-iosSpokestack: give your iOS app a voice interface!
Stars: ✭ 27 (-15.62%)
LingvoLingvo
Stars: ✭ 2,361 (+7278.13%)
TensorVoxDesktop application for neural speech synthesis written in C++
Stars: ✭ 140 (+337.5%)
Asr EvaluationPython module for evaluating ASR hypotheses (e.g. word error rate, word recognition rate).
Stars: ✭ 190 (+493.75%)
ctc-asrEnd-to-end trained speech recognition system, based on RNNs and the connectionist temporal classification (CTC) cost function.
Stars: ✭ 112 (+250%)
AESRC2020a deep accent recognition network
Stars: ✭ 35 (+9.38%)
opensource-voice-toolsA repo listing known open source voice tools, ordered by where they sit in the voice stack
Stars: ✭ 21 (-34.37%)
simple diarizerSimplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code
Stars: ✭ 26 (-18.75%)
klaamArabic speech recognition, classification and text-to-speech.
Stars: ✭ 151 (+371.88%)
vosk-asteriskSpeech Recognition in Asterisk with Vosk Server
Stars: ✭ 52 (+62.5%)
Hms Ml DemoHMS ML Demo provides an example of integrating Huawei ML Kit service into applications. This example demonstrates how to integrate services provided by ML Kit, such as face detection, text recognition, image segmentation, asr, and tts.
Stars: ✭ 187 (+484.38%)
spokestack-tray-androidA UI component that makes it easy to add voice interaction to your app.
Stars: ✭ 13 (-59.37%)
pie百度云流式语音识别客户端 SDK
Stars: ✭ 62 (+93.75%)
megsA merged version of multiple open-source German speech datasets.
Stars: ✭ 21 (-34.37%)
PhomemeSimple sentence mixing tool (work in progress)
Stars: ✭ 18 (-43.75%)
indexed-string-variationExperimental JavaScript module to generate all possible variations of strings over an alphabet using an n-ary virtual tree
Stars: ✭ 16 (-50%)
speech-transformerTransformer implementation speciaized in speech recognition tasks using Pytorch.
Stars: ✭ 40 (+25%)
Cn2an📦 快速转化「中文数字」和「阿拉伯数字」~ (最新特性:分数,日期、温度等转化)
Stars: ✭ 249 (+678.13%)
KerasdeepspeechA Keras CTC implementation of Baidu's DeepSpeech for model experimentation
Stars: ✭ 245 (+665.63%)
jquery-alphaindexjQuery plugin to create alphabetical indexes for your lists
Stars: ✭ 12 (-62.5%)
EdgedictWorking online speech recognition based on RNN Transducer. ( Trained model release available in release )
Stars: ✭ 205 (+540.63%)
homoglyphsHomoglyphs: get similar letters, convert to ASCII, detect possible languages and UTF-8 group.
Stars: ✭ 70 (+118.75%)
KospeechOpen-Source Toolkit for End-to-End Korean Automatic Speech Recognition.
Stars: ✭ 190 (+493.75%)
edit-distance-papersA curated list of papers dedicated to edit-distance as objective function
Stars: ✭ 49 (+53.13%)
g2pKg2pK: g2p module for Korean
Stars: ✭ 137 (+328.13%)
lightning-asrModular and extensible speech recognition library leveraging pytorch-lightning and hydra.
Stars: ✭ 36 (+12.5%)
torchainWIP: pytorch FFI wrapper for Kaldi chain loss (a.k.a. Lattice Free MMI)
Stars: ✭ 20 (-37.5%)
speech courseYSDA course in Speech Processing.
Stars: ✭ 93 (+190.63%)