ctc-asrEnd-to-end trained speech recognition system, based on RNNs and the connectionist temporal classification (CTC) cost function.
Stars: ✭ 112 (+7.69%)
DeepspeechrecognitionA Chinese Deep Speech Recognition System 包括基于深度学习的声学模型和基于深度学习的语言模型
Stars: ✭ 1,421 (+1266.35%)
Kaldikaldi-asr/kaldi is the official location of the Kaldi project.
Stars: ✭ 11,151 (+10622.12%)
PCPMPresenting Collection of Pretrained Models. Links to pretrained models in NLP and voice.
Stars: ✭ 21 (-79.81%)
Kaldi GopComputes the GMM-based Goodness of Pronunciation (GOP). Bases on Kaldi.
Stars: ✭ 104 (+0%)
Kaldi OnnxKaldi model converter to ONNX
Stars: ✭ 174 (+67.31%)
EdgedictWorking online speech recognition based on RNN Transducer. ( Trained model release available in release )
Stars: ✭ 205 (+97.12%)
Silero ModelsSilero Models: pre-trained STT models and benchmarks made embarrassingly simple
Stars: ✭ 522 (+401.92%)
Asr EvaluationPython module for evaluating ASR hypotheses (e.g. word error rate, word recognition rate).
Stars: ✭ 190 (+82.69%)
KospeechOpen-Source Toolkit for End-to-End Korean Automatic Speech Recognition.
Stars: ✭ 190 (+82.69%)
torchainWIP: pytorch FFI wrapper for Kaldi chain loss (a.k.a. Lattice Free MMI)
Stars: ✭ 20 (-80.77%)
wenetProduction First and Production Ready End-to-End Speech Recognition Toolkit
Stars: ✭ 2,384 (+2192.31%)
wav2vec2-liveA live speech recognition using Facebooks wav2vec 2.0 model.
Stars: ✭ 205 (+97.12%)
opensource-voice-toolsA repo listing known open source voice tools, ordered by where they sit in the voice stack
Stars: ✭ 21 (-79.81%)
lightning-asrModular and extensible speech recognition library leveraging pytorch-lightning and hydra.
Stars: ✭ 36 (-65.38%)
LingvoLingvo
Stars: ✭ 2,361 (+2170.19%)
leopardOn-device speech-to-text engine powered by deep learning
Stars: ✭ 354 (+240.38%)
spokestack-androidExtensible Android mobile voice framework: wakeword, ASR, NLU, and TTS. Easily add voice to any Android app!
Stars: ✭ 52 (-50%)
demo vietasrVietnamese Speech Recognition
Stars: ✭ 22 (-78.85%)
sova-asrSOVA ASR (Automatic Speech Recognition)
Stars: ✭ 123 (+18.27%)
Umbrella"A collection of functional programming libraries that can be composed together.
Unlike a framework, thi.ng is a suite of instruments and you (the user) must be
the composer of. Geared towards versatility, not any specific type of music."
— @loganpowell via Twitter
Stars: ✭ 2,186 (+2001.92%)
Neural spEnd-to-end ASR/LM implementation with PyTorch
Stars: ✭ 408 (+292.31%)
Cn2an📦 快速转化「中文数字」和「阿拉伯数字」~ (最新特性:分数,日期、温度等转化)
Stars: ✭ 249 (+139.42%)
Py NltoolsA collection of basic python modules for spoken natural language processing
Stars: ✭ 46 (-55.77%)
megsA merged version of multiple open-source German speech datasets.
Stars: ✭ 21 (-79.81%)
rocket-pipesPowerful pipes for TypeScript, that chain Promise and ADT for you 🚌 -> ⛰️ -> 🚠 -> 🏂 -> 🚀
Stars: ✭ 18 (-82.69%)
PostmortemA simple debug library for Clojure(Script) that features data-oriented logging and tracing
Stars: ✭ 143 (+37.5%)
simple-obs-sttSpeech-to-text and keyboard input captions for OBS.
Stars: ✭ 89 (-14.42%)
visual-automataVisual Automata is a Python 3 library built as a wrapper for the Automata library to add more visualization features.
Stars: ✭ 55 (-47.12%)
geojson-to-wfs-t-2A lightweight javascript module to format WFS-T-2 statements from GeoJSON features
Stars: ✭ 21 (-79.81%)
KodiSharpUse Kodi python APIs in C#, and write rich addons using the .NET framework/Mono
Stars: ✭ 22 (-78.85%)
awesome-keyword-spottingThis repository is a curated list of awesome Speech Keyword Spotting (Wake-Up Word Detection).
Stars: ✭ 150 (+44.23%)
linderaA morphological analysis library.
Stars: ✭ 226 (+117.31%)
AESRC2020Data preperation scripts, training pipeline and baseline experiment results for the Interspeech 2020 Accented English Speech Recognition Challenge (AESRC).
Stars: ✭ 40 (-61.54%)
praiseDo stuff with your voice in the browser.
Stars: ✭ 13 (-87.5%)
rspark▁▂▆▇▁▄█▁ Sparklines for Rust apps
Stars: ✭ 50 (-51.92%)
ftorftor enables ML-like type-directed, functional programming with Javascript including reasonable debugging.
Stars: ✭ 44 (-57.69%)
SwiLexA universal lexer library in Swift.
Stars: ✭ 29 (-72.12%)
avsr-tf1Audio-Visual Speech Recognition using Sequence to Sequence Models
Stars: ✭ 76 (-26.92%)
KeenASR-Android-PoCA proof-of-concept app using KeenASR SDK on Android. WE ARE HIRING: https://keenresearch.com/careers.html
Stars: ✭ 21 (-79.81%)
doSimplest way to manage asynchronicity
Stars: ✭ 33 (-68.27%)
React.aiIt recognize your speech and trained AI Bot will respond(i.e Customer Service, Personal Assistant) using Machine Learning API (DialogFlow, apiai), Speech Recognition, GraphQL, Next.js, React, redux
Stars: ✭ 38 (-63.46%)
octopusOn-device speech-to-index engine powered by deep learning.
Stars: ✭ 30 (-71.15%)
DeepSpeech-APIThe code enables users to use Mozilla's Deep Speech model over the Web Browser.
Stars: ✭ 31 (-70.19%)
speechreca simple speech recognition app using the Web Speech API Interfaces
Stars: ✭ 18 (-82.69%)
iOSProjectsIt's project that contains different applications developed with Swift 5.7 👨💻👩🏼💻🧑🏿💻
Stars: ✭ 122 (+17.31%)
treesNo description or website provided.
Stars: ✭ 54 (-48.08%)
frame transpilerFrame is a markdown language for creating state machines (automata) in 8 programming languages as well as generating UML documentation.
Stars: ✭ 35 (-66.35%)
myG2PMyanmar (Burmese) Language Grapheme to Phoneme (myG2P) Conversion Dictionary for speech recognition (ASR) and speech synthesis (TTS).
Stars: ✭ 43 (-58.65%)
algosA collection of algorithms in rust
Stars: ✭ 16 (-84.62%)
elm-collageCreate interactive vector graphics and position them relative to each other
Stars: ✭ 57 (-45.19%)
lexLex is an implementation of lex tool in Ruby.
Stars: ✭ 49 (-52.88%)