lightning-asrModular and extensible speech recognition library leveraging pytorch-lightning and hydra.
Stars: ✭ 36 (-70.97%)
HolobotHoloBot is a reusable 3D interface that allows HoloLens & VR users to interact with any bot using Mixed Reality & Speech.
Stars: ✭ 114 (-8.06%)
DeepspeechrecognitionA Chinese Deep Speech Recognition System 包括基于深度学习的声学模型和基于深度学习的语言模型
Stars: ✭ 1,421 (+1045.97%)
gluon2pytorchGluon to PyTorch deep neural network model converter
Stars: ✭ 72 (-41.94%)
pytorch2kerasPyTorch to Keras model convertor
Stars: ✭ 788 (+535.48%)
Rnn TransducerMXNet implementation of RNN Transducer (Graves 2012): Sequence Transduction with Recurrent Neural Networks
Stars: ✭ 114 (-8.06%)
torchainWIP: pytorch FFI wrapper for Kaldi chain loss (a.k.a. Lattice Free MMI)
Stars: ✭ 20 (-83.87%)
kaldi helpers🙊 A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.
Stars: ✭ 13 (-89.52%)
speech-to-textmixlingual speech recognition system; hybrid (GMM+NNet) model; Kaldi + Keras
Stars: ✭ 61 (-50.81%)
speech to texthow to use the Google Cloud Speech API to transcribe audio/video files.
Stars: ✭ 35 (-71.77%)
vosk-asteriskSpeech Recognition in Asterisk with Vosk Server
Stars: ✭ 52 (-58.06%)
JuliusOpen-Source Large Vocabulary Continuous Speech Recognition Engine
Stars: ✭ 1,258 (+914.52%)
kaldi-allignerscripts to align a given wave to its transcription using trained models by Kaldi
Stars: ✭ 24 (-80.65%)
speech recognition ctcUse ctc to do chinese speech recognition by keras / 通过keras和ctc实现中文语音识别
Stars: ✭ 40 (-67.74%)
UniSpeechUniSpeech - Large Scale Self-Supervised Learning for Speech
Stars: ✭ 224 (+80.65%)
kosrKorean speech recognition based on transformer (트랜스포머 기반 한국어 음성 인식)
Stars: ✭ 25 (-79.84%)
Speech Alignerspeech-aligner,是一个从“人声语音”及其“语言文本”,产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech and its transcription
Stars: ✭ 259 (+108.87%)
speech-transformerTransformer implementation speciaized in speech recognition tasks using Pytorch.
Stars: ✭ 40 (-67.74%)
pyro-visionComputer vision library for wildfire detection
Stars: ✭ 33 (-73.39%)
demo vietasrVietnamese Speech Recognition
Stars: ✭ 22 (-82.26%)
Kaldi GopComputes the GMM-based Goodness of Pronunciation (GOP). Bases on Kaldi.
Stars: ✭ 104 (-16.13%)
UnityASRAutomatic Speech Recognition in Unity.
Stars: ✭ 14 (-88.71%)
asr2424-hour Automatic Speech Recognition
Stars: ✭ 27 (-78.23%)
Pocketsphinx PythonPython interface to CMU Sphinxbase and Pocketsphinx libraries
Stars: ✭ 298 (+140.32%)
EspnetEnd-to-End Speech Processing Toolkit
Stars: ✭ 4,533 (+3555.65%)
Basic cnns tensorflow2A tensorflow2 implementation of some basic CNNs(MobileNetV1/V2/V3, EfficientNet, ResNeXt, InceptionV4, InceptionResNetV1/V2, SENet, SqueezeNet, DenseNet, ShuffleNetV2, ResNet).
Stars: ✭ 374 (+201.61%)
NmtpytorchSequence-to-Sequence Framework in PyTorch
Stars: ✭ 392 (+216.13%)
DeepspeechA PaddlePaddle implementation of ASR.
Stars: ✭ 1,219 (+883.06%)
Segmentation modelsSegmentation models with pretrained backbones. Keras and TensorFlow Keras.
Stars: ✭ 3,575 (+2783.06%)
CheetahOn-device streaming speech-to-text engine powered by deep learning
Stars: ✭ 383 (+208.87%)
Pytorch classification利用pytorch实现图像分类的一个完整的代码,训练,预测,TTA,模型融合,模型部署,cnn提取特征,svm或者随机森林等进行分类,模型蒸馏,一个完整的代码
Stars: ✭ 395 (+218.55%)
Tensorflowasr⚡️ TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords
Stars: ✭ 400 (+222.58%)
Asrt speechrecognitionA Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
Stars: ✭ 4,943 (+3886.29%)
Wav2letterSpeech Recognition model based off of FAIR research paper built using Pytorch.
Stars: ✭ 78 (-37.1%)
Java Speech ApiThe J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.
Stars: ✭ 490 (+295.16%)
SpecaugmentA Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain
Stars: ✭ 408 (+229.03%)
CtcdecodePyTorch CTC Decoder bindings
Stars: ✭ 442 (+256.45%)
Silero ModelsSilero Models: pre-trained STT models and benchmarks made embarrassingly simple
Stars: ✭ 522 (+320.97%)
Sonus💬 /so.nus/ STT (speech to text) for Node with offline hotword detection
Stars: ✭ 532 (+329.03%)
MedicalzoopytorchA pytorch-based deep learning framework for multi-modal 2D/3D medical image segmentation
Stars: ✭ 546 (+340.32%)
CtcdecoderConnectionist Temporal Classification (CTC) decoding algorithms: best path, prefix search, beam search and token passing. Implemented in Python.
Stars: ✭ 529 (+326.61%)
Cifar ZooPyTorch implementation of CNNs for CIFAR benchmark
Stars: ✭ 584 (+370.97%)
WenetProduction First and Production Ready End-to-End Speech Recognition Toolkit
Stars: ✭ 617 (+397.58%)
Keras Idiomatic ProgrammerBooks, Presentations, Workshops, Notebook Labs, and Model Zoo for Software Engineers and Data Scientists wanting to learn the TF.Keras Machine Learning framework
Stars: ✭ 720 (+480.65%)
Pytorch2kerasPyTorch to Keras model convertor
Stars: ✭ 676 (+445.16%)
Annyang💬 Speech recognition for your site
Stars: ✭ 6,216 (+4912.9%)
Speech Emotion AnalyzerThe neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)
Stars: ✭ 633 (+410.48%)
Asr benchmarkProgram to benchmark various speech recognition APIs
Stars: ✭ 71 (-42.74%)