speech-aligner，是一个从“人声语音”及其“语言文本”，产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech and its transcription

Stars: ✭ 259 (+108.87%)

Mutual labels: speech, kaldi

speech-transformer

Transformer implementation speciaized in speech recognition tasks using Pytorch.

Stars: ✭ 40 (-67.74%)

Mutual labels: speech, asr

pyro-vision

Computer vision library for wildfire detection

Stars: ✭ 33 (-73.39%)

Mutual labels: densenet, resnet

demo vietasr

Vietnamese Speech Recognition

Stars: ✭ 22 (-82.26%)

Mutual labels: speech-recognition, asr

Docker Kaldi Gstreamer Server

Dockerfile for kaldi-gstreamer-server.

Stars: ✭ 266 (+114.52%)

Mutual labels: asr, kaldi

Kaldi Gop

Computes the GMM-based Goodness of Pronunciation (GOP). Bases on Kaldi.

Stars: ✭ 104 (-16.13%)

Mutual labels: speech-recognition, kaldi

UnityASR

Automatic Speech Recognition in Unity.

Stars: ✭ 14 (-88.71%)

Mutual labels: speech-recognition, asr

asr24

24-hour Automatic Speech Recognition

Stars: ✭ 27 (-78.23%)

Mutual labels: kaldi, asr

Tianchi Medical Lungtumordetect

天池医疗AI大赛[第一季]：肺部结节智能诊断 UNet/VGG/Inception/ResNet/DenseNet

Stars: ✭ 314 (+153.23%)

Mutual labels: resnet, densenet

Asr theory

语音识别理论，论文和PPT

Stars: ✭ 344 (+177.42%)

Mutual labels: asr, kaldi

Pocketsphinx Python

Python interface to CMU Sphinxbase and Pocketsphinx libraries

Stars: ✭ 298 (+140.32%)

Mutual labels: speech-recognition, speech

Espnet

End-to-End Speech Processing Toolkit

Stars: ✭ 4,533 (+3555.65%)

Mutual labels: speech-recognition, kaldi

Basic cnns tensorflow2

A tensorflow2 implementation of some basic CNNs(MobileNetV1/V2/V3, EfficientNet, ResNeXt, InceptionV4, InceptionResNetV1/V2, SENet, SqueezeNet, DenseNet, ShuffleNetV2, ResNet).

Stars: ✭ 374 (+201.61%)

Mutual labels: resnet, densenet

Nmtpytorch

Sequence-to-Sequence Framework in PyTorch

Stars: ✭ 392 (+216.13%)

Mutual labels: speech-recognition, asr

Deepspeech

A PaddlePaddle implementation of ASR.

Stars: ✭ 1,219 (+883.06%)

Mutual labels: speech-recognition, speech

Segmentation models

Segmentation models with pretrained backbones. Keras and TensorFlow Keras.

Stars: ✭ 3,575 (+2783.06%)

Mutual labels: resnet, densenet

Cheetah

On-device streaming speech-to-text engine powered by deep learning

Stars: ✭ 383 (+208.87%)

Mutual labels: speech-recognition, asr

Pytorch classification

利用pytorch实现图像分类的一个完整的代码，训练，预测，TTA，模型融合，模型部署，cnn提取特征，svm或者随机森林等进行分类，模型蒸馏，一个完整的代码

Stars: ✭ 395 (+218.55%)

Mutual labels: resnet, densenet

Tensorflowasr

⚡️ TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords

Stars: ✭ 400 (+222.58%)

Mutual labels: speech-recognition, ctc

Awesome Very Deep Learning

♾A curated list of papers and code about very deep neural networks

Stars: ✭ 435 (+250.81%)

Mutual labels: resnet, densenet

Asrt speechrecognition

A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统

Stars: ✭ 4,943 (+3886.29%)

Mutual labels: speech-recognition, ctc

Wav2letter

Speech Recognition model based off of FAIR research paper built using Pytorch.

Stars: ✭ 78 (-37.1%)

Mutual labels: speech-recognition, asr

Java Speech Api

The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.

Stars: ✭ 490 (+295.16%)

Mutual labels: speech-recognition, speech

Specaugment

A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain

Stars: ✭ 408 (+229.03%)

Mutual labels: speech-recognition, speech

Ctcdecode

PyTorch CTC Decoder bindings

Stars: ✭ 442 (+256.45%)

Mutual labels: ctc, decoder

Silero Models

Silero Models: pre-trained STT models and benchmarks made embarrassingly simple

Stars: ✭ 522 (+320.97%)

Mutual labels: speech-recognition, asr

Sonus

💬 /so.nus/ STT (speech to text) for Node with offline hotword detection

Stars: ✭ 532 (+329.03%)

Mutual labels: speech-recognition, speech

Medicalzoopytorch

A pytorch-based deep learning framework for multi-modal 2D/3D medical image segmentation

Stars: ✭ 546 (+340.32%)

Mutual labels: resnet, densenet

Ctcdecoder

Connectionist Temporal Classification (CTC) decoding algorithms: best path, prefix search, beam search and token passing. Implemented in Python.

Stars: ✭ 529 (+326.61%)

Mutual labels: speech-recognition, ctc

Cifar Zoo

PyTorch implementation of CNNs for CIFAR benchmark

Stars: ✭ 584 (+370.97%)

Mutual labels: resnet, densenet

Wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

Stars: ✭ 617 (+397.58%)

Mutual labels: speech-recognition, asr

Keras Idiomatic Programmer

Books, Presentations, Workshops, Notebook Labs, and Model Zoo for Software Engineers and Data Scientists wanting to learn the TF.Keras Machine Learning framework

Stars: ✭ 720 (+480.65%)

Mutual labels: resnet, densenet

Pytorch2keras

PyTorch to Keras model convertor

Stars: ✭ 676 (+445.16%)

Mutual labels: resnet, densenet

Annyang

💬 Speech recognition for your site

Stars: ✭ 6,216 (+4912.9%)

Mutual labels: speech-recognition, speech

Speech Emotion Analyzer

The neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)