pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

Stars: ✭ 2,097 (+27.79%)

Mutual labels: speech-recognition

Tacotron asr

Speech Recognition Using Tacotron

Stars: ✭ 165 (-89.95%)

Mutual labels: speech-recognition

Hey Jetson

Deep Learning based Automatic Speech Recognition with attention for the Nvidia Jetson.

Stars: ✭ 161 (-90.19%)

Mutual labels: speech-recognition

Kaldiio

A pure python module for reading and writing kaldi ark files

Stars: ✭ 160 (-90.25%)

Mutual labels: speech-recognition

Interspeech2019 Tutorial

INTERSPEECH 2019 Tutorial Materials

Stars: ✭ 160 (-90.25%)

Mutual labels: speech-recognition

Rnnt Speech Recognition

End-to-end speech recognition using RNN Transducers in Tensorflow 2.0

Stars: ✭ 158 (-90.37%)

Mutual labels: speech-recognition

Py Kaldi Asr

Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.

Stars: ✭ 156 (-90.49%)

Mutual labels: speech-recognition

Clovacall

ClovaCall dataset and Pytorch LAS baseline code (Interspeech 2020)

Stars: ✭ 151 (-90.8%)

Mutual labels: speech-recognition

Speech To Text Russian

Проект для распознавания речи на русском языке на основе pykaldi.

Stars: ✭ 151 (-90.8%)

Mutual labels: speech-recognition

Swiftspeech

A speech recognition framework designed for SwiftUI.

Stars: ✭ 149 (-90.92%)

Mutual labels: speech-recognition

Speech Recognition Neural Network

This is the end-to-end Speech Recognition neural network, deployed in Keras. This was my final project for Artificial Intelligence Nanodegree @Udacity.

Stars: ✭ 148 (-90.98%)

Mutual labels: speech-recognition

Zzz Retired openstt

RETIRED - OpenSTT is now retired. If you would like more information on Mycroft AI's open source STT projects, please visit:

Stars: ✭ 146 (-91.1%)

Mutual labels: speech-recognition

Speechrecognizerbutton

UIButton subclass with push to talk recording, speech recognition and Siri-style waveform view.

Stars: ✭ 144 (-91.22%)

Mutual labels: speech-recognition

Dla

Deep learning for audio processing

Stars: ✭ 142 (-91.35%)

Mutual labels: speech-recognition

Aimybox Android Assistant

Embeddable custom voice assistant for Android applications

Stars: ✭ 139 (-91.53%)

Mutual labels: speech-recognition

Go Astideepspeech

Golang bindings for Mozilla's DeepSpeech speech-to-text library

Stars: ✭ 137 (-91.65%)

Mutual labels: speech-recognition

Allosaurus

Allosaurus is a pretrained universal phone recognizer for more than 2000 languages

Stars: ✭ 135 (-91.77%)

Mutual labels: speech-recognition

Awesome Ai Services

An overview of the AI-as-a-service landscape

Stars: ✭ 133 (-91.9%)

Mutual labels: speech-recognition

Voice activity detection

Voice Activity Detection based on Deep Learning & TensorFlow

Stars: ✭ 132 (-91.96%)

Mutual labels: speech-recognition

Persephone

A tool for automatic phoneme transcription

Stars: ✭ 130 (-92.08%)

Mutual labels: speech-recognition

Asr audio data links

A list of publically available audio data that anyone can download for ASR or other speech activities

Stars: ✭ 128 (-92.2%)

Mutual labels: speech-recognition

Alan Sdk Pcf

Alan AI Power Apps SDK adds a voice assistant or chatbot to your Microsoft Power Apps project.

Stars: ✭ 128 (-92.2%)

Mutual labels: speech-recognition

Pytorch Speech Commands

Speech commands recognition with PyTorch

Stars: ✭ 128 (-92.2%)

Mutual labels: speech-recognition

Tensorflow Ctc Speech Recognition

Application of Connectionist Temporal Classification (CTC) for Speech Recognition (Tensorflow 1.0 but compatible with 2.0).

Stars: ✭ 127 (-92.26%)

Mutual labels: speech-recognition

Kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.

Stars: ✭ 11,151 (+579.52%)

Mutual labels: speech-recognition

Awesome Speech Recognition Speech Synthesis Papers

Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)

Stars: ✭ 2,085 (+27.06%)

Mutual labels: speech-recognition

301-332 of 332 similar projects

first

‹