All Projects â†’ CoEDL â†’ kaldi_helpers

CoEDL / kaldi_helpers

Licence: Apache-2.0 license
🙊 A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.

Programming Languages

python
139335 projects - #7 most used programming language
shell
77523 projects

Projects that are alternatives of or similar to kaldi helpers

kaldi-long-audio-alignment
Long audio alignment using Kaldi
Stars: ✭ 21 (+61.54%)
Mutual labels:  automatic-speech-recognition, speech-to-text, kaldi, transcription
leopard
On-device speech-to-text engine powered by deep learning
Stars: ✭ 354 (+2623.08%)
Mutual labels:  automatic-speech-recognition, speech-to-text, transcription
Awesome Kaldi
This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )
Stars: ✭ 393 (+2923.08%)
Mutual labels:  speech, speech-to-text, kaldi
Kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
Stars: ✭ 11,151 (+85676.92%)
Mutual labels:  speech, speech-to-text, kaldi
sova-asr
SOVA ASR (Automatic Speech Recognition)
Stars: ✭ 123 (+846.15%)
Mutual labels:  speech, automatic-speech-recognition, speech-to-text
kaldi ag training
Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-grammar.
Stars: ✭ 14 (+7.69%)
Mutual labels:  speech, speech-to-text, kaldi
Lhotse
Stars: ✭ 236 (+1715.38%)
Mutual labels:  speech, kaldi
Kerasdeepspeech
A Keras CTC implementation of Baidu's DeepSpeech for model experimentation
Stars: ✭ 245 (+1784.62%)
Mutual labels:  speech, speech-to-text
speech-to-text
Python helper for Google and IBM Watson speech-to-text cloud APIs.
Stars: ✭ 14 (+7.69%)
Mutual labels:  speech-to-text, transcription
Pytorch Kaldi
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
Stars: ✭ 2,097 (+16030.77%)
Mutual labels:  speech, kaldi
anycontrol
Voice control for your websites and applications
Stars: ✭ 53 (+307.69%)
Mutual labels:  speech, speech-to-text
wav2vec2-live
A live speech recognition using Facebooks wav2vec 2.0 model.
Stars: ✭ 205 (+1476.92%)
Mutual labels:  speech, speech-to-text
Setk
Tools for Speech Enhancement integrated with Kaldi
Stars: ✭ 227 (+1646.15%)
Mutual labels:  speech, kaldi
Edgedict
Working online speech recognition based on RNN Transducer. ( Trained model release available in release )
Stars: ✭ 205 (+1476.92%)
Mutual labels:  speech, speech-to-text
Speechbrain.github.io
The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.
Stars: ✭ 242 (+1761.54%)
Mutual labels:  speech, speech-to-text
Lingvo
Lingvo
Stars: ✭ 2,361 (+18061.54%)
Mutual labels:  speech, speech-to-text
FAST-RIR
This is the official implementation of our neural-network-based fast diffuse room impulse response generator (FAST-RIR) for generating room impulse responses (RIRs) for a given acoustic environment.
Stars: ✭ 90 (+592.31%)
Mutual labels:  speech, automatic-speech-recognition
ASR-Audio-Data-Links
A list of publically available audio data that anyone can download for ASR or other speech activities
Stars: ✭ 179 (+1276.92%)
Mutual labels:  speech, speech-to-text
KeenASR-Android-PoC
A proof-of-concept app using KeenASR SDK on Android. WE ARE HIRING: https://keenresearch.com/careers.html
Stars: ✭ 21 (+61.54%)
Mutual labels:  speech, speech-to-text
Asr audio data links
A list of publically available audio data that anyone can download for ASR or other speech activities
Stars: ✭ 128 (+884.62%)
Mutual labels:  speech, speech-to-text
This repository is now deprecated in favour of [Elpis](https://github.com/CoEDL/elpis).

CoEDL Kaldi Helpers

A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition toolkit.

Requirements

This pipeline relies on Python 3.6 and several open-source Python packages (listed here). It also assumes you have Kaldi, sox and task installed.

Tasks

This library uses the task tool to run the more complex processes automatically. Once you've set up Kaldi Helpers, you can run the various pipeline tasks we've developed. Read the Taskfile for more information about the available tasks.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].