All Projects → gooofy → Py Kaldi Asr

gooofy / Py Kaldi Asr

Licence: apache-2.0
Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Py Kaldi Asr

Pykaldi
A Python wrapper for Kaldi
Stars: ✭ 756 (+384.62%)
Mutual labels:  wrapper, speech-recognition, asr, kaldi
kaldi-long-audio-alignment
Long audio alignment using Kaldi
Stars: ✭ 21 (-86.54%)
Mutual labels:  speech-recognition, kaldi, asr
rustfst
Rust re-implementation of OpenFST - library for constructing, combining, optimizing, and searching weighted finite-state transducers (FSTs). A Python binding is also available.
Stars: ✭ 104 (-33.33%)
Mutual labels:  speech-recognition, kaldi, asr
Vosk Api
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Stars: ✭ 1,357 (+769.87%)
Mutual labels:  speech-recognition, asr, kaldi
vosk-model-ru-adaptation
No description or website provided.
Stars: ✭ 19 (-87.82%)
Mutual labels:  speech-recognition, kaldi, asr
Zeroth
Kaldi-based Korean ASR (한국어 음성인식) open-source project
Stars: ✭ 248 (+58.97%)
Mutual labels:  speech-recognition, asr, kaldi
Vosk Android Demo
Offline speech recognition for Android with Vosk library.
Stars: ✭ 271 (+73.72%)
Mutual labels:  speech-recognition, asr, kaldi
Pytorch Kaldi
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
Stars: ✭ 2,097 (+1244.23%)
Mutual labels:  speech-recognition, asr, kaldi
Zamia Speech
Open tools and data for cloudless automatic speech recognition
Stars: ✭ 374 (+139.74%)
Mutual labels:  speech-recognition, asr, kaldi
Eesen
The official repository of the Eesen project
Stars: ✭ 738 (+373.08%)
Mutual labels:  speech-recognition, asr, kaldi
Speech To Text Russian
Проект для распознавания речи на русском языке на основе pykaldi.
Stars: ✭ 151 (-3.21%)
Mutual labels:  speech-recognition, asr, kaldi
Vosk Server
WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries
Stars: ✭ 277 (+77.56%)
Mutual labels:  speech-recognition, asr, kaldi
Espresso
Espresso: A Fast End-to-End Neural Speech Recognition Toolkit
Stars: ✭ 808 (+417.95%)
Mutual labels:  speech-recognition, asr, kaldi
Pytorch Asr
ASR with PyTorch
Stars: ✭ 124 (-20.51%)
Mutual labels:  speech-recognition, asr, kaldi
Asr benchmark
Program to benchmark various speech recognition APIs
Stars: ✭ 71 (-54.49%)
Mutual labels:  speech-recognition, asr
Openasr
A pytorch based end2end speech recognition system.
Stars: ✭ 69 (-55.77%)
Mutual labels:  speech-recognition, asr
Wav2letter
Speech Recognition model based off of FAIR research paper built using Pytorch.
Stars: ✭ 78 (-50%)
Mutual labels:  speech-recognition, asr
Mongolian Speech Recognition
Mongolian speech recognition with PyTorch
Stars: ✭ 97 (-37.82%)
Mutual labels:  speech-recognition, asr
Dragonfire
the open-source virtual assistant for Ubuntu based Linux distributions
Stars: ✭ 1,120 (+617.95%)
Mutual labels:  speech-recognition, kaldi
Ktspeechcrawler
Automatically constructing corpus for automatic speech recognition from YouTube videos
Stars: ✭ 92 (-41.03%)
Mutual labels:  speech-recognition, asr

py-kaldi-asr

Some simple wrappers around kaldi-asr intended to make using kaldi's online nnet3-chain decoders as convenient as possible. Kaldi's online GMM decoders are also supported.

Target audience are developers who would like to use kaldi-asr as-is for speech recognition in their application on GNU/Linux operating systems.

Constructive comments, patches and pull-requests are very welcome.

Getting Started

We recommend using pre-trained modules from the zamia-speech project to get started. There you will also find a tutorial complete with links to pre-built binary packages to get you up and running with free and open source speech recognition in a matter of minutes:

Zamia Speech Tutorial

Example Code

Simple wav file decoding:

from kaldiasr.nnet3 import KaldiNNet3OnlineModel, KaldiNNet3OnlineDecoder

MODELDIR    = 'data/models/kaldi-generic-en-tdnn_sp-latest'
WAVFILE     = 'data/dw961.wav'

kaldi_model = KaldiNNet3OnlineModel (MODELDIR)
decoder     = KaldiNNet3OnlineDecoder (kaldi_model)

if decoder.decode_wav_file(WAVFILE):

    s, l = decoder.get_decoded_string()

    print
    print u"*****************************************************************"
    print u"**", WAVFILE
    print u"**", s
    print u"** %s likelihood:" % MODELDIR, l
    print u"*****************************************************************"
    print

else:

    print "***ERROR: decoding of %s failed." % WAVFILE

Please check the examples directory for more example code.

Requirements

Setup Notes

Source

At the time of this writing kaldi-asr does not seem to have an official way to install it on a system.

So, for now we will rely on pkg-config to provide LIBS and CFLAGS for compilation: Create a file called kaldi-asr.pc somewhere in your PKG_CONFIG_PATH that provides this information - here is what such a file could look like (details depend on your OS environment):

kaldi_root=/opt/kaldi

Name: kaldi-asr
Description: kaldi-asr speech recognition toolkit
Version: 5.2
Requires: atlas
Libs: -L${kaldi_root}/tools/openfst/lib -L${kaldi_root}/src/lib -lkaldi-decoder -lkaldi-lat -lkaldi-fstext -lkaldi-hmm -lkaldi-feat -lkaldi-transform -lkaldi-gmm -lkaldi-tree -lkaldi-util -lkaldi-matrix -lkaldi-base -lkaldi-nnet3 -lkaldi-online2 -lkaldi-cudamatrix -lkaldi-ivector -lfst
Cflags: -I${kaldi_root}/src  -I${kaldi_root}/tools/openfst/include

make sure kaldi_root points to wherever your kaldi checkout lives in your filesystem.

ATLAS

You may need to install ATLAS headers even if you didn't need them to compile Kaldi.

$ sudo apt install libatlas-dev

License

My own code is Apache licensed unless otherwise noted in the script's copyright headers.

Some scripts and files are based on works of others, in those cases it is my intention to keep the original license intact. Please make sure to check the copyright headers inside for more information.

Author

Guenter Bartsch [email protected]
Kaldi 5.1 adaptation contributed by mariasmo https://github.com/mariasmo
Kaldi GMM model support contributed by David Zurow https://github.com/daanzu
Python > 3.5 support contributed by Jakob Kruse https://github.com/jakob1111996

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].