t13m / kaldi-readers-for-tensorflow

Licence: other

readers that enable reading kaldi ark in tensorflow

Programming Languages

C++

36643 projects - #6 most used programming language

python

139335 projects - #7 most used programming language

CMake

9771 projects

Projects that are alternatives of or similar to kaldi-readers-for-tensorflow

rustfst

Rust re-implementation of OpenFST - library for constructing, combining, optimizing, and searching weighted finite-state transducers (FSTs). A Python binding is also available.

Stars: ✭ 104 (+550%)

Mutual labels: asr, kaldi-asr

kaldi-alligner

scripts to align a given wave to its transcription using trained models by Kaldi

Stars: ✭ 24 (+50%)

Mutual labels: asr, kaldi-asr

syn-speech-samples

An application that demostrate the usage of Syn.Speech library for Speech Recognition

Stars: ✭ 24 (+50%)

Mutual labels: asr

kosr

Korean speech recognition based on transformer (트랜스포머 기반 한국어 음성 인식)

Stars: ✭ 25 (+56.25%)

Mutual labels: asr

vosk-model-ru-adaptation

No description or website provided.

Stars: ✭ 19 (+18.75%)

Mutual labels: asr

kaldi-long-audio-alignment

Long audio alignment using Kaldi

Stars: ✭ 21 (+31.25%)

Mutual labels: asr

torchain

WIP: pytorch FFI wrapper for Kaldi chain loss (a.k.a. Lattice Free MMI)

Stars: ✭ 20 (+25%)

Mutual labels: asr

wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

Stars: ✭ 2,384 (+14800%)

Mutual labels: asr

torch-asg

Auto Segmentation Criterion (ASG) implemented in pytorch

Stars: ✭ 42 (+162.5%)

Mutual labels: asr

opensnips

Open source projects related to Snips https://snips.ai/.

Stars: ✭ 50 (+212.5%)

Mutual labels: asr

commonvoice-utils

Linguistic processing for Common Voice

Stars: ✭ 32 (+100%)

Mutual labels: asr

NTUA-slp-nlp

💻Speech and Natural Language Processing (SLP & NLP) Lab Assignments for ECE NTUA

Stars: ✭ 19 (+18.75%)

Mutual labels: kaldi-asr

spokestack-ios

Spokestack: give your iOS app a voice interface!

Stars: ✭ 27 (+68.75%)

Mutual labels: asr

simple diarizer

Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code

Stars: ✭ 26 (+62.5%)

Mutual labels: asr

spokestack-android

Extensible Android mobile voice framework: wakeword, ASR, NLU, and TTS. Easily add voice to any Android app!

Stars: ✭ 52 (+225%)

Mutual labels: asr

AESRC2020

a deep accent recognition network

Stars: ✭ 35 (+118.75%)

Mutual labels: asr

vosk-asterisk

Speech Recognition in Asterisk with Vosk Server

Stars: ✭ 52 (+225%)

Mutual labels: asr

lightning-asr

Modular and extensible speech recognition library leveraging pytorch-lightning and hydra.

Stars: ✭ 36 (+125%)

Mutual labels: asr

Speech-Recognition

End-to-End Speech Recognition using Neural Networks.

Stars: ✭ 31 (+93.75%)

Mutual labels: asr

sepia-stt-server

SEPIA server to support open-source speech recognition via WebSocket connection.

Stars: ✭ 45 (+181.25%)

Mutual labels: kaldi-asr

View All Similar Projects ➔

Kaldi ark readers for tensorflow

Introduction

This project aims to enable reading kaldi ark files into tensorflow. It adds following operators to tensorflow:

read_kaldi_matrix(rspecific)
decode_kaldi_matrix(data, element-type)
read_kaldi_post_and_ali(rspecific, is_reading_post(bool))
decode_kaldi_ali(data, element-type, is_reading_post(bool))

For kaldi matrix, only float matrix are supported. Please pass "tf.float32" in through the element-type argument.

For compressed kaldi matrix, only compression method 2 (kSpeechFeature) is supported.

For alignment, user need to specify weather is reading posteriors or pdfs by the argument is_reading_post. Operator decode_kaldi_ali produces alignment pdfs output, in format of a one dimension int32 tensor (a int32 tensor). Please pass "tf.int32" in through the element-type argument.

Contributions are welcome. Feel free to fork and send pull request or to create issues.

Build

Prerequisitions:

Linux
GCC version > 5.1 (for use of c++11 regex)
python with tensorflow installed

Steps:

git clone
cd kaldi-reader-standalone
mkdir build && cd build
cmake .. -DPYTHONBIN=/path/to/your/correct/version/of/python
make

Then you are all set.

Usage example

    kaldi_module = kaldi_module = tf.load_op_library("/path/to/this/project/libkaldi_readers.so")
    feats_raw_value = kaldi_module.read_kaldi_matrix("/path/to/somearks/file1.ark:2321")
    feats_value = kaldi_module.decode_kaldi_matrix(feats_raw_value, tf.float32)
    feats_value.set_shape([None, num_dim])

There are some examples under the example directory. To run them, please modify the library path (in contents of the python files) to the correct path.

cd example
python read-compressed-matrix.py
python read-uncompressed-matrix.py
python read-post.py
python read-ali.py

Author

Fan Ziye

Reference

Kaldi: https://github.com/kaldi-asr/kaldi Tensorflow: https://www.tensorflow.org/extend/adding_an_op

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

t13m / kaldi-readers-for-tensorflow

Programming Languages

Labels

Projects that are alternatives of or similar to kaldi-readers-for-tensorflow

Kaldi ark readers for tensorflow

Introduction

Build

Usage example

Author

Reference