All Projects → t13m → kaldi-readers-for-tensorflow

t13m / kaldi-readers-for-tensorflow

Licence: other
readers that enable reading kaldi ark in tensorflow

Programming Languages

C++
36643 projects - #6 most used programming language
python
139335 projects - #7 most used programming language
CMake
9771 projects

Projects that are alternatives of or similar to kaldi-readers-for-tensorflow

rustfst
Rust re-implementation of OpenFST - library for constructing, combining, optimizing, and searching weighted finite-state transducers (FSTs). A Python binding is also available.
Stars: ✭ 104 (+550%)
Mutual labels:  asr, kaldi-asr
kaldi-alligner
scripts to align a given wave to its transcription using trained models by Kaldi
Stars: ✭ 24 (+50%)
Mutual labels:  asr, kaldi-asr
syn-speech-samples
An application that demostrate the usage of Syn.Speech library for Speech Recognition
Stars: ✭ 24 (+50%)
Mutual labels:  asr
kosr
Korean speech recognition based on transformer (트랜스포머 기반 한국어 음성 인식)
Stars: ✭ 25 (+56.25%)
Mutual labels:  asr
vosk-model-ru-adaptation
No description or website provided.
Stars: ✭ 19 (+18.75%)
Mutual labels:  asr
kaldi-long-audio-alignment
Long audio alignment using Kaldi
Stars: ✭ 21 (+31.25%)
Mutual labels:  asr
torchain
WIP: pytorch FFI wrapper for Kaldi chain loss (a.k.a. Lattice Free MMI)
Stars: ✭ 20 (+25%)
Mutual labels:  asr
wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
Stars: ✭ 2,384 (+14800%)
Mutual labels:  asr
torch-asg
Auto Segmentation Criterion (ASG) implemented in pytorch
Stars: ✭ 42 (+162.5%)
Mutual labels:  asr
opensnips
Open source projects related to Snips https://snips.ai/.
Stars: ✭ 50 (+212.5%)
Mutual labels:  asr
commonvoice-utils
Linguistic processing for Common Voice
Stars: ✭ 32 (+100%)
Mutual labels:  asr
NTUA-slp-nlp
💻Speech and Natural Language Processing (SLP & NLP) Lab Assignments for ECE NTUA
Stars: ✭ 19 (+18.75%)
Mutual labels:  kaldi-asr
spokestack-ios
Spokestack: give your iOS app a voice interface!
Stars: ✭ 27 (+68.75%)
Mutual labels:  asr
simple diarizer
Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code
Stars: ✭ 26 (+62.5%)
Mutual labels:  asr
spokestack-android
Extensible Android mobile voice framework: wakeword, ASR, NLU, and TTS. Easily add voice to any Android app!
Stars: ✭ 52 (+225%)
Mutual labels:  asr
AESRC2020
a deep accent recognition network
Stars: ✭ 35 (+118.75%)
Mutual labels:  asr
vosk-asterisk
Speech Recognition in Asterisk with Vosk Server
Stars: ✭ 52 (+225%)
Mutual labels:  asr
lightning-asr
Modular and extensible speech recognition library leveraging pytorch-lightning and hydra.
Stars: ✭ 36 (+125%)
Mutual labels:  asr
Speech-Recognition
End-to-End Speech Recognition using Neural Networks.
Stars: ✭ 31 (+93.75%)
Mutual labels:  asr
sepia-stt-server
SEPIA server to support open-source speech recognition via WebSocket connection.
Stars: ✭ 45 (+181.25%)
Mutual labels:  kaldi-asr

Kaldi ark readers for tensorflow

Introduction

This project aims to enable reading kaldi ark files into tensorflow. It adds following operators to tensorflow:

  • read_kaldi_matrix(rspecific)
  • decode_kaldi_matrix(data, element-type)
  • read_kaldi_post_and_ali(rspecific, is_reading_post(bool))
  • decode_kaldi_ali(data, element-type, is_reading_post(bool))

For kaldi matrix, only float matrix are supported. Please pass "tf.float32" in through the element-type argument.

For compressed kaldi matrix, only compression method 2 (kSpeechFeature) is supported.

For alignment, user need to specify weather is reading posteriors or pdfs by the argument is_reading_post. Operator decode_kaldi_ali produces alignment pdfs output, in format of a one dimension int32 tensor (a int32 tensor). Please pass "tf.int32" in through the element-type argument.

Contributions are welcome. Feel free to fork and send pull request or to create issues.

Build

Prerequisitions:

  • Linux
  • GCC version > 5.1 (for use of c++11 regex)
  • python with tensorflow installed

Steps:

  1. git clone
  2. cd kaldi-reader-standalone
  3. mkdir build && cd build
  4. cmake .. -DPYTHONBIN=/path/to/your/correct/version/of/python
  5. make

Then you are all set.

Usage example

    kaldi_module = kaldi_module = tf.load_op_library("/path/to/this/project/libkaldi_readers.so")
    feats_raw_value = kaldi_module.read_kaldi_matrix("/path/to/somearks/file1.ark:2321")
    feats_value = kaldi_module.decode_kaldi_matrix(feats_raw_value, tf.float32)
    feats_value.set_shape([None, num_dim])

There are some examples under the example directory. To run them, please modify the library path (in contents of the python files) to the correct path.

  1. cd example
  2. python read-compressed-matrix.py
  3. python read-uncompressed-matrix.py
  4. python read-post.py
  5. python read-ali.py

Author

Fan Ziye

Reference

Kaldi: https://github.com/kaldi-asr/kaldi Tensorflow: https://www.tensorflow.org/extend/adding_an_op

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].