All Projects → orbxball → timit-preprocessor

orbxball / timit-preprocessor

Licence: BSD-3-Clause license
Extract mfcc vectors and phones from TIMIT dataset

Programming Languages

shell
77523 projects
python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to timit-preprocessor

Automatic speech recognition
End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow
Stars: ✭ 2,751 (+19550%)
Mutual labels:  speech-recognition, data-preprocessing, timit-dataset
Speech-Recognition
End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow
Stars: ✭ 21 (+50%)
Mutual labels:  speech-recognition, timit-dataset
Awesome Speech Recognition Speech Synthesis Papers
Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)
Stars: ✭ 2,085 (+14792.86%)
Mutual labels:  speech-recognition, timit-dataset
scim
[wip]Speech recognition tool-box written by Nim. Based on Arraymancer.
Stars: ✭ 17 (+21.43%)
Mutual labels:  speech-recognition, mfcc
Pytorch Kaldi
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
Stars: ✭ 2,097 (+14878.57%)
Mutual labels:  speech-recognition, timit
rnnt decoder cuda
An efficient implementation of RNN-T Prefix Beam Search in C++/CUDA.
Stars: ✭ 60 (+328.57%)
Mutual labels:  speech-recognition
PCPM
Presenting Collection of Pretrained Models. Links to pretrained models in NLP and voice.
Stars: ✭ 21 (+50%)
Mutual labels:  speech-recognition
speechrec
a simple speech recognition app using the Web Speech API Interfaces
Stars: ✭ 18 (+28.57%)
Mutual labels:  speech-recognition
Android-TTS-STT
One line solution for Android Text to speech(TTS) & Speech to Text(STT) translation problem
Stars: ✭ 77 (+450%)
Mutual labels:  speech-recognition
rustfst
Rust re-implementation of OpenFST - library for constructing, combining, optimizing, and searching weighted finite-state transducers (FSTs). A Python binding is also available.
Stars: ✭ 104 (+642.86%)
Mutual labels:  speech-recognition
VoiceBridge
VoiceBridge - an AI-TOOLKIT Open Source C++ Speech Recognition Toolkit
Stars: ✭ 17 (+21.43%)
Mutual labels:  speech-recognition
ml-with-audio
HF's ML for Audio study group
Stars: ✭ 104 (+642.86%)
Mutual labels:  speech-recognition
2018-dlsl
UPC Deep Learning for Speech and Language 2018
Stars: ✭ 18 (+28.57%)
Mutual labels:  speech-recognition
modelscript
REPO MOVED TO https://github.com/repetere/jsonstack-data - Data Science and Machine learning in JavaScript
Stars: ✭ 40 (+185.71%)
Mutual labels:  data-preprocessing
salutejs
SmartApp Framework для создания навыков семейства Виртуальных Ассистентов "Салют" на языке JavaScript
Stars: ✭ 35 (+150%)
Mutual labels:  speech-recognition
QuantumSpeech-QCNN
IEEE ICASSP 21 - Quantum Convolution Neural Networks for Speech Processing and Automatic Speech Recognition
Stars: ✭ 71 (+407.14%)
Mutual labels:  speech-recognition
google-voice
Ruby interaction with Google Voice
Stars: ✭ 16 (+14.29%)
Mutual labels:  phone
DeepSpeech-API
The code enables users to use Mozilla's Deep Speech model over the Web Browser.
Stars: ✭ 31 (+121.43%)
Mutual labels:  speech-recognition
kaldi ag training
Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-grammar.
Stars: ✭ 14 (+0%)
Mutual labels:  speech-recognition
End-to-End-Mandarin-ASR
End-to-end speech recognition on AISHELL dataset.
Stars: ✭ 20 (+42.86%)
Mutual labels:  speech-recognition

TIMIT Preprocessor

timit-preprocessor extract mfcc vectors and phones from TIMIT dataset for advanced use on speech recognition.

Overview

The TIMIT corpus of read speech is designed to provide speech data for acoustic-phonetic studies and for the development and evaluation of automatic speech recognition systems. More information on website or Wiki

Installation

Note that to install Kaldi first by following the instructions in INSTALL.

(1)
go to tools/ and follow INSTALL instructions there.

(2) go to src/ and follow INSTALL instructions there.

After running the scripts instructed by INSTALL in tools/, there will be reminder as followed. Go and run it.

Kaldi Warning: IRSTLM is not installed by default anymore. If you need IRSTLM, use the script extras/install_irstlm.sh

After ensuring kaldi installation, we can start by running

git clone https://github.com/Jy-Liu/timit-preprocessor.git

Preprocessing

Steps

  1. Run ./convert_wav.sh only in the first time after cloning this repo.

  2. python3 parsing.py -h to see instructions parsing timit dataset for phone labels and raw intermediate files in folder data/material/.

  3. ./extract_mfcc.sh to extract mfcc vectors into .scp and .ark files.

Finally, there's a folder called data/ which contains all the outcomes in the belowing directory structure:

data/
|-- material
|   |-- test.lbl
|   `-- train.lbl
`-- processed
    |-- test.39.cmvn.ark
    |-- test.39.cmvn.scp
    |-- test.extract.log
    |-- train.39.cmvn.ark
    |-- train.39.cmvn.scp
    `-- train.extract.log

If you want to do further operations, there's a good repo called kaldi-io-for-python.

Contact

Feel free to contact me if there's any problems.

License

BSD 3-Clause License (2017), Jun-You Liu

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].