All Projects → facebookresearch → Wav2letter

facebookresearch / Wav2letter

Licence: other
Facebook AI Research's Automatic Speech Recognition Toolkit

Programming Languages

C++
36643 projects - #6 most used programming language
python
139335 projects - #7 most used programming language
shell
77523 projects
CMake
9771 projects
perl
6916 projects
c
50402 projects - #5 most used programming language
Dockerfile
14818 projects

Projects that are alternatives of or similar to Wav2letter

wav2letter
Facebook AI Research's Automatic Speech Recognition Toolkit
Stars: ✭ 6,026 (+2.01%)
Mutual labels:  end-to-end, speech-recognition, wav2letter
Automatic speech recognition
End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow
Stars: ✭ 2,751 (-53.43%)
Mutual labels:  speech-recognition, end-to-end
Kospeech
Open-Source Toolkit for End-to-End Korean Automatic Speech Recognition.
Stars: ✭ 190 (-96.78%)
Mutual labels:  speech-recognition, end-to-end
Transformer-Transducer
PyTorch implementation of "Transformer Transducer: A Streamable Speech Recognition Model with Transformer Encoders and RNN-T Loss" (ICASSP 2020)
Stars: ✭ 61 (-98.97%)
Mutual labels:  end-to-end, speech-recognition
E2e Asr
PyTorch Implementations for End-to-End Automatic Speech Recognition
Stars: ✭ 106 (-98.21%)
Mutual labels:  speech-recognition, end-to-end
Rnn Transducer
MXNet implementation of RNN Transducer (Graves 2012): Sequence Transduction with Recurrent Neural Networks
Stars: ✭ 114 (-98.07%)
Mutual labels:  speech-recognition, end-to-end
End-to-End-Mandarin-ASR
End-to-end speech recognition on AISHELL dataset.
Stars: ✭ 20 (-99.66%)
Mutual labels:  end-to-end, speech-recognition
kospeech
Open-Source Toolkit for End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.
Stars: ✭ 456 (-92.28%)
Mutual labels:  end-to-end, speech-recognition
kosr
Korean speech recognition based on transformer (트랜스포머 기반 한국어 음성 인식)
Stars: ✭ 25 (-99.58%)
Mutual labels:  end-to-end, speech-recognition
Rus-SpeechRecognition-LSTM-CTC-VoxForge
Распознавание речи русского языка используя Tensorflow, обучаясь на базе Voxforge
Stars: ✭ 50 (-99.15%)
Mutual labels:  end-to-end, speech-recognition
Speech Transformer Tf2.0
transformer for ASR-systerm (via tensorflow2.0)
Stars: ✭ 90 (-98.48%)
Mutual labels:  speech-recognition, end-to-end
Tensorflow end2end speech recognition
End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)
Stars: ✭ 305 (-94.84%)
Mutual labels:  speech-recognition, end-to-end
Espresso
Espresso: A Fast End-to-End Neural Speech Recognition Toolkit
Stars: ✭ 808 (-86.32%)
Mutual labels:  speech-recognition, end-to-end
End2end Asr Pytorch
End-to-End Automatic Speech Recognition on PyTorch
Stars: ✭ 175 (-97.04%)
Mutual labels:  speech-recognition, end-to-end
Speech-Recognition
End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow
Stars: ✭ 21 (-99.64%)
Mutual labels:  end-to-end, speech-recognition
sova-asr
SOVA ASR (Automatic Speech Recognition)
Stars: ✭ 123 (-97.92%)
Mutual labels:  speech-recognition, wav2letter
Espnet
End-to-End Speech Processing Toolkit
Stars: ✭ 4,533 (-23.26%)
Mutual labels:  speech-recognition, end-to-end
Java Speech Api
The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.
Stars: ✭ 490 (-91.7%)
Mutual labels:  speech-recognition
Athena
an open-source implementation of sequence-to-sequence based speech processing engine
Stars: ✭ 542 (-90.82%)
Mutual labels:  speech-recognition
Mycroft Precise
A lightweight, simple-to-use, RNN wake word listener
Stars: ✭ 481 (-91.86%)
Mutual labels:  speech-recognition

wav2letter++

CircleCI Join the chat at https://gitter.im/wav2letter/community

Important Note:

wav2letter has been moved and consolidated into Flashlight in the ASR application.

Future wav2letter development will occur in Flashlight.

To build the old, pre-consolidation version of wav2letter, checkout the wav2letter v0.2 release, which depends on the old Flashlight v0.2 release. The wav2letter-lua project can be found on the wav2letter-lua branch, accordingly.

For more information on wav2letter++, see or cite this arXiv paper.

Recipes

This repository includes recipes to reproduce the following research papers as well as pre-trained models:

Data preparation for training and evaluation can be found in data directory.

Building the Recipes

First, install Flashlight with the ASR application. Then, after cloning the project source:

mkdir build && cd build
cmake .. && make -j8

If Flashlight or ArrayFire are installed in nonstandard paths via a custom CMAKE_INSTALL_PREFIX, they can be found by passing

-Dflashlight_DIR=[PREFIX]/usr/share/flashlight/cmake/ -DArrayFire_DIR=[PREFIX]/usr/share/ArrayFire/cmake

when running cmake.

Join the wav2letter community

License

wav2letter++ is MIT-licensed, as found in the LICENSE file.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].