Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → flashlight → wav2letter

flashlight / wav2letter

Licence: other

Facebook AI Research's Automatic Speech Recognition Toolkit

Programming Languages

36643 projects - #6 most used programming language

139335 projects - #7 most used programming language

Jupyter Notebook

11667 projects

77523 projects

9771 projects

6916 projects

Labels

deep-learning end-to-end speech-recognition wav2letter

Projects that are alternatives of or similar to wav2letter

Facebook AI Research's Automatic Speech Recognition Toolkit

Stars: ✭ 5,907 (-1.97%)

Mutual labels: end-to-end, speech-recognition, wav2letter

Espresso: A Fast End-to-End Neural Speech Recognition Toolkit

Stars: ✭ 808 (-86.59%)

Mutual labels: end-to-end, speech-recognition

End-to-End Speech Processing Toolkit

Stars: ✭ 4,533 (-24.78%)

Mutual labels: end-to-end, speech-recognition

MXNet implementation of RNN Transducer (Graves 2012): Sequence Transduction with Recurrent Neural Networks

Stars: ✭ 114 (-98.11%)

Mutual labels: end-to-end, speech-recognition

Korean speech recognition based on transformer (트랜스포머 기반 한국어 음성 인식)

Stars: ✭ 25 (-99.59%)

Mutual labels: end-to-end, speech-recognition

Tensorflow end2end speech recognition

End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)

Stars: ✭ 305 (-94.94%)

Mutual labels: end-to-end, speech-recognition

PyTorch Implementations for End-to-End Automatic Speech Recognition

Stars: ✭ 106 (-98.24%)

Mutual labels: end-to-end, speech-recognition

Open-Source Toolkit for End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.

Stars: ✭ 456 (-92.43%)

Mutual labels: end-to-end, speech-recognition

Automatic speech recognition

End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow

Stars: ✭ 2,751 (-54.35%)

Mutual labels: end-to-end, speech-recognition

Open-Source Toolkit for End-to-End Korean Automatic Speech Recognition.

Stars: ✭ 190 (-96.85%)

Mutual labels: end-to-end, speech-recognition

Speech-Recognition

End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow

Stars: ✭ 21 (-99.65%)

Mutual labels: end-to-end, speech-recognition

Transformer-Transducer

PyTorch implementation of "Transformer Transducer: A Streamable Speech Recognition Model with Transformer Encoders and RNN-T Loss" (ICASSP 2020)

Stars: ✭ 61 (-98.99%)

Mutual labels: end-to-end, speech-recognition

SOVA ASR (Automatic Speech Recognition)

Stars: ✭ 123 (-97.96%)

Mutual labels: speech-recognition, wav2letter

Speech Transformer Tf2.0

transformer for ASR-systerm (via tensorflow2.0)

Stars: ✭ 90 (-98.51%)

Mutual labels: end-to-end, speech-recognition

End2end Asr Pytorch

End-to-End Automatic Speech Recognition on PyTorch

Stars: ✭ 175 (-97.1%)

Mutual labels: end-to-end, speech-recognition

End-to-End-Mandarin-ASR

End-to-end speech recognition on AISHELL dataset.

Stars: ✭ 20 (-99.67%)

Mutual labels: end-to-end, speech-recognition

Rus-SpeechRecognition-LSTM-CTC-VoxForge

Распознавание речи русского языка используя Tensorflow, обучаясь на базе Voxforge

Stars: ✭ 50 (-99.17%)

Mutual labels: end-to-end, speech-recognition

Modular and extensible speech recognition library leveraging pytorch-lightning and hydra.

Stars: ✭ 36 (-99.4%)

Mutual labels: speech-recognition

UniSpeech - Large Scale Self-Supervised Learning for Speech

Stars: ✭ 224 (-96.28%)

Mutual labels: speech-recognition

Library of state-of-the-art models (PyTorch) for NLP tasks

Stars: ✭ 92 (-98.47%)

Mutual labels: speech-recognition

View All Similar Projects ➔

wav2letter++

Important Note:

wav2letter has been moved and consolidated into Flashlight in the ASR application.

Future wav2letter development will occur in Flashlight.

To build the old, pre-consolidation version of wav2letter, checkout the wav2letter v0.2 release, which depends on the old Flashlight v0.2 release. The wav2letter-lua project can be found on the wav2letter-lua branch, accordingly.

For more information on wav2letter++, see or cite this arXiv paper.

Recipes

This repository includes recipes to reproduce the following research papers as well as pre-trained models. All results reproduction must use Flashlight <= 0.3.2 for exact reproducability. Papers contained here include:

Data preparation for training and evaluation can be found in data directory.

Building the Recipes

First, install Flashlight (using the 0.3 branch is required) with the ASR application.

mkdir build && cd build
cmake .. && make -j8

If Flashlight or ArrayFire are installed in nonstandard paths via a custom CMAKE_INSTALL_PREFIX, they can be found by passing

-Dflashlight_DIR=[PREFIX]/usr/share/flashlight/cmake/ -DArrayFire_DIR=[PREFIX]/usr/share/ArrayFire/cmake

when running cmake.

Join the wav2letter community

Facebook page: https://www.facebook.com/groups/717232008481207/
Google group: https://groups.google.com/forum/#!forum/wav2letter-users
Contact: [email protected], [email protected], [email protected], [email protected], [email protected], [email protected], [email protected], [email protected], [email protected]

License

wav2letter++ is MIT-licensed, as found in the LICENSE file.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 6,026

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (95) 🔗