All Projects → kahne → SpeechTransProgress

kahne / SpeechTransProgress

Licence: CC0-1.0 License
Tracking the progress in end-to-end speech translation

Projects that are alternatives of or similar to SpeechTransProgress

Nonautoreggenprogress
Tracking the progress in non-autoregressive generation (translation, transcription, etc.)
Stars: ✭ 118 (-15.11%)
Mutual labels:  machine-translation, natural-language-generation, speech-processing
Espnet
End-to-End Speech Processing Toolkit
Stars: ✭ 4,533 (+3161.15%)
Mutual labels:  machine-translation, speech-translation
rtg
Reader Translator Generator - NMT toolkit based on pytorch
Stars: ✭ 26 (-81.29%)
Mutual labels:  machine-translation, natural-language-generation
Nlg Eval
Evaluation code for various unsupervised automated metrics for Natural Language Generation.
Stars: ✭ 822 (+491.37%)
Mutual labels:  machine-translation, natural-language-generation
mtdata
A tool that locates, downloads, and extracts machine translation corpora
Stars: ✭ 95 (-31.65%)
Mutual labels:  machine-translation, natural-language-generation
ttslearn
ttslearn: Library for Pythonで学ぶ音声合成 (Text-to-speech with Python)
Stars: ✭ 158 (+13.67%)
Mutual labels:  speech-processing
uctf
Unsupervised Controllable Text Generation (Applied to text Formalization)
Stars: ✭ 19 (-86.33%)
Mutual labels:  natural-language-generation
apertium-html-tools
Web application providing a fully localised interface for text/website/document translation, analysis and generation powered by Apertium.
Stars: ✭ 36 (-74.1%)
Mutual labels:  machine-translation
NRC
Natural language generation for discrete data in EHRs
Stars: ✭ 19 (-86.33%)
Mutual labels:  natural-language-generation
banglanmt
This repository contains the code and data of the paper titled "Not Low-Resource Anymore: Aligner Ensembling, Batch Filtering, and New Datasets for Bengali-English Machine Translation" published in Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP 2020), November 16 - November 20, 2020.
Stars: ✭ 91 (-34.53%)
Mutual labels:  machine-translation
nepali-translator
Neural Machine Translation on the Nepali-English language pair
Stars: ✭ 29 (-79.14%)
Mutual labels:  machine-translation
classy
classy is a simple-to-use library for building high-performance Machine Learning models in NLP.
Stars: ✭ 61 (-56.12%)
Mutual labels:  natural-language-generation
awesome-multimodal-ml
Reading list for research topics in multimodal machine learning
Stars: ✭ 3,125 (+2148.2%)
Mutual labels:  speech-processing
transformer-pytorch
A PyTorch implementation of Transformer in "Attention is All You Need"
Stars: ✭ 77 (-44.6%)
Mutual labels:  machine-translation
syntaxmaker
The NLG tool for Finnish
Stars: ✭ 19 (-86.33%)
Mutual labels:  natural-language-generation
vak
a neural network toolbox for animal vocalizations and bioacoustics
Stars: ✭ 21 (-84.89%)
Mutual labels:  speech-processing
nlp-notebooks
A collection of natural language processing notebooks.
Stars: ✭ 19 (-86.33%)
Mutual labels:  natural-language-generation
text2text
Text2Text: Cross-lingual natural language processing and generation toolkit
Stars: ✭ 188 (+35.25%)
Mutual labels:  natural-language-generation
speechportal
(1st place at HopHacks) A dynamic webVR memory palace for speech training, utilizing natural language processing and Google Streetview API
Stars: ✭ 14 (-89.93%)
Mutual labels:  speech-processing
Natural-Language-Processing
Contains various architectures and novel paper implementations for Natural Language Processing tasks like Sequence Modelling and Neural Machine Translation.
Stars: ✭ 48 (-65.47%)
Mutual labels:  machine-translation

End-to-End Speech Translation Progress

Tutorial

Data

Corpus Direction Target Duration License
CoVoST 2 {Fr, De, Es, Ca, It, Ru, Zh, Pt, Fa, Et, Mn, Nl, Tr, Ar, Sv, Lv, Sl, Ta, Ja, Id, Cy} -> En and En -> {De, Ca, Zh, Fa, Et, Mn, Tr, Ar, Sv, Lv, Sl, Ta, Ja, Id, Cy} Text 2880h CC0
CVSS {Fr, De, Es, Ca, It, Ru, Zh, Pt, Fa, Et, Mn, Nl, Tr, Ar, Sv, Lv, Sl, Ta, Ja, Id, Cy} -> En Text & Speech 1900h CC BY 4.0
mTEDx {Es, Fr, Pt, It, Ru, El} -> En, {Fr, Pt, It} -> Es, Es -> {Fr, It}, {Es,Fr} -> Pt Text 765h CC BY-NC-ND 4.0
CoVoST {Fr, De, Nl, Ru, Es, It, Tr, Fa, Sv, Mn, Zh} -> En Text 700h CC0
MUST-C & MUST-Cinema En -> {De, Es, Fr, It, Nl, Pt, Ro, Ru, Ar, Cs, Fa, Tr, Vi, Zh} Text 504h CC BY-NC-ND 4.0
How2 En -> Pt Text 300h Youtube & CC BY-SA 4.0
Augmented LibriSpeech En -> Fr Text 236h CC BY 4.0
Europarl-ST {En, Fr, De, Es, It, Pt, Pl, Ro, Nl} -> {En, Fr, De, Es, It, Pt, Pl, Ro, Nl} Text 280h CC BY-NC 4.0
Kosp2e Ko -> En Text 198h Mixed CC
Fisher + Callhome Es -> En Text 160h+20h LDC
MaSS {En, Es, Eu, Fi, Fr, Hu, Ro, Ru} -> {En, Es, Eu, Fi, Fr, Hu, Ro, Ru} Text & Speech 172h Bible.is
LibriVoxDeEn De -> En Text 110h CC BY-NC-SA 4.0
BSTC Zh -> En Text 68h

Toolkit

Paper

2021

2020

2019

2018

2017

2016

2013

Contact

Changhan Wang ([email protected])

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].