Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)

Stars: ✭ 2,085 (+6850%)

Mutual labels: automatic-speech-recognition

Listen Attend Spell

A PyTorch implementation of Listen, Attend and Spell (LAS), an End-to-End ASR framework.

Stars: ✭ 147 (+390%)

Mutual labels: asr

FAST-RIR

This is the official implementation of our neural-network-based fast diffuse room impulse response generator (FAST-RIR) for generating room impulse responses (RIRs) for a given acoustic environment.

Stars: ✭ 90 (+200%)

Mutual labels: automatic-speech-recognition

soxan

Wav2Vec for speech recognition, classification, and audio classification

Stars: ✭ 113 (+276.67%)

Mutual labels: automatic-speech-recognition

ctc-asr

End-to-end trained speech recognition system, based on RNNs and the connectionist temporal classification (CTC) cost function.

Stars: ✭ 112 (+273.33%)

Mutual labels: asr

Speech-Corpus-Collection

A Collection of Speech Corpus for ASR and TTS

Stars: ✭ 113 (+276.67%)

Mutual labels: asr

KoLM

Korean text normalization and language preparation package for LM in Kaldi-based ASR system

Stars: ✭ 46 (+53.33%)

Mutual labels: asr

Cn2an

📦 快速转化「中文数字」和「阿拉伯数字」～ (最新特性：分数，日期、温度等转化）

Stars: ✭ 249 (+730%)

Mutual labels: asr

asr24

24-hour Automatic Speech Recognition

Stars: ✭ 27 (-10%)

Mutual labels: asr

kospeech

Open-Source Toolkit for End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.

Stars: ✭ 456 (+1420%)

Mutual labels: asr

Asr syllable

基于卷积神经网络的语音识别声学模型的研究

Stars: ✭ 127 (+323.33%)

Mutual labels: asr

Edgedict

Working online speech recognition based on RNN Transducer. ( Trained model release available in release )

Stars: ✭ 205 (+583.33%)

Mutual labels: asr

pie

百度云流式语音识别客户端 SDK

Stars: ✭ 62 (+106.67%)

Mutual labels: asr

End-to-End-Mandarin-ASR

End-to-end speech recognition on AISHELL dataset.

Stars: ✭ 20 (-33.33%)

Mutual labels: asr

automatic speech recognition

Vietnamese Automatic Speech Recognition

Stars: ✭ 58 (+93.33%)

Mutual labels: automatic-speech-recognition

Rnn Transducer

MXNet implementation of RNN Transducer (Graves 2012): Sequence Transduction with Recurrent Neural Networks

Stars: ✭ 114 (+280%)

Mutual labels: asr

Hms Ml Demo

HMS ML Demo provides an example of integrating Huawei ML Kit service into applications. This example demonstrates how to integrate services provided by ML Kit, such as face detection, text recognition, image segmentation, asr, and tts.

Stars: ✭ 187 (+523.33%)

Mutual labels: asr

obvi

A Polymer 3+ webcomponent / button for doing speech recognition

Stars: ✭ 54 (+80%)

Mutual labels: automatic-speech-recognition

Pytorch Kaldi

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

Stars: ✭ 2,097 (+6890%)

Mutual labels: asr

AESRC2020

Data preperation scripts, training pipeline and baseline experiment results for the Interspeech 2020 Accented English Speech Recognition Challenge (AESRC).

Stars: ✭ 40 (+33.33%)

Mutual labels: asr

Py Kaldi Asr

Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.

Stars: ✭ 156 (+420%)

Mutual labels: asr

rasr

The RWTH ASR Toolkit.

Stars: ✭ 43 (+43.33%)

Mutual labels: asr

Speech To Text Russian

Проект для распознавания речи на русском языке на основе pykaldi.