Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)

Stars: ✭ 2,085 (+284.69%)

Mutual labels: tts, speech-synthesis, speech-recognition

VAENAR-TTS

PyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.

Stars: ✭ 66 (-87.82%)

Mutual labels: tts, speech-synthesis, unsupervised-learning

Delta

DELTA is a deep learning based natural language and speech processing platform.

Stars: ✭ 1,479 (+172.88%)

Mutual labels: speech-recognition, asr, sequence-to-sequence

Zero-Shot-TTS

Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration

Stars: ✭ 33 (-93.91%)

Mutual labels: tts, speech-synthesis, transformer

Wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

Stars: ✭ 617 (+13.84%)

Mutual labels: speech-recognition, asr, transformer

Eesen

The official repository of the Eesen project

Stars: ✭ 738 (+36.16%)

Mutual labels: speech-recognition, asr, ctc

Cognitive Speech Tts

Microsoft Text-to-Speech API sample code in several languages, part of Cognitive Services.

Stars: ✭ 312 (-42.44%)

Mutual labels: speech-synthesis, tts, transformer

Spokestack Python

Spokestack is a library that allows a user to easily incorporate a voice interface into any Python application.

Stars: ✭ 103 (-81%)

Mutual labels: speech-recognition, speech-synthesis, tts

ctc-asr

End-to-end trained speech recognition system, based on RNNs and the connectionist temporal classification (CTC) cost function.

Stars: ✭ 112 (-79.34%)

Mutual labels: speech-recognition, asr, ctc

Kospeech

Open-Source Toolkit for End-to-End Korean Automatic Speech Recognition.

Stars: ✭ 190 (-64.94%)

Mutual labels: speech-recognition, asr, transformer

AdaSpeech

AdaSpeech: Adaptive Text to Speech for Custom Voice

Stars: ✭ 108 (-80.07%)

Mutual labels: tts, speech-synthesis, transformer

wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

Stars: ✭ 2,384 (+339.85%)

Mutual labels: transformer, speech-recognition, asr

WaveGrad2

PyTorch Implementation of Google Brain's WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis

Stars: ✭ 55 (-89.85%)

Mutual labels: tts, speech-synthesis

kaldi-long-audio-alignment

Long audio alignment using Kaldi

Stars: ✭ 21 (-96.13%)

Mutual labels: speech-recognition, asr

speech-transformer

Transformer implementation speciaized in speech recognition tasks using Pytorch.

Stars: ✭ 40 (-92.62%)

Mutual labels: transformer, asr

Daft-Exprt

PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis

Stars: ✭ 41 (-92.44%)

Mutual labels: tts, speech-synthesis

TinyCog

Small Robot, Toy Robot platform

Stars: ✭ 29 (-94.65%)

Mutual labels: speech-synthesis, speech-recognition

Silero Models

Silero Models: pre-trained STT models and benchmarks made embarrassingly simple

Stars: ✭ 522 (-3.69%)

Mutual labels: speech-recognition, asr

Parallel-Tacotron2

PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling

Stars: ✭ 149 (-72.51%)

Mutual labels: tts, speech-synthesis

spoken-word

Spoken Word

Stars: ✭ 46 (-91.51%)

Mutual labels: tts, speech-synthesis

vosk-model-ru-adaptation

No description or website provided.

Stars: ✭ 19 (-96.49%)

Mutual labels: speech-recognition, asr

Rus-SpeechRecognition-LSTM-CTC-VoxForge

Распознавание речи русского языка используя Tensorflow, обучаясь на базе Voxforge

Stars: ✭ 50 (-90.77%)

Mutual labels: speech-recognition, ctc

Speech-Backbones

This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.

Stars: ✭ 205 (-62.18%)

Mutual labels: speech-synthesis, speech-recognition

vosk-asterisk

Speech Recognition in Asterisk with Vosk Server

Stars: ✭ 52 (-90.41%)

Mutual labels: speech-recognition, asr

lightning-asr

Modular and extensible speech recognition library leveraging pytorch-lightning and hydra.

Stars: ✭ 36 (-93.36%)

Mutual labels: speech-recognition, asr

LVCNet

LVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generation

Stars: ✭ 67 (-87.64%)

Mutual labels: tts, speech-synthesis

YourTTS

YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone

Stars: ✭ 217 (-59.96%)

Mutual labels: tts, speech-synthesis

speech-recognition

SDKs and docs for Skit's speech to text service

Stars: ✭ 20 (-96.31%)

Mutual labels: speech-recognition, asr

porfir

Голосовой ассистент Порфирьевич

Stars: ✭ 23 (-95.76%)

Mutual labels: speech-synthesis, speech-recognition

talkie

Text-to-speech browser extension button. Select text on any web page, and have the computer read it out loud for you by simply clicking the Talkie button.

Stars: ✭ 43 (-92.07%)

Mutual labels: tts, speech-synthesis

Java Speech Api

The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.

Stars: ✭ 490 (-9.59%)

Mutual labels: speech-recognition, speech-synthesis

leon

🧠 Leon is your open-source personal assistant.

Stars: ✭ 8,560 (+1479.34%)

Mutual labels: speech-synthesis, speech-recognition

torch-asg

Auto Segmentation Criterion (ASG) implemented in pytorch

Stars: ✭ 42 (-92.25%)

Mutual labels: asr, ctc

Autovc

AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss

Stars: ✭ 485 (-10.52%)

Mutual labels: unsupervised-learning, speech-synthesis

voicekit-examples

Examples on how to use Tinkoff Voicekit

Stars: ✭ 35 (-93.54%)

Mutual labels: speech-synthesis, speech-recognition

ttslearn

ttslearn: Library for Pythonで学ぶ音声合成 (Text-to-speech with Python)

Stars: ✭ 158 (-70.85%)

Mutual labels: tts, speech-synthesis

Fre-GAN-pytorch

Fre-GAN: Adversarial Frequency-consistent Audio Synthesis

Stars: ✭ 73 (-86.53%)

Mutual labels: tts, speech-synthesis

editts

Official implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech

Stars: ✭ 74 (-86.35%)

Mutual labels: tts, speech-synthesis

Ctcdecoder

Connectionist Temporal Classification (CTC) decoding algorithms: best path, prefix search, beam search and token passing. Implemented in Python.

Stars: ✭ 529 (-2.4%)

Mutual labels: speech-recognition, ctc

UnityASR

Automatic Speech Recognition in Unity.

Stars: ✭ 14 (-97.42%)

Mutual labels: speech-recognition, asr

demo vietasr

Vietnamese Speech Recognition

Stars: ✭ 22 (-95.94%)

Mutual labels: speech-recognition, asr

esp32-flite

Speech synthesis running on ESP32 based on Flite engine.

Stars: ✭ 28 (-94.83%)

Mutual labels: tts, speech-synthesis

Vosk Android Demo

Offline speech recognition for Android with Vosk library.

Stars: ✭ 271 (-50%)

Mutual labels: speech-recognition, asr

sova-asr

SOVA ASR (Automatic Speech Recognition)

Stars: ✭ 123 (-77.31%)

Mutual labels: speech-recognition, asr

Comprehensive-Tacotron2

PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supports both single-, multi-speaker TTS and several techniques to enforce the robustness and efficiency of the model.

Stars: ✭ 22 (-95.94%)

Mutual labels: tts, speech-synthesis

1-60 of 1643 similar projects

›

next*5