All Projects → Tacotron_pytorch → Similar Projects or Alternatives

6218 Open source projects that are alternatives of or similar to Tacotron_pytorch

Implementation of Google Brain's WaveGrad high-fidelity vocoder (paper: https://arxiv.org/pdf/2009.00713.pdf). First implementation on GitHub.

Stars: ✭ 245 (+1.24%)

Mutual labels: jupyter-notebook, speech, speech-synthesis

Tts

🤖 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)

Stars: ✭ 5,427 (+2142.56%)

Mutual labels: jupyter-notebook, speech, tacotron

Tts

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Stars: ✭ 305 (+26.03%)

Mutual labels: jupyter-notebook, speech, tacotron

melgan

MelGAN implementation with Multi-Band and Full Band supports...

Stars: ✭ 54 (-77.69%)

Mutual labels: speech, speech-synthesis

tacotron2

Pytorch implementation of "Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions", ICASSP, 2018.

Stars: ✭ 17 (-92.98%)

Mutual labels: speech-synthesis, tacotron

Zero-Shot-TTS

Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration

Stars: ✭ 33 (-86.36%)

Mutual labels: speech, speech-synthesis

Comprehensive-Tacotron2

PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supports both single-, multi-speaker TTS and several techniques to enforce the robustness and efficiency of the model.

Stars: ✭ 22 (-90.91%)

Mutual labels: speech-synthesis, tacotron

Tacotron pytorch

Tacotron implementation of pytorch

Stars: ✭ 12 (-95.04%)

Mutual labels: speech-synthesis, tacotron

Voice Builder

An opensource text-to-speech (TTS) voice building tool

Stars: ✭ 362 (+49.59%)

Mutual labels: speech, speech-synthesis

Gantts

PyTorch implementation of GAN-based text-to-speech synthesis and voice conversion (VC)

Stars: ✭ 460 (+90.08%)

Mutual labels: jupyter-notebook, speech-synthesis

Cs224n Gpu That Talks

Attention, I'm Trying to Speak: End-to-end speech synthesis (CS224n '18)

Stars: ✭ 52 (-78.51%)

Mutual labels: jupyter-notebook, speech-synthesis

Lingvo

Stars: ✭ 2,361 (+875.62%)

Mutual labels: speech, speech-synthesis

Wavegrad

A fast, high-quality neural vocoder.

Stars: ✭ 138 (-42.98%)

Mutual labels: speech, speech-synthesis

StyleSpeech

Official implementation of Meta-StyleSpeech and StyleSpeech

Stars: ✭ 161 (-33.47%)

Mutual labels: speech, speech-synthesis

idear

🎙️ Handsfree Audio Development Interface

Stars: ✭ 84 (-65.29%)

Mutual labels: speech, speech-synthesis

spokestack-android

Extensible Android mobile voice framework: wakeword, ASR, NLU, and TTS. Easily add voice to any Android app!

Stars: ✭ 52 (-78.51%)

Mutual labels: speech, speech-synthesis

Fre-GAN-pytorch

Fre-GAN: Adversarial Frequency-consistent Audio Synthesis

Stars: ✭ 73 (-69.83%)

Mutual labels: speech, speech-synthesis

Pysptk

A python wrapper for Speech Signal Processing Toolkit (SPTK).

Stars: ✭ 297 (+22.73%)

Mutual labels: speech, speech-synthesis

Voice2Mesh

CVPR 2022: Cross-Modal Perceptionist: Can Face Geometry be Gleaned from Voices?

Stars: ✭ 67 (-72.31%)

Mutual labels: speech, speech-synthesis

Lightspeech

LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search

Stars: ✭ 31 (-87.19%)

Mutual labels: speech, speech-synthesis

Wsay

Windows "say"

Stars: ✭ 36 (-85.12%)

Mutual labels: speech, speech-synthesis

Tf Wavenet vocoder

Wavenet and its applications with Tensorflow

Stars: ✭ 58 (-76.03%)

Mutual labels: jupyter-notebook, speech-synthesis

Wavernn

WaveRNN Vocoder + TTS

Stars: ✭ 1,636 (+576.03%)

Mutual labels: speech-synthesis, tacotron

Durian

Implementation of "Duration Informed Attention Network for Multimodal Synthesis" (https://arxiv.org/pdf/1909.01700.pdf) paper.

Stars: ✭ 111 (-54.13%)

Mutual labels: speech, speech-synthesis

Xva Synth

Machine learning based speech synthesis Electron app, with voices from specific characters from video games

Stars: ✭ 136 (-43.8%)

Mutual labels: speech-synthesis, tacotron

Tacotron asr

Speech Recognition Using Tacotron

Stars: ✭ 165 (-31.82%)

Mutual labels: speech, tacotron

Wavenet vocoder

WaveNet vocoder

Stars: ✭ 1,926 (+695.87%)

Mutual labels: speech, speech-synthesis

Tacotron 2

DeepMind's Tacotron-2 Tensorflow implementation

Stars: ✭ 1,968 (+713.22%)

Mutual labels: speech-synthesis, tacotron

Expressive tacotron

Tensorflow Implementation of Expressive Tacotron

Stars: ✭ 192 (-20.66%)

Mutual labels: speech-synthesis, tacotron

Neural Voice Cloning With Few Samples

Implementation of Neural Voice Cloning with Few Samples Research Paper by Baidu

Stars: ✭ 211 (-12.81%)

Mutual labels: speech, speech-synthesis

Tacotron

Audio samples accompanying publications related to Tacotron, an end-to-end speech synthesis model.

Stars: ✭ 493 (+103.72%)

Mutual labels: speech, tacotron

TFGAN

TFGAN: Time and Frequency Domain Based Generative Adversarial Network for High-fidelity Speech Synthesis

Stars: ✭ 65 (-73.14%)

Mutual labels: speech, speech-synthesis

IMS-Toucan

Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart. Objectives of the development are simplicity, modularity, controllability and multilinguality.

Stars: ✭ 295 (+21.9%)

Mutual labels: speech, speech-synthesis

AdaSpeech

AdaSpeech: Adaptive Text to Speech for Custom Voice

Stars: ✭ 108 (-55.37%)

Mutual labels: speech, speech-synthesis

Pytorch Dc Tts

Text to Speech with PyTorch (English and Mongolian)

Stars: ✭ 122 (-49.59%)

Mutual labels: jupyter-notebook, speech-synthesis

ttslearn

ttslearn: Library for Pythonで学ぶ音声合成 (Text-to-speech with Python)

Stars: ✭ 158 (-34.71%)

Mutual labels: speech, speech-synthesis

MelNet-SpeechGeneration

Implementation of MelNet in PyTorch to generate high-fidelity audio samples

Stars: ✭ 19 (-92.15%)

Mutual labels: speech, speech-synthesis

editts

Official implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech

Stars: ✭ 74 (-69.42%)

Mutual labels: speech, speech-synthesis

mimic2

Text to Speech engine based on the Tacotron architecture, initially implemented by Keith Ito.

Stars: ✭ 537 (+121.9%)

Mutual labels: speech-synthesis, tacotron

Amazing Python Scripts

🚀 Curated collection of Amazing Python scripts from Basics to Advance with automation task scripts.

Stars: ✭ 229 (-5.37%)

Mutual labels: jupyter-notebook, speech

Source separation

Deep learning based speech source separation using Pytorch

Stars: ✭ 226 (-6.61%)

Mutual labels: jupyter-notebook, speech

Waveflow

A PyTorch implementation of "WaveFlow: A Compact Flow-based Model for Raw Audio"

Stars: ✭ 95 (-60.74%)

Mutual labels: jupyter-notebook, speech-synthesis

Parallelwavegan

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN) with Pytorch

Stars: ✭ 682 (+181.82%)

Mutual labels: jupyter-notebook, speech-synthesis

Speech Emotion Analyzer

The neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)

Stars: ✭ 633 (+161.57%)

Mutual labels: jupyter-notebook, speech

Tacotron2

A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Wavenet On Mel Spectrogram Predictions".

Stars: ✭ 43 (-82.23%)

Mutual labels: speech-synthesis, tacotron

Flowtron

Flowtron is an auto-regressive flow-based generative network for text to speech synthesis with control over speech variation and style transfer

Stars: ✭ 546 (+125.62%)

Mutual labels: jupyter-notebook, speech-synthesis

Tacotron Pytorch

A Pytorch Implementation of Tacotron: End-to-end Text-to-speech Deep-Learning Model

Stars: ✭ 104 (-57.02%)

Mutual labels: speech-synthesis, tacotron

Java Speech Api

The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.

Stars: ✭ 490 (+102.48%)

Mutual labels: speech, speech-synthesis

Diffwave

DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.

Stars: ✭ 139 (-42.56%)

Mutual labels: speech, speech-synthesis

Tacotron

A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)

Stars: ✭ 2,581 (+966.53%)

Mutual labels: speech-synthesis, tacotron

Nemo

NeMo: a toolkit for conversational AI