All Projects → BridgetteSong → ExpressiveTacotron

BridgetteSong / ExpressiveTacotron

Licence: other
This repository provides a multi-mode and multi-speaker expressive speech synthesis framework, including multi-attentive Tacotron, DurIAN, Non-attentive Tacotron, GST, VAE, GMVAE, and X-vectors for building prosody encoder.

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to ExpressiveTacotron

Wavernn
WaveRNN Vocoder + TTS
Stars: ✭ 1,636 (+3107.84%)
Mutual labels:  tacotron
Mimic Recording Studio
Mimic Recording Studio is a Docker-based application you can install to record voice samples, which can then be trained into a TTS voice with Mimic2
Stars: ✭ 202 (+296.08%)
Mutual labels:  tacotron
tacotron2
Multispeaker & Emotional TTS based on Tacotron 2 and Waveglow
Stars: ✭ 102 (+100%)
Mutual labels:  tacotron
Tacotron 2
DeepMind's Tacotron-2 Tensorflow implementation
Stars: ✭ 1,968 (+3758.82%)
Mutual labels:  tacotron
Multi Tacotron Voice Cloning
Phoneme multilingual(Russian-English) voice cloning based on
Stars: ✭ 192 (+276.47%)
Mutual labels:  tacotron
Tacotron pytorch
PyTorch implementation of Tacotron speech synthesis model.
Stars: ✭ 242 (+374.51%)
Mutual labels:  tacotron
Tacotron2
A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Wavenet On Mel Spectrogram Predictions".
Stars: ✭ 43 (-15.69%)
Mutual labels:  tacotron
mimic2
Text to Speech engine based on the Tacotron architecture, initially implemented by Keith Ito.
Stars: ✭ 537 (+952.94%)
Mutual labels:  tacotron
Expressive tacotron
Tensorflow Implementation of Expressive Tacotron
Stars: ✭ 192 (+276.47%)
Mutual labels:  tacotron
FCH-TTS
A fast Text-to-Speech (TTS) model. Work well for English, Mandarin/Chinese, Japanese, Korean, Russian and Tibetan (so far). 快速语音合成模型,适用于英语、普通话/中文、日语、韩语、俄语和藏语(当前已测试)。
Stars: ✭ 154 (+201.96%)
Mutual labels:  tacotron
Tacotron asr
Speech Recognition Using Tacotron
Stars: ✭ 165 (+223.53%)
Mutual labels:  tacotron
Tacotron Pytorch
Pytorch implementation of Tacotron
Stars: ✭ 189 (+270.59%)
Mutual labels:  tacotron
tacotron2
Pytorch implementation of "Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions", ICASSP, 2018.
Stars: ✭ 17 (-66.67%)
Mutual labels:  tacotron
Xva Synth
Machine learning based speech synthesis Electron app, with voices from specific characters from video games
Stars: ✭ 136 (+166.67%)
Mutual labels:  tacotron
TTS tf
WIP Tensorflow implementation of https://github.com/mozilla/TTS
Stars: ✭ 14 (-72.55%)
Mutual labels:  tacotron
Tacotron Pytorch
A Pytorch Implementation of Tacotron: End-to-end Text-to-speech Deep-Learning Model
Stars: ✭ 104 (+103.92%)
Mutual labels:  tacotron
Tacotron
A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)
Stars: ✭ 2,581 (+4960.78%)
Mutual labels:  tacotron
Text-to-Speech-Landscape
No description or website provided.
Stars: ✭ 31 (-39.22%)
Mutual labels:  tacotron
Tacotron2-PyTorch
Yet another PyTorch implementation of Tacotron 2 with reduction factor and faster training speed.
Stars: ✭ 118 (+131.37%)
Mutual labels:  tacotron
vietTTS
Vietnamese Text to Speech library
Stars: ✭ 78 (+52.94%)
Mutual labels:  tacotron

Expressive Tacotron (implementation with Pytorch)

Introduction

This repository provides a multi-mode and multi-speaker expressive speech synthesis framework, including multi-attentive Tacotron, DurIAN, Non-attentive Tacotron.

The framework also includes various deep learning architectures such as Global Style Token (GST), Variational Autoencoder (VAE), and Gaussian Mixture Variational Autoencoder (GMVAE), and X-vectors for building prosody encoder.

  • Only provides kernel model files, not including data prepared scripts, training scripts and synthesis scripts
  • You can reference ExpressiveTacotron for more training scripts

Available recipes

Expressive Mode

Framework Mode

Differences

  • Non-attentive Tacotron: duration stacked convolution layers are concatenated with encoder outputs

Acknowledgements

This implementation uses code from the following repos: NVIDIA, ESPNet, ERISHA, ForwardAttention

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].