All Projects → sooftware → tacotron2

sooftware / tacotron2

Licence: Apache-2.0 license
Pytorch implementation of "Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions", ICASSP, 2018.

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to tacotron2

Comprehensive-Tacotron2
PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supports both single-, multi-speaker TTS and several techniques to enforce the robustness and efficiency of the model.
Stars: ✭ 22 (+29.41%)
Mutual labels:  speech-synthesis, tacotron, tacotron2
sova-tts-engine
Tacotron2 based engine for the SOVA-TTS project
Stars: ✭ 63 (+270.59%)
Mutual labels:  speech-synthesis, tacotron2
Xva Synth
Machine learning based speech synthesis Electron app, with voices from specific characters from video games
Stars: ✭ 136 (+700%)
Mutual labels:  speech-synthesis, tacotron
Tacotron pytorch
PyTorch implementation of Tacotron speech synthesis model.
Stars: ✭ 242 (+1323.53%)
Mutual labels:  speech-synthesis, tacotron
Expressive tacotron
Tensorflow Implementation of Expressive Tacotron
Stars: ✭ 192 (+1029.41%)
Mutual labels:  speech-synthesis, tacotron
Tts
🤖 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
Stars: ✭ 5,427 (+31823.53%)
Mutual labels:  tacotron, tacotron2
mimic2
Text to Speech engine based on the Tacotron architecture, initially implemented by Keith Ito.
Stars: ✭ 537 (+3058.82%)
Mutual labels:  speech-synthesis, tacotron
TensorVox
Desktop application for neural speech synthesis written in C++
Stars: ✭ 140 (+723.53%)
Mutual labels:  speech-synthesis, tacotron2
Tacotron 2
DeepMind's Tacotron-2 Tensorflow implementation
Stars: ✭ 1,968 (+11476.47%)
Mutual labels:  speech-synthesis, tacotron
Tacotron2
A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Wavenet On Mel Spectrogram Predictions".
Stars: ✭ 43 (+152.94%)
Mutual labels:  speech-synthesis, tacotron
Tacotron2-PyTorch
Yet another PyTorch implementation of Tacotron 2 with reduction factor and faster training speed.
Stars: ✭ 118 (+594.12%)
Mutual labels:  tacotron, tacotron2
Tensorflowtts
😝 TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)
Stars: ✭ 2,382 (+13911.76%)
Mutual labels:  speech-synthesis, tacotron2
tacotron2
Multispeaker & Emotional TTS based on Tacotron 2 and Waveglow
Stars: ✭ 102 (+500%)
Mutual labels:  tacotron, tacotron2
Tacotron
A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)
Stars: ✭ 2,581 (+15082.35%)
Mutual labels:  speech-synthesis, tacotron
Tacotron pytorch
Tacotron implementation of pytorch
Stars: ✭ 12 (-29.41%)
Mutual labels:  speech-synthesis, tacotron
Tacotron Pytorch
A Pytorch Implementation of Tacotron: End-to-end Text-to-speech Deep-Learning Model
Stars: ✭ 104 (+511.76%)
Mutual labels:  speech-synthesis, tacotron
Wavernn
WaveRNN Vocoder + TTS
Stars: ✭ 1,636 (+9523.53%)
Mutual labels:  speech-synthesis, tacotron
Wavegrad
A fast, high-quality neural vocoder.
Stars: ✭ 138 (+711.76%)
Mutual labels:  speech-synthesis
Diffwave
DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.
Stars: ✭ 139 (+717.65%)
Mutual labels:  speech-synthesis
Naomi
The Naomi Project is an open source, technology agnostic platform for developing always-on, voice-controlled applications!
Stars: ✭ 171 (+905.88%)
Mutual labels:  speech-synthesis

Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions

Pytorch implementation of Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions

This implementation focuses as much as possible on the readability and extensibility of the code and the reproduction as it is in the paper (without Wavenet). I would appreciate it if you could feedback or contribution at any time if there was a mistake or an error.

Usage

import torch
import numpy as np
from tacotron2 import Tacotron2
from test.args import DefaultArgument

batch_size, seq_length = 3, 3

inputs = torch.LongTensor(np.arange(batch_size * seq_length).reshape(batch_size, seq_length))
input_lengths = torch.LongTensor([3, 3, 2])
targets = torch.FloatTensor(batch_size, 100, 80).uniform_(-0.1, 0.1)
args = DefaultArgument()

model = Tacotron2(args)
output = model(inputs, targets, input_lengths)

Installation

Currently we only support installation from source code using setuptools. Checkout the source code and run the following commands:

pip install -e .

Troubleshoots and Contributing

If you have any questions, bug reports, and feature requests, please open an issue on github or
contacts [email protected] please.

I appreciate any kind of feedback or contribution. Feel free to proceed with small issues like bug fixes, documentation improvement. For major contributions and new features, please discuss with the collaborators in corresponding issues.

Code Style

I follow PEP-8 for code style. Especially the style of docstrings is important to generate documentation.

Reference

Author

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].