😝 TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)

Stars: ✭ 2,382 (+3163.01%)

Mutual labels: text-to-speech, tts, speech-synthesis, vocoder

Durian

Implementation of "Duration Informed Attention Network for Multimodal Synthesis" (https://arxiv.org/pdf/1909.01700.pdf) paper.

Stars: ✭ 111 (+52.05%)

Mutual labels: text-to-speech, speech, tts, speech-synthesis

editts

Official implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech

Stars: ✭ 74 (+1.37%)

Mutual labels: text-to-speech, speech, tts, speech-synthesis

IMS-Toucan

Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart. Objectives of the development are simplicity, modularity, controllability and multilinguality.

Stars: ✭ 295 (+304.11%)

Mutual labels: text-to-speech, speech, tts, speech-synthesis

Lightspeech

LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search

Stars: ✭ 31 (-57.53%)

Mutual labels: text-to-speech, speech, tts, speech-synthesis

Wsay

Windows "say"

Stars: ✭ 36 (-50.68%)

Mutual labels: text-to-speech, speech, tts, speech-synthesis

Wavegrad

Implementation of Google Brain's WaveGrad high-fidelity vocoder (paper: https://arxiv.org/pdf/2009.00713.pdf). First implementation on GitHub.

Stars: ✭ 245 (+235.62%)

Mutual labels: text-to-speech, speech, tts, speech-synthesis

LVCNet

LVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generation

Stars: ✭ 67 (-8.22%)

Mutual labels: text-to-speech, tts, speech-synthesis, vocoder

vits

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Stars: ✭ 1,604 (+2097.26%)

Mutual labels: text-to-speech, tts, speech-synthesis

Expressive-FastSpeech2

PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean, and your own languages.

Stars: ✭ 139 (+90.41%)

Mutual labels: text-to-speech, tts, speech-synthesis

TFGAN

TFGAN: Time and Frequency Domain Based Generative Adversarial Network for High-fidelity Speech Synthesis

Stars: ✭ 65 (-10.96%)

Mutual labels: speech, tts, speech-synthesis

TensorVox

Desktop application for neural speech synthesis written in C++

Stars: ✭ 140 (+91.78%)

Mutual labels: text-to-speech, tts, speech-synthesis

View All Similar Projects ➔

Fre-GAN Vocoder

Fre-GAN: Adversarial Frequency-consistent Audio Synthesis

Training:

python train.py --config config.json

Citation:

@misc{kim2021fregan,
      title={Fre-GAN: Adversarial Frequency-consistent Audio Synthesis}, 
      author={Ji-Hoon Kim and Sang-Hoon Lee and Ji-Hyun Lee and Seong-Whan Lee},
      year={2021},
      eprint={2106.02297},
      archivePrefix={arXiv},
      primaryClass={eess.AS}
}

Note

For more complete and end to end Voice cloning or Text to Speech (TTS) toolbox please visit Deepsync Technologies.

References:

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

rishikksh20 / Fre-GAN-pytorch

Programming Languages

Labels

Projects that are alternatives of or similar to Fre-GAN-pytorch

Fre-GAN Vocoder

Training:

Citation:

Note

References: