Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart. Objectives of the development are simplicity, modularity, controllability and multilinguality.

Stars: ✭ 295 (+793.94%)

Mutual labels: text-to-speech, speech, tts, speech-synthesis

Durian

Implementation of "Duration Informed Attention Network for Multimodal Synthesis" (https://arxiv.org/pdf/1909.01700.pdf) paper.

Stars: ✭ 111 (+236.36%)

Mutual labels: text-to-speech, speech, tts, speech-synthesis

spokestack-android

Extensible Android mobile voice framework: wakeword, ASR, NLU, and TTS. Easily add voice to any Android app!

Stars: ✭ 52 (+57.58%)

Mutual labels: text-to-speech, speech, tts, speech-synthesis

Cognitive Speech Tts

Microsoft Text-to-Speech API sample code in several languages, part of Cognitive Services.

Stars: ✭ 312 (+845.45%)

Mutual labels: text-to-speech, tts, speech-synthesis, transformer

editts

Official implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech

Stars: ✭ 74 (+124.24%)

Mutual labels: text-to-speech, speech, tts, speech-synthesis

Fre-GAN-pytorch

Fre-GAN: Adversarial Frequency-consistent Audio Synthesis

Stars: ✭ 73 (+121.21%)

Mutual labels: text-to-speech, speech, tts, speech-synthesis

Lightspeech

LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search

Stars: ✭ 31 (-6.06%)

Mutual labels: text-to-speech, speech, tts, speech-synthesis

Voice Builder

An opensource text-to-speech (TTS) voice building tool

Stars: ✭ 362 (+996.97%)

Mutual labels: text-to-speech, speech, tts, speech-synthesis

Wsay

Windows "say"

Stars: ✭ 36 (+9.09%)

Mutual labels: text-to-speech, speech, tts, speech-synthesis

Crystal

Crystal - C++ implementation of a unified framework for multilingual TTS synthesis engine with SSML specification as interface.

Stars: ✭ 108 (+227.27%)

Mutual labels: text-to-speech, tts, speech-synthesis

vits

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Stars: ✭ 1,604 (+4760.61%)

Mutual labels: text-to-speech, tts, speech-synthesis

Wavernn

WaveRNN Vocoder + TTS

Stars: ✭ 1,636 (+4857.58%)

Mutual labels: text-to-speech, tts, speech-synthesis

Tts

Text-to-Speech for Arduino

Stars: ✭ 118 (+257.58%)

Mutual labels: text-to-speech, speech, tts

Spokestack Python

Spokestack is a library that allows a user to easily incorporate a voice interface into any Python application.

Stars: ✭ 103 (+212.12%)

Mutual labels: text-to-speech, tts, speech-synthesis

Pytorch Dc Tts

Text to Speech with PyTorch (English and Mongolian)

Stars: ✭ 122 (+269.7%)

Mutual labels: text-to-speech, tts, speech-synthesis

Marytts

MARY TTS -- an open-source, multilingual text-to-speech synthesis system written in pure java

Stars: ✭ 1,699 (+5048.48%)

Mutual labels: text-to-speech, tts, speech-synthesis

View All Similar Projects ➔

Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration

This repo contains only model Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration paper.

Citation

@misc{tang2021zeroshot,
      title={Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration}, 
      author={Chuanxin Tang and Chong Luo and Zhiyuan Zhao and Dacheng Yin and Yucheng Zhao and Wenjun Zeng},
      year={2021},
      eprint={2109.05426},
      archivePrefix={arXiv},
      primaryClass={cs.SD}
}

Note

This repo only contain model implementation, not dataloader and training code, also it is not well tested from my side.
For more complete TTS or Speech Synthesis solution please visit DeepSync .

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

rishikksh20 / Zero-Shot-TTS

Programming Languages

Labels

Projects that are alternatives of or similar to Zero-Shot-TTS

Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration

Citation

Note