All Projects → VAENAR-TTS → Similar Projects or Alternatives

739 Open source projects that are alternatives of or similar to VAENAR-TTS

PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling

Stars: ✭ 149 (+125.76%)

Mutual labels: text-to-speech, duration, tts, speech-synthesis, vae, self-attention, neural-tts, non-autoregressive

Cross-Speaker-Emotion-Transfer

PyTorch Implementation of ByteDance's Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised Training in Text-To-Speech

Stars: ✭ 107 (+62.12%)

Mutual labels: text-to-speech, tts, speech-synthesis, neural-tts, non-autoregressive, non-ar

WaveGrad2

PyTorch Implementation of Google Brain's WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis

Stars: ✭ 55 (-16.67%)

Mutual labels: text-to-speech, duration, tts, speech-synthesis, neural-tts, non-autoregressive

Daft-Exprt

PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis

Stars: ✭ 41 (-37.88%)

Mutual labels: text-to-speech, tts, speech-synthesis, neural-tts, non-autoregressive

StyleSpeech

Official implementation of Meta-StyleSpeech and StyleSpeech

Stars: ✭ 161 (+143.94%)

Mutual labels: text-to-speech, tts, speech-synthesis, neural-tts

Expressive-FastSpeech2

PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean, and your own languages.

Stars: ✭ 139 (+110.61%)

Mutual labels: text-to-speech, tts, speech-synthesis, non-autoregressive

Comprehensive-Tacotron2

PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supports both single-, multi-speaker TTS and several techniques to enforce the robustness and efficiency of the model.

Stars: ✭ 22 (-66.67%)

Mutual labels: text-to-speech, tts, speech-synthesis, neural-tts

Cs224n Gpu That Talks

Attention, I'm Trying to Speak: End-to-end speech synthesis (CS224n '18)

Stars: ✭ 52 (-21.21%)

Mutual labels: text-to-speech, tts, speech-synthesis

TensorVox

Desktop application for neural speech synthesis written in C++

Stars: ✭ 140 (+112.12%)

Mutual labels: text-to-speech, tts, speech-synthesis

open-speech-corpora

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

Stars: ✭ 841 (+1174.24%)

Mutual labels: text-to-speech, tts, speech-synthesis

Durian

Implementation of "Duration Informed Attention Network for Multimodal Synthesis" (https://arxiv.org/pdf/1909.01700.pdf) paper.

Stars: ✭ 111 (+68.18%)

Mutual labels: text-to-speech, tts, speech-synthesis

Wsay

Windows "say"

Stars: ✭ 36 (-45.45%)

Mutual labels: text-to-speech, tts, speech-synthesis

Zero-Shot-TTS

Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration

Stars: ✭ 33 (-50%)

Mutual labels: text-to-speech, tts, speech-synthesis

Marytts

MARY TTS -- an open-source, multilingual text-to-speech synthesis system written in pure java

Stars: ✭ 1,699 (+2474.24%)

Mutual labels: text-to-speech, tts, speech-synthesis

spokestack-android

Extensible Android mobile voice framework: wakeword, ASR, NLU, and TTS. Easily add voice to any Android app!

Stars: ✭ 52 (-21.21%)

Mutual labels: text-to-speech, tts, speech-synthesis

Glow Tts

A Generative Flow for Text-to-Speech via Monotonic Alignment Search

Stars: ✭ 284 (+330.3%)

Mutual labels: text-to-speech, tts, speech-synthesis

Hifi Gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Stars: ✭ 325 (+392.42%)

Mutual labels: text-to-speech, tts, speech-synthesis

AdaSpeech

AdaSpeech: Adaptive Text to Speech for Custom Voice

Stars: ✭ 108 (+63.64%)

Mutual labels: text-to-speech, tts, speech-synthesis

Spokestack Python

Spokestack is a library that allows a user to easily incorporate a voice interface into any Python application.

Stars: ✭ 103 (+56.06%)

Mutual labels: text-to-speech, tts, speech-synthesis

Pytorch Dc Tts

Text to Speech with PyTorch (English and Mongolian)

Stars: ✭ 122 (+84.85%)

Mutual labels: text-to-speech, tts, speech-synthesis

talkie

Text-to-speech browser extension button. Select text on any web page, and have the computer read it out loud for you by simply clicking the Talkie button.

Stars: ✭ 43 (-34.85%)

Mutual labels: text-to-speech, tts, speech-synthesis

Tensorflowtts

😝 TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)

Stars: ✭ 2,382 (+3509.09%)

Mutual labels: text-to-speech, tts, speech-synthesis

vits

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Stars: ✭ 1,604 (+2330.3%)

Mutual labels: text-to-speech, tts, speech-synthesis

LVCNet

LVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generation

Stars: ✭ 67 (+1.52%)

Mutual labels: text-to-speech, tts, speech-synthesis

Fre-GAN-pytorch

Fre-GAN: Adversarial Frequency-consistent Audio Synthesis

Stars: ✭ 73 (+10.61%)

Mutual labels: text-to-speech, tts, speech-synthesis

ttslearn

ttslearn: Library for Pythonで学ぶ音声合成 (Text-to-speech with Python)

Stars: ✭ 158 (+139.39%)

Mutual labels: text-to-speech, tts, speech-synthesis

editts

Official implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech

Stars: ✭ 74 (+12.12%)

Mutual labels: text-to-speech, tts, speech-synthesis

Wavegrad

Implementation of Google Brain's WaveGrad high-fidelity vocoder (paper: https://arxiv.org/pdf/2009.00713.pdf). First implementation on GitHub.

Stars: ✭ 245 (+271.21%)

Mutual labels: text-to-speech, tts, speech-synthesis

Lightspeech

LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search

Stars: ✭ 31 (-53.03%)

Mutual labels: text-to-speech, tts, speech-synthesis

Jsut Lab

HTS-style full-context labels for JSUT v1.1

Stars: ✭ 28 (-57.58%)

Mutual labels: text-to-speech, tts, speech-synthesis

Parakeet

PAddle PARAllel text-to-speech toolKIT (supporting WaveFlow, WaveNet, Transformer TTS and Tacotron2)

Stars: ✭ 279 (+322.73%)

Mutual labels: text-to-speech, tts, speech-synthesis

react-native-spokestack

Spokestack: give your React Native app a voice interface!

Stars: ✭ 53 (-19.7%)

Mutual labels: text-to-speech, tts, speech-synthesis

Cognitive Speech Tts

Microsoft Text-to-Speech API sample code in several languages, part of Cognitive Services.

Stars: ✭ 312 (+372.73%)

Mutual labels: text-to-speech, tts, speech-synthesis

Multilingual text to speech

An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.

Stars: ✭ 324 (+390.91%)

Mutual labels: text-to-speech, tts, speech-synthesis

Wavernn

WaveRNN Vocoder + TTS

Stars: ✭ 1,636 (+2378.79%)

Mutual labels: text-to-speech, tts, speech-synthesis

IMS-Toucan

Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart. Objectives of the development are simplicity, modularity, controllability and multilinguality.

Stars: ✭ 295 (+346.97%)

Mutual labels: text-to-speech, tts, speech-synthesis

Crystal

Crystal - C++ implementation of a unified framework for multilingual TTS synthesis engine with SSML specification as interface.

Stars: ✭ 108 (+63.64%)

Mutual labels: text-to-speech, tts, speech-synthesis

Athena

an open-source implementation of sequence-to-sequence based speech processing engine

Stars: ✭ 542 (+721.21%)

Mutual labels: tts, speech-synthesis, unsupervised-learning

esp32-flite

Speech synthesis running on ESP32 based on Flite engine.

Stars: ✭ 28 (-57.58%)

Mutual labels: text-to-speech, tts, speech-synthesis

Voice Builder

An opensource text-to-speech (TTS) voice building tool

Stars: ✭ 362 (+448.48%)

Mutual labels: text-to-speech, tts, speech-synthesis

Parallelwavegan

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN) with Pytorch

Stars: ✭ 682 (+933.33%)

Mutual labels: text-to-speech, tts, speech-synthesis

Espeak Ng

eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.

Stars: ✭ 799 (+1110.61%)

Mutual labels: text-to-speech, speech-synthesis

Tacotron2

A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Wavenet On Mel Spectrogram Predictions".

Stars: ✭ 43 (-34.85%)

Mutual labels: text-to-speech, speech-synthesis

Speaker

A PHP library to convert text to speech using various web services

Stars: ✭ 86 (+30.3%)

Mutual labels: text-to-speech, tts

Gtts

Python library and CLI tool to interface with Google Translate's text-to-speech API

Stars: ✭ 1,303 (+1874.24%)

Mutual labels: text-to-speech, tts

Zhrtvc

Chinese real time voice cloning (VC) and Chinese text to speech (TTS). 好用的中文语音克隆兼中文语音合成系统，包含语音编码器、语音合成器、声码器和可视化模块。