Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → KuangDD → Zhrtvc

KuangDD / Zhrtvc

Chinese real time voice cloning (VC) and Chinese text to speech (TTS). 好用的中文语音克隆兼中文语音合成系统，包含语音编码器、语音合成器、声码器和可视化模块。

Programming Languages

139335 projects - #7 most used programming language

Labels

text-to-speech tts

Projects that are alternatives of or similar to Zhrtvc

An opensource text-to-speech (TTS) voice building tool

Stars: ✭ 362 (-53.05%)

Mutual labels: text-to-speech, tts

A Generative Flow for Text-to-Speech via Monotonic Alignment Search

Stars: ✭ 284 (-63.16%)

Mutual labels: text-to-speech, tts

Speech synthesis running on ESP32 based on Flite engine.

Stars: ✭ 28 (-96.37%)

Mutual labels: text-to-speech, tts

Free and open source text-to-speech software

Stars: ✭ 355 (-53.96%)

Mutual labels: text-to-speech, tts

AAC communication system with text-to-speech for the browser

Stars: ✭ 437 (-43.32%)

Mutual labels: text-to-speech, tts

Parallelwavegan

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN) with Pytorch

Stars: ✭ 682 (-11.54%)

Mutual labels: text-to-speech, tts

PAddle PARAllel text-to-speech toolKIT (supporting WaveFlow, WaveNet, Transformer TTS and Tacotron2)

Stars: ✭ 279 (-63.81%)

Mutual labels: text-to-speech, tts

An Alfred 3 workflow that uses macOS's TTS (text-to-speech) feature to speak text aloud.

Stars: ✭ 29 (-96.24%)

Mutual labels: text-to-speech, tts

🤖💬 Transformer TTS: Implementation of a non-autoregressive Transformer based neural network for text to speech.

Stars: ✭ 617 (-19.97%)

Mutual labels: text-to-speech, tts

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Stars: ✭ 325 (-57.85%)

Mutual labels: text-to-speech, tts

google-translate-tts

Node library for Google Translate TTS (Text-to-Speech) API

Stars: ✭ 23 (-97.02%)

Mutual labels: text-to-speech, tts

🤖 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)

Stars: ✭ 5,427 (+603.89%)

Mutual labels: text-to-speech, tts

Text-to-speech and translation bot for Discord

Stars: ✭ 27 (-96.5%)

Mutual labels: text-to-speech, tts

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Stars: ✭ 305 (-60.44%)

Mutual labels: text-to-speech, tts

Official implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech

Stars: ✭ 74 (-90.4%)

Mutual labels: text-to-speech, tts

Comprehensive-Tacotron2

PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supports both single-, multi-speaker TTS and several techniques to enforce the robustness and efficiency of the model.

Stars: ✭ 22 (-97.15%)

Mutual labels: text-to-speech, tts

SAM: Software Automatic Mouth (Ported from https://github.com/vidarh/SAM)

Stars: ✭ 33 (-95.72%)

Mutual labels: text-to-speech, tts

Fre-GAN-pytorch

Fre-GAN: Adversarial Frequency-consistent Audio Synthesis

Stars: ✭ 73 (-90.53%)

Mutual labels: text-to-speech, tts

Cognitive Speech Tts

Microsoft Text-to-Speech API sample code in several languages, part of Cognitive Services.

Stars: ✭ 312 (-59.53%)

Mutual labels: text-to-speech, tts

Multilingual text to speech

An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.

Stars: ✭ 324 (-57.98%)

Mutual labels: text-to-speech, tts

View All Similar Projects ➔

zhrtvc

Chinese Real Time Voice Cloning

tips: 中文或汉语的语言缩写简称是zh。

关注【啊啦嘻哈】微信公众号，回复一个字【听】，小萝莉有话对你说哦^v^

版本

v1.2.6

使用说明和注意事项详见README

注意事项:
- 这个说明是新版GMW版本的语音克隆框架的说明，使用ge2e(encoder)-mellotron-waveglow的模块（简称GMW），运行更简单，效果更稳定和合成语音更加优质。
- 基于项目Real-Time-Voice-Cloning改造为中文支持的版本ESV版本的说明见README-ESV，该版本用encoder-synthesizer-vocoder的模块（简称ESV），运行比较复杂。
- 需要进入zhrtvc项目的代码子目录【zhrtvc】运行代码。
- zhrtvc项目默认参数设置是适用于data目录中的样本数据，仅用于跑通整个流程。
- 推荐使用mellotron的语音合成器和waveglow的声码器，mellotron设置多种模式适应多种任务使用。
中文语料

中文语音语料zhvoice，语音更加清晰自然，包含8个开源数据集，3200个说话人，900小时语音，1300万字。

中文模型

扫描上面的二维码，关注**【啊啦嘻哈】微信公众号**，回复：中文语音克隆模型走起，获取百度网盘的模型文件。

合成样例

合成语音样例的目录

目录介绍

zhrtvc

代码相关的说明详见zhrtvc目录下的readme文件。

models

预训练的模型在百度网盘下载，下载后解压，替换models文件夹即可。

data

语料样例，包括语音和文本对齐语料。

注意：

该语料样例用于测试跑通模型，数据量太少，不可能使得模型收敛，即不会训练出可用模型。

在测试跑通模型情况下，处理自己的数据为语料样例的格式，用自己的数据训练模型即可。

学习交流

【AI解决方案交流群】QQ群：925294583

点击链接加入群聊：https://jq.qq.com/?_wv=1027&k=wlQzvT0N

Real-Time Voice Cloning

This repository is an implementation of Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis (SV2TTS) with a vocoder that works in real-time. Feel free to check my thesis if you're curious or if you're looking for info I haven't documented yet (don't hesitate to make an issue for that too). Mostly I would recommend giving a quick look to the figures beyond the introduction.

SV2TTS is a three-stage deep learning framework that allows to create a numerical representation of a voice from a few seconds of audio, and to use it to condition a text-to-speech model trained to generalize to new voices.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 771

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (79) 🔗