r9y9 / ttslearn

Licence: MIT License

ttslearn: Library for Pythonで学ぶ音声合成 (Text-to-speech with Python)

Programming Languages

Jupyter Notebook

11667 projects

python

139335 projects - #7 most used programming language

shell

77523 projects

Projects that are alternatives of or similar to ttslearn

Wavenet vocoder

WaveNet vocoder

Stars: ✭ 1,926 (+1118.99%)

Mutual labels: speech, speech-synthesis, wavenet, speech-processing, wavenet-vocoder

Awesome Speech Recognition Speech Synthesis Papers

Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)

Stars: ✭ 2,085 (+1219.62%)

Mutual labels: dnn, tts, speech-synthesis, seq2seq, attention-mechanism

IMS-Toucan

Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart. Objectives of the development are simplicity, modularity, controllability and multilinguality.

Stars: ✭ 295 (+86.71%)

Mutual labels: text-to-speech, speech, tts, speech-synthesis, speech-processing

StyleSpeech

Official implementation of Meta-StyleSpeech and StyleSpeech

Stars: ✭ 161 (+1.9%)

Mutual labels: text-to-speech, speech, tts, speech-synthesis

react-native-spokestack

Spokestack: give your React Native app a voice interface!

Stars: ✭ 53 (-66.46%)

Mutual labels: text-to-speech, tts, speech-synthesis, speech-processing

spokestack-android

Extensible Android mobile voice framework: wakeword, ASR, NLU, and TTS. Easily add voice to any Android app!

Stars: ✭ 52 (-67.09%)

Mutual labels: text-to-speech, speech, tts, speech-synthesis

editts

Official implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech

Stars: ✭ 74 (-53.16%)

Mutual labels: text-to-speech, speech, tts, speech-synthesis

Zero-Shot-TTS

Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration

Stars: ✭ 33 (-79.11%)

Mutual labels: text-to-speech, speech, tts, speech-synthesis

Fre-GAN-pytorch

Fre-GAN: Adversarial Frequency-consistent Audio Synthesis

Stars: ✭ 73 (-53.8%)

Mutual labels: text-to-speech, speech, tts, speech-synthesis

Wavegrad

Implementation of Google Brain's WaveGrad high-fidelity vocoder (paper: https://arxiv.org/pdf/2009.00713.pdf). First implementation on GitHub.

Stars: ✭ 245 (+55.06%)

Mutual labels: text-to-speech, speech, tts, speech-synthesis

Lightspeech

LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search

Stars: ✭ 31 (-80.38%)

Mutual labels: text-to-speech, speech, tts, speech-synthesis

Lingvo

Stars: ✭ 2,361 (+1394.3%)

Mutual labels: speech, tts, speech-synthesis, seq2seq

Parallelwavegan

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN) with Pytorch

Stars: ✭ 682 (+331.65%)

Mutual labels: text-to-speech, tts, speech-synthesis, wavenet

open-speech-corpora

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

Stars: ✭ 841 (+432.28%)

Mutual labels: text-to-speech, tts, speech-synthesis, speech-processing

Durian

Implementation of "Duration Informed Attention Network for Multimodal Synthesis" (https://arxiv.org/pdf/1909.01700.pdf) paper.

Stars: ✭ 111 (-29.75%)

Mutual labels: text-to-speech, speech, tts, speech-synthesis

Voice Builder

An opensource text-to-speech (TTS) voice building tool

Stars: ✭ 362 (+129.11%)

Mutual labels: text-to-speech, speech, tts, speech-synthesis

Wsay

Windows "say"

Stars: ✭ 36 (-77.22%)

Mutual labels: text-to-speech, speech, tts, speech-synthesis

AdaSpeech

AdaSpeech: Adaptive Text to Speech for Custom Voice

Stars: ✭ 108 (-31.65%)

Mutual labels: text-to-speech, speech, tts, speech-synthesis

TFGAN

TFGAN: Time and Frequency Domain Based Generative Adversarial Network for High-fidelity Speech Synthesis

Stars: ✭ 65 (-58.86%)

Mutual labels: speech, tts, speech-synthesis

vits

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Stars: ✭ 1,604 (+915.19%)

Mutual labels: text-to-speech, tts, speech-synthesis

View All Similar Projects ➔

ttslearn: Library for Pythonで学ぶ音声合成 (Text-to-speech with Python)

日本語は以下に続きます (Japanese follows)

English: This book is written in Japanese and primarily focuses on Japanese TTS. Some of the functionality (e.g., neural network implementations) in this codebase can be used for other languages. However, we didn't prepare any guide or code for non-Japanese TTS systems. We may extend the codebase for other languages in the future but cannot guarantee if we would work on it.

Installation

pip install ttslearn

リポジトリの構成

ttslearn: 「Pythonで学ぶ音声合成」のために作成された、音声合成のコアライブラリです。 pip install ttslearn としてインストールされるライブラリの実体です。書籍のサンプルコードとしてだけでなく、汎用的な音声合成のライブラリとしてもご利用いただけます。
notebooks: 第4章から第10章までの、Jupyter notebook形式のソースコードです。
hydra: 第6章で解説している hydra のサンプルコードです。
recipes: 第6章、第8章、第10章で解説している、日本語音声合成のレシピです。JSUTコーパスを利用した日本語音声合成システムの実装が含まれています。
extra_recipes: 発展的な音声合成のレシピです。書籍では解説していませんが、ttslearn ライブラリの利用例として、JSUTコーパス、JVSコーパスを用いた音声合成のレシピをリポジトリに含めています。

詳細なドキュメントは、https://r9y9.github.io/ttslearn/ を参照してください。

ライセンス

ソースコードのライセンスはMITです。商用・非商用問わずに、お使いいただけます。詳細は LICENSEファイルを参照してください。

学習済みモデルの利用規約

本リポジトリのリリースページでは、JSUTコーパス・JVSコーパスを用いて学習した、学習済みモデルを配布しています。それらの学習済みモデルは、「非商用目的」でのみ利用可能です。学習済みモデルを利用する際は、各コーパスの利用規約も併せてご確認ください。

また、作者は、学習済みモデルの利用による一切の請求、損害、その他の義務について何らの責任も負わないものとします。

付録

付録として、日本語音声合成のフルコンテキストラベルの仕様をまとめています。詳細は、docs/appendix.pdf を参照してください。

問い合わせ

書籍の内容、ソースコードに関する質問などありましたら、GitHub issue にてお問い合わせをいただければ、可能な限り返答します。

お詫びと訂正

本書の正誤表を以下のリンク先でまとめています。

本書の正誤表

もし、正誤表に記載されていない誤植などの間違いを見つけた場合は、GitHub issue にてご連絡ください。

謝辞

Tacotron 2の一部ソースコードは、ESPnetを元に作られました。(thanks to @kan-bayashi)
発展的なレシピの実装のほとんどにおいて、kan-bayashi/ParallelWaveGANを利用しました。
日本語音声合成のテキスト処理には、Open JTalk およびそのPythonラッパーを利用しました。

リンク

Amazon: https://www.amazon.co.jp/dp/4295012270/
インプレス書籍情報: https://book.impress.co.jp/books/1120101073

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

r9y9 / ttslearn

Programming Languages

Labels

Projects that are alternatives of or similar to ttslearn

ttslearn: Library for Pythonで学ぶ音声合成 (Text-to-speech with Python)

Installation

リポジトリの構成

ライセンス

学習済みモデルの利用規約

付録

問い合わせ

お詫びと訂正

謝辞

リンク