Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → danklabs → tts_dataset_maker

danklabs / tts_dataset_maker

Licence: MIT license

A gui to help make a text to speech dataset.

Programming Languages

184084 projects - #8 most used programming language

75241 projects

56736 projects

Labels

nodejs text-to-speech tts electron-app javscript

Projects that are alternatives of or similar to tts dataset maker

EMPHASIS-pytorch

EMPHASIS: An Emotional Phoneme-based Acoustic Model for Speech Synthesis System

Stars: ✭ 15 (-25%)

Mutual labels: text-to-speech, tts

macOS CLI for changing the default TTS (text-to-speech) voice and printing information about and speaking text with multiple voices.

Stars: ✭ 53 (+165%)

Mutual labels: text-to-speech, tts

PyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.

Stars: ✭ 66 (+230%)

Mutual labels: text-to-speech, tts

Tacotron2-PyTorch

Yet another PyTorch implementation of Tacotron 2 with reduction factor and faster training speed.

Stars: ✭ 118 (+490%)

Mutual labels: text-to-speech, tts

SpeakIt Vietnamese TTS

Vietnamese Text-to-Speech on Windows Project (zalo-speech)

Stars: ✭ 81 (+305%)

Mutual labels: text-to-speech, tts

A Text to Speech Reader Front-end that Reads from the Clipboard and with Exceptionable Features

Stars: ✭ 16 (-20%)

Mutual labels: text-to-speech, tts

open-speech-corpora

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

Stars: ✭ 841 (+4105%)

Mutual labels: text-to-speech, tts

Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration

Stars: ✭ 33 (+65%)

Mutual labels: text-to-speech, tts

PyTorch Implementation of Google Brain's WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis

Stars: ✭ 55 (+175%)

Mutual labels: text-to-speech, tts

PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis

Stars: ✭ 41 (+105%)

Mutual labels: text-to-speech, tts

Ukrainian TTS (text-to-speech) using Coqui TTS

Stars: ✭ 74 (+270%)

Mutual labels: text-to-speech, tts

PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speech

Stars: ✭ 163 (+715%)

Mutual labels: text-to-speech, tts

Multi-Speaker Pytorch FastSpeech2: Fast and High-Quality End-to-End Text to Speech ✊

Stars: ✭ 64 (+220%)

Mutual labels: text-to-speech, tts

WIP Tensorflow implementation of https://github.com/mozilla/TTS

Stars: ✭ 14 (-30%)

Mutual labels: text-to-speech, tts

Cross-Speaker-Emotion-Transfer

PyTorch Implementation of ByteDance's Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised Training in Text-To-Speech

Stars: ✭ 107 (+435%)

Mutual labels: text-to-speech, tts

AdaSpeech: Adaptive Text to Speech for Custom Voice

Stars: ✭ 108 (+440%)

Mutual labels: text-to-speech, tts

Official implementation of Meta-StyleSpeech and StyleSpeech

Stars: ✭ 161 (+705%)

Mutual labels: text-to-speech, tts

Desktop application for neural speech synthesis written in C++

Stars: ✭ 140 (+600%)

Mutual labels: text-to-speech, tts

Text-to-Speach golang package based in Amazon Polly service

Stars: ✭ 19 (-5%)

Mutual labels: text-to-speech, tts

Official repository of STYLER: Style Factor Modeling with Rapidity and Robustness via Speech Decomposition for Expressive and Controllable Neural Text to Speech, INTERSPEECH 2021

Stars: ✭ 105 (+425%)

Mutual labels: text-to-speech, tts

View All Similar Projects ➔

TTS Dataset Maker

Make your own text to speech dataset with this tool. Why should you make one? To replicate people's voices kinda like this and much more.
Fair Warning: It's way harder than you think and this will make it a little less harder

Resources(More to come):

Here is a sentdex video about voice cloning.

ScreenShots:

Download

Install the application in your computer. You can find it in the releases section.

The dataset folder will look like(Similar to LJ speech dataset):

Destination folder:
  -wavs   <===== folder containing the clips
  -metadata.csv <===== csv file containing the clip name and corresponding text

Pr's are welcome

Todo:

To Do:

I/O:

Better Responsive UI.
Add some way to begin from where it was left off.
Add timeline to wavesurfer.
Add keyboard shortcuts for the activities.
Add yt and audio link support.
Better Readme

Core additional features:

Add slow mo option for playback.
Remove silent parts from the clip.

Send me your queries @ [email protected]

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 20

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (0) 🔗