Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → Sharad24 → Neural Voice Cloning With Few Samples

Sharad24 / Neural Voice Cloning With Few Samples

Implementation of Neural Voice Cloning with Few Samples Research Paper by Baidu

Programming Languages

python

139335 projects - #7 most used programming language

Labels

speech speech-synthesis speech-processing

Projects that are alternatives of or similar to Neural Voice Cloning With Few Samples

ttslearn

ttslearn: Library for Pythonで学ぶ音声合成 (Text-to-speech with Python)

Stars: ✭ 158 (-25.12%)

Mutual labels: speech, speech-synthesis, speech-processing

IMS-Toucan

Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart. Objectives of the development are simplicity, modularity, controllability and multilinguality.

Stars: ✭ 295 (+39.81%)

Mutual labels: speech, speech-synthesis, speech-processing

Pysptk

A python wrapper for Speech Signal Processing Toolkit (SPTK).

Stars: ✭ 297 (+40.76%)

Mutual labels: speech, speech-synthesis, speech-processing

Wavenet vocoder

WaveNet vocoder

Stars: ✭ 1,926 (+812.8%)

Mutual labels: speech, speech-synthesis, speech-processing

Java Speech Api

The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.

Stars: ✭ 490 (+132.23%)

Mutual labels: speech, speech-synthesis

Voice Builder

An opensource text-to-speech (TTS) voice building tool

Stars: ✭ 362 (+71.56%)

Mutual labels: speech, speech-synthesis

Lightspeech

LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search

Stars: ✭ 31 (-85.31%)

Mutual labels: speech, speech-synthesis

Lingvo

Stars: ✭ 2,361 (+1018.96%)

Mutual labels: speech, speech-synthesis

editts

Official implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech

Stars: ✭ 74 (-64.93%)

Mutual labels: speech, speech-synthesis

Wsay

Windows "say"

Stars: ✭ 36 (-82.94%)

Mutual labels: speech, speech-synthesis

React Native Dialogflow

A React-Native Bridge for the Google Dialogflow (API.AI) SDK

Stars: ✭ 182 (-13.74%)

Mutual labels: speech, speech-processing

Nnmnkwii

Library to build speech synthesis systems designed for easy and fast prototyping.

Stars: ✭ 308 (+45.97%)

Mutual labels: speech-synthesis, speech-processing

Diffwave

DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.

Stars: ✭ 139 (-34.12%)

Mutual labels: speech, speech-synthesis

Speech Denoising Wavenet

A neural network for end-to-end speech denoising

Stars: ✭ 516 (+144.55%)

Mutual labels: speech, speech-processing

hifigan-denoiser

HiFi-GAN: High Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks

Stars: ✭ 88 (-58.29%)

Mutual labels: speech, speech-processing

Durian

Implementation of "Duration Informed Attention Network for Multimodal Synthesis" (https://arxiv.org/pdf/1909.01700.pdf) paper.

Stars: ✭ 111 (-47.39%)

Mutual labels: speech, speech-synthesis

Deepvoice3 pytorch

PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models

Stars: ✭ 1,654 (+683.89%)

Mutual labels: speech-synthesis, speech-processing

Wavegrad

A fast, high-quality neural vocoder.

Stars: ✭ 138 (-34.6%)

Mutual labels: speech, speech-synthesis

spokestack-android

Extensible Android mobile voice framework: wakeword, ASR, NLU, and TTS. Easily add voice to any Android app!

Stars: ✭ 52 (-75.36%)

Mutual labels: speech, speech-synthesis

Fre-GAN-pytorch

Fre-GAN: Adversarial Frequency-consistent Audio Synthesis

Stars: ✭ 73 (-65.4%)

Mutual labels: speech, speech-synthesis

View All Similar Projects ➔

Neural-Voice-Cloning-with-Few-Samples

Implementation of the paper titled "Neural Voice Cloning with Few Samples" by Baidu link

A checkpoint for the encoder trained on 56k epochs with a loss of 0.0810 can be found in the checkpoints directory

The repository is only partially complete. https://github.com/SforAiDl/Neural-Voice-Cloning-With-Few-Samples describes how to train and generate voice samples using Speaker Adaptation approach. Link to generated voice samples can be found there as well. All the future commits will happen on https://github.com/SforAiDl/Neural-Voice-Cloning-With-Few-Samples repository.

Acknowledgements

The implementation of deep voice 3 was done from the following repository: https://github.com/r9y9/deepvoice3_pytorch

Cite

If you find the code in the repository useful, please cite it using:

@misc{chitlangia2021voicecloning,
  author = {Chitlangia, Sharad and Rastogi, Mehul and Ganguly, Rijul},
  title = {An Open Source Implementation of Neural Voice Cloning With Few Samples},
  year = {2021},
  publisher = {GitHub},
  journal = {GitHub repository},
  howpublished = {Available at \url{https://github.com/SforAiDl/Neural-Voice-Cloning-With-Few-Samples/} or \url{https://github.com/Sharad24/Neural-Voice-Cloning-With-Few-Samples/}},
}

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 211

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (0) 🔗