All Projects → Sharad24 → Neural Voice Cloning With Few Samples

Sharad24 / Neural Voice Cloning With Few Samples

Implementation of Neural Voice Cloning with Few Samples Research Paper by Baidu

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Neural Voice Cloning With Few Samples

ttslearn
ttslearn: Library for Pythonで学ぶ音声合成 (Text-to-speech with Python)
Stars: ✭ 158 (-25.12%)
Mutual labels:  speech, speech-synthesis, speech-processing
IMS-Toucan
Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart. Objectives of the development are simplicity, modularity, controllability and multilinguality.
Stars: ✭ 295 (+39.81%)
Mutual labels:  speech, speech-synthesis, speech-processing
Pysptk
A python wrapper for Speech Signal Processing Toolkit (SPTK).
Stars: ✭ 297 (+40.76%)
Mutual labels:  speech, speech-synthesis, speech-processing
Wavenet vocoder
WaveNet vocoder
Stars: ✭ 1,926 (+812.8%)
Mutual labels:  speech, speech-synthesis, speech-processing
Java Speech Api
The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.
Stars: ✭ 490 (+132.23%)
Mutual labels:  speech, speech-synthesis
Voice Builder
An opensource text-to-speech (TTS) voice building tool
Stars: ✭ 362 (+71.56%)
Mutual labels:  speech, speech-synthesis
Lightspeech
LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search
Stars: ✭ 31 (-85.31%)
Mutual labels:  speech, speech-synthesis
Lingvo
Lingvo
Stars: ✭ 2,361 (+1018.96%)
Mutual labels:  speech, speech-synthesis
editts
Official implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech
Stars: ✭ 74 (-64.93%)
Mutual labels:  speech, speech-synthesis
Wsay
Windows "say"
Stars: ✭ 36 (-82.94%)
Mutual labels:  speech, speech-synthesis
React Native Dialogflow
A React-Native Bridge for the Google Dialogflow (API.AI) SDK
Stars: ✭ 182 (-13.74%)
Mutual labels:  speech, speech-processing
Nnmnkwii
Library to build speech synthesis systems designed for easy and fast prototyping.
Stars: ✭ 308 (+45.97%)
Mutual labels:  speech-synthesis, speech-processing
Diffwave
DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.
Stars: ✭ 139 (-34.12%)
Mutual labels:  speech, speech-synthesis
Speech Denoising Wavenet
A neural network for end-to-end speech denoising
Stars: ✭ 516 (+144.55%)
Mutual labels:  speech, speech-processing
hifigan-denoiser
HiFi-GAN: High Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks
Stars: ✭ 88 (-58.29%)
Mutual labels:  speech, speech-processing
Durian
Implementation of "Duration Informed Attention Network for Multimodal Synthesis" (https://arxiv.org/pdf/1909.01700.pdf) paper.
Stars: ✭ 111 (-47.39%)
Mutual labels:  speech, speech-synthesis
Deepvoice3 pytorch
PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models
Stars: ✭ 1,654 (+683.89%)
Mutual labels:  speech-synthesis, speech-processing
Wavegrad
A fast, high-quality neural vocoder.
Stars: ✭ 138 (-34.6%)
Mutual labels:  speech, speech-synthesis
spokestack-android
Extensible Android mobile voice framework: wakeword, ASR, NLU, and TTS. Easily add voice to any Android app!
Stars: ✭ 52 (-75.36%)
Mutual labels:  speech, speech-synthesis
Fre-GAN-pytorch
Fre-GAN: Adversarial Frequency-consistent Audio Synthesis
Stars: ✭ 73 (-65.4%)
Mutual labels:  speech, speech-synthesis

Neural-Voice-Cloning-with-Few-Samples

Implementation of the paper titled "Neural Voice Cloning with Few Samples" by Baidu link

A checkpoint for the encoder trained on 56k epochs with a loss of 0.0810 can be found in the checkpoints directory

The repository is only partially complete. https://github.com/SforAiDl/Neural-Voice-Cloning-With-Few-Samples describes how to train and generate voice samples using Speaker Adaptation approach. Link to generated voice samples can be found there as well. All the future commits will happen on https://github.com/SforAiDl/Neural-Voice-Cloning-With-Few-Samples repository.

Acknowledgements

The implementation of deep voice 3 was done from the following repository: https://github.com/r9y9/deepvoice3_pytorch

Cite

If you find the code in the repository useful, please cite it using:

@misc{chitlangia2021voicecloning,
  author = {Chitlangia, Sharad and Rastogi, Mehul and Ganguly, Rijul},
  title = {An Open Source Implementation of Neural Voice Cloning With Few Samples},
  year = {2021},
  publisher = {GitHub},
  journal = {GitHub repository},
  howpublished = {Available at \url{https://github.com/SforAiDl/Neural-Voice-Cloning-With-Few-Samples/} or \url{https://github.com/Sharad24/Neural-Voice-Cloning-With-Few-Samples/}},
}
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].