azraelkuan / voice-conversion

Licence: other

an tutorial implement of voice conversion using pytorch

Programming Languages

python

139335 projects - #7 most used programming language

shell

77523 projects

Projects that are alternatives of or similar to voice-conversion

Awesome Speech Recognition Speech Synthesis Papers

Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)

Stars: ✭ 2,085 (+7919.23%)

Mutual labels: dnn, speech-synthesis

QTextRecognizer

A gui for tesseractOCR with some preprocessing image options (OpenCV) for improve character recognition

Stars: ✭ 27 (+3.85%)

Mutual labels: dnn, lstm

Pytorch Kaldi

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

Stars: ✭ 2,097 (+7965.38%)

Mutual labels: dnn, lstm

Espnet

End-to-End Speech Processing Toolkit

Stars: ✭ 4,533 (+17334.62%)

Mutual labels: speech-synthesis, voice-conversion

ppg-vc

PPG-Based Voice Conversion

Stars: ✭ 154 (+492.31%)

Mutual labels: speech-synthesis, voice-conversion

Ailearning

AiLearning: 机器学习 - MachineLearning - ML、深度学习 - DeepLearning - DL、自然语言处理 NLP

Stars: ✭ 32,316 (+124192.31%)

Mutual labels: dnn, lstm

MediumVC

Any-to-any voice conversion using synthetic specific-speaker speeches as intermedium features

Stars: ✭ 46 (+76.92%)

Mutual labels: speech-synthesis, voice-conversion

Vad

Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.

Stars: ✭ 622 (+2292.31%)

Mutual labels: dnn, lstm

dnn-lstm-word-segment

Chinese Word Segmention Base on the Deep Learning and LSTM Neural Network

Stars: ✭ 24 (-7.69%)

Mutual labels: dnn, lstm

LearningMetersPoems

Official repo of the article: Yousef, W. A., Ibrahime, O. M., Madbouly, T. M., & Mahmoud, M. A. (2019), "Learning meters of arabic and english poems with recurrent neural networks: a step forward for language understanding and synthesis", arXiv preprint arXiv:1905.05700

Stars: ✭ 18 (-30.77%)

Mutual labels: dnn, lstm

SingleVC

Any-to-one voice conversion using the data augment strategy: pitch shifted and duration remained.

Stars: ✭ 25 (-3.85%)

Mutual labels: speech-synthesis, voice-conversion

ttslearn

ttslearn: Library for Pythonで学ぶ音声合成 (Text-to-speech with Python)

Stars: ✭ 158 (+507.69%)

Mutual labels: dnn, speech-synthesis

MachineLearning

Implementations of machine learning algorithm by Python 3

Stars: ✭ 16 (-38.46%)

Mutual labels: lstm, gmm

myDL

Deep Learning

Stars: ✭ 18 (-30.77%)

Mutual labels: dnn, lstm

YourTTS

YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone

Stars: ✭ 217 (+734.62%)

Mutual labels: speech-synthesis, voice-conversion

SERank

An efficient and effective learning to rank algorithm by mining information across ranking candidates. This repository contains the tensorflow implementation of SERank model. The code is developed based on TF-Ranking.

Stars: ✭ 42 (+61.54%)

Mutual labels: dnn

Chinese-Word-Segmentation-in-NLP

State of the art Chinese Word Segmentation with Bi-LSTMs

Stars: ✭ 23 (-11.54%)

Mutual labels: lstm

tiny-rnn

Lightweight C++11 library for building deep recurrent neural networks

Stars: ✭ 41 (+57.69%)

Mutual labels: lstm

vietnamese word seperate

Seperate vietnamese using lstm

Stars: ✭ 13 (-50%)

Mutual labels: lstm

Stock-Prediction

stock predict by cnn and lstm

Stars: ✭ 25 (-3.85%)

Mutual labels: lstm

View All Similar Projects ➔

pytorch_voice_conversion

use pytorch to implement voice conversion

ToDo List

GMM Baseline
LSTM Result
Dual Result
Gan Result

First

check the data root in config.py and replace it by yours

GMM Baseline

the gmm baseline is copied from the nnmnkwii gitpage

LSTM

Prepare Data

replace the in_dir and out_dir in run_pre.sh and then run it.

the in_dir contains wavs like xx/arctic/cmu_us_bdl_arctic/wav/xx.wav

Train

replace the data_root in run_rnn.sh and choose the source(target) speaker, then run it.

finally the checkpoint will be saved in the dir checkpoints

Test

replace the ssp tsp checkpoint_path in test.py, the result wav will be under the dir wavs

Dual

the train and test process is the same of LSTM except add the args dual

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

azraelkuan / voice-conversion

Programming Languages

Labels

Projects that are alternatives of or similar to voice-conversion

pytorch_voice_conversion

ToDo List

First

GMM Baseline

LSTM

Prepare Data

Train

Test

Dual