All Projects → azraelkuan → voice-conversion

azraelkuan / voice-conversion

Licence: other
an tutorial implement of voice conversion using pytorch

Programming Languages

python
139335 projects - #7 most used programming language
shell
77523 projects

Projects that are alternatives of or similar to voice-conversion

Awesome Speech Recognition Speech Synthesis Papers
Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)
Stars: ✭ 2,085 (+7919.23%)
Mutual labels:  dnn, speech-synthesis
QTextRecognizer
A gui for tesseractOCR with some preprocessing image options (OpenCV) for improve character recognition
Stars: ✭ 27 (+3.85%)
Mutual labels:  dnn, lstm
Pytorch Kaldi
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
Stars: ✭ 2,097 (+7965.38%)
Mutual labels:  dnn, lstm
Espnet
End-to-End Speech Processing Toolkit
Stars: ✭ 4,533 (+17334.62%)
Mutual labels:  speech-synthesis, voice-conversion
ppg-vc
PPG-Based Voice Conversion
Stars: ✭ 154 (+492.31%)
Mutual labels:  speech-synthesis, voice-conversion
Ailearning
AiLearning: 机器学习 - MachineLearning - ML、深度学习 - DeepLearning - DL、自然语言处理 NLP
Stars: ✭ 32,316 (+124192.31%)
Mutual labels:  dnn, lstm
MediumVC
Any-to-any voice conversion using synthetic specific-speaker speeches as intermedium features
Stars: ✭ 46 (+76.92%)
Mutual labels:  speech-synthesis, voice-conversion
Vad
Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.
Stars: ✭ 622 (+2292.31%)
Mutual labels:  dnn, lstm
dnn-lstm-word-segment
Chinese Word Segmention Base on the Deep Learning and LSTM Neural Network
Stars: ✭ 24 (-7.69%)
Mutual labels:  dnn, lstm
LearningMetersPoems
Official repo of the article: Yousef, W. A., Ibrahime, O. M., Madbouly, T. M., & Mahmoud, M. A. (2019), "Learning meters of arabic and english poems with recurrent neural networks: a step forward for language understanding and synthesis", arXiv preprint arXiv:1905.05700
Stars: ✭ 18 (-30.77%)
Mutual labels:  dnn, lstm
SingleVC
Any-to-one voice conversion using the data augment strategy: pitch shifted and duration remained.
Stars: ✭ 25 (-3.85%)
Mutual labels:  speech-synthesis, voice-conversion
ttslearn
ttslearn: Library for Pythonで学ぶ音声合成 (Text-to-speech with Python)
Stars: ✭ 158 (+507.69%)
Mutual labels:  dnn, speech-synthesis
MachineLearning
Implementations of machine learning algorithm by Python 3
Stars: ✭ 16 (-38.46%)
Mutual labels:  lstm, gmm
myDL
Deep Learning
Stars: ✭ 18 (-30.77%)
Mutual labels:  dnn, lstm
YourTTS
YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone
Stars: ✭ 217 (+734.62%)
Mutual labels:  speech-synthesis, voice-conversion
SERank
An efficient and effective learning to rank algorithm by mining information across ranking candidates. This repository contains the tensorflow implementation of SERank model. The code is developed based on TF-Ranking.
Stars: ✭ 42 (+61.54%)
Mutual labels:  dnn
Chinese-Word-Segmentation-in-NLP
State of the art Chinese Word Segmentation with Bi-LSTMs
Stars: ✭ 23 (-11.54%)
Mutual labels:  lstm
tiny-rnn
Lightweight C++11 library for building deep recurrent neural networks
Stars: ✭ 41 (+57.69%)
Mutual labels:  lstm
vietnamese word seperate
Seperate vietnamese using lstm
Stars: ✭ 13 (-50%)
Mutual labels:  lstm
Stock-Prediction
stock predict by cnn and lstm
Stars: ✭ 25 (-3.85%)
Mutual labels:  lstm

pytorch_voice_conversion

use pytorch to implement voice conversion

ToDo List

  • GMM Baseline
  • LSTM Result
  • Dual Result
  • Gan Result

First

check the data root in config.py and replace it by yours

GMM Baseline

the gmm baseline is copied from the nnmnkwii gitpage

LSTM

Prepare Data

replace the in_dir and out_dir in run_pre.sh and then run it.

the in_dir contains wavs like xx/arctic/cmu_us_bdl_arctic/wav/xx.wav

Train

replace the data_root in run_rnn.sh and choose the source(target) speaker, then run it.

finally the checkpoint will be saved in the dir checkpoints

Test

replace the ssp tsp checkpoint_path in test.py, the result wav will be under the dir wavs

Dual

the train and test process is the same of LSTM except add the args dual

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].