All Projects → albertaparicio → Tfg Voice Conversion

albertaparicio / Tfg Voice Conversion

Licence: gpl-3.0
Deep Learning-based Voice Conversion system

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Tfg Voice Conversion

LIUM
Scripts for LIUM SpkDiarization tools
Stars: ✭ 28 (-75.65%)
Mutual labels:  speech, speech-processing
Pysptk
A python wrapper for Speech Signal Processing Toolkit (SPTK).
Stars: ✭ 297 (+158.26%)
Mutual labels:  speech, speech-processing
hifigan-denoiser
HiFi-GAN: High Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks
Stars: ✭ 88 (-23.48%)
Mutual labels:  speech, speech-processing
datasets
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
Stars: ✭ 13,870 (+11960.87%)
Mutual labels:  numpy, speech
Yolo Tf2
yolo(all versions) implementation in keras and tensorflow 2.4
Stars: ✭ 695 (+504.35%)
Mutual labels:  deep-neural-networks, numpy
UniSpeech
UniSpeech - Large Scale Self-Supervised Learning for Speech
Stars: ✭ 224 (+94.78%)
Mutual labels:  speech, speech-processing
Sednn
deep learning based speech enhancement using keras or pytorch, make it easy to use
Stars: ✭ 288 (+150.43%)
Mutual labels:  deep-neural-networks, speech
Andrew Ng Notes
This is Andrew NG Coursera Handwritten Notes.
Stars: ✭ 180 (+56.52%)
Mutual labels:  deep-neural-networks, numpy
Segan
Speech Enhancement Generative Adversarial Network in TensorFlow
Stars: ✭ 661 (+474.78%)
Mutual labels:  deep-neural-networks, speech
Speech Emotion Analyzer
The neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)
Stars: ✭ 633 (+450.43%)
Mutual labels:  deep-neural-networks, speech
Shifter
Pitch shifter using WSOLA and resampling implemented by Python3
Stars: ✭ 22 (-80.87%)
Mutual labels:  speech, speech-processing
Gcommandspytorch
ConvNets for Audio Recognition using Google Commands Dataset
Stars: ✭ 65 (-43.48%)
Mutual labels:  deep-neural-networks, speech-processing
IMS-Toucan
Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart. Objectives of the development are simplicity, modularity, controllability and multilinguality.
Stars: ✭ 295 (+156.52%)
Mutual labels:  speech, speech-processing
ttslearn
ttslearn: Library for Pythonで学ぶ音声合成 (Text-to-speech with Python)
Stars: ✭ 158 (+37.39%)
Mutual labels:  speech, speech-processing
Emotion Classification From Audio Files
Understanding emotions from audio files using neural networks and multiple datasets.
Stars: ✭ 189 (+64.35%)
Mutual labels:  deep-neural-networks, speech
Awesome Speech Enhancement
A tutorial for Speech Enhancement researchers and practitioners. The purpose of this repo is to organize the world’s resources for speech enhancement and make them universally accessible and useful.
Stars: ✭ 257 (+123.48%)
Mutual labels:  deep-neural-networks, speech-processing
Tutorial separation
This repo summarizes the tutorials, datasets, papers, codes and tools for speech separation and speaker extraction task. You are kindly invited to pull requests.
Stars: ✭ 151 (+31.3%)
Mutual labels:  deep-neural-networks, speech-processing
Pytorch Kaldi
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
Stars: ✭ 2,097 (+1723.48%)
Mutual labels:  deep-neural-networks, speech
Speech Denoising Wavenet
A neural network for end-to-end speech denoising
Stars: ✭ 516 (+348.7%)
Mutual labels:  speech, speech-processing
Pykaldi
A Python wrapper for Kaldi
Stars: ✭ 756 (+557.39%)
Mutual labels:  speech, numpy

Voice Conversion using Deep Learning

Download PDF

This project will be carried out at the Signal Theory and Communications Department (TSC) of the Polytechnic University of Catalonia (UPC). Specifically, it will be developed at the Speech Processing investigation group (VEU) as a contribution to its research project DeepVoice: Deep Learning Technologies for Speech and Audio Processing.

The purpose of this project is to develop a deep learning-based system able to convert a voice signal from a speaker into another that sounds as if it were uttered by a different one. The result signal must keep the linguistic and prosodic elements of the original signal unmodified.

Deep Learning techniques have shown remarkable results in other areas of speech processing, such as voice recognition and voice synthesis. These techniques are often combined with other, more classic, techniques of voice processing and modeling, such as feature extractions from a vocoder. These techniques are used for pre and post-processing purposes.

Before this system can be developed, there are some previous tasks that must be accomplished. Mainly, these tasks comprise acquiring a thorough knowledge of Neural Networks and how to apply them in Deep Learning, as well as getting familiarized with the tools that will be used in the project. These tools include several Python libraries, such as NumPy, TensorFlow, Theano and Keras.

Regarding the programming tools and libraries, some preparation work has already been done beforehand during summer 2016, working with Python, NumPy and TensorFlow.

The project’s main goals are:

  1. Develop a Deep Learning-based system able to convert recorded speech from a speaker into that of another speaker
    1. Profound understanding of Deep Learning architectures
    2. Solid knowledge in the use of the Keras Deep Learning Python library
    3. Propose an innovative architecture following the state of the art in Deep Learning for Voice Conversion
    4. Evaluate the developed system’s conversion so it performs better than those submitted to Interspeech 2016 Voice Conversion Challenge
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].