JasonSWFu / JD-NMF

Licence: other

Joint Dictionary Learning-based Non-Negative Matrix Factorization for Voice Conversion (TBME 2016)

Programming Languages

matlab

3953 projects

Projects that are alternatives of or similar to JD-NMF

VQMIVC

Official implementation of VQMIVC: One-shot (any-to-any) Voice Conversion @ Interspeech 2021 + Online playing demo!

Stars: ✭ 278 (+1290%)

Mutual labels: speech, voice-conversion

CVC

CVC: Contrastive Learning for Non-parallel Voice Conversion (INTERSPEECH 2021, in PyTorch)

Stars: ✭ 45 (+125%)

Mutual labels: speech, voice-conversion

Phomeme

Simple sentence mixing tool (work in progress)

Stars: ✭ 18 (-10%)

Mutual labels: speech, voice-conversion

Shifter

Pitch shifter using WSOLA and resampling implemented by Python3

Stars: ✭ 22 (+10%)

Mutual labels: speech, voice-conversion

voice-based-email-for-blind

Emailing System for visually impaired persons

Stars: ✭ 35 (+75%)

Mutual labels: speech

MediumVC

Any-to-any voice conversion using synthetic specific-speaker speeches as intermedium features

Stars: ✭ 46 (+130%)

Mutual labels: voice-conversion

StyleSpeech

Official implementation of Meta-StyleSpeech and StyleSpeech

Stars: ✭ 161 (+705%)

Mutual labels: speech

amazon-reviews

Sentiment Analysis & Topic Modeling with Amazon Reviews

Stars: ✭ 26 (+30%)

Mutual labels: nmf

AdaSpeech

AdaSpeech: Adaptive Text to Speech for Custom Voice

Stars: ✭ 108 (+440%)

Mutual labels: speech

Quick-Data-Science-Experiments-2017

Quick-Data-Science-Experiments

Stars: ✭ 19 (-5%)

Mutual labels: nmf

MajorDomo-Scenarios

Сценарии для системы домашней автоматизации Majordomo

Stars: ✭ 12 (-40%)

Mutual labels: speech

M-NMF

An implementation of "Community Preserving Network Embedding" (AAAI 2017)

Stars: ✭ 119 (+495%)

Mutual labels: nmf

Voice-Conversion

No description or website provided.

Stars: ✭ 30 (+50%)

Mutual labels: voice-conversion

Zero-Shot-TTS

Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration

Stars: ✭ 33 (+65%)

Mutual labels: speech

melgan

MelGAN implementation with Multi-Band and Full Band supports...

Stars: ✭ 54 (+170%)

Mutual labels: speech

SingleVC

Any-to-one voice conversion using the data augment strategy: pitch shifted and duration remained.

Stars: ✭ 25 (+25%)

Mutual labels: voice-conversion

data-at-hand-mobile

Mobile application for exploring fitness data using both speech and touch interaction.

Stars: ✭ 50 (+150%)

Mutual labels: speech

aframe-speech-controls-component

alternative form of inputs for in-VR interaction with the content of a scene

Stars: ✭ 13 (-35%)

Mutual labels: speech

D-TDNN

PyTorch implementation of Densely Connected Time Delay Neural Network

Stars: ✭ 60 (+200%)

Mutual labels: speech

deepspeech.mxnet

A MXNet implementation of Baidu's DeepSpeech architecture

Stars: ✭ 82 (+310%)

Mutual labels: speech

View All Similar Projects ➔

Joint Dictionary Learning-based Non-Negative Matrix Factorization for Voice Conversion to Improve Speech Intelligibility After Oral Surgery (TBME 2016)

IEEE Transactions on Biomedical Engineering, 2016

Introduction

The Joint Dictionary Learning-based Non-Negative Matrix Factorization (JD-NMF) is used for training joint dictionary (source & target) for voice conversion. But this method can also be used in other applications where the two dictionaries have to be aligned. The basic idea is that if two signals are first aligned by some methods (e.g., DTW in speech processing), to reconstruct the coupled training data with shared activation matrix, the learned dictionaries are automatcally forced to couple with each other to minimize the distance (e.g., KL divergence).

For more details and evaluation results, please check out our paper.

Usuage

Gitsource.list is the list of source speech files used for training JD-NMF. Gittarget.list is the list of target speech files used for training JD-NMF. Gitsource_Test.list is the list of source speech files used for testing (conversion).

JDNMF.m: Convert the source speech files listed in Gitsource_Test.list (with spectrogram features) to the Converted_speech folder.

JDNMF_STRAIGHT.m: Convert the source speech files listed in Gitsource_Test.list (with STRAIGHT features) to the Converted_speech_STRAIGHT folder. This may perform better, but you have to ask the STRAIGHT code from here.

Citation

If you find the code and datasets useful in your research, please cite:

@article{fu2016joint,
  title={Joint Dictionary Learning-based Non-Negative Matrix Factorization for Voice Conversion to Improve Speech Intelligibility After Oral Surgery},
  author={Fu, Szu-Wei and Li, Pei-Chun and Lai, Ying-Hui and Yang, Cheng-Chien and Hsieh, Li-Chun and Tsao, Yu},
  journal={IEEE Transactions on Biomedical Engineering},
  year={2016},
  publisher={IEEE}
}

Contact

e-mail: [email protected] or [email protected]

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

JasonSWFu / JD-NMF

Programming Languages

Labels

Projects that are alternatives of or similar to JD-NMF

Joint Dictionary Learning-based Non-Negative Matrix Factorization for Voice Conversion to Improve Speech Intelligibility After Oral Surgery (TBME 2016)

Introduction

Usuage

Citation

Contact