All Projects → JasonSWFu → JD-NMF

JasonSWFu / JD-NMF

Licence: other
Joint Dictionary Learning-based Non-Negative Matrix Factorization for Voice Conversion (TBME 2016)

Programming Languages

matlab
3953 projects

Projects that are alternatives of or similar to JD-NMF

VQMIVC
Official implementation of VQMIVC: One-shot (any-to-any) Voice Conversion @ Interspeech 2021 + Online playing demo!
Stars: ✭ 278 (+1290%)
Mutual labels:  speech, voice-conversion
CVC
CVC: Contrastive Learning for Non-parallel Voice Conversion (INTERSPEECH 2021, in PyTorch)
Stars: ✭ 45 (+125%)
Mutual labels:  speech, voice-conversion
Phomeme
Simple sentence mixing tool (work in progress)
Stars: ✭ 18 (-10%)
Mutual labels:  speech, voice-conversion
Shifter
Pitch shifter using WSOLA and resampling implemented by Python3
Stars: ✭ 22 (+10%)
Mutual labels:  speech, voice-conversion
voice-based-email-for-blind
Emailing System for visually impaired persons
Stars: ✭ 35 (+75%)
Mutual labels:  speech
MediumVC
Any-to-any voice conversion using synthetic specific-speaker speeches as intermedium features
Stars: ✭ 46 (+130%)
Mutual labels:  voice-conversion
StyleSpeech
Official implementation of Meta-StyleSpeech and StyleSpeech
Stars: ✭ 161 (+705%)
Mutual labels:  speech
amazon-reviews
Sentiment Analysis & Topic Modeling with Amazon Reviews
Stars: ✭ 26 (+30%)
Mutual labels:  nmf
AdaSpeech
AdaSpeech: Adaptive Text to Speech for Custom Voice
Stars: ✭ 108 (+440%)
Mutual labels:  speech
Quick-Data-Science-Experiments-2017
Quick-Data-Science-Experiments
Stars: ✭ 19 (-5%)
Mutual labels:  nmf
MajorDomo-Scenarios
Сценарии для системы домашней автоматизации Majordomo
Stars: ✭ 12 (-40%)
Mutual labels:  speech
M-NMF
An implementation of "Community Preserving Network Embedding" (AAAI 2017)
Stars: ✭ 119 (+495%)
Mutual labels:  nmf
Voice-Conversion
No description or website provided.
Stars: ✭ 30 (+50%)
Mutual labels:  voice-conversion
Zero-Shot-TTS
Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration
Stars: ✭ 33 (+65%)
Mutual labels:  speech
melgan
MelGAN implementation with Multi-Band and Full Band supports...
Stars: ✭ 54 (+170%)
Mutual labels:  speech
SingleVC
Any-to-one voice conversion using the data augment strategy: pitch shifted and duration remained.
Stars: ✭ 25 (+25%)
Mutual labels:  voice-conversion
data-at-hand-mobile
Mobile application for exploring fitness data using both speech and touch interaction.
Stars: ✭ 50 (+150%)
Mutual labels:  speech
aframe-speech-controls-component
alternative form of inputs for in-VR interaction with the content of a scene
Stars: ✭ 13 (-35%)
Mutual labels:  speech
D-TDNN
PyTorch implementation of Densely Connected Time Delay Neural Network
Stars: ✭ 60 (+200%)
Mutual labels:  speech
deepspeech.mxnet
A MXNet implementation of Baidu's DeepSpeech architecture
Stars: ✭ 82 (+310%)
Mutual labels:  speech

Joint Dictionary Learning-based Non-Negative Matrix Factorization for Voice Conversion to Improve Speech Intelligibility After Oral Surgery (TBME 2016)

IEEE Transactions on Biomedical Engineering, 2016

Introduction

The Joint Dictionary Learning-based Non-Negative Matrix Factorization (JD-NMF) is used for training joint dictionary (source & target) for voice conversion. But this method can also be used in other applications where the two dictionaries have to be aligned. The basic idea is that if two signals are first aligned by some methods (e.g., DTW in speech processing), to reconstruct the coupled training data with shared activation matrix, the learned dictionaries are automatcally forced to couple with each other to minimize the distance (e.g., KL divergence).

For more details and evaluation results, please check out our paper.

teaser

Usuage

Gitsource.list is the list of source speech files used for training JD-NMF. Gittarget.list is the list of target speech files used for training JD-NMF. Gitsource_Test.list is the list of source speech files used for testing (conversion).

JDNMF.m: Convert the source speech files listed in Gitsource_Test.list (with spectrogram features) to the Converted_speech folder.

JDNMF_STRAIGHT.m: Convert the source speech files listed in Gitsource_Test.list (with STRAIGHT features) to the Converted_speech_STRAIGHT folder. This may perform better, but you have to ask the STRAIGHT code from here.

Citation

If you find the code and datasets useful in your research, please cite:

@article{fu2016joint,
  title={Joint Dictionary Learning-based Non-Negative Matrix Factorization for Voice Conversion to Improve Speech Intelligibility After Oral Surgery},
  author={Fu, Szu-Wei and Li, Pei-Chun and Lai, Ying-Hui and Yang, Cheng-Chien and Hsieh, Li-Chun and Tsao, Yu},
  journal={IEEE Transactions on Biomedical Engineering},
  year={2016},
  publisher={IEEE}
}

Contact

e-mail: [email protected] or [email protected]

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].