Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → xuchenglin28 → speech_separation

xuchenglin28 / speech_separation

Licence: other

Constrained Permutation Invariant Training, Speech Separation

Programming Languages

139335 projects - #7 most used programming language

77523 projects

Labels

speech-separation constrained-pit

Projects that are alternatives of or similar to speech separation

UtterancePIT-Speech-Separation

According to funcwj's uPIT, the training code supporting multi-gpu is written, and the Dataloader is reconstructed.

Stars: ✭ 55 (+103.7%)

Mutual labels: speech-separation

A PyTorch implementation of Time-domain Audio Separation Network (TasNet) with Permutation Invariant Training (PIT) for speech separation.

Stars: ✭ 81 (+200%)

Mutual labels: speech-separation

UniSpeech - Large Scale Self-Supervised Learning for Speech

Stars: ✭ 224 (+729.63%)

Mutual labels: speech-separation

Voice-Separation-and-Enhancement

A framework for quick testing and comparing multi-channel speech enhancement and separation methods, such as DSB, MVDR, LCMV, GEVD beamforming and ICA, FastICA, IVA, AuxIVA, OverIVA, ILRMA, FastMNMF.

Stars: ✭ 60 (+122.22%)

Mutual labels: speech-separation

open-speech-corpora

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

Stars: ✭ 841 (+3014.81%)

Mutual labels: speech-separation

awesome-speech-enhancement

A curated list of awesome Speech Enhancement papers, libraries, datasets, and other resources.

Stars: ✭ 48 (+77.78%)

Mutual labels: speech-separation

Deep-Clustering-for-Speech-Separation

Pytorch implements Deep Clustering: Discriminative Embeddings For Segmentation And Separation

Stars: ✭ 99 (+266.67%)

Mutual labels: speech-separation

mann-for-speech-separation

Neural Turing machine for source separation in Tensorflow

Stars: ✭ 18 (-33.33%)

Mutual labels: speech-separation

Calculate-SNR-SDR

Script to calculate SNR and SDR using python

Stars: ✭ 76 (+181.48%)

Mutual labels: speech-separation

End-to-End Speech Processing Toolkit

Stars: ✭ 4,533 (+16688.89%)

Mutual labels: speech-separation

A unofficial Pytorch implementation of Google's VoiceFilter

Stars: ✭ 75 (+177.78%)

Mutual labels: speech-separation

Constrained Permutation Invariant Training, Speech Separation

Please cite:

Chenglin Xu, Wei Rao, Xiong Xiao, Eng Siong Chng and Haizhou Li, "SINGLE CHANNEL SPEECH SEPARATION WITH CONSTRAINED UTTERANCE LEVEL PERMUTATION INVARIANT TRAINING USING GRID LSTM", in Proc. of ICASSP 2018, pp 6-10.
Chenglin Xu, Wei Rao, Eng Siong Chng and Haizhou Li, "A Shifted Delta Coefficient Objective for Monaural Speech Separation using Multi-task Learning", in Proc. of INTERSPEECH 2018, pp 3479-3483.

Data Generation

If you are using wsj0-2mix to repeat the work in paper 1 and 2, please use the code from http://www.merl.com/demos/deep-clustering to generate the wsj0_2mix data

Speech Separation

Currently, the code only implement two speaker separation, if you have more speakers to be separated, please revise the output part together with mask estimation accordingly. The number of speakers information need to be known in prior, it limits the application of speech separation in practice.

We have done another work, which is target speaker extraction. It's only extracting target speaker's voice from the mixed or noisy enviroment. please refer to https://github.com/xuchenglin28/speaker_extraction.

The run.sh script includes feature extraction, modeling training and run-time inference, please run it after you simulated data.

run.sh

Environments:

python: 2.7

Tensorflow: 1.12 (some API are older version, but compatiable by 1.12)

Part of the code are adapted from https://github.com/snsun/pit-speech-separation

Contact:

email: [email protected]

Licence

The code and models in this repository are licensed under the GNU General Public License Version 3.

Citation

If you would like to cite, use this :

@inproceedings{xu2018single,
  title={Single channel speech separation with constrained utterance level permutation invariant training using grid lstm},
  author={Xu, Chenglin and Rao, Wei and Xiao, Xiong and Chng, Eng Siong and Li, Haizhou},
  booktitle={IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)},
  pages={6--10},
  year={2018}
}
@inproceedings{xu2018shifted,
  title={A Shifted Delta Coefficient Objective for Monaural Speech Separation Using Multi-task Learning.},
  author={Xu, Chenglin and Rao, Wei and Chng, Eng Siong and Li, Haizhou},
  booktitle={Proc. of INTERSPEECH},
  pages={3479--3483},
  year={2018}
}

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 27

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (2) 🔗