All Projects → KWTsou1220 → mann-for-speech-separation

KWTsou1220 / mann-for-speech-separation

Licence: other
Neural Turing machine for source separation in Tensorflow

Programming Languages

python
139335 projects - #7 most used programming language
matlab
3953 projects
Jupyter Notebook
11667 projects

Projects that are alternatives of or similar to mann-for-speech-separation

Calculate-SNR-SDR
Script to calculate SNR and SDR using python
Stars: ✭ 76 (+322.22%)
Mutual labels:  speech-separation
Espnet
End-to-End Speech Processing Toolkit
Stars: ✭ 4,533 (+25083.33%)
Mutual labels:  speech-separation
voice-filter
A unofficial Pytorch implementation of Google's VoiceFilter
Stars: ✭ 75 (+316.67%)
Mutual labels:  speech-separation
speech separation
Constrained Permutation Invariant Training, Speech Separation
Stars: ✭ 27 (+50%)
Mutual labels:  speech-separation
UtterancePIT-Speech-Separation
According to funcwj's uPIT, the training code supporting multi-gpu is written, and the Dataloader is reconstructed.
Stars: ✭ 55 (+205.56%)
Mutual labels:  speech-separation
TasNet
A PyTorch implementation of Time-domain Audio Separation Network (TasNet) with Permutation Invariant Training (PIT) for speech separation.
Stars: ✭ 81 (+350%)
Mutual labels:  speech-separation
UniSpeech
UniSpeech - Large Scale Self-Supervised Learning for Speech
Stars: ✭ 224 (+1144.44%)
Mutual labels:  speech-separation
Voice-Separation-and-Enhancement
A framework for quick testing and comparing multi-channel speech enhancement and separation methods, such as DSB, MVDR, LCMV, GEVD beamforming and ICA, FastICA, IVA, AuxIVA, OverIVA, ILRMA, FastMNMF.
Stars: ✭ 60 (+233.33%)
Mutual labels:  speech-separation
open-speech-corpora
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
Stars: ✭ 841 (+4572.22%)
Mutual labels:  speech-separation
awesome-speech-enhancement
A curated list of awesome Speech Enhancement papers, libraries, datasets, and other resources.
Stars: ✭ 48 (+166.67%)
Mutual labels:  speech-separation
Deep-Clustering-for-Speech-Separation
Pytorch implements Deep Clustering: Discriminative Embeddings For Segmentation And Separation
Stars: ✭ 99 (+450%)
Mutual labels:  speech-separation

Memory Augmented Neural Network for Source Separation

In this project, we implement neural Turing machine (NTM) for sequential signals of speech and noise in presence of different speakers and noise types. NTM is a memory-augmented neural network which is equipped with external memory to learn long sequential data. The information is stored with attention mechanism and read-writing scheme. For more details about NTM, you can refer to Neural Turing Machine. The system architecture and experimental settings are shown in Memory Augmented Neural Network for Source Separation.

Setting

  • Hardware:
    • CPU: Intel Core i7-4930K @3.40 GHz
    • RAM: 64 GB DDR3-1600
    • GPU: NVIDIA Tesla K20c 6 GB RAM
  • Tensorflow 0.12
  • Dataset

Result

  • An example of demixed signal
Mixed signal
Clean signal
Demixed signal
  • STOI measure on other noises
Seen speakers Unseen speakers
  • STOI measure on bus noises
Seen speakers Unseen speakers
  • STOI measure on caf noises
Seen speakers Unseen speakers
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].