All Projects → fgnt → Pb_bss

fgnt / Pb_bss

Licence: mit
Collection of EM algorithms for blind source separation of audio signals

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Pb bss

Audino
Open source audio annotation tool for humans™
Stars: ✭ 740 (+482.68%)
Mutual labels:  speech-processing
Dnc
Discriminative Neural Clustering for Speaker Diarisation
Stars: ✭ 60 (-52.76%)
Mutual labels:  speech-processing
Tfg Voice Conversion
Deep Learning-based Voice Conversion system
Stars: ✭ 115 (-9.45%)
Mutual labels:  speech-processing
Rte Speech Generator
Natural Language Processing to generate new speeches for the President of Turkey.
Stars: ✭ 22 (-82.68%)
Mutual labels:  speech-processing
Keras Sincnet
Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)
Stars: ✭ 47 (-62.99%)
Mutual labels:  speech-processing
Sptk
A modified version of Speech Signal Processing Toolkit (SPTK)
Stars: ✭ 71 (-44.09%)
Mutual labels:  speech-processing
Speech Denoising Wavenet
A neural network for end-to-end speech denoising
Stars: ✭ 516 (+306.3%)
Mutual labels:  speech-processing
Deepvoice3 pytorch
PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models
Stars: ✭ 1,654 (+1202.36%)
Mutual labels:  speech-processing
Fullsubnet
PyTorch implementation of "A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."
Stars: ✭ 51 (-59.84%)
Mutual labels:  speech-processing
Wave U Net For Speech Enhancement
Implement Wave-U-Net by PyTorch, and migrate it to the speech enhancement.
Stars: ✭ 106 (-16.54%)
Mutual labels:  speech-processing
Pyannote Audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Stars: ✭ 978 (+670.08%)
Mutual labels:  speech-processing
Formant Analyzer
iOS application for finding formants in spoken sounds
Stars: ✭ 43 (-66.14%)
Mutual labels:  speech-processing
Vokaturiandroid
Emotion recognition by speech in android.
Stars: ✭ 79 (-37.8%)
Mutual labels:  speech-processing
Sincnet
SincNet is a neural architecture for efficiently processing raw audio samples.
Stars: ✭ 764 (+501.57%)
Mutual labels:  speech-processing
Tf Kaldi Speaker
Neural speaker recognition/verification system based on Kaldi and Tensorflow
Stars: ✭ 117 (-7.87%)
Mutual labels:  speech-processing
Awesome Diarization
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
Stars: ✭ 673 (+429.92%)
Mutual labels:  speech-processing
Gcommandspytorch
ConvNets for Audio Recognition using Google Commands Dataset
Stars: ✭ 65 (-48.82%)
Mutual labels:  speech-processing
A Convolutional Recurrent Neural Network For Real Time Speech Enhancement
A minimum unofficial implementation of the "A Convolutional Recurrent Neural Network for Real-Time Speech Enhancement" (CRN) using PyTorch
Stars: ✭ 123 (-3.15%)
Mutual labels:  speech-processing
Nonautoreggenprogress
Tracking the progress in non-autoregressive generation (translation, transcription, etc.)
Stars: ✭ 118 (-7.09%)
Mutual labels:  speech-processing
Pytorch Kaldi Neural Speaker Embeddings
A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.
Stars: ✭ 99 (-22.05%)
Mutual labels:  speech-processing

Blind Source Separation (BSS) algorithms

Build Status Azure DevOps tests Azure DevOps coverage MIT License

This repository covers EM algorithms to separate speech sources in multi-channel recordings.

In particular, the repository contains methods to integrate Deep Clustering (a neural network-based source separation algorithm) with a probabilistic spatial mixture model as proposed in the Interspeech paper "Tight integration of spatial and spectral features for BSS with Deep Clustering embeddings" presented at Interspeech 2017 in Stockholm.

@InProceedings{Drude2017DeepClusteringIntegration,
  Title                    = {Tight integration of spatial and spectral features for {BSS} with Deep Clustering embeddings},
  Author                   = {Drude, Lukas and and Haeb-Umbach, Reinhold},
  Booktitle                = {INTERSPEECH 2017, Stockholm, Sweden},
  Year                     = {2017},
  Month                    = {Aug}
}

Installation

Install it directly from source

git clone https://github.com/fgnt/pb_bss.git
cd pb_bss
pip install --editable .

We expect that numpy, scipy and cython are installed (e.g. conda install numpy scipy cython or pip install numpy scipy cython).

The default option is to install only the necessary dependencies. When you want to run the tests or execute the notebooks, use the one of the following commands for the installation:

pip install --editable .[all]  # Without a whitespace between `.` and `[all]`
pip install git+https://github.com/fgnt/pb_bss.git#egg=pb_bss[all]
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].