Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → fgnt → Pb_bss

fgnt / Pb_bss

Licence: mit

Collection of EM algorithms for blind source separation of audio signals

Programming Languages

139335 projects - #7 most used programming language

Labels

speech-processing

Projects that are alternatives of or similar to Pb bss

Open source audio annotation tool for humans™

Stars: ✭ 740 (+482.68%)

Mutual labels: speech-processing

Discriminative Neural Clustering for Speaker Diarisation

Stars: ✭ 60 (-52.76%)

Mutual labels: speech-processing

Tfg Voice Conversion

Deep Learning-based Voice Conversion system

Stars: ✭ 115 (-9.45%)

Mutual labels: speech-processing

Rte Speech Generator

Natural Language Processing to generate new speeches for the President of Turkey.

Stars: ✭ 22 (-82.68%)

Mutual labels: speech-processing

Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)

Stars: ✭ 47 (-62.99%)

Mutual labels: speech-processing

A modified version of Speech Signal Processing Toolkit (SPTK)

Stars: ✭ 71 (-44.09%)

Mutual labels: speech-processing

Speech Denoising Wavenet

A neural network for end-to-end speech denoising

Stars: ✭ 516 (+306.3%)

Mutual labels: speech-processing

Deepvoice3 pytorch

PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models

Stars: ✭ 1,654 (+1202.36%)

Mutual labels: speech-processing

PyTorch implementation of "A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."

Stars: ✭ 51 (-59.84%)

Mutual labels: speech-processing

Wave U Net For Speech Enhancement

Implement Wave-U-Net by PyTorch, and migrate it to the speech enhancement.

Stars: ✭ 106 (-16.54%)

Mutual labels: speech-processing

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Stars: ✭ 978 (+670.08%)

Mutual labels: speech-processing

Formant Analyzer

iOS application for finding formants in spoken sounds

Stars: ✭ 43 (-66.14%)

Mutual labels: speech-processing

Vokaturiandroid

Emotion recognition by speech in android.

Stars: ✭ 79 (-37.8%)

Mutual labels: speech-processing

SincNet is a neural architecture for efficiently processing raw audio samples.

Stars: ✭ 764 (+501.57%)

Mutual labels: speech-processing

Tf Kaldi Speaker

Neural speaker recognition/verification system based on Kaldi and Tensorflow

Stars: ✭ 117 (-7.87%)

Mutual labels: speech-processing

Awesome Diarization

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

Stars: ✭ 673 (+429.92%)

Mutual labels: speech-processing

Gcommandspytorch

ConvNets for Audio Recognition using Google Commands Dataset

Stars: ✭ 65 (-48.82%)

Mutual labels: speech-processing

A Convolutional Recurrent Neural Network For Real Time Speech Enhancement

A minimum unofficial implementation of the "A Convolutional Recurrent Neural Network for Real-Time Speech Enhancement" (CRN) using PyTorch

Stars: ✭ 123 (-3.15%)

Mutual labels: speech-processing

Nonautoreggenprogress

Tracking the progress in non-autoregressive generation (translation, transcription, etc.)

Stars: ✭ 118 (-7.09%)

Mutual labels: speech-processing

Pytorch Kaldi Neural Speaker Embeddings

A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.

Stars: ✭ 99 (-22.05%)

Mutual labels: speech-processing

View All Similar Projects ➔

Blind Source Separation (BSS) algorithms

This repository covers EM algorithms to separate speech sources in multi-channel recordings.

In particular, the repository contains methods to integrate Deep Clustering (a neural network-based source separation algorithm) with a probabilistic spatial mixture model as proposed in the Interspeech paper "Tight integration of spatial and spectral features for BSS with Deep Clustering embeddings" presented at Interspeech 2017 in Stockholm.

@InProceedings{Drude2017DeepClusteringIntegration,
  Title                    = {Tight integration of spatial and spectral features for {BSS} with Deep Clustering embeddings},
  Author                   = {Drude, Lukas and and Haeb-Umbach, Reinhold},
  Booktitle                = {INTERSPEECH 2017, Stockholm, Sweden},
  Year                     = {2017},
  Month                    = {Aug}
}

Installation

Install it directly from source

git clone https://github.com/fgnt/pb_bss.git
cd pb_bss
pip install --editable .

We expect that numpy, scipy and cython are installed (e.g. conda install numpy scipy cython or pip install numpy scipy cython).

The default option is to install only the necessary dependencies. When you want to run the tests or execute the notebooks, use the one of the following commands for the installation:

pip install --editable .[all]  # Without a whitespace between `.` and `[all]`
pip install git+https://github.com/fgnt/pb_bss.git#egg=pb_bss[all]

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 127

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (4) 🔗