auspicious3000 / deepbeam

Licence: other

Deep learning based Speech Beamforming

Programming Languages

Jupyter Notebook

11667 projects

python

139335 projects - #7 most used programming language

Projects that are alternatives of or similar to deepbeam

Voice-Separation-and-Enhancement

A framework for quick testing and comparing multi-channel speech enhancement and separation methods, such as DSB, MVDR, LCMV, GEVD beamforming and ICA, FastICA, IVA, AuxIVA, OverIVA, ILRMA, FastMNMF.

Stars: ✭ 60 (+3.45%)

Mutual labels: multi-channel, speech-enhancement

Cskefu

🌲 春松客服，智能客服系统，开源客服系统，机器人客服，客服系统开发框架，多渠道

Stars: ✭ 1,970 (+3296.55%)

Mutual labels: multi-channel

fdndlp

A speech dereverberation algorithm, also called wpe

Stars: ✭ 115 (+98.28%)

Mutual labels: speech-enhancement

Dual Channel Beamformer and Postfilter

This project gives an example of dual microphone speech enhancement based on GSC beamformer and multiple channel postfilter.

Stars: ✭ 80 (+37.93%)

Mutual labels: beamforming

semetrics

Speech Enhancement Metrics (PESQ, CSIG, CBAK, COVL)

Stars: ✭ 39 (-32.76%)

Mutual labels: speech-enhancement

ArraySim

Simulation of atenna array. Beamforming and DOA estimation.

Stars: ✭ 17 (-70.69%)

Mutual labels: beamforming

speech-enhancement-WGAN

speech enhancement GAN on waveform/log-power-spectrum data using Improved WGAN

Stars: ✭ 35 (-39.66%)

Mutual labels: speech-enhancement

ConvolutionaNeuralNetworksToEnhanceCodedSpeech

In this work we propose two postprocessing approaches applying convolutional neural networks (CNNs) either in the time domain or the cepstral domain to enhance the coded speech without any modification of the codecs. The time domain approach follows an end-to-end fashion, while the cepstral domain approach uses analysis-synthesis with cepstral d…

Stars: ✭ 25 (-56.9%)

Mutual labels: speech-enhancement

kotori

A flexible data historian based on InfluxDB, Grafana, MQTT and more. Free, open, simple.

Stars: ✭ 73 (+25.86%)

Mutual labels: multi-channel

Espnet

End-to-End Speech Processing Toolkit

Stars: ✭ 4,533 (+7715.52%)

Mutual labels: speech-enhancement

Noise2Noise-audio denoising without clean training data

Source code for the paper titled "Speech Denoising without Clean Training Data: a Noise2Noise Approach". Paper accepted at the INTERSPEECH 2021 conference. This paper tackles the problem of the heavy dependence of clean speech data required by deep learning based audio denoising methods by showing that it is possible to train deep speech denoisi…

Stars: ✭ 49 (-15.52%)

Mutual labels: speech-enhancement

EaBNet

This is the repo of the manuscript "Embedding and Beamforming: All-Neural Causal Beamformer for Multichannel Speech Enhancement", which was submitted to ICASSP2022.

Stars: ✭ 34 (-41.38%)

Mutual labels: speech-enhancement

SimpleWavSplitter

Split multi-channel WAV files into single channel WAV files.

Stars: ✭ 15 (-74.14%)

Mutual labels: multi-channel

Phase-aware-Deep-Complex-UNet

(NOT Official) Implementation DC-UNet (ICLR 2019)

Stars: ✭ 48 (-17.24%)

Mutual labels: speech-enhancement

torchsubband

Pytorch implementation of subband decomposition

Stars: ✭ 63 (+8.62%)

Mutual labels: speech-enhancement

awesome-speech-enhancement

A curated list of awesome Speech Enhancement papers, libraries, datasets, and other resources.

Stars: ✭ 48 (-17.24%)

Mutual labels: speech-enhancement

FPGA Ultrasound

CMU 18545 FPGA project -- Multi-channel ultrasound data acquisition and beamforming system.

Stars: ✭ 39 (-32.76%)

Mutual labels: beamforming

NBSS

The official repo of "Multi-channel Narrow-band Deep Speech Separation with Full-band Permutation Invariant Training", "Multichannel Speech Separation with Narrow-band Conformer" and "NBC2: Multichannel Speech Separation with Revised Narrow-band Conformer".

Stars: ✭ 77 (+32.76%)

Mutual labels: multi-channel

Beamforming Examples

some Matlab examples of microphone array beamforming

Stars: ✭ 66 (+13.79%)

Mutual labels: beamforming

Voice-Denoising-AN

A Conditional Generative Adverserial Network (cGAN) was adapted for the task of source de-noising of noisy voice auditory images. The base architecture is adapted from Pix2Pix.

Stars: ✭ 42 (-27.59%)

Mutual labels: speech-enhancement

View All Similar Projects ➔

Deepbeam

Deep learning based Speech Beamforming

Requirements

tensorflow, scipy, fftw, h5py

Train Wavenet-based enhancement model

Noisy input data filename: noisy_train.mat

Dimension: [24570, NUM_TOKENS]

Content: noisy waveforms

Clean ouput data filename: target_train.mat

Dimension: [16384, NUM_TOKENS]

Content: 256 mu-law quantized bin index of clean waveforms

The above become numpy arrays after loaded into python, you can generate your own traning data and modify the model architecture accordingly.

To train the enhancement model, place the data in the same directory as the training code, then execute the following:

python bawn_sp_multi_gpu_train_v2.py /logdir NUM_GPUS

Demo using pre-trained model

A pre-trained enhancement model using 109 speakers and 100 noises is available in assets.

"demo.ipynb" contains a complete enhancement and beamforming workflow using a short 8-channel noisy speech.

The noisy input is a [length, NUM_CHANNELS] matrix, where each column is a channel. Another input contains only the multi-channel noise itself is required in order to compute signal-to-noise-ratio.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

auspicious3000 / deepbeam

Programming Languages

Labels

Projects that are alternatives of or similar to deepbeam

Deepbeam

Requirements

Train Wavenet-based enhancement model

Demo using pre-trained model