All Projects → auspicious3000 → deepbeam

auspicious3000 / deepbeam

Licence: other
Deep learning based Speech Beamforming

Programming Languages

Jupyter Notebook
11667 projects
python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to deepbeam

Voice-Separation-and-Enhancement
A framework for quick testing and comparing multi-channel speech enhancement and separation methods, such as DSB, MVDR, LCMV, GEVD beamforming and ICA, FastICA, IVA, AuxIVA, OverIVA, ILRMA, FastMNMF.
Stars: ✭ 60 (+3.45%)
Mutual labels:  multi-channel, speech-enhancement
Cskefu
🌲 春松客服,智能客服系统,开源客服系统 ,机器人客服,客服系统开发框架,多渠道
Stars: ✭ 1,970 (+3296.55%)
Mutual labels:  multi-channel
fdndlp
A speech dereverberation algorithm, also called wpe
Stars: ✭ 115 (+98.28%)
Mutual labels:  speech-enhancement
Dual Channel Beamformer and Postfilter
This project gives an example of dual microphone speech enhancement based on GSC beamformer and multiple channel postfilter.
Stars: ✭ 80 (+37.93%)
Mutual labels:  beamforming
semetrics
Speech Enhancement Metrics (PESQ, CSIG, CBAK, COVL)
Stars: ✭ 39 (-32.76%)
Mutual labels:  speech-enhancement
ArraySim
Simulation of atenna array. Beamforming and DOA estimation.
Stars: ✭ 17 (-70.69%)
Mutual labels:  beamforming
speech-enhancement-WGAN
speech enhancement GAN on waveform/log-power-spectrum data using Improved WGAN
Stars: ✭ 35 (-39.66%)
Mutual labels:  speech-enhancement
ConvolutionaNeuralNetworksToEnhanceCodedSpeech
In this work we propose two postprocessing approaches applying convolutional neural networks (CNNs) either in the time domain or the cepstral domain to enhance the coded speech without any modification of the codecs. The time domain approach follows an end-to-end fashion, while the cepstral domain approach uses analysis-synthesis with cepstral d…
Stars: ✭ 25 (-56.9%)
Mutual labels:  speech-enhancement
kotori
A flexible data historian based on InfluxDB, Grafana, MQTT and more. Free, open, simple.
Stars: ✭ 73 (+25.86%)
Mutual labels:  multi-channel
Espnet
End-to-End Speech Processing Toolkit
Stars: ✭ 4,533 (+7715.52%)
Mutual labels:  speech-enhancement
Noise2Noise-audio denoising without clean training data
Source code for the paper titled "Speech Denoising without Clean Training Data: a Noise2Noise Approach". Paper accepted at the INTERSPEECH 2021 conference. This paper tackles the problem of the heavy dependence of clean speech data required by deep learning based audio denoising methods by showing that it is possible to train deep speech denoisi…
Stars: ✭ 49 (-15.52%)
Mutual labels:  speech-enhancement
EaBNet
This is the repo of the manuscript "Embedding and Beamforming: All-Neural Causal Beamformer for Multichannel Speech Enhancement", which was submitted to ICASSP2022.
Stars: ✭ 34 (-41.38%)
Mutual labels:  speech-enhancement
SimpleWavSplitter
Split multi-channel WAV files into single channel WAV files.
Stars: ✭ 15 (-74.14%)
Mutual labels:  multi-channel
Phase-aware-Deep-Complex-UNet
(NOT Official) Implementation DC-UNet (ICLR 2019)
Stars: ✭ 48 (-17.24%)
Mutual labels:  speech-enhancement
torchsubband
Pytorch implementation of subband decomposition
Stars: ✭ 63 (+8.62%)
Mutual labels:  speech-enhancement
awesome-speech-enhancement
A curated list of awesome Speech Enhancement papers, libraries, datasets, and other resources.
Stars: ✭ 48 (-17.24%)
Mutual labels:  speech-enhancement
FPGA Ultrasound
CMU 18545 FPGA project -- Multi-channel ultrasound data acquisition and beamforming system.
Stars: ✭ 39 (-32.76%)
Mutual labels:  beamforming
NBSS
The official repo of "Multi-channel Narrow-band Deep Speech Separation with Full-band Permutation Invariant Training", "Multichannel Speech Separation with Narrow-band Conformer" and "NBC2: Multichannel Speech Separation with Revised Narrow-band Conformer".
Stars: ✭ 77 (+32.76%)
Mutual labels:  multi-channel
Beamforming Examples
some Matlab examples of microphone array beamforming
Stars: ✭ 66 (+13.79%)
Mutual labels:  beamforming
Voice-Denoising-AN
A Conditional Generative Adverserial Network (cGAN) was adapted for the task of source de-noising of noisy voice auditory images. The base architecture is adapted from Pix2Pix.
Stars: ✭ 42 (-27.59%)
Mutual labels:  speech-enhancement

Deepbeam

Deep learning based Speech Beamforming

Requirements

tensorflow, scipy, fftw, h5py

Train Wavenet-based enhancement model

Noisy input data filename: noisy_train.mat

Dimension: [24570, NUM_TOKENS]

Content: noisy waveforms

Clean ouput data filename: target_train.mat

Dimension: [16384, NUM_TOKENS]

Content: 256 mu-law quantized bin index of clean waveforms

The above become numpy arrays after loaded into python, you can generate your own traning data and modify the model architecture accordingly.

To train the enhancement model, place the data in the same directory as the training code, then execute the following:

python bawn_sp_multi_gpu_train_v2.py /logdir NUM_GPUS

Demo using pre-trained model

A pre-trained enhancement model using 109 speakers and 100 noises is available in assets.

"demo.ipynb" contains a complete enhancement and beamforming workflow using a short 8-channel noisy speech.

The noisy input is a [length, NUM_CHANNELS] matrix, where each column is a channel. Another input contains only the multi-channel noise itself is required in order to compute signal-to-noise-ratio.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].