Voice-Separation-and-EnhancementA framework for quick testing and comparing multi-channel speech enhancement and separation methods, such as DSB, MVDR, LCMV, GEVD beamforming and ICA, FastICA, IVA, AuxIVA, OverIVA, ILRMA, FastMNMF.
Stars: ✭ 60 (+3.45%)
NBSSThe official repo of "Multi-channel Narrow-band Deep Speech Separation with Full-band Permutation Invariant Training", "Multichannel Speech Separation with Narrow-band Conformer" and "NBC2: Multichannel Speech Separation with Revised Narrow-band Conformer".
Stars: ✭ 77 (+32.76%)
ConvolutionaNeuralNetworksToEnhanceCodedSpeechIn this work we propose two postprocessing approaches applying convolutional neural networks (CNNs) either in the time domain or the cepstral domain to enhance the coded speech without any modification of the codecs. The time domain approach follows an end-to-end fashion, while the cepstral domain approach uses analysis-synthesis with cepstral d…
Stars: ✭ 25 (-56.9%)
torchsubbandPytorch implementation of subband decomposition
Stars: ✭ 63 (+8.62%)
Cskefu🌲 春松客服,智能客服系统,开源客服系统 ,机器人客服,客服系统开发框架,多渠道
Stars: ✭ 1,970 (+3296.55%)
kotoriA flexible data historian based on InfluxDB, Grafana, MQTT and more. Free, open, simple.
Stars: ✭ 73 (+25.86%)
SimpleWavSplitterSplit multi-channel WAV files into single channel WAV files.
Stars: ✭ 15 (-74.14%)
ArraySimSimulation of atenna array. Beamforming and DOA estimation.
Stars: ✭ 17 (-70.69%)
FPGA UltrasoundCMU 18545 FPGA project -- Multi-channel ultrasound data acquisition and beamforming system.
Stars: ✭ 39 (-32.76%)
EspnetEnd-to-End Speech Processing Toolkit
Stars: ✭ 4,533 (+7715.52%)
Noise2Noise-audio denoising without clean training dataSource code for the paper titled "Speech Denoising without Clean Training Data: a Noise2Noise Approach". Paper accepted at the INTERSPEECH 2021 conference. This paper tackles the problem of the heavy dependence of clean speech data required by deep learning based audio denoising methods by showing that it is possible to train deep speech denoisi…
Stars: ✭ 49 (-15.52%)
Voice-Denoising-ANA Conditional Generative Adverserial Network (cGAN) was adapted for the task of source de-noising of noisy voice auditory images. The base architecture is adapted from Pix2Pix.
Stars: ✭ 42 (-27.59%)
EaBNetThis is the repo of the manuscript "Embedding and Beamforming: All-Neural Causal Beamformer for Multichannel Speech Enhancement", which was submitted to ICASSP2022.
Stars: ✭ 34 (-41.38%)
semetricsSpeech Enhancement Metrics (PESQ, CSIG, CBAK, COVL)
Stars: ✭ 39 (-32.76%)
fdndlpA speech dereverberation algorithm, also called wpe
Stars: ✭ 115 (+98.28%)
awesome-speech-enhancementA curated list of awesome Speech Enhancement papers, libraries, datasets, and other resources.
Stars: ✭ 48 (-17.24%)
speech-enhancement-WGANspeech enhancement GAN on waveform/log-power-spectrum data using Improved WGAN
Stars: ✭ 35 (-39.66%)
SpleeterRTReal time monaural source separation base on fully convolutional neural network operates on Time-frequency domain.
Stars: ✭ 111 (+91.38%)