awesome-speech-enhancementA curated list of awesome Speech Enhancement papers, libraries, datasets, and other resources.
Stars: ✭ 48 (-20%)
deepbeamDeep learning based Speech Beamforming
Stars: ✭ 58 (-3.33%)
EspnetEnd-to-End Speech Processing Toolkit
Stars: ✭ 4,533 (+7455%)
SimpleWavSplitterSplit multi-channel WAV files into single channel WAV files.
Stars: ✭ 15 (-75%)
open-speech-corpora💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
Stars: ✭ 841 (+1301.67%)
fdndlpA speech dereverberation algorithm, also called wpe
Stars: ✭ 115 (+91.67%)
speech-enhancement-WGANspeech enhancement GAN on waveform/log-power-spectrum data using Improved WGAN
Stars: ✭ 35 (-41.67%)
SpleeterRTReal time monaural source separation base on fully convolutional neural network operates on Time-frequency domain.
Stars: ✭ 111 (+85%)
NBSSThe official repo of "Multi-channel Narrow-band Deep Speech Separation with Full-band Permutation Invariant Training", "Multichannel Speech Separation with Narrow-band Conformer" and "NBC2: Multichannel Speech Separation with Revised Narrow-band Conformer".
Stars: ✭ 77 (+28.33%)
ConvolutionaNeuralNetworksToEnhanceCodedSpeechIn this work we propose two postprocessing approaches applying convolutional neural networks (CNNs) either in the time domain or the cepstral domain to enhance the coded speech without any modification of the codecs. The time domain approach follows an end-to-end fashion, while the cepstral domain approach uses analysis-synthesis with cepstral d…
Stars: ✭ 25 (-58.33%)
torchsubbandPytorch implementation of subband decomposition
Stars: ✭ 63 (+5%)
Cskefu🌲 春松客服,智能客服系统,开源客服系统 ,机器人客服,客服系统开发框架,多渠道
Stars: ✭ 1,970 (+3183.33%)
kotoriA flexible data historian based on InfluxDB, Grafana, MQTT and more. Free, open, simple.
Stars: ✭ 73 (+21.67%)
voice-filterA unofficial Pytorch implementation of Google's VoiceFilter
Stars: ✭ 75 (+25%)
speech separationConstrained Permutation Invariant Training, Speech Separation
Stars: ✭ 27 (-55%)
UtterancePIT-Speech-SeparationAccording to funcwj's uPIT, the training code supporting multi-gpu is written, and the Dataloader is reconstructed.
Stars: ✭ 55 (-8.33%)
TasNetA PyTorch implementation of Time-domain Audio Separation Network (TasNet) with Permutation Invariant Training (PIT) for speech separation.
Stars: ✭ 81 (+35%)
UniSpeechUniSpeech - Large Scale Self-Supervised Learning for Speech
Stars: ✭ 224 (+273.33%)
Noise2Noise-audio denoising without clean training dataSource code for the paper titled "Speech Denoising without Clean Training Data: a Noise2Noise Approach". Paper accepted at the INTERSPEECH 2021 conference. This paper tackles the problem of the heavy dependence of clean speech data required by deep learning based audio denoising methods by showing that it is possible to train deep speech denoisi…
Stars: ✭ 49 (-18.33%)
Voice-Denoising-ANA Conditional Generative Adverserial Network (cGAN) was adapted for the task of source de-noising of noisy voice auditory images. The base architecture is adapted from Pix2Pix.
Stars: ✭ 42 (-30%)
EaBNetThis is the repo of the manuscript "Embedding and Beamforming: All-Neural Causal Beamformer for Multichannel Speech Enhancement", which was submitted to ICASSP2022.
Stars: ✭ 34 (-43.33%)
semetricsSpeech Enhancement Metrics (PESQ, CSIG, CBAK, COVL)
Stars: ✭ 39 (-35%)