A tutorial for Speech Enhancement researchers and practitioners. The purpose of this repo is to organize the world’s resources for speech enhancement and make them universally accessible and useful.

Stars: ✭ 257 (+928%)

Mutual labels: speech-processing

SpeechTransProgress

Tracking the progress in end-to-end speech translation

Stars: ✭ 139 (+456%)

Mutual labels: speech-processing

hifigan-denoiser

HiFi-GAN: High Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks

Stars: ✭ 88 (+252%)

Mutual labels: speech-processing

vak

a neural network toolbox for animal vocalizations and bioacoustics

Stars: ✭ 21 (-16%)

Mutual labels: speech-processing

speechportal

(1st place at HopHacks) A dynamic webVR memory palace for speech training, utilizing natural language processing and Google Streetview API

Stars: ✭ 14 (-44%)

Mutual labels: speech-processing

LIUM

Scripts for LIUM SpkDiarization tools

Stars: ✭ 28 (+12%)

Mutual labels: speech-processing

awesome-multimodal-ml

Reading list for research topics in multimodal machine learning

Stars: ✭ 3,125 (+12400%)

Mutual labels: speech-processing

ttslearn

ttslearn: Library for Pythonで学ぶ音声合成 (Text-to-speech with Python)

Stars: ✭ 158 (+532%)

Mutual labels: speech-processing

UniSpeech

UniSpeech - Large Scale Self-Supervised Learning for Speech

Stars: ✭ 224 (+796%)

Mutual labels: speech-processing

SpeechEnhancement

Combining Weighted Multi-resolution STFT Loss and Distance Fusion to Optimize Speech Enhancement Generative Adversarial Networks

Stars: ✭ 49 (+96%)

Mutual labels: speech-processing

pyssp

python speech signal processing library

Stars: ✭ 18 (-28%)

Mutual labels: speech-processing

BookLibrary

Book Library of P&W Studio

Stars: ✭ 13 (-48%)

Mutual labels: speech-processing

spokestack-ios

Spokestack: give your iOS app a voice interface!

Stars: ✭ 27 (+8%)

Mutual labels: speech-processing

CNN-VAD

A Convolutional Neural Network based Voice Activity Detector for Smartphones

Stars: ✭ 60 (+140%)

Mutual labels: speech-processing

Speech-Backbones

This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.

Stars: ✭ 205 (+720%)

Mutual labels: speech-processing

open-speech-corpora

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

Stars: ✭ 841 (+3264%)

Mutual labels: speech-processing

Huawei-Challenge-Speaker-Identification

Trained speaker embedding deep learning models and evaluation pipelines in pytorch and tesorflow for speaker recognition.

Stars: ✭ 34 (+36%)

Mutual labels: speech-processing

QuantumSpeech-QCNN

IEEE ICASSP 21 - Quantum Convolution Neural Networks for Speech Processing and Automatic Speech Recognition

Stars: ✭ 71 (+184%)

Mutual labels: speech-processing

speechrec

a simple speech recognition app using the Web Speech API Interfaces

Stars: ✭ 18 (-28%)

Mutual labels: speech-processing

Shifter

Pitch shifter using WSOLA and resampling implemented by Python3

Stars: ✭ 22 (-12%)

Mutual labels: speech-processing

DiViMe

ACLEW Diarization Virtual Machine

Stars: ✭ 28 (+12%)

Mutual labels: speech-processing

awesome-keyword-spotting

This repository is a curated list of awesome Speech Keyword Spotting (Wake-Up Word Detection).

Stars: ✭ 150 (+500%)

Mutual labels: speech-processing

react-native-spokestack

Spokestack: give your React Native app a voice interface!

Stars: ✭ 53 (+112%)

Mutual labels: speech-processing

Gan

Resources and Implementations of Generative Adversarial Nets: GAN, DCGAN, WGAN, CGAN, InfoGAN

Stars: ✭ 2,127 (+8408%)

Mutual labels: wasserstein-gan

TadGAN

Code for the paper "TadGAN: Time Series Anomaly Detection Using Generative Adversarial Networks"

Stars: ✭ 67 (+168%)

Mutual labels: wasserstein-gan

skip-thought-gan

Generating Text through Adversarial Training(GAN) using Skip-Thought Vectors

Stars: ✭ 44 (+76%)

Mutual labels: wasserstein-gan

chainer-wasserstein-gan

Chainer implementation of the Wesserstein GAN

Stars: ✭ 20 (-20%)

Mutual labels: wasserstein-gan

wgan-gp

Pytorch implementation of Wasserstein GANs with Gradient Penalty

Stars: ✭ 161 (+544%)

Mutual labels: wasserstein-gan

Improved-Wasserstein-GAN-application-on-MRI-images

Improved Wasserstein GAN (WGAN-GP) application on medical (MRI) images

Stars: ✭ 23 (-8%)

Mutual labels: wasserstein-gan

WassersteinGAN.torch

Torch implementation of Wasserstein GAN https://arxiv.org/abs/1701.07875

Stars: ✭ 48 (+92%)

Mutual labels: wasserstein-gan

progressive growing of GANs

Pure tensorflow implementation of progressive growing of GANs

Stars: ✭ 31 (+24%)

Mutual labels: wasserstein-gan

progressive-growing-of-gans.pytorch

Unofficial PyTorch implementation of "Progressive Growing of GANs for Improved Quality, Stability, and Variation".

Stars: ✭ 51 (+104%)

Mutual labels: wasserstein-gan

Espnet

End-to-End Speech Processing Toolkit

Stars: ✭ 4,533 (+18032%)

Mutual labels: speech-enhancement

Noise2Noise-audio denoising without clean training data

Source code for the paper titled "Speech Denoising without Clean Training Data: a Noise2Noise Approach". Paper accepted at the INTERSPEECH 2021 conference. This paper tackles the problem of the heavy dependence of clean speech data required by deep learning based audio denoising methods by showing that it is possible to train deep speech denoisi…

Stars: ✭ 49 (+96%)

Mutual labels: speech-enhancement

Voice-Denoising-AN

A Conditional Generative Adverserial Network (cGAN) was adapted for the task of source de-noising of noisy voice auditory images. The base architecture is adapted from Pix2Pix.

Stars: ✭ 42 (+68%)

Mutual labels: speech-enhancement

Enhancement-Coded-Speech

No description or website provided.

Stars: ✭ 17 (-32%)

Mutual labels: speech-enhancement

EaBNet

This is the repo of the manuscript "Embedding and Beamforming: All-Neural Causal Beamformer for Multichannel Speech Enhancement", which was submitted to ICASSP2022.

Stars: ✭ 34 (+36%)

Mutual labels: speech-enhancement

semetrics

Speech Enhancement Metrics (PESQ, CSIG, CBAK, COVL)

Stars: ✭ 39 (+56%)

Mutual labels: speech-enhancement

Voice-Separation-and-Enhancement

A framework for quick testing and comparing multi-channel speech enhancement and separation methods, such as DSB, MVDR, LCMV, GEVD beamforming and ICA, FastICA, IVA, AuxIVA, OverIVA, ILRMA, FastMNMF.

Stars: ✭ 60 (+140%)

Mutual labels: speech-enhancement

Phase-aware-Deep-Complex-UNet

(NOT Official) Implementation DC-UNet (ICLR 2019)

Stars: ✭ 48 (+92%)

Mutual labels: speech-enhancement

fdndlp

A speech dereverberation algorithm, also called wpe

Stars: ✭ 115 (+360%)

Mutual labels: speech-enhancement

speech-enhancement-WGAN

speech enhancement GAN on waveform/log-power-spectrum data using Improved WGAN