PaseProblem Agnostic Speech Encoder
Stars: ✭ 348 (+1292%)
SurfboardNovoic's audio feature extraction library
Stars: ✭ 318 (+1172%)
NnmnkwiiLibrary to build speech synthesis systems designed for easy and fast prototyping.
Stars: ✭ 308 (+1132%)
PysptkA python wrapper for Speech Signal Processing Toolkit (SPTK).
Stars: ✭ 297 (+1088%)
Awesome Speech EnhancementA tutorial for Speech Enhancement researchers and practitioners. The purpose of this repo is to organize the world’s resources for speech enhancement and make them universally accessible and useful.
Stars: ✭ 257 (+928%)
hifigan-denoiserHiFi-GAN: High Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks
Stars: ✭ 88 (+252%)
vaka neural network toolbox for animal vocalizations and bioacoustics
Stars: ✭ 21 (-16%)
speechportal(1st place at HopHacks) A dynamic webVR memory palace for speech training, utilizing natural language processing and Google Streetview API
Stars: ✭ 14 (-44%)
LIUMScripts for LIUM SpkDiarization tools
Stars: ✭ 28 (+12%)
awesome-multimodal-mlReading list for research topics in multimodal machine learning
Stars: ✭ 3,125 (+12400%)
ttslearnttslearn: Library for Pythonで学ぶ音声合成 (Text-to-speech with Python)
Stars: ✭ 158 (+532%)
UniSpeechUniSpeech - Large Scale Self-Supervised Learning for Speech
Stars: ✭ 224 (+796%)
SpeechEnhancementCombining Weighted Multi-resolution STFT Loss and Distance Fusion to Optimize Speech Enhancement Generative Adversarial Networks
Stars: ✭ 49 (+96%)
pyssppython speech signal processing library
Stars: ✭ 18 (-28%)
BookLibraryBook Library of P&W Studio
Stars: ✭ 13 (-48%)
spokestack-iosSpokestack: give your iOS app a voice interface!
Stars: ✭ 27 (+8%)
CNN-VADA Convolutional Neural Network based Voice Activity Detector for Smartphones
Stars: ✭ 60 (+140%)
Speech-BackbonesThis is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.
Stars: ✭ 205 (+720%)
open-speech-corpora💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
Stars: ✭ 841 (+3264%)
QuantumSpeech-QCNNIEEE ICASSP 21 - Quantum Convolution Neural Networks for Speech Processing and Automatic Speech Recognition
Stars: ✭ 71 (+184%)
speechreca simple speech recognition app using the Web Speech API Interfaces
Stars: ✭ 18 (-28%)
ShifterPitch shifter using WSOLA and resampling implemented by Python3
Stars: ✭ 22 (-12%)
DiViMeACLEW Diarization Virtual Machine
Stars: ✭ 28 (+12%)
awesome-keyword-spottingThis repository is a curated list of awesome Speech Keyword Spotting (Wake-Up Word Detection).
Stars: ✭ 150 (+500%)
GanResources and Implementations of Generative Adversarial Nets: GAN, DCGAN, WGAN, CGAN, InfoGAN
Stars: ✭ 2,127 (+8408%)
TadGANCode for the paper "TadGAN: Time Series Anomaly Detection Using Generative Adversarial Networks"
Stars: ✭ 67 (+168%)
skip-thought-ganGenerating Text through Adversarial Training(GAN) using Skip-Thought Vectors
Stars: ✭ 44 (+76%)
wgan-gpPytorch implementation of Wasserstein GANs with Gradient Penalty
Stars: ✭ 161 (+544%)
WassersteinGAN.torchTorch implementation of Wasserstein GAN https://arxiv.org/abs/1701.07875
Stars: ✭ 48 (+92%)
EspnetEnd-to-End Speech Processing Toolkit
Stars: ✭ 4,533 (+18032%)
Noise2Noise-audio denoising without clean training dataSource code for the paper titled "Speech Denoising without Clean Training Data: a Noise2Noise Approach". Paper accepted at the INTERSPEECH 2021 conference. This paper tackles the problem of the heavy dependence of clean speech data required by deep learning based audio denoising methods by showing that it is possible to train deep speech denoisi…
Stars: ✭ 49 (+96%)
Voice-Denoising-ANA Conditional Generative Adverserial Network (cGAN) was adapted for the task of source de-noising of noisy voice auditory images. The base architecture is adapted from Pix2Pix.
Stars: ✭ 42 (+68%)
EaBNetThis is the repo of the manuscript "Embedding and Beamforming: All-Neural Causal Beamformer for Multichannel Speech Enhancement", which was submitted to ICASSP2022.
Stars: ✭ 34 (+36%)
semetricsSpeech Enhancement Metrics (PESQ, CSIG, CBAK, COVL)
Stars: ✭ 39 (+56%)
Voice-Separation-and-EnhancementA framework for quick testing and comparing multi-channel speech enhancement and separation methods, such as DSB, MVDR, LCMV, GEVD beamforming and ICA, FastICA, IVA, AuxIVA, OverIVA, ILRMA, FastMNMF.
Stars: ✭ 60 (+140%)
fdndlpA speech dereverberation algorithm, also called wpe
Stars: ✭ 115 (+360%)
speech-enhancement-WGANspeech enhancement GAN on waveform/log-power-spectrum data using Improved WGAN
Stars: ✭ 35 (+40%)
SpleeterRTReal time monaural source separation base on fully convolutional neural network operates on Time-frequency domain.
Stars: ✭ 111 (+344%)
deepbeamDeep learning based Speech Beamforming
Stars: ✭ 58 (+132%)