Source code for the paper titled "Speech Denoising without Clean Training Data: a Noise2Noise Approach". Paper accepted at the INTERSPEECH 2021 conference. This paper tackles the problem of the heavy dependence of clean speech data required by deep learning based audio denoising methods by showing that it is possible to train deep speech denoisi…

Stars: ✭ 49 (-43.02%)

Mutual labels: speech

audio noise clustering

https://dodiku.github.io/audio_noise_clustering/results/ ==> An experiment with a variety of clustering (and clustering-like) techniques to reduce noise on an audio speech recording.

Stars: ✭ 24 (-72.09%)

Mutual labels: speech

Deepspeech

A PaddlePaddle implementation of ASR.

Stars: ✭ 1,219 (+1317.44%)

Mutual labels: speech

MelNet-SpeechGeneration

Implementation of MelNet in PyTorch to generate high-fidelity audio samples

Stars: ✭ 19 (-77.91%)

Mutual labels: speech

Neural sp

End-to-end ASR/LM implementation with PyTorch

Stars: ✭ 408 (+374.42%)

Mutual labels: speech

HTK

The Hidden Markov Model Toolkit (HTK) from University of Cambridge, with fixed issues.

Stars: ✭ 23 (-73.26%)

Mutual labels: speech

Wsay

Windows "say"

Stars: ✭ 36 (-58.14%)

Mutual labels: speech

opensnips

Open source projects related to Snips https://snips.ai/.

Stars: ✭ 50 (-41.86%)

Mutual labels: speech

Voice Converter Cyclegan

Voice Converter Using CycleGAN and Non-Parallel Data

Stars: ✭ 384 (+346.51%)

Mutual labels: speech

nlp-class

A Natural Language Processing course taught by Professor Ghassemi

Stars: ✭ 95 (+10.47%)

Mutual labels: speech

Sound Source Localization Algorithm doa estimation

关于语音信号声源定位DOA估计所用的一些传统算法

Stars: ✭ 58 (-32.56%)

Mutual labels: speech

Voice2Mesh

CVPR 2022: Cross-Modal Perceptionist: Can Face Geometry be Gleaned from Voices?

Stars: ✭ 67 (-22.09%)

Mutual labels: speech

Voice Builder

An opensource text-to-speech (TTS) voice building tool

Stars: ✭ 362 (+320.93%)

Mutual labels: speech

UniSpeech

UniSpeech - Large Scale Self-Supervised Learning for Speech

Stars: ✭ 224 (+160.47%)

Mutual labels: speech

Pytorch Uniwavenet

Stars: ✭ 30 (-65.12%)

Mutual labels: wavenet

gtranscribe

Software for interview transcription

Stars: ✭ 12 (-86.05%)

Mutual labels: speech

Time Series Prediction

A collection of time series prediction methods: rnn, seq2seq, cnn, wavenet, transformer, unet, n-beats, gan, kalman-filter

Stars: ✭ 351 (+308.14%)

Mutual labels: wavenet

linear16

Converts an audio file to LINEAR16 Google-speech compatible file.

Stars: ✭ 14 (-83.72%)

Mutual labels: speech

Julius

Open-Source Large Vocabulary Continuous Speech Recognition Engine

Stars: ✭ 1,258 (+1362.79%)

Mutual labels: speech

DeepSegmentor

Sequence Segmentation using Joint RNN and Structured Prediction Models (ICASSP 2017)

Stars: ✭ 17 (-80.23%)

Mutual labels: speech

Tts

🤖 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)

Stars: ✭ 5,427 (+6210.47%)

Mutual labels: speech

VAD-LTSD

Efficient voice activity detection algorithm using long-term speech information

Stars: ✭ 37 (-56.98%)

Mutual labels: speech

Pykaldi

A Python wrapper for Kaldi

Stars: ✭ 756 (+779.07%)

Mutual labels: speech

JD-NMF

Joint Dictionary Learning-based Non-Negative Matrix Factorization for Voice Conversion (TBME 2016)

Stars: ✭ 20 (-76.74%)

Mutual labels: speech

Android Speech

Android speech recognition and text to speech made easy

Stars: ✭ 310 (+260.47%)

Mutual labels: speech

D-TDNN

PyTorch implementation of Densely Connected Time Delay Neural Network

Stars: ✭ 60 (-30.23%)

Mutual labels: speech

Wavenet

WaveNet implementation with chainer

Stars: ✭ 53 (-38.37%)

Mutual labels: wavenet

melgan

MelGAN implementation with Multi-Band and Full Band supports...

Stars: ✭ 54 (-37.21%)

Mutual labels: speech

Pocketsphinx Python

Python interface to CMU Sphinxbase and Pocketsphinx libraries

Stars: ✭ 298 (+246.51%)

Mutual labels: speech

chainer-ClariNet

A Chainer implementation of ClariNet.

Stars: ✭ 45 (-47.67%)

Mutual labels: wavenet

Parallelwavegan

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN) with Pytorch

Stars: ✭ 682 (+693.02%)

Mutual labels: wavenet

AdaSpeech

AdaSpeech: Adaptive Text to Speech for Custom Voice

Stars: ✭ 108 (+25.58%)

Mutual labels: speech

Sednn

deep learning based speech enhancement using keras or pytorch, make it easy to use

Stars: ✭ 288 (+234.88%)

Mutual labels: speech

MajorDomo-Scenarios

Сценарии для системы домашней автоматизации Majordomo

Stars: ✭ 12 (-86.05%)

Mutual labels: speech

Openasr

A pytorch based end2end speech recognition system.

Stars: ✭ 69 (-19.77%)

Mutual labels: speech

aframe-speech-controls-component

alternative form of inputs for in-VR interaction with the content of a scene

Stars: ✭ 13 (-84.88%)

Mutual labels: speech

Pytorchwavenetvocoder

WaveNet-Vocoder implementation with pytorch.

Stars: ✭ 269 (+212.79%)

Mutual labels: wavenet

Phomeme

Simple sentence mixing tool (work in progress)

Stars: ✭ 18 (-79.07%)

Mutual labels: speech

Segan

Speech Enhancement Generative Adversarial Network in TensorFlow

Stars: ✭ 661 (+668.6%)

Mutual labels: speech

Music-Style-Transfer

Source code for "Transferring the Style of Homophonic Music Using Recurrent Neural Networks and Autoregressive Model"

Stars: ✭ 16 (-81.4%)

Mutual labels: wavenet

Speech Aligner

speech-aligner，是一个从“人声语音”及其“语言文本”，产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech and its transcription

Stars: ✭ 259 (+201.16%)

Mutual labels: speech

Zero-Shot-TTS

Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration

Stars: ✭ 33 (-61.63%)

Mutual labels: speech

Tacotron2

pytorch tacotron2 https://arxiv.org/pdf/1712.05884.pdf

Stars: ✭ 46 (-46.51%)

Mutual labels: wavenet

minutes

🔭 Speaker diarization via transfer learning