All Projects → Wavenet Enhancement → Similar Projects or Alternatives

212 Open source projects that are alternatives of or similar to Wavenet Enhancement

ttslearn
ttslearn: Library for Pythonで学ぶ音声合成 (Text-to-speech with Python)
Stars: ✭ 158 (+83.72%)
Mutual labels:  speech, wavenet
Vq Vae Speech
PyTorch implementation of VQ-VAE + WaveNet by [Chorowski et al., 2019] and VQ-VAE on speech signals by [van den Oord et al., 2017]
Stars: ✭ 187 (+117.44%)
Mutual labels:  speech, wavenet
Wavenet vocoder
WaveNet vocoder
Stars: ✭ 1,926 (+2139.53%)
Mutual labels:  speech, wavenet
Speech Denoising Wavenet
A neural network for end-to-end speech denoising
Stars: ✭ 516 (+500%)
Mutual labels:  speech, wavenet
hifigan-denoiser
HiFi-GAN: High Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks
Stars: ✭ 88 (+2.33%)
Mutual labels:  speech, wavenet
Java Speech Api
The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.
Stars: ✭ 490 (+469.77%)
Mutual labels:  speech
Discordspeechbot
A speech-to-text bot for discord with music commands and more using NodeJS. Ideally for controlling your Discord server using voice commands, can also be useful for hearing-impaired people.
Stars: ✭ 35 (-59.3%)
Mutual labels:  speech
Specaugment
A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain
Stars: ✭ 408 (+374.42%)
Mutual labels:  speech
Tts
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Stars: ✭ 305 (+254.65%)
Mutual labels:  speech
Syn Speech
Syn.Speech is a flexible speaker independent continuous speech recognition engine for Mono and .NET framework
Stars: ✭ 57 (-33.72%)
Mutual labels:  speech
Wavenet Stt
An end-to-end speech recognition system with Wavenet. Built using C++ and python.
Stars: ✭ 18 (-79.07%)
Mutual labels:  wavenet
Inaspeechsegmenter
CNN-based audio segmentation toolkit. Allows to detect speech, music and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.
Stars: ✭ 352 (+309.3%)
Mutual labels:  speech
Sonus
💬 /so.nus/ STT (speech to text) for Node with offline hotword detection
Stars: ✭ 532 (+518.6%)
Mutual labels:  speech
Dialectid e2e
End to End Dialect Identification using Convolutional Neural Network
Stars: ✭ 40 (-53.49%)
Mutual labels:  speech
Flowavenet
A Pytorch implementation of "FloWaveNet: A Generative Flow for Raw Audio"
Stars: ✭ 471 (+447.67%)
Mutual labels:  wavenet
Tf Wavenet vocoder
Wavenet and its applications with Tensorflow
Stars: ✭ 58 (-32.56%)
Mutual labels:  wavenet
Awesome Kaldi
This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )
Stars: ✭ 393 (+356.98%)
Mutual labels:  speech
Lightspeech
LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search
Stars: ✭ 31 (-63.95%)
Mutual labels:  speech
Pycadl
Python package with source code from the course "Creative Applications of Deep Learning w/ TensorFlow"
Stars: ✭ 356 (+313.95%)
Mutual labels:  wavenet
Chainer Vq Vae
A Chainer implementation of VQ-VAE.
Stars: ✭ 77 (-10.47%)
Mutual labels:  wavenet
Ios 10 Sampler
Code examples for new APIs of iOS 10.
Stars: ✭ 3,341 (+3784.88%)
Mutual labels:  speech
Annyang
💬 Speech recognition for your site
Stars: ✭ 6,216 (+7127.91%)
Mutual labels:  speech
Css10
CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages
Stars: ✭ 302 (+251.16%)
Mutual labels:  speech
Pysptk
A python wrapper for Speech Signal Processing Toolkit (SPTK).
Stars: ✭ 297 (+245.35%)
Mutual labels:  speech
Soloud
Free, easy, portable audio engine for games
Stars: ✭ 1,048 (+1118.6%)
Mutual labels:  speech
Praat
Praat: Doing Phonetics By Computer
Stars: ✭ 675 (+684.88%)
Mutual labels:  speech
Clarinet
A Pytorch Implementation of ClariNet
Stars: ✭ 273 (+217.44%)
Mutual labels:  wavenet
Dc tts
A TensorFlow Implementation of DC-TTS: yet another text-to-speech model
Stars: ✭ 1,017 (+1082.56%)
Mutual labels:  speech
Tacotron
Audio samples accompanying publications related to Tacotron, an end-to-end speech synthesis model.
Stars: ✭ 493 (+473.26%)
Mutual labels:  speech
Watbot
An Android ChatBot powered by IBM Watson Services (Assistant V1, Text-to-Speech, and Speech-to-Text with Speaker Recognition) on IBM Cloud.
Stars: ✭ 64 (-25.58%)
Mutual labels:  speech
Xr3player
🎧 🎼 Advanced JavaFX Media Player
Stars: ✭ 472 (+448.84%)
Mutual labels:  speech
Vq Vae Wavenet
TensorFlow implementation of VQ-VAE with WaveNet decoder, based on https://arxiv.org/abs/1711.00937 and https://arxiv.org/abs/1901.08810
Stars: ✭ 40 (-53.49%)
Mutual labels:  wavenet
Cboard
AAC communication system with text-to-speech for the browser
Stars: ✭ 437 (+408.14%)
Mutual labels:  speech
Deepspeech
A PaddlePaddle implementation of ASR.
Stars: ✭ 1,219 (+1317.44%)
Mutual labels:  speech
Neural sp
End-to-end ASR/LM implementation with PyTorch
Stars: ✭ 408 (+374.42%)
Mutual labels:  speech
Wsay
Windows "say"
Stars: ✭ 36 (-58.14%)
Mutual labels:  speech
Voice Converter Cyclegan
Voice Converter Using CycleGAN and Non-Parallel Data
Stars: ✭ 384 (+346.51%)
Mutual labels:  speech
Sound Source Localization Algorithm doa estimation
关于语音信号声源定位DOA估计所用的一些传统算法
Stars: ✭ 58 (-32.56%)
Mutual labels:  speech
Voice Builder
An opensource text-to-speech (TTS) voice building tool
Stars: ✭ 362 (+320.93%)
Mutual labels:  speech
Pytorch Uniwavenet
Stars: ✭ 30 (-65.12%)
Mutual labels:  wavenet
Time Series Prediction
A collection of time series prediction methods: rnn, seq2seq, cnn, wavenet, transformer, unet, n-beats, gan, kalman-filter
Stars: ✭ 351 (+308.14%)
Mutual labels:  wavenet
Julius
Open-Source Large Vocabulary Continuous Speech Recognition Engine
Stars: ✭ 1,258 (+1362.79%)
Mutual labels:  speech
Tts
🤖 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
Stars: ✭ 5,427 (+6210.47%)
Mutual labels:  speech
Pykaldi
A Python wrapper for Kaldi
Stars: ✭ 756 (+779.07%)
Mutual labels:  speech
Android Speech
Android speech recognition and text to speech made easy
Stars: ✭ 310 (+260.47%)
Mutual labels:  speech
Wavenet
WaveNet implementation with chainer
Stars: ✭ 53 (-38.37%)
Mutual labels:  wavenet
Pocketsphinx Python
Python interface to CMU Sphinxbase and Pocketsphinx libraries
Stars: ✭ 298 (+246.51%)
Mutual labels:  speech
Parallelwavegan
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN) with Pytorch
Stars: ✭ 682 (+693.02%)
Mutual labels:  wavenet
Sednn
deep learning based speech enhancement using keras or pytorch, make it easy to use
Stars: ✭ 288 (+234.88%)
Mutual labels:  speech
Openasr
A pytorch based end2end speech recognition system.
Stars: ✭ 69 (-19.77%)
Mutual labels:  speech
Segan
Speech Enhancement Generative Adversarial Network in TensorFlow
Stars: ✭ 661 (+668.6%)
Mutual labels:  speech
Pytorchwavenetvocoder
WaveNet-Vocoder implementation with pytorch.
Stars: ✭ 269 (+212.79%)
Mutual labels:  wavenet
Speech Vad Demo
集成Webrtc的VAD,用于切分音频文件
Stars: ✭ 259 (+201.16%)
Mutual labels:  speech
Speech Aligner
speech-aligner,是一个从“人声语音”及其“语言文本”,产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech and its transcription
Stars: ✭ 259 (+201.16%)
Mutual labels:  speech
Tacotron2
pytorch tacotron2 https://arxiv.org/pdf/1712.05884.pdf
Stars: ✭ 46 (-46.51%)
Mutual labels:  wavenet
Speech Emotion Analyzer
The neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)
Stars: ✭ 633 (+636.05%)
Mutual labels:  speech
Amazing Python Scripts
🚀 Curated collection of Amazing Python scripts from Basics to Advance with automation task scripts.
Stars: ✭ 229 (+166.28%)
Mutual labels:  speech
Noise2Noise-audio denoising without clean training data
Source code for the paper titled "Speech Denoising without Clean Training Data: a Noise2Noise Approach". Paper accepted at the INTERSPEECH 2021 conference. This paper tackles the problem of the heavy dependence of clean speech data required by deep learning based audio denoising methods by showing that it is possible to train deep speech denoisi…
Stars: ✭ 49 (-43.02%)
Mutual labels:  speech
Vad
Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.
Stars: ✭ 622 (+623.26%)
Mutual labels:  speech
Audio
Data manipulation and transformation for audio signal processing, powered by PyTorch
Stars: ✭ 1,262 (+1367.44%)
Mutual labels:  speech
1-60 of 212 similar projects