All Projects → Wavenet_vocoder → Similar Projects or Alternatives

373 Open source projects that are alternatives of or similar to Wavenet_vocoder

ttslearn
ttslearn: Library for Pythonで学ぶ音声合成 (Text-to-speech with Python)
Stars: ✭ 158 (-91.8%)
Pysptk
A python wrapper for Speech Signal Processing Toolkit (SPTK).
Stars: ✭ 297 (-84.58%)
QPPWG
Quasi-Periodic Parallel WaveGAN Pytorch implementation
Stars: ✭ 41 (-97.87%)
Speech Denoising Wavenet
A neural network for end-to-end speech denoising
Stars: ✭ 516 (-73.21%)
Mutual labels:  speech, speech-processing, wavenet
hifigan-denoiser
HiFi-GAN: High Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks
Stars: ✭ 88 (-95.43%)
Mutual labels:  speech, wavenet, speech-processing
Vq Vae Speech
PyTorch implementation of VQ-VAE + WaveNet by [Chorowski et al., 2019] and VQ-VAE on speech signals by [van den Oord et al., 2017]
Stars: ✭ 187 (-90.29%)
Mutual labels:  speech, speech-processing, wavenet
IMS-Toucan
Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart. Objectives of the development are simplicity, modularity, controllability and multilinguality.
Stars: ✭ 295 (-84.68%)
Neural Voice Cloning With Few Samples
Implementation of Neural Voice Cloning with Few Samples Research Paper by Baidu
Stars: ✭ 211 (-89.04%)
Nnmnkwii
Library to build speech synthesis systems designed for easy and fast prototyping.
Stars: ✭ 308 (-84.01%)
Lingvo
Lingvo
Stars: ✭ 2,361 (+22.59%)
Mutual labels:  speech, speech-synthesis
Wavernn
WaveRNN Vocoder + TTS
Stars: ✭ 1,636 (-15.06%)
Mutual labels:  speech-synthesis, neural-vocoder
Voice Builder
An opensource text-to-speech (TTS) voice building tool
Stars: ✭ 362 (-81.2%)
Mutual labels:  speech, speech-synthesis
open-speech-corpora
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
Stars: ✭ 841 (-56.33%)
React Native Dialogflow
A React-Native Bridge for the Google Dialogflow (API.AI) SDK
Stars: ✭ 182 (-90.55%)
Mutual labels:  speech, speech-processing
Tacotron 2
DeepMind's Tacotron-2 Tensorflow implementation
Stars: ✭ 1,968 (+2.18%)
Mutual labels:  speech-synthesis, wavenet
idear
🎙️ Handsfree Audio Development Interface
Stars: ✭ 84 (-95.64%)
Mutual labels:  speech, speech-synthesis
Wavegrad
Implementation of Google Brain's WaveGrad high-fidelity vocoder (paper: https://arxiv.org/pdf/2009.00713.pdf). First implementation on GitHub.
Stars: ✭ 245 (-87.28%)
Mutual labels:  speech, speech-synthesis
react-native-spokestack
Spokestack: give your React Native app a voice interface!
Stars: ✭ 53 (-97.25%)
Zero-Shot-TTS
Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration
Stars: ✭ 33 (-98.29%)
Mutual labels:  speech, speech-synthesis
LIUM
Scripts for LIUM SpkDiarization tools
Stars: ✭ 28 (-98.55%)
Mutual labels:  speech, speech-processing
TFGAN
TFGAN: Time and Frequency Domain Based Generative Adversarial Network for High-fidelity Speech Synthesis
Stars: ✭ 65 (-96.63%)
Mutual labels:  speech, speech-synthesis
speechrec
a simple speech recognition app using the Web Speech API Interfaces
Stars: ✭ 18 (-99.07%)
Diffwave
DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.
Stars: ✭ 139 (-92.78%)
Mutual labels:  speech, speech-synthesis
UniSpeech
UniSpeech - Large Scale Self-Supervised Learning for Speech
Stars: ✭ 224 (-88.37%)
Mutual labels:  speech, speech-processing
Voice2Mesh
CVPR 2022: Cross-Modal Perceptionist: Can Face Geometry be Gleaned from Voices?
Stars: ✭ 67 (-96.52%)
Mutual labels:  speech, speech-synthesis
Wavenet Enhancement
Speech Enhancement using Bayesian WaveNet
Stars: ✭ 86 (-95.53%)
Mutual labels:  speech, wavenet
Durian
Implementation of "Duration Informed Attention Network for Multimodal Synthesis" (https://arxiv.org/pdf/1909.01700.pdf) paper.
Stars: ✭ 111 (-94.24%)
Mutual labels:  speech, speech-synthesis
spokestack-android
Extensible Android mobile voice framework: wakeword, ASR, NLU, and TTS. Easily add voice to any Android app!
Stars: ✭ 52 (-97.3%)
Mutual labels:  speech, speech-synthesis
MelNet-SpeechGeneration
Implementation of MelNet in PyTorch to generate high-fidelity audio samples
Stars: ✭ 19 (-99.01%)
Mutual labels:  speech, speech-synthesis
Fre-GAN-pytorch
Fre-GAN: Adversarial Frequency-consistent Audio Synthesis
Stars: ✭ 73 (-96.21%)
Mutual labels:  speech, speech-synthesis
editts
Official implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech
Stars: ✭ 74 (-96.16%)
Mutual labels:  speech, speech-synthesis
Vocgan
VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network
Stars: ✭ 158 (-91.8%)
Wavegrad
A fast, high-quality neural vocoder.
Stars: ✭ 138 (-92.83%)
Mutual labels:  speech, speech-synthesis
Java Speech Api
The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.
Stars: ✭ 490 (-74.56%)
Mutual labels:  speech, speech-synthesis
Speechbrain.github.io
The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.
Stars: ✭ 242 (-87.44%)
Mutual labels:  speech, speech-processing
Tacotron pytorch
PyTorch implementation of Tacotron speech synthesis model.
Stars: ✭ 242 (-87.44%)
Mutual labels:  speech, speech-synthesis
Parallelwavegan
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN) with Pytorch
Stars: ✭ 682 (-64.59%)
Mutual labels:  speech-synthesis, wavenet
Lightspeech
LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search
Stars: ✭ 31 (-98.39%)
Mutual labels:  speech, speech-synthesis
Gcc Nmf
Real-time GCC-NMF Blind Speech Separation and Enhancement
Stars: ✭ 231 (-88.01%)
Mutual labels:  speech, speech-processing
StyleSpeech
Official implementation of Meta-StyleSpeech and StyleSpeech
Stars: ✭ 161 (-91.64%)
Mutual labels:  speech, speech-synthesis
Shifter
Pitch shifter using WSOLA and resampling implemented by Python3
Stars: ✭ 22 (-98.86%)
Mutual labels:  speech, speech-processing
Tf Wavenet vocoder
Wavenet and its applications with Tensorflow
Stars: ✭ 58 (-96.99%)
Mutual labels:  speech-synthesis, wavenet
spokestack-ios
Spokestack: give your iOS app a voice interface!
Stars: ✭ 27 (-98.6%)
melgan
MelGAN implementation with Multi-Band and Full Band supports...
Stars: ✭ 54 (-97.2%)
Mutual labels:  speech, speech-synthesis
Deepvoice3 pytorch
PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models
Stars: ✭ 1,654 (-14.12%)
Speech-Backbones
This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.
Stars: ✭ 205 (-89.36%)
AdaSpeech
AdaSpeech: Adaptive Text to Speech for Custom Voice
Stars: ✭ 108 (-94.39%)
Mutual labels:  speech, speech-synthesis
Pytorchwavenetvocoder
WaveNet-Vocoder implementation with pytorch.
Stars: ✭ 269 (-86.03%)
Mutual labels:  speech-synthesis, wavenet
Wsay
Windows "say"
Stars: ✭ 36 (-98.13%)
Mutual labels:  speech, speech-synthesis
Tfg Voice Conversion
Deep Learning-based Voice Conversion system
Stars: ✭ 115 (-94.03%)
Mutual labels:  speech, speech-processing
Asr audio data links
A list of publically available audio data that anyone can download for ASR or other speech activities
Stars: ✭ 128 (-93.35%)
Mutual labels:  speech
Pytorch Gan Timeseries
GANs for time series generation in pytorch
Stars: ✭ 109 (-94.34%)
Mutual labels:  wavenet
Kalliope
Kalliope is a framework that will help you to create your own personal assistant.
Stars: ✭ 1,509 (-21.65%)
Mutual labels:  speech-synthesis
Xva Synth
Machine learning based speech synthesis Electron app, with voices from specific characters from video games
Stars: ✭ 136 (-92.94%)
Mutual labels:  speech-synthesis
Pb bss
Collection of EM algorithms for blind source separation of audio signals
Stars: ✭ 127 (-93.41%)
Mutual labels:  speech-processing
Crystal
Crystal - C++ implementation of a unified framework for multilingual TTS synthesis engine with SSML specification as interface.
Stars: ✭ 108 (-94.39%)
Mutual labels:  speech-synthesis
Numpy Ml
Machine learning, in numpy
Stars: ✭ 11,100 (+476.32%)
Mutual labels:  wavenet
Reconstructing faces from voices
An example of the paper "reconstructing faces from voices"
Stars: ✭ 127 (-93.41%)
Mutual labels:  speech
Python Speech recognition
A simple example for use speech recognition baidu api with python.
Stars: ✭ 106 (-94.5%)
Mutual labels:  speech
Allosaurus
Allosaurus is a pretrained universal phone recognizer for more than 2000 languages
Stars: ✭ 135 (-92.99%)
Mutual labels:  speech
1-60 of 373 similar projects