All Projects → Neural Voice Cloning With Few Samples → Similar Projects or Alternatives

347 Open source projects that are alternatives of or similar to Neural Voice Cloning With Few Samples

Pysptk
A python wrapper for Speech Signal Processing Toolkit (SPTK).
Stars: ✭ 297 (+40.76%)
IMS-Toucan
Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart. Objectives of the development are simplicity, modularity, controllability and multilinguality.
Stars: ✭ 295 (+39.81%)
ttslearn
ttslearn: Library for Pythonで学ぶ音声合成 (Text-to-speech with Python)
Stars: ✭ 158 (-25.12%)
Wavenet vocoder
WaveNet vocoder
Stars: ✭ 1,926 (+812.8%)
Voice Builder
An opensource text-to-speech (TTS) voice building tool
Stars: ✭ 362 (+71.56%)
Mutual labels:  speech, speech-synthesis
Diffwave
DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.
Stars: ✭ 139 (-34.12%)
Mutual labels:  speech, speech-synthesis
Lightspeech
LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search
Stars: ✭ 31 (-85.31%)
Mutual labels:  speech, speech-synthesis
Vq Vae Speech
PyTorch implementation of VQ-VAE + WaveNet by [Chorowski et al., 2019] and VQ-VAE on speech signals by [van den Oord et al., 2017]
Stars: ✭ 187 (-11.37%)
Mutual labels:  speech, speech-processing
react-native-spokestack
Spokestack: give your React Native app a voice interface!
Stars: ✭ 53 (-74.88%)
StyleSpeech
Official implementation of Meta-StyleSpeech and StyleSpeech
Stars: ✭ 161 (-23.7%)
Mutual labels:  speech, speech-synthesis
React Native Dialogflow
A React-Native Bridge for the Google Dialogflow (API.AI) SDK
Stars: ✭ 182 (-13.74%)
Mutual labels:  speech, speech-processing
UniSpeech
UniSpeech - Large Scale Self-Supervised Learning for Speech
Stars: ✭ 224 (+6.16%)
Mutual labels:  speech, speech-processing
AdaSpeech
AdaSpeech: Adaptive Text to Speech for Custom Voice
Stars: ✭ 108 (-48.82%)
Mutual labels:  speech, speech-synthesis
speechrec
a simple speech recognition app using the Web Speech API Interfaces
Stars: ✭ 18 (-91.47%)
LIUM
Scripts for LIUM SpkDiarization tools
Stars: ✭ 28 (-86.73%)
Mutual labels:  speech, speech-processing
Wavegrad
Implementation of Google Brain's WaveGrad high-fidelity vocoder (paper: https://arxiv.org/pdf/2009.00713.pdf). First implementation on GitHub.
Stars: ✭ 245 (+16.11%)
Mutual labels:  speech, speech-synthesis
idear
🎙️ Handsfree Audio Development Interface
Stars: ✭ 84 (-60.19%)
Mutual labels:  speech, speech-synthesis
Vocgan
VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network
Stars: ✭ 158 (-25.12%)
TFGAN
TFGAN: Time and Frequency Domain Based Generative Adversarial Network for High-fidelity Speech Synthesis
Stars: ✭ 65 (-69.19%)
Mutual labels:  speech, speech-synthesis
spokestack-ios
Spokestack: give your iOS app a voice interface!
Stars: ✭ 27 (-87.2%)
melgan
MelGAN implementation with Multi-Band and Full Band supports...
Stars: ✭ 54 (-74.41%)
Mutual labels:  speech, speech-synthesis
Voice2Mesh
CVPR 2022: Cross-Modal Perceptionist: Can Face Geometry be Gleaned from Voices?
Stars: ✭ 67 (-68.25%)
Mutual labels:  speech, speech-synthesis
spokestack-android
Extensible Android mobile voice framework: wakeword, ASR, NLU, and TTS. Easily add voice to any Android app!
Stars: ✭ 52 (-75.36%)
Mutual labels:  speech, speech-synthesis
MelNet-SpeechGeneration
Implementation of MelNet in PyTorch to generate high-fidelity audio samples
Stars: ✭ 19 (-91%)
Mutual labels:  speech, speech-synthesis
Fre-GAN-pytorch
Fre-GAN: Adversarial Frequency-consistent Audio Synthesis
Stars: ✭ 73 (-65.4%)
Mutual labels:  speech, speech-synthesis
editts
Official implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech
Stars: ✭ 74 (-64.93%)
Mutual labels:  speech, speech-synthesis
open-speech-corpora
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
Stars: ✭ 841 (+298.58%)
Speech-Backbones
This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.
Stars: ✭ 205 (-2.84%)
Wavegrad
A fast, high-quality neural vocoder.
Stars: ✭ 138 (-34.6%)
Mutual labels:  speech, speech-synthesis
Java Speech Api
The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.
Stars: ✭ 490 (+132.23%)
Mutual labels:  speech, speech-synthesis
Deepvoice3 pytorch
PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models
Stars: ✭ 1,654 (+683.89%)
Speech Denoising Wavenet
A neural network for end-to-end speech denoising
Stars: ✭ 516 (+144.55%)
Mutual labels:  speech, speech-processing
Tfg Voice Conversion
Deep Learning-based Voice Conversion system
Stars: ✭ 115 (-45.5%)
Mutual labels:  speech, speech-processing
Nnmnkwii
Library to build speech synthesis systems designed for easy and fast prototyping.
Stars: ✭ 308 (+45.97%)
Lingvo
Lingvo
Stars: ✭ 2,361 (+1018.96%)
Mutual labels:  speech, speech-synthesis
Speechbrain.github.io
The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.
Stars: ✭ 242 (+14.69%)
Mutual labels:  speech, speech-processing
Tacotron pytorch
PyTorch implementation of Tacotron speech synthesis model.
Stars: ✭ 242 (+14.69%)
Mutual labels:  speech, speech-synthesis
Gcc Nmf
Real-time GCC-NMF Blind Speech Separation and Enhancement
Stars: ✭ 231 (+9.48%)
Mutual labels:  speech, speech-processing
Shifter
Pitch shifter using WSOLA and resampling implemented by Python3
Stars: ✭ 22 (-89.57%)
Mutual labels:  speech, speech-processing
Zero-Shot-TTS
Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration
Stars: ✭ 33 (-84.36%)
Mutual labels:  speech, speech-synthesis
hifigan-denoiser
HiFi-GAN: High Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks
Stars: ✭ 88 (-58.29%)
Mutual labels:  speech, speech-processing
Wsay
Windows "say"
Stars: ✭ 36 (-82.94%)
Mutual labels:  speech, speech-synthesis
Durian
Implementation of "Duration Informed Attention Network for Multimodal Synthesis" (https://arxiv.org/pdf/1909.01700.pdf) paper.
Stars: ✭ 111 (-47.39%)
Mutual labels:  speech, speech-synthesis
Tensorflowtts
😝 TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)
Stars: ✭ 2,382 (+1028.91%)
Mutual labels:  speech-synthesis
Chatbot Watson Android
An Android ChatBot powered by Watson Services - Assistant, Speech-to-Text and Text-to-Speech on IBM Cloud.
Stars: ✭ 169 (-19.91%)
Mutual labels:  speech
Prosody
Helsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Text
Stars: ✭ 139 (-34.12%)
Mutual labels:  speech-synthesis
Emotion Classification From Audio Files
Understanding emotions from audio files using neural networks and multiple datasets.
Stars: ✭ 189 (-10.43%)
Mutual labels:  speech
Pytorch Kaldi
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
Stars: ✭ 2,097 (+893.84%)
Mutual labels:  speech
Zerospeech
VQ-VAE for Acoustic Unit Discovery and Voice Conversion
Stars: ✭ 137 (-35.07%)
Mutual labels:  speech-synthesis
Speech Enhancement
Deep neural network based speech enhancement toolkit
Stars: ✭ 167 (-20.85%)
Mutual labels:  speech-processing
Xva Synth
Machine learning based speech synthesis Electron app, with voices from specific characters from video games
Stars: ✭ 136 (-35.55%)
Mutual labels:  speech-synthesis
Allosaurus
Allosaurus is a pretrained universal phone recognizer for more than 2000 languages
Stars: ✭ 135 (-36.02%)
Mutual labels:  speech
Universalvocoding
A PyTorch implementation of "Robust Universal Neural Vocoding"
Stars: ✭ 197 (-6.64%)
Mutual labels:  speech-synthesis
Depression Detect
Predicting depression from acoustic features of speech using a Convolutional Neural Network.
Stars: ✭ 187 (-11.37%)
Mutual labels:  speech
Tacotron asr
Speech Recognition Using Tacotron
Stars: ✭ 165 (-21.8%)
Mutual labels:  speech
Cotatron
Official code for Cotatron @ INTERSPEECH 2020
Stars: ✭ 137 (-35.07%)
Mutual labels:  speech-synthesis
Awesome Ai Services
An overview of the AI-as-a-service landscape
Stars: ✭ 133 (-36.97%)
Mutual labels:  speech-synthesis
Tts Papers
🐸 collection of TTS papers
Stars: ✭ 160 (-24.17%)
Mutual labels:  speech
Voice activity detection
Voice Activity Detection based on Deep Learning & TensorFlow
Stars: ✭ 132 (-37.44%)
Mutual labels:  speech
Avpi
an open source voice command macro software
Stars: ✭ 130 (-38.39%)
Mutual labels:  speech
1-60 of 347 similar projects