All Projects → Neural Voice Cloning With Few Samples → Similar Projects or Alternatives

347 Open source projects that are alternatives of or similar to Neural Voice Cloning With Few Samples

A python wrapper for Speech Signal Processing Toolkit (SPTK).

Stars: ✭ 297 (+40.76%)

Mutual labels: speech, speech-synthesis, speech-processing

Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart. Objectives of the development are simplicity, modularity, controllability and multilinguality.

Stars: ✭ 295 (+39.81%)

Mutual labels: speech, speech-synthesis, speech-processing

ttslearn

ttslearn: Library for Pythonで学ぶ音声合成 (Text-to-speech with Python)

Stars: ✭ 158 (-25.12%)

Mutual labels: speech, speech-synthesis, speech-processing

Wavenet vocoder

WaveNet vocoder

Stars: ✭ 1,926 (+812.8%)

Mutual labels: speech, speech-synthesis, speech-processing

Voice Builder

An opensource text-to-speech (TTS) voice building tool

Stars: ✭ 362 (+71.56%)

Mutual labels: speech, speech-synthesis

Diffwave

DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.

Stars: ✭ 139 (-34.12%)

Mutual labels: speech, speech-synthesis

Lightspeech

LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search

Stars: ✭ 31 (-85.31%)

Mutual labels: speech, speech-synthesis

Vq Vae Speech

PyTorch implementation of VQ-VAE + WaveNet by [Chorowski et al., 2019] and VQ-VAE on speech signals by [van den Oord et al., 2017]

Stars: ✭ 187 (-11.37%)

Mutual labels: speech, speech-processing

react-native-spokestack

Spokestack: give your React Native app a voice interface!

Stars: ✭ 53 (-74.88%)

Mutual labels: speech-synthesis, speech-processing

StyleSpeech

Official implementation of Meta-StyleSpeech and StyleSpeech

Stars: ✭ 161 (-23.7%)

Mutual labels: speech, speech-synthesis

React Native Dialogflow

A React-Native Bridge for the Google Dialogflow (API.AI) SDK

Stars: ✭ 182 (-13.74%)

Mutual labels: speech, speech-processing

UniSpeech

UniSpeech - Large Scale Self-Supervised Learning for Speech

Stars: ✭ 224 (+6.16%)

Mutual labels: speech, speech-processing

AdaSpeech

AdaSpeech: Adaptive Text to Speech for Custom Voice

Stars: ✭ 108 (-48.82%)

Mutual labels: speech, speech-synthesis

speechrec

a simple speech recognition app using the Web Speech API Interfaces

Stars: ✭ 18 (-91.47%)

Mutual labels: speech-synthesis, speech-processing

LIUM

Scripts for LIUM SpkDiarization tools

Stars: ✭ 28 (-86.73%)

Mutual labels: speech, speech-processing

Wavegrad

Implementation of Google Brain's WaveGrad high-fidelity vocoder (paper: https://arxiv.org/pdf/2009.00713.pdf). First implementation on GitHub.

Stars: ✭ 245 (+16.11%)

Mutual labels: speech, speech-synthesis

idear

🎙️ Handsfree Audio Development Interface

Stars: ✭ 84 (-60.19%)

Mutual labels: speech, speech-synthesis

Vocgan

VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network

Stars: ✭ 158 (-25.12%)

Mutual labels: speech-synthesis, speech-processing

TFGAN

TFGAN: Time and Frequency Domain Based Generative Adversarial Network for High-fidelity Speech Synthesis

Stars: ✭ 65 (-69.19%)

Mutual labels: speech, speech-synthesis

spokestack-ios

Spokestack: give your iOS app a voice interface!

Stars: ✭ 27 (-87.2%)

Mutual labels: speech-synthesis, speech-processing

melgan

MelGAN implementation with Multi-Band and Full Band supports...

Stars: ✭ 54 (-74.41%)

Mutual labels: speech, speech-synthesis

Voice2Mesh

CVPR 2022: Cross-Modal Perceptionist: Can Face Geometry be Gleaned from Voices?

Stars: ✭ 67 (-68.25%)

Mutual labels: speech, speech-synthesis

spokestack-android

Extensible Android mobile voice framework: wakeword, ASR, NLU, and TTS. Easily add voice to any Android app!

Stars: ✭ 52 (-75.36%)

Mutual labels: speech, speech-synthesis

MelNet-SpeechGeneration

Implementation of MelNet in PyTorch to generate high-fidelity audio samples

Stars: ✭ 19 (-91%)

Mutual labels: speech, speech-synthesis

Fre-GAN-pytorch

Fre-GAN: Adversarial Frequency-consistent Audio Synthesis

Stars: ✭ 73 (-65.4%)

Mutual labels: speech, speech-synthesis

editts

Official implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech

Stars: ✭ 74 (-64.93%)

Mutual labels: speech, speech-synthesis

open-speech-corpora

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

Stars: ✭ 841 (+298.58%)

Mutual labels: speech-synthesis, speech-processing

Speech-Backbones

This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.

Stars: ✭ 205 (-2.84%)

Mutual labels: speech-synthesis, speech-processing

Wavegrad

A fast, high-quality neural vocoder.

Stars: ✭ 138 (-34.6%)

Mutual labels: speech, speech-synthesis

Java Speech Api

The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.

Stars: ✭ 490 (+132.23%)

Mutual labels: speech, speech-synthesis

Deepvoice3 pytorch

PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models

Stars: ✭ 1,654 (+683.89%)

Mutual labels: speech-synthesis, speech-processing

Speech Denoising Wavenet

A neural network for end-to-end speech denoising

Stars: ✭ 516 (+144.55%)

Mutual labels: speech, speech-processing

Tfg Voice Conversion

Deep Learning-based Voice Conversion system

Stars: ✭ 115 (-45.5%)

Mutual labels: speech, speech-processing

Nnmnkwii

Library to build speech synthesis systems designed for easy and fast prototyping.

Stars: ✭ 308 (+45.97%)

Mutual labels: speech-synthesis, speech-processing

Lingvo

Stars: ✭ 2,361 (+1018.96%)

Mutual labels: speech, speech-synthesis

Speechbrain.github.io

The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.

Stars: ✭ 242 (+14.69%)

Mutual labels: speech, speech-processing

Tacotron pytorch

PyTorch implementation of Tacotron speech synthesis model.

Stars: ✭ 242 (+14.69%)

Mutual labels: speech, speech-synthesis

Gcc Nmf

Real-time GCC-NMF Blind Speech Separation and Enhancement

Stars: ✭ 231 (+9.48%)

Mutual labels: speech, speech-processing

Shifter

Pitch shifter using WSOLA and resampling implemented by Python3

Stars: ✭ 22 (-89.57%)

Mutual labels: speech, speech-processing

Zero-Shot-TTS

Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration

Stars: ✭ 33 (-84.36%)

Mutual labels: speech, speech-synthesis

hifigan-denoiser

HiFi-GAN: High Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks

Stars: ✭ 88 (-58.29%)

Mutual labels: speech, speech-processing

Wsay

Windows "say"

Stars: ✭ 36 (-82.94%)

Mutual labels: speech, speech-synthesis

Durian

Implementation of "Duration Informed Attention Network for Multimodal Synthesis" (https://arxiv.org/pdf/1909.01700.pdf) paper.

Stars: ✭ 111 (-47.39%)

Mutual labels: speech, speech-synthesis

Tensorflowtts

😝 TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)

Stars: ✭ 2,382 (+1028.91%)

Mutual labels: speech-synthesis

Chatbot Watson Android

An Android ChatBot powered by Watson Services - Assistant, Speech-to-Text and Text-to-Speech on IBM Cloud.

Stars: ✭ 169 (-19.91%)

Mutual labels: speech

Prosody

Helsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Text

Stars: ✭ 139 (-34.12%)

Mutual labels: speech-synthesis

Emotion Classification From Audio Files

Understanding emotions from audio files using neural networks and multiple datasets.

Stars: ✭ 189 (-10.43%)

Mutual labels: speech

Pytorch Kaldi

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

Stars: ✭ 2,097 (+893.84%)

Mutual labels: speech

Zerospeech

VQ-VAE for Acoustic Unit Discovery and Voice Conversion

Stars: ✭ 137 (-35.07%)

Mutual labels: speech-synthesis

Speech Enhancement

Deep neural network based speech enhancement toolkit

Stars: ✭ 167 (-20.85%)

Mutual labels: speech-processing

Xva Synth

Machine learning based speech synthesis Electron app, with voices from specific characters from video games