SingleVCAny-to-one voice conversion using the data augment strategy: pitch shifted and duration remained.
Stars: ✭ 25 (-45.65%)
voice-conversionan tutorial implement of voice conversion using pytorch
Stars: ✭ 26 (-43.48%)
ppg-vcPPG-Based Voice Conversion
Stars: ✭ 154 (+234.78%)
YourTTSYourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone
Stars: ✭ 217 (+371.74%)
EspnetEnd-to-End Speech Processing Toolkit
Stars: ✭ 4,533 (+9754.35%)
NaomiThe Naomi Project is an open source, technology agnostic platform for developing always-on, voice-controlled applications!
Stars: ✭ 171 (+271.74%)
IMS-ToucanText-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart. Objectives of the development are simplicity, modularity, controllability and multilinguality.
Stars: ✭ 295 (+541.3%)
DiffwaveDiffWave is a fast, high-quality neural vocoder and waveform synthesizer.
Stars: ✭ 139 (+202.17%)
ShifterPitch shifter using WSOLA and resampling implemented by Python3
Stars: ✭ 22 (-52.17%)
DaisyXMusicFree and Open Source Group Voice chat music player for telegram ❤️ with button support youtube playback support
Stars: ✭ 204 (+343.48%)
LingvoLingvo
Stars: ✭ 2,361 (+5032.61%)
sova-tts-engineTacotron2 based engine for the SOVA-TTS project
Stars: ✭ 63 (+36.96%)
VocganVocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network
Stars: ✭ 158 (+243.48%)
vitsVITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
Stars: ✭ 1,604 (+3386.96%)
ProsodyHelsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Text
Stars: ✭ 139 (+202.17%)
VQMIVCOfficial implementation of VQMIVC: One-shot (any-to-any) Voice Conversion @ Interspeech 2021 + Online playing demo!
Stars: ✭ 278 (+504.35%)
Xva SynthMachine learning based speech synthesis Electron app, with voices from specific characters from video games
Stars: ✭ 136 (+195.65%)
KhronosThe open source intelligent personal assistant
Stars: ✭ 25 (-45.65%)
MaryttsMARY TTS -- an open-source, multilingual text-to-speech synthesis system written in pure java
Stars: ✭ 1,699 (+3593.48%)
ttsflowtensorflow speech synthesis c++ inference for voicenet
Stars: ✭ 17 (-63.04%)
Deepvoice3 pytorchPyTorch implementation of convolutional neural networks-based text-to-speech synthesis models
Stars: ✭ 1,654 (+3495.65%)
KalliopeKalliope is a framework that will help you to create your own personal assistant.
Stars: ✭ 1,509 (+3180.43%)
TFGANTFGAN: Time and Frequency Domain Based Generative Adversarial Network for High-fidelity Speech Synthesis
Stars: ✭ 65 (+41.3%)
WavegradImplementation of Google Brain's WaveGrad high-fidelity vocoder (paper: https://arxiv.org/pdf/2009.00713.pdf). First implementation on GitHub.
Stars: ✭ 245 (+432.61%)
WavernnWaveRNN Vocoder + TTS
Stars: ✭ 1,636 (+3456.52%)
UniversalvocodingA PyTorch implementation of "Robust Universal Neural Vocoding"
Stars: ✭ 197 (+328.26%)
wiki2ssmlWiki2SSML provides the WikiVoice markup language used for fine-tuning synthesised voice.
Stars: ✭ 31 (-32.61%)
Expressive-FastSpeech2PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean, and your own languages.
Stars: ✭ 139 (+202.17%)
Cyclegan Vc2Voice Conversion by CycleGAN (语音克隆/语音转换): CycleGAN-VC2
Stars: ✭ 158 (+243.48%)
GlottDNNGlottDNN vocoder and tools for training DNN excitation models
Stars: ✭ 30 (-34.78%)
Tacotron 2DeepMind's Tacotron-2 Tensorflow implementation
Stars: ✭ 1,968 (+4178.26%)
StyleSpeechOfficial implementation of Meta-StyleSpeech and StyleSpeech
Stars: ✭ 161 (+250%)
Tensorflowtts😝 TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)
Stars: ✭ 2,382 (+5078.26%)
samSoftware Automatic Mouth - Tiny Speech Synthesizer
Stars: ✭ 316 (+586.96%)
WavegradA fast, high-quality neural vocoder.
Stars: ✭ 138 (+200%)
audioslides.ioUse Amazon Polly, Google Slides and FFMpeg to create videos that can be updated at anytime by anyone. This project is written in Elixir.
Stars: ✭ 19 (-58.7%)
ZerospeechVQ-VAE for Acoustic Unit Discovery and Voice Conversion
Stars: ✭ 137 (+197.83%)
idear🎙️ Handsfree Audio Development Interface
Stars: ✭ 84 (+82.61%)
CotatronOfficial code for Cotatron @ INTERSPEECH 2020
Stars: ✭ 137 (+197.83%)
Zero-Shot-TTSUnofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration
Stars: ✭ 33 (-28.26%)
Legacy straightA vocoder framework which had been widely used in research community since 1999.
Stars: ✭ 130 (+182.61%)
voderAn emulation of the Voder Speech Synthesizer.
Stars: ✭ 19 (-58.7%)
Pytorch Dc TtsText to Speech with PyTorch (English and Mongolian)
Stars: ✭ 122 (+165.22%)
Catch-A-WaveformOfficial pytorch implementation of the paper: "Catch-A-Waveform: Learning to Generate Audio from a Single Short Example" (NeurIPS 2021)
Stars: ✭ 117 (+154.35%)
DurianImplementation of "Duration Informed Attention Network for Multimodal Synthesis" (https://arxiv.org/pdf/1909.01700.pdf) paper.
Stars: ✭ 111 (+141.3%)
tacotron2Pytorch implementation of "Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions", ICASSP, 2018.
Stars: ✭ 17 (-63.04%)
CrystalCrystal - C++ implementation of a unified framework for multilingual TTS synthesis engine with SSML specification as interface.
Stars: ✭ 108 (+134.78%)
Tacotron pytorchPyTorch implementation of Tacotron speech synthesis model.
Stars: ✭ 242 (+426.09%)
Tacotron PytorchA Pytorch Implementation of Tacotron: End-to-end Text-to-speech Deep-Learning Model
Stars: ✭ 104 (+126.09%)
Spokestack PythonSpokestack is a library that allows a user to easily incorporate a voice interface into any Python application.
Stars: ✭ 103 (+123.91%)
Openseq2seqToolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP
Stars: ✭ 1,378 (+2895.65%)
NormitTranslations with speech synthesis in your terminal as a node package
Stars: ✭ 219 (+376.09%)
WaveflowA PyTorch implementation of "WaveFlow: A Compact Flow-based Model for Raw Audio"
Stars: ✭ 95 (+106.52%)
Cross vcCross-lingual Voice Conversion
Stars: ✭ 91 (+97.83%)
TacotronA TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)
Stars: ✭ 2,581 (+5510.87%)
Cross-Speaker-Emotion-TransferPyTorch Implementation of ByteDance's Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised Training in Text-To-Speech
Stars: ✭ 107 (+132.61%)