All Projects → MediumVC → Similar Projects or Alternatives

155 Open source projects that are alternatives of or similar to MediumVC

Cs224n Gpu That Talks

Attention, I'm Trying to Speak: End-to-end speech synthesis (CS224n '18)

Stars: ✭ 52 (+13.04%)

Mutual labels: speech-synthesis

wiki2ssml

Wiki2SSML provides the WikiVoice markup language used for fine-tuning synthesised voice.

Stars: ✭ 31 (-32.61%)

Mutual labels: speech-synthesis

Artyom.js

A voice control - voice commands - speech recognition and speech synthesis javascript library. Create your own siri,google now or cortana with Google Chrome within your website.

Stars: ✭ 1,011 (+2097.83%)

Mutual labels: speech-synthesis

Expressive tacotron

Tensorflow Implementation of Expressive Tacotron

Stars: ✭ 192 (+317.39%)

Mutual labels: speech-synthesis

Lightspeech

LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search

Stars: ✭ 31 (-32.61%)

Mutual labels: speech-synthesis

Expressive-FastSpeech2

PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean, and your own languages.

Stars: ✭ 139 (+202.17%)

Mutual labels: speech-synthesis

Pororo

PORORO: Platform Of neuRal mOdels for natuRal language prOcessing

Stars: ✭ 812 (+1665.22%)

Mutual labels: speech-synthesis

Cyclegan Vc2

Voice Conversion by CycleGAN (语音克隆/语音转换): CycleGAN-VC2

Stars: ✭ 158 (+243.48%)

Mutual labels: speech-synthesis

World

A high-quality speech analysis, manipulation and synthesis system

Stars: ✭ 769 (+1571.74%)

Mutual labels: speech-synthesis

GlottDNN

GlottDNN vocoder and tools for training DNN excitation models

Stars: ✭ 30 (-34.78%)

Mutual labels: speech-synthesis

Parallelwavegan

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN) with Pytorch

Stars: ✭ 682 (+1382.61%)

Mutual labels: speech-synthesis

Tacotron 2

DeepMind's Tacotron-2 Tensorflow implementation

Stars: ✭ 1,968 (+4178.26%)

Mutual labels: speech-synthesis

Parrot

RNN-based generative models for speech.

Stars: ✭ 601 (+1206.52%)

Mutual labels: speech-synthesis

StyleSpeech

Official implementation of Meta-StyleSpeech and StyleSpeech

Stars: ✭ 161 (+250%)

Mutual labels: speech-synthesis

Melgan Neurips

GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis

Stars: ✭ 592 (+1186.96%)

Mutual labels: speech-synthesis

Tensorflowtts

😝 TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)

Stars: ✭ 2,382 (+5078.26%)

Mutual labels: speech-synthesis

Athena

an open-source implementation of sequence-to-sequence based speech processing engine

Stars: ✭ 542 (+1078.26%)

Mutual labels: speech-synthesis

sam

Software Automatic Mouth - Tiny Speech Synthesizer

Stars: ✭ 316 (+586.96%)

Mutual labels: speech-synthesis

Java Speech Api

The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.

Stars: ✭ 490 (+965.22%)

Mutual labels: speech-synthesis

Wavegrad

A fast, high-quality neural vocoder.

Stars: ✭ 138 (+200%)

Mutual labels: speech-synthesis

Gantts

PyTorch implementation of GAN-based text-to-speech synthesis and voice conversion (VC)

Stars: ✭ 460 (+900%)

Mutual labels: speech-synthesis

audioslides.io

Use Amazon Polly, Google Slides and FFMpeg to create videos that can be updated at anytime by anyone. This project is written in Elixir.

Stars: ✭ 19 (-58.7%)

Mutual labels: speech-synthesis

Zerospeech

VQ-VAE for Acoustic Unit Discovery and Voice Conversion

Stars: ✭ 137 (+197.83%)

Mutual labels: speech-synthesis

Libfaceid

libfaceid is a research framework for prototyping of face recognition solutions. It seamlessly integrates multiple detection, recognition and liveness models w/ speech synthesis and speech recognition.

Stars: ✭ 354 (+669.57%)

Mutual labels: speech-synthesis

idear

🎙️ Handsfree Audio Development Interface

Stars: ✭ 84 (+82.61%)

Mutual labels: speech-synthesis

Multilingual text to speech

An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.

Stars: ✭ 324 (+604.35%)

Mutual labels: speech-synthesis

Cotatron

Official code for Cotatron @ INTERSPEECH 2020

Stars: ✭ 137 (+197.83%)

Mutual labels: speech-synthesis

Cognitive Speech Tts

Microsoft Text-to-Speech API sample code in several languages, part of Cognitive Services.

Stars: ✭ 312 (+578.26%)

Mutual labels: speech-synthesis

Zero-Shot-TTS

Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration

Stars: ✭ 33 (-28.26%)

Mutual labels: speech-synthesis

Pysptk

A python wrapper for Speech Signal Processing Toolkit (SPTK).

Stars: ✭ 297 (+545.65%)

Mutual labels: speech-synthesis

Legacy straight

A vocoder framework which had been widely used in research community since 1999.

Stars: ✭ 130 (+182.61%)

Mutual labels: speech-synthesis

Parakeet

PAddle PARAllel text-to-speech toolKIT (supporting WaveFlow, WaveNet, Transformer TTS and Tacotron2)

Stars: ✭ 279 (+506.52%)

Mutual labels: speech-synthesis

voder

An emulation of the Voder Speech Synthesizer.

Stars: ✭ 19 (-58.7%)

Mutual labels: speech-synthesis

Nemo

NeMo: a toolkit for conversational AI

Stars: ✭ 3,685 (+7910.87%)

Mutual labels: speech-synthesis

Pytorch Dc Tts

Text to Speech with PyTorch (English and Mongolian)

Stars: ✭ 122 (+165.22%)

Mutual labels: speech-synthesis

Catch-A-Waveform

Official pytorch implementation of the paper: "Catch-A-Waveform: Learning to Generate Audio from a Single Short Example" (NeurIPS 2021)

Stars: ✭ 117 (+154.35%)

Mutual labels: speech-synthesis

mimic2

Text to Speech engine based on the Tacotron architecture, initially implemented by Keith Ito.

Stars: ✭ 537 (+1067.39%)

Mutual labels: speech-synthesis

esp32-flite

Speech synthesis running on ESP32 based on Flite engine.

Stars: ✭ 28 (-39.13%)

Mutual labels: speech-synthesis

Durian

Implementation of "Duration Informed Attention Network for Multimodal Synthesis" (https://arxiv.org/pdf/1909.01700.pdf) paper.

Stars: ✭ 111 (+141.3%)

Mutual labels: speech-synthesis

Tacotron pytorch

Tacotron implementation of pytorch

Stars: ✭ 12 (-73.91%)

Mutual labels: speech-synthesis

tacotron2

Pytorch implementation of "Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions", ICASSP, 2018.

Stars: ✭ 17 (-63.04%)

Mutual labels: speech-synthesis

editts

Official implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech

Stars: ✭ 74 (+60.87%)

Mutual labels: speech-synthesis

Crystal

Crystal - C++ implementation of a unified framework for multilingual TTS synthesis engine with SSML specification as interface.

Stars: ✭ 108 (+134.78%)

Mutual labels: speech-synthesis

Fre-GAN-pytorch

Fre-GAN: Adversarial Frequency-consistent Audio Synthesis

Stars: ✭ 73 (+58.7%)

Mutual labels: speech-synthesis

Tacotron

A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)

Stars: ✭ 2,581 (+5510.87%)

Mutual labels: speech-synthesis

Cnn vocoder

A fast cnn-based vocoder

Stars: ✭ 74 (+60.87%)

Mutual labels: speech-synthesis

Meta-TTS

Official repository of https://arxiv.org/abs/2111.04040v1

Stars: ✭ 69 (+50%)

Mutual labels: speech-synthesis

Tacotron Pytorch

A Pytorch Implementation of Tacotron: End-to-end Text-to-speech Deep-Learning Model

Stars: ✭ 104 (+126.09%)

Mutual labels: speech-synthesis

Tacotron pytorch

PyTorch implementation of Tacotron speech synthesis model.

Stars: ✭ 242 (+426.09%)

Mutual labels: speech-synthesis

ttslearn

ttslearn: Library for Pythonで学ぶ音声合成 (Text-to-speech with Python)

Stars: ✭ 158 (+243.48%)

Mutual labels: speech-synthesis

Openseq2seq

Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP

Stars: ✭ 1,378 (+2895.65%)

Mutual labels: speech-synthesis

MelNet-SpeechGeneration

Implementation of MelNet in PyTorch to generate high-fidelity audio samples

Stars: ✭ 19 (-58.7%)

Mutual labels: speech-synthesis

react-native-spokestack

Spokestack: give your React Native app a voice interface!

Stars: ✭ 53 (+15.22%)

Mutual labels: speech-synthesis

Merlin

This is now the official location of the Merlin project.

Stars: ✭ 1,168 (+2439.13%)

Mutual labels: speech-synthesis

Cross vc

Cross-lingual Voice Conversion

Stars: ✭ 91 (+97.83%)

Mutual labels: speech-synthesis

Cross-Speaker-Emotion-Transfer

PyTorch Implementation of ByteDance's Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised Training in Text-To-Speech

Stars: ✭ 107 (+132.61%)

Mutual labels: speech-synthesis

TensorVox

Desktop application for neural speech synthesis written in C++

Stars: ✭ 140 (+204.35%)

Mutual labels: speech-synthesis

web-speech-cognitive-services

Polyfill Web Speech API with Cognitive Services Bing Speech for both speech-to-text and text-to-speech service.

Stars: ✭ 35 (-23.91%)

Mutual labels: speech-synthesis

NanoFlow

PyTorch implementation of the paper "NanoFlow: Scalable Normalizing Flows with Sublinear Parameter Complexity." (NeurIPS 2020)