End-to-end Solution for Speech Recognition, Text Translation, and Text-to-Speech for iOS using Amazon Translate and Amazon Polly as AWS Machine Learning managed services.

Stars: ✭ 50 (-81.41%)

Mutual labels: speech-synthesis

EmotionalConversionStarGAN

This repository contains code to replicate results from the ICASSP 2020 paper "StarGAN for Emotional Speech Conversion: Validated by Data Augmentation of End-to-End Emotion Recognition".

Stars: ✭ 92 (-65.8%)

Mutual labels: speech-synthesis

MediumVC

Any-to-any voice conversion using synthetic specific-speaker speeches as intermedium features

Stars: ✭ 46 (-82.9%)

Mutual labels: speech-synthesis

klatt-syn

Klatt formant synthesizer

Stars: ✭ 18 (-93.31%)

Mutual labels: speech-synthesis

Zero-Shot-TTS

Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration

Stars: ✭ 33 (-87.73%)

Mutual labels: speech-synthesis

StyleSpeech

Official implementation of Meta-StyleSpeech and StyleSpeech

Stars: ✭ 161 (-40.15%)

Mutual labels: speech-synthesis

porfir

Голосовой ассистент Порфирьевич

Stars: ✭ 23 (-91.45%)

Mutual labels: speech-synthesis

ppg-vc

PPG-Based Voice Conversion

Stars: ✭ 154 (-42.75%)

Mutual labels: speech-synthesis

SingleVC

Any-to-one voice conversion using the data augment strategy: pitch shifted and duration remained.

Stars: ✭ 25 (-90.71%)

Mutual labels: speech-synthesis

Daft-Exprt

PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis

Stars: ✭ 41 (-84.76%)

Mutual labels: speech-synthesis

Speech-Backbones

This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.

Stars: ✭ 205 (-23.79%)

Mutual labels: speech-synthesis

editts

Official implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech

Stars: ✭ 74 (-72.49%)

Mutual labels: speech-synthesis

deep-learning-german-tts

Thorsten-Voice: A free to use, offline working, high quality german TTS voice should be available for every project without any license struggling.

Stars: ✭ 268 (-0.37%)

Mutual labels: speech-synthesis

MelNet-SpeechGeneration

Implementation of MelNet in PyTorch to generate high-fidelity audio samples

Stars: ✭ 19 (-92.94%)

Mutual labels: speech-synthesis

AdaSpeech

AdaSpeech: Adaptive Text to Speech for Custom Voice

Stars: ✭ 108 (-59.85%)

Mutual labels: speech-synthesis

esp32-flite

Speech synthesis running on ESP32 based on Flite engine.

Stars: ✭ 28 (-89.59%)

Mutual labels: speech-synthesis

mimic2

Text to Speech engine based on the Tacotron architecture, initially implemented by Keith Ito.

Stars: ✭ 537 (+99.63%)

Mutual labels: speech-synthesis

VAENAR-TTS

PyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.

Stars: ✭ 66 (-75.46%)

Mutual labels: speech-synthesis

Fre-GAN-pytorch

Fre-GAN: Adversarial Frequency-consistent Audio Synthesis

Stars: ✭ 73 (-72.86%)

Mutual labels: speech-synthesis

Music-Style-Transfer

Source code for "Transferring the Style of Homophonic Music Using Recurrent Neural Networks and Autoregressive Model"

Stars: ✭ 16 (-94.05%)

Mutual labels: wavenet

spoken-word

Spoken Word

Stars: ✭ 46 (-82.9%)

Mutual labels: speech-synthesis

few-shot-transformer-tts

Byte-based multilingual transformer TTS for low-resource/few-shot language adaptation.

Stars: ✭ 60 (-77.7%)

Mutual labels: speech-synthesis

hifigan-denoiser

HiFi-GAN: High Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks

Stars: ✭ 88 (-67.29%)

Mutual labels: wavenet

speechrec

a simple speech recognition app using the Web Speech API Interfaces

Stars: ✭ 18 (-93.31%)

Mutual labels: speech-synthesis

ExtensibleTTS-PyTorch

An extensible speech synthesis system, build with PyTorch and the original code is from r9y9's https://github.com/r9y9/nnmnkwii_gallery

Stars: ✭ 25 (-90.71%)

Mutual labels: speech-synthesis

Cross-Speaker-Emotion-Transfer

PyTorch Implementation of ByteDance's Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised Training in Text-To-Speech

Stars: ✭ 107 (-60.22%)

Mutual labels: speech-synthesis

Meta-TTS

Official repository of https://arxiv.org/abs/2111.04040v1

Stars: ✭ 69 (-74.35%)

Mutual labels: speech-synthesis

TensorVox

Desktop application for neural speech synthesis written in C++

Stars: ✭ 140 (-47.96%)

Mutual labels: speech-synthesis

Sinsy-NG

(discontinued) 🎵The Formant-Based All Language Singing Voice Syntheis System: Sinsy-NG

Stars: ✭ 15 (-94.42%)

Mutual labels: speech-synthesis

Khronos

The open source intelligent personal assistant

Stars: ✭ 25 (-90.71%)

Mutual labels: speech-synthesis

Tacotron pytorch

Tacotron implementation of pytorch

Stars: ✭ 12 (-95.54%)

Mutual labels: speech-synthesis

chainer-Fast-WaveNet

A Chainer implementation of Fast WaveNet(mel-spectrogram vocoder).

Stars: ✭ 33 (-87.73%)

Mutual labels: wavenet

web-speech-cognitive-services

Polyfill Web Speech API with Cognitive Services Bing Speech for both speech-to-text and text-to-speech service.

Stars: ✭ 35 (-86.99%)

Mutual labels: speech-synthesis

Expressive-FastSpeech2

PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean, and your own languages.

Stars: ✭ 139 (-48.33%)

Mutual labels: speech-synthesis

vits

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Stars: ✭ 1,604 (+496.28%)

Mutual labels: speech-synthesis

YourTTS

YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone

Stars: ✭ 217 (-19.33%)

Mutual labels: speech-synthesis

TinyCog

Small Robot, Toy Robot platform

Stars: ✭ 29 (-89.22%)

Mutual labels: speech-synthesis

audioslides.io

Use Amazon Polly, Google Slides and FFMpeg to create videos that can be updated at anytime by anyone. This project is written in Elixir.

Stars: ✭ 19 (-92.94%)

Mutual labels: speech-synthesis

Catch-A-Waveform

Official pytorch implementation of the paper: "Catch-A-Waveform: Learning to Generate Audio from a Single Short Example" (NeurIPS 2021)

Stars: ✭ 117 (-56.51%)

Mutual labels: speech-synthesis

spokestack-ios

Spokestack: give your iOS app a voice interface!

Stars: ✭ 27 (-89.96%)

Mutual labels: speech-synthesis

Comprehensive-Tacotron2

PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supports both single-, multi-speaker TTS and several techniques to enforce the robustness and efficiency of the model.

Stars: ✭ 22 (-91.82%)

Mutual labels: speech-synthesis

1-60 of 168 similar projects

›