End-to-end Solution for Speech Recognition, Text Translation, and Text-to-Speech for iOS using Amazon Translate and Amazon Polly as AWS Machine Learning managed services.

Stars: ✭ 50 (+177.78%)

Mutual labels: speech-synthesis

lyrebird-slack-integration

Send voicified messages on Slack using your vocal avatar!

Stars: ✭ 31 (+72.22%)

Mutual labels: voice-synthesis

open-speech-corpora

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

Stars: ✭ 841 (+4572.22%)

Mutual labels: speech-synthesis

sam

Software Automatic Mouth - Tiny Speech Synthesizer

Stars: ✭ 316 (+1655.56%)

Mutual labels: speech-synthesis

Cross-Speaker-Emotion-Transfer

PyTorch Implementation of ByteDance's Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised Training in Text-To-Speech

Stars: ✭ 107 (+494.44%)

Mutual labels: speech-synthesis

tacotron2

Pytorch implementation of "Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions", ICASSP, 2018.

Stars: ✭ 17 (-5.56%)

Mutual labels: speech-synthesis

WaveGrad2

PyTorch Implementation of Google Brain's WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis

Stars: ✭ 55 (+205.56%)

Mutual labels: speech-synthesis

Khronos

The open source intelligent personal assistant

Stars: ✭ 25 (+38.89%)

Mutual labels: speech-synthesis

Tacotron

A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)

Stars: ✭ 2,581 (+14238.89%)

Mutual labels: speech-synthesis

Universalvocoding

A PyTorch implementation of "Robust Universal Neural Vocoding"

Stars: ✭ 197 (+994.44%)

Mutual labels: speech-synthesis

Expressive tacotron

Tensorflow Implementation of Expressive Tacotron

Stars: ✭ 192 (+966.67%)

Mutual labels: speech-synthesis

QPPWG

Quasi-Periodic Parallel WaveGAN Pytorch implementation

Stars: ✭ 41 (+127.78%)

Mutual labels: speech-synthesis

web-speech-cognitive-services

Polyfill Web Speech API with Cognitive Services Bing Speech for both speech-to-text and text-to-speech service.

Stars: ✭ 35 (+94.44%)

Mutual labels: speech-synthesis

Cyclegan Vc2

Voice Conversion by CycleGAN (语音克隆/语音转换): CycleGAN-VC2

Stars: ✭ 158 (+777.78%)

Mutual labels: speech-synthesis

lessampler

lessampler is a Singing Voice Synthesizer

Stars: ✭ 59 (+227.78%)

Mutual labels: voice-synthesis

JSpeak

A Text to Speech Reader Front-end that Reads from the Clipboard and with Exceptionable Features

Stars: ✭ 16 (-11.11%)

Mutual labels: voice-synthesis

react-native-spokestack

Spokestack: give your React Native app a voice interface!

Stars: ✭ 53 (+194.44%)

Mutual labels: speech-synthesis

Speech-Backbones

This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.

Stars: ✭ 205 (+1038.89%)

Mutual labels: speech-synthesis

wiki2ssml

Wiki2SSML provides the WikiVoice markup language used for fine-tuning synthesised voice.

Stars: ✭ 31 (+72.22%)

Mutual labels: speech-synthesis

few-shot-transformer-tts

Byte-based multilingual transformer TTS for low-resource/few-shot language adaptation.

Stars: ✭ 60 (+233.33%)

Mutual labels: speech-synthesis

sova-tts-engine

Tacotron2 based engine for the SOVA-TTS project

Stars: ✭ 63 (+250%)

Mutual labels: speech-synthesis

Tacotron 2

DeepMind's Tacotron-2 Tensorflow implementation

Stars: ✭ 1,968 (+10833.33%)

Mutual labels: speech-synthesis

IMS-Toucan

Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart. Objectives of the development are simplicity, modularity, controllability and multilinguality.

Stars: ✭ 295 (+1538.89%)

Mutual labels: speech-synthesis

MediumVC

Any-to-any voice conversion using synthetic specific-speaker speeches as intermedium features

Stars: ✭ 46 (+155.56%)

Mutual labels: speech-synthesis

idear

🎙️ Handsfree Audio Development Interface

Stars: ✭ 84 (+366.67%)

Mutual labels: speech-synthesis

deep-learning-german-tts

Thorsten-Voice: A free to use, offline working, high quality german TTS voice should be available for every project without any license struggling.

Stars: ✭ 268 (+1388.89%)

Mutual labels: speech-synthesis

ttsflow

tensorflow speech synthesis c++ inference for voicenet

Stars: ✭ 17 (-5.56%)

Mutual labels: speech-synthesis

Zero-Shot-TTS

Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration

Stars: ✭ 33 (+83.33%)

Mutual labels: speech-synthesis

ppg-vc

PPG-Based Voice Conversion

Stars: ✭ 154 (+755.56%)

Mutual labels: speech-synthesis

vits

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Stars: ✭ 1,604 (+8811.11%)

Mutual labels: speech-synthesis

Tensorflowtts

😝 TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)

Stars: ✭ 2,382 (+13133.33%)

Mutual labels: speech-synthesis

Normit

Translations with speech synthesis in your terminal as a node package

Stars: ✭ 219 (+1116.67%)

Mutual labels: speech-synthesis

StyleSpeech

Official implementation of Meta-StyleSpeech and StyleSpeech

Stars: ✭ 161 (+794.44%)

Mutual labels: speech-synthesis

Neural Voice Cloning With Few Samples

Implementation of Neural Voice Cloning with Few Samples Research Paper by Baidu

Stars: ✭ 211 (+1072.22%)

Mutual labels: speech-synthesis

sova-tts-tps

NLP-preprocessor for the SOVA-TTS project

Stars: ✭ 44 (+144.44%)

Mutual labels: speech-synthesis

Lingvo

Stars: ✭ 2,361 (+13016.67%)

Mutual labels: speech-synthesis

SingleVC

Any-to-one voice conversion using the data augment strategy: pitch shifted and duration remained.

Stars: ✭ 25 (+38.89%)

Mutual labels: speech-synthesis

Naomi

The Naomi Project is an open source, technology agnostic platform for developing always-on, voice-controlled applications!

Stars: ✭ 171 (+850%)

Mutual labels: speech-synthesis

Daft-Exprt

PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis

Stars: ✭ 41 (+127.78%)

Mutual labels: speech-synthesis

Vocgan

VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network

Stars: ✭ 158 (+777.78%)

Mutual labels: speech-synthesis

Expressive-FastSpeech2

PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean, and your own languages.

Stars: ✭ 139 (+672.22%)

Mutual labels: speech-synthesis

Wavenet vocoder

WaveNet vocoder

Stars: ✭ 1,926 (+10600%)

Mutual labels: speech-synthesis

VAENAR-TTS

PyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.

Stars: ✭ 66 (+266.67%)

Mutual labels: speech-synthesis

audioslides.io

Use Amazon Polly, Google Slides and FFMpeg to create videos that can be updated at anytime by anyone. This project is written in Elixir.