Aeneasaeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)
Stars: ✭ 1,942 (+14.3%)
NemoNeMo: a toolkit for conversational AI
Stars: ✭ 3,685 (+116.89%)
NaomiThe Naomi Project is an open source, technology agnostic platform for developing always-on, voice-controlled applications!
Stars: ✭ 171 (-89.94%)
VocganVocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network
Stars: ✭ 158 (-90.7%)
Tacotron2A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Wavenet On Mel Spectrogram Predictions".
Stars: ✭ 43 (-97.47%)
Tts🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Stars: ✭ 305 (-82.05%)
GttsPython library and CLI tool to interface with Google Translate's text-to-speech API
Stars: ✭ 1,303 (-23.31%)
ukrainian-ttsUkrainian TTS (text-to-speech) using Coqui TTS
Stars: ✭ 74 (-95.64%)
JSpeakA Text to Speech Reader Front-end that Reads from the Clipboard and with Exceptionable Features
Stars: ✭ 16 (-99.06%)
Tts🤖 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
Stars: ✭ 5,427 (+219.42%)
brasilttsBrasil TTS é um conjunto de sintetizadores de voz, em português do Brasil, que lê telas para portadores de deficiência visual. Transforma texto em áudio, permitindo que pessoas cegas ou com baixa visão tenham acesso ao conteúdo exibido na tela. Embora o principal público-alvo de sistemas de conversão texto-fala – como o Brasil TTS – seja formado…
Stars: ✭ 34 (-98%)
EMPHASIS-pytorchEMPHASIS: An Emotional Phoneme-based Acoustic Model for Speech Synthesis System
Stars: ✭ 15 (-99.12%)
Tacotron PytorchA Pytorch Implementation of Tacotron: End-to-end Text-to-speech Deep-Learning Model
Stars: ✭ 104 (-93.88%)
MouseTooltipTranslatorchrome extension - When mouse hover on text, it shows translated tooltip using google translate
Stars: ✭ 93 (-94.53%)
XzvoiceFree and open source text-to-speech software
Stars: ✭ 355 (-79.11%)
TFGANTFGAN: Time and Frequency Domain Based Generative Adversarial Network for High-fidelity Speech Synthesis
Stars: ✭ 65 (-96.17%)
golang-ttsText-to-Speach golang package based in Amazon Polly service
Stars: ✭ 19 (-98.88%)
SpeakerA PHP library to convert text to speech using various web services
Stars: ✭ 86 (-94.94%)
AmazonSpeechTranslatorEnd-to-end Solution for Speech Recognition, Text Translation, and Text-to-Speech for iOS using Amazon Translate and Amazon Polly as AWS Machine Learning managed services.
Stars: ✭ 50 (-97.06%)
FastSpeech2Multi-Speaker Pytorch FastSpeech2: Fast and High-Quality End-to-End Text to Speech ✊
Stars: ✭ 64 (-96.23%)
EspeakeSpeak NG is an open source speech synthesizer that supports 101 languages and accents.
Stars: ✭ 339 (-80.05%)
NnmnkwiiLibrary to build speech synthesis systems designed for easy and fast prototyping.
Stars: ✭ 308 (-81.87%)
voicesmacOS CLI for changing the default TTS (text-to-speech) voice and printing information about and speaking text with multiple voices.
Stars: ✭ 53 (-96.88%)
spokestack-iosSpokestack: give your iOS app a voice interface!
Stars: ✭ 27 (-98.41%)
FastSpeech2PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speech
Stars: ✭ 163 (-90.41%)
Sinsy-NG(discontinued) 🎵The Formant-Based All Language Singing Voice Syntheis System: Sinsy-NG
Stars: ✭ 15 (-99.12%)
melganMelGAN implementation with Multi-Band and Full Band supports...
Stars: ✭ 54 (-96.82%)
STYLEROfficial repository of STYLER: Style Factor Modeling with Rapidity and Robustness via Speech Decomposition for Expressive and Controllable Neural Text to Speech, INTERSPEECH 2021
Stars: ✭ 105 (-93.82%)
text-to-speech⚡️ Capacitor plugin for synthesizing speech from text.
Stars: ✭ 50 (-97.06%)
laravel-text-to-speech💬 A wrapper for popular TTS services to create a more simple & uniform API. Currently, only AWS Polly is supported.
Stars: ✭ 26 (-98.47%)
Neural-HMMNeural HMMs are all you need (for high-quality attention-free TTS)
Stars: ✭ 69 (-95.94%)
Espeak NgeSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.
Stars: ✭ 799 (-52.97%)
Tacotron2-PyTorchYet another PyTorch implementation of Tacotron 2 with reduction factor and faster training speed.
Stars: ✭ 118 (-93.05%)
ZhrtvcChinese real time voice cloning (VC) and Chinese text to speech (TTS). 好用的中文语音克隆兼中文语音合成系统,包含语音编码器、语音合成器、声码器和可视化模块。
Stars: ✭ 771 (-54.62%)
TTS tfWIP Tensorflow implementation of https://github.com/mozilla/TTS
Stars: ✭ 14 (-99.18%)
dctts-pytorchThe pytorch implementation of DC-TTS
Stars: ✭ 73 (-95.7%)
persian-tts🔊 A simple human-based text-to-speach synthesiser and ReactNative app for Persian language.
Stars: ✭ 18 (-98.94%)
Cnn vocoderA fast cnn-based vocoder
Stars: ✭ 74 (-95.64%)
google-translate-ttsNode library for Google Translate TTS (Text-to-Speech) API
Stars: ✭ 23 (-98.65%)
leon🧠 Leon is your open-source personal assistant.
Stars: ✭ 8,560 (+403.83%)
talkbotText-to-speech and translation bot for Discord
Stars: ✭ 27 (-98.41%)
YourTTSYourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone
Stars: ✭ 217 (-87.23%)
Rhvoicea free and open source speech synthesizer for Russian and other languages
Stars: ✭ 750 (-55.86%)
samSAM: Software Automatic Mouth (Ported from https://github.com/vidarh/SAM)
Stars: ✭ 33 (-98.06%)
deep-learning-german-ttsThorsten-Voice: A free to use, offline working, high quality german TTS voice should be available for every project without any license struggling.
Stars: ✭ 268 (-84.23%)
speak.awfAn Alfred 3 workflow that uses macOS's TTS (text-to-speech) feature to speak text aloud.
Stars: ✭ 29 (-98.29%)
MerlinThis is now the official location of the Merlin project.
Stars: ✭ 1,168 (-31.25%)
Deepvoice3 pytorchPyTorch implementation of convolutional neural networks-based text-to-speech synthesis models
Stars: ✭ 1,654 (-2.65%)
Openseq2seqToolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP
Stars: ✭ 1,378 (-18.89%)
WatbotAn Android ChatBot powered by IBM Watson Services (Assistant V1, Text-to-Speech, and Speech-to-Text with Speaker Recognition) on IBM Cloud.
Stars: ✭ 64 (-96.23%)
ParrotRNN-based generative models for speech.
Stars: ✭ 601 (-64.63%)
FastspeechThe Implementation of FastSpeech based on pytorch.
Stars: ✭ 600 (-64.69%)
Dragonfirethe open-source virtual assistant for Ubuntu based Linux distributions
Stars: ✭ 1,120 (-34.08%)