Tts🤖 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
Stars: ✭ 5,427 (+7040.79%)
Fre-GAN-pytorchFre-GAN: Adversarial Frequency-consistent Audio Synthesis
Stars: ✭ 73 (-3.95%)
WaveGrad2PyTorch Implementation of Google Brain's WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis
Stars: ✭ 55 (-27.63%)
LVCNetLVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generation
Stars: ✭ 67 (-11.84%)
Tensorflowtts😝 TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)
Stars: ✭ 2,382 (+3034.21%)
MouseTooltipTranslatorchrome extension - When mouse hover on text, it shows translated tooltip using google translate
Stars: ✭ 93 (+22.37%)
oddvoicesAn indie singing synthesizer
Stars: ✭ 4 (-94.74%)
brasilttsBrasil TTS é um conjunto de sintetizadores de voz, em português do Brasil, que lê telas para portadores de deficiência visual. Transforma texto em áudio, permitindo que pessoas cegas ou com baixa visão tenham acesso ao conteúdo exibido na tela. Embora o principal público-alvo de sistemas de conversão texto-fala – como o Brasil TTS – seja formado…
Stars: ✭ 34 (-55.26%)
TacotronA TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)
Stars: ✭ 2,581 (+3296.05%)
FScape-nextAudio rendering software, based on UGen graphs. Issue tracker: https://codeberg.org/sciss/FScape-next/issues
Stars: ✭ 13 (-82.89%)
csound-extendedExtensions for Csound including algorithmic composition, Android app, and WebAssembly.
Stars: ✭ 38 (-50%)
IMS-ToucanText-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart. Objectives of the development are simplicity, modularity, controllability and multilinguality.
Stars: ✭ 295 (+288.16%)
FFTNetFFTNet: a Real-Time Speaker-Dependent Neural Vocoder
Stars: ✭ 63 (-17.11%)
ppt presenterConvert ppt to video with audio track, using text to speech synthesis
Stars: ✭ 38 (-50%)
TFGANTFGAN: Time and Frequency Domain Based Generative Adversarial Network for High-fidelity Speech Synthesis
Stars: ✭ 65 (-14.47%)
WavegradImplementation of Google Brain's WaveGrad high-fidelity vocoder (paper: https://arxiv.org/pdf/2009.00713.pdf). First implementation on GitHub.
Stars: ✭ 245 (+222.37%)
TensorVoxDesktop application for neural speech synthesis written in C++
Stars: ✭ 140 (+84.21%)
LingvoLingvo
Stars: ✭ 2,361 (+3006.58%)
Google TtsGoogle TTS (Text-To-Speech) for node.js
Stars: ✭ 180 (+136.84%)
speech courseYSDA course in Speech Processing.
Stars: ✭ 93 (+22.37%)
xedaCross EDA Abstraction and Automation
Stars: ✭ 25 (-67.11%)
Mrcp Plugin With Freeswitch使用FreeSWITCH接受用户手机呼叫,通过UniMRCP Server集成讯飞开放平台(xfyun)插件将用户语音进行语音识别(ASR),并根据自定义业务逻辑调用语音合成(TTS),构建简单的端到端语音呼叫中心。
Stars: ✭ 168 (+121.05%)
myprosodyA Python library for measuring the acoustic features of speech (simultaneous speech, high entropy) compared to ones of native speech.
Stars: ✭ 162 (+113.16%)
say-itTTS in command line -- Pronounce the Chinese and English words you typed in.
Stars: ✭ 19 (-75%)
synthesis🔥 Synthesis is Meteor + Polymer
Stars: ✭ 28 (-63.16%)
GlottDNNGlottDNN vocoder and tools for training DNN excitation models
Stars: ✭ 30 (-60.53%)
simple-obs-sttSpeech-to-text and keyboard input captions for OBS.
Stars: ✭ 89 (+17.11%)
reefAutomatically labeling training data
Stars: ✭ 102 (+34.21%)
Zero-Shot-TTSUnofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration
Stars: ✭ 33 (-56.58%)
lessamplerlessampler is a Singing Voice Synthesizer
Stars: ✭ 59 (-22.37%)
FergunAn utility Discord bot written in C# using Discord.Net
Stars: ✭ 26 (-65.79%)
Expressive-FastSpeech2PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean, and your own languages.
Stars: ✭ 139 (+82.89%)
Wukong Robot🤖 wukong-robot 是一个简单、灵活、优雅的中文语音对话机器人/智能音箱项目,还可能是首个支持脑机交互的开源智能音箱项目。
Stars: ✭ 3,110 (+3992.11%)
YANGstraight sourceAnalytic signal-based source information analysis for YANGstraight and real-time interactive tools
Stars: ✭ 31 (-59.21%)
MttsA Demo of Mandarin/Chinese TTS frontend
Stars: ✭ 229 (+201.32%)
Cross-Speaker-Emotion-TransferPyTorch Implementation of ByteDance's Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised Training in Text-To-Speech
Stars: ✭ 107 (+40.79%)
Mimic Recording StudioMimic Recording Studio is a Docker-based application you can install to record voice samples, which can then be trained into a TTS voice with Mimic2
Stars: ✭ 202 (+165.79%)
opensource-voice-toolsA repo listing known open source voice tools, ordered by where they sit in the voice stack
Stars: ✭ 21 (-72.37%)
pytorch FFTNetA pytorch implementation of FFTNet.
Stars: ✭ 35 (-53.95%)
Speaker adapted ttsMaking a TTS model with 1 minute of speech samples within 10 minutes
Stars: ✭ 183 (+140.79%)
Gst Tacotron A PyTorch implementation of Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis
Stars: ✭ 175 (+130.26%)
vietTTSVietnamese Text to Speech library
Stars: ✭ 78 (+2.63%)
MelnetImplementation of "MelNet: A Generative Model for Audio in the Frequency Domain"
Stars: ✭ 161 (+111.84%)
Tts Papers🐸 collection of TTS papers
Stars: ✭ 160 (+110.53%)
vitsVITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
Stars: ✭ 1,604 (+2010.53%)
lf synLearning-Based View Synthesis for Light Field Cameras - Pytorch
Stars: ✭ 31 (-59.21%)
Aeneasaeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)
Stars: ✭ 1,942 (+2455.26%)
hawkingThe retro text-to-speech bot for Discord
Stars: ✭ 24 (-68.42%)