tts dataset makerA gui to help make a text to speech dataset.
Stars: β 20 (-91.84%)
Sinsy-NG(discontinued) π΅The Formant-Based All Language Singing Voice Syntheis System: Sinsy-NG
Stars: β 15 (-93.88%)
web-speech-demoLearn how to build a simple text-to-speech voice app for the web using the Web Speech API.
Stars: β 19 (-92.24%)
Neural-HMMNeural HMMs are all you need (for high-quality attention-free TTS)
Stars: β 69 (-71.84%)
MelNet-SpeechGenerationImplementation of MelNet in PyTorch to generate high-fidelity audio samples
Stars: β 19 (-92.24%)
YourTTSYourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone
Stars: β 217 (-11.43%)
text-to-speechβ‘οΈ Capacitor plugin for synthesizing speech from text.
Stars: β 50 (-79.59%)
samSAM: Software Automatic Mouth (Ported from https://github.com/vidarh/SAM)
Stars: β 33 (-86.53%)
dctts-pytorchThe pytorch implementation of DC-TTS
Stars: β 73 (-70.2%)
persian-ttsπ A simple human-based text-to-speach synthesiser and ReactNative app for Persian language.
Stars: β 18 (-92.65%)
leonπ§ Leon is your open-source personal assistant.
Stars: β 8,560 (+3393.88%)
speak.awfAn Alfred 3 workflow that uses macOS's TTS (text-to-speech) feature to speak text aloud.
Stars: β 29 (-88.16%)
Amazing Python Scriptsπ Curated collection of Amazing Python scripts from Basics to Advance with automation task scripts.
Stars: β 229 (-6.53%)
PysptkA python wrapper for Speech Signal Processing Toolkit (SPTK).
Stars: β 297 (+21.22%)
NnmnkwiiLibrary to build speech synthesis systems designed for easy and fast prototyping.
Stars: β 308 (+25.71%)
google-translate-ttsNode library for Google Translate TTS (Text-to-Speech) API
Stars: β 23 (-90.61%)
Android SpeechAndroid speech recognition and text to speech made easy
Stars: β 310 (+26.53%)
EspeakeSpeak NG is an open source speech synthesizer that supports 101 languages and accents.
Stars: β 339 (+38.37%)
GanttsPyTorch implementation of GAN-based text-to-speech synthesis and voice conversion (VC)
Stars: β 460 (+87.76%)
Transformer TtsA Pytorch Implementation of "Neural Speech Synthesis with Transformer Network"
Stars: β 418 (+70.61%)
XzvoiceFree and open source text-to-speech software
Stars: β 355 (+44.9%)
TacotronAudio samples accompanying publications related to Tacotron, an end-to-end speech synthesis model.
Stars: β 493 (+101.22%)
Java Speech ApiThe J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.
Stars: β 490 (+100%)
Athenaan open-source implementation of sequence-to-sequence based speech processing engine
Stars: β 542 (+121.22%)
Transformerttsπ€π¬ Transformer TTS: Implementation of a non-autoregressive Transformer based neural network for text to speech.
Stars: β 617 (+151.84%)
Speech Emotion AnalyzerThe neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)
Stars: β 633 (+158.37%)
golang-ttsText-to-Speach golang package based in Amazon Polly service
Stars: β 19 (-92.24%)
talkbotText-to-speech and translation bot for Discord
Stars: β 27 (-88.98%)
ZhrtvcChinese real time voice cloning (VC) and Chinese text to speech (TTS). ε₯½η¨ηδΈζθ―ι³ε
ιε
ΌδΈζθ―ι³εζη³»η»οΌε
ε«θ―ι³ηΌη ε¨γθ―ι³εζε¨γε£°η ε¨εε―θ§ε樑εγ
Stars: β 771 (+214.69%)
FlowtronFlowtron is an auto-regressive flow-based generative network for text to speech synthesis with control over speech variation and style transfer
Stars: β 546 (+122.86%)
Rhvoicea free and open source speech synthesizer for Russian and other languages
Stars: β 750 (+206.12%)
Espeak NgeSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.
Stars: β 799 (+226.12%)
Tacotron2A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Wavenet On Mel Spectrogram Predictions".
Stars: β 43 (-82.45%)
Dc ttsA TensorFlow Implementation of DC-TTS: yet another text-to-speech model
Stars: β 1,017 (+315.1%)
Tacotron2pytorch tacotron2 https://arxiv.org/pdf/1712.05884.pdf
Stars: β 46 (-81.22%)
WatbotAn Android ChatBot powered by IBM Watson Services (Assistant V1, Text-to-Speech, and Speech-to-Text with Speaker Recognition) on IBM Cloud.
Stars: β 64 (-73.88%)
TtsTools to convert text to speech ππ¬
Stars: β 84 (-65.71%)
Cnn vocoderA fast cnn-based vocoder
Stars: β 74 (-69.8%)
SpeakerA PHP library to convert text to speech using various web services
Stars: β 86 (-64.9%)
AsrgenAttacking Speaker Recognition with Deep Generative Models
Stars: β 31 (-87.35%)
MerlinThis is now the official location of the Merlin project.
Stars: β 1,168 (+376.73%)
JoytanCreative Audio/Textbook Maker π΅ π See our YouTube channel
Stars: β 91 (-62.86%)
WaveflowA PyTorch implementation of "WaveFlow: A Compact Flow-based Model for Raw Audio"
Stars: β 95 (-61.22%)
Tacotron PytorchA Pytorch Implementation of Tacotron: End-to-end Text-to-speech Deep-Learning Model
Stars: β 104 (-57.55%)
Openseq2seqToolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP
Stars: β 1,378 (+462.45%)
Cross Lingual Voice CloningTacotron 2 - PyTorch implementation with faster-than-realtime inference modified to enable cross lingual voice cloning.
Stars: β 106 (-56.73%)
Deepvoice3 pytorchPyTorch implementation of convolutional neural networks-based text-to-speech synthesis models
Stars: β 1,654 (+575.1%)
TalkifyJavascript Text to speech library
Stars: β 132 (-46.12%)
Amazon Polly SampleSample application for Amazon Polly. Allows to convert any blog into an audio podcast.
Stars: β 139 (-43.27%)
DlaDeep learning for audio processing
Stars: β 142 (-42.04%)
TacotronA TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model
Stars: β 1,756 (+616.73%)
VocganVocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network
Stars: β 158 (-35.51%)