All Projects → Tacotron → Similar Projects or Alternatives

322 Open source projects that are alternatives of or similar to Tacotron

Kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
Stars: ✭ 11,151 (+535.02%)
Mutual labels:  speech
Annyang
💬 Speech recognition for your site
Stars: ✭ 6,216 (+253.99%)
Mutual labels:  speech
Wikipron
Massively multilingual pronunciation mining
Stars: ✭ 99 (-94.36%)
Mutual labels:  speech
Ekho
Chinese text-to-speech engine
Stars: ✭ 690 (-60.71%)
Mutual labels:  tts
Androidmarytts
Android MARY TTS - an open-source, offline HMM-Based text-to-speech synthesis system based on MaryTTS
Stars: ✭ 134 (-92.37%)
Mutual labels:  tts
Praat
Praat: Doing Phonetics By Computer
Stars: ✭ 675 (-61.56%)
Mutual labels:  speech
Joytan
Creative Audio/Textbook Maker 🎵 📖 See our YouTube channel
Stars: ✭ 91 (-94.82%)
Mutual labels:  tts
Code Switching Papers
A curated list of research papers and resources on code-switching
Stars: ✭ 122 (-93.05%)
Mutual labels:  speech
Ios 10 Sampler
Code examples for new APIs of iOS 10.
Stars: ✭ 3,341 (+90.26%)
Mutual labels:  speech
Transformertts
🤖💬 Transformer TTS: Implementation of a non-autoregressive Transformer based neural network for text to speech.
Stars: ✭ 617 (-64.86%)
Mutual labels:  tts
Pitchtron
TTS for pitch-accented language. Korean dialect DB.
Stars: ✭ 91 (-94.82%)
Mutual labels:  tts
Real Time Voice Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Stars: ✭ 32,095 (+1727.73%)
Mutual labels:  tts
Amazon Polly Sample
Sample application for Amazon Polly. Allows to convert any blog into an audio podcast.
Stars: ✭ 139 (-92.08%)
Mutual labels:  tts
Sonus
💬 /so.nus/ STT (speech to text) for Node with offline hotword detection
Stars: ✭ 532 (-69.7%)
Mutual labels:  speech
Wavenet Enhancement
Speech Enhancement using Bayesian WaveNet
Stars: ✭ 86 (-95.1%)
Mutual labels:  speech
Nlp Paper
自然语言处理领域下的对话语音领域,整理相关论文(附阅读笔记),复现模型以及数据处理等(代码含TensorFlow和PyTorch两版本)
Stars: ✭ 67 (-96.18%)
Mutual labels:  speech
Hifi Gan
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
Stars: ✭ 325 (-81.49%)
Mutual labels:  tts
Deepvoice3 pytorch
PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models
Stars: ✭ 1,654 (-5.81%)
Mutual labels:  tts
Xr3player
🎧 🎼 Advanced JavaFX Media Player
Stars: ✭ 472 (-73.12%)
Mutual labels:  speech
Talkify
Javascript Text to speech library
Stars: ✭ 132 (-92.48%)
Mutual labels:  tts
Cognitive Speech Tts
Microsoft Text-to-Speech API sample code in several languages, part of Cognitive Services.
Stars: ✭ 312 (-82.23%)
Mutual labels:  tts
Specaugment
A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain
Stars: ✭ 408 (-76.77%)
Mutual labels:  speech
Deepspeech
A PaddlePaddle implementation of ASR.
Stars: ✭ 1,219 (-30.58%)
Mutual labels:  speech
Awesome Kaldi
This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )
Stars: ✭ 393 (-77.62%)
Mutual labels:  speech
Speech And Text Unity Ios Android
Speed to text in Unity iOS use Native Speech Recognition
Stars: ✭ 117 (-93.34%)
Mutual labels:  speech
Cnn vocoder
A fast cnn-based vocoder
Stars: ✭ 74 (-95.79%)
Mutual labels:  tts
Voc
A physical model of the human vocal tract using literate programming, based on Pink Trombone.
Stars: ✭ 129 (-92.65%)
Mutual labels:  speech
Sound Source Localization Algorithm doa estimation
关于语音信号声源定位DOA估计所用的一些传统算法
Stars: ✭ 58 (-96.7%)
Mutual labels:  speech
Css10
CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages
Stars: ✭ 302 (-82.8%)
Mutual labels:  speech
Dla
Deep learning for audio processing
Stars: ✭ 142 (-91.91%)
Mutual labels:  tts
Openasr
A pytorch based end2end speech recognition system.
Stars: ✭ 69 (-96.07%)
Mutual labels:  speech
Multilingual text to speech
An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.
Stars: ✭ 324 (-81.55%)
Mutual labels:  tts
Holobot
HoloBot is a reusable 3D interface that allows HoloLens & VR users to interact with any bot using Mixed Reality & Speech.
Stars: ✭ 114 (-93.51%)
Mutual labels:  speech
Facemoji
😆 A voice chatbot that can imitate your expression. OpenCV+Dlib+Live2D+Moments Recorder+Turing Robot+Iflytek IAT+Iflytek TTS
Stars: ✭ 320 (-81.78%)
Mutual labels:  tts
Watbot
An Android ChatBot powered by IBM Watson Services (Assistant V1, Text-to-Speech, and Speech-to-Text with Speaker Recognition) on IBM Cloud.
Stars: ✭ 64 (-96.36%)
Mutual labels:  speech
Pysptk
A python wrapper for Speech Signal Processing Toolkit (SPTK).
Stars: ✭ 297 (-83.09%)
Mutual labels:  speech
Cs224n Gpu That Talks
Attention, I'm Trying to Speak: End-to-end speech synthesis (CS224n '18)
Stars: ✭ 52 (-97.04%)
Mutual labels:  tts
Glow Tts
A Generative Flow for Text-to-Speech via Monotonic Alignment Search
Stars: ✭ 284 (-83.83%)
Mutual labels:  tts
Pocketsphinx Python
Python interface to CMU Sphinxbase and Pocketsphinx libraries
Stars: ✭ 298 (-83.03%)
Mutual labels:  speech
Syn Speech
Syn.Speech is a flexible speaker independent continuous speech recognition engine for Mono and .NET framework
Stars: ✭ 57 (-96.75%)
Mutual labels:  speech
Sednn
deep learning based speech enhancement using keras or pytorch, make it easy to use
Stars: ✭ 288 (-83.6%)
Mutual labels:  speech
Crystal
Crystal - C++ implementation of a unified framework for multilingual TTS synthesis engine with SSML specification as interface.
Stars: ✭ 108 (-93.85%)
Mutual labels:  tts
Parakeet
PAddle PARAllel text-to-speech toolKIT (supporting WaveFlow, WaveNet, Transformer TTS and Tacotron2)
Stars: ✭ 279 (-84.11%)
Mutual labels:  tts
Soloud
Free, easy, portable audio engine for games
Stars: ✭ 1,048 (-40.32%)
Mutual labels:  speech
Make A Smart Speaker
A collection of resources to make a smart speaker
Stars: ✭ 268 (-84.74%)
Mutual labels:  tts
Neural Voice Cloning With Few Samples
This repository has implementation for "Neural Voice Cloning With Few Samples"
Stars: ✭ 262 (-85.08%)
Mutual labels:  tts
Diffwave
DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.
Stars: ✭ 139 (-92.08%)
Mutual labels:  speech
Asr audio data links
A list of publically available audio data that anyone can download for ASR or other speech activities
Stars: ✭ 128 (-92.71%)
Mutual labels:  speech
Python Speech recognition
A simple example for use speech recognition baidu api with python.
Stars: ✭ 106 (-93.96%)
Mutual labels:  speech
Parrots
Automatic Speech Recognition(ASR), Text-To-Speech(TTS) engine for Chinese.
Stars: ✭ 48 (-97.27%)
Mutual labels:  tts
Flutter tts
Flutter Text to Speech package
Stars: ✭ 263 (-85.02%)
Mutual labels:  tts
Speech Vad Demo
集成Webrtc的VAD,用于切分音频文件
Stars: ✭ 259 (-85.25%)
Mutual labels:  speech
Py Nltools
A collection of basic python modules for spoken natural language processing
Stars: ✭ 46 (-97.38%)
Mutual labels:  tts
Speech Aligner
speech-aligner,是一个从“人声语音”及其“语言文本”,产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech and its transcription
Stars: ✭ 259 (-85.25%)
Mutual labels:  speech
Amazing Python Scripts
🚀 Curated collection of Amazing Python scripts from Basics to Advance with automation task scripts.
Stars: ✭ 229 (-86.96%)
Mutual labels:  speech
Wavernn
WaveRNN Vocoder + TTS
Stars: ✭ 1,636 (-6.83%)
Mutual labels:  tts
Stl
The ITU-T Software Tool Library (G.191)
Stars: ✭ 44 (-97.49%)
Mutual labels:  speech
Comprehensive-Tacotron2
PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supports both single-, multi-speaker TTS and several techniques to enforce the robustness and efficiency of the model.
Stars: ✭ 22 (-98.75%)
Mutual labels:  tts
Noise2Noise-audio denoising without clean training data
Source code for the paper titled "Speech Denoising without Clean Training Data: a Noise2Noise Approach". Paper accepted at the INTERSPEECH 2021 conference. This paper tackles the problem of the heavy dependence of clean speech data required by deep learning based audio denoising methods by showing that it is possible to train deep speech denoisi…
Stars: ✭ 49 (-97.21%)
Mutual labels:  speech
hifigan-denoiser
HiFi-GAN: High Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks
Stars: ✭ 88 (-94.99%)
Mutual labels:  speech
61-120 of 322 similar projects