All Projects → Tacotron → Similar Projects or Alternatives

322 Open source projects that are alternatives of or similar to Tacotron

kaldi-asr/kaldi is the official location of the Kaldi project.

Stars: ✭ 11,151 (+535.02%)

Mutual labels: speech

💬 Speech recognition for your site

Stars: ✭ 6,216 (+253.99%)

Mutual labels: speech

Massively multilingual pronunciation mining

Stars: ✭ 99 (-94.36%)

Mutual labels: speech

Chinese text-to-speech engine

Stars: ✭ 690 (-60.71%)

Mutual labels: tts

Android MARY TTS - an open-source, offline HMM-Based text-to-speech synthesis system based on MaryTTS

Stars: ✭ 134 (-92.37%)

Mutual labels: tts

Praat: Doing Phonetics By Computer

Stars: ✭ 675 (-61.56%)

Mutual labels: speech

Creative Audio/Textbook Maker 🎵 📖 See our YouTube channel

Stars: ✭ 91 (-94.82%)

Mutual labels: tts

Code Switching Papers

A curated list of research papers and resources on code-switching

Stars: ✭ 122 (-93.05%)

Mutual labels: speech

Code examples for new APIs of iOS 10.

Stars: ✭ 3,341 (+90.26%)

Mutual labels: speech

🤖💬 Transformer TTS: Implementation of a non-autoregressive Transformer based neural network for text to speech.

Stars: ✭ 617 (-64.86%)

Mutual labels: tts

TTS for pitch-accented language. Korean dialect DB.

Stars: ✭ 91 (-94.82%)

Mutual labels: tts

Real Time Voice Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Stars: ✭ 32,095 (+1727.73%)

Mutual labels: tts

Amazon Polly Sample

Sample application for Amazon Polly. Allows to convert any blog into an audio podcast.

Stars: ✭ 139 (-92.08%)

Mutual labels: tts

💬 /so.nus/ STT (speech to text) for Node with offline hotword detection

Stars: ✭ 532 (-69.7%)

Mutual labels: speech

Wavenet Enhancement

Speech Enhancement using Bayesian WaveNet

Stars: ✭ 86 (-95.1%)

Mutual labels: speech

自然语言处理领域下的对话语音领域，整理相关论文（附阅读笔记），复现模型以及数据处理等（代码含TensorFlow和PyTorch两版本）

Stars: ✭ 67 (-96.18%)

Mutual labels: speech

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Stars: ✭ 325 (-81.49%)

Mutual labels: tts

Deepvoice3 pytorch

PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models

Stars: ✭ 1,654 (-5.81%)

Mutual labels: tts

🎧 🎼 Advanced JavaFX Media Player

Stars: ✭ 472 (-73.12%)

Mutual labels: speech

Javascript Text to speech library

Stars: ✭ 132 (-92.48%)

Mutual labels: tts

Cognitive Speech Tts

Microsoft Text-to-Speech API sample code in several languages, part of Cognitive Services.

Stars: ✭ 312 (-82.23%)

Mutual labels: tts

A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain

Stars: ✭ 408 (-76.77%)

Mutual labels: speech

A PaddlePaddle implementation of ASR.

Stars: ✭ 1,219 (-30.58%)

Mutual labels: speech

This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )

Stars: ✭ 393 (-77.62%)

Mutual labels: speech

Speech And Text Unity Ios Android

Speed to text in Unity iOS use Native Speech Recognition

Stars: ✭ 117 (-93.34%)

Mutual labels: speech

A fast cnn-based vocoder

Stars: ✭ 74 (-95.79%)

Mutual labels: tts

A physical model of the human vocal tract using literate programming, based on Pink Trombone.

Stars: ✭ 129 (-92.65%)

Mutual labels: speech

Sound Source Localization Algorithm doa estimation

关于语音信号声源定位DOA估计所用的一些传统算法

Stars: ✭ 58 (-96.7%)

Mutual labels: speech

CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages

Stars: ✭ 302 (-82.8%)

Mutual labels: speech

Deep learning for audio processing

Stars: ✭ 142 (-91.91%)

Mutual labels: tts

A pytorch based end2end speech recognition system.

Stars: ✭ 69 (-96.07%)

Mutual labels: speech

Multilingual text to speech

An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.

Stars: ✭ 324 (-81.55%)

Mutual labels: tts

HoloBot is a reusable 3D interface that allows HoloLens & VR users to interact with any bot using Mixed Reality & Speech.

Stars: ✭ 114 (-93.51%)

Mutual labels: speech

😆 A voice chatbot that can imitate your expression. OpenCV+Dlib+Live2D+Moments Recorder+Turing Robot+Iflytek IAT+Iflytek TTS

Stars: ✭ 320 (-81.78%)

Mutual labels: tts

An Android ChatBot powered by IBM Watson Services (Assistant V1, Text-to-Speech, and Speech-to-Text with Speaker Recognition) on IBM Cloud.

Stars: ✭ 64 (-96.36%)

Mutual labels: speech

A python wrapper for Speech Signal Processing Toolkit (SPTK).

Stars: ✭ 297 (-83.09%)

Mutual labels: speech

Cs224n Gpu That Talks

Attention, I'm Trying to Speak: End-to-end speech synthesis (CS224n '18)

Stars: ✭ 52 (-97.04%)

Mutual labels: tts

A Generative Flow for Text-to-Speech via Monotonic Alignment Search

Stars: ✭ 284 (-83.83%)

Mutual labels: tts

Pocketsphinx Python

Python interface to CMU Sphinxbase and Pocketsphinx libraries

Stars: ✭ 298 (-83.03%)

Mutual labels: speech

Syn.Speech is a flexible speaker independent continuous speech recognition engine for Mono and .NET framework

Stars: ✭ 57 (-96.75%)

Mutual labels: speech

deep learning based speech enhancement using keras or pytorch, make it easy to use

Stars: ✭ 288 (-83.6%)

Mutual labels: speech

Crystal - C++ implementation of a unified framework for multilingual TTS synthesis engine with SSML specification as interface.

Stars: ✭ 108 (-93.85%)

Mutual labels: tts

PAddle PARAllel text-to-speech toolKIT (supporting WaveFlow, WaveNet, Transformer TTS and Tacotron2)

Stars: ✭ 279 (-84.11%)

Mutual labels: tts

Free, easy, portable audio engine for games

Stars: ✭ 1,048 (-40.32%)

Mutual labels: speech

Make A Smart Speaker

A collection of resources to make a smart speaker

Stars: ✭ 268 (-84.74%)

Mutual labels: tts

Neural Voice Cloning With Few Samples

This repository has implementation for "Neural Voice Cloning With Few Samples"

Stars: ✭ 262 (-85.08%)

Mutual labels: tts

DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.

Stars: ✭ 139 (-92.08%)

Mutual labels: speech

Asr audio data links

A list of publically available audio data that anyone can download for ASR or other speech activities

Stars: ✭ 128 (-92.71%)

Mutual labels: speech

Python Speech recognition

A simple example for use speech recognition baidu api with python.

Stars: ✭ 106 (-93.96%)

Mutual labels: speech

Automatic Speech Recognition(ASR), Text-To-Speech(TTS) engine for Chinese.

Stars: ✭ 48 (-97.27%)

Mutual labels: tts

Flutter Text to Speech package

Stars: ✭ 263 (-85.02%)

Mutual labels: tts

Speech Vad Demo

集成Webrtc的VAD，用于切分音频文件

Stars: ✭ 259 (-85.25%)

Mutual labels: speech

A collection of basic python modules for spoken natural language processing

Stars: ✭ 46 (-97.38%)

Mutual labels: tts

speech-aligner，是一个从“人声语音”及其“语言文本”，产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech and its transcription

Stars: ✭ 259 (-85.25%)

Mutual labels: speech

Amazing Python Scripts

🚀 Curated collection of Amazing Python scripts from Basics to Advance with automation task scripts.

Stars: ✭ 229 (-86.96%)

Mutual labels: speech

WaveRNN Vocoder + TTS

Stars: ✭ 1,636 (-6.83%)

Mutual labels: tts

The ITU-T Software Tool Library (G.191)

Stars: ✭ 44 (-97.49%)

Mutual labels: speech

Comprehensive-Tacotron2

PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supports both single-, multi-speaker TTS and several techniques to enforce the robustness and efficiency of the model.

Stars: ✭ 22 (-98.75%)

Mutual labels: tts

Noise2Noise-audio denoising without clean training data

Source code for the paper titled "Speech Denoising without Clean Training Data: a Noise2Noise Approach". Paper accepted at the INTERSPEECH 2021 conference. This paper tackles the problem of the heavy dependence of clean speech data required by deep learning based audio denoising methods by showing that it is possible to train deep speech denoisi…

Stars: ✭ 49 (-97.21%)

Mutual labels: speech

hifigan-denoiser

HiFi-GAN: High Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks

Stars: ✭ 88 (-94.99%)

Mutual labels: speech

61-120 of 322 similar projects