PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supports both single-, multi-speaker TTS and several techniques to enforce the robustness and efficiency of the model.

Stars: ✭ 22 (-45%)

Mutual labels: tts, tacotron

Wavernn

WaveRNN Vocoder + TTS

Stars: ✭ 1,636 (+3990%)

Mutual labels: tacotron, tts

Tts

🤖 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)

Stars: ✭ 5,427 (+13467.5%)

Mutual labels: tacotron, tts

Tacotron2-PyTorch

Yet another PyTorch implementation of Tacotron 2 with reduction factor and faster training speed.

Stars: ✭ 118 (+195%)

Mutual labels: tts, tacotron

FCH-TTS

A fast Text-to-Speech (TTS) model. Work well for English, Mandarin/Chinese, Japanese, Korean, Russian and Tibetan (so far). 快速语音合成模型，适用于英语、普通话/中文、日语、韩语、俄语和藏语（当前已测试）。

Stars: ✭ 154 (+285%)

Mutual labels: tts, tacotron

Awesome Speech Recognition Speech Synthesis Papers

Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)

Stars: ✭ 2,085 (+5112.5%)

Mutual labels: dnn, tts

Multi Tacotron Voice Cloning

Phoneme multilingual(Russian-English) voice cloning based on

Stars: ✭ 192 (+380%)

Mutual labels: tacotron, tts

Text-to-Speech-Landscape

No description or website provided.

Stars: ✭ 31 (-22.5%)

Mutual labels: tts, tacotron

Tts

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Stars: ✭ 305 (+662.5%)

Mutual labels: tacotron, tts

Php Opencv Examples

Tutorial for computer vision and machine learning in PHP 7/8 by opencv (installation + examples + documentation)

Stars: ✭ 333 (+732.5%)

Mutual labels: dnn

Parallelwavegan

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN) with Pytorch

Stars: ✭ 682 (+1605%)

Mutual labels: tts

Multilingual text to speech

An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.

Stars: ✭ 324 (+710%)

Mutual labels: tts

Facemoji

😆 A voice chatbot that can imitate your expression. OpenCV+Dlib+Live2D+Moments Recorder+Turing Robot+Iflytek IAT+Iflytek TTS

Stars: ✭ 320 (+700%)

Mutual labels: tts

Dnn.appinsights

A module to use Visual Studio Application Insights with the DNN Platform (formerly DotNetNuke) CMS

Stars: ✭ 12 (-70%)

Mutual labels: dnn

Gaussian yolov3

Gaussian YOLOv3: An Accurate and Fast Object Detector Using Localization Uncertainty for Autonomous Driving (ICCV, 2019)

Stars: ✭ 622 (+1455%)

Mutual labels: dnn

Gst Tacotron

A tensorflow implementation of the "Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis"

Stars: ✭ 313 (+682.5%)

Mutual labels: tacotron

Android Speech

Android speech recognition and text to speech made easy

Stars: ✭ 310 (+675%)

Mutual labels: tts

Real Time Voice Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Stars: ✭ 32,095 (+80137.5%)

Mutual labels: tts

Parakeet

PAddle PARAllel text-to-speech toolKIT (supporting WaveFlow, WaveNet, Transformer TTS and Tacotron2)

Stars: ✭ 279 (+597.5%)

Mutual labels: tts

Numpy neural network

仅使用numpy从头开始实现神经网络,包括反向传播公式推导过程; numpy构建全连接层、卷积层、池化层、Flatten层；以及图像分类案例及精调网络案例等,持续更新中... ...

Stars: ✭ 339 (+747.5%)

Mutual labels: dnn

Ekho

Chinese text-to-speech engine

Stars: ✭ 690 (+1625%)

Mutual labels: tts

Dnn.azureadprovider

The DNN Azure Active Directory Provider is an Authentication provider for DNN Platform (formerly DotNetNuke) that uses Azure Active Directory OAuth2 authentication to authenticate users.

Stars: ✭ 21 (-47.5%)

Mutual labels: dnn

Hifi Gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Stars: ✭ 325 (+712.5%)

Mutual labels: tts

Vad

Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.

Stars: ✭ 622 (+1455%)

Mutual labels: dnn

Caffe Mobile

Optimized (for size and speed) Caffe lib for iOS and Android with out-of-the-box demo APP.

Stars: ✭ 316 (+690%)

Mutual labels: dnn

Awesome Tts Samples

Awesome list of TTS papers with audio samples

Stars: ✭ 35 (-12.5%)

Mutual labels: tts

Cognitive Speech Tts

Microsoft Text-to-Speech API sample code in several languages, part of Cognitive Services.

Stars: ✭ 312 (+680%)

Mutual labels: tts

Transformertts

🤖💬 Transformer TTS: Implementation of a non-autoregressive Transformer based neural network for text to speech.

Stars: ✭ 617 (+1442.5%)

Mutual labels: tts

Glow Tts

A Generative Flow for Text-to-Speech via Monotonic Alignment Search

Stars: ✭ 284 (+610%)

Mutual labels: tts

Dnncommunity.home

This it the home for all DNN Community Projects

Stars: ✭ 9 (-77.5%)

Mutual labels: dnn

Athena

an open-source implementation of sequence-to-sequence based speech processing engine

Stars: ✭ 542 (+1255%)

Mutual labels: tts

Caffe Hrt

Heterogeneous Run Time version of Caffe. Added heterogeneous capabilities to the Caffe, uses heterogeneous computing infrastructure framework to speed up Deep Learning on Arm-based heterogeneous embedded platform. It also retains all the features of the original Caffe architecture which users deploy their applications seamlessly.

Stars: ✭ 271 (+577.5%)

Mutual labels: dnn

Make A Smart Speaker

A collection of resources to make a smart speaker

Stars: ✭ 268 (+570%)

Mutual labels: tts

Nvquicksite

nvQuickSite is a desktop installation app for DNN, the world's most popular ASP.NET-based CMS. This app allows you to easily install DNN onto any environment that meets the minimum system requirements for DNN to be installed.

Stars: ✭ 36 (-10%)

Mutual labels: dnn

Lightspeech

LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search

Stars: ✭ 31 (-22.5%)

Mutual labels: tts

Dnn.platform.samples.mvc

DNN Sample MVC and SPA (Single Page Application) Modules

Stars: ✭ 26 (-35%)

Mutual labels: dnn

Neural Voice Cloning With Few Samples

This repository has implementation for "Neural Voice Cloning With Few Samples"

Stars: ✭ 262 (+555%)

Mutual labels: tts

Flutter tts

Flutter Text to Speech package

Stars: ✭ 263 (+557.5%)

Mutual labels: tts

Facsvatar

An Open Source Modular Framework From Face to FACS Based Avatar Animation (Unity3D / Blender)

Stars: ✭ 260 (+550%)

Mutual labels: dnn

Melgan

MelGAN vocoder (compatible with NVIDIA/tacotron2)

Stars: ✭ 444 (+1010%)

Mutual labels: tts

Chaidnn

HLS based Deep Neural Network Accelerator Library for Xilinx Ultrascale+ MPSoCs

Stars: ✭ 258 (+545%)

Mutual labels: dnn

Dnn.platform

DNN (formerly DotNetNuke) is the leading open source web content management platform (CMS) in the Microsoft ecosystem.

Stars: ✭ 798 (+1895%)

Mutual labels: dnn

Cboard

AAC communication system with text-to-speech for the browser

Stars: ✭ 437 (+992.5%)

Mutual labels: tts

Credit-Card-Fraud

No description or website provided.

Stars: ✭ 17 (-57.5%)

Mutual labels: dnn

voice-conversion

an tutorial implement of voice conversion using pytorch

Stars: ✭ 26 (-35%)

Mutual labels: dnn

Transformer Tts

A Pytorch Implementation of "Neural Speech Synthesis with Transformer Network"

Stars: ✭ 418 (+945%)

Mutual labels: tts

esp32-flite

Speech synthesis running on ESP32 based on Flite engine.

Stars: ✭ 28 (-30%)

Mutual labels: tts

crowdsource-video-experiments-on-android

Crowdsourcing video experiments (such as collaborative benchmarking and optimization of DNN algorithms) using Collective Knowledge Framework across diverse Android devices provided by volunteers. Results are continuously aggregated in the open repository:

Stars: ✭ 29 (-27.5%)

Mutual labels: dnn

Jsut Lab

HTS-style full-context labels for JSUT v1.1

Stars: ✭ 28 (-30%)

Mutual labels: tts

Ailab

Experience, Learn and Code the latest breakthrough innovations with Microsoft AI

Stars: ✭ 6,896 (+17140%)

Mutual labels: dnn

Gocv

Go package for computer vision using OpenCV 4 and beyond.