Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart. Objectives of the development are simplicity, modularity, controllability and multilinguality.

Stars: ✭ 295 (-84.68%)

Mutual labels: speech, speech-synthesis, speech-processing

Neural Voice Cloning With Few Samples

Implementation of Neural Voice Cloning with Few Samples Research Paper by Baidu

Stars: ✭ 211 (-89.04%)

Mutual labels: speech, speech-synthesis, speech-processing

Nnmnkwii

Library to build speech synthesis systems designed for easy and fast prototyping.

Stars: ✭ 308 (-84.01%)

Mutual labels: speech-synthesis, speech-processing

Lingvo

Stars: ✭ 2,361 (+22.59%)

Mutual labels: speech, speech-synthesis

Wavernn

WaveRNN Vocoder + TTS

Stars: ✭ 1,636 (-15.06%)

Mutual labels: speech-synthesis, neural-vocoder

Voice Builder

An opensource text-to-speech (TTS) voice building tool

Stars: ✭ 362 (-81.2%)

Mutual labels: speech, speech-synthesis

open-speech-corpora

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

Stars: ✭ 841 (-56.33%)

Mutual labels: speech-synthesis, speech-processing

React Native Dialogflow

A React-Native Bridge for the Google Dialogflow (API.AI) SDK

Stars: ✭ 182 (-90.55%)

Mutual labels: speech, speech-processing

Tacotron 2

DeepMind's Tacotron-2 Tensorflow implementation

Stars: ✭ 1,968 (+2.18%)

Mutual labels: speech-synthesis, wavenet

idear

🎙️ Handsfree Audio Development Interface

Stars: ✭ 84 (-95.64%)

Mutual labels: speech, speech-synthesis

Wavegrad

Implementation of Google Brain's WaveGrad high-fidelity vocoder (paper: https://arxiv.org/pdf/2009.00713.pdf). First implementation on GitHub.

Stars: ✭ 245 (-87.28%)

Mutual labels: speech, speech-synthesis

react-native-spokestack

Spokestack: give your React Native app a voice interface!

Stars: ✭ 53 (-97.25%)

Mutual labels: speech-synthesis, speech-processing

Zero-Shot-TTS

Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration

Stars: ✭ 33 (-98.29%)

Mutual labels: speech, speech-synthesis

LIUM

Scripts for LIUM SpkDiarization tools

Stars: ✭ 28 (-98.55%)

Mutual labels: speech, speech-processing

TFGAN

TFGAN: Time and Frequency Domain Based Generative Adversarial Network for High-fidelity Speech Synthesis

Stars: ✭ 65 (-96.63%)

Mutual labels: speech, speech-synthesis

speechrec

a simple speech recognition app using the Web Speech API Interfaces

Stars: ✭ 18 (-99.07%)

Mutual labels: speech-synthesis, speech-processing

Diffwave

DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.

Stars: ✭ 139 (-92.78%)

Mutual labels: speech, speech-synthesis

UniSpeech

UniSpeech - Large Scale Self-Supervised Learning for Speech

Stars: ✭ 224 (-88.37%)

Mutual labels: speech, speech-processing

Voice2Mesh

CVPR 2022: Cross-Modal Perceptionist: Can Face Geometry be Gleaned from Voices?

Stars: ✭ 67 (-96.52%)

Mutual labels: speech, speech-synthesis

Wavenet Enhancement

Speech Enhancement using Bayesian WaveNet

Stars: ✭ 86 (-95.53%)

Mutual labels: speech, wavenet

Durian

Implementation of "Duration Informed Attention Network for Multimodal Synthesis" (https://arxiv.org/pdf/1909.01700.pdf) paper.

Stars: ✭ 111 (-94.24%)

Mutual labels: speech, speech-synthesis

spokestack-android

Extensible Android mobile voice framework: wakeword, ASR, NLU, and TTS. Easily add voice to any Android app!

Stars: ✭ 52 (-97.3%)

Mutual labels: speech, speech-synthesis

MelNet-SpeechGeneration

Implementation of MelNet in PyTorch to generate high-fidelity audio samples

Stars: ✭ 19 (-99.01%)

Mutual labels: speech, speech-synthesis

Fre-GAN-pytorch

Fre-GAN: Adversarial Frequency-consistent Audio Synthesis

Stars: ✭ 73 (-96.21%)

Mutual labels: speech, speech-synthesis

editts

Official implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech

Stars: ✭ 74 (-96.16%)

Mutual labels: speech, speech-synthesis

Vocgan

VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network

Stars: ✭ 158 (-91.8%)

Mutual labels: speech-synthesis, speech-processing

Wavegrad

A fast, high-quality neural vocoder.

Stars: ✭ 138 (-92.83%)

Mutual labels: speech, speech-synthesis

Java Speech Api

The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.

Stars: ✭ 490 (-74.56%)

Mutual labels: speech, speech-synthesis

Speechbrain.github.io

The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.

Stars: ✭ 242 (-87.44%)

Mutual labels: speech, speech-processing

Tacotron pytorch

PyTorch implementation of Tacotron speech synthesis model.

Stars: ✭ 242 (-87.44%)

Mutual labels: speech, speech-synthesis

Parallelwavegan

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN) with Pytorch

Stars: ✭ 682 (-64.59%)

Mutual labels: speech-synthesis, wavenet

Lightspeech

LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search

Stars: ✭ 31 (-98.39%)

Mutual labels: speech, speech-synthesis

Gcc Nmf

Real-time GCC-NMF Blind Speech Separation and Enhancement

Stars: ✭ 231 (-88.01%)

Mutual labels: speech, speech-processing

StyleSpeech

Official implementation of Meta-StyleSpeech and StyleSpeech

Stars: ✭ 161 (-91.64%)

Mutual labels: speech, speech-synthesis

Shifter

Pitch shifter using WSOLA and resampling implemented by Python3

Stars: ✭ 22 (-98.86%)

Mutual labels: speech, speech-processing

Tf Wavenet vocoder

Wavenet and its applications with Tensorflow

Stars: ✭ 58 (-96.99%)

Mutual labels: speech-synthesis, wavenet

spokestack-ios

Spokestack: give your iOS app a voice interface!

Stars: ✭ 27 (-98.6%)

Mutual labels: speech-synthesis, speech-processing

melgan

MelGAN implementation with Multi-Band and Full Band supports...

Stars: ✭ 54 (-97.2%)

Mutual labels: speech, speech-synthesis

Deepvoice3 pytorch

PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models

Stars: ✭ 1,654 (-14.12%)

Mutual labels: speech-synthesis, speech-processing

Speech-Backbones

This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.

Stars: ✭ 205 (-89.36%)

Mutual labels: speech-synthesis, speech-processing

AdaSpeech

AdaSpeech: Adaptive Text to Speech for Custom Voice

Stars: ✭ 108 (-94.39%)

Mutual labels: speech, speech-synthesis

Pytorchwavenetvocoder

WaveNet-Vocoder implementation with pytorch.

Stars: ✭ 269 (-86.03%)

Mutual labels: speech-synthesis, wavenet

Wsay

Windows "say"

Stars: ✭ 36 (-98.13%)

Mutual labels: speech, speech-synthesis

Tfg Voice Conversion

Deep Learning-based Voice Conversion system

Stars: ✭ 115 (-94.03%)

Mutual labels: speech, speech-processing

Asr audio data links

A list of publically available audio data that anyone can download for ASR or other speech activities

Stars: ✭ 128 (-93.35%)

Mutual labels: speech

Pytorch Gan Timeseries

GANs for time series generation in pytorch

Stars: ✭ 109 (-94.34%)

Mutual labels: wavenet

Kalliope

Kalliope is a framework that will help you to create your own personal assistant.

Stars: ✭ 1,509 (-21.65%)

Mutual labels: speech-synthesis

Xva Synth

Machine learning based speech synthesis Electron app, with voices from specific characters from video games

Stars: ✭ 136 (-92.94%)

Mutual labels: speech-synthesis

Pb bss

Collection of EM algorithms for blind source separation of audio signals

Stars: ✭ 127 (-93.41%)

Mutual labels: speech-processing

Crystal

Crystal - C++ implementation of a unified framework for multilingual TTS synthesis engine with SSML specification as interface.

Stars: ✭ 108 (-94.39%)

Mutual labels: speech-synthesis

Numpy Ml

Machine learning, in numpy

Stars: ✭ 11,100 (+476.32%)

Mutual labels: wavenet

Reconstructing faces from voices

An example of the paper "reconstructing faces from voices"

Stars: ✭ 127 (-93.41%)

Mutual labels: speech

Python Speech recognition

A simple example for use speech recognition baidu api with python.

Stars: ✭ 106 (-94.5%)

Mutual labels: speech

Allosaurus

Allosaurus is a pretrained universal phone recognizer for more than 2000 languages

Stars: ✭ 135 (-92.99%)

Mutual labels: speech

1-60 of 373 similar projects

›

next*5