Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart. Objectives of the development are simplicity, modularity, controllability and multilinguality.

Stars: ✭ 295 (+43.9%)

Mutual labels: speech-synthesis, speech-processing

torchsubband

Pytorch implementation of subband decomposition

Stars: ✭ 63 (-69.27%)

Mutual labels: speech-recognition, speech-processing

web-speech-cognitive-services

Polyfill Web Speech API with Cognitive Services Bing Speech for both speech-to-text and text-to-speech service.

Stars: ✭ 35 (-82.93%)

Mutual labels: speech-synthesis, speech-recognition

Khronos

The open source intelligent personal assistant

Stars: ✭ 25 (-87.8%)

Mutual labels: speech-synthesis, speech-recognition

Naomi

The Naomi Project is an open source, technology agnostic platform for developing always-on, voice-controlled applications!

Stars: ✭ 171 (-16.59%)

Mutual labels: speech-synthesis, speech-recognition

Vocgan

VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network

Stars: ✭ 158 (-22.93%)

Mutual labels: speech-synthesis, speech-processing

QuantumSpeech-QCNN

IEEE ICASSP 21 - Quantum Convolution Neural Networks for Speech Processing and Automatic Speech Recognition

Stars: ✭ 71 (-65.37%)

Mutual labels: speech-recognition, speech-processing

Wavenet vocoder

WaveNet vocoder

Stars: ✭ 1,926 (+839.51%)

Mutual labels: speech-synthesis, speech-processing

Awesome Ai Services

An overview of the AI-as-a-service landscape

Stars: ✭ 133 (-35.12%)

Mutual labels: speech-synthesis, speech-recognition

Kalliope

Kalliope is a framework that will help you to create your own personal assistant.

Stars: ✭ 1,509 (+636.1%)

Mutual labels: speech-synthesis, speech-recognition

Deepvoice3 pytorch

PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models

Stars: ✭ 1,654 (+706.83%)

Mutual labels: speech-synthesis, speech-processing

awesome-keyword-spotting

This repository is a curated list of awesome Speech Keyword Spotting (Wake-Up Word Detection).

Stars: ✭ 150 (-26.83%)

Mutual labels: speech-recognition, speech-processing

View All Similar Projects ➔

Speech-Backbones

This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.

Grad-TTS

Official implementation of the Grad-TTS model based on Diffusion Probabilistic Modelling. For all details check out our paper accepted to ICML 2021 via this link.

Authors: Vadim Popov*, Ivan Vovk*, Vladimir Gogoryan, Tasnima Sadekova, Mikhail Kudinov.

^{*Equal contribution.}

SPIRAL

Official implementation of SPIRAL: Self-supervised Perturbation-Invariant Representation Learning for Speech Pre-Training. For all details check out our paper accepted to ICLR 2022 via this link.

Authors: Wenyong Huang, Zhenhe Zhang, Yu Ting Yeung, Xin Jiang, Qun Liu.

DiffVC

Official implementation of the paper "Diffusion-Based Voice Conversion with Fast Maximum Likelihood Sampling Scheme" (ICLR 2022, Oral). Link.

Authors: Vadim Popov, Ivan Vovk, Vladimir Gogoryan, Tasnima Sadekova, Mikhail Kudinov, Jiansheng Wei.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

huawei-noah / Speech-Backbones

Programming Languages

Labels

Projects that are alternatives of or similar to Speech-Backbones

Speech-Backbones

Grad-TTS

SPIRAL

DiffVC