All Projects → huawei-noah → Speech-Backbones

huawei-noah / Speech-Backbones

Licence: other
This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.

Programming Languages

Jupyter Notebook
11667 projects
python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Speech-Backbones

speechrec
a simple speech recognition app using the Web Speech API Interfaces
Stars: ✭ 18 (-91.22%)
Mutual labels:  speech-synthesis, speech-recognition, speech-processing
react-native-spokestack
Spokestack: give your React Native app a voice interface!
Stars: ✭ 53 (-74.15%)
Mutual labels:  speech-synthesis, speech-recognition, speech-processing
open-speech-corpora
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
Stars: ✭ 841 (+310.24%)
Mutual labels:  speech-synthesis, speech-recognition, speech-processing
spokestack-ios
Spokestack: give your iOS app a voice interface!
Stars: ✭ 27 (-86.83%)
Mutual labels:  speech-synthesis, speech-recognition, speech-processing
Neural Voice Cloning With Few Samples
Implementation of Neural Voice Cloning with Few Samples Research Paper by Baidu
Stars: ✭ 211 (+2.93%)
Mutual labels:  speech-synthesis, speech-processing
Lingvo
Lingvo
Stars: ✭ 2,361 (+1051.71%)
Mutual labels:  speech-synthesis, speech-recognition
idear
🎙️ Handsfree Audio Development Interface
Stars: ✭ 84 (-59.02%)
Mutual labels:  speech-synthesis, speech-recognition
UHV-OTS-Speech
A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.
Stars: ✭ 94 (-54.15%)
Mutual labels:  speech-recognition, speech-processing
IMS-Toucan
Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart. Objectives of the development are simplicity, modularity, controllability and multilinguality.
Stars: ✭ 295 (+43.9%)
Mutual labels:  speech-synthesis, speech-processing
torchsubband
Pytorch implementation of subband decomposition
Stars: ✭ 63 (-69.27%)
Mutual labels:  speech-recognition, speech-processing
web-speech-cognitive-services
Polyfill Web Speech API with Cognitive Services Bing Speech for both speech-to-text and text-to-speech service.
Stars: ✭ 35 (-82.93%)
Mutual labels:  speech-synthesis, speech-recognition
Khronos
The open source intelligent personal assistant
Stars: ✭ 25 (-87.8%)
Mutual labels:  speech-synthesis, speech-recognition
Naomi
The Naomi Project is an open source, technology agnostic platform for developing always-on, voice-controlled applications!
Stars: ✭ 171 (-16.59%)
Mutual labels:  speech-synthesis, speech-recognition
Vocgan
VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network
Stars: ✭ 158 (-22.93%)
Mutual labels:  speech-synthesis, speech-processing
QuantumSpeech-QCNN
IEEE ICASSP 21 - Quantum Convolution Neural Networks for Speech Processing and Automatic Speech Recognition
Stars: ✭ 71 (-65.37%)
Mutual labels:  speech-recognition, speech-processing
Wavenet vocoder
WaveNet vocoder
Stars: ✭ 1,926 (+839.51%)
Mutual labels:  speech-synthesis, speech-processing
Awesome Ai Services
An overview of the AI-as-a-service landscape
Stars: ✭ 133 (-35.12%)
Mutual labels:  speech-synthesis, speech-recognition
Kalliope
Kalliope is a framework that will help you to create your own personal assistant.
Stars: ✭ 1,509 (+636.1%)
Mutual labels:  speech-synthesis, speech-recognition
Deepvoice3 pytorch
PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models
Stars: ✭ 1,654 (+706.83%)
Mutual labels:  speech-synthesis, speech-processing
awesome-keyword-spotting
This repository is a curated list of awesome Speech Keyword Spotting (Wake-Up Word Detection).
Stars: ✭ 150 (-26.83%)
Mutual labels:  speech-recognition, speech-processing

Speech-Backbones

This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.

Grad-TTS

Official implementation of the Grad-TTS model based on Diffusion Probabilistic Modelling. For all details check out our paper accepted to ICML 2021 via this link.

Authors: Vadim Popov*, Ivan Vovk*, Vladimir Gogoryan, Tasnima Sadekova, Mikhail Kudinov.

*Equal contribution.

SPIRAL

Official implementation of SPIRAL: Self-supervised Perturbation-Invariant Representation Learning for Speech Pre-Training. For all details check out our paper accepted to ICLR 2022 via this link.

Authors: Wenyong Huang, Zhenhe Zhang, Yu Ting Yeung, Xin Jiang, Qun Liu.

DiffVC

Official implementation of the paper "Diffusion-Based Voice Conversion with Fast Maximum Likelihood Sampling Scheme" (ICLR 2022, Oral). Link.

Authors: Vadim Popov, Ivan Vovk, Vladimir Gogoryan, Tasnima Sadekova, Mikhail Kudinov, Jiansheng Wei.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].