VQMIVCOfficial implementation of VQMIVC: One-shot (any-to-any) Voice Conversion @ Interspeech 2021 + Online playing demo!
Stars: ✭ 278 (+80.52%)
voice-conversionan tutorial implement of voice conversion using pytorch
Stars: ✭ 26 (-83.12%)
SingleVCAny-to-one voice conversion using the data augment strategy: pitch shifted and duration remained.
Stars: ✭ 25 (-83.77%)
MediumVCAny-to-any voice conversion using synthetic specific-speaker speeches as intermedium features
Stars: ✭ 46 (-70.13%)
YourTTSYourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone
Stars: ✭ 217 (+40.91%)
EspnetEnd-to-End Speech Processing Toolkit
Stars: ✭ 4,533 (+2843.51%)
Expressive-FastSpeech2PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean, and your own languages.
Stars: ✭ 139 (-9.74%)
QPPWGQuasi-Periodic Parallel WaveGAN Pytorch implementation
Stars: ✭ 41 (-73.38%)
audioslides.ioUse Amazon Polly, Google Slides and FFMpeg to create videos that can be updated at anytime by anyone. This project is written in Elixir.
Stars: ✭ 19 (-87.66%)
TFGANTFGAN: Time and Frequency Domain Based Generative Adversarial Network for High-fidelity Speech Synthesis
Stars: ✭ 65 (-57.79%)
open-speech-corpora💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
Stars: ✭ 841 (+446.1%)
AmazonSpeechTranslatorEnd-to-end Solution for Speech Recognition, Text Translation, and Text-to-Speech for iOS using Amazon Translate and Amazon Polly as AWS Machine Learning managed services.
Stars: ✭ 50 (-67.53%)
wiki2ssmlWiki2SSML provides the WikiVoice markup language used for fine-tuning synthesised voice.
Stars: ✭ 31 (-79.87%)
KhronosThe open source intelligent personal assistant
Stars: ✭ 25 (-83.77%)
sova-tts-tpsNLP-preprocessor for the SOVA-TTS project
Stars: ✭ 44 (-71.43%)
vitsVITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
Stars: ✭ 1,604 (+941.56%)
melganMelGAN implementation with Multi-Band and Full Band supports...
Stars: ✭ 54 (-64.94%)
MixPathMixPath: A Unified Approach for One-shot Neural Architecture Search
Stars: ✭ 29 (-81.17%)
VAENAR-TTSPyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.
Stars: ✭ 66 (-57.14%)
TinyCogSmall Robot, Toy Robot platform
Stars: ✭ 29 (-81.17%)
GlottDNNGlottDNN vocoder and tools for training DNN excitation models
Stars: ✭ 30 (-80.52%)
speechreca simple speech recognition app using the Web Speech API Interfaces
Stars: ✭ 18 (-88.31%)
samSoftware Automatic Mouth - Tiny Speech Synthesizer
Stars: ✭ 316 (+105.19%)
idear🎙️ Handsfree Audio Development Interface
Stars: ✭ 84 (-45.45%)
wenetProduction First and Production Ready End-to-End Speech Recognition Toolkit
Stars: ✭ 2,384 (+1448.05%)
systoleSystole: A python package for cardiac signal synchrony and analysis
Stars: ✭ 51 (-66.88%)
ttsflowtensorflow speech synthesis c++ inference for voicenet
Stars: ✭ 17 (-88.96%)
CVCCVC: Contrastive Learning for Non-parallel Voice Conversion (INTERSPEECH 2021, in PyTorch)
Stars: ✭ 45 (-70.78%)
web-speech-cognitive-servicesPolyfill Web Speech API with Cognitive Services Bing Speech for both speech-to-text and text-to-speech service.
Stars: ✭ 35 (-77.27%)
Daft-ExprtPyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis
Stars: ✭ 41 (-73.38%)
CGCF-ConfGen🧪 Learning Neural Generative Dynamics for Molecular Conformation Generation (ICLR 2021)
Stars: ✭ 41 (-73.38%)
AFE4490 OximeterThis pulse oximetry shield from ProtoCentral uses the AFE4490 IC to enable your Arduino to measure heart rate as well as SpO2 values.
Stars: ✭ 39 (-74.68%)
kospeechOpen-Source Toolkit for End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.
Stars: ✭ 456 (+196.1%)
JD-NMFJoint Dictionary Learning-based Non-Negative Matrix Factorization for Voice Conversion (TBME 2016)
Stars: ✭ 20 (-87.01%)
ShifterPitch shifter using WSOLA and resampling implemented by Python3
Stars: ✭ 22 (-85.71%)
PhomemeSimple sentence mixing tool (work in progress)
Stars: ✭ 18 (-88.31%)
Catch-A-WaveformOfficial pytorch implementation of the paper: "Catch-A-Waveform: Learning to Generate Audio from a Single Short Example" (NeurIPS 2021)
Stars: ✭ 117 (-24.03%)
Speech-BackbonesThis is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.
Stars: ✭ 205 (+33.12%)
UpcharikaA unique flutter application aimed at helping people getting their vitals using Photoplethysmography and Computer Vision
Stars: ✭ 37 (-75.97%)
ml-with-audioHF's ML for Audio study group
Stars: ✭ 104 (-32.47%)
NanoFlowPyTorch implementation of the paper "NanoFlow: Scalable Normalizing Flows with Sublinear Parameter Complexity." (NeurIPS 2020)
Stars: ✭ 63 (-59.09%)
PPGCode to estimate HR from PPG signals using Subspace Decomposition and Kalman filter for the dataset of 22 PPG recordings provided for the 2015 IEEE Signal Processing Cup (SP Cup) competition. The traces are stored in folder 'DATABASE'. Please cite this publication when referencing this material: "Measuring Heart Rate During Physical Exercise by …
Stars: ✭ 43 (-72.08%)
sova-tts-engineTacotron2 based engine for the SOVA-TTS project
Stars: ✭ 63 (-59.09%)
few-shot-transformer-ttsByte-based multilingual transformer TTS for low-resource/few-shot language adaptation.
Stars: ✭ 60 (-61.04%)
IMS-ToucanText-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart. Objectives of the development are simplicity, modularity, controllability and multilinguality.
Stars: ✭ 295 (+91.56%)
deep-learning-german-ttsThorsten-Voice: A free to use, offline working, high quality german TTS voice should be available for every project without any license struggling.
Stars: ✭ 268 (+74.03%)
WavegradImplementation of Google Brain's WaveGrad high-fidelity vocoder (paper: https://arxiv.org/pdf/2009.00713.pdf). First implementation on GitHub.
Stars: ✭ 245 (+59.09%)
voderAn emulation of the Voder Speech Synthesizer.
Stars: ✭ 19 (-87.66%)
spokestack-iosSpokestack: give your iOS app a voice interface!
Stars: ✭ 27 (-82.47%)
tacotron2Pytorch implementation of "Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions", ICASSP, 2018.
Stars: ✭ 17 (-88.96%)
Cross-Speaker-Emotion-TransferPyTorch Implementation of ByteDance's Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised Training in Text-To-Speech
Stars: ✭ 107 (-30.52%)
Tacotron pytorchPyTorch implementation of Tacotron speech synthesis model.
Stars: ✭ 242 (+57.14%)
Zero-Shot-TTSUnofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration
Stars: ✭ 33 (-78.57%)
NormitTranslations with speech synthesis in your terminal as a node package
Stars: ✭ 219 (+42.21%)
TacotronA TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)
Stars: ✭ 2,581 (+1575.97%)
TensorVoxDesktop application for neural speech synthesis written in C++
Stars: ✭ 140 (-9.09%)