Comprehensive-Tacotron2PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supports both single-, multi-speaker TTS and several techniques to enforce the robustness and efficiency of the model.
Stars: ✭ 22 (-96.28%)
VSGANVapourSynth Single Image Super-Resolution Generative Adversarial Network (GAN)
Stars: ✭ 124 (-79.05%)
Hifi GanHiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
Stars: ✭ 325 (-45.1%)
Pytorch GansMy implementation of various GAN (generative adversarial networks) architectures like vanilla GAN (Goodfellow et al.), cGAN (Mirza et al.), DCGAN (Radford et al.), etc.
Stars: ✭ 271 (-54.22%)
leon🧠 Leon is your open-source personal assistant.
Stars: ✭ 8,560 (+1345.95%)
Libfaceidlibfaceid is a research framework for prototyping of face recognition solutions. It seamlessly integrates multiple detection, recognition and liveness models w/ speech synthesis and speech recognition.
Stars: ✭ 354 (-40.2%)
EmotionalConversionStarGANThis repository contains code to replicate results from the ICASSP 2020 paper "StarGAN for Emotional Speech Conversion: Validated by Data Augmentation of End-to-End Emotion Recognition".
Stars: ✭ 92 (-84.46%)
GansformerGenerative Adversarial Transformers
Stars: ✭ 421 (-28.89%)
Deep-LearningIt contains the coursework and the practice I have done while learning Deep Learning.🚀 👨💻💥 🚩🌈
Stars: ✭ 21 (-96.45%)
PysptkA python wrapper for Speech Signal Processing Toolkit (SPTK).
Stars: ✭ 297 (-49.83%)
Generative models tutorial with demoGenerative Models Tutorial with Demo: Bayesian Classifier Sampling, Variational Auto Encoder (VAE), Generative Adversial Networks (GANs), Popular GANs Architectures, Auto-Regressive Models, Important Generative Model Papers, Courses, etc..
Stars: ✭ 276 (-53.38%)
Fre-GAN-pytorchFre-GAN: Adversarial Frequency-consistent Audio Synthesis
Stars: ✭ 73 (-87.67%)
EspnetEnd-to-End Speech Processing Toolkit
Stars: ✭ 4,533 (+665.71%)
Mimicry[CVPR 2020 Workshop] A PyTorch GAN library that reproduces research results for popular GANs.
Stars: ✭ 458 (-22.64%)
voice-conversionan tutorial implement of voice conversion using pytorch
Stars: ✭ 26 (-95.61%)
EspeakeSpeak NG is an open source speech synthesizer that supports 101 languages and accents.
Stars: ✭ 339 (-42.74%)
EigenGAN-TensorflowEigenGAN: Layer-Wise Eigen-Learning for GANs (ICCV 2021)
Stars: ✭ 294 (-50.34%)
Java Speech ApiThe J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.
Stars: ✭ 490 (-17.23%)
Cognitive Speech TtsMicrosoft Text-to-Speech API sample code in several languages, part of Cognitive Services.
Stars: ✭ 312 (-47.3%)
skip-thought-ganGenerating Text through Adversarial Training(GAN) using Skip-Thought Vectors
Stars: ✭ 44 (-92.57%)
Fast SrganA Fast Deep Learning Model to Upsample Low Resolution Videos to High Resolution at 30fps
Stars: ✭ 417 (-29.56%)
Text To Image SynthesisPytorch implementation of Generative Adversarial Text-to-Image Synthesis paper
Stars: ✭ 288 (-51.35%)
ParakeetPAddle PARAllel text-to-speech toolKIT (supporting WaveFlow, WaveNet, Transformer TTS and Tacotron2)
Stars: ✭ 279 (-52.87%)
spokestack-androidExtensible Android mobile voice framework: wakeword, ASR, NLU, and TTS. Easily add voice to any Android app!
Stars: ✭ 52 (-91.22%)
SdvSynthetic Data Generation for tabular, relational and time series data.
Stars: ✭ 360 (-39.19%)
GanttsPyTorch implementation of GAN-based text-to-speech synthesis and voice conversion (VC)
Stars: ✭ 460 (-22.3%)
Fewshot gan Unet3dTensorflow implementation of our paper: Few-shot 3D Multi-modal Medical Image Segmentation using Generative Adversarial Learning
Stars: ✭ 272 (-54.05%)
Voice BuilderAn opensource text-to-speech (TTS) voice building tool
Stars: ✭ 362 (-38.85%)
3d Sdn[NeurIPS 2018] 3D-Aware Scene Manipulation via Inverse Graphics
Stars: ✭ 256 (-56.76%)
Von[NeurIPS 2018] Visual Object Networks: Image Generation with Disentangled 3D Representation.
Stars: ✭ 497 (-16.05%)
few-shot-gan-adaptation[CVPR '21] Official repository for Few-shot Image Generation via Cross-domain Correspondence
Stars: ✭ 198 (-66.55%)
AttentionganAttentionGAN for Unpaired Image-to-Image Translation & Multi-Domain Image-to-Image Translation
Stars: ✭ 341 (-42.4%)
esp32-fliteSpeech synthesis running on ESP32 based on Flite engine.
Stars: ✭ 28 (-95.27%)
RewritingRewriting a Deep Generative Model, ECCV 2020 (oral). Interactive tool to directly edit the rules of a GAN to synthesize scenes with objects added, removed, or altered. Change StyleGANv2 to make extravagant eyebrows, or horses wearing hats.
Stars: ✭ 454 (-23.31%)
HistoGANReference code for the paper HistoGAN: Controlling Colors of GAN-Generated and Real Images via Color Histograms (CVPR 2021).
Stars: ✭ 158 (-73.31%)
Multilingual text to speechAn implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.
Stars: ✭ 324 (-45.27%)
DeepDreamGenerative deep learning: DeepDream
Stars: ✭ 17 (-97.13%)
Athenaan open-source implementation of sequence-to-sequence based speech processing engine
Stars: ✭ 542 (-8.45%)
DLSSDeep Learning Super Sampling with Deep Convolutional Generative Adversarial Networks.
Stars: ✭ 88 (-85.14%)
Shapematchinggan[ICCV 2019, Oral] Controllable Artistic Text Style Transfer via Shape-Matching GAN
Stars: ✭ 315 (-46.79%)
SprocketVoice Conversion Tool Kit
Stars: ✭ 425 (-28.21%)
edittsOfficial implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech
Stars: ✭ 74 (-87.5%)
NnmnkwiiLibrary to build speech synthesis systems designed for easy and fast prototyping.
Stars: ✭ 308 (-47.97%)
PaintsTensorFlowline drawing colorization using TensorFlow
Stars: ✭ 47 (-92.06%)
AutovcAutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss
Stars: ✭ 485 (-18.07%)
gans-collection.torchTorch implementation of various types of GAN (e.g. DCGAN, ALI, Context-encoder, DiscoGAN, CycleGAN, EBGAN, LSGAN)
Stars: ✭ 53 (-91.05%)
Pix2depthDEPRECATED: Depth Map Estimation from Monocular Images
Stars: ✭ 293 (-50.51%)
AvatarGANGenerate Cartoon Images using Generative Adversarial Network
Stars: ✭ 24 (-95.95%)
Anycost Gan[CVPR 2021] Anycost GANs for Interactive Image Synthesis and Editing
Stars: ✭ 367 (-38.01%)
Glow TtsA Generative Flow for Text-to-Speech via Monotonic Alignment Search
Stars: ✭ 284 (-52.03%)
FlowtronFlowtron is an auto-regressive flow-based generative network for text to speech synthesis with control over speech variation and style transfer
Stars: ✭ 546 (-7.77%)
TermitTranslations with speech synthesis in your terminal as a ruby gem
Stars: ✭ 505 (-14.7%)
Tf.gans ComparisonImplementations of (theoretical) generative adversarial networks and comparison without cherry-picking
Stars: ✭ 477 (-19.43%)
Selectiongan[CVPR 2019 Oral] Multi-Channel Attention Selection GAN with Cascaded Semantic Guidance for Cross-View Image Translation
Stars: ✭ 366 (-38.18%)
Faceswap GanA denoising autoencoder + adversarial losses and attention mechanisms for face swapping.
Stars: ✭ 3,099 (+423.48%)