All Projects → Diffwave → Similar Projects or Alternatives

885 Open source projects that are alternatives of or similar to Diffwave

Conditional Pixelcnn Decoder
Tensorflow implementation of Gated Conditional Pixel Convolutional Neural Network
Stars: ✭ 479 (+244.6%)
Mutual labels:  paper
ScriptBlockPlus
任意のブロックにスクリプトを追加するプラグインです。
Stars: ✭ 25 (-82.01%)
Mutual labels:  paper
Speech And Text
Speech to text (PocketSphinx, Iflytex API, Baidu API) and text to speech (pyttsx3) | 语音转文字(PocketSphinx、百度 API、科大讯飞 API)和文字转语音(pyttsx3)
Stars: ✭ 102 (-26.62%)
Mutual labels:  text-to-speech
atari-demo
Code for the blog post "Learning Montezuma’s Revenge from a Single Demonstration"
Stars: ✭ 21 (-84.89%)
Mutual labels:  paper
Iaf
Code for reproducing key results in the paper "Improving Variational Inference with Inverse Autoregressive Flow"
Stars: ✭ 468 (+236.69%)
Mutual labels:  paper
FastSpeech2
PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speech
Stars: ✭ 163 (+17.27%)
Mutual labels:  text-to-speech
Voicenet
Speech synthesis platform based on tensorflow and sonnet
Stars: ✭ 60 (-56.83%)
Mutual labels:  text-to-speech
datasets
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
Stars: ✭ 13,870 (+9878.42%)
Mutual labels:  speech
Mmf
A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)
Stars: ✭ 4,713 (+3290.65%)
Mutual labels:  pretrained-models
Deepvoice3 pytorch
PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models
Stars: ✭ 1,654 (+1089.93%)
Mutual labels:  speech-synthesis
safety-gear-detector-python
Observe workers as they pass in front of a camera to determine if they have adequate safety protection.
Stars: ✭ 54 (-61.15%)
Mutual labels:  pretrained-models
Read Aloud
An awesome browser extension that reads aloud webpage content with one click
Stars: ✭ 444 (+219.42%)
Mutual labels:  text-to-speech
deepspeech.mxnet
A MXNet implementation of Baidu's DeepSpeech architecture
Stars: ✭ 82 (-41.01%)
Mutual labels:  speech
Dips
NAACL 2019: Submodular optimization-based diverse paraphrasing and its effectiveness in data augmentation
Stars: ✭ 59 (-57.55%)
Mutual labels:  paper
Cvpr2021 Papers With Code
CVPR 2021 论文和开源项目合集
Stars: ✭ 7,138 (+5035.25%)
Mutual labels:  paper
speech to text
how to use the Google Cloud Speech API to transcribe audio/video files.
Stars: ✭ 35 (-74.82%)
Mutual labels:  speech
Gate
A high performant & paralleled Minecraft proxy server with scalability, flexibility & excellent server version support - ready for the cloud!
Stars: ✭ 102 (-26.62%)
Mutual labels:  paper
Sparsely Grouped Gan
Code for paper "Sparsely Grouped Multi-task Generative Adversarial Networks for Facial Attribute Manipulation"
Stars: ✭ 68 (-51.08%)
Mutual labels:  paper
Bert paper chinese translation
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding 论文的中文翻译 Chinese Translation!
Stars: ✭ 564 (+305.76%)
Mutual labels:  paper
LayeredSceneDecomposition
No description or website provided.
Stars: ✭ 22 (-84.17%)
Mutual labels:  paper
Tensorflow-YOLACT
Implementation of the paper "YOLACT Real-time Instance Segmentation" in Tensorflow 2
Stars: ✭ 97 (-30.22%)
Mutual labels:  paper
Research Method
论文写作与资料分享
Stars: ✭ 436 (+213.67%)
Mutual labels:  paper
Pytorch Classification Uncertainty
This repo contains a PyTorch implementation of the paper: "Evidential Deep Learning to Quantify Classification Uncertainty"
Stars: ✭ 59 (-57.55%)
Mutual labels:  paper
Sprocket
Voice Conversion Tool Kit
Stars: ✭ 425 (+205.76%)
Mutual labels:  speech-synthesis
Vonage Python Sdk
Vonage Server SDK for Python. API support for SMS, Voice, Text-to-Speech, Numbers, Verify (2FA) and more.
Stars: ✭ 134 (-3.6%)
Mutual labels:  text-to-speech
msla2014
wherein I implement several substructural logics in Agda
Stars: ✭ 24 (-82.73%)
Mutual labels:  paper
voices
macOS CLI for changing the default TTS (text-to-speech) voice and printing information about and speaking text with multiple voices.
Stars: ✭ 53 (-61.87%)
Mutual labels:  text-to-speech
Knowledge Distillation Papers
knowledge distillation papers
Stars: ✭ 422 (+203.6%)
Mutual labels:  paper
Pytorch Image Models
PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, EfficientNetV2, NFNet, Vision Transformer, MixNet, MobileNet-V3/V2, RegNet, DPN, CSPNet, and more
Stars: ✭ 15,232 (+10858.27%)
Mutual labels:  pretrained-models
Self Driving Car In Video Games
A deep neural network that learns to drive in video games
Stars: ✭ 559 (+302.16%)
Mutual labels:  pretrained-models
mimic2
Text to Speech engine based on the Tacotron architecture, initially implemented by Keith Ito.
Stars: ✭ 537 (+286.33%)
Mutual labels:  speech-synthesis
gan-vae-pretrained-pytorch
Pretrained GANs + VAEs + classifiers for MNIST/CIFAR in pytorch.
Stars: ✭ 134 (-3.6%)
Mutual labels:  pretrained-models
Neural Backed Decision Trees
Making decision trees competitive with neural networks on CIFAR10, CIFAR100, TinyImagenet200, Imagenet
Stars: ✭ 411 (+195.68%)
Mutual labels:  pretrained-models
BIRADS classifier
High-resolution breast cancer screening with multi-view deep convolutional neural networks
Stars: ✭ 122 (-12.23%)
Mutual labels:  pretrained-models
Library
Collection of papers in the field of distributed systems, game theory, cryptography, cryptoeconomics, zero knowledge
Stars: ✭ 100 (-28.06%)
Mutual labels:  paper
data-at-hand-mobile
Mobile application for exploring fitness data using both speech and touch interaction.
Stars: ✭ 50 (-64.03%)
Mutual labels:  speech
Yatopia
The Most Powerful and Feature Rich Minecraft Server Software!
Stars: ✭ 408 (+193.53%)
Mutual labels:  paper
Hotpur
A fork of Purpur that aims to improve performance and add FabricMC compatibility.
Stars: ✭ 17 (-87.77%)
Mutual labels:  paper
Sound Source Localization Algorithm doa estimation
关于语音信号声源定位DOA估计所用的一些传统算法
Stars: ✭ 58 (-58.27%)
Mutual labels:  speech
AutoInAgda
Proof automation – for Agda, in Agda.
Stars: ✭ 38 (-72.66%)
Mutual labels:  paper
Awesome Kaldi
This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )
Stars: ✭ 393 (+182.73%)
Mutual labels:  speech
cerberus research
Research tools for analysing Cerberus banking trojan.
Stars: ✭ 110 (-20.86%)
Mutual labels:  paper
Mdm
A TensorFlow implementation of the Mnemonic Descent Method.
Stars: ✭ 120 (-13.67%)
Mutual labels:  pretrained-models
Voice Converter Cyclegan
Voice Converter Using CycleGAN and Non-Parallel Data
Stars: ✭ 384 (+176.26%)
Mutual labels:  speech
Pytorch Asr
ASR with PyTorch
Stars: ✭ 124 (-10.79%)
Mutual labels:  speech
AdversarialAudioSeparation
Code accompanying the paper "Semi-supervised adversarial audio source separation applied to singing voice extraction"
Stars: ✭ 70 (-49.64%)
Mutual labels:  paper
Cv paperdaily
CV 论文笔记
Stars: ✭ 555 (+299.28%)
Mutual labels:  paper
pigallery
PiGallery: AI-powered Self-hosted Secure Multi-user Image Gallery and Detailed Image analysis using Machine Learning, EXIF Parsing and Geo Tagging
Stars: ✭ 35 (-74.82%)
Mutual labels:  pretrained-models
CVC
CVC: Contrastive Learning for Non-parallel Voice Conversion (INTERSPEECH 2021, in PyTorch)
Stars: ✭ 45 (-67.63%)
Mutual labels:  speech
Syn Speech
Syn.Speech is a flexible speaker independent continuous speech recognition engine for Mono and .NET framework
Stars: ✭ 57 (-58.99%)
Mutual labels:  speech
paper-terminal
Print Markdown to a paper in your terminal
Stars: ✭ 33 (-76.26%)
Mutual labels:  paper
Ipfs
Peer-to-peer hypermedia protocol
Stars: ✭ 20,128 (+14380.58%)
Mutual labels:  paper
EagerMOT
Official code for "EagerMOT: 3D Multi-Object Tracking via Sensor Fusion" [ICRA 2021]
Stars: ✭ 249 (+79.14%)
Mutual labels:  paper
Research And Coding
研究资源列表 A curated list of research resources
Stars: ✭ 100 (-28.06%)
Mutual labels:  paper
Imitation
Code for the paper "Generative Adversarial Imitation Learning"
Stars: ✭ 555 (+299.28%)
Mutual labels:  paper
Cross-View-Gait-Based-Human-Identification-with-Deep-CNNs
Code for 2016 TPAMI(IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE) A Comprehensive Study on Cross-View Gait Based Human Identification with Deep CNNs
Stars: ✭ 21 (-84.89%)
Mutual labels:  paper
spoken-word
Spoken Word
Stars: ✭ 46 (-66.91%)
Mutual labels:  speech-synthesis
Distributedsystems
My Distributed Systems references
Stars: ✭ 67 (-51.8%)
Mutual labels:  paper
Nodejs Speech
Node.js client for Google Cloud Speech: Speech to text conversion powered by machine learning.
Stars: ✭ 545 (+292.09%)
Mutual labels:  speech
nlp-class
A Natural Language Processing course taught by Professor Ghassemi
Stars: ✭ 95 (-31.65%)
Mutual labels:  speech
541-600 of 885 similar projects