HandyRL is a handy and simple framework based on Python and PyTorch for distributed reinforcement learning that is applicable to your own environments.

Stars: ✭ 228 (+470%)

Mutual labels: policy-gradient

GANs-Keras

GANs Implementations in Keras

Stars: ✭ 24 (-40%)

Mutual labels: gan

wgan-gp

Pytorch implementation of Wasserstein GANs with Gradient Penalty

Stars: ✭ 161 (+302.5%)

Mutual labels: gan

ZSL-ADA

Code accompanying the paper "A Generative Framework for Zero Shot Learning with Adversarial Domain Adaptation"

Stars: ✭ 18 (-55%)

Mutual labels: gan

scrabble-gan

Adversarial Generation of Handwritten Text Images

Stars: ✭ 49 (+22.5%)

Mutual labels: gan

CoMoGAN

CoMoGAN: continuous model-guided image-to-image translation. CVPR 2021 oral.

Stars: ✭ 139 (+247.5%)

Mutual labels: gan

td-reg

TD-Regularized Actor-Critic Methods

Stars: ✭ 28 (-30%)

Mutual labels: policy-gradient

PerceptualGAN

Pytorch implementation of Image Manipulation with Perceptual Discriminators paper

Stars: ✭ 119 (+197.5%)

Mutual labels: gan

Reinforcement Learning

Deep Reinforcement Learning Algorithms implemented with Tensorflow 2.3

Stars: ✭ 61 (+52.5%)

Mutual labels: policy-gradient

Paddle-RLBooks

Paddle-RLBooks is a reinforcement learning code study guide based on pure PaddlePaddle.

Stars: ✭ 113 (+182.5%)

Mutual labels: policy-gradient

Deep-rl-mxnet

Mxnet implementation of Deep Reinforcement Learning papers, such as DQN, PG, DDPG, PPO

Stars: ✭ 26 (-35%)

Mutual labels: policy-gradient

pix2pix-tensorflow

A minimal tensorflow implementation of pix2pix (Image-to-Image Translation with Conditional Adversarial Nets - https://phillipi.github.io/pix2pix/).

Stars: ✭ 22 (-45%)

Mutual labels: gan

Semantic Pyramid for Image Generation

PyTorch reimplementation of the paper: "Semantic Pyramid for Image Generation" [CVPR 2020].

Stars: ✭ 45 (+12.5%)

Mutual labels: gan

View All Similar Projects ➔

SeqGAN-PyTorch

An implementation of SeqGAN (Paper: SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient) in PyTorch. The code performs the experiment on synthetic data as described in the paper.

Usage

$ python main.py

Please refer to main.py for supported arguments. You can also change model parameters there.

Dependency

PyTorch 0.4.0+ (1.0 ready)
Python 3.5+
CUDA 8.0+ & cuDNN (For GPU)
numpy

Hacks and Observations

Using Adam for Generator and SGD for Discriminator
Discriminator should neither be trained too powerful (fail to provide useful feedback) nor too ill-performed (randomly guessing, unable to guide generation)
The GAN phase may not always lead to massive drops in NLL (sometimes very minimal or even increases NLL)

Sample Learning Curve

Learning curve of generator obtained after MLE training for 120 steps (1 epoch per round) followed by adversarial training for 150 rounds (1 epoch per round):

Learning curve of discriminator obtained after MLE training for 50 steps (3 epochs per step) followed by adversarial training for 150 rounds (9 epoch per round):

Acknowledgement

This code is based on Zhao Zijian's SeqGAN-PyTorch, Surag Nair's SeqGAN and Lantao Yu's original implementation in Tensorflow. Many thanks to Zhao Zijian, Surag Nair and Lantao Yu!

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

X-czh / SeqGAN-PyTorch

Programming Languages

Labels

Projects that are alternatives of or similar to SeqGAN-PyTorch

SeqGAN-PyTorch

Usage

Dependency

Hacks and Observations

Sample Learning Curve

Acknowledgement