Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Reinforcement Learning Tutorial with Demo: DP (Policy and Value Iteration), Monte Carlo, TD Learning (SARSA, QLearning), Function Approximation, Policy Gradient, DQN, Imitation, Meta Learning, Papers, Courses, etc..

Stars: ✭ 442 (+420%)

Mutual labels: deep-reinforcement-learning, imitation-learning

Deterministic Gail Pytorch

PyTorch implementation of Deterministic Generative Adversarial Imitation Learning (GAIL) for Off Policy learning

Stars: ✭ 44 (-48.24%)

Mutual labels: deep-reinforcement-learning, imitation-learning

Imitation Learning

Autonomous driving: Tensorflow implementation of the paper "End-to-end Driving via Conditional Imitation Learning"

Stars: ✭ 60 (-29.41%)

Mutual labels: imitation-learning

Noreward Rl

[ICML 2017] TensorFlow code for Curiosity-driven Exploration for Deep Reinforcement Learning

Stars: ✭ 1,176 (+1283.53%)

Mutual labels: deep-reinforcement-learning

Imitation Learning Dagger Torcs

A Simple Example for Imitation Learning with Dataset Aggregation (DAGGER) on Torcs Env

Stars: ✭ 60 (-29.41%)

Mutual labels: imitation-learning

Malmo Challenge

Malmo Collaborative AI Challenge - Team Pig Catcher

Stars: ✭ 64 (-24.71%)

Mutual labels: deep-reinforcement-learning

Rl Course Experiments

Stars: ✭ 73 (-14.12%)

Mutual labels: deep-reinforcement-learning

Max

Code for reproducing experiments in Model-Based Active Exploration, ICML 2019

Stars: ✭ 61 (-28.24%)

Mutual labels: deep-reinforcement-learning

Treeqn

Stars: ✭ 77 (-9.41%)

Mutual labels: deep-reinforcement-learning

Pgdrive

PGDrive: an open-ended driving simulator with infinite scenes from procedural generation

Stars: ✭ 60 (-29.41%)

Mutual labels: imitation-learning

Drl paper summary

Summary of key papers in deep reinforcement learning. Heavily based on OpenAI SpinningUp.

Stars: ✭ 49 (-42.35%)

Mutual labels: deep-reinforcement-learning

Rlenv.directory

Explore and find reinforcement learning environments in a list of 150+ open source environments.

Stars: ✭ 79 (-7.06%)

Mutual labels: deep-reinforcement-learning

Mit Deep Learning

Tutorials, assignments, and competitions for MIT Deep Learning related courses.

Stars: ✭ 8,912 (+10384.71%)

Mutual labels: deep-reinforcement-learning

1 Year Machinelearning Journey

An advanced program in Machine Learning and Deep Learning

Stars: ✭ 69 (-18.82%)

Mutual labels: deep-reinforcement-learning

View All Similar Projects ➔

IL

Imitation learning algorithms (with PPO [1]):

~~ABC [2]~~
AIRL [3]
BC [4]
DRIL [5]
FAIRL [6]
GAIL [7]
GMMIL [8]
~~PWIL [9]~~
RED [10]

python main.py --imitation [AIRL|BC|DRIL|FAIRL|GAIL|GMMIL|RED]

Options include:

State-only imitation learning: --state-only
Absorbing state indicator [11]: --absorbing
R1 gradient regularisation [12]: --r1-reg-coeff 1 (default)

Results

PPO

Train	Test

AIRL

Train	Test

Train	Test

DRIL

Train	Test

FAIRL

Train	Test

GAIL

Train	Test

GMMIL

Train	Test

RED

Train	Test

Acknowledgements

@ikostrikov for https://github.com/ikostrikov/pytorch-a2c-ppo-acktr-gail

References

[1] Proximal Policy Optimization Algorithms
[2] Adversarial Behavioral Cloning
[3] Learning Robust Rewards with Adversarial Inverse Reinforcement Learning
[4] Efficient Training of Artificial Neural Networks for Autonomous Navigation
[5] Disagreement-Regularized Imitation Learning
[6] A Divergence Minimization Perspective on Imitation Learning Methods
[7] Generative Adversarial Imitation Learning
[8] Imitation Learning via Kernel Mean Embedding
[9] Primal Wasserstein Imitation Learning
[10] Random Expert Distillation: Imitation Learning via Expert Policy Support Estimation
[11] Discriminator-Actor-Critic: Addressing Sample Inefficiency and Reward Bias in Adversarial Imitation Learning
[12] Which Training Methods for GANs do actually Converge?

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 85

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (0) 🔗