All Projects → Kaixhin → Imitation Learning

Kaixhin / Imitation Learning

Licence: mit
Imitation learning algorithms

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Imitation Learning

imitation learning
PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.
Stars: ✭ 93 (+9.41%)
Mutual labels:  deep-reinforcement-learning, imitation-learning
Easy Rl
强化学习中文教程,在线阅读地址:https://datawhalechina.github.io/easy-rl/
Stars: ✭ 3,004 (+3434.12%)
Mutual labels:  deep-reinforcement-learning, imitation-learning
Gym Carla
An OpenAI gym wrapper for CARLA simulator
Stars: ✭ 164 (+92.94%)
Mutual labels:  deep-reinforcement-learning, imitation-learning
pytorchrl
Deep Reinforcement Learning algorithms implemented in PyTorch
Stars: ✭ 47 (-44.71%)
Mutual labels:  deep-reinforcement-learning, imitation-learning
Tf2rl
TensorFlow2 Reinforcement Learning
Stars: ✭ 353 (+315.29%)
Mutual labels:  deep-reinforcement-learning, imitation-learning
Carla
Open-source simulator for autonomous driving research.
Stars: ✭ 7,012 (+8149.41%)
Mutual labels:  deep-reinforcement-learning, imitation-learning
Reinforcement learning tutorial with demo
Reinforcement Learning Tutorial with Demo: DP (Policy and Value Iteration), Monte Carlo, TD Learning (SARSA, QLearning), Function Approximation, Policy Gradient, DQN, Imitation, Meta Learning, Papers, Courses, etc..
Stars: ✭ 442 (+420%)
Mutual labels:  deep-reinforcement-learning, imitation-learning
Deterministic Gail Pytorch
PyTorch implementation of Deterministic Generative Adversarial Imitation Learning (GAIL) for Off Policy learning
Stars: ✭ 44 (-48.24%)
Mutual labels:  deep-reinforcement-learning, imitation-learning
Imitation Learning
Autonomous driving: Tensorflow implementation of the paper "End-to-end Driving via Conditional Imitation Learning"
Stars: ✭ 60 (-29.41%)
Mutual labels:  imitation-learning
Noreward Rl
[ICML 2017] TensorFlow code for Curiosity-driven Exploration for Deep Reinforcement Learning
Stars: ✭ 1,176 (+1283.53%)
Mutual labels:  deep-reinforcement-learning
Imitation Learning Dagger Torcs
A Simple Example for Imitation Learning with Dataset Aggregation (DAGGER) on Torcs Env
Stars: ✭ 60 (-29.41%)
Mutual labels:  imitation-learning
Malmo Challenge
Malmo Collaborative AI Challenge - Team Pig Catcher
Stars: ✭ 64 (-24.71%)
Mutual labels:  deep-reinforcement-learning
Rl Course Experiments
Stars: ✭ 73 (-14.12%)
Mutual labels:  deep-reinforcement-learning
Max
Code for reproducing experiments in Model-Based Active Exploration, ICML 2019
Stars: ✭ 61 (-28.24%)
Mutual labels:  deep-reinforcement-learning
Treeqn
Stars: ✭ 77 (-9.41%)
Mutual labels:  deep-reinforcement-learning
Pgdrive
PGDrive: an open-ended driving simulator with infinite scenes from procedural generation
Stars: ✭ 60 (-29.41%)
Mutual labels:  imitation-learning
Drl paper summary
Summary of key papers in deep reinforcement learning. Heavily based on OpenAI SpinningUp.
Stars: ✭ 49 (-42.35%)
Mutual labels:  deep-reinforcement-learning
Rlenv.directory
Explore and find reinforcement learning environments in a list of 150+ open source environments.
Stars: ✭ 79 (-7.06%)
Mutual labels:  deep-reinforcement-learning
Mit Deep Learning
Tutorials, assignments, and competitions for MIT Deep Learning related courses.
Stars: ✭ 8,912 (+10384.71%)
Mutual labels:  deep-reinforcement-learning
1 Year Machinelearning Journey
An advanced program in Machine Learning and Deep Learning
Stars: ✭ 69 (-18.82%)
Mutual labels:  deep-reinforcement-learning

IL

MIT License

Imitation learning algorithms (with PPO [1]):

python main.py --imitation [AIRL|BC|DRIL|FAIRL|GAIL|GMMIL|RED]

Options include:

  • State-only imitation learning: --state-only
  • Absorbing state indicator [11]: --absorbing
  • R1 gradient regularisation [12]: --r1-reg-coeff 1 (default)

Results

PPO

Train Test
ppo_train_returns ppo_test_returns

AIRL

Train Test
airl_train_returns airl_test_returns

BC

Train Test
bc_test_returns bc_test_returns

DRIL

Train Test
dril_train_returns dril_test_returns

FAIRL

Train Test
fairl_train_returns fairl_test_returns

GAIL

Train Test
gail_train_returns gail_test_returns

GMMIL

Train Test
gmmil_train_returns gmmil_test_returns

RED

Train Test
red_train_returns red_test_returns

Acknowledgements

References

[1] Proximal Policy Optimization Algorithms
[2] Adversarial Behavioral Cloning
[3] Learning Robust Rewards with Adversarial Inverse Reinforcement Learning
[4] Efficient Training of Artificial Neural Networks for Autonomous Navigation
[5] Disagreement-Regularized Imitation Learning
[6] A Divergence Minimization Perspective on Imitation Learning Methods
[7] Generative Adversarial Imitation Learning
[8] Imitation Learning via Kernel Mean Embedding
[9] Primal Wasserstein Imitation Learning
[10] Random Expert Distillation: Imitation Learning via Expert Policy Support Estimation
[11] Discriminator-Actor-Critic: Addressing Sample Inefficiency and Reward Bias in Adversarial Imitation Learning
[12] Which Training Methods for GANs do actually Converge?

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].