All Projects → kristery → Imitation-Learning-from-Imperfect-Demonstration

kristery / Imitation-Learning-from-Imperfect-Demonstration

Licence: other
[ICML 2019] Implementation of "Imitation Learning from Imperfect Demonstration"

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Imitation-Learning-from-Imperfect-Demonstration

Face Nn
游戏捏脸,基于神经风格迁移框架生成逼真人脸
Stars: ✭ 192 (+433.33%)
Mutual labels:  imitation-learning
rpg
Ranking Policy Gradient
Stars: ✭ 22 (-38.89%)
Mutual labels:  imitation-learning
Comyco
The implementation of "Comyco: Quality-Aware Adaptive Video Streaming via Imitation Learning" (ACM MM 2019)
Stars: ✭ 38 (+5.56%)
Mutual labels:  imitation-learning
Imitation
Clean PyTorch implementations of imitation learning algorithms
Stars: ✭ 204 (+466.67%)
Mutual labels:  imitation-learning
Mil
Code for "One-Shot Visual Imitation Learning via Meta-Learning"
Stars: ✭ 254 (+605.56%)
Mutual labels:  imitation-learning
haxball-chameleon
Solving Haxball (www.haxball.com) using Imitation Learning methods.
Stars: ✭ 20 (-44.44%)
Mutual labels:  imitation-learning
Pantheon
Pantheon of Congestion Control
Stars: ✭ 170 (+372.22%)
Mutual labels:  imitation-learning
CoDAIL
Implementation of CoDAIL in the ICLR 2021 paper <Multi-Agent Interactions Modeling with Correlated Policies>
Stars: ✭ 17 (-52.78%)
Mutual labels:  imitation-learning
Pontryagin-Differentiable-Programming
A unified end-to-end learning and control framework that is able to learn a (neural) control objective function, dynamics equation, control policy, or/and optimal trajectory in a control system.
Stars: ✭ 111 (+208.33%)
Mutual labels:  imitation-learning
dril
Disagreement-Regularized Imitation Learning
Stars: ✭ 25 (-30.56%)
Mutual labels:  imitation-learning
Awesome Real World Rl
Great resources for making Reinforcement Learning work in Real Life situations. Papers,projects and more.
Stars: ✭ 234 (+550%)
Mutual labels:  imitation-learning
Awesome Carla
👉 CARLA resources such as tutorial, blog, code and etc https://github.com/carla-simulator/carla
Stars: ✭ 246 (+583.33%)
Mutual labels:  imitation-learning
vnla
Code accompanying the CVPR 2019 paper: https://arxiv.org/abs/1812.04155
Stars: ✭ 60 (+66.67%)
Mutual labels:  imitation-learning
My bibliography for research on autonomous driving
Personal notes about scientific and research works on "Decision-Making for Autonomous Driving"
Stars: ✭ 197 (+447.22%)
Mutual labels:  imitation-learning
magical
The MAGICAL benchmark suite for robust imitation learning (NeurIPS 2020)
Stars: ✭ 60 (+66.67%)
Mutual labels:  imitation-learning
Gail Tf
Tensorflow implementation of generative adversarial imitation learning
Stars: ✭ 179 (+397.22%)
Mutual labels:  imitation-learning
end2end-self-driving-car
End-to-end Self-driving Car (Behavioral Cloning)
Stars: ✭ 19 (-47.22%)
Mutual labels:  imitation-learning
robInfLib-matlab
Kernelized Movement Primitives (KMP)
Stars: ✭ 24 (-33.33%)
Mutual labels:  imitation-learning
Coordinated-Multi-Agent-Imitation-Learning
This is an implementation of the paper "Coordinated Multi Agent Imitation Learning", or the Sloan version "Data-Driven Ghosting using Deep Imitation Learning" using Tensorflow
Stars: ✭ 35 (-2.78%)
Mutual labels:  imitation-learning
imitation learning
PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.
Stars: ✭ 93 (+158.33%)
Mutual labels:  imitation-learning

Imitation Learning from Imperfect Demonstration

The TRPO part is hugely based on: https://github.com/ikostrikov/pytorch-trpo

Requirement

  • Python 3.6
  • PyTorch 0.4.1
  • gym
  • mujoco
  • numpy
  • scipy

Execute

The .py files take trajectories and confidence data as inputs (in demonstrations folder) and record accumulated reward at each update in the log folder. Please follow below commands to run our methods and baselines. Traj-size option is the same as specifying $n_c+n_u$ in the paper and num-epochs specifies the maximum number of update iterations.

  • IC_GAIL
python IC_GAIL.py --env Ant-v2 --num-epochs 5000 --traj-size 600 
  • 2IWIL
python 2IWIL.py --env Ant-v2 --num-epochs 5000 --traj-size 600 --weight
  • GAIL (U+C)
python 2IWIL.py --env Ant-v2 --num-epochs 5000 --traj-size 600
  • GAIL (C)
python 2IWIL.py --env Ant-v2 --num-epochs 5000 --traj-size 600 --weight --only --noconf
  • GAIL (Reweight)
python 2IWIL.py --env Ant-v2 --num-epochs 5000 --traj-size 600 --weight --only
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].