Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → kristery → Imitation-Learning-from-Imperfect-Demonstration

kristery / Imitation-Learning-from-Imperfect-Demonstration

Licence: other

[ICML 2019] Implementation of "Imitation Learning from Imperfect Demonstration"

Programming Languages

139335 projects - #7 most used programming language

Labels

pytorch imitation-learning imperfect-demonstrations

Projects that are alternatives of or similar to Imitation-Learning-from-Imperfect-Demonstration

游戏捏脸，基于神经风格迁移框架生成逼真人脸

Stars: ✭ 192 (+433.33%)

Mutual labels: imitation-learning

Ranking Policy Gradient

Stars: ✭ 22 (-38.89%)

Mutual labels: imitation-learning

The implementation of "Comyco: Quality-Aware Adaptive Video Streaming via Imitation Learning" (ACM MM 2019)

Stars: ✭ 38 (+5.56%)

Mutual labels: imitation-learning

Clean PyTorch implementations of imitation learning algorithms

Stars: ✭ 204 (+466.67%)

Mutual labels: imitation-learning

Code for "One-Shot Visual Imitation Learning via Meta-Learning"

Stars: ✭ 254 (+605.56%)

Mutual labels: imitation-learning

haxball-chameleon

Solving Haxball (www.haxball.com) using Imitation Learning methods.

Stars: ✭ 20 (-44.44%)

Mutual labels: imitation-learning

Pantheon of Congestion Control

Stars: ✭ 170 (+372.22%)

Mutual labels: imitation-learning

Implementation of CoDAIL in the ICLR 2021 paper <Multi-Agent Interactions Modeling with Correlated Policies>

Stars: ✭ 17 (-52.78%)

Mutual labels: imitation-learning

Pontryagin-Differentiable-Programming

A unified end-to-end learning and control framework that is able to learn a (neural) control objective function, dynamics equation, control policy, or/and optimal trajectory in a control system.

Stars: ✭ 111 (+208.33%)

Mutual labels: imitation-learning

Disagreement-Regularized Imitation Learning

Stars: ✭ 25 (-30.56%)

Mutual labels: imitation-learning

Awesome Real World Rl

Great resources for making Reinforcement Learning work in Real Life situations. Papers,projects and more.

Stars: ✭ 234 (+550%)

Mutual labels: imitation-learning

👉 CARLA resources such as tutorial, blog, code and etc https://github.com/carla-simulator/carla

Stars: ✭ 246 (+583.33%)

Mutual labels: imitation-learning

Code accompanying the CVPR 2019 paper: https://arxiv.org/abs/1812.04155

Stars: ✭ 60 (+66.67%)

Mutual labels: imitation-learning

My bibliography for research on autonomous driving

Personal notes about scientific and research works on "Decision-Making for Autonomous Driving"

Stars: ✭ 197 (+447.22%)

Mutual labels: imitation-learning

The MAGICAL benchmark suite for robust imitation learning (NeurIPS 2020)

Stars: ✭ 60 (+66.67%)

Mutual labels: imitation-learning

Tensorflow implementation of generative adversarial imitation learning

Stars: ✭ 179 (+397.22%)

Mutual labels: imitation-learning

end2end-self-driving-car

End-to-end Self-driving Car (Behavioral Cloning)

Stars: ✭ 19 (-47.22%)

Mutual labels: imitation-learning

robInfLib-matlab

Kernelized Movement Primitives (KMP)

Stars: ✭ 24 (-33.33%)

Mutual labels: imitation-learning

Coordinated-Multi-Agent-Imitation-Learning

This is an implementation of the paper "Coordinated Multi Agent Imitation Learning", or the Sloan version "Data-Driven Ghosting using Deep Imitation Learning" using Tensorflow

Stars: ✭ 35 (-2.78%)

Mutual labels: imitation-learning

imitation learning

PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.

Stars: ✭ 93 (+158.33%)

Mutual labels: imitation-learning

View All Similar Projects ➔

Imitation Learning from Imperfect Demonstration

The TRPO part is hugely based on: https://github.com/ikostrikov/pytorch-trpo

Requirement

Python 3.6
PyTorch 0.4.1
gym
mujoco
numpy
scipy

Execute

The .py files take trajectories and confidence data as inputs (in demonstrations folder) and record accumulated reward at each update in the log folder. Please follow below commands to run our methods and baselines. Traj-size option is the same as specifying $n_c+n_u$ in the paper and num-epochs specifies the maximum number of update iterations.

IC_GAIL

python IC_GAIL.py --env Ant-v2 --num-epochs 5000 --traj-size 600

2IWIL

python 2IWIL.py --env Ant-v2 --num-epochs 5000 --traj-size 600 --weight

GAIL (U+C)

python 2IWIL.py --env Ant-v2 --num-epochs 5000 --traj-size 600

GAIL (C)

python 2IWIL.py --env Ant-v2 --num-epochs 5000 --traj-size 600 --weight --only --noconf

GAIL (Reweight)

python 2IWIL.py --env Ant-v2 --num-epochs 5000 --traj-size 600 --weight --only

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 36

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (0) 🔗