Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → xkianteb → dril

xkianteb / dril

Licence: other

Disagreement-Regularized Imitation Learning

Programming Languages

139335 projects - #7 most used programming language

Labels

reinforcement-learning imitation-learning sequential-decision-making-problems

Projects that are alternatives of or similar to dril

Learningbycheating

Driving in CARLA using waypoint prediction and two-stage imitation learning

Stars: ✭ 119 (+376%)

Mutual labels: imitation-learning

Clean PyTorch implementations of imitation learning algorithms

Stars: ✭ 204 (+716%)

Mutual labels: imitation-learning

Ranking Policy Gradient

Stars: ✭ 22 (-12%)

Mutual labels: imitation-learning

An OpenAI gym wrapper for CARLA simulator

Stars: ✭ 164 (+556%)

Mutual labels: imitation-learning

游戏捏脸，基于神经风格迁移框架生成逼真人脸

Stars: ✭ 192 (+668%)

Mutual labels: imitation-learning

An elegant PyTorch deep reinforcement learning library.

Stars: ✭ 4,109 (+16336%)

Mutual labels: imitation-learning

Tensorflow implementation of Generative Adversarial Imitation Learning(GAIL) with discrete action

Stars: ✭ 99 (+296%)

Mutual labels: imitation-learning

Code accompanying the CVPR 2019 paper: https://arxiv.org/abs/1812.04155

Stars: ✭ 60 (+140%)

Mutual labels: imitation-learning

My bibliography for research on autonomous driving

Personal notes about scientific and research works on "Decision-Making for Autonomous Driving"

Stars: ✭ 197 (+688%)

Mutual labels: imitation-learning

Pontryagin-Differentiable-Programming

A unified end-to-end learning and control framework that is able to learn a (neural) control objective function, dynamics equation, control policy, or/and optimal trajectory in a control system.

Stars: ✭ 111 (+344%)

Mutual labels: imitation-learning

Reinforcement Learning Coach by Intel AI Lab enables easy experimentation with state of the art Reinforcement Learning algorithms

Stars: ✭ 2,085 (+8240%)

Mutual labels: imitation-learning

Tensorflow implementation of generative adversarial imitation learning

Stars: ✭ 179 (+616%)

Mutual labels: imitation-learning

👉 CARLA resources such as tutorial, blog, code and etc https://github.com/carla-simulator/carla

Stars: ✭ 246 (+884%)

Mutual labels: imitation-learning

Train robotic agents to learn pick and place with deep learning for vision-based manipulation in PyBullet. Transporter Nets, CoRL 2020.

Stars: ✭ 133 (+432%)

Mutual labels: imitation-learning

end2end-self-driving-car

End-to-end Self-driving Car (Behavioral Cloning)

Stars: ✭ 19 (-24%)

Mutual labels: imitation-learning

强化学习中文教程，在线阅读地址：https://datawhalechina.github.io/easy-rl/

Stars: ✭ 3,004 (+11916%)

Mutual labels: imitation-learning

Awesome Real World Rl

Great resources for making Reinforcement Learning work in Real Life situations. Papers,projects and more.

Stars: ✭ 234 (+836%)

Mutual labels: imitation-learning

imitation learning

PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.

Stars: ✭ 93 (+272%)

Mutual labels: imitation-learning

haxball-chameleon

Solving Haxball (www.haxball.com) using Imitation Learning methods.

Stars: ✭ 20 (-20%)

Mutual labels: imitation-learning

Code for "One-Shot Visual Imitation Learning via Meta-Learning"

Stars: ✭ 254 (+916%)

Mutual labels: imitation-learning

View All Similar Projects ➔

Due to a normalization bug the expert trajectories have lower performance than the rl_baseline_zoo reported experts. Please see the following link in codebase for where the bug was fixed at. [link]

Disagreement-Regularized Imitation Learning

Code to train the models described in the paper "Disagreement-Regularized Imitation Learning", by Kianté Brantley, Wen Sun and Mikael Henaff.

Usage:

Install using pip

Install the DRIL package

pip install -e .

Software Dependencies

"stable-baselines", "rl-baselines-zoo", "baselines", "gym", "pytorch", "pybullet"

Data

We provide a python script to generate expert data from per-trained models using the "rl-baselines-zoo" repository. Click "Here" to see all of the pre-trained agents available and their respective perfromance. Replace <name-of-environment> with the name of the pre-trained agent environment you would like to collect expert data for.

python -u generate_demonstration_data.py --seed <seed-number> --env-name <name-of-environment> --rl_baseline_zoo_dir <location-to-top-level-directory>

Training

DRIL requires a per-trained ensemble model and a per-trained behavior-cloning model.

Note that <location-to-rl-baseline-zoo-directory> is the full-path to the top-level directory to the rl_baseline_zoo repository.

To train only a behavior-cloning model run:

python -u main.py --env-name <name-of-environment> --num-trajs <number-of-trajectories> --behavior_cloning --rl_baseline_zoo_dir <location-to-rl-baseline-zoo-directory> --seed <seed-number>'

To train only a ensemble model run:

python -u main.py --env-name <name-of-environment> --num-trajs <number-of-trajectories> --pretrain_ensemble_only --rl_baseline_zoo_dir <location-to-rl-baseline-zoo-directory> --seed <seed-number>'

To train a DRIL model run the command below. Note that command below first checks that both the behavior cloning model and the ensemble model are trained, if they are not the script will automatically train both the ensemble and behavior-cloning model.

python -u main.py --env-name <name-of-environment> --default_experiment_params <type-of-env>  --num-trajs <number-of-trajectories> --rl_baseline_zoo_dir <location-to-rl-baseline-zoo-directory> --seed <seed-number>  --dril

--default_experiment_params are the default parameters we use in the DRIL experiments and has two options: atari and continous-control

Visualization

After training the models, the results are stored in a folder called trained_results. Run the command below to reproduce the plots in our paper. If you change any of the hyperparameters, you will need to change the hyperparameters in the plot file naming convention.

python -u plot.py -env <name-of-environment>

Empirical evaluation

Atari

Results on Atari environments.

Continous Control

Results on continuous control tasks.

Acknowledgement:

We would like to thank Ilya Kostrikov for creating this "repo" that our codebase builds on.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 25

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (0) 🔗