All Projects → yrlu → Irl Imitation

yrlu / Irl Imitation

Implementation of Inverse Reinforcement Learning (IRL) algorithms in python/Tensorflow. Deep MaxEnt, MaxEnt, LPIRL

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Irl Imitation

Ros2learn
ROS 2 enabled Machine Learning algorithms
Stars: ✭ 119 (-64.26%)
Mutual labels:  reinforcement-learning, ml, rl
Coach
Reinforcement Learning Coach by Intel AI Lab enables easy experimentation with state of the art Reinforcement Learning algorithms
Stars: ✭ 2,085 (+526.13%)
Mutual labels:  reinforcement-learning, rl, imitation-learning
Atari
AI research environment for the Atari 2600 games 🤖.
Stars: ✭ 174 (-47.75%)
Mutual labels:  reinforcement-learning, ml, rl
Awesome Real World Rl
Great resources for making Reinforcement Learning work in Real Life situations. Papers,projects and more.
Stars: ✭ 234 (-29.73%)
Mutual labels:  reinforcement-learning, imitation-learning
Gymfc
A universal flight control tuning framework
Stars: ✭ 210 (-36.94%)
Mutual labels:  reinforcement-learning, rl
Data Science Free
Free Resources For Data Science created by Shubham Kumar
Stars: ✭ 232 (-30.33%)
Mutual labels:  reinforcement-learning, ml
My bibliography for research on autonomous driving
Personal notes about scientific and research works on "Decision-Making for Autonomous Driving"
Stars: ✭ 197 (-40.84%)
Mutual labels:  reinforcement-learning, imitation-learning
gym-rs
OpenAI's Gym written in pure Rust for blazingly fast performance
Stars: ✭ 34 (-89.79%)
Mutual labels:  ml, rl
Learning To Communicate Pytorch
Learning to Communicate with Deep Multi-Agent Reinforcement Learning in PyTorch
Stars: ✭ 236 (-29.13%)
Mutual labels:  reinforcement-learning, rl
revisiting rainbow
Revisiting Rainbow
Stars: ✭ 71 (-78.68%)
Mutual labels:  ml, rl
Elf
ELF: a platform for game research with AlphaGoZero/AlphaZero reimplementation
Stars: ✭ 3,240 (+872.97%)
Mutual labels:  reinforcement-learning, rl
Alphazero gomoku
An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)
Stars: ✭ 2,570 (+671.77%)
Mutual labels:  reinforcement-learning, rl
Rl Tutorial Jnrr19
Stable-Baselines tutorial for Journées Nationales de la Recherche en Robotique 2019
Stars: ✭ 204 (-38.74%)
Mutual labels:  reinforcement-learning, rl
Pytorch Drl
PyTorch implementations of various Deep Reinforcement Learning (DRL) algorithms for both single agent and multi-agent.
Stars: ✭ 233 (-30.03%)
Mutual labels:  reinforcement-learning, rl
Rl trading
An environment to high-frequency trading agents under reinforcement learning
Stars: ✭ 205 (-38.44%)
Mutual labels:  reinforcement-learning, rl
Awesome Carla
👉 CARLA resources such as tutorial, blog, code and etc https://github.com/carla-simulator/carla
Stars: ✭ 246 (-26.13%)
Mutual labels:  reinforcement-learning, imitation-learning
Gym Gazebo2
gym-gazebo2 is a toolkit for developing and comparing reinforcement learning algorithms using ROS 2 and Gazebo
Stars: ✭ 257 (-22.82%)
Mutual labels:  reinforcement-learning, rl
Polyaxon
Machine Learning Platform for Kubernetes (MLOps tools for experimentation and automation)
Stars: ✭ 2,966 (+790.69%)
Mutual labels:  reinforcement-learning, ml
Rad
RAD: Reinforcement Learning with Augmented Data
Stars: ✭ 268 (-19.52%)
Mutual labels:  reinforcement-learning, rl
Gail Tf
Tensorflow implementation of generative adversarial imitation learning
Stars: ✭ 179 (-46.25%)
Mutual labels:  reinforcement-learning, imitation-learning

irl-imitation

Implementation of selected Inverse Reinforcement Learning (IRL) algorithms in python/Tensorflow.

python demo.py
Algorithms implemented
  • Linear inverse reinforcement learning (Ng & Russell 2000)
  • Maximum entropy inverse reinforcement learning (Ziebart et al. 2008)
  • Maximum entropy deep inverse reinforcement learning (Wulfmeier et al. 2015)
MDP & solver implemented
  • gridworld 2D
  • gridworld 1D
  • value iteration

Dependencies

  • python 2.7
  • cvxopt
  • Tensorflow 0.12.1
  • matplotlib

Linear Inverse Reinforcement Learning

$ python linear_irl_gridworld.py --act_random=0.3 --gamma=0.5 --l1=10 --r_max=10

Maximum Entropy Inverse Reinforcement Learning

(This implementation is largely influenced by Matthew Alger's maxent implementation)

$ python maxent_irl_gridworld.py --height=10 --width=10 --gamma=0.8 --n_trajs=100 --l_traj=50 --no-rand_start --learning_rate=0.01 --n_iters=20

$ python maxent_irl_gridworld.py --gamma=0.8 --n_trajs=400 --l_traj=50 --rand_start --learning_rate=0.01 --n_iters=20

Maximum Entropy Deep Inverse Reinforcement Learning

  • Following Wulfmeier et al. 2015 paper: Maximum Entropy Deep Inverse Reinforcement Learning. FC version implemented. The implementation does not follow exactly the model proposed in the paper. Some tweaks applied including elu activations, clipping gradients, l2 regularization etc.
  • $ python deep_maxent_irl_gridworld.py --help for options descriptions
$ python deep_maxent_irl_gridworld.py --learning_rate=0.02 --n_trajs=200 --n_iters=20

MIT License

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].