Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → jangirrishabh → Toycarirl

jangirrishabh / Toycarirl

Licence: mit

Implementation of Inverse Reinforcement Learning Algorithm on a toy car in a 2D world problem, (Apprenticeship Learning via Inverse Reinforcement Learning Abbeel & Ng, 2004)

Programming Languages

python

139335 projects - #7 most used programming language

Labels

artificial-intelligence reinforcement-learning robotics pygame

Projects that are alternatives of or similar to Toycarirl

Ravens

Train robotic agents to learn pick and place with deep learning for vision-based manipulation in PyBullet. Transporter Nets, CoRL 2020.

Stars: ✭ 133 (+3.91%)

Mutual labels: artificial-intelligence, robotics, reinforcement-learning

Rex Gym

OpenAI Gym environments for an open-source quadruped robot (SpotMicro)

Stars: ✭ 684 (+434.38%)

Mutual labels: artificial-intelligence, robotics, reinforcement-learning

Dreamer

Dream to Control: Learning Behaviors by Latent Imagination

Stars: ✭ 269 (+110.16%)

Mutual labels: artificial-intelligence, robotics, reinforcement-learning

Pygame Learning Environment

PyGame Learning Environment (PLE) -- Reinforcement Learning Environment in Python.

Stars: ✭ 828 (+546.88%)

Mutual labels: artificial-intelligence, pygame, reinforcement-learning

Dreamerv2

Mastering Atari with Discrete World Models

Stars: ✭ 287 (+124.22%)

Mutual labels: artificial-intelligence, robotics, reinforcement-learning

Awesome Decision Making Reinforcement Learning

A selection of state-of-the-art research materials on decision making and motion planning.

Stars: ✭ 68 (-46.87%)

Mutual labels: artificial-intelligence, robotics, reinforcement-learning

Hand dapg

Repository to accompany RSS 2018 paper on dexterous hand manipulation

Stars: ✭ 88 (-31.25%)

Mutual labels: robotics, reinforcement-learning

Mapleai

AI各领域学习资料整理。（A collection of all skills and knowledges should be got command of to obtain an AI relevant job offer. There are online blogs, my personal blogs, electronic books copy.）

Stars: ✭ 89 (-30.47%)

Mutual labels: artificial-intelligence, reinforcement-learning

60 days rl challenge

60_Days_RL_Challenge中文版

Stars: ✭ 92 (-28.12%)

Mutual labels: artificial-intelligence, reinforcement-learning

Rlai Exercises

Exercise Solutions for Reinforcement Learning: An Introduction [2nd Edition]

Stars: ✭ 97 (-24.22%)

Mutual labels: artificial-intelligence, reinforcement-learning

Ai Reading Materials

Some of the ML and DL related reading materials, research papers that I've read

Stars: ✭ 79 (-38.28%)

Mutual labels: artificial-intelligence, reinforcement-learning

Researchpapernotes

Initiative to read research papers

Stars: ✭ 97 (-24.22%)

Mutual labels: robotics, reinforcement-learning

Chemgan Challenge

Code for the paper: Benhenda, M. 2017. ChemGAN challenge for drug discovery: can AI reproduce natural chemical diversity? arXiv preprint arXiv:1708.08227.

Stars: ✭ 98 (-23.44%)

Mutual labels: artificial-intelligence, reinforcement-learning

Stable Baselines3

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Stars: ✭ 1,263 (+886.72%)

Mutual labels: robotics, reinforcement-learning

Simulator

A ROS/ROS2 Multi-robot Simulator for Autonomous Vehicles

Stars: ✭ 1,260 (+884.38%)

Mutual labels: artificial-intelligence, reinforcement-learning

Safeopt

Safe Bayesian Optimization

Stars: ✭ 90 (-29.69%)

Mutual labels: robotics, reinforcement-learning

Snake

Artificial intelligence for the Snake game.

Stars: ✭ 1,241 (+869.53%)

Mutual labels: artificial-intelligence, reinforcement-learning

Papers Literature Ml Dl Rl Ai

Highly cited and useful papers related to machine learning, deep learning, AI, game theory, reinforcement learning

Stars: ✭ 1,341 (+947.66%)

Mutual labels: artificial-intelligence, reinforcement-learning

Reinforcement Learning Cheat Sheet

Stars: ✭ 104 (-18.75%)

Mutual labels: artificial-intelligence, reinforcement-learning

Pose Interpreter Networks

Real-Time Object Pose Estimation with Pose Interpreter Networks (IROS 2018)

Stars: ✭ 104 (-18.75%)

Mutual labels: artificial-intelligence, robotics

View All Similar Projects ➔

Using Inverse Reinforcement Learning to train a toy car in a 2D game to learn given Expert Behaviors

Note: The RL algorithm and the simulation game used is the work of Matthew Harvey. Thank you Matt

To know more please visit my blog https://jangirrishabh.github.io/2016/07/09/virtual-car-IRL/

Apprenticeship learning using Inverse Reinforcement Learning

Reinforcement learning (RL) is is the very basic and most intuitive form of trial and error learning, it is the way by which most of the living organisms with some form of thinking capabilities learn. Often referred to as learning by exploration, it is the way by which a new born human baby learns to take its first steps, that is by taking random actions initially and then slowly figuring out the actions which lead to the forward walking motion.

Now the question that I kept asking myself is, what is the driving force for this kind of learning, what forces the agent to learn a particular behavior in the way it is doing it. Upon learning more about RL I came across the idea of rewards, basically the agent tries to choose its actions in such a way that the rewards that is gets from that particular behavior are maximized. Now to make the agent perform different behaviors, it is the reward structure that one must modify/exploit. But assume we only have the knowledge of the behavior of the expert with us, then how do we estimate the reward structure given a particular behavior in the environment? Well, this is the very problem of Inverse Reinforcement Learning (IRL), where given the optimal expert policy (actually assumed to be optimal), we wish to determine the underlying reward structure.

Again, this is not an Intro to Inverse Reinforcement Learning post, rather it is a tutorial on how to use/code Inverse reinforcement learning framework for your own problem, but IRL lies at the very core of it, and it is quintessential to know about it first. IRL has been extensively studied in the past and algorithms have been developed for it, please go through the papers Ng and Russell,2000, and Abbeel and Ng, 2004 for more information.

Install Pygame

Install Pygame's dependencies with:

sudo apt install mercurial libfreetype6-dev libsdl-dev libsdl-image1.2-dev libsdl-ttf2.0-dev libsmpeg-dev libportmidi-dev libavformat-dev libsdl-mixer1.2-dev libswscale-dev libjpeg-dev

Then install Pygame itself:

pip3 install hg+http://bitbucket.org/pygame/pygame

Install Pymunk

This is the physics engine used by the simulation. It just went through a pretty significant rewrite (v5) so you need to grab the older v4 version. v4 is written for Python 2 so there are a couple extra steps.

Go back to your home or downloads and get Pymunk 4:

wget https://github.com/viblo/pymunk/archive/pymunk-4.0.0.tar.gz

Unpack it:

tar zxvf pymunk-4.0.0.tar.gz

Update from Python 2 to 3:

cd pymunk-pymukn-4.0.0/pymunk

2to3 -w *.py

Install it:

cd .. python3 setup.py install

Now go back to where you cloned reinforcement-learning-car and make sure everything worked with a quick python3 learning.py. If you see a screen come up with a little dot flying around the screen, you're ready to go!

Training

First, you need to train a model. This will save weights to the saved-models folder. You may need to create this folder before running. You can train the model by running:

python3 learning.py

It can take anywhere from an hour to 36 hours to train a model, depending on the complexity of the network and the size of your sample. However, it will spit out weights every 25,000 frames, so you can move on to the next step in much less time.

Playing

Edit the playing.py file to change the path name for the model you want to load. Sorry about this, I know it should be a command line argument.

Then, watch the car drive itself around the obstacles!

python3 playing.py

That's all there is to it.

Resources and credits -

Thank you Matt Harvey for the game and the RL framework, basically for the father blog to this blog - Using RL to teach a virtual car to avoid obstacles
Andrew Ng and Stuart Russell, 2000 - Algorithms for Inverse Reinforcement Learning
Pieter Abbeel and Andrew Ng, 2004 - Apprenticeship Learning via Inverse Reinforcement Learning

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 128

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (4) 🔗