All Projects → nikhilbarhate99 → Deterministic Gail Pytorch

nikhilbarhate99 / Deterministic Gail Pytorch

Licence: mit
PyTorch implementation of Deterministic Generative Adversarial Imitation Learning (GAIL) for Off Policy learning

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Deterministic Gail Pytorch

Pytorch Rl
This repository contains model-free deep reinforcement learning algorithms implemented in Pytorch
Stars: ✭ 394 (+795.45%)
Mutual labels:  gym, reinforcement-learning, generative-adversarial-network, deep-reinforcement-learning, openai-gym
Rl Book
Source codes for the book "Reinforcement Learning: Theory and Python Implementation"
Stars: ✭ 464 (+954.55%)
Mutual labels:  gym, reinforcement-learning, deep-reinforcement-learning, openai-gym
Rl algos
Reinforcement Learning Algorithms
Stars: ✭ 14 (-68.18%)
Mutual labels:  gym, reinforcement-learning, deep-reinforcement-learning
Reinforcement learning tutorial with demo
Reinforcement Learning Tutorial with Demo: DP (Policy and Value Iteration), Monte Carlo, TD Learning (SARSA, QLearning), Function Approximation, Policy Gradient, DQN, Imitation, Meta Learning, Papers, Courses, etc..
Stars: ✭ 442 (+904.55%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning, imitation-learning
Rl a3c pytorch
A3C LSTM Atari with Pytorch plus A3G design
Stars: ✭ 482 (+995.45%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning, openai-gym
Deep Reinforcement Learning
Repo for the Deep Reinforcement Learning Nanodegree program
Stars: ✭ 4,012 (+9018.18%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning, openai-gym
Rl Baselines Zoo
A collection of 100+ pre-trained RL agents using Stable Baselines, training and hyperparameter optimization included.
Stars: ✭ 839 (+1806.82%)
Mutual labels:  gym, reinforcement-learning, openai-gym
Mushroom Rl
Python library for Reinforcement Learning.
Stars: ✭ 442 (+904.55%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning, openai-gym
Drq
DrQ: Data regularized Q
Stars: ✭ 268 (+509.09%)
Mutual labels:  gym, reinforcement-learning, deep-reinforcement-learning
Drlkit
A High Level Python Deep Reinforcement Learning library. Great for beginners, prototyping and quickly comparing algorithms
Stars: ✭ 29 (-34.09%)
Mutual labels:  gym, reinforcement-learning, deep-reinforcement-learning
Hands On Reinforcement Learning With Python
Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow
Stars: ✭ 640 (+1354.55%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning, openai-gym
Super Mario Bros Ppo Pytorch
Proximal Policy Optimization (PPO) algorithm for Super Mario Bros
Stars: ✭ 649 (+1375%)
Mutual labels:  gym, reinforcement-learning, openai-gym
Paac.pytorch
Pytorch implementation of the PAAC algorithm presented in Efficient Parallel Methods for Deep Reinforcement Learning https://arxiv.org/abs/1705.04862
Stars: ✭ 22 (-50%)
Mutual labels:  gym, reinforcement-learning, deep-reinforcement-learning
Gym Duckietown
Self-driving car simulator for the Duckietown universe
Stars: ✭ 379 (+761.36%)
Mutual labels:  reinforcement-learning, openai-gym, imitation-learning
Tf2rl
TensorFlow2 Reinforcement Learning
Stars: ✭ 353 (+702.27%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning, imitation-learning
Rlcard
Reinforcement Learning / AI Bots in Card (Poker) Games - Blackjack, Leduc, Texas, DouDizhu, Mahjong, UNO.
Stars: ✭ 980 (+2127.27%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning, openai-gym
Ma Gym
A collection of multi agent environments based on OpenAI gym.
Stars: ✭ 226 (+413.64%)
Mutual labels:  gym, reinforcement-learning, openai-gym
Gym Gazebo2
gym-gazebo2 is a toolkit for developing and comparing reinforcement learning algorithms using ROS 2 and Gazebo
Stars: ✭ 257 (+484.09%)
Mutual labels:  gym, reinforcement-learning, deep-reinforcement-learning
Deepdrive
Deepdrive is a simulator that allows anyone with a PC to push the state-of-the-art in self-driving
Stars: ✭ 628 (+1327.27%)
Mutual labels:  gym, reinforcement-learning, deep-reinforcement-learning
Pytorch Rl
PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.
Stars: ✭ 658 (+1395.45%)
Mutual labels:  reinforcement-learning, generative-adversarial-network, deep-reinforcement-learning

Deterministic-GAIL-PyTorch

This is an attempt to implement Generative Adversarial Imitation Learning (GAIL) for deterministic policies with off Policy learning on static data. The policy never interacts with the environment (except for evaluation), instead it is trained on policy state-action pair, where policy only selects actions for states sampled from expert data.

Results

Although it works sometimes (depending on the type of environment), the algorithm has high variance, and the results are inconsistent.

BipedalWalker-v2

Expert Policy Recovered Policy (10 expert episodes)
Epochs vs rewards
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].