All Projects → nosyndicate → pytorchrl

nosyndicate / pytorchrl

Licence: MIT License
Deep Reinforcement Learning algorithms implemented in PyTorch

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to pytorchrl

Deterministic Gail Pytorch
PyTorch implementation of Deterministic Generative Adversarial Imitation Learning (GAIL) for Off Policy learning
Stars: ✭ 44 (-6.38%)
Mutual labels:  deep-reinforcement-learning, imitation-learning
Carla
Open-source simulator for autonomous driving research.
Stars: ✭ 7,012 (+14819.15%)
Mutual labels:  deep-reinforcement-learning, imitation-learning
Tf2rl
TensorFlow2 Reinforcement Learning
Stars: ✭ 353 (+651.06%)
Mutual labels:  deep-reinforcement-learning, imitation-learning
Deep-Reinforcement-Learning-With-Python
Master classic RL, deep RL, distributional RL, inverse RL, and more using OpenAI Gym and TensorFlow with extensive Math
Stars: ✭ 222 (+372.34%)
Mutual labels:  deep-reinforcement-learning, inverse-reinforcement-learning
Gym Carla
An OpenAI gym wrapper for CARLA simulator
Stars: ✭ 164 (+248.94%)
Mutual labels:  deep-reinforcement-learning, imitation-learning
Imitation Learning
Imitation learning algorithms
Stars: ✭ 85 (+80.85%)
Mutual labels:  deep-reinforcement-learning, imitation-learning
Reinforcement learning tutorial with demo
Reinforcement Learning Tutorial with Demo: DP (Policy and Value Iteration), Monte Carlo, TD Learning (SARSA, QLearning), Function Approximation, Policy Gradient, DQN, Imitation, Meta Learning, Papers, Courses, etc..
Stars: ✭ 442 (+840.43%)
Mutual labels:  deep-reinforcement-learning, imitation-learning
Easy Rl
强化学习中文教程,在线阅读地址:https://datawhalechina.github.io/easy-rl/
Stars: ✭ 3,004 (+6291.49%)
Mutual labels:  deep-reinforcement-learning, imitation-learning
Pontryagin-Differentiable-Programming
A unified end-to-end learning and control framework that is able to learn a (neural) control objective function, dynamics equation, control policy, or/and optimal trajectory in a control system.
Stars: ✭ 111 (+136.17%)
Mutual labels:  imitation-learning, inverse-reinforcement-learning
imitation learning
PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.
Stars: ✭ 93 (+97.87%)
Mutual labels:  deep-reinforcement-learning, imitation-learning
Deep-Reinforcement-Learning
Introduction to Deep Reinforcement Learning
Stars: ✭ 71 (+51.06%)
Mutual labels:  deep-reinforcement-learning
reinforcement-learning-papers
My notes on reinforcement learning papers
Stars: ✭ 13 (-72.34%)
Mutual labels:  deep-reinforcement-learning
hgail
gail, infogail, hierarchical gail implementations
Stars: ✭ 25 (-46.81%)
Mutual labels:  imitation-learning
neat
[ICCV'21] NEAT: Neural Attention Fields for End-to-End Autonomous Driving
Stars: ✭ 194 (+312.77%)
Mutual labels:  imitation-learning
catalyst-examples
Examples
Stars: ✭ 54 (+14.89%)
Mutual labels:  deep-reinforcement-learning
SelfImitationDiverse
Tensorflow code for "Learning Self-Imitating Diverse Policies" (ICLR 2019)
Stars: ✭ 18 (-61.7%)
Mutual labels:  imitation-learning
wolpertinger ddpg
Wolpertinger Training with DDPG (Pytorch), Deep Reinforcement Learning in Large Discrete Action Spaces. Multi-GPU/Singer-GPU/CPU compatible.
Stars: ✭ 44 (-6.38%)
Mutual labels:  deep-reinforcement-learning
snake-reinforcement-DNN
Developing a deep neural network to play a snake game
Stars: ✭ 12 (-74.47%)
Mutual labels:  deep-reinforcement-learning
Imitation-Learning-from-Imperfect-Demonstration
[ICML 2019] Implementation of "Imitation Learning from Imperfect Demonstration"
Stars: ✭ 36 (-23.4%)
Mutual labels:  imitation-learning
Deep RL with pytorch
A pytorch tutorial for DRL(Deep Reinforcement Learning)
Stars: ✭ 160 (+240.43%)
Mutual labels:  deep-reinforcement-learning

PyTorchRL

Deep Reinforcement Learning implemented in pytorch.

This project trying to mimic the structure of rllab code and occasionally borrow code from rllab and inverse_rl

  • Implemented Algorithms
    • Online

      • DDPG (Deep Deterministic Policy Gradient)
      • NAF (Normalized Advantage Function)
      • Soft Q-Learning (Deep Energy-Based Policy)
    • Batch

      • TRPO (Trust Region Policy Optimization)
    • Inverse Reinforcement Learning & Imitation Learning

      • Generative Adversarial Imitation Learning
      • Adversarial Inverse Reinforcement Learning
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].