All Projects → jcwleo → random-network-distillation-pytorch

jcwleo / random-network-distillation-pytorch

Licence: MIT license
Random Network Distillation pytorch

Programming Languages

python
139335 projects - #7 most used programming language
shell
77523 projects

Projects that are alternatives of or similar to random-network-distillation-pytorch

RND-Pytorch
Random Network Distillation(RND) algo in Pytorch
Stars: ✭ 39 (-78.92%)
Mutual labels:  rnd, random-network-distillation
play-json-extensions
+22 field case class formatter and more for play-json
Stars: ✭ 193 (+4.32%)
Mutual labels:  rnd
diff
Visually compare Scala data structures with out of the box support for arbitrary case classes.
Stars: ✭ 179 (-3.24%)
Mutual labels:  rnd
typeless
running wild with shapeless
Stars: ✭ 16 (-91.35%)
Mutual labels:  rnd
Street-fighter-A3C-ICM-pytorch
Curiosity-driven Exploration by Self-supervised Prediction for Street Fighter III Third Strike
Stars: ✭ 149 (-19.46%)
Mutual labels:  curiosity-driven
Deep RL with pytorch
A pytorch tutorial for DRL(Deep Reinforcement Learning)
Stars: ✭ 160 (-13.51%)
Mutual labels:  random-network-distillation
rl pytorch
Deep Reinforcement Learning Algorithms Implementation in PyTorch
Stars: ✭ 23 (-87.57%)
Mutual labels:  random-network-distillation
reinforcement learning ppo rnd
Deep Reinforcement Learning by using Proximal Policy Optimization and Random Network Distillation in Tensorflow 2 and Pytorch with some explanation
Stars: ✭ 33 (-82.16%)
Mutual labels:  random-network-distillation

Random Network Distillation

Intrinsic Reward Graph with play

Venture Montezuma's Revenge
Video Label
~ New model for Montezuma
  • Advantage Actor critic [1]
  • Parallel Advantage Actor critic [2]
  • Exploration by Random Network Distillation [3]
  • Proximal Policy Optimization Algorithms [4]

1. Setup

Requirements


2. How to Train

Modify the parameters in config.conf as you like.

python train.py

3. How to Eval

python eval.py

4. Loss/Reward Graph

  • Montezuma's Revenge Env image
  • Venture Env image

References

[1] Actor-Critic Algorithms
[2] Efficient Parallel Methods for Deep Reinforcement Learning
[3] Exploration by Random Network Distillation
[4] Proximal Policy Optimization Algorithms

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].