All Projects → openai → Epg

openai / Epg

Licence: mit
Code for the paper "Evolved Policy Gradients"

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Epg

FSL-Mate
FSL-Mate: A collection of resources for few-shot learning (FSL).
Stars: ✭ 1,346 (+559.8%)
Mutual labels:  paper, meta-learning
Dl Nlp Readings
My Reading Lists of Deep Learning and Natural Language Processing
Stars: ✭ 656 (+221.57%)
Mutual labels:  paper, reinforcement-learning
maml-rl-tf2
Implementation of Model-Agnostic Meta-Learning (MAML) applied on Reinforcement Learning problems in TensorFlow 2.
Stars: ✭ 16 (-92.16%)
Mutual labels:  paper, meta-learning
Learning Deep Learning
Paper reading notes on Deep Learning and Machine Learning
Stars: ✭ 388 (+90.2%)
Mutual labels:  paper, reinforcement-learning
Ai Reading Materials
Some of the ML and DL related reading materials, research papers that I've read
Stars: ✭ 79 (-61.27%)
Mutual labels:  paper, reinforcement-learning
Awesome Deeplearning Resources
Deep Learning and deep reinforcement learning research papers and some codes
Stars: ✭ 2,483 (+1117.16%)
Mutual labels:  paper, reinforcement-learning
Reinforcement learning tutorial with demo
Reinforcement Learning Tutorial with Demo: DP (Policy and Value Iteration), Monte Carlo, TD Learning (SARSA, QLearning), Function Approximation, Policy Gradient, DQN, Imitation, Meta Learning, Papers, Courses, etc..
Stars: ✭ 442 (+116.67%)
Mutual labels:  meta-learning, reinforcement-learning
Awesome Real World Rl
Great resources for making Reinforcement Learning work in Real Life situations. Papers,projects and more.
Stars: ✭ 234 (+14.71%)
Mutual labels:  meta-learning, reinforcement-learning
Personae
📈 Personae is a repo of implements and environment of Deep Reinforcement Learning & Supervised Learning for Quantitative Trading.
Stars: ✭ 1,140 (+458.82%)
Mutual labels:  paper, reinforcement-learning
Neural Architecture Search With Rl
Minimal Tensorflow implementation of the paper "Neural Architecture Search With Reinforcement Learning" presented at ICLR 2017
Stars: ✭ 37 (-81.86%)
Mutual labels:  paper, reinforcement-learning
Drl4recsys
Courses on Deep Reinforcement Learning (DRL) and DRL papers for recommender systems
Stars: ✭ 196 (-3.92%)
Mutual labels:  paper, reinforcement-learning
Savn
Learning to Learn how to Learn: Self-Adaptive Visual Navigation using Meta-Learning (https://arxiv.org/abs/1812.00971)
Stars: ✭ 135 (-33.82%)
Mutual labels:  meta-learning, reinforcement-learning
Transferlearning
Transfer learning / domain adaptation / domain generalization / multi-task learning etc. Papers, codes, datasets, applications, tutorials.-迁移学习
Stars: ✭ 8,481 (+4057.35%)
Mutual labels:  paper, meta-learning
Papers Literature Ml Dl Rl Ai
Highly cited and useful papers related to machine learning, deep learning, AI, game theory, reinforcement learning
Stars: ✭ 1,341 (+557.35%)
Mutual labels:  paper, reinforcement-learning
Promp
ProMP: Proximal Meta-Policy Search
Stars: ✭ 181 (-11.27%)
Mutual labels:  meta-learning, reinforcement-learning
Openml Python
Python module to interface with OpenML
Stars: ✭ 202 (-0.98%)
Mutual labels:  meta-learning
Dm control
DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.
Stars: ✭ 2,592 (+1170.59%)
Mutual labels:  reinforcement-learning
Reinforcementlearning.jl
A reinforcement learning package for Julia
Stars: ✭ 192 (-5.88%)
Mutual labels:  reinforcement-learning
Papers we read
Summaries of the papers that are discussed by VLG.
Stars: ✭ 203 (-0.49%)
Mutual labels:  reinforcement-learning
Release
Deep Reinforcement Learning for de-novo Drug Design
Stars: ✭ 201 (-1.47%)
Mutual labels:  reinforcement-learning

Status: Archive (code is provided as-is, no updates expected)

Evolved Policy Gradients (EPG)

The paper is located at https://arxiv.org/abs/1802.04821. A demonstration video can be found at https://youtu.be/-Z-ieH6w0LA.

Houthooft, R., Chen, R. Y., Isola, P., Stadie, B. C., Wolski, F., Ho, J., Abbeel, P. (2018). Evolved Policy Gradients. arXiv preprint arXiv:1802.04821.

Installation

Install Anaconda:

curl -o /tmp/miniconda.sh https://repo.continuum.io/miniconda/Miniconda3-latest-MacOSX-x86_64.sh
bash /tmp/miniconda.sh
conda create -n epg python=3.6.1
source activate epg

Install necessary OSX packages for MPI:

brew install open-mpi

Install necessary Python packages:

pip install mpi4py==3.0.0 scipy \
pandas tqdm joblib cloudpickle == 0.5.2 \
progressbar2 opencv-python flask >= 0.11.1 matplotlib pytest cython \
chainer pathos mujoco_py 'gym[all]'

Running

First go to the EPG code folder:

cd <path_to_EPG_folder>

Then launch the entry script:

PYTHONPATH=. python epg/launch_local.py

Experiment data is saved in <home_dir>/EPG_experiments/<month>-<day>/<experiment_name>.

Testing

First, set theta_load_path = '<path_to_theta.npy>/theta.npy' in launch_local.py according to the theta.npy obtained after running the launch_local.py script. This file should be located in /<home_dir>/EPG_experiments/<month>-<day>/<experiment_name>/thetas/.

Then run:

PYTHONPATH=. python epg/launch_local.py --test true

Visualizing experiment data

Assuming the experiment data is saved in <home_dir>/EPG_experiments/<month>-<day>/<experiment_name>, run:

PYTHONPATH=. python epg/viskit/frontend.py <home_dir>/EPG_experiments/<month>-<day>/<experiment_name>

Then go to http://0.0.0.0:5000 in your browser.

Viskit sourced from

Duan, Y., Chen, X., Houthooft, R., Schulman, J., Abbeel, P. "Benchmarking Deep Reinforcement Learning for Continuous Control". Proceedings of the 33rd International Conference on Machine Learning (ICML), 2016.

BibTeX entry

@article{Houthooft18Evolved,
author = {Houthooft, Rein and Chen, Richard Y. and Isola, Phillip and Stadie, Bradly C. and Wolski, Filip and Ho, Jonathan and Abbeel, Pieter},
title = {Evolved Policy Gradients},
journal={arXiv preprint arXiv:1802.04821},
year = {2018}}
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].