Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Reinforcement Learning Tutorial with Demo: DP (Policy and Value Iteration), Monte Carlo, TD Learning (SARSA, QLearning), Function Approximation, Policy Gradient, DQN, Imitation, Meta Learning, Papers, Courses, etc..

Stars: ✭ 442 (+116.67%)

Mutual labels: meta-learning, reinforcement-learning

Awesome Real World Rl

Great resources for making Reinforcement Learning work in Real Life situations. Papers,projects and more.

Stars: ✭ 234 (+14.71%)

Mutual labels: meta-learning, reinforcement-learning

Personae

📈 Personae is a repo of implements and environment of Deep Reinforcement Learning & Supervised Learning for Quantitative Trading.

Stars: ✭ 1,140 (+458.82%)

Mutual labels: paper, reinforcement-learning

Neural Architecture Search With Rl

Minimal Tensorflow implementation of the paper "Neural Architecture Search With Reinforcement Learning" presented at ICLR 2017

Stars: ✭ 37 (-81.86%)

Mutual labels: paper, reinforcement-learning

Drl4recsys

Courses on Deep Reinforcement Learning (DRL) and DRL papers for recommender systems

Stars: ✭ 196 (-3.92%)

Mutual labels: paper, reinforcement-learning

Savn

Learning to Learn how to Learn: Self-Adaptive Visual Navigation using Meta-Learning (https://arxiv.org/abs/1812.00971)

Stars: ✭ 135 (-33.82%)

Mutual labels: meta-learning, reinforcement-learning

Transferlearning

Transfer learning / domain adaptation / domain generalization / multi-task learning etc. Papers, codes, datasets, applications, tutorials.-迁移学习

Stars: ✭ 8,481 (+4057.35%)

Mutual labels: paper, meta-learning

Papers Literature Ml Dl Rl Ai

Highly cited and useful papers related to machine learning, deep learning, AI, game theory, reinforcement learning

Stars: ✭ 1,341 (+557.35%)

Mutual labels: paper, reinforcement-learning

Promp

ProMP: Proximal Meta-Policy Search

Stars: ✭ 181 (-11.27%)

Mutual labels: meta-learning, reinforcement-learning

Openml Python

Python module to interface with OpenML

Stars: ✭ 202 (-0.98%)

Mutual labels: meta-learning

Dm control

DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.

Stars: ✭ 2,592 (+1170.59%)

Mutual labels: reinforcement-learning

Reinforcementlearning.jl

A reinforcement learning package for Julia

Stars: ✭ 192 (-5.88%)

Mutual labels: reinforcement-learning

Papers we read

Summaries of the papers that are discussed by VLG.

Stars: ✭ 203 (-0.49%)

Mutual labels: reinforcement-learning

Release

Deep Reinforcement Learning for de-novo Drug Design

Stars: ✭ 201 (-1.47%)

Mutual labels: reinforcement-learning

View All Similar Projects ➔

Status: Archive (code is provided as-is, no updates expected)

Evolved Policy Gradients (EPG)

The paper is located at https://arxiv.org/abs/1802.04821. A demonstration video can be found at https://youtu.be/-Z-ieH6w0LA.

Houthooft, R., Chen, R. Y., Isola, P., Stadie, B. C., Wolski, F., Ho, J., Abbeel, P. (2018). Evolved Policy Gradients. arXiv preprint arXiv:1802.04821.

Installation

Install Anaconda:

curl -o /tmp/miniconda.sh https://repo.continuum.io/miniconda/Miniconda3-latest-MacOSX-x86_64.sh
bash /tmp/miniconda.sh
conda create -n epg python=3.6.1
source activate epg

Install necessary OSX packages for MPI:

brew install open-mpi

Install necessary Python packages:

pip install mpi4py==3.0.0 scipy \
pandas tqdm joblib cloudpickle == 0.5.2 \
progressbar2 opencv-python flask >= 0.11.1 matplotlib pytest cython \
chainer pathos mujoco_py 'gym[all]'

Running

First go to the EPG code folder:

cd <path_to_EPG_folder>

Then launch the entry script:

PYTHONPATH=. python epg/launch_local.py

Experiment data is saved in <home_dir>/EPG_experiments/<month>-<day>/<experiment_name>.

Testing

First, set theta_load_path = '<path_to_theta.npy>/theta.npy' in launch_local.py according to the theta.npy obtained after running the launch_local.py script. This file should be located in /<home_dir>/EPG_experiments/<month>-<day>/<experiment_name>/thetas/.

Then run:

PYTHONPATH=. python epg/launch_local.py --test true

Visualizing experiment data

Assuming the experiment data is saved in <home_dir>/EPG_experiments/<month>-<day>/<experiment_name>, run:

PYTHONPATH=. python epg/viskit/frontend.py <home_dir>/EPG_experiments/<month>-<day>/<experiment_name>

Then go to http://0.0.0.0:5000 in your browser.

Viskit sourced from

Duan, Y., Chen, X., Houthooft, R., Schulman, J., Abbeel, P. "Benchmarking Deep Reinforcement Learning for Continuous Control". Proceedings of the 33rd International Conference on Machine Learning (ICML), 2016.

BibTeX entry

@article{Houthooft18Evolved,
author = {Houthooft, Rein and Chen, Richard Y. and Isola, Phillip and Stadie, Bradly C. and Wolski, Filip and Ho, Jonathan and Abbeel, Pieter},
title = {Evolved Policy Gradients},
journal={arXiv preprint arXiv:1802.04821},
year = {2018}}

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 204

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (7) 🔗