All Projects → llan-ml → tesp

llan-ml / tesp

Licence: other
Implementation of our paper "Meta Reinforcement Learning with Task Embedding and Shared Policy"

Programming Languages

python
139335 projects - #7 most used programming language
HTML
75241 projects

Projects that are alternatives of or similar to tesp

maml-rl-tf2
Implementation of Model-Agnostic Meta-Learning (MAML) applied on Reinforcement Learning problems in TensorFlow 2.
Stars: ✭ 16 (-42.86%)
Mutual labels:  meta-learning, meta-rl, meta-reinforcement
FOCAL-ICLR
Code for FOCAL Paper Published at ICLR 2021
Stars: ✭ 35 (+25%)
Mutual labels:  meta-learning, meta-rl
Keita
My personal toolkit for PyTorch development.
Stars: ✭ 124 (+342.86%)
Mutual labels:  meta-learning
Meta Learning Papers
Meta Learning / Learning to Learn / One Shot Learning / Few Shot Learning
Stars: ✭ 2,420 (+8542.86%)
Mutual labels:  meta-learning
Promp
ProMP: Proximal Meta-Policy Search
Stars: ✭ 181 (+546.43%)
Mutual labels:  meta-learning
Canet
The code for paper "CANet: Class-Agnostic Segmentation Networks with Iterative Refinement and Attentive Few-Shot Learning"
Stars: ✭ 135 (+382.14%)
Mutual labels:  meta-learning
Meta Learning Papers
A classified list of meta learning papers based on realm.
Stars: ✭ 193 (+589.29%)
Mutual labels:  meta-learning
Metar Cnn
Meta R-CNN : Towards General Solver for Instance-level Low-shot Learning
Stars: ✭ 120 (+328.57%)
Mutual labels:  meta-learning
Awesome Real World Rl
Great resources for making Reinforcement Learning work in Real Life situations. Papers,projects and more.
Stars: ✭ 234 (+735.71%)
Mutual labels:  meta-learning
Mzsr
Meta-Transfer Learning for Zero-Shot Super-Resolution (CVPR, 2020)
Stars: ✭ 181 (+546.43%)
Mutual labels:  meta-learning
Epg
Code for the paper "Evolved Policy Gradients"
Stars: ✭ 204 (+628.57%)
Mutual labels:  meta-learning
Metalearning4nlp Papers
A list of recent papers about Meta / few-shot learning methods applied in NLP areas.
Stars: ✭ 163 (+482.14%)
Mutual labels:  meta-learning
Savn
Learning to Learn how to Learn: Self-Adaptive Visual Navigation using Meta-Learning (https://arxiv.org/abs/1812.00971)
Stars: ✭ 135 (+382.14%)
Mutual labels:  meta-learning
Openml Python
Python module to interface with OpenML
Stars: ✭ 202 (+621.43%)
Mutual labels:  meta-learning
Mfr
Learning Meta Face Recognition in Unseen Domains, CVPR, Oral, 2020
Stars: ✭ 127 (+353.57%)
Mutual labels:  meta-learning
Mini Imagenet Tools
Tools for generating mini-ImageNet dataset and processing batches
Stars: ✭ 209 (+646.43%)
Mutual labels:  meta-learning
Meta Learning Lstm Pytorch
pytorch implementation of Optimization as a Model for Few-shot Learning
Stars: ✭ 121 (+332.14%)
Mutual labels:  meta-learning
Meta Weight Net
NeurIPS'19: Meta-Weight-Net: Learning an Explicit Mapping For Sample Weighting (Pytorch implementation for noisy labels).
Stars: ✭ 158 (+464.29%)
Mutual labels:  meta-learning
Hyperactive
A hyperparameter optimization and data collection toolbox for convenient and fast prototyping of machine-learning models.
Stars: ✭ 182 (+550%)
Mutual labels:  meta-learning
Mil
Code for "One-Shot Visual Imitation Learning via Meta-Learning"
Stars: ✭ 254 (+807.14%)
Mutual labels:  meta-learning

Meta Reinforcement Learning

This repository contains the TensorFlow implementation of our paper Meta Reinforcement Learning with Task Embedding and Shared Policy (IJCAI 2019).

For detailed settings of the environments and experiments, please refer to the supplementary material.

We also re-implement the following methods:

Requirements

  • python==3.6.5
  • tensorflow>=1.11,<2.0
  • ray>=0.6 (we used this version in the experiments, but any version ray>=0.6 should work)
  • gym>=0.10
  • mujoco-py==1.50.1.68

We recommend installing Anaconda before installing other dependencies.

Usage

Preparation

We provide several samples of bash scripts to ease operations on a Ray cluster:

  • env.sh: Declare the configuration of Ray cluster.
  • update_ray_codes.sh: Replace with our slightly modified RLlib and Tune.
  • exec_commands.sh: Make some directories and (optionally) install ray on each node.
  • ray_start.sh: Start a Ray cluster.
  • ray_stop.sh: Stop a Ray cluster.
  • upload_codes.sh: Upload the training code to each node of a Ray cluster.

Run

Distributed Mode

You need to first launch a Ray cluster declared in env.sh and upload relevant codes to each node.

Then, log into the master (head) node, switch to the working directory (see upload_codes.sh), and type

python main_train.py --env wheeled --alg tesp

Local Mode

You need to uncomment ray.init() in main_train.py.

Cite

Please cite our paper if you use this code in your own work:

@inproceedings{lan2019metarl,
  title={Meta Reinforcement Learning with Task Embedding and Shared Policy},
  author={Lan, Lin and Li, Zhenguo and Guan, Xiaohong and Wang, Pinghui},
  booktitle={IJCAI},
  year={2019}
}
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].