Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → llan-ml → tesp

llan-ml / tesp

Licence: other

Implementation of our paper "Meta Reinforcement Learning with Task Embedding and Shared Policy"

Programming Languages

139335 projects - #7 most used programming language

75241 projects

Labels

reinforcement-learning meta-learning meta-rl tesp meta-reinforcement

Projects that are alternatives of or similar to tesp

Implementation of Model-Agnostic Meta-Learning (MAML) applied on Reinforcement Learning problems in TensorFlow 2.

Stars: ✭ 16 (-42.86%)

Mutual labels: meta-learning, meta-rl, meta-reinforcement

Code for FOCAL Paper Published at ICLR 2021

Stars: ✭ 35 (+25%)

Mutual labels: meta-learning, meta-rl

My personal toolkit for PyTorch development.

Stars: ✭ 124 (+342.86%)

Mutual labels: meta-learning

Meta Learning Papers

Meta Learning / Learning to Learn / One Shot Learning / Few Shot Learning

Stars: ✭ 2,420 (+8542.86%)

Mutual labels: meta-learning

ProMP: Proximal Meta-Policy Search

Stars: ✭ 181 (+546.43%)

Mutual labels: meta-learning

The code for paper "CANet: Class-Agnostic Segmentation Networks with Iterative Refinement and Attentive Few-Shot Learning"

Stars: ✭ 135 (+382.14%)

Mutual labels: meta-learning

Meta Learning Papers

A classified list of meta learning papers based on realm.

Stars: ✭ 193 (+589.29%)

Mutual labels: meta-learning

Meta R-CNN : Towards General Solver for Instance-level Low-shot Learning

Stars: ✭ 120 (+328.57%)

Mutual labels: meta-learning

Awesome Real World Rl

Great resources for making Reinforcement Learning work in Real Life situations. Papers,projects and more.

Stars: ✭ 234 (+735.71%)

Mutual labels: meta-learning

Meta-Transfer Learning for Zero-Shot Super-Resolution (CVPR, 2020)

Stars: ✭ 181 (+546.43%)

Mutual labels: meta-learning

Code for the paper "Evolved Policy Gradients"

Stars: ✭ 204 (+628.57%)

Mutual labels: meta-learning

Metalearning4nlp Papers

A list of recent papers about Meta / few-shot learning methods applied in NLP areas.

Stars: ✭ 163 (+482.14%)

Mutual labels: meta-learning

Learning to Learn how to Learn: Self-Adaptive Visual Navigation using Meta-Learning (https://arxiv.org/abs/1812.00971)

Stars: ✭ 135 (+382.14%)

Mutual labels: meta-learning

Python module to interface with OpenML

Stars: ✭ 202 (+621.43%)

Mutual labels: meta-learning

Learning Meta Face Recognition in Unseen Domains, CVPR, Oral, 2020

Stars: ✭ 127 (+353.57%)

Mutual labels: meta-learning

Mini Imagenet Tools

Tools for generating mini-ImageNet dataset and processing batches

Stars: ✭ 209 (+646.43%)

Mutual labels: meta-learning

Meta Learning Lstm Pytorch

pytorch implementation of Optimization as a Model for Few-shot Learning

Stars: ✭ 121 (+332.14%)

Mutual labels: meta-learning

Meta Weight Net

NeurIPS'19: Meta-Weight-Net: Learning an Explicit Mapping For Sample Weighting (Pytorch implementation for noisy labels).

Stars: ✭ 158 (+464.29%)

Mutual labels: meta-learning

A hyperparameter optimization and data collection toolbox for convenient and fast prototyping of machine-learning models.

Stars: ✭ 182 (+550%)

Mutual labels: meta-learning

Code for "One-Shot Visual Imitation Learning via Meta-Learning"

Stars: ✭ 254 (+807.14%)

Mutual labels: meta-learning

View All Similar Projects ➔

Meta Reinforcement Learning

This repository contains the TensorFlow implementation of our paper Meta Reinforcement Learning with Task Embedding and Shared Policy (IJCAI 2019).

For detailed settings of the environments and experiments, please refer to the supplementary material.

We also re-implement the following methods:

MAML for RL
- Agent Class
- paper
Meta-SGD for RL
- Agent Class
- paper
MAESN
- Agent Class
- paper
- Note that here we use a latent variable instead of a latent distribution.

Requirements

python==3.6.5
tensorflow>=1.11,<2.0
ray>=0.6 (we used this version in the experiments, but any version ray>=0.6 should work)
gym>=0.10
mujoco-py==1.50.1.68

We recommend installing Anaconda before installing other dependencies.

Usage

Preparation

We provide several samples of bash scripts to ease operations on a Ray cluster:

env.sh: Declare the configuration of Ray cluster.
update_ray_codes.sh: Replace with our slightly modified RLlib and Tune.
exec_commands.sh: Make some directories and (optionally) install ray on each node.
ray_start.sh: Start a Ray cluster.
ray_stop.sh: Stop a Ray cluster.
upload_codes.sh: Upload the training code to each node of a Ray cluster.

Run

Distributed Mode

You need to first launch a Ray cluster declared in env.sh and upload relevant codes to each node.

Then, log into the master (head) node, switch to the working directory (see upload_codes.sh), and type

python main_train.py --env wheeled --alg tesp

Local Mode

You need to uncomment ray.init() in main_train.py.

Cite

Please cite our paper if you use this code in your own work:

@inproceedings{lan2019metarl,
  title={Meta Reinforcement Learning with Task Embedding and Shared Policy},
  author={Lan, Lin and Li, Zhenguo and Guan, Xiaohong and Wang, Pinghui},
  booktitle={IJCAI},
  year={2019}
}

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 28

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (0) 🔗