All Projects → awjuliani → Meta Rl

awjuliani / Meta Rl

Licence: mit
Implementation of Meta-RL A3C algorithm

Projects that are alternatives of or similar to Meta Rl

Popular Rl Algorithms
PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT-Opt, PointNet..
Stars: ✭ 266 (-25.07%)
Mutual labels:  jupyter-notebook, reinforcement-learning
Dinoruntutorial
Accompanying code for Paperspace tutorial "Build an AI to play Dino Run"
Stars: ✭ 285 (-19.72%)
Mutual labels:  jupyter-notebook, reinforcement-learning
Drq
DrQ: Data regularized Q
Stars: ✭ 268 (-24.51%)
Mutual labels:  jupyter-notebook, reinforcement-learning
Rl learn
我的强化学习笔记和学习材料📖 still updating ... ...
Stars: ✭ 234 (-34.08%)
Mutual labels:  jupyter-notebook, reinforcement-learning
A3c trading
Trading with recurrent actor-critic reinforcement learning
Stars: ✭ 305 (-14.08%)
Mutual labels:  jupyter-notebook, reinforcement-learning
Aleph star
Reinforcement learning with A* and a deep heuristic
Stars: ✭ 235 (-33.8%)
Mutual labels:  jupyter-notebook, reinforcement-learning
Trading Bot
Stock Trading Bot using Deep Q-Learning
Stars: ✭ 273 (-23.1%)
Mutual labels:  jupyter-notebook, reinforcement-learning
Icychesszero
中国象棋alpha zero程序
Stars: ✭ 206 (-41.97%)
Mutual labels:  jupyter-notebook, reinforcement-learning
Gdrl
Grokking Deep Reinforcement Learning
Stars: ✭ 304 (-14.37%)
Mutual labels:  jupyter-notebook, reinforcement-learning
Baby Steps Of Rl Ja
Pythonで学ぶ強化学習 -入門から実践まで- サンプルコード
Stars: ✭ 302 (-14.93%)
Mutual labels:  jupyter-notebook, reinforcement-learning
Nn
🧑‍🏫 50! Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
Stars: ✭ 5,720 (+1511.27%)
Mutual labels:  jupyter-notebook, reinforcement-learning
Youtube Code Repository
Repository for most of the code from my YouTube channel
Stars: ✭ 317 (-10.7%)
Mutual labels:  jupyter-notebook, reinforcement-learning
Applied Reinforcement Learning
Reinforcement Learning and Decision Making tutorials explained at an intuitive level and with Jupyter Notebooks
Stars: ✭ 229 (-35.49%)
Mutual labels:  jupyter-notebook, reinforcement-learning
Rad
RAD: Reinforcement Learning with Augmented Data
Stars: ✭ 268 (-24.51%)
Mutual labels:  jupyter-notebook, reinforcement-learning
Machine Learning Notebooks
Machine Learning notebooks for refreshing concepts.
Stars: ✭ 222 (-37.46%)
Mutual labels:  jupyter-notebook, reinforcement-learning
Rlquant
Applying Reinforcement Learning in Quantitative Trading
Stars: ✭ 271 (-23.66%)
Mutual labels:  jupyter-notebook, reinforcement-learning
Multihopkg
Multi-hop knowledge graph reasoning learned via policy gradient with reward shaping and action dropout
Stars: ✭ 202 (-43.1%)
Mutual labels:  jupyter-notebook, reinforcement-learning
Rl Tutorial Jnrr19
Stable-Baselines tutorial for Journées Nationales de la Recherche en Robotique 2019
Stars: ✭ 204 (-42.54%)
Mutual labels:  jupyter-notebook, reinforcement-learning
Tensorwatch
Debugging, monitoring and visualization for Python Machine Learning and Data Science
Stars: ✭ 3,191 (+798.87%)
Mutual labels:  jupyter-notebook, reinforcement-learning
Reinforcement Learning
Learn Deep Reinforcement Learning in 60 days! Lectures & Code in Python. Reinforcement Learning + Deep Learning
Stars: ✭ 3,329 (+837.75%)
Mutual labels:  jupyter-notebook, reinforcement-learning

Meta-RL

Tensorflow implementation of Meta-RL A3C algorithm taken from Learning to Reinforcement Learn. For more information, as well as explainations of each of the experiments, see my corresponding Medium post. A3C is built from previous implementation available here.

Contains iPython notebooks for:

  • A3C-Meta-Bandit - Set of bandit tasks described in paper. Including: Independent, Dependent, and Restless bandits.
  • A3C-Meta-Context - Rainbow bandit task using randomized colors to indicate reward-giving arm in each episode.
  • A3C-Meta-Grid - Rainbow Gridworld task; a variation of gridworld in which goal colors are randomzied each episode and must be learned "on the fly."
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].