Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → awjuliani → Meta Rl

awjuliani / Meta Rl

Licence: mit

Implementation of Meta-RL A3C algorithm

Labels

jupyter-notebook tensorflow reinforcement-learning

Projects that are alternatives of or similar to Meta Rl

Popular Rl Algorithms

PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT-Opt, PointNet..

Stars: ✭ 266 (-25.07%)

Mutual labels: jupyter-notebook, reinforcement-learning

Dinoruntutorial

Accompanying code for Paperspace tutorial "Build an AI to play Dino Run"

Stars: ✭ 285 (-19.72%)

Mutual labels: jupyter-notebook, reinforcement-learning

DrQ: Data regularized Q

Stars: ✭ 268 (-24.51%)

Mutual labels: jupyter-notebook, reinforcement-learning

我的强化学习笔记和学习材料📖 still updating ... ...

Stars: ✭ 234 (-34.08%)

Mutual labels: jupyter-notebook, reinforcement-learning

Trading with recurrent actor-critic reinforcement learning

Stars: ✭ 305 (-14.08%)

Mutual labels: jupyter-notebook, reinforcement-learning

Reinforcement learning with A* and a deep heuristic

Stars: ✭ 235 (-33.8%)

Mutual labels: jupyter-notebook, reinforcement-learning

Stock Trading Bot using Deep Q-Learning

Stars: ✭ 273 (-23.1%)

Mutual labels: jupyter-notebook, reinforcement-learning

中国象棋alpha zero程序

Stars: ✭ 206 (-41.97%)

Mutual labels: jupyter-notebook, reinforcement-learning

Grokking Deep Reinforcement Learning

Stars: ✭ 304 (-14.37%)

Mutual labels: jupyter-notebook, reinforcement-learning

Baby Steps Of Rl Ja

Pythonで学ぶ強化学習 -入門から実践まで- サンプルコード

Stars: ✭ 302 (-14.93%)

Mutual labels: jupyter-notebook, reinforcement-learning

🧑‍🏫 50! Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠

Stars: ✭ 5,720 (+1511.27%)

Mutual labels: jupyter-notebook, reinforcement-learning

Youtube Code Repository

Repository for most of the code from my YouTube channel

Stars: ✭ 317 (-10.7%)

Mutual labels: jupyter-notebook, reinforcement-learning

Applied Reinforcement Learning

Reinforcement Learning and Decision Making tutorials explained at an intuitive level and with Jupyter Notebooks

Stars: ✭ 229 (-35.49%)

Mutual labels: jupyter-notebook, reinforcement-learning

RAD: Reinforcement Learning with Augmented Data

Stars: ✭ 268 (-24.51%)

Mutual labels: jupyter-notebook, reinforcement-learning

Machine Learning Notebooks

Machine Learning notebooks for refreshing concepts.

Stars: ✭ 222 (-37.46%)

Mutual labels: jupyter-notebook, reinforcement-learning

Applying Reinforcement Learning in Quantitative Trading

Stars: ✭ 271 (-23.66%)

Mutual labels: jupyter-notebook, reinforcement-learning

Multi-hop knowledge graph reasoning learned via policy gradient with reward shaping and action dropout

Stars: ✭ 202 (-43.1%)

Mutual labels: jupyter-notebook, reinforcement-learning

Rl Tutorial Jnrr19

Stable-Baselines tutorial for Journées Nationales de la Recherche en Robotique 2019

Stars: ✭ 204 (-42.54%)

Mutual labels: jupyter-notebook, reinforcement-learning

Debugging, monitoring and visualization for Python Machine Learning and Data Science

Stars: ✭ 3,191 (+798.87%)

Mutual labels: jupyter-notebook, reinforcement-learning

Reinforcement Learning

Learn Deep Reinforcement Learning in 60 days! Lectures & Code in Python. Reinforcement Learning + Deep Learning

Stars: ✭ 3,329 (+837.75%)

Mutual labels: jupyter-notebook, reinforcement-learning

View All Similar Projects ➔

Meta-RL

Tensorflow implementation of Meta-RL A3C algorithm taken from Learning to Reinforcement Learn. For more information, as well as explainations of each of the experiments, see my corresponding Medium post. A3C is built from previous implementation available here.

Contains iPython notebooks for:

A3C-Meta-Bandit - Set of bandit tasks described in paper. Including: Independent, Dependent, and Restless bandits.
A3C-Meta-Context - Rainbow bandit task using randomized colors to indicate reward-giving arm in each episode.
A3C-Meta-Grid - Rainbow Gridworld task; a variation of gridworld in which goal colors are randomzied each episode and must be learned "on the fly."

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 355

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (4) 🔗