All Projects → RLOpensource → Relational_Deep_Reinforcement_Learning

RLOpensource / Relational_Deep_Reinforcement_Learning

Licence: other
No description or website provided.

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Relational Deep Reinforcement Learning

reinforcement learning ppo rnd
Deep Reinforcement Learning by using Proximal Policy Optimization and Random Network Distillation in Tensorflow 2 and Pytorch with some explanation
Stars: ✭ 33 (-25%)
Mutual labels:  proximal-policy-optimization, ppo
Reinforcement Learning With Tensorflow
Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学
Stars: ✭ 6,948 (+15690.91%)
Mutual labels:  proximal-policy-optimization, ppo
Awesome-Vision-Transformer-Collection
Variants of Vision Transformer and its downstream tasks
Stars: ✭ 124 (+181.82%)
Mutual labels:  explainable-ai, self-attention
Pytorch A2c Ppo Acktr Gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
Stars: ✭ 2,632 (+5881.82%)
Mutual labels:  proximal-policy-optimization, ppo
ppo-pytorch
Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)
Stars: ✭ 83 (+88.64%)
Mutual labels:  proximal-policy-optimization, ppo
imitation learning
PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.
Stars: ✭ 93 (+111.36%)
Mutual labels:  proximal-policy-optimization, ppo
Transformer-MM-Explainability
[ICCV 2021- Oral] Official PyTorch implementation for Generic Attention-model Explainability for Interpreting Bi-Modal and Encoder-Decoder Transformers, a novel method to visualize any Transformer-based network. Including examples for DETR, VQA.
Stars: ✭ 484 (+1000%)
Mutual labels:  explainable-ai
mllp
The code of AAAI 2020 paper "Transparent Classification with Multilayer Logical Perceptrons and Random Binarization".
Stars: ✭ 15 (-65.91%)
Mutual labels:  explainable-ai
rl trading
No description or website provided.
Stars: ✭ 14 (-68.18%)
Mutual labels:  ppo
concept-based-xai
Library implementing state-of-the-art Concept-based and Disentanglement Learning methods for Explainable AI
Stars: ✭ 41 (-6.82%)
Mutual labels:  explainable-ai
CrabNet
Predict materials properties using only the composition information!
Stars: ✭ 57 (+29.55%)
Mutual labels:  self-attention
trading gym
a unified environment for supervised learning and reinforcement learning in the context of quantitative trading
Stars: ✭ 36 (-18.18%)
Mutual labels:  ppo
responsible-ai-toolbox
This project provides responsible AI user interfaces for Fairlearn, interpret-community, and Error Analysis, as well as foundational building blocks that they rely on.
Stars: ✭ 615 (+1297.73%)
Mutual labels:  explainable-ai
dlime experiments
In this work, we propose a deterministic version of Local Interpretable Model Agnostic Explanations (LIME) and the experimental results on three different medical datasets shows the superiority for Deterministic Local Interpretable Model-Agnostic Explanations (DLIME).
Stars: ✭ 21 (-52.27%)
Mutual labels:  explainable-ai
transformers-interpret
Model explainability that works seamlessly with 🤗 transformers. Explain your transformers model in just 2 lines of code.
Stars: ✭ 861 (+1856.82%)
Mutual labels:  explainable-ai
VAENAR-TTS
PyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.
Stars: ✭ 66 (+50%)
Mutual labels:  self-attention
FUSION
PyTorch code for NeurIPSW 2020 paper (4th Workshop on Meta-Learning) "Few-Shot Unsupervised Continual Learning through Meta-Examples"
Stars: ✭ 18 (-59.09%)
Mutual labels:  self-attention
hierarchical-dnn-interpretations
Using / reproducing ACD from the paper "Hierarchical interpretations for neural network predictions" 🧠 (ICLR 2019)
Stars: ✭ 110 (+150%)
Mutual labels:  explainable-ai
LWDRLC
Lightweight deep RL Libraray for continuous control.
Stars: ✭ 14 (-68.18%)
Mutual labels:  ppo
meg
Molecular Explanation Generator
Stars: ✭ 14 (-68.18%)
Mutual labels:  explainable-ai

Implementation of Relational Deep Reinforcement Learning

This Repository is implementation of Relational Deep Reinforcement Learning to Breakout Environment.

The Reinforcement Learning Algorithm is Proximal Policy Optimization

Configuration

  • This paper requires heavy computation power.
  • Left Figure is the map of attention which is produced by self-attention.
  • Though the paper developed 100 environments for experiment, the implementer of this repository created only 16 environments with the limitation of computer resources. So sometimes it's exactly the performance and sometimes it's not.
  • If you want to see more significant attention map, just control CNN function to have less strides and more filters. In this repository, 84, 84 images are processed to have 19, 19 because of my computation limit.

Initial Training status

During Training

Tensorboard

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].