RLOpensource / Relational_Deep_Reinforcement_Learning

Licence: other

No description or website provided.

Programming Languages

139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Relational Deep Reinforcement Learning

Deep Reinforcement Learning by using Proximal Policy Optimization and Random Network Distillation in Tensorflow 2 and Pytorch with some explanation

Stars: ✭ 33 (-25%)

Mutual labels: proximal-policy-optimization, ppo

Reinforcement Learning With Tensorflow

Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学

Stars: ✭ 6,948 (+15690.91%)

Mutual labels: proximal-policy-optimization, ppo

Awesome-Vision-Transformer-Collection

Variants of Vision Transformer and its downstream tasks

Stars: ✭ 124 (+181.82%)

Mutual labels: explainable-ai, self-attention

Pytorch A2c Ppo Acktr Gail

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

Stars: ✭ 2,632 (+5881.82%)

Mutual labels: proximal-policy-optimization, ppo

ppo-pytorch

Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)

Stars: ✭ 83 (+88.64%)

Mutual labels: proximal-policy-optimization, ppo

imitation learning

PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.

Stars: ✭ 93 (+111.36%)

Mutual labels: proximal-policy-optimization, ppo

Transformer-MM-Explainability

[ICCV 2021- Oral] Official PyTorch implementation for Generic Attention-model Explainability for Interpreting Bi-Modal and Encoder-Decoder Transformers, a novel method to visualize any Transformer-based network. Including examples for DETR, VQA.

Stars: ✭ 484 (+1000%)

Mutual labels: explainable-ai

mllp

The code of AAAI 2020 paper "Transparent Classification with Multilayer Logical Perceptrons and Random Binarization".

Stars: ✭ 15 (-65.91%)

Mutual labels: explainable-ai

rl trading

No description or website provided.

Stars: ✭ 14 (-68.18%)

Mutual labels: ppo

concept-based-xai

Library implementing state-of-the-art Concept-based and Disentanglement Learning methods for Explainable AI

Stars: ✭ 41 (-6.82%)

Mutual labels: explainable-ai

CrabNet

Predict materials properties using only the composition information!

Stars: ✭ 57 (+29.55%)

Mutual labels: self-attention

trading gym

a unified environment for supervised learning and reinforcement learning in the context of quantitative trading

Stars: ✭ 36 (-18.18%)

Mutual labels: ppo

responsible-ai-toolbox

This project provides responsible AI user interfaces for Fairlearn, interpret-community, and Error Analysis, as well as foundational building blocks that they rely on.

Stars: ✭ 615 (+1297.73%)

Mutual labels: explainable-ai

dlime experiments

In this work, we propose a deterministic version of Local Interpretable Model Agnostic Explanations (LIME) and the experimental results on three different medical datasets shows the superiority for Deterministic Local Interpretable Model-Agnostic Explanations (DLIME).

Stars: ✭ 21 (-52.27%)

Mutual labels: explainable-ai

transformers-interpret

Model explainability that works seamlessly with 🤗 transformers. Explain your transformers model in just 2 lines of code.

Stars: ✭ 861 (+1856.82%)

Mutual labels: explainable-ai

VAENAR-TTS

PyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.

Stars: ✭ 66 (+50%)

Mutual labels: self-attention

FUSION

PyTorch code for NeurIPSW 2020 paper (4th Workshop on Meta-Learning) "Few-Shot Unsupervised Continual Learning through Meta-Examples"

Stars: ✭ 18 (-59.09%)

Mutual labels: self-attention

hierarchical-dnn-interpretations

Using / reproducing ACD from the paper "Hierarchical interpretations for neural network predictions" 🧠 (ICLR 2019)

Stars: ✭ 110 (+150%)

Mutual labels: explainable-ai

LWDRLC

Lightweight deep RL Libraray for continuous control.

Stars: ✭ 14 (-68.18%)

Mutual labels: ppo

meg

Molecular Explanation Generator

Stars: ✭ 14 (-68.18%)

Mutual labels: explainable-ai

View All Similar Projects ➔

Implementation of Relational Deep Reinforcement Learning

This Repository is implementation of Relational Deep Reinforcement Learning to Breakout Environment.

The Reinforcement Learning Algorithm is Proximal Policy Optimization

Configuration

This paper requires heavy computation power.
Left Figure is the map of attention which is produced by self-attention.
Though the paper developed 100 environments for experiment, the implementer of this repository created only 16 environments with the limitation of computer resources. So sometimes it's exactly the performance and sometimes it's not.
If you want to see more significant attention map, just control CNN function to have less strides and more filters. In this repository, 84, 84 images are processed to have 19, 19 because of my computation limit.

Initial Training status

During Training

Tensorboard

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

RLOpensource / Relational_Deep_Reinforcement_Learning

Programming Languages

Labels

Projects that are alternatives of or similar to Relational Deep Reinforcement Learning

Implementation of Relational Deep Reinforcement Learning

Configuration

Initial Training status

During Training

Tensorboard