rtrlPyTorch implementation of our paper Real-Time Reinforcement Learning (NeurIPS 2019)
wax-mlA Python library for machine-learning and feedback loops on streaming data
AtariAlgos.jlArcade Learning Environment (ALE) wrapped as a Reinforce.jl environment
distributed rlPytorch implementation of distributed deep reinforcement learning
XWorldA C++/Python simulator package for reinforcement learning
carla-rlReinforcement Learning Agents Trained in the CARLA Simulator
drl graspingDeep Reinforcement Learning for Robotic Grasping from Octrees
magicalThe MAGICAL benchmark suite for robust imitation learning (NeurIPS 2020)
HebbianMetaLearningMeta-Learning through Hebbian Plasticity in Random Networks: https://arxiv.org/abs/2007.02686
recsim ngRecSim NG: Toward Principled Uncertainty Modeling for Recommender Ecosystems
Deep-Learning-Mahjong---Reinforcement learning (RL) implementation of imperfect information game Mahjong using markov decision processes to predict future game states
flatland-challenge-starter-kit⚠️ NOTICE: This starter kit was used for 2019 challenge and has been deprecated in favour of 2020 Flatland challenge's starter kit present here
muzeroA clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each other, and investigate reliability of learned MuZero MDP models.
RLReinforcement Learning Demos
deep-tic-tac-toeUsed deep reinforcement learning to train a deep neural network to play tic-tac-toe and deployed using tensorflow.js.
TensorTradeThis repository hosts all my code related to TensorTrade. It consists of the main program, its old versions, and some extras for more insights.
CARE-GNNCode for CIKM 2020 paper Enhancing Graph Neural Network-based Fraud Detectors against Camouflaged Fraudsters
bindsnetSimulation of spiking neural networks (SNNs) using PyTorch.
DI-smartcrossDecision Intelligence platform for Traffic Crossing Signal Control
gdcCode for the ICLR 2021 paper "A Distributional Approach to Controlled Text Generation"
rl-lang-groundTensorflow code for WACV 2019 paper "Attention Based Natural Language Grounding by Navigating Virtual Environment" - https://arxiv.org/abs/1804.08454
DeepLaetitiaDeep Reinforcement Learning that makes you smile
cs294-112 hwsMy solution to assignments in UC Berkeley CS294-112: Deep Reinforcement Learning
FleetSimEvent-based Simulation for Electric Vehicle Fleets
ml-aiML-AI Community | Open Source | Built in Bharat for the World | Data science problem statements and solutions
covid-xprizeOpen-source repository containing examples and documentation for the Cognizant XPRIZE Pandemic Response Challenge
cpprbFast Flexible Replay Buffer Library (Mirror repository of https://gitlab.com/ymd_h/cpprb)
sutton-barto-rl-exercises📖Learning reinforcement learning by implementing the algorithms from reinforcement learning an introduction
ShinRLShinRL: A Library for Evaluating RL Algorithms from Theoretical and Practical Perspectives (Deep RL Workshop 2021)
SelSumAbstractive opinion summarization system (SelSum) and the largest dataset of Amazon product summaries (AmaSum). EMNLP 2021 conference paper.
DQN-AtariDeep Q-Learning (DQN) implementation for Atari pong.
DacKGRSource codes and datasets for EMNLP 2020 paper "Dynamic Anticipation and Completion for Multi-Hop Reasoning over Sparse Knowledge Graph"
Pytorch-RL-CPPA Repository with C++ implementations of Reinforcement Learning Algorithms (Pytorch)
alpha sigmaA pytorch based Gomoku game model. Alpha Zero algorithm based reinforcement Learning and Monte Carlo Tree Search model.
distributedRLA framework for easy prototyping of distributed reinforcement learning algorithms
cogment-verseLibrary of Environments, Human Actor UIs and Agent implementation for Human In the Loop Learning & Reinforcement Learning
CorailedUnrailed! simulator using C++ with some reinforcement learning and Unrailed! AI using Python with OpenCV
kuka rlReinforcement Learning Experiments using PyBullet
banditsComparison of bandit algorithms from the Reinforcement Learning bible.
gyxReinforcement Learning environment for Elixir