tmrlTrackMania 2020 through RL
ppo-pytorchProximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)
RCNN MDPCode base for solving Markov Decision Processes and Reinforcement Learning problems using Recurrent Convolutional Neural Networks.
SARNetCode repository for SARNet: Learning Multi-Agent Communication through Structured Attentive Reasoning (NeurIPS 2020)
CommNetan implementation of CommNet
policy-gradient-pongtensorflow implementation of Andrej Karpathy's blog about reinforcement learning. http://karpathy.github.io/2016/05/31/rl/
inv rlInverse Reinforcement Learning Argorithms
DOM-Q-NETGraph-based Deep Q Network for Web Navigation
EgoPoseOfficial PyTorch Implementation of "Ego-Pose Estimation and Forecasting as Real-Time PD Control". ICCV 2019.
ReinLifeCreating Artificial Life with Reinforcement Learning
HiLAPCode for paper "Hierarchical Text Classification with Reinforced Label Assignment" EMNLP 2019
geometry-dexPyTorch Code for "Generalization in Dexterous Manipulation via Geometry-Aware Multi-Task Learning"
reinforce-js[INACTIVE] A collection of various machine learning solver. The library is an object-oriented approach (baked with Typescript) and tries to deliver simplified interfaces that make using the algorithms pretty simple.
TiKickLearning-based agent for Google Research Football
SmartTrafficIntersectionAnother AI toy project, of a traffic intersection controlled by a Reinforcement Learning AI agent to optimize traffic flow in an intersection of vehicles or pedestrians
taxiHierarchical Online Planning and Reinforcement Learning on Taxi
POMDPImplementing a RL algorithm based upon a partially observable Markov decision process.
RL-2018Reinforcement Learning at UCLA IPAM RIPS 2018
pytorchrlDeep Reinforcement Learning algorithms implemented in PyTorch
alpha-zeroAlphaZero implementation for Othello, Connect-Four and Tic-Tac-Toe based on "Mastering the game of Go without human knowledge" and "Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm" by DeepMind.
tetrisRLA Tetris environment to train machine learning agents
CQLConservative Q Learning on top of SAC
reinforced-raceA model car learns driving along a track using reinforcement learning
2048-GymThis projects aims to use reinforcement learning algorithms to play the game 2048.
pymarl2Fine-tuned MARL algorithms on SMAC (100% win rates on most scenarios)
rlGeneric reinforcement learning codebase in TensorFlow
robotic-warehouseMulti-Robot Warehouse (RWARE): A multi-agent reinforcement learning environment
megaverseHigh-throughput simulation platform for Artificial Intelligence reseach
ADL2019Applied Deep Learning (2019 Spring) @ NTU
RelevancyTuningDice.com tutorial on using black box optimization algorithms to do relevancy tuning on your Solr Search Engine Configuration from Simon Hughes Dice.com
Deep-QLearning-Demo-csharpThis demo is a C# port of ConvNetJS RLDemo (https://cs.stanford.edu/people/karpathy/convnetjs/demo/rldemo.html) by Andrej Karpathy
sharpesharpe is a unified, interactive, general-purpose environment for backtesting or applying machine learning(supervised learning and reinforcement learning) in the context of quantitative trading
l2rOpen-source reinforcement learning environment for autonomous racing.
maml-rl-tf2Implementation of Model-Agnostic Meta-Learning (MAML) applied on Reinforcement Learning problems in TensorFlow 2.
glossaryhttps://machinelearning.wtf/ - An online glossary of machine learning terms.
qbso-fsPython implementation of QBSO-FS : a Reinforcement Learning based Bee Swarm Optimization metaheuristic for Feature Selection problem.
ML-Papers-TLDRA summary of interesting Machine Learning (mostly Deep Learning) papers that I encounter.
Point-Then-OperateCode for the ACL 2019 paper ``A Hierarchical Reinforced Sequence Operation Method for Unsupervised Text Style Transfer``
maze solverThis project solves self-made maze in a variety of ways: A-star, Q-learning and Deep Q-network.
relearnA Reinforcement Learning Library for C++11/14
gym-advGym environments modified with adversarial agents
braxMassively parallel rigidbody physics simulation on accelerator hardware.
MuJoCo RL UR5A MuJoCo/Gym environment for robot control using Reinforcement Learning. The task of agents in this environment is pixel-wise prediction of grasp success chances.