Pytorch-PCGradPytorch reimplementation for "Gradient Surgery for Multi-Task Learning"
SeaPearl.jlJulia hybrid constraint programming solver enhanced by a reinforcement learning driven search.
CloudSimPyCloudSimPy: Datacenter job scheduling simulation framework
CDS[NeurIPS 2021] CDS achieves remarkable success in challenging benchmarks SMAC and GRF by balancing sharing and diversity.
MetaGymCollection of Reinforcement Learning / Meta Reinforcement Learning Environments.
RASAISTATS 2019: Reference-based Adversarial Sampling & Its applications to Soft Q-learning
Deep-Learning-PytorchA repo containing code covering various aspects of deep learning on Pytorch. Great for beginners and intermediate in the field
mpoPyTorch Implementation of the Maximum a Posteriori Policy Optimisation
MI-MVI 2016Semestral project for the subject Methods of computational inteligence @ fit.cvut.cz
Machine-Learning-ModelsIn This repository I made some simple to complex methods in machine learning. Here I try to build template style code.
cups-rlCustomisable Unified Physical Simulations (CUPS) for Reinforcement Learning. Experiments run on the ai2thor environment (http://ai2thor.allenai.org/) e.g. using A3C, RainbowDQN and A3C_GA (Gated Attention multi-modal fusion) for Task-Oriented Language Grounding (tasks specified by natural language instructions) e.g. "Pick up the Cup or else"
alphazeroBoard Game Reinforcement Learning using AlphaZero method. including Makhos (Thai Checkers), Reversi, Connect Four, Tic-tac-toe game rules
reinforcement learning ppo rndDeep Reinforcement Learning by using Proximal Policy Optimization and Random Network Distillation in Tensorflow 2 and Pytorch with some explanation
cytonRLreinforcement learning, deep Q-network, double DQN, dueling DQN, prioritized experience replay
offline rlPytorch implementation of state-of-the-art offline reinforcement learning algorithms.
irokoA platform to test reinforcement learning policies in the datacenter setting.
contextualContextual Bandits in R - simulation and evaluation of Multi-Armed Bandit Policies
yz-ai.github.ioDerin Öğrenme, Bilgisayarlı Görü, Doğal Dil İşleme ve Pekiştirmeli Öğrenme alanlarındaki uygulama, akademik yayınlar, eğitici kaynaklar ve blog yazılarını bulacağınız yapay öğrenme platformu.
ml galleryThis is a master project of some experiments with Neural Networks. Every project here is runnable, visualized and explained clearly.
pytorch-gymImplementation of the Deep Deterministic Policy Gradient(DDPG) in bullet Gym using pytorch
frozenlakeValue & Policy Iteration for the frozenlake environment of OpenAI
DeepCubeACode for DeepCubeA, a Deep Reinforcement Learning algorithm that can learn to solve the Rubik's cube.
RLNo description or website provided.
Multiagent-RLMultiagent reinforcement learning simulation framework - Undergraduate thesis in Mechatronics Engineering at the University of Brasília
gym-RAn R package providing access to the OpenAI Gym API
LearnSnake🐍 AI that learns to play Snake using Q-Learning (Reinforcement Learning)
AlphaGo.jlAlphaGo Zero implementation using Flux.jl
deep-blueberryIf you've always wanted to learn about deep-learning but don't know where to start, then you might have stumbled upon the right place!
gym-mtsimA general-purpose, flexible, and easy-to-use simulator alongside an OpenAI Gym trading environment for MetaTrader 5 trading platform (Approved by OpenAI Gym)
rlqpAccelerating Quadratic Optimization with Reinforcement Learning
FlashRLNo description or website provided.
FOCAL-ICLRCode for FOCAL Paper Published at ICLR 2021
marioSuper Mario Reinforcement Learning from Demonstration
Fruit-APIA Universal Deep Reinforcement Learning Framework
PGMORL[ICML 2020] Prediction-Guided Multi-Objective Reinforcement Learning for Continuous Robot Control
poke.AIAn experimental AI that plays the 3rd gen Pokemon games - Winner of Judge's Choice Award for NUS Orbital Project
nips rlCode for NIPS 2017 learning to run challenge
jax-rlJAX implementations of core Deep RL algorithms