A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each other, and investigate reliability of learned MuZero MDP models.

✭ 126

Jupyter Notebook python reinforcement-learning deep-learning tensorflow deep-reinforcement-learning tf2 mcts alphazero tensorflow2 muzero

Reinforcement Learning Demos

✭ 66

python reinforcement-learning deep-reinforcement-learning q-learning vrep sarsa

neat-openai-gym

NEAT for Reinforcement Learning on the OpenAI Gym

✭ 19

python reinforcement-learning ai neat openai-gym neuroevolution neat-python

neptune-mlflow

Neptune integration with MLflow

✭ 27

python Makefile platform data-science machine-learning reinforcement-learning deep-learning machine-learning-platform mlflow neptune-platform mlflow2 neptune-mlflow neptune-community-spectrum mlflow-neptune

Stock-Trading-Using-Machine-Learning

A comprehensive approach for stock trading implemented using Neural Network and Reinforcement Learning separately.

✭ 20

python reinforcement-learning neural-network pca-analysis data-preprocessing

deep-tic-tac-toe

Used deep reinforcement learning to train a deep neural network to play tic-tac-toe and deployed using tensorflow.js.

✭ 52

Jupyter Notebook HTML machine-learning reinforcement-learning neural-network keras convolutional-neural-networks tensorflow-js

TensorTrade

This repository hosts all my code related to TensorTrade. It consists of the main program, its old versions, and some extras for more insights.

✭ 16

python benchmark crypto reinforcement-learning tensorflow heatmap cryptocurrency automated-trading binance automated-trading-bot tensortrade

CARE-GNN

Code for CIKM 2020 paper Enhancing Graph Neural Network-based Fraud Detectors against Camouflaged Fraudsters

✭ 121

python security machine-learning reinforcement-learning deep-learning fraud-prevention datamining fraud-detection graphneuralnetwork

alphaFive

alphaGo版本的五子棋(gobang, gomoku)

✭ 51

python reinforcement-learning tensorflow gomoku gobang alphago alphago-zero alphazero

VREP-RL-bot

Reinforcement Learning in Vrep

✭ 14

python OpenEdge ABL reinforcement-learning tensorflow keras q-learning vrep reinforcement-learning-algorithms

bindsnet

Simulation of spiking neural networks (SNNs) using PyTorch.

✭ 34

python Dockerfile machine-learning reinforcement-learning pytorch spiking-neural-networks

CartPole

Run OpenAI Gym on a Server

✭ 16

python Jupyter Notebook aws reinforcement-learning keras openai-gym openai gym cartpole rl

DI-smartcross

Decision Intelligence platform for Traffic Crossing Signal Control

✭ 114

python shell reinforcement-learning traffic-signal-control traffic-light-control

gdc

Code for the ICLR 2021 paper "A Distributional Approach to Controlled Text Generation"

✭ 94

python nlp machine-learning reinforcement-learning ai nlg language-model exponential-family fairness-ml information-geometry gpt-2 gpt3 controlled-nlg

rl-lang-ground

Tensorflow code for WACV 2019 paper "Attention Based Natural Language Grounding by Navigating Virtual Environment" - https://arxiv.org/abs/1804.08454

✭ 17

python reinforcement-learning computer-vision deep-learning tensorflow

DeepLaetitia

Deep Reinforcement Learning that makes you smile

✭ 15

Mathematica deep-neural-networks reinforcement-learning computer-vision

pytorch-rl

Pytorch Implementation of RL algorithms

✭ 15

python reinforcement-learning deep-learning openai-gym pytorch rl-agents artificial-intelligence dqn reinforcement-learning-algorithms ddpg

handson-ml

도서 "핸즈온 머신러닝"의 예제와 연습문제를 담은 주피터 노트북입니다.

✭ 285

machine-learning deep-neural-networks reinforcement-learning deep-learning neural-network random-forest tensorflow svm scikit-learn recurrent-neural-networks xgboost autoencoder ensemble-learning gradient-boosting

cs294-112 hws

My solution to assignments in UC Berkeley CS294-112: Deep Reinforcement Learning

✭ 91

python TeX shell reinforcement-learning tensorflow cs294-112

FleetSim

Event-based Simulation for Electric Vehicle Fleets

✭ 21

Jupyter Notebook python reinforcement-learning simulation electric-vehicles simpy event-based

ml-ai

ML-AI Community | Open Source | Built in Bharat for the World | Data science problem statements and solutions

covid-xprize

Open-source repository containing examples and documentation for the Cognizant XPRIZE Pandemic Response Challenge

✭ 36

python Jupyter Notebook Dockerfile challenge machine-learning reinforcement-learning ai optimization artificial-intelligence response epidemics pandemic xprize covid-19 covid covid19 x-prize competition-guidelines

cpprb

Fast Flexible Replay Buffer Library (Mirror repository of https://gitlab.com/ymd_h/cpprb)

✭ 52

python cython C++CSS HTML Dockerfile machine-learning reinforcement-learning

sutton-barto-rl-exercises

📖Learning reinforcement learning by implementing the algorithms from reinforcement learning an introduction

✭ 77

Jupyter Notebook python machine-learning reinforcement-learning supervised-learning unsupervised-learning sutton barto

ShinRL

ShinRL: A Library for Evaluating RL Algorithms from Theoretical and Practical Perspectives (Deep RL Workshop 2021)

✭ 30

Jupyter Notebook python reinforcement-learning jax

SelSum

Abstractive opinion summarization system (SelSum) and the largest dataset of Amazon product summaries (AmaSum). EMNLP 2021 conference paper.

✭ 36

python shell natural-language-processing reinforcement-learning deep-learning amazon summarization opinion-mining variational-inference natural-language-understanding

DQN-Atari

Deep Q-Learning (DQN) implementation for Atari pong.

✭ 53

python machine-learning reinforcement-learning pong pytorch dqn atari dqn-pytorch

Relational Deep Reinforcement Learning

No description or website provided.

✭ 44

python reinforcement-learning tensorflow relational-networks proximal-policy-optimization ppo explainable-ai self-attention

ReinventCommunity

No description or website provided.

✭ 103

Jupyter Notebook python reinforcement-learning cheminformatics jupyter-notebook neural-networks transfer-learning denovo-design astrazeneca

DacKGR

Source codes and datasets for EMNLP 2020 paper "Dynamic Anticipation and Completion for Multi-Hop Reasoning over Sparse Knowledge Graph"

✭ 26

Jupyter Notebook python shell reinforcement-learning knowledge-graph-reasoning multi-hop-reasoning

Pytorch-RL-CPP

A Repository with C++ implementations of Reinforcement Learning Algorithms (Pytorch)

✭ 73

C++CMake machine-learning google deep-neural-networks reinforcement-learning deep-learning robotics tensorflow keras openai-gym pytorch gan openai gym vae reinforcement-learning-algorithms atari deepmind mujoco

alpha sigma

A pytorch based Gomoku game model. Alpha Zero algorithm based reinforcement Learning and Monte Carlo Tree Search model.

✭ 134

python reinforcement-learning deep-learning deep-reinforcement-learning pytorch gomoku monte-carlo-tree-search gomoku-game pytorch-rl alphazero

sinkhorn-policy-gradient.pytorch

Code accompanying the paper "Learning Permutations with Sinkhorn Policy Gradient"

✭ 36

python shell reinforcement-learning deep-learning combinatorial-optimization permutation-algorithms contextual-bandits

distributedRL

A framework for easy prototyping of distributed reinforcement learning algorithms

✭ 93

python reinforcement-learning zeromq dqn ray distributed-reinforcement-learning ape-x

vlainic.github.io

My GitHub blog: things you might be interested, and probably not...

✭ 26

blog docker education data-science machine-learning reinforcement-learning ai machine-learning-algorithms prediction datascience aws-ec2 nlp-machine-learning prediction-algorithm prediction-model

cogment-verse

Library of Environments, Human Actor UIs and Agent implementation for Human In the Loop Learning & Reinforcement Learning

✭ 26

python javascript shell typescript CSS HTML reinforcement-learning human-in-the-loop-learning cogment

Corailed

Unrailed! simulator using C++ with some reinforcement learning and Unrailed! AI using Python with OpenCV

✭ 15

python reinforcement-learning rl python-api rl-environment simulator-game unrailed

使用深度强化学习解决视觉跟踪和视觉导航问题

✭ 16

matlab python java shell c Batchfile reinforcement-learning computer-vision deep-learning navigation deep-reinforcement-learning autodrive

spore-nest-module

Synaptic Plasticity with Online Reinforcement learning

✭ 24

C++python CMake shell music reinforcement-learning nest spiking-neural-networks nest-module

kuka rl

Reinforcement Learning Experiments using PyBullet

✭ 65

Jupyter Notebook reinforcement-learning deep-learning pytorch kuka controls grasping pybullet

bandits

Comparison of bandit algorithms from the Reinforcement Learning bible.

✭ 16

python machine-learning reinforcement-learning reinforcement-learning-algorithms reinforcement-learning-agent sutton-book

gyx

Reinforcement Learning environment for Elixir

✭ 20

elixir python Dockerfile reinforcement-learning artificial-intelligence deep-q-learning dopamine-rl

721-780 of 962 reinforcement-learning projects

first

‹

›