Alphazero gomokuAn implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)
Stars: ✭ 2,570 (-1.8%)
alpha-zeroAlphaZero implementation for Othello, Connect-Four and Tic-Tac-Toe based on "Mastering the game of Go without human knowledge" and "Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm" by DeepMind.
Stars: ✭ 68 (-97.4%)
alphaFivealphaGo版本的五子棋(gobang, gomoku)
Stars: ✭ 51 (-98.05%)
alpha sigmaA pytorch based Gomoku game model. Alpha Zero algorithm based reinforcement Learning and Monte Carlo Tree Search model.
Stars: ✭ 134 (-94.88%)
Chess Alpha ZeroChess reinforcement learning by AlphaGo Zero methods.
Stars: ✭ 1,868 (-28.62%)
alphazeroBoard Game Reinforcement Learning using AlphaZero method. including Makhos (Thai Checkers), Reversi, Connect Four, Tic-tac-toe game rules
Stars: ✭ 24 (-99.08%)
alphastoneUsing self-play, MCTS, and a deep neural network to create a hearthstone ai player
Stars: ✭ 24 (-99.08%)
UCThelloUCThello - a board game demonstrator (Othello variant) with computer AI using Monte Carlo Tree Search (MCTS) with UCB (Upper Confidence Bounds) applied to trees (UCT in short)
Stars: ✭ 26 (-99.01%)
ElfELF: a platform for game research with AlphaGoZero/AlphaZero reimplementation
Stars: ✭ 3,240 (+23.81%)
Practical rlA course in reinforcement learning in the wild
Stars: ✭ 4,741 (+81.16%)
Rl BookSource codes for the book "Reinforcement Learning: Theory and Python Implementation"
Stars: ✭ 464 (-82.27%)
Ml MiptOpen Machine Learning course at MIPT
Stars: ✭ 480 (-81.66%)
Amazon Sagemaker ExamplesExample 📓 Jupyter notebooks that demonstrate how to build, train, and deploy machine learning models using 🧠 Amazon SageMaker.
Stars: ✭ 6,346 (+142.49%)
Machine Learning Curriculum💻 Make machines learn so that you don't have to struggle to program them; The ultimate list
Stars: ✭ 761 (-70.92%)
ChainerrlChainerRL is a deep reinforcement learning library built on top of Chainer.
Stars: ✭ 931 (-64.42%)
NotebooksSome notebooks
Stars: ✭ 53 (-97.97%)
World Models Sonic PytorchAttempt at reinforcement learning with curiosity for Sonic the Hedgehog games. Number 149 on OpenAI retro contest leaderboard, but more work needed
Stars: ✭ 27 (-98.97%)
Reinforcement LearningImplementation of Reinforcement Learning algorithms in Python, based on Sutton's & Barto's Book (Ed. 2)
Stars: ✭ 55 (-97.9%)
CoursesQuiz & Assignment of Coursera
Stars: ✭ 454 (-82.65%)
Tensor HouseA collection of reference machine learning and optimization models for enterprise operations: marketing, pricing, supply chain
Stars: ✭ 449 (-82.84%)
Tensorflow BookAccompanying source code for Machine Learning with TensorFlow. Refer to the book for step-by-step explanations.
Stars: ✭ 4,448 (+69.97%)
Reinforcement learning tutorial with demoReinforcement Learning Tutorial with Demo: DP (Policy and Value Iteration), Monte Carlo, TD Learning (SARSA, QLearning), Function Approximation, Policy Gradient, DQN, Imitation, Meta Learning, Papers, Courses, etc..
Stars: ✭ 442 (-83.11%)
Deeprl TutorialsContains high quality implementations of Deep Reinforcement Learning algorithms written in PyTorch
Stars: ✭ 748 (-71.42%)
Qlearning tradingLearning to trade under the reinforcement learning framework
Stars: ✭ 431 (-83.53%)
Basic reinforcement learningAn introductory series to Reinforcement Learning (RL) with comprehensive step-by-step tutorials.
Stars: ✭ 826 (-68.44%)
Awesome Ai BooksSome awesome AI related books and pdfs for learning and downloading, also apply some playground models for learning
Stars: ✭ 855 (-67.33%)
CourseraQuiz & Assignment of Coursera
Stars: ✭ 774 (-70.42%)
Policy Gradient MethodsImplementation of Algorithms from the Policy Gradient Family. Currently includes: A2C, A3C, DDPG, TD3, SAC
Stars: ✭ 54 (-97.94%)
Machine Learning From ScratchSuccinct Machine Learning algorithm implementations from scratch in Python, solving real-world problems (Notebooks and Book). Examples of Logistic Regression, Linear Regression, Decision Trees, K-means clustering, Sentiment Analysis, Recommender Systems, Neural Networks and Reinforcement Learning.
Stars: ✭ 42 (-98.4%)
Rl WorkshopReinforcement Learning Workshop for Data Science BKK
Stars: ✭ 73 (-97.21%)
Spacenet building detectionProject to train/test convolutional neural networks to extract buildings from SpaceNet satellite imageries.
Stars: ✭ 83 (-96.83%)
Chainer HandsonCAUTION: This is not maintained anymore. Visit https://github.com/chainer-community/chainer-colab-notebook/
Stars: ✭ 84 (-96.79%)
MagnetMAGNet: Multi-agents control using Graph Neural Networks
Stars: ✭ 88 (-96.64%)
Rl Movie RecommenderThe purpose of our research is to study reinforcement learning approaches to building a movie recommender system. We formulate the problem of interactive recommendation as a contextual multi-armed bandit.
Stars: ✭ 93 (-96.45%)
Ngsim envLearning human driver models from NGSIM data with imitation learning.
Stars: ✭ 96 (-96.33%)
Ctc ExecutionerMaster Thesis: Limit order placement with Reinforcement Learning
Stars: ✭ 112 (-95.72%)
Coursera reinforcement learningCoursera Reinforcement Learning Specialization by University of Alberta & Alberta Machine Intelligence Institute
Stars: ✭ 114 (-95.64%)
Pytorch RlTutorials for reinforcement learning in PyTorch and Gym by implementing a few of the popular algorithms. [IN PROGRESS]
Stars: ✭ 121 (-95.38%)
MathyTools for using computer algebra systems to solve math problems step-by-step with reinforcement learning
Stars: ✭ 79 (-96.98%)
Rlai ExercisesExercise Solutions for Reinforcement Learning: An Introduction [2nd Edition]
Stars: ✭ 97 (-96.29%)
Reinforcementlearning AtarigamePytorch LSTM RNN for reinforcement learning to play Atari games from OpenAI Universe. We also use Google Deep Mind's Asynchronous Advantage Actor-Critic (A3C) Algorithm. This is much superior and efficient than DQN and obsoletes it. Can play on many games
Stars: ✭ 118 (-95.49%)