All Projects → vaibhawvipul → tildy-mdp

vaibhawvipul / tildy-mdp

Licence: GPL-3.0 license
Reinforcement Learning Fun Game

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to tildy-mdp

DQN-using-PyTorch-and-ML-Agents
A simple example of how to implement vector based DQN using PyTorch and a ML-Agents environment
Stars: ✭ 81 (+350%)
Mutual labels:  reinforcement-learning-excercises
Deep-Reinforcement-Learning-for-Boardgames
Master Thesis project that provides a training framework for two player games. TicTacToe and Othello have already been implemented.
Stars: ✭ 17 (-5.56%)
Mutual labels:  reinforcement-learning-playground
Chat-Bot
Chatbot – is a computer program that simulates a natural human conversation. Users communicate with a chatbot via the chat interface or by voice, like how they would talk to a real person.
Stars: ✭ 26 (+44.44%)
Mutual labels:  fun-project

tildy-mdp

This is a fun project, inspired by talk of richard sutton - Tutorial: Introduction to Reinforcement Learning with Function Approximation

Play with this repo

python3 learn_mdp.py

About the project

Here the user is a reinforcement learning agent and he tries to find the optimal policy to gain maximum rewards. The environment has two states A and B. User can take 2 actions - 1,2 . Based on user's action in a state he gets positive or negative reward/feedback.

True model of the world

If you decide to play this game then following is the optimal policy

State Action
A 2
B 1

This repository can be used for educational purposes. This repo can be used to explain the following concepts of Reinforcement Learning -

  • MDP
  • Exploration vs Exploitation Dilemma
  • Introduction to RL.

Feel free to improve this project. Pull Requests are welcome.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].