All Projects → zlpure → Cs234

zlpure / Cs234

Licence: mit
My Solution to Assignments of CS234

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Cs234

Ml In Tf
Get started with Machine Learning in TensorFlow with a selection of good reads and implemented examples!
Stars: ✭ 45 (-50.55%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning
1 Year Machinelearning Journey
An advanced program in Machine Learning and Deep Learning
Stars: ✭ 69 (-24.18%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning
Mujocounity
Reproducing MuJoCo benchmarks in a modern, commercial game /physics engine (Unity + PhysX).
Stars: ✭ 47 (-48.35%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning
Deep Q Learning
Minimal Deep Q Learning (DQN & DDQN) implementations in Keras
Stars: ✭ 1,013 (+1013.19%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning
Treeqn
Stars: ✭ 77 (-15.38%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning
Deterministic Gail Pytorch
PyTorch implementation of Deterministic Generative Adversarial Imitation Learning (GAIL) for Off Policy learning
Stars: ✭ 44 (-51.65%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning
Drl papernotes
Notes and comments about Deep Reinforcement Learning papers
Stars: ✭ 65 (-28.57%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning
Rlcard
Reinforcement Learning / AI Bots in Card (Poker) Games - Blackjack, Leduc, Texas, DouDizhu, Mahjong, UNO.
Stars: ✭ 980 (+976.92%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning
Muzero General
MuZero
Stars: ✭ 1,187 (+1204.4%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning
Rl Course Experiments
Stars: ✭ 73 (-19.78%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning
Deepbootcamp
Solved lab problems, slides and notes of the Deep Reinforcement Learning bootcamp 2017 held at UCBerkeley
Stars: ✭ 39 (-57.14%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning
Snake
Artificial intelligence for the Snake game.
Stars: ✭ 1,241 (+1263.74%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning
Deepqlearning.jl
Implementation of the Deep Q-learning algorithm to solve MDPs
Stars: ✭ 38 (-58.24%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning
Async Deeprl
Playing Atari games with TensorFlow implementation of Asynchronous Deep Q-Learning
Stars: ✭ 44 (-51.65%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning
Awesome Deep Rl
For deep RL and the future of AI.
Stars: ✭ 985 (+982.42%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning
Max
Code for reproducing experiments in Model-Based Active Exploration, ICML 2019
Stars: ✭ 61 (-32.97%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning
Drlkit
A High Level Python Deep Reinforcement Learning library. Great for beginners, prototyping and quickly comparing algorithms
Stars: ✭ 29 (-68.13%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning
Left Shift
Using deep reinforcement learning to tackle the game 2048.
Stars: ✭ 35 (-61.54%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning
Torch Ac
Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO
Stars: ✭ 70 (-23.08%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning
Rlenv.directory
Explore and find reinforcement learning environments in a list of 150+ open source environments.
Stars: ✭ 79 (-13.19%)
Mutual labels:  reinforcement-learning, deep-reinforcement-learning

My Solution to Assignments of CS234

This is my solution to three assignments of CS234.
CS234: Deep Reinforcement Learning is an interesting class, which teaches you what is the reinforcement learning: Learn to make good sequences of decisions. This class provides some basic knowledge and insights of cutting-edge research in reinforcement learning. More details are as follows:

  • Define the key features of RL vs AI & other ML
  • Define MDP, POMDP, bandit, batch offline RL, online RL
  • Describe the exploration vs exploitation challenge and compare and contrast 2 or more approaches
  • Given an application problem (e.g. from computer vision, robotics, etc) decide if it should be formulated as a RL problem, if yes how to formulate, what algorithm (from class) is best suited to address, and justify an answer
  • Implement several RL algorithms incl. a deep RL approach
  • Describe multiple criteria for analyzing RL algorithms and evaluate algorithms on these metrics: e.g. regret, sample complexity, computational complexity, convergence, etc.
  • List at least two open challenges or hot topics in RL

Note: If you consult my source codes that you may want to incorporate into your algorithm or system, you should clearly cite references in your codes.


Table of Contents

  • Assignment 1
    • Bellman Operator Properties
    • Value Iteration
    • Grid Policies
    • Frozen Lake MDP
    • Frozen Lake Reinforcement Learning
  • Assignment 2
    • Q-learning
    • Linear Approximation
    • Deepmind's DQN
    • (Bonus) Double DQN
    • (Bonus) Dueling DQN
  • Assignment 3
    • R-max algorithm
    • epsilon-greedy q-learning
    • Expected Regret Bounds

Dependencies

  • Anaconda
  • tensorflow>=0.12
  • matplotlib
  • scipy
  • numpy
  • sklearn
  • six

Author

@zlpure

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].