zlpure / Cs234
Licence: mit
My Solution to Assignments of CS234
Stars: ✭ 91
Programming Languages
python
139335 projects - #7 most used programming language
Projects that are alternatives of or similar to Cs234
Ml In Tf
Get started with Machine Learning in TensorFlow with a selection of good reads and implemented examples!
Stars: ✭ 45 (-50.55%)
Mutual labels: reinforcement-learning, deep-reinforcement-learning
1 Year Machinelearning Journey
An advanced program in Machine Learning and Deep Learning
Stars: ✭ 69 (-24.18%)
Mutual labels: reinforcement-learning, deep-reinforcement-learning
Mujocounity
Reproducing MuJoCo benchmarks in a modern, commercial game /physics engine (Unity + PhysX).
Stars: ✭ 47 (-48.35%)
Mutual labels: reinforcement-learning, deep-reinforcement-learning
Deep Q Learning
Minimal Deep Q Learning (DQN & DDQN) implementations in Keras
Stars: ✭ 1,013 (+1013.19%)
Mutual labels: reinforcement-learning, deep-reinforcement-learning
Deterministic Gail Pytorch
PyTorch implementation of Deterministic Generative Adversarial Imitation Learning (GAIL) for Off Policy learning
Stars: ✭ 44 (-51.65%)
Mutual labels: reinforcement-learning, deep-reinforcement-learning
Drl papernotes
Notes and comments about Deep Reinforcement Learning papers
Stars: ✭ 65 (-28.57%)
Mutual labels: reinforcement-learning, deep-reinforcement-learning
Rlcard
Reinforcement Learning / AI Bots in Card (Poker) Games - Blackjack, Leduc, Texas, DouDizhu, Mahjong, UNO.
Stars: ✭ 980 (+976.92%)
Mutual labels: reinforcement-learning, deep-reinforcement-learning
Muzero General
MuZero
Stars: ✭ 1,187 (+1204.4%)
Mutual labels: reinforcement-learning, deep-reinforcement-learning
Rl Course Experiments
Stars: ✭ 73 (-19.78%)
Mutual labels: reinforcement-learning, deep-reinforcement-learning
Deepbootcamp
Solved lab problems, slides and notes of the Deep Reinforcement Learning bootcamp 2017 held at UCBerkeley
Stars: ✭ 39 (-57.14%)
Mutual labels: reinforcement-learning, deep-reinforcement-learning
Snake
Artificial intelligence for the Snake game.
Stars: ✭ 1,241 (+1263.74%)
Mutual labels: reinforcement-learning, deep-reinforcement-learning
Deepqlearning.jl
Implementation of the Deep Q-learning algorithm to solve MDPs
Stars: ✭ 38 (-58.24%)
Mutual labels: reinforcement-learning, deep-reinforcement-learning
Async Deeprl
Playing Atari games with TensorFlow implementation of Asynchronous Deep Q-Learning
Stars: ✭ 44 (-51.65%)
Mutual labels: reinforcement-learning, deep-reinforcement-learning
Awesome Deep Rl
For deep RL and the future of AI.
Stars: ✭ 985 (+982.42%)
Mutual labels: reinforcement-learning, deep-reinforcement-learning
Max
Code for reproducing experiments in Model-Based Active Exploration, ICML 2019
Stars: ✭ 61 (-32.97%)
Mutual labels: reinforcement-learning, deep-reinforcement-learning
Drlkit
A High Level Python Deep Reinforcement Learning library. Great for beginners, prototyping and quickly comparing algorithms
Stars: ✭ 29 (-68.13%)
Mutual labels: reinforcement-learning, deep-reinforcement-learning
Left Shift
Using deep reinforcement learning to tackle the game 2048.
Stars: ✭ 35 (-61.54%)
Mutual labels: reinforcement-learning, deep-reinforcement-learning
Torch Ac
Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO
Stars: ✭ 70 (-23.08%)
Mutual labels: reinforcement-learning, deep-reinforcement-learning
Rlenv.directory
Explore and find reinforcement learning environments in a list of 150+ open source environments.
Stars: ✭ 79 (-13.19%)
Mutual labels: reinforcement-learning, deep-reinforcement-learning
My Solution to Assignments of CS234
This is my solution to three assignments of CS234.
CS234: Deep Reinforcement Learning is
an interesting class, which teaches you what is the reinforcement learning:
Learn to make good sequences of decisions. This class provides some basic knowledge and insights of cutting-edge research in reinforcement learning. More details are as follows:
- Define the key features of RL vs AI & other ML
- Define MDP, POMDP, bandit, batch offline RL, online RL
- Describe the exploration vs exploitation challenge and compare and contrast 2 or more approaches
- Given an application problem (e.g. from computer vision, robotics, etc) decide if it should be formulated as a RL problem, if yes how to formulate, what algorithm (from class) is best suited to address, and justify an answer
- Implement several RL algorithms incl. a deep RL approach
- Describe multiple criteria for analyzing RL algorithms and evaluate algorithms on these metrics: e.g. regret, sample complexity, computational complexity, convergence, etc.
- List at least two open challenges or hot topics in RL
Note: If you consult my source codes that you may want to incorporate into your algorithm or system, you should clearly cite references in your codes.
Table of Contents
-
Assignment 1
- Bellman Operator Properties
- Value Iteration
- Grid Policies
- Frozen Lake MDP
- Frozen Lake Reinforcement Learning
-
Assignment 2
- Q-learning
- Linear Approximation
- Deepmind's DQN
- (Bonus) Double DQN
- (Bonus) Dueling DQN
-
Assignment 3
- R-max algorithm
- epsilon-greedy q-learning
- Expected Regret Bounds
Dependencies
- Anaconda
- tensorflow>=0.12
- matplotlib
- scipy
- numpy
- sklearn
- six
Author
Note that the project description data, including the texts, logos, images, and/or trademarks,
for each open source project belongs to its rightful owner.
If you wish to add or remove any projects, please contact us at [email protected].