Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → zlpure → Cs234

zlpure / Cs234

Licence: mit

My Solution to Assignments of CS234

Programming Languages

139335 projects - #7 most used programming language

Labels

reinforcement-learning deep-reinforcement-learning

Projects that are alternatives of or similar to Cs234

Get started with Machine Learning in TensorFlow with a selection of good reads and implemented examples!

Stars: ✭ 45 (-50.55%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning

1 Year Machinelearning Journey

An advanced program in Machine Learning and Deep Learning

Stars: ✭ 69 (-24.18%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning

Reproducing MuJoCo benchmarks in a modern, commercial game /physics engine (Unity + PhysX).

Stars: ✭ 47 (-48.35%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning

Deep Q Learning

Minimal Deep Q Learning (DQN & DDQN) implementations in Keras

Stars: ✭ 1,013 (+1013.19%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning

Stars: ✭ 77 (-15.38%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning

Deterministic Gail Pytorch

PyTorch implementation of Deterministic Generative Adversarial Imitation Learning (GAIL) for Off Policy learning

Stars: ✭ 44 (-51.65%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning

Notes and comments about Deep Reinforcement Learning papers

Stars: ✭ 65 (-28.57%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning

Reinforcement Learning / AI Bots in Card (Poker) Games - Blackjack, Leduc, Texas, DouDizhu, Mahjong, UNO.

Stars: ✭ 980 (+976.92%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning

MuZero

Stars: ✭ 1,187 (+1204.4%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning

Rl Course Experiments

Stars: ✭ 73 (-19.78%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning

Solved lab problems, slides and notes of the Deep Reinforcement Learning bootcamp 2017 held at UCBerkeley

Stars: ✭ 39 (-57.14%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning

Artificial intelligence for the Snake game.

Stars: ✭ 1,241 (+1263.74%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning

Deepqlearning.jl

Implementation of the Deep Q-learning algorithm to solve MDPs

Stars: ✭ 38 (-58.24%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning

Playing Atari games with TensorFlow implementation of Asynchronous Deep Q-Learning

Stars: ✭ 44 (-51.65%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning

Awesome Deep Rl

For deep RL and the future of AI.

Stars: ✭ 985 (+982.42%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning

Code for reproducing experiments in Model-Based Active Exploration, ICML 2019

Stars: ✭ 61 (-32.97%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning

A High Level Python Deep Reinforcement Learning library. Great for beginners, prototyping and quickly comparing algorithms

Stars: ✭ 29 (-68.13%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning

Using deep reinforcement learning to tackle the game 2048.

Stars: ✭ 35 (-61.54%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning

Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO

Stars: ✭ 70 (-23.08%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning

Rlenv.directory

Explore and find reinforcement learning environments in a list of 150+ open source environments.

Stars: ✭ 79 (-13.19%)

Mutual labels: reinforcement-learning, deep-reinforcement-learning

View All Similar Projects ➔

My Solution to Assignments of CS234

This is my solution to three assignments of CS234.
CS234: Deep Reinforcement Learning is an interesting class, which teaches you what is the reinforcement learning: Learn to make good sequences of decisions. This class provides some basic knowledge and insights of cutting-edge research in reinforcement learning. More details are as follows:

Define the key features of RL vs AI & other ML
Define MDP, POMDP, bandit, batch offline RL, online RL
Describe the exploration vs exploitation challenge and compare and contrast 2 or more approaches
Given an application problem (e.g. from computer vision, robotics, etc) decide if it should be formulated as a RL problem, if yes how to formulate, what algorithm (from class) is best suited to address, and justify an answer
Implement several RL algorithms incl. a deep RL approach
Describe multiple criteria for analyzing RL algorithms and evaluate algorithms on these metrics: e.g. regret, sample complexity, computational complexity, convergence, etc.
List at least two open challenges or hot topics in RL

Note: If you consult my source codes that you may want to incorporate into your algorithm or system, you should clearly cite references in your codes.

Table of Contents

Assignment 1
- Bellman Operator Properties
- Value Iteration
- Grid Policies
- Frozen Lake MDP
- Frozen Lake Reinforcement Learning
Assignment 2
- Q-learning
- Linear Approximation
- Deepmind's DQN
- (Bonus) Double DQN
- (Bonus) Dueling DQN
Assignment 3
- R-max algorithm
- epsilon-greedy q-learning
- Expected Regret Bounds

Dependencies

Anaconda
tensorflow>=0.12
matplotlib
scipy
numpy
sklearn
six

Author

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 91

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (2) 🔗