All Projects → dalmia → David Silver Reinforcement Learning

dalmia / David Silver Reinforcement Learning

Licence: mit
Notes for the Reinforcement Learning course by David Silver along with implementation of various algorithms.

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to David Silver Reinforcement Learning

Awesome Ai Books
Some awesome AI related books and pdfs for learning and downloading, also apply some playground models for learning
Stars: ✭ 855 (+37.24%)
Mutual labels:  artificial-intelligence, jupyter-notebook, reinforcement-learning
Notebooks
Some notebooks
Stars: ✭ 53 (-91.49%)
Mutual labels:  artificial-intelligence, jupyter-notebook, reinforcement-learning
Basic reinforcement learning
An introductory series to Reinforcement Learning (RL) with comprehensive step-by-step tutorials.
Stars: ✭ 826 (+32.58%)
Mutual labels:  artificial-intelligence, jupyter-notebook, reinforcement-learning
60 days rl challenge
60_Days_RL_Challenge中文版
Stars: ✭ 92 (-85.23%)
Mutual labels:  artificial-intelligence, jupyter-notebook, reinforcement-learning
Gdrl
Grokking Deep Reinforcement Learning
Stars: ✭ 304 (-51.2%)
Mutual labels:  artificial-intelligence, jupyter-notebook, reinforcement-learning
Text summurization abstractive methods
Multiple implementations for abstractive text summurization , using google colab
Stars: ✭ 359 (-42.38%)
Mutual labels:  artificial-intelligence, jupyter-notebook, reinforcement-learning
Machine Learning From Scratch
Succinct Machine Learning algorithm implementations from scratch in Python, solving real-world problems (Notebooks and Book). Examples of Logistic Regression, Linear Regression, Decision Trees, K-means clustering, Sentiment Analysis, Recommender Systems, Neural Networks and Reinforcement Learning.
Stars: ✭ 42 (-93.26%)
Mutual labels:  artificial-intelligence, jupyter-notebook, reinforcement-learning
Rlai Exercises
Exercise Solutions for Reinforcement Learning: An Introduction [2nd Edition]
Stars: ✭ 97 (-84.43%)
Mutual labels:  artificial-intelligence, jupyter-notebook, reinforcement-learning
Applied Reinforcement Learning
Reinforcement Learning and Decision Making tutorials explained at an intuitive level and with Jupyter Notebooks
Stars: ✭ 229 (-63.24%)
Mutual labels:  artificial-intelligence, jupyter-notebook, reinforcement-learning
Reinforcement Learning
Learn Deep Reinforcement Learning in 60 days! Lectures & Code in Python. Reinforcement Learning + Deep Learning
Stars: ✭ 3,329 (+434.35%)
Mutual labels:  artificial-intelligence, jupyter-notebook, reinforcement-learning
Lagom
lagom: A PyTorch infrastructure for rapid prototyping of reinforcement learning algorithms.
Stars: ✭ 364 (-41.57%)
Mutual labels:  artificial-intelligence, jupyter-notebook, reinforcement-learning
Reinforcement learning tutorial with demo
Reinforcement Learning Tutorial with Demo: DP (Policy and Value Iteration), Monte Carlo, TD Learning (SARSA, QLearning), Function Approximation, Policy Gradient, DQN, Imitation, Meta Learning, Papers, Courses, etc..
Stars: ✭ 442 (-29.05%)
Mutual labels:  jupyter-notebook, reinforcement-learning
Tensor House
A collection of reference machine learning and optimization models for enterprise operations: marketing, pricing, supply chain
Stars: ✭ 449 (-27.93%)
Mutual labels:  jupyter-notebook, reinforcement-learning
Courses
Quiz & Assignment of Coursera
Stars: ✭ 454 (-27.13%)
Mutual labels:  jupyter-notebook, reinforcement-learning
Pba
Efficient Learning of Augmentation Policy Schedules
Stars: ✭ 461 (-26%)
Mutual labels:  artificial-intelligence, jupyter-notebook
Practical Deep Learning Book
Official code repo for the O'Reilly Book - Practical Deep Learning for Cloud, Mobile & Edge
Stars: ✭ 441 (-29.21%)
Mutual labels:  artificial-intelligence, jupyter-notebook
Arnold
Arnold - DOOM Agent
Stars: ✭ 457 (-26.65%)
Mutual labels:  artificial-intelligence, reinforcement-learning
Recsim
A Configurable Recommender Systems Simulation Platform
Stars: ✭ 461 (-26%)
Mutual labels:  artificial-intelligence, reinforcement-learning
Practical rl
A course in reinforcement learning in the wild
Stars: ✭ 4,741 (+661%)
Mutual labels:  jupyter-notebook, reinforcement-learning
Tensorflow Book
Accompanying source code for Machine Learning with TensorFlow. Refer to the book for step-by-step explanations.
Stars: ✭ 4,448 (+613.96%)
Mutual labels:  jupyter-notebook, reinforcement-learning

David-Silver-Reinforcement-learning

Tweet

apm Build Status PRs Welcome

This repository contains the notes for the Reinforcement Learning course by David Silver along with the implementation of the various algorithms discussed, both in Keras (with TensorFlow backend) and OpenAI's gym framework.

Syllabus:

  • Week 1: Introduction to Reinforcement Learning [slide][video]

  • Week 2: Markov Decision Processes [slide][video]

  • Week 3: Planning by Dynamic Programming [slide][video]

  • Week 4: Model-Free Prediction [slide][video]

  • Week 5: Model-Free Control [slide][video]

  • Week 6: Value Function Approximation [slide][video]

  • Week 7: Policy Gradient Methods [slide][video]

  • Week 8: Integrating Learning and Planning [slide][video]

  • Week 9: Exploration and Exploitation [slide][video]

  • Week 10: Case Study: RL in Classic Games [slide][video]

Dependencies

  • TensorFlow
  • Keras
  • Gym
  • Numpy

Install them using pip.

Contributing

Please feel free to create a Pull Request for adding implementations of the algorithms discussed in different frameworks like PyTorch, Caffe, etc. or improving the existing implementations. If you are a beginner, you can refer this for getting started.

Support

If you found this useful, please consider starring(★) the repo so that it can reach a broader audience.

License

This project is licensed under the MIT License - see the LICENSE file for details.

References

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].