applenob / Rl_learn
我的强化学习笔记和学习材料📖 still updating ... ...
Stars: ✭ 234
Projects that are alternatives of or similar to Rl learn
Deep Algotrading
A resource for learning about deep learning techniques from regression to LSTM and Reinforcement Learning using financial data and the fitness functions of algorithmic trading
Stars: ✭ 173 (-26.07%)
Mutual labels: jupyter-notebook, reinforcement-learning
Alpha Zero General
A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more
Stars: ✭ 2,617 (+1018.38%)
Mutual labels: jupyter-notebook, reinforcement-learning
Andrew Ng Notes
This is Andrew NG Coursera Handwritten Notes.
Stars: ✭ 180 (-23.08%)
Mutual labels: jupyter-notebook, reinforcement-learning
2048 Deep Reinforcement Learning
Trained A Convolutional Neural Network To Play 2048 using Deep-Reinforcement Learning
Stars: ✭ 169 (-27.78%)
Mutual labels: jupyter-notebook, reinforcement-learning
Machine Learning Notebooks
Machine Learning notebooks for refreshing concepts.
Stars: ✭ 222 (-5.13%)
Mutual labels: jupyter-notebook, reinforcement-learning
Pytorch sac
PyTorch implementation of Soft Actor-Critic (SAC)
Stars: ✭ 174 (-25.64%)
Mutual labels: jupyter-notebook, reinforcement-learning
Release
Deep Reinforcement Learning for de-novo Drug Design
Stars: ✭ 201 (-14.1%)
Mutual labels: jupyter-notebook, reinforcement-learning
Modular Rl
[ICML 2020] PyTorch Code for "One Policy to Control Them All: Shared Modular Policies for Agent-Agnostic Control"
Stars: ✭ 126 (-46.15%)
Mutual labels: jupyter-notebook, reinforcement-learning
Icychesszero
中国象棋alpha zero程序
Stars: ✭ 206 (-11.97%)
Mutual labels: jupyter-notebook, reinforcement-learning
Rl Tutorial Jnrr19
Stable-Baselines tutorial for Journées Nationales de la Recherche en Robotique 2019
Stars: ✭ 204 (-12.82%)
Mutual labels: jupyter-notebook, reinforcement-learning
Chess Alpha Zero
Chess reinforcement learning by AlphaGo Zero methods.
Stars: ✭ 1,868 (+698.29%)
Mutual labels: jupyter-notebook, reinforcement-learning
Applied Reinforcement Learning
Reinforcement Learning and Decision Making tutorials explained at an intuitive level and with Jupyter Notebooks
Stars: ✭ 229 (-2.14%)
Mutual labels: jupyter-notebook, reinforcement-learning
Data Science Question Answer
A repo for data science related questions and answers
Stars: ✭ 2,000 (+754.7%)
Mutual labels: jupyter-notebook, reinforcement-learning
Machine Learning And Reinforcement Learning In Finance
Machine Learning and Reinforcement Learning in Finance New York University Tandon School of Engineering
Stars: ✭ 173 (-26.07%)
Mutual labels: jupyter-notebook, reinforcement-learning
Monthly Challenges
Repository containing monthly challenges about quantum computing.
Stars: ✭ 126 (-46.15%)
Mutual labels: jupyter-notebook, learning-by-doing
Deeprl Agents
A set of Deep Reinforcement Learning Agents implemented in Tensorflow.
Stars: ✭ 2,149 (+818.38%)
Mutual labels: jupyter-notebook, reinforcement-learning
Rl Quadcopter
Teach a Quadcopter How to Fly!
Stars: ✭ 124 (-47.01%)
Mutual labels: jupyter-notebook, reinforcement-learning
Dive Into Machine Learning
Dive into Machine Learning with Python Jupyter notebook and scikit-learn! First posted in 2016, maintained as of 2021. Pull requests welcome.
Stars: ✭ 10,810 (+4519.66%)
Mutual labels: jupyter-notebook, learning-by-doing
Multihopkg
Multi-hop knowledge graph reasoning learned via policy gradient with reward shaping and action dropout
Stars: ✭ 202 (-13.68%)
Mutual labels: jupyter-notebook, reinforcement-learning
Functional intro to python
[tutorial]A functional, Data Science focused introduction to Python
Stars: ✭ 228 (-2.56%)
Mutual labels: jupyter-notebook, learning-by-doing
[WIP]强化学习的学习仓库
这是我个人学习强化学习的时候收集的比较经典的学习资料、笔记和代码,分享给所有人。
为了直接在GitHub上用markdown文件看公式,推荐安装chrome插件:MathJax Plugin for Github
入门指南
课程笔记
-
Sutton 的 Reinforcement Learning: An Introduction书本学习笔记
- 1. Introduction
- 2. Multi-armed Bandits
- 3. Finite Markov DecisionProcesses
- 4. Dynamic Programming
- 5. Monte Carlo Methods
- 6. Temporal-Difference Learning
- 7. n-step Bootstrapping
- 8. Planning and Learning with Tabular Methods
- 9. On-policy Prediction with Approximation
- 10. On-policy Control with Approximation
- 11. Off-policy Methods with Approximation
- 12. Eligibility Traces
- 13. Policy Gradient Methods
- 14. Psychology
- 15. Neuroscience
- 16. Applications and Case Studies
- 17. Frontiers
实验目录
所有的实验源代码都在lib
目录下,来自dennybritz。在原先代码的基础上,增加了对实验背景的具体介绍、代码和公式的对照。
- Gridworld:对应MDP的Dynamic Programming
- Blackjack:对应Model Free的Monte Carlo的Planning和Controlling
- Windy Gridworld:对应Model Free的Temporal Difference的On-Policy Controlling:SARSA。
- Cliff Walking:对应Model Free的Temporal Difference的Off-Policy Controlling:Q-learning。
- Mountain Car:对应Q表格很大无法处理(state空间连续)的Q-Learning with Linear Function Approximation。
- Atari:对应Deep-Q Learning。
其他重要学习资料:
Note that the project description data, including the texts, logos, images, and/or trademarks,
for each open source project belongs to its rightful owner.
If you wish to add or remove any projects, please contact us at [email protected].