All Projects → applenob → Rl_learn

applenob / Rl_learn

我的强化学习笔记和学习材料📖 still updating ... ...

Projects that are alternatives of or similar to Rl learn

Deep Algotrading
A resource for learning about deep learning techniques from regression to LSTM and Reinforcement Learning using financial data and the fitness functions of algorithmic trading
Stars: ✭ 173 (-26.07%)
Mutual labels:  jupyter-notebook, reinforcement-learning
Alpha Zero General
A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more
Stars: ✭ 2,617 (+1018.38%)
Mutual labels:  jupyter-notebook, reinforcement-learning
Andrew Ng Notes
This is Andrew NG Coursera Handwritten Notes.
Stars: ✭ 180 (-23.08%)
Mutual labels:  jupyter-notebook, reinforcement-learning
2048 Deep Reinforcement Learning
Trained A Convolutional Neural Network To Play 2048 using Deep-Reinforcement Learning
Stars: ✭ 169 (-27.78%)
Mutual labels:  jupyter-notebook, reinforcement-learning
Machine Learning Notebooks
Machine Learning notebooks for refreshing concepts.
Stars: ✭ 222 (-5.13%)
Mutual labels:  jupyter-notebook, reinforcement-learning
Pytorch sac
PyTorch implementation of Soft Actor-Critic (SAC)
Stars: ✭ 174 (-25.64%)
Mutual labels:  jupyter-notebook, reinforcement-learning
Release
Deep Reinforcement Learning for de-novo Drug Design
Stars: ✭ 201 (-14.1%)
Mutual labels:  jupyter-notebook, reinforcement-learning
Modular Rl
[ICML 2020] PyTorch Code for "One Policy to Control Them All: Shared Modular Policies for Agent-Agnostic Control"
Stars: ✭ 126 (-46.15%)
Mutual labels:  jupyter-notebook, reinforcement-learning
Icychesszero
中国象棋alpha zero程序
Stars: ✭ 206 (-11.97%)
Mutual labels:  jupyter-notebook, reinforcement-learning
Rl Tutorial Jnrr19
Stable-Baselines tutorial for Journées Nationales de la Recherche en Robotique 2019
Stars: ✭ 204 (-12.82%)
Mutual labels:  jupyter-notebook, reinforcement-learning
Chess Alpha Zero
Chess reinforcement learning by AlphaGo Zero methods.
Stars: ✭ 1,868 (+698.29%)
Mutual labels:  jupyter-notebook, reinforcement-learning
Applied Reinforcement Learning
Reinforcement Learning and Decision Making tutorials explained at an intuitive level and with Jupyter Notebooks
Stars: ✭ 229 (-2.14%)
Mutual labels:  jupyter-notebook, reinforcement-learning
Data Science Question Answer
A repo for data science related questions and answers
Stars: ✭ 2,000 (+754.7%)
Mutual labels:  jupyter-notebook, reinforcement-learning
Machine Learning And Reinforcement Learning In Finance
Machine Learning and Reinforcement Learning in Finance New York University Tandon School of Engineering
Stars: ✭ 173 (-26.07%)
Mutual labels:  jupyter-notebook, reinforcement-learning
Monthly Challenges
Repository containing monthly challenges about quantum computing.
Stars: ✭ 126 (-46.15%)
Mutual labels:  jupyter-notebook, learning-by-doing
Deeprl Agents
A set of Deep Reinforcement Learning Agents implemented in Tensorflow.
Stars: ✭ 2,149 (+818.38%)
Mutual labels:  jupyter-notebook, reinforcement-learning
Rl Quadcopter
Teach a Quadcopter How to Fly!
Stars: ✭ 124 (-47.01%)
Mutual labels:  jupyter-notebook, reinforcement-learning
Dive Into Machine Learning
Dive into Machine Learning with Python Jupyter notebook and scikit-learn! First posted in 2016, maintained as of 2021. Pull requests welcome.
Stars: ✭ 10,810 (+4519.66%)
Mutual labels:  jupyter-notebook, learning-by-doing
Multihopkg
Multi-hop knowledge graph reasoning learned via policy gradient with reward shaping and action dropout
Stars: ✭ 202 (-13.68%)
Mutual labels:  jupyter-notebook, reinforcement-learning
Functional intro to python
[tutorial]A functional, Data Science focused introduction to Python
Stars: ✭ 228 (-2.56%)
Mutual labels:  jupyter-notebook, learning-by-doing

[WIP]强化学习的学习仓库

这是我个人学习强化学习的时候收集的比较经典的学习资料、笔记和代码,分享给所有人。

为了直接在GitHub上用markdown文件看公式,推荐安装chrome插件:MathJax Plugin for Github

入门指南

课程笔记

实验目录

所有的实验源代码都在lib目录下,来自dennybritz。在原先代码的基础上,增加了对实验背景的具体介绍、代码和公式的对照。

  • Gridworld:对应MDPDynamic Programming
  • Blackjack:对应Model FreeMonte Carlo的Planning和Controlling
  • Windy Gridworld:对应Model FreeTemporal DifferenceOn-Policy ControllingSARSA
  • Cliff Walking:对应Model FreeTemporal DifferenceOff-Policy ControllingQ-learning
  • Mountain Car:对应Q表格很大无法处理(state空间连续)的Q-Learning with Linear Function Approximation
  • Atari:对应Deep-Q Learning

其他重要学习资料:

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].