All Projects → maciejjaskowski → Deep Q Learning

maciejjaskowski / Deep Q Learning

Projects that are alternatives of or similar to Deep Q Learning

Binderhub
Run your code in the cloud, with technology so advanced, it feels like magic!
Stars: ✭ 2,050 (+1222.58%)
Mutual labels:  jupyter-notebook
Pytorchmedicalai
This is the hands-on deep learning tutorial series for the 2018/2019 Medical AI course by DeepOncology AI.
Stars: ✭ 155 (+0%)
Mutual labels:  jupyter-notebook
Tencent social ads2017 mobile app pcvr
Tencent Social Ads 2017 contest rank 20
Stars: ✭ 155 (+0%)
Mutual labels:  jupyter-notebook
Deepreinforcementlearning
A replica of the AlphaZero methodology for deep reinforcement learning in Python
Stars: ✭ 1,898 (+1124.52%)
Mutual labels:  jupyter-notebook
Your First Kaggle Submission
How to perform an exploratory data analysis on the Kaggle Titanic dataset and make a submission to the leaderboard.
Stars: ✭ 155 (+0%)
Mutual labels:  jupyter-notebook
Surgery Robot Detection Segmentation
Object detection and segmentation for a surgery robot using Mask-RCNN on Python 3, Keras, and TensorFlow..
Stars: ✭ 155 (+0%)
Mutual labels:  jupyter-notebook
Stock Market Prediction Challenge
Following repo is the solution to Stock Market Prediction using Neural Networks and Sentiment Analysis
Stars: ✭ 154 (-0.65%)
Mutual labels:  jupyter-notebook
Py Quantmod
Powerful financial charting library based on R's Quantmod | http://py-quantmod.readthedocs.io/en/latest/
Stars: ✭ 155 (+0%)
Mutual labels:  jupyter-notebook
Stocks
Programs for stock prediction and evaluation
Stars: ✭ 155 (+0%)
Mutual labels:  jupyter-notebook
Fcn For Semantic Segmentation
Implemention of FCN-8 and FCN-16 with Keras and uses CRF as post processing
Stars: ✭ 155 (+0%)
Mutual labels:  jupyter-notebook
Pyportfolioopt
Financial portfolio optimisation in python, including classical efficient frontier, Black-Litterman, Hierarchical Risk Parity
Stars: ✭ 2,502 (+1514.19%)
Mutual labels:  jupyter-notebook
Davsod
Shifting More Attention to Video Salient Objection Detection, CVPR 2019 (Best paper finalist & Oral)
Stars: ✭ 155 (+0%)
Mutual labels:  jupyter-notebook
Mgwr
Multiscale Geographically Weighted Regression (MGWR)
Stars: ✭ 155 (+0%)
Mutual labels:  jupyter-notebook
Neural Style Transfer
Keras Implementation of Neural Style Transfer from the paper "A Neural Algorithm of Artistic Style" (http://arxiv.org/abs/1508.06576) in Keras 2.0+
Stars: ✭ 2,000 (+1190.32%)
Mutual labels:  jupyter-notebook
Learningtosee
Stars: ✭ 154 (-0.65%)
Mutual labels:  jupyter-notebook
Jupyter Server Proxy
Jupyter notebook server extension to proxy web services.
Stars: ✭ 153 (-1.29%)
Mutual labels:  jupyter-notebook
Spiking Neural Network Snn With Pytorch Where Backpropagation Engenders Stdp
What about coding a Spiking Neural Network using an automatic differentiation framework? In SNNs, there is a time axis and the neural network sees data throughout time, and activation functions are instead spikes that are raised past a certain pre-activation threshold. Pre-activation values constantly fades if neurons aren't excited enough.
Stars: ✭ 155 (+0%)
Mutual labels:  jupyter-notebook
Carnd Camera Calibration
Images and notebook for camera calibration
Stars: ✭ 155 (+0%)
Mutual labels:  jupyter-notebook
Ml Training Advanced
Materials for the "Advanced Scikit-learn" class in the afternoon
Stars: ✭ 155 (+0%)
Mutual labels:  jupyter-notebook
Copulas
A library to model multivariate data using copulas.
Stars: ✭ 149 (-3.87%)
Mutual labels:  jupyter-notebook

Deep Q-Learning

Overview

Our version of the deep q-learning algorithm from The DQN paper. This algorithm reads the screen and the integer score of the Atari 2600 game Space Invaders. The output is the same control commands as a human would have with a controller (albeit, without the physical controller).

Installation Dependencies:

Amazon Instance Installation

Look at /provision/aws_installation.sh for a concise shell history to install the environment.

External References

The DQN paper

Human-level control through deep reinforcement learning

Deep Reinforcement Learning with Double Q-learning - more stable learning through double q-learning

Action-Conditional Video Prediction using Deep Networks in Atari Games - predicting future frames

Dueling Network Architectures for Deep Q-learning

Arcade Learning Environment

Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation

Reccurent Model of Visual Attention - applying q-learning to figure out what part of the image to look at.

Prioritized Experience Replay - drawing from the memory should be more likely if the memory is more shocking

Deep Recurrent Q-Learning For Partially Observable MDPs - by using LSTM you can get rid of preprocessing done in DQN paper. "The recurrent net can better adapt at evaluation time if the quality of observations changes"

A fast learning algorithm for deep belief nets - Training one layer at a time

Reinforcement Learning and Automated Planning: A Survey

Autoregressive Neural Networks - Neural Networks applied to Time Series.

Deep Autoregressive Neural Networks - predicting future frames of an Atari Game.

Reinforcement Learning: An introduction - very thorough introduction to Reinforcement Learning.

A survey of robot learning by demonstration Learning by|from demonstration = Learning by watching = Learning from observation = Programming by demonstration = Behaviour cloning|imitation|mimicry

DynaQ

Deep Reinforcement Learning Nice summary of recent advances in Deep Q-learning.

Concurrent Q-learning for Autonomous Mapping and Navigation One-trial learning???

Using Reinforcement Learning to Adapt an Imitation Task Overcoming new obstacles ???

On the importance of initialization and momentum in deep learning - Nesterov Momentum vs Nesterov Accelerated Gradient

CNN Features off-the-shelf: an Astounding Baseline for Recognition NN generated features are better then manually-made

Prioritized Experience Replay - on Atari games

Network in Network - MaxPooling looses information, let's keep some more information.

Concurrent Reinforcement Learning - RL in time dependent environments

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].