All Projects → higgsfield → Np Hard Deep Reinforcement Learning

higgsfield / Np Hard Deep Reinforcement Learning

pytorch neural combinatorial optimization

Projects that are alternatives of or similar to Np Hard Deep Reinforcement Learning

Unintended Ml Bias Analysis
Stars: ✭ 271 (-1.09%)
Mutual labels:  jupyter-notebook
Aws Deepcomposer Samples
Stars: ✭ 273 (-0.36%)
Mutual labels:  jupyter-notebook
Machine Learning For Algorithmic Trading Second Edition original
Machine Learning for Algorithmic Trading, Second Edition - published by Packt
Stars: ✭ 273 (-0.36%)
Mutual labels:  jupyter-notebook
Fba matting
Official repository for the paper F, B, Alpha Matting
Stars: ✭ 272 (-0.73%)
Mutual labels:  jupyter-notebook
Data Science Is Software
Stars: ✭ 272 (-0.73%)
Mutual labels:  jupyter-notebook
Islr Python
An Introduction to Statistical Learning (James, Witten, Hastie, Tibshirani, 2013): Python code
Stars: ✭ 3,344 (+1120.44%)
Mutual labels:  jupyter-notebook
Streamingphish
Python-based utility that uses supervised machine learning to detect phishing domains from the Certificate Transparency log network.
Stars: ✭ 271 (-1.09%)
Mutual labels:  jupyter-notebook
Kubeflow Labs
👩‍🔬 Train and Serve TensorFlow Models at Scale with Kubernetes and Kubeflow on Azure
Stars: ✭ 275 (+0.36%)
Mutual labels:  jupyter-notebook
Drq
DrQ: Data regularized Q
Stars: ✭ 268 (-2.19%)
Mutual labels:  jupyter-notebook
Altair Tutorial
Notebooks for the Altair tutorial
Stars: ✭ 273 (-0.36%)
Mutual labels:  jupyter-notebook
Cryptocurrency Price Prediction
Cryptocurrency Price Prediction Using LSTM neural network
Stars: ✭ 271 (-1.09%)
Mutual labels:  jupyter-notebook
Numerai
Code from my experiments on Numerai
Stars: ✭ 272 (-0.73%)
Mutual labels:  jupyter-notebook
Python reference
Useful functions, tutorials, and other Python-related things
Stars: ✭ 3,125 (+1040.51%)
Mutual labels:  jupyter-notebook
Rad
RAD: Reinforcement Learning with Augmented Data
Stars: ✭ 268 (-2.19%)
Mutual labels:  jupyter-notebook
Mathematical Python
Introduction to Mathematical Computing with Python and Jupyter
Stars: ✭ 273 (-0.36%)
Mutual labels:  jupyter-notebook
Introduction To Python For Computational Science And Engineering
Book: Introduction to Python for Computational Science and Engineering
Stars: ✭ 271 (-1.09%)
Mutual labels:  jupyter-notebook
Computation Thru Dynamics
Understanding computation in artificial and biological recurrent networks through the lens of dynamical systems.
Stars: ✭ 273 (-0.36%)
Mutual labels:  jupyter-notebook
Rlquant
Applying Reinforcement Learning in Quantitative Trading
Stars: ✭ 271 (-1.09%)
Mutual labels:  jupyter-notebook
Pytorch Image Classification
Tutorials on how to implement a few key architectures for image classification using PyTorch and TorchVision.
Stars: ✭ 272 (-0.73%)
Mutual labels:  jupyter-notebook
Enigma Simulator
An educational demonstration of breaking the Enigma machine
Stars: ✭ 273 (-0.36%)
Mutual labels:  jupyter-notebook

combinatorial optimization with DL/RL: IPython tutorials

This tutorial demonstrates technique to solve combinatorial optimization problems such as the well-known travelling salesman problem. The method was presented in the paper Neural Combinatorial Optimization with Reinforcement Learning.

The Algorithm applies the pointer network architecture wherein an attention mechanism is fashioned to point to elements of an input sequence, allowing a decoder to output said elements. The network is trained by reinforcement learning using an actor-critic method.

Note! This model does not beat existing baselines for TSP, moreover local search method LK-H solves these tsp tasks to optimality in seconds on a CPU, compared to suboptimal results by this model in several hours on a GPU.

The algorithm consists of two parts:

Pointer Network

Intro to PN for simple sorting task: Intro to Pointer Network.ipynb.

Paper: Pointer Networks.

Blog post by fast ml: Introduction to pointer networks.

Neural Combinatorial Optimization

Neural Combinatorial Optimization.ipynb

Paper: Neural Combinatorial Optimization with Reinforcement Learning

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].