BerkeleyAutomation / rlqp

Licence: other

Accelerating Quadratic Optimization with Reinforcement Learning

Projects that are alternatives of or similar to rlqp

osqp-cpp

A C++ interface for the OSQP quadratic programming solver.

Stars: ✭ 160 (+125.35%)

Mutual labels: quadratic-programming, osqp

node-quadprog

Module for solving quadratic programming problems with constraints

Stars: ✭ 28 (-60.56%)

Mutual labels: quadratic-programming

Paddle-RLBooks

Paddle-RLBooks is a reinforcement learning code study guide based on pure PaddlePaddle.

Stars: ✭ 113 (+59.15%)

Mutual labels: td3

portfolio allocation js

A JavaScript library to allocate and optimize financial portfolios.

Stars: ✭ 145 (+104.23%)

Mutual labels: quadratic-programming

ReinforcementLearningZoo.jl

juliareinforcementlearning.org/

Stars: ✭ 46 (-35.21%)

Mutual labels: td3

NMPC-DCLF-DCBF

A collection of work using nonlinear model predictive control (NMPC) with discrete-time control Lyapunov functions (CLFs) and control barrier functions (CBFs)

Stars: ✭ 38 (-46.48%)

Mutual labels: quadratic-programming

model-free-algorithms

TD3, SAC, IQN, Rainbow, PPO, Ape-X and etc. in TF1.x

Stars: ✭ 56 (-21.13%)

Mutual labels: td3

Rainy

☔ Deep RL agents with PyTorch☔

Stars: ✭ 39 (-45.07%)

Mutual labels: td3

osqp

The Operator Splitting QP Solver

Stars: ✭ 929 (+1208.45%)

Mutual labels: quadratic-programming

pydata-london-2018

Slides and notebooks for my tutorial at PyData London 2018

Stars: ✭ 22 (-69.01%)

Mutual labels: quadratic-programming

rmpk

Mixed Integer Linear and Quadratic Programming in R

Stars: ✭ 37 (-47.89%)

Mutual labels: quadratic-programming

Deeprl

Modularized Implementation of Deep RL Algorithms in PyTorch

Stars: ✭ 2,640 (+3618.31%)

Mutual labels: td3

gibbous

Convex optimization for java and scala, built on Apache Commons Math

Stars: ✭ 17 (-76.06%)

Mutual labels: quadratic-programming

Deep-rl-mxnet

Mxnet implementation of Deep Reinforcement Learning papers, such as DQN, PG, DDPG, PPO

Stars: ✭ 26 (-63.38%)

Mutual labels: td3

Deep-Reinforcement-Learning-With-Python

Master classic RL, deep RL, distributional RL, inverse RL, and more using OpenAI Gym and TensorFlow with extensive Math

Stars: ✭ 222 (+212.68%)

Mutual labels: td3

ElegantRL

Scalable and Elastic Deep Reinforcement Learning Using PyTorch. Please star. 🔥

Stars: ✭ 2,074 (+2821.13%)

Mutual labels: td3

GALAHAD

A library of modern Fortran modules for nonlinear optimization

Stars: ✭ 60 (-15.49%)

Mutual labels: quadratic-programming

jax-rl

JAX implementations of core Deep RL algorithms

Stars: ✭ 61 (-14.08%)

Mutual labels: td3

quadruped control

Quadruped control architecture

Stars: ✭ 46 (-35.21%)

Mutual labels: quadratic-programming

qpmad

ROS-compatible Eigen-based Goldfarb-Idnani quadratic programming solver

Stars: ✭ 41 (-42.25%)

Mutual labels: quadratic-programming

View All Similar Projects ➔

RLQP: Accelerating Quadratic Optimization with RL

We demonstrate reinforcement learning can significantly accelerate first-order optimization, outperforming state-of-the-art solvers by up to 3x. RLQP avoids suboptimal heuristics within solvers by tuning the internal parameters of the ADMM algorithm. By decomposing the policy as a multi-agent partially observed problem, RLQP adapts to unseen problem classes and to larger problems than seen during training.

Getting Started

RLQP is composed of a few submodules, namely to (a) train the RL policy on a specific class of problems (source in rlqp_train/) and (b) evaluate the policy on a test problem. Most users will want to start by using RLQP's policy to accelerate optimization of their problems.

Prerequisites

Installation (evaluation)

To install the Python package to evaluate a pre-trained policy, run:

pip install git+https://github.com/berkeleyautomation/rlqp-python.git@55f378e496979bd00e84cea4583ac37bfaa571a9

This package contains a pre-trained model which should improve convergence beyond OSQP. The interface is identical to OSQP.

Installation (training)

Please follow the instructions in the rlqp_train/ directory to setup and run training code. This code is still in preview mode as we work to release features like our TD3 policy.

Citation

@article{ichnowski2021rlqp,
  title={Accelerating Quadratic Optimization with Reinforcement Learning},
  author={Jeffrey Ichnowski, Paras Jain, Bartolomeo Stellato,
    and Goran Banjac, Michael Luo, Francesco Borrelli
    and Joseph E. Gonzalez, Ion Stoica, Ken Goldberg},
  year={2021},
  journal={arXiv preprint}
}

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

BerkeleyAutomation / rlqp

Labels

Projects that are alternatives of or similar to rlqp

RLQP: Accelerating Quadratic Optimization with RL

Getting Started

Prerequisites

Installation (evaluation)

Installation (training)

Citation