BerkeleyAutomation / rlqp

Licence: other
Accelerating Quadratic Optimization with Reinforcement Learning

Projects that are alternatives of or similar to rlqp

osqp-cpp
A C++ interface for the OSQP quadratic programming solver.
Stars: ✭ 160 (+125.35%)
Mutual labels:  quadratic-programming, osqp
node-quadprog
Module for solving quadratic programming problems with constraints
Stars: ✭ 28 (-60.56%)
Mutual labels:  quadratic-programming
Paddle-RLBooks
Paddle-RLBooks is a reinforcement learning code study guide based on pure PaddlePaddle.
Stars: ✭ 113 (+59.15%)
Mutual labels:  td3
portfolio allocation js
A JavaScript library to allocate and optimize financial portfolios.
Stars: ✭ 145 (+104.23%)
Mutual labels:  quadratic-programming
ReinforcementLearningZoo.jl
juliareinforcementlearning.org/
Stars: ✭ 46 (-35.21%)
Mutual labels:  td3
NMPC-DCLF-DCBF
A collection of work using nonlinear model predictive control (NMPC) with discrete-time control Lyapunov functions (CLFs) and control barrier functions (CBFs)
Stars: ✭ 38 (-46.48%)
Mutual labels:  quadratic-programming
model-free-algorithms
TD3, SAC, IQN, Rainbow, PPO, Ape-X and etc. in TF1.x
Stars: ✭ 56 (-21.13%)
Mutual labels:  td3
Rainy
☔ Deep RL agents with PyTorch☔
Stars: ✭ 39 (-45.07%)
Mutual labels:  td3
osqp
The Operator Splitting QP Solver
Stars: ✭ 929 (+1208.45%)
Mutual labels:  quadratic-programming
pydata-london-2018
Slides and notebooks for my tutorial at PyData London 2018
Stars: ✭ 22 (-69.01%)
Mutual labels:  quadratic-programming
rmpk
Mixed Integer Linear and Quadratic Programming in R
Stars: ✭ 37 (-47.89%)
Mutual labels:  quadratic-programming
Deeprl
Modularized Implementation of Deep RL Algorithms in PyTorch
Stars: ✭ 2,640 (+3618.31%)
Mutual labels:  td3
gibbous
Convex optimization for java and scala, built on Apache Commons Math
Stars: ✭ 17 (-76.06%)
Mutual labels:  quadratic-programming
Deep-rl-mxnet
Mxnet implementation of Deep Reinforcement Learning papers, such as DQN, PG, DDPG, PPO
Stars: ✭ 26 (-63.38%)
Mutual labels:  td3
Deep-Reinforcement-Learning-With-Python
Master classic RL, deep RL, distributional RL, inverse RL, and more using OpenAI Gym and TensorFlow with extensive Math
Stars: ✭ 222 (+212.68%)
Mutual labels:  td3
ElegantRL
Scalable and Elastic Deep Reinforcement Learning Using PyTorch. Please star. 🔥
Stars: ✭ 2,074 (+2821.13%)
Mutual labels:  td3
GALAHAD
A library of modern Fortran modules for nonlinear optimization
Stars: ✭ 60 (-15.49%)
Mutual labels:  quadratic-programming
jax-rl
JAX implementations of core Deep RL algorithms
Stars: ✭ 61 (-14.08%)
Mutual labels:  td3
quadruped control
Quadruped control architecture
Stars: ✭ 46 (-35.21%)
Mutual labels:  quadratic-programming
qpmad
ROS-compatible Eigen-based Goldfarb-Idnani quadratic programming solver
Stars: ✭ 41 (-42.25%)
Mutual labels:  quadratic-programming

RLQP: Accelerating Quadratic Optimization with RL

We demonstrate reinforcement learning can significantly accelerate first-order optimization, outperforming state-of-the-art solvers by up to 3x. RLQP avoids suboptimal heuristics within solvers by tuning the internal parameters of the ADMM algorithm. By decomposing the policy as a multi-agent partially observed problem, RLQP adapts to unseen problem classes and to larger problems than seen during training.

Getting Started

RLQP is composed of a few submodules, namely to (a) train the RL policy on a specific class of problems (source in rlqp_train/) and (b) evaluate the policy on a test problem. Most users will want to start by using RLQP's policy to accelerate optimization of their problems.

Prerequisites

Installation (evaluation)

To install the Python package to evaluate a pre-trained policy, run:

pip install git+https://github.com/berkeleyautomation/rlqp-python.git@55f378e496979bd00e84cea4583ac37bfaa571a9

This package contains a pre-trained model which should improve convergence beyond OSQP. The interface is identical to OSQP.

Installation (training)

Please follow the instructions in the rlqp_train/ directory to setup and run training code. This code is still in preview mode as we work to release features like our TD3 policy.

Citation

@article{ichnowski2021rlqp,
  title={Accelerating Quadratic Optimization with Reinforcement Learning},
  author={Jeffrey Ichnowski, Paras Jain, Bartolomeo Stellato,
    and Goran Banjac, Michael Luo, Francesco Borrelli
    and Joseph E. Gonzalez, Ion Stoica, Ken Goldberg},
  year={2021},
  journal={arXiv preprint}
}
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].