All Projects → befelix → Safe_learning

befelix / Safe_learning

Licence: mit
Safe reinforcement learning with stability guarantees

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Safe learning

Move37
Coding Demos from the School of AI's Move37 Course
Stars: ✭ 130 (-7.14%)
Mutual labels:  dynamic-programming, reinforcement-learning
Reinforcement learning tutorial with demo
Reinforcement Learning Tutorial with Demo: DP (Policy and Value Iteration), Monte Carlo, TD Learning (SARSA, QLearning), Function Approximation, Policy Gradient, DQN, Imitation, Meta Learning, Papers, Courses, etc..
Stars: ✭ 442 (+215.71%)
Mutual labels:  dynamic-programming, reinforcement-learning
Pilco
Bayesian Reinforcement Learning in Tensorflow
Stars: ✭ 222 (+58.57%)
Mutual labels:  reinforcement-learning, gaussian-processes
Safeopt
Safe Bayesian Optimization
Stars: ✭ 90 (-35.71%)
Mutual labels:  reinforcement-learning, gaussian-processes
Deep Reinforcement Learning
Repo for the Deep Reinforcement Learning Nanodegree program
Stars: ✭ 4,012 (+2765.71%)
Mutual labels:  dynamic-programming, reinforcement-learning
Numpy Ml
Machine learning, in numpy
Stars: ✭ 11,100 (+7828.57%)
Mutual labels:  reinforcement-learning, gaussian-processes
Vizdoom Keras Rl
Reinforcement Learning in Keras on VizDoom
Stars: ✭ 130 (-7.14%)
Mutual labels:  reinforcement-learning
Ml Agents
Unity Machine Learning Agents Toolkit
Stars: ✭ 12,134 (+8567.14%)
Mutual labels:  reinforcement-learning
Aixijs
AIXIjs - General Reinforcement Learning in the Browser
Stars: ✭ 128 (-8.57%)
Mutual labels:  reinforcement-learning
Aboleth
A bare-bones TensorFlow framework for Bayesian deep learning and Gaussian process approximation
Stars: ✭ 127 (-9.29%)
Mutual labels:  gaussian-processes
Survival Analysis Using Deep Learning
This repository contains morden baysian statistics and deep learning based research articles , software for survival analysis
Stars: ✭ 139 (-0.71%)
Mutual labels:  gaussian-processes
Dsa Geeksclasses
DSA-Self Paced With Doubt Assistance Course Solutions in Python (Python 3)
Stars: ✭ 137 (-2.14%)
Mutual labels:  dynamic-programming
Hindsight Experience Replay
This is the pytorch implementation of Hindsight Experience Replay (HER) - Experiment on all fetch robotic environments.
Stars: ✭ 134 (-4.29%)
Mutual labels:  reinforcement-learning
Reinforcement learning in python
Implementing Reinforcement Learning, namely Q-learning and Sarsa algorithms, for global path planning of mobile robot in unknown environment with obstacles. Comparison analysis of Q-learning and Sarsa
Stars: ✭ 134 (-4.29%)
Mutual labels:  reinforcement-learning
Automata
A comprehensive autonomous decentralized systems framework for AI control architects.
Stars: ✭ 130 (-7.14%)
Mutual labels:  reinforcement-learning
Savn
Learning to Learn how to Learn: Self-Adaptive Visual Navigation using Meta-Learning (https://arxiv.org/abs/1812.00971)
Stars: ✭ 135 (-3.57%)
Mutual labels:  reinforcement-learning
Toycarirl
Implementation of Inverse Reinforcement Learning Algorithm on a toy car in a 2D world problem, (Apprenticeship Learning via Inverse Reinforcement Learning Abbeel & Ng, 2004)
Stars: ✭ 128 (-8.57%)
Mutual labels:  reinforcement-learning
Ravens
Train robotic agents to learn pick and place with deep learning for vision-based manipulation in PyBullet. Transporter Nets, CoRL 2020.
Stars: ✭ 133 (-5%)
Mutual labels:  reinforcement-learning
Pysot
Surrogate Optimization Toolbox for Python
Stars: ✭ 136 (-2.86%)
Mutual labels:  gaussian-processes
Saltie
🚗 Rocket League Distributed Deep Reinforcement Learning Bot
Stars: ✭ 134 (-4.29%)
Mutual labels:  reinforcement-learning

===================================================== Safe Reinforcement Learning with Stability Guarantees

.. image:: https://travis-ci.org/befelix/safe_learning.svg?branch=master :target: https://travis-ci.org/befelix/safe_learning :alt: Build status .. image:: https://readthedocs.org/projects/safe-learning/badge/?version=latest :target: http://safe-learning.readthedocs.io/en/latest/?badge=latest :alt: Documentation Status

This code accompanies the paper [1]_ and implements the code for estimating the region of attraction for a policy and optimizing the policy subject to stability constraints. For the old numpy-based code to estimate the region of attraction in [2]_ see the lyapunov-learning <https://github.com/befelix/lyapunov-learning>_ repository. The code for learning Lyapunov functions from [3]_ can be found in the examples <./examples>_ folder.

.. [1] F. Berkenkamp, M. Turchetta, A. P. Schoellig, A. Krause, Safe Model-based Reinforcement Learning with Stability Guarantees <http://arxiv.org/abs/1509.01066>_ in Proc. of the Conference on Neural Information Processing Systems (NIPS), 2017.

.. [2] F. Berkenkamp, R. Moriconi, A. P. Schoellig, A. Krause, Safe Learning of Regions of Attraction in Uncertain, Nonlinear Systems with Gaussian Processes <http://arxiv.org/abs/1603.04915>_ in Proc. of the Conference on Decision and Control (CDC), 2016.

.. [3] S. M. Richards, F. Berkenkamp, A. Krause, The Lyapunov Neural Network: Adaptive Stability Certification for Safe Learning of Dynamical Systems <https://arxiv.org/abs/1808.00924>_. Conference on Robot Learning (CoRL), 2018.

Getting started

This library is tested based on both python 2.7 and 3.5, together with the following dependencies, since pip>=19 does not support --process-dependency-links (see below)

::

pip install pip==18.1 pip install numpy==1.14.5

Based on this, you can install the library by cloning the repository and installing it with

pip install . --process-dependency-links

To run the tests with the bash script in scripts/test_code.sh, you need to install additional dependencies with

pip install ".[test]" --process-dependency-links

The --process-dependency-links flag is needed to install gpflow==0.4.0, which is not on pypi. You can skip it if that particular version of the library is already installed.

You can the find example jupyter notebooks and the experiments in the paper in the examples <./examples>_ folder.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].