Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → befelix → Safe_learning

befelix / Safe_learning

Licence: mit

Safe reinforcement learning with stability guarantees

Programming Languages

python

139335 projects - #7 most used programming language

Labels

reinforcement-learning gaussian-processes dynamic-programming

Projects that are alternatives of or similar to Safe learning

Move37

Coding Demos from the School of AI's Move37 Course

Stars: ✭ 130 (-7.14%)

Mutual labels: dynamic-programming, reinforcement-learning

Reinforcement learning tutorial with demo

Reinforcement Learning Tutorial with Demo: DP (Policy and Value Iteration), Monte Carlo, TD Learning (SARSA, QLearning), Function Approximation, Policy Gradient, DQN, Imitation, Meta Learning, Papers, Courses, etc..

Stars: ✭ 442 (+215.71%)

Mutual labels: dynamic-programming, reinforcement-learning

Pilco

Bayesian Reinforcement Learning in Tensorflow

Stars: ✭ 222 (+58.57%)

Mutual labels: reinforcement-learning, gaussian-processes

Safeopt

Safe Bayesian Optimization

Stars: ✭ 90 (-35.71%)

Mutual labels: reinforcement-learning, gaussian-processes

Deep Reinforcement Learning

Repo for the Deep Reinforcement Learning Nanodegree program

Stars: ✭ 4,012 (+2765.71%)

Mutual labels: dynamic-programming, reinforcement-learning

Numpy Ml

Machine learning, in numpy

Stars: ✭ 11,100 (+7828.57%)

Mutual labels: reinforcement-learning, gaussian-processes

Vizdoom Keras Rl

Reinforcement Learning in Keras on VizDoom

Stars: ✭ 130 (-7.14%)

Mutual labels: reinforcement-learning

Ml Agents

Unity Machine Learning Agents Toolkit

Stars: ✭ 12,134 (+8567.14%)

Mutual labels: reinforcement-learning

Aixijs

AIXIjs - General Reinforcement Learning in the Browser

Stars: ✭ 128 (-8.57%)

Mutual labels: reinforcement-learning

Aboleth

A bare-bones TensorFlow framework for Bayesian deep learning and Gaussian process approximation

Stars: ✭ 127 (-9.29%)

Mutual labels: gaussian-processes

Survival Analysis Using Deep Learning

This repository contains morden baysian statistics and deep learning based research articles , software for survival analysis

Stars: ✭ 139 (-0.71%)

Mutual labels: gaussian-processes

Dsa Geeksclasses

DSA-Self Paced With Doubt Assistance Course Solutions in Python (Python 3)

Stars: ✭ 137 (-2.14%)

Mutual labels: dynamic-programming

Hindsight Experience Replay

This is the pytorch implementation of Hindsight Experience Replay (HER) - Experiment on all fetch robotic environments.

Stars: ✭ 134 (-4.29%)

Mutual labels: reinforcement-learning

Reinforcement learning in python

Implementing Reinforcement Learning, namely Q-learning and Sarsa algorithms, for global path planning of mobile robot in unknown environment with obstacles. Comparison analysis of Q-learning and Sarsa

Stars: ✭ 134 (-4.29%)

Mutual labels: reinforcement-learning

Automata

A comprehensive autonomous decentralized systems framework for AI control architects.

Stars: ✭ 130 (-7.14%)

Mutual labels: reinforcement-learning

Savn

Learning to Learn how to Learn: Self-Adaptive Visual Navigation using Meta-Learning (https://arxiv.org/abs/1812.00971)

Stars: ✭ 135 (-3.57%)

Mutual labels: reinforcement-learning

Toycarirl

Implementation of Inverse Reinforcement Learning Algorithm on a toy car in a 2D world problem, (Apprenticeship Learning via Inverse Reinforcement Learning Abbeel & Ng, 2004)

Stars: ✭ 128 (-8.57%)

Mutual labels: reinforcement-learning

Ravens

Train robotic agents to learn pick and place with deep learning for vision-based manipulation in PyBullet. Transporter Nets, CoRL 2020.

Stars: ✭ 133 (-5%)

Mutual labels: reinforcement-learning

Pysot

Surrogate Optimization Toolbox for Python

Stars: ✭ 136 (-2.86%)

Mutual labels: gaussian-processes

Saltie

🚗 Rocket League Distributed Deep Reinforcement Learning Bot

Stars: ✭ 134 (-4.29%)

Mutual labels: reinforcement-learning

View All Similar Projects ➔

===================================================== Safe Reinforcement Learning with Stability Guarantees

.. image:: https://travis-ci.org/befelix/safe_learning.svg?branch=master :target: https://travis-ci.org/befelix/safe_learning :alt: Build status .. image:: https://readthedocs.org/projects/safe-learning/badge/?version=latest :target: http://safe-learning.readthedocs.io/en/latest/?badge=latest :alt: Documentation Status

This code accompanies the paper [1]_ and implements the code for estimating the region of attraction for a policy and optimizing the policy subject to stability constraints. For the old numpy-based code to estimate the region of attraction in [2]_ see the lyapunov-learning <https://github.com/befelix/lyapunov-learning>_ repository. The code for learning Lyapunov functions from [3]_ can be found in the examples <./examples>_ folder.

.. [1] F. Berkenkamp, M. Turchetta, A. P. Schoellig, A. Krause, Safe Model-based Reinforcement Learning with Stability Guarantees <http://arxiv.org/abs/1509.01066>_ in Proc. of the Conference on Neural Information Processing Systems (NIPS), 2017.

.. [2] F. Berkenkamp, R. Moriconi, A. P. Schoellig, A. Krause, Safe Learning of Regions of Attraction in Uncertain, Nonlinear Systems with Gaussian Processes <http://arxiv.org/abs/1603.04915>_ in Proc. of the Conference on Decision and Control (CDC), 2016.

.. [3] S. M. Richards, F. Berkenkamp, A. Krause, The Lyapunov Neural Network: Adaptive Stability Certification for Safe Learning of Dynamical Systems <https://arxiv.org/abs/1808.00924>_. Conference on Robot Learning (CoRL), 2018.

Getting started

This library is tested based on both python 2.7 and 3.5, together with the following dependencies, since pip>=19 does not support --process-dependency-links (see below)

pip install pip==18.1 pip install numpy==1.14.5

Based on this, you can install the library by cloning the repository and installing it with

pip install . --process-dependency-links

To run the tests with the bash script in scripts/test_code.sh, you need to install additional dependencies with

pip install ".[test]" --process-dependency-links

The --process-dependency-links flag is needed to install gpflow==0.4.0, which is not on pypi. You can skip it if that particular version of the library is already installed.

You can the find example jupyter notebooks and the experiments in the paper in the examples <./examples>_ folder.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 140

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (3) 🔗