All Projects → BY571 → Upside-Down-Reinforcement-Learning

BY571 / Upside-Down-Reinforcement-Learning

Licence: MIT license
Upside-Down Reinforcement Learning (⅂ꓤ) implementation in PyTorch. Based on the paper published by Jürgen Schmidhuber.

Programming Languages

Jupyter Notebook
11667 projects

Projects that are alternatives of or similar to Upside-Down-Reinforcement-Learning

ml-ai
ML-AI Community | Open Source | Built in Bharat for the World | Data science problem statements and solutions
Stars: ✭ 32 (-50%)
Mutual labels:  machine-learning-algorithms, reinforcement-learning-algorithms
zoofs
zoofs is a python library for performing feature selection using a variety of nature-inspired wrapper algorithms. The algorithms range from swarm-intelligence to physics-based to Evolutionary. It's easy to use , flexible and powerful tool to reduce your feature size.
Stars: ✭ 142 (+121.88%)
Mutual labels:  machine-learning-algorithms
extra-model
Code to run the ExtRA algorithm for unsupervised topic/aspect extraction on English texts.
Stars: ✭ 43 (-32.81%)
Mutual labels:  machine-learning-algorithms
ml-from-scratch
All content related to machine learning from my blog
Stars: ✭ 110 (+71.88%)
Mutual labels:  machine-learning-algorithms
normalizing-flows
Implementations of normalizing flows using python and tensorflow
Stars: ✭ 15 (-76.56%)
Mutual labels:  machine-learning-algorithms
NeuroEvolution-Flappy-Bird
A comparison between humans, neuroevolution and multilayer perceptrons playing Flapy Bird implemented in Python
Stars: ✭ 17 (-73.44%)
Mutual labels:  machine-learning-algorithms
PROSAC
PROSAC algorithm in python
Stars: ✭ 19 (-70.31%)
Mutual labels:  machine-learning-algorithms
NoML
A notebook for machine learning interview
Stars: ✭ 54 (-15.62%)
Mutual labels:  machine-learning-algorithms
spectral-clustering.matlab
An intuitive implementation of spectral clustering on matlab
Stars: ✭ 12 (-81.25%)
Mutual labels:  machine-learning-algorithms
darwin
Darwin C++ and Python Machine Learning Framework for Cyber Security
Stars: ✭ 20 (-68.75%)
Mutual labels:  machine-learning-algorithms
AIML-Human-Attributes-Detection-with-Facial-Feature-Extraction
This is a Human Attributes Detection program with facial features extraction. It detects facial coordinates using FaceNet model and uses MXNet facial attribute extraction model for extracting 40 types of facial attributes. This solution also detects Emotion, Age and Gender along with facial attributes.
Stars: ✭ 48 (-25%)
Mutual labels:  machine-learning-algorithms
subsemble
subsemble R package for ensemble learning on subsets of data
Stars: ✭ 40 (-37.5%)
Mutual labels:  machine-learning-algorithms
claire
Continuously Learning Artificial Intelligence Rules Engine (Claire) for Smart Homes
Stars: ✭ 18 (-71.87%)
Mutual labels:  machine-learning-algorithms
calcuMLator
An intelligently dumb calculator that uses machine learning
Stars: ✭ 30 (-53.12%)
Mutual labels:  machine-learning-algorithms
taller SparkR
Taller SparkR para las Jornadas de Usuarios de R
Stars: ✭ 12 (-81.25%)
Mutual labels:  machine-learning-algorithms
Reinforcement Learning
Deep Reinforcement Learning Algorithms implemented with Tensorflow 2.3
Stars: ✭ 61 (-4.69%)
Mutual labels:  reinforcement-learning-algorithms
Anomaly Detection
anomaly detection with anomalize and Google Trends data
Stars: ✭ 38 (-40.62%)
Mutual labels:  machine-learning-algorithms
RL-code-resources
A collection of Reinforcement Learning GitHub code resources divided by frameworks and environments
Stars: ✭ 51 (-20.31%)
Mutual labels:  reinforcement-learning-algorithms
influence boosting
Supporting code for the paper "Finding Influential Training Samples for Gradient Boosted Decision Trees"
Stars: ✭ 57 (-10.94%)
Mutual labels:  machine-learning-algorithms
ML-ProjectKart
🙌Kart of 210+ projects based on machine learning, deep learning, computer vision, natural language processing and all. Show your support by ✨ this repository.
Stars: ✭ 162 (+153.13%)
Mutual labels:  machine-learning-algorithms

Upside-Down-Reinforcement-Learning DOI

Upside-Down Reinforcement Learning (⅂ꓤ) implementation in Pytorch.
Based on the paper published by Jürgen Schmidhuber: ⅂ꓤ-Paper

This repository contains a discrete action space as well as a continuous action space implementation for the OpenAI gym CartPole environment (continuous version of the environment).

The notebooks include the training of a behavior function as well as an evaluation part, where you can test the trained behavior function. Feed it with an desired reward that the agent shall achieve in a desired time horizon.

Plots for the discrete CartPole Environment:

plot

Plots for the continuous CartPole Environment:

plot

Plots for the LunarLander Environment:

plot

TODO:

  • test some possible improvements mentioned in the paper (6. Future Research Directions).

Author

  • Sebastian Dittert

Feel free to use this code for your own projects or research. For citation check DOI or cite as:

@misc{Upside-Down,
  author = {Dittert, Sebastian},
  title = {PyTorch Implementation of Upside-Down RL},
  year = {2020},
  publisher = {GitHub},
  journal = {GitHub repository},
  howpublished = {\url{https://github.com/BY571/Upside-Down-Reinforcement-Learning}},
}
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].