Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → kashif → Firedup

kashif / Firedup

Licence: mit

Clone of OpenAI's Spinning Up in PyTorch

Programming Languages

139335 projects - #7 most used programming language

Labels

deep-learning pytorch reinforcement-learning

Projects that are alternatives of or similar to Firedup

Machine learning, in numpy

Stars: ✭ 11,100 (+9227.73%)

Mutual labels: reinforcement-learning

Deep Neuroevolution

Deep Neuroevolution

Stars: ✭ 1,526 (+1182.35%)

Mutual labels: reinforcement-learning

Reinforcementlearninganintroduction.jl

Julia code for the book Reinforcement Learning An Introduction

Stars: ✭ 117 (-1.68%)

Mutual labels: reinforcement-learning

Using RGB Image as Visual Input for Mapless Robot Navigation

Stars: ✭ 111 (-6.72%)

Mutual labels: reinforcement-learning

Startcraft pysc2 minigames

Startcraft II Machine Learning research with DeepMind pysc2 python library .mini-games and agents.

Stars: ✭ 113 (-5.04%)

Mutual labels: reinforcement-learning

Stable Baselines

Mirror of Stable-Baselines: a fork of OpenAI Baselines, implementations of reinforcement learning algorithms

Stars: ✭ 115 (-3.36%)

Mutual labels: reinforcement-learning

OpenAI's cartpole env solver.

Stars: ✭ 107 (-10.08%)

Mutual labels: reinforcement-learning

Machine learning lectures

Collection of lectures and lab lectures on machine learning and deep learning. Lab practices in Python and TensorFlow.

Stars: ✭ 118 (-0.84%)

Mutual labels: reinforcement-learning

Doom Net Pytorch

Reinforcement learning models in ViZDoom environment

Stars: ✭ 113 (-5.04%)

Mutual labels: reinforcement-learning

Reinforcement Learning An Introduction

Python Implementation of Reinforcement Learning: An Introduction

Stars: ✭ 11,042 (+9178.99%)

Mutual labels: reinforcement-learning

Handful Of Trials Pytorch

Unofficial Pytorch code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"

Stars: ✭ 112 (-5.88%)

Mutual labels: reinforcement-learning

Study E-Book(ComputerVision DeepLearning MachineLearning Math NLP Python ReinforcementLearning)

Stars: ✭ 1,457 (+1124.37%)

Mutual labels: reinforcement-learning

Hierarchical Actor Critic Hac Pytorch

PyTorch implementation of Hierarchical Actor Critic (HAC) for OpenAI gym environments

Stars: ✭ 116 (-2.52%)

Mutual labels: reinforcement-learning

Pairstrade Fyp 2019

We tested 3 approaches for Pair Trading: distance, cointegration and reinforcement learning approach.

Stars: ✭ 109 (-8.4%)

Mutual labels: reinforcement-learning

Spiral Tensorflow

in progress

Stars: ✭ 117 (-1.68%)

Mutual labels: reinforcement-learning

Code for "MojiTalk: Generating Emotional Responses at Scale" https://arxiv.org/abs/1711.04090

Stars: ✭ 107 (-10.08%)

Mutual labels: reinforcement-learning

Coursera reinforcement learning

Coursera Reinforcement Learning Specialization by University of Alberta & Alberta Machine Intelligence Institute

Stars: ✭ 114 (-4.2%)

Mutual labels: reinforcement-learning

Deep reinforcement learning

Resources, papers, tutorials

Stars: ✭ 119 (+0%)

Mutual labels: reinforcement-learning

Reinforcementlearning Atarigame

Pytorch LSTM RNN for reinforcement learning to play Atari games from OpenAI Universe. We also use Google Deep Mind's Asynchronous Advantage Actor-Critic (A3C) Algorithm. This is much superior and efficient than DQN and obsoletes it. Can play on many games

Stars: ✭ 118 (-0.84%)

Mutual labels: reinforcement-learning

C51-DDQN in Keras

Stars: ✭ 115 (-3.36%)

Mutual labels: reinforcement-learning

View All Similar Projects ➔

Welcome to Fired Up in Deep RL!

This is a clone of OpenAI's Spinning Up in PyTorch. Spinning Up is an awesome educational resource produced by Josh Achiam, a research scientist at OpenAI, that makes it easier to learn about deep reinforcement learning (deep RL).

Installation

Fired Up requires Python3, PyTorch, OpenAI Gym, and OpenMPI.

Fired Up is currently only supported on Linux and OSX. It may be possible to install on Windows, though I haven't tested this OS.

Installing Python

We recommend installing Python through Anaconda. Anaconda is a Python distribution that includes many useful packages especially for scientific computing, as well as an environment manager called conda that makes package management simple.

Download and install Anaconda 2018.x (at time of writing, 2018.12) Python 3.7. Then create a conda environment for organizing packages used in Fired Up:

conda create -n firedup python=3.7

To use Python from the environment you just created, activate the environment with:

source activate firedup

You can alternatively use virtualenv with the Python3 version you have. Just install it via pip3 and then:

virtualenv firedup

To activate this virtual environment you need to:

source /path/to/firedup/bin/activate

Installing OpenMPI

Ubuntu

sudo apt update && sudo apt install libopenmpi-dev

Mac OS X

Installation of system packages on Mac requires Homebrew. With Homebrew installed, run the following:

brew install openmpi

Installing Fired Up

git clone https://github.com/kashif/firedup.git
cd firedup
pip install -e .

Fired Up defaults to installing everything in Gym except the MuJoCo environments.

Check Your Install

To see if you've successfully installed Fired Up, try running PPO in the LunarLander-v2 environment with:

python -m fireup.run ppo --hid "[32,32]" --env LunarLander-v2 --exp_name installtest --gamma 0.999

After it finishes training, watch a video of the trained policy with:

python -m fireup.run test_policy data/installtest/installtest_s0

And plot the results with:

python -m fireup.run plot data/installtest/installtest_s0

Algorithms

The following algorithms are implemented in the Fired Up package:

Vanilla Policy Gradient (VPG)
Trust Region Policy Optimization (TRPO)
Proximal Policy Optimization (PPO)
Deep Q-Network (DQN)
Deep Deterministic Policy Gradient (DDPG)
Twin Delayed DDPG (TD3)
Soft Actor-Critic (SAC)

They are all implemented with MLP (non-recurrent) actor-critics, making them suitable for fully-observed, non-image-based RL environments, e.g. the Gym Mujoco environments.

Citation

If you use Fired Up in your research please use the following BibTeX entry:

@misc{rasulfiredup,
  author =       {Kashif Rasul and Joshua Achiam},
  title =        {Fired Up},
  howpublished = {\url{https://github.com/kashif/firedup/}},
  year =         {2019}
}

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 119

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (4) 🔗