31 projects in the framework of Deep Reinforcement Learning algorithms: Q-learning, DQN, PPO, DDPG, TD3, SAC, A2C and others. Each project is provided with a detailed training log.

Stars: ✭ 167 (+1092.86%)

Mutual labels: deep-reinforcement-learning, ddpg

Minimalrl

Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)

Stars: ✭ 2,051 (+14550%)

Mutual labels: deep-reinforcement-learning, ddpg

Fruit-API

A Universal Deep Reinforcement Learning Framework

Stars: ✭ 61 (+335.71%)

Mutual labels: deep-reinforcement-learning, reinforcement-learning-algorithms

Pytorch Drl

PyTorch implementations of various Deep Reinforcement Learning (DRL) algorithms for both single agent and multi-agent.

Stars: ✭ 233 (+1564.29%)

Mutual labels: deep-reinforcement-learning, ddpg

Elegantrl

Lightweight, efficient and stable implementations of deep reinforcement learning algorithms using PyTorch.

Stars: ✭ 575 (+4007.14%)

Mutual labels: deep-reinforcement-learning, ddpg

Pytorch A2c Ppo Acktr Gail

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

Stars: ✭ 2,632 (+18700%)

Mutual labels: deep-reinforcement-learning, continuous-control

yarll

Combining deep learning and reinforcement learning.

Stars: ✭ 84 (+500%)

Mutual labels: deep-reinforcement-learning, reinforcement-learning-algorithms

Deep Reinforcement Learning In Large Discrete Action Spaces

Implementation of the algorithm in Python 3, TensorFlow and OpenAI Gym

Stars: ✭ 132 (+842.86%)

Mutual labels: deep-reinforcement-learning, ddpg

Easy Rl

强化学习中文教程，在线阅读地址：https://datawhalechina.github.io/easy-rl/

Stars: ✭ 3,004 (+21357.14%)

Mutual labels: deep-reinforcement-learning, ddpg

Machine Learning Is All You Need

🔥🌟《Machine Learning 格物志》: ML + DL + RL basic codes and notes by sklearn, PyTorch, TensorFlow, Keras & the most important, from scratch!💪 This repository is ALL You Need!

Stars: ✭ 173 (+1135.71%)

Mutual labels: deep-reinforcement-learning, ddpg

Reinforcement Learning

🤖 Implements of Reinforcement Learning algorithms.

Stars: ✭ 104 (+642.86%)

Mutual labels: deep-reinforcement-learning, ddpg

Deeprl

Modularized Implementation of Deep RL Algorithms in PyTorch

Stars: ✭ 2,640 (+18757.14%)

Mutual labels: deep-reinforcement-learning, ddpg

deep rl acrobot

TensorFlow A2C to solve Acrobot, with synchronized parallel environments

Stars: ✭ 32 (+128.57%)

Mutual labels: deep-reinforcement-learning, ddpg

Reinforcement-Learning-on-google-colab

Reinforcement Learning algorithm's using google-colab

Stars: ✭ 33 (+135.71%)

Mutual labels: deep-reinforcement-learning, reinforcement-learning-algorithms

pytorch-distributed

Ape-X DQN & DDPG with pytorch & tensorboard

Stars: ✭ 98 (+600%)

Mutual labels: deep-reinforcement-learning, ddpg

UAV-DDPG

Code for paper "Computation Offloading Optimization for UAV-assisted Mobile Edge Computing: A Deep Deterministic Policy Gradient Approach"

Stars: ✭ 133 (+850%)

Mutual labels: reinforcement-learning-algorithms, ddpg

View All Similar Projects ➔

Continuous Control using Deep Reinforcement Learning on Unity ML Agent

Introduction

This repository contains a Deep Deterministic Policy Gradients (DDPG) agent running in the Unity ML Agent Reacher(https://github.com/Unity-Technologies/ml-agents/blob/master/docs/Learning-Environment-Examples.md#reacher) environment. It can be used to train and evaluate the result of the training.

I use it for the purpose of learning DDPG agent in the context of continuous control of a agent.

The DDPG is implemented in Python 3 using PyTorch.

The full report can be found here. (https://github.com/kinwo/deeprl-continuous-control/blob/master/Report.pdf)

Environment

The 3D environment contains 20 double joined arms agents who can move freely to reach the target locations.

Goal

The goal is to control the 20 arms to move to their individual target locations and keep them there as many time steps as possible.

Environment Solved Criteria

The environment is considered solved when the average mean score of all agents reach 30+ in the last 100 epsisodes.

Rewards

A reward of +0.1 is provided for each step that each agent's hand is in the goal location independently.

Actions

Vector Action space: (Continuous) Size of 4, corresponding to torque applicable to two joints.

Spaces

The observation space is composed of 33 variables:
position, rotation, velocity, and angular velocities of the arm

Getting Started

Install Unity ML https://github.com/Unity-Technologies/ml-agents/blob/master/docs/Installation.md
Download the Unity ML environment from one of the links below based on your OS:
- Linux: click here
- Mac OSX: click here
- Windows (32-bit): click here
- Windows (64-bit): click here

Then unzip the file and place the file in this project folder.

Create Conda Environment

Install conda from conda.io. Create a new Conda environment with Python 3.6.

conda create --name deeprl python=3.6
source activate deeprl

Install Dependencies

cd python
pip install .

How to run the agent

To start training, simply open Continuous_Control.ipynb in Jupyter Notebook and follow the instructions there:

Start Jupyter Notebook

jupyter notebook

Trained model weights is included for quickly running the agent and seeing the result in Unity ML Agent. Simply skip the training step and run the last step of the Continuous_Control.ipynb

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

kinwo / deeprl-continuous-control

Programming Languages

Labels

Projects that are alternatives of or similar to deeprl-continuous-control

Continuous Control using Deep Reinforcement Learning on Unity ML Agent

Introduction

Environment

Goal

Environment Solved Criteria

Rewards

Actions

Spaces

Getting Started

How to run the agent