DeepRL algorithms implementation easy for understanding and reading with Pytorch and Tensorflow 2(DQN, REINFORCE, VPG, A2C, TRPO, PPO, DDPG, TD3, SAC)

Stars: ✭ 97 (-6.73%)

Mutual labels: deep-reinforcement-learning, policy-gradient, mujoco

Explorer

Explorer is a PyTorch reinforcement learning framework for exploring new ideas.

Stars: ✭ 54 (-48.08%)

Mutual labels: deep-reinforcement-learning, policy-gradient

SharkStock

Automate swing trading using deep reinforcement learning. The deep deterministic policy gradient-based neural network model trains to choose an action to sell, buy, or hold the stocks to maximize the gain in asset value. The paper also acknowledges the need for a system that predicts the trend in stock value to work along with the reinforcement …

Stars: ✭ 63 (-39.42%)

Mutual labels: deep-reinforcement-learning, policy-gradient

omd

JAX code for the paper "Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation"

Stars: ✭ 43 (-58.65%)

Mutual labels: deep-reinforcement-learning, model-based-rl

jax-rl

JAX implementations of core Deep RL algorithms

Stars: ✭ 61 (-41.35%)

Mutual labels: deep-reinforcement-learning, mujoco

Reinforcement Learning

Minimal and Clean Reinforcement Learning Examples

Stars: ✭ 2,863 (+2652.88%)

Mutual labels: deep-reinforcement-learning, policy-gradient

Pytorch A2c Ppo Acktr Gail

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

Stars: ✭ 2,632 (+2430.77%)

Mutual labels: deep-reinforcement-learning, mujoco

a3c-super-mario-pytorch

Reinforcement Learning for Super Mario Bros using A3C on GPU

Stars: ✭ 35 (-66.35%)

Mutual labels: deep-reinforcement-learning, openai-gym

Fruit-API

A Universal Deep Reinforcement Learning Framework

Stars: ✭ 61 (-41.35%)

Mutual labels: deep-reinforcement-learning, actor-critic-algorithm

drl grasping

Deep Reinforcement Learning for Robotic Grasping from Octrees

Stars: ✭ 160 (+53.85%)

Mutual labels: deep-reinforcement-learning, openai-gym

View All Similar Projects ➔

CS285

Pytorch Version of homework assignments of Deep Reinforcement Learning Course
Presented by Dr. Sergey Levin at University of California, Berkeley

Report Bug

About the Project
- Main Goals
- Completed So Far
Getting Started
- Prerequisites
Usage
Roadmap
Contributing
License
Contact

About The Project

In this project, we aim to create a Pytorch version of CS285 course whose Tensorflow 1 version is already available at here.

Main Goals

Converting all the Tensorflow 1 code to the newest version of Pytorch
The current version Mujoco environment that has been used in this project is old which requires only using Python < 3.6 version. Therefore, we seek to make this project compatible with the newer version of this library and, consequently, Python >= 3.6.

Completed So Far

Homework 1, 2, 3, and 4 Tensorflow codes have been fully replaced by Pytorch.

Getting Started

Currently, this project is under development, and the same libraries that have been employed in the Tensorflow version of these assignments plus Pytorch are required for running the assignments of this project. However, we are eager to use the versions of these libraries that are presented in the prerequisites section for the future release of this project.

Prerequisites

The libraries that we want to use in the future are as follows.

Python >= 3.6
Gym >= 0.17
Mujoco-py >= 2.0
Pytorch >= 1.5.1
TensorboardX
Matplotlib
Ipython
Moviepy
OpenCV
Box2d-py

Usage

The instructions for execution of all of these assignments are given in the Readme documents that are located in each of the homework directories.

Roadmap

See the open issues for a list of known issues.

Contributing

Unfortunately, the current version of this repository is not compatible with the latest versions of libraries, such as Tensorflow and Mojocu-py. As a result, installing the proper versions of these libraries, which can enable you to contribute to this repo, could be a hard challenge. However, since I have been faced with this problem before, I designed a certain number of steps that you can take to install the right versions of these libraries.

Create a new Conda environment based on Python 3.5 and install matplotlib, ipython, and pytorch. Then, activate it.

conda create -n cs285_env python=3.5 matplotlib ipython pytorch=1.5.0
source activate cs285_env

Clone this repository
Install mujoco-py
1. Get mujoco license key file from its website
2. Create a .mujoco folder in the home directory and copy the given mjpro150 directory and your license key into it
```
mkdir ~/.mujoco/
cd <location_of_your_license_key>
cp mjkey.txt ~/.mujoco/
cd <this_repo>/mujoco
cp -r mjpro150 ~/.mujoco/
```
1. Add the following line to bottom of your .bashrc file:
```
export LD_LIBRARY_PATH=~/.mujoco/mjpro150/bin/
```
1. Build and install mujoco-py 1.50.1.1. It can be downloaded from this link.
```
tar -xzf mujoco-py-1.50.1.1.tar.gz 
cd mujoco-py-1.50.1.1
python setup.py install
```
Install rest of the libraries given in contribution_requirements.txt file using pip

pip install --user --requirement contribution_requirements.txt

At last, it should be considered that before executing scripts of each homework folder (e.g., hw1), you should allow your code to be able to see 'cs285' by executing the following lines:

cd <path_to_hw>
pip install -e .

License

Distributed under the MIT License. See LICENSE file for more information.

Contact

Erfan Miahi - @erfan_mhi - [email protected]

Project Link: https://github.com/erfanMhi/Deep-Reinforcement-Learning-CS285-Pytorch

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

erfanMhi / Deep-Reinforcement-Learning-CS285-Pytorch

Programming Languages

Labels

Projects that are alternatives of or similar to Deep-Reinforcement-Learning-CS285-Pytorch

CS285

Table of Contents

About The Project

Main Goals

Completed So Far

Getting Started

Prerequisites

Usage

Roadmap

Contributing

License

Contact