All Projects → megvii-research → pytorch-gym

megvii-research / pytorch-gym

Licence: other
Implementation of the Deep Deterministic Policy Gradient(DDPG) in bullet Gym using pytorch

Programming Languages

python
139335 projects - #7 most used programming language
Jupyter Notebook
11667 projects

Projects that are alternatives of or similar to pytorch-gym

Deep Rl Keras
Keras Implementation of popular Deep RL Algorithms (A3C, DDQN, DDPG, Dueling DDQN)
Stars: ✭ 395 (+912.82%)
Mutual labels:  gym, ddpg
Pytorch Rl
This repository contains model-free deep reinforcement learning algorithms implemented in Pytorch
Stars: ✭ 394 (+910.26%)
Mutual labels:  gym, ddpg
wolpertinger ddpg
Wolpertinger Training with DDPG (Pytorch), Deep Reinforcement Learning in Large Discrete Action Spaces. Multi-GPU/Singer-GPU/CPU compatible.
Stars: ✭ 44 (+12.82%)
Mutual labels:  gym, ddpg
Torchrl
Pytorch Implementation of Reinforcement Learning Algorithms ( Soft Actor Critic(SAC)/ DDPG / TD3 /DQN / A2C/ PPO / TRPO)
Stars: ✭ 90 (+130.77%)
Mutual labels:  gym, ddpg
reinforcement learning with Tensorflow
Minimal implementations of reinforcement learning algorithms by Tensorflow
Stars: ✭ 28 (-28.21%)
Mutual labels:  ddpg
Gym Unrealcv
Unreal environments for reinforcement learning
Stars: ✭ 202 (+417.95%)
Mutual labels:  gym
Gym Sokoban
Sokoban environment for OpenAI Gym
Stars: ✭ 186 (+376.92%)
Mutual labels:  gym
Pytorch sac
PyTorch implementation of Soft Actor-Critic (SAC)
Stars: ✭ 174 (+346.15%)
Mutual labels:  gym
Resources-Allocation-in-The-Edge-Computing-Environment-Using-Reinforcement-Learning
Simulated the scenario between edge servers and users with a clear graphic interface. Also, implemented the continuous control with Deep Deterministic Policy Gradient (DDPG) to determine the resources allocation (offload targets, computational resources, migration bandwidth) in the edge servers
Stars: ✭ 158 (+305.13%)
Mutual labels:  ddpg
Rainy
☔ Deep RL agents with PyTorch☔
Stars: ✭ 39 (+0%)
Mutual labels:  ddpg
mujoco-benchmark
Provide full reinforcement learning benchmark on mujoco environments, including ddpg, sac, td3, pg, a2c, ppo, library
Stars: ✭ 101 (+158.97%)
Mutual labels:  ddpg
Pytorch Reinforce
PyTorch Implementation of REINFORCE for both discrete & continuous control
Stars: ✭ 212 (+443.59%)
Mutual labels:  gym
pytorch-distributed
Ape-X DQN & DDPG with pytorch & tensorboard
Stars: ✭ 98 (+151.28%)
Mutual labels:  ddpg
Naf Tensorflow
"Continuous Deep Q-Learning with Model-based Acceleration" in TensorFlow
Stars: ✭ 192 (+392.31%)
Mutual labels:  gym
DQN
Deep-Q-Network reinforcement learning algorithm applied to a simple 2d-car-racing environment
Stars: ✭ 42 (+7.69%)
Mutual labels:  gym
Atari
AI research environment for the Atari 2600 games 🤖.
Stars: ✭ 174 (+346.15%)
Mutual labels:  gym
DDPG Torcs PyTorch
Using PyTorch and DDPG to play Torcs
Stars: ✭ 44 (+12.82%)
Mutual labels:  ddpg
UAV-DDPG
Code for paper "Computation Offloading Optimization for UAV-assisted Mobile Edge Computing: A Deep Deterministic Policy Gradient Approach"
Stars: ✭ 133 (+241.03%)
Mutual labels:  ddpg
multi car racing
An OpenAI Gym environment for multi-agent car racing based on Gym's original car racing environment.
Stars: ✭ 58 (+48.72%)
Mutual labels:  gym
Rls
Reinforcement Learning Algorithms Based on TensorFlow 2.x
Stars: ✭ 239 (+512.82%)
Mutual labels:  gym

DDPG in bullet Gym using pytorch

Overview

This is an implementation of Deep Deterministic Policy Gradient (DDPG) in bullet Gym using PyTorch.

Dependencies

  • Python 3.6.2
  • pytorch 0.2.0
  • gym
  • tensorboardX-1.0
  • pybullet (if you want to train agents for bullet env)

Run

  • here is a simple example to train CartPole with high efficiency:

$ cd base

$ python main.py --debug --discrete --env=CartPole-v0 --vis

  • you can use this to understand usage of each argument:

$ python main.py --help

  • some explanation of important arguments:

--debug: print the reward and some other information

--discrete: if the actions are discrete rather than continuous

--vis: render each action (but it would slow down your training speed)

--cuda: train this task using GPU

--test: testing mode

--resume : load model from the path

DDPG from baselines

# run HalfCheetah-v1 with default parameters
cd ./baselines
python main.py 

Contributors

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].