All Projects → Kaixhin → Acer

Kaixhin / Acer

Licence: mit
Actor-critic with experience replay

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Acer

2048 Deep Reinforcement Learning
Trained A Convolutional Neural Network To Play 2048 using Deep-Reinforcement Learning
Stars: ✭ 169 (-21.4%)
Mutual labels:  deep-reinforcement-learning
Awesome Deep Rl
A curated list of awesome Deep Reinforcement Learning resources.
Stars: ✭ 193 (-10.23%)
Mutual labels:  deep-reinforcement-learning
Tensorflow2 Deep Reinforcement Learning
Code accompanying the blog post "Deep Reinforcement Learning with TensorFlow 2.1"
Stars: ✭ 204 (-5.12%)
Mutual labels:  deep-reinforcement-learning
Drl
Repository for codes of 'Deep Reinforcement Learning'
Stars: ✭ 172 (-20%)
Mutual labels:  deep-reinforcement-learning
Hands On Intelligent Agents With Openai Gym
Code for Hands On Intelligent Agents with OpenAI Gym book to get started and learn to build deep reinforcement learning agents using PyTorch
Stars: ✭ 189 (-12.09%)
Mutual labels:  deep-reinforcement-learning
Drl4recsys
Courses on Deep Reinforcement Learning (DRL) and DRL papers for recommender systems
Stars: ✭ 196 (-8.84%)
Mutual labels:  deep-reinforcement-learning
Accel Brain Code
The purpose of this repository is to make prototypes as case study in the context of proof of concept(PoC) and research and development(R&D) that I have written in my website. The main research topics are Auto-Encoders in relation to the representation learning, the statistical machine learning for energy-based models, adversarial generation networks(GANs), Deep Reinforcement Learning such as Deep Q-Networks, semi-supervised learning, and neural network language model for natural language processing.
Stars: ✭ 166 (-22.79%)
Mutual labels:  deep-reinforcement-learning
Trixi
Manage your machine learning experiments with trixi - modular, reproducible, high fashion. An experiment infrastructure optimized for PyTorch, but flexible enough to work for your framework and your tastes.
Stars: ✭ 211 (-1.86%)
Mutual labels:  deep-reinforcement-learning
Naf Tensorflow
"Continuous Deep Q-Learning with Model-based Acceleration" in TensorFlow
Stars: ✭ 192 (-10.7%)
Mutual labels:  deep-reinforcement-learning
Papers
Summaries of machine learning papers
Stars: ✭ 2,362 (+998.6%)
Mutual labels:  deep-reinforcement-learning
Machine Learning Is All You Need
🔥🌟《Machine Learning 格物志》: ML + DL + RL basic codes and notes by sklearn, PyTorch, TensorFlow, Keras & the most important, from scratch!💪 This repository is ALL You Need!
Stars: ✭ 173 (-19.53%)
Mutual labels:  deep-reinforcement-learning
Deep Reinforcement Learning For Dialogue Generation In Tensorflow
Deep-Reinforcement-Learning-for-Dialogue-Generation-in-tensorflow
Stars: ✭ 178 (-17.21%)
Mutual labels:  deep-reinforcement-learning
Atari Model Zoo
A binary release of trained deep reinforcement learning models trained in the Atari machine learning benchmark, and a software release that enables easy visualization and analysis of models, and comparison across training algorithms.
Stars: ✭ 198 (-7.91%)
Mutual labels:  deep-reinforcement-learning
Noisynet A3c
Noisy Networks for Exploration
Stars: ✭ 171 (-20.47%)
Mutual labels:  deep-reinforcement-learning
Chanlun
文件 笔和线段的一种划分.py,只需要把k线high,low数据输入,就能自动实现笔,线段,中枢,买卖点,走势类型的划分了。可以把sh.csv 作为输入文件。个人简历见.pdf。时间的力量。有人说择时很困难,有人说选股很容易,有人说统计套利需要的IT配套设施很重要。还有人说系统有不可测原理。众说纷纭。分布式的系统,当你的影响可以被忽略,你才能实现,Jiang主席所谓之,闷声发大财。
Stars: ✭ 206 (-4.19%)
Mutual labels:  deep-reinforcement-learning
Deep Reinforcement Learning Algorithms
31 projects in the framework of Deep Reinforcement Learning algorithms: Q-learning, DQN, PPO, DDPG, TD3, SAC, A2C and others. Each project is provided with a detailed training log.
Stars: ✭ 167 (-22.33%)
Mutual labels:  deep-reinforcement-learning
Reinforcementlearning.jl
A reinforcement learning package for Julia
Stars: ✭ 192 (-10.7%)
Mutual labels:  deep-reinforcement-learning
Deeprl
Modularized Implementation of Deep RL Algorithms in PyTorch
Stars: ✭ 2,640 (+1127.91%)
Mutual labels:  deep-reinforcement-learning
Pytorch A2c Ppo Acktr Gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
Stars: ✭ 2,632 (+1124.19%)
Mutual labels:  deep-reinforcement-learning
Deep Reinforcement Learning Gym
Deep reinforcement learning model implementation in Tensorflow + OpenAI gym
Stars: ✭ 200 (-6.98%)
Mutual labels:  deep-reinforcement-learning

ACER

MIT License

Actor-critic with experience replay (ACER) [1]. Uses batch off-policy updates to improve stability. Trust region updates can be enabled with --trust-region. Currently uses full trust region instead of "efficient" trust region (see issue #1).

Run with python main.py <options>. To run asynchronous advantage actor-critic (A3C) [2] (but with a Q-value head), use the --on-policy option.

Requirements

To install all dependencies with Anaconda run conda env create -f environment.yml and use source activate acer to activate the environment.

Results

ACER

Acknowledgements

References

[1] Sample Efficient Actor-Critic with Experience Replay
[2] Asynchronous Methods for Deep Reinforcement Learning

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].