Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → Kaixhin → Acer

Kaixhin / Acer

Licence: mit

Actor-critic with experience replay

Programming Languages

139335 projects - #7 most used programming language

Labels

deep-learning deep-reinforcement-learning

Projects that are alternatives of or similar to Acer

2048 Deep Reinforcement Learning

Trained A Convolutional Neural Network To Play 2048 using Deep-Reinforcement Learning

Stars: ✭ 169 (-21.4%)

Mutual labels: deep-reinforcement-learning

Awesome Deep Rl

A curated list of awesome Deep Reinforcement Learning resources.

Stars: ✭ 193 (-10.23%)

Mutual labels: deep-reinforcement-learning

Tensorflow2 Deep Reinforcement Learning

Code accompanying the blog post "Deep Reinforcement Learning with TensorFlow 2.1"

Stars: ✭ 204 (-5.12%)

Mutual labels: deep-reinforcement-learning

Repository for codes of 'Deep Reinforcement Learning'

Stars: ✭ 172 (-20%)

Mutual labels: deep-reinforcement-learning

Hands On Intelligent Agents With Openai Gym

Code for Hands On Intelligent Agents with OpenAI Gym book to get started and learn to build deep reinforcement learning agents using PyTorch

Stars: ✭ 189 (-12.09%)

Mutual labels: deep-reinforcement-learning

Courses on Deep Reinforcement Learning (DRL) and DRL papers for recommender systems

Stars: ✭ 196 (-8.84%)

Mutual labels: deep-reinforcement-learning

Accel Brain Code

The purpose of this repository is to make prototypes as case study in the context of proof of concept(PoC) and research and development(R&D) that I have written in my website. The main research topics are Auto-Encoders in relation to the representation learning, the statistical machine learning for energy-based models, adversarial generation networks(GANs), Deep Reinforcement Learning such as Deep Q-Networks, semi-supervised learning, and neural network language model for natural language processing.

Stars: ✭ 166 (-22.79%)

Mutual labels: deep-reinforcement-learning

Manage your machine learning experiments with trixi - modular, reproducible, high fashion. An experiment infrastructure optimized for PyTorch, but flexible enough to work for your framework and your tastes.

Stars: ✭ 211 (-1.86%)

Mutual labels: deep-reinforcement-learning

"Continuous Deep Q-Learning with Model-based Acceleration" in TensorFlow

Stars: ✭ 192 (-10.7%)

Mutual labels: deep-reinforcement-learning

Summaries of machine learning papers

Stars: ✭ 2,362 (+998.6%)

Mutual labels: deep-reinforcement-learning

Machine Learning Is All You Need

🔥🌟《Machine Learning 格物志》: ML + DL + RL basic codes and notes by sklearn, PyTorch, TensorFlow, Keras & the most important, from scratch!💪 This repository is ALL You Need!

Stars: ✭ 173 (-19.53%)

Mutual labels: deep-reinforcement-learning

Deep Reinforcement Learning For Dialogue Generation In Tensorflow

Deep-Reinforcement-Learning-for-Dialogue-Generation-in-tensorflow

Stars: ✭ 178 (-17.21%)

Mutual labels: deep-reinforcement-learning

Atari Model Zoo

A binary release of trained deep reinforcement learning models trained in the Atari machine learning benchmark, and a software release that enables easy visualization and analysis of models, and comparison across training algorithms.

Stars: ✭ 198 (-7.91%)

Mutual labels: deep-reinforcement-learning

Noisy Networks for Exploration

Stars: ✭ 171 (-20.47%)

Mutual labels: deep-reinforcement-learning

文件笔和线段的一种划分.py,只需要把k线high，low数据输入，就能自动实现笔，线段，中枢，买卖点，走势类型的划分了。可以把sh.csv 作为输入文件。个人简历见.pdf。时间的力量。有人说择时很困难，有人说选股很容易，有人说统计套利需要的IT配套设施很重要。还有人说系统有不可测原理。众说纷纭。分布式的系统，当你的影响可以被忽略，你才能实现，Jiang主席所谓之，闷声发大财。

Stars: ✭ 206 (-4.19%)

Mutual labels: deep-reinforcement-learning

Deep Reinforcement Learning Algorithms

31 projects in the framework of Deep Reinforcement Learning algorithms: Q-learning, DQN, PPO, DDPG, TD3, SAC, A2C and others. Each project is provided with a detailed training log.

Stars: ✭ 167 (-22.33%)

Mutual labels: deep-reinforcement-learning

Reinforcementlearning.jl

A reinforcement learning package for Julia

Stars: ✭ 192 (-10.7%)

Mutual labels: deep-reinforcement-learning

Modularized Implementation of Deep RL Algorithms in PyTorch

Stars: ✭ 2,640 (+1127.91%)

Mutual labels: deep-reinforcement-learning

Pytorch A2c Ppo Acktr Gail

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

Stars: ✭ 2,632 (+1124.19%)

Mutual labels: deep-reinforcement-learning

Deep Reinforcement Learning Gym

Deep reinforcement learning model implementation in Tensorflow + OpenAI gym

Stars: ✭ 200 (-6.98%)

Mutual labels: deep-reinforcement-learning

View All Similar Projects ➔

ACER

Actor-critic with experience replay (ACER) [1]. Uses batch off-policy updates to improve stability. Trust region updates can be enabled with --trust-region. Currently uses full trust region instead of "efficient" trust region (see issue #1).

Run with python main.py <options>. To run asynchronous advantage actor-critic (A3C) [2] (but with a Q-value head), use the --on-policy option.

Requirements

To install all dependencies with Anaconda run conda env create -f environment.yml and use source activate acer to activate the environment.

Results

Acknowledgements

@ikostrikov for pytorch-a3c
@apaszke for Reinforcement Learning (DQN) tutorial
@pfnet for ChainerRL

References

[1] Sample Efficient Actor-Critic with Experience Replay
[2] Asynchronous Methods for Deep Reinforcement Learning

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 215

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (0) 🔗