All Projects → aunum → Gold

aunum / Gold

Licence: apache-2.0
Reinforcement Learning in Go

Programming Languages

go
31211 projects - #10 most used programming language
golang
3204 projects

Projects that are alternatives of or similar to Gold

Alpha Zero General
A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more
Stars: ✭ 2,617 (+1117.21%)
Mutual labels:  reinforcement-learning
Icychesszero
中国象棋alpha zero程序
Stars: ✭ 206 (-4.19%)
Mutual labels:  reinforcement-learning
Awesome Deeplearning Resources
Deep Learning and deep reinforcement learning research papers and some codes
Stars: ✭ 2,483 (+1054.88%)
Mutual labels:  reinforcement-learning
Gym Unrealcv
Unreal environments for reinforcement learning
Stars: ✭ 202 (-6.05%)
Mutual labels:  reinforcement-learning
Rl trading
An environment to high-frequency trading agents under reinforcement learning
Stars: ✭ 205 (-4.65%)
Mutual labels:  reinforcement-learning
Alphazero gomoku
An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)
Stars: ✭ 2,570 (+1095.35%)
Mutual labels:  reinforcement-learning
Release
Deep Reinforcement Learning for de-novo Drug Design
Stars: ✭ 201 (-6.51%)
Mutual labels:  reinforcement-learning
Autodrome
Framework and OpenAI Gym Environment for Autonomous Vehicle Development
Stars: ✭ 214 (-0.47%)
Mutual labels:  reinforcement-learning
Rl Tutorial Jnrr19
Stable-Baselines tutorial for Journées Nationales de la Recherche en Robotique 2019
Stars: ✭ 204 (-5.12%)
Mutual labels:  reinforcement-learning
Pytorch Reinforce
PyTorch Implementation of REINFORCE for both discrete & continuous control
Stars: ✭ 212 (-1.4%)
Mutual labels:  reinforcement-learning
Multihopkg
Multi-hop knowledge graph reasoning learned via policy gradient with reward shaping and action dropout
Stars: ✭ 202 (-6.05%)
Mutual labels:  reinforcement-learning
Minerva
Meandering In Networks of Entities to Reach Verisimilar Answers
Stars: ✭ 205 (-4.65%)
Mutual labels:  reinforcement-learning
Gymfc
A universal flight control tuning framework
Stars: ✭ 210 (-2.33%)
Mutual labels:  reinforcement-learning
Papers we read
Summaries of the papers that are discussed by VLG.
Stars: ✭ 203 (-5.58%)
Mutual labels:  reinforcement-learning
Reco Papers
Classic papers and resources on recommendation
Stars: ✭ 2,804 (+1204.19%)
Mutual labels:  reinforcement-learning
Reinforcement Learning Stanford
🕹️ CS234: Reinforcement Learning, Winter 2019 | YouTube videos 👉
Stars: ✭ 201 (-6.51%)
Mutual labels:  reinforcement-learning
Pytorch A2c Ppo Acktr Gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
Stars: ✭ 2,632 (+1124.19%)
Mutual labels:  reinforcement-learning
Icnn
Input Convex Neural Networks
Stars: ✭ 214 (-0.47%)
Mutual labels:  reinforcement-learning
Pokerrl
Framework for Multi-Agent Deep Reinforcement Learning in Poker
Stars: ✭ 214 (-0.47%)
Mutual labels:  reinforcement-learning
Reinforcement Learning An Introduction Chinese
《Reinforcement Learning: An Introduction》(第二版)中文翻译
Stars: ✭ 210 (-2.33%)
Mutual labels:  reinforcement-learning

logo
cartpole
GoDoc Go Report Card

Overview

Gold is a reinforcement learning library for Go. It provides a set of agents that can be used to solve challenges in various environments. The library further contains a composable tooling for creating agents and visualizing their performance.

Quick Start

go run ./pkg/v1/agent/deepq/experiments/cartpole/main.go

Requirements

  • Go >= v13.0
  • Docker
  • A browser that isn't IE

Agents

All of the agent implementations can be found in pkg/v1/agent each agent has an experiments folder providing demos across various environments.

agent algorithm
pkg/v1/agent/q Q Learning
pkg/v1/agent/deepq Deep Q learning with Double Q
pkg/v1/agent/reinforce REINFORCE aka Monte Carlo Policy Gradients
pkg/v1/agent/nes Natural Evolution Strategies
pkg/v1/agent/her Hindsight Experience Replay
pkg/v1/agent/ppo Proximal Policy Optimization ⚠️ in progress

Docs

Each package contains a README explaining the usage, also see GoDoc.

Contributing

Please open an MR for any issues or feature requests.

The project is fairly simple to hack on, environments are provided through Sphere, models are provided through Goro.

Feel free to ping @pbarker on Gopher slack.

Roadmap

  • [ ] More agents, more environments; see Future Thoughts
  • [ ] Accelerated compute support
  • [ ] Tuning libraries
  • [ ] Deployments

Inspiration

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].