Zeta36 / muzero Licence: GPL-3.0 license
A simple implementation of MuZero algorithm for connect4 game
Programming Languages python 139335 projects - #7 most used programming language
Projects that are alternatives of or similar to muzero Pathnet Tensorflow Implementation of PathNet: Evolution Channels Gradient Descent in Super Neural Networks
Stars : ✭ 96 (+10.34%)
Mutual labels: deepmind
Pytorch-RL-CPP A Repository with C++ implementations of Reinforcement Learning Algorithms (Pytorch)
Stars : ✭ 73 (-16.09%)
Mutual labels: deepmind
Startcraft pysc2 minigames Startcraft II Machine Learning research with DeepMind pysc2 python library .mini-games and agents.
Stars : ✭ 113 (+29.89%)
Mutual labels: deepmind
Learning To Communicate Pytorch Learning to Communicate with Deep Multi-Agent Reinforcement Learning in PyTorch
Stars : ✭ 236 (+171.26%)
Mutual labels: deepmind
Nfnets pytorch Pre-trained NFNets with 99% of the accuracy of the official paper "High-Performance Large-Scale Image Recognition Without Normalization".
Stars : ✭ 85 (-2.3%)
Mutual labels: deepmind
pysc2 StarCraft II Learning Environment
Stars : ✭ 7,529 (+8554.02%)
Mutual labels: deepmind
Pbt Population Based Training (in PyTorch with sqlite3). Status: Unsupported
Stars : ✭ 138 (+58.62%)
Mutual labels: deepmind
jax-rl JAX implementations of core Deep RL algorithms
Stars : ✭ 61 (-29.89%)
Mutual labels: deepmind
Torch Gqn PyTorch Implementation of Generative Query Network
Stars : ✭ 118 (+35.63%)
Mutual labels: deepmind
Relational Rnn Pytorch An implementation of DeepMind's Relational Recurrent Neural Networks in PyTorch.
Stars : ✭ 236 (+171.26%)
Mutual labels: deepmind
Nalu Basic pytorch implementation of NAC/NALU from Neural Arithmetic Logic Units paper by trask et.al
Stars : ✭ 110 (+26.44%)
Mutual labels: deepmind
wavenet-classifier Keras Implementation of Deepmind's WaveNet for Supervised Learning Tasks
Stars : ✭ 54 (-37.93%)
Mutual labels: deepmind
Allie Allie: A UCI compliant chess engine
Stars : ✭ 89 (+2.3%)
Mutual labels: deepmind
Nfnets Pytorch NFNets and Adaptive Gradient Clipping for SGD implemented in PyTorch
Stars : ✭ 215 (+147.13%)
Mutual labels: deepmind
muzero A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each other, and investigate reliability of learned MuZero MDP models.
Stars : ✭ 126 (+44.83%)
Mutual labels: muzero
ActiveRagdollControllers Research into controllers for 2d and 3d Active Ragdolls (using MujocoUnity+ml_agents)
Stars : ✭ 30 (-65.52%)
Mutual labels: deepmind
MuZero
A simple implementation of MuZero algorithm for Connect4 game (following the pseudocode offered by DeepMind in their paper)
https://arxiv.org/abs/1911.08265
Note that the project description data, including the texts, logos, images, and/or trademarks,
for each open source project belongs to its rightful owner.
If you wish to add or remove any projects, please contact us at
[email protected] .