All Projects → muzero → Similar Projects or Alternatives

481 Open source projects that are alternatives of or similar to muzero

godpaper
🐵 An AI chess-board-game framework(by many programming languages) implementations.
Stars: ✭ 40 (-68.25%)
pcdarts-tf2
PC-DARTS (PC-DARTS: Partial Channel Connections for Memory-Efficient Differentiable Architecture Search, published in ICLR 2020) implemented in Tensorflow 2.0+. This is an unofficial implementation.
Stars: ✭ 25 (-80.16%)
Mutual labels:  tf2, tensorflow2
TF2-GAN
🐳 GAN implemented as Tensorflow 2.X
Stars: ✭ 61 (-51.59%)
Mutual labels:  tf2, tensorflow2
alpha-zero
AlphaZero implementation for Othello, Connect-Four and Tic-Tac-Toe based on "Mastering the game of Go without human knowledge" and "Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm" by DeepMind.
Stars: ✭ 68 (-46.03%)
Mutual labels:  mcts, alphazero
Awesome-Tensorflow2
基于Tensorflow2开发的优秀扩展包及项目
Stars: ✭ 45 (-64.29%)
Mutual labels:  tf2, tensorflow2
AlphaZero Gobang
Deep Learning big homework of UCAS
Stars: ✭ 29 (-76.98%)
Mutual labels:  mcts, alphazero
computer-go-dataset
datasets for computer go
Stars: ✭ 133 (+5.56%)
Mutual labels:  alphazero, muzero
Alphazero gomoku
An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)
Stars: ✭ 2,570 (+1939.68%)
Mutual labels:  mcts, alphazero
deep reinforcement learning gallery
Deep reinforcement learning with tensorflow2
Stars: ✭ 35 (-72.22%)
Alpha Zero General
A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more
Stars: ✭ 2,617 (+1976.98%)
Mutual labels:  mcts, alphazero
tf-faster-rcnn
Tensorflow 2 Faster-RCNN implementation from scratch supporting to the batch processing with MobileNetV2 and VGG16 backbones
Stars: ✭ 88 (-30.16%)
Mutual labels:  tf2, tensorflow2
FinRL
FinRL: The first open-source project for financial reinforcement learning. Please star. 🔥
Stars: ✭ 3,497 (+2675.4%)
spectral normalization-tf2
🌈 Spectral Normalization implemented as Tensorflow 2
Stars: ✭ 36 (-71.43%)
Mutual labels:  tf2, tensorflow2
keras efficientnet v2
self defined efficientnetV2 according to official version. Including converted ImageNet/21K/21k-ft1k weights.
Stars: ✭ 56 (-55.56%)
Mutual labels:  tf2, tensorflow2
Finrl Library
FinRL: Financial Reinforcement Learning Framework. Please star. 🔥
Stars: ✭ 3,037 (+2310.32%)
alpha sigma
A pytorch based Gomoku game model. Alpha Zero algorithm based reinforcement Learning and Monte Carlo Tree Search model.
Stars: ✭ 134 (+6.35%)
CRNN.tf2
Convolutional Recurrent Neural Network(CRNN) for End-to-End Text Recognition - TensorFlow 2
Stars: ✭ 131 (+3.97%)
Mutual labels:  tf2, tensorflow2
transformer-tensorflow2.0
transformer in tensorflow 2.0
Stars: ✭ 53 (-57.94%)
Mutual labels:  tf2, tensorflow2
Deep RL with pytorch
A pytorch tutorial for DRL(Deep Reinforcement Learning)
Stars: ✭ 160 (+26.98%)
manning tf2 in action
The official code repository for "TensorFlow in Action" by Manning.
Stars: ✭ 61 (-51.59%)
Mutual labels:  tf2, tensorflow2
GLOM-TensorFlow
An attempt at the implementation of GLOM, Geoffrey Hinton's paper for emergent part-whole hierarchies from data
Stars: ✭ 32 (-74.6%)
Mutual labels:  tensorflow2
minerva
An out-of-the-box GUI tool for offline deep reinforcement learning
Stars: ✭ 80 (-36.51%)
deep-rts
A Real-Time-Strategy game for Deep Learning research
Stars: ✭ 152 (+20.63%)
TF RL
Eagerly Experimentable!!!
Stars: ✭ 22 (-82.54%)
alphaFive
alphaGo版本的五子棋(gobang, gomoku)
Stars: ✭ 51 (-59.52%)
Mutual labels:  alphazero
AI
使用深度强化学习解决视觉跟踪和视觉导航问题
Stars: ✭ 16 (-87.3%)
mmn
Moore Machine Networks (MMN): Learning Finite-State Representations of Recurrent Policy Networks
Stars: ✭ 39 (-69.05%)
FinRL Podracer
Cloud-native Financial Reinforcement Learning
Stars: ✭ 179 (+42.06%)
chi
A high-level framework for advanced deep learning with TensorFlow
Stars: ✭ 55 (-56.35%)
code summarization public
source code for 'Improving automatic source code summarization via deep reinforcement learning'
Stars: ✭ 71 (-43.65%)
tensorflow-ml-nlp-tf2
텐서플로2와 머신러닝으로 시작하는 자연어처리 (로지스틱회귀부터 BERT와 GPT3까지) 실습자료
Stars: ✭ 245 (+94.44%)
Mutual labels:  tf2
minGPT-TF
A minimal TF2 re-implementation of the OpenAI GPT training
Stars: ✭ 36 (-71.43%)
Mutual labels:  tf2
DolboNet
Русскоязычный чат-бот для Discord на архитектуре Transformer
Stars: ✭ 53 (-57.94%)
Mutual labels:  tensorflow2
VSH-Rewrite
Popular Versus Saxton Hale gamemode remade from scratch
Stars: ✭ 30 (-76.19%)
Mutual labels:  tf2
steam community market
Get item prices and volumes from the Steam Community Market using Python 3
Stars: ✭ 24 (-80.95%)
Mutual labels:  tf2
tf retrieval baseline
A Tensorflow retrieval (space embedding) baseline. Metric learning baseline on CUB and Stanford Online Products.
Stars: ✭ 39 (-69.05%)
Mutual labels:  tensorflow2
decentralized-rl
Decentralized Reinforcment Learning: Global Decision-Making via Local Economic Transactions (ICML 2020)
Stars: ✭ 40 (-68.25%)
FreakFortressBat
No longer supported.
Stars: ✭ 32 (-74.6%)
Mutual labels:  tf2
datascienv
datascienv is package that helps you to setup your environment in single line of code with all dependency and it is also include pyforest that provide single line of import all required ml libraries
Stars: ✭ 53 (-57.94%)
Mutual labels:  tensorflow2
QuantumSpeech-QCNN
IEEE ICASSP 21 - Quantum Convolution Neural Networks for Speech Processing and Automatic Speech Recognition
Stars: ✭ 71 (-43.65%)
Mutual labels:  tensorflow2
pomdp-baselines
Simple (but often Strong) Baselines for POMDPs in PyTorch - ICML 2022
Stars: ✭ 162 (+28.57%)
pku-Artificial-intelligence-practice-homework
2019北京大学软件与微电子学院曹健老师的《人工智能实践》作业,有完整的注释,欢迎提出issue以及request
Stars: ✭ 45 (-64.29%)
Mutual labels:  tensorflow2
Meta-Learning-for-StarCraft-II-Minigames
We reproduced DeepMind's results and implement a meta-learning (MLSH) agent which can generalize across minigames.
Stars: ✭ 26 (-79.37%)
fortress-royale
Team Fortress 2 battle royale gamemode
Stars: ✭ 48 (-61.9%)
Mutual labels:  tf2
CrowdNav DSRNN
[ICRA 2021] Decentralized Structural-RNN for Robot Crowd Navigation with Deep Reinforcement Learning
Stars: ✭ 43 (-65.87%)
mae-scalable-vision-learners
A TensorFlow 2.x implementation of Masked Autoencoders Are Scalable Vision Learners
Stars: ✭ 54 (-57.14%)
Mutual labels:  tensorflow2
face-mask-detection-tf2
A face mask detection using ssd with simplified Mobilenet and RFB or Pelee in Tensorflow 2.1. Training on your own dataset. Can be converted to kmodel and run on the edge device of k210
Stars: ✭ 72 (-42.86%)
Mutual labels:  tensorflow2
CallAdmin
CallAdmin is a multilingual sourcemod plugin which provides in-game report functionality
Stars: ✭ 52 (-58.73%)
Mutual labels:  tf2
imitation learning
PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.
Stars: ✭ 93 (-26.19%)
Master-Thesis
Deep Reinforcement Learning in Autonomous Driving: the A3C algorithm used to make a car learn to drive in TORCS; Python 3.5, Tensorflow, tensorboard, numpy, gym-torcs, ubuntu, latex
Stars: ✭ 33 (-73.81%)
tf-image
TensorFlow2+ graph image augmentation library optimized for tf.data.Dataset.
Stars: ✭ 24 (-80.95%)
Mutual labels:  tensorflow2
Carla-ppo
This repository hosts a customized PPO based agent for Carla. The goal of this project is to make it easier to interact with and experiment in Carla with reinforcement learning based agents -- this, by wrapping Carla in a gym like environment that can handle custom reward functions, custom debug output, etc.
Stars: ✭ 122 (-3.17%)
drift drl
High-speed Autonomous Drifting with Deep Reinforcement Learning
Stars: ✭ 82 (-34.92%)
motion-planner-reinforcement-learning
End to end motion planner using Deep Deterministic Policy Gradient (DDPG) in gazebo
Stars: ✭ 99 (-21.43%)
jupiter
A Monte-Carlo based AI to beat 2048
Stars: ✭ 47 (-62.7%)
Mutual labels:  mcts
tensorflow-tabnet
Improved TabNet for TensorFlow
Stars: ✭ 49 (-61.11%)
Mutual labels:  tensorflow2
pytorch-noreward-rl
pytorch implementation of Curiosity-driven Exploration by Self-supervised Prediction
Stars: ✭ 79 (-37.3%)
MP-DQN
Source code for the dissertation: "Multi-Pass Deep Q-Networks for Reinforcement Learning with Parameterised Action Spaces"
Stars: ✭ 99 (-21.43%)
A-Barebones-Image-Retrieval-System
This project presents a simple framework to retrieve images similar to a query image.
Stars: ✭ 25 (-80.16%)
Mutual labels:  tensorflow2
1-60 of 481 similar projects