All Projects → muzero → Similar Projects or Alternatives

481 Open source projects that are alternatives of or similar to muzero

🐵 An AI chess-board-game framework(by many programming languages) implementations.

Stars: ✭ 40 (-68.25%)

Mutual labels: deep-reinforcement-learning, mcts

PC-DARTS (PC-DARTS: Partial Channel Connections for Memory-Efficient Differentiable Architecture Search, published in ICLR 2020) implemented in Tensorflow 2.0+. This is an unofficial implementation.

Stars: ✭ 25 (-80.16%)

Mutual labels: tf2, tensorflow2

TF2-GAN

🐳 GAN implemented as Tensorflow 2.X

Stars: ✭ 61 (-51.59%)

Mutual labels: tf2, tensorflow2

alpha-zero

AlphaZero implementation for Othello, Connect-Four and Tic-Tac-Toe based on "Mastering the game of Go without human knowledge" and "Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm" by DeepMind.

Stars: ✭ 68 (-46.03%)

Mutual labels: mcts, alphazero

Awesome-Tensorflow2

基于Tensorflow2开发的优秀扩展包及项目

Stars: ✭ 45 (-64.29%)

Mutual labels: tf2, tensorflow2

AlphaZero Gobang

Deep Learning big homework of UCAS

Stars: ✭ 29 (-76.98%)

Mutual labels: mcts, alphazero

computer-go-dataset

datasets for computer go

Stars: ✭ 133 (+5.56%)

Mutual labels: alphazero, muzero

Alphazero gomoku

An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)

Stars: ✭ 2,570 (+1939.68%)

Mutual labels: mcts, alphazero

deep reinforcement learning gallery

Deep reinforcement learning with tensorflow2

Stars: ✭ 35 (-72.22%)

Mutual labels: deep-reinforcement-learning, tensorflow2

Alpha Zero General

A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more

Stars: ✭ 2,617 (+1976.98%)

Mutual labels: mcts, alphazero

tf-faster-rcnn

Tensorflow 2 Faster-RCNN implementation from scratch supporting to the batch processing with MobileNetV2 and VGG16 backbones

Stars: ✭ 88 (-30.16%)

Mutual labels: tf2, tensorflow2

FinRL

FinRL: The first open-source project for financial reinforcement learning. Please star. 🔥

Stars: ✭ 3,497 (+2675.4%)

Mutual labels: deep-reinforcement-learning, tensorflow2

spectral normalization-tf2

🌈 Spectral Normalization implemented as Tensorflow 2

Stars: ✭ 36 (-71.43%)

Mutual labels: tf2, tensorflow2

keras efficientnet v2

self defined efficientnetV2 according to official version. Including converted ImageNet/21K/21k-ft1k weights.

Stars: ✭ 56 (-55.56%)

Mutual labels: tf2, tensorflow2

Finrl Library

FinRL: Financial Reinforcement Learning Framework. Please star. 🔥

Stars: ✭ 3,037 (+2310.32%)

Mutual labels: deep-reinforcement-learning, tensorflow2

alpha sigma

A pytorch based Gomoku game model. Alpha Zero algorithm based reinforcement Learning and Monte Carlo Tree Search model.

Stars: ✭ 134 (+6.35%)

Mutual labels: deep-reinforcement-learning, alphazero

CRNN.tf2

Convolutional Recurrent Neural Network(CRNN) for End-to-End Text Recognition - TensorFlow 2

Stars: ✭ 131 (+3.97%)

Mutual labels: tf2, tensorflow2

transformer-tensorflow2.0

transformer in tensorflow 2.0

Stars: ✭ 53 (-57.94%)

Mutual labels: tf2, tensorflow2

Deep RL with pytorch

A pytorch tutorial for DRL(Deep Reinforcement Learning)

Stars: ✭ 160 (+26.98%)

Mutual labels: deep-reinforcement-learning, mcts

manning tf2 in action

The official code repository for "TensorFlow in Action" by Manning.

Stars: ✭ 61 (-51.59%)

Mutual labels: tf2, tensorflow2

GLOM-TensorFlow

An attempt at the implementation of GLOM, Geoffrey Hinton's paper for emergent part-whole hierarchies from data

Stars: ✭ 32 (-74.6%)

Mutual labels: tensorflow2

minerva

An out-of-the-box GUI tool for offline deep reinforcement learning

Stars: ✭ 80 (-36.51%)

Mutual labels: deep-reinforcement-learning

deep-rts

A Real-Time-Strategy game for Deep Learning research

Stars: ✭ 152 (+20.63%)

Mutual labels: deep-reinforcement-learning

TF RL

Eagerly Experimentable!!!

Stars: ✭ 22 (-82.54%)

Mutual labels: deep-reinforcement-learning

alphaFive

alphaGo版本的五子棋(gobang, gomoku)

Stars: ✭ 51 (-59.52%)

Mutual labels: alphazero

使用深度强化学习解决视觉跟踪和视觉导航问题

Stars: ✭ 16 (-87.3%)

Mutual labels: deep-reinforcement-learning

mmn

Moore Machine Networks (MMN): Learning Finite-State Representations of Recurrent Policy Networks

Stars: ✭ 39 (-69.05%)

Mutual labels: deep-reinforcement-learning

FinRL Podracer

Cloud-native Financial Reinforcement Learning

Stars: ✭ 179 (+42.06%)

Mutual labels: deep-reinforcement-learning

chi

A high-level framework for advanced deep learning with TensorFlow

Stars: ✭ 55 (-56.35%)

Mutual labels: deep-reinforcement-learning

code summarization public

source code for 'Improving automatic source code summarization via deep reinforcement learning'

Stars: ✭ 71 (-43.65%)

Mutual labels: deep-reinforcement-learning

tensorflow-ml-nlp-tf2

텐서플로2와 머신러닝으로 시작하는 자연어처리 (로지스틱회귀부터 BERT와 GPT3까지) 실습자료

Stars: ✭ 245 (+94.44%)

Mutual labels: tf2

minGPT-TF

A minimal TF2 re-implementation of the OpenAI GPT training

Stars: ✭ 36 (-71.43%)

Mutual labels: tf2

DolboNet

Русскоязычный чат-бот для Discord на архитектуре Transformer

Stars: ✭ 53 (-57.94%)

Mutual labels: tensorflow2

VSH-Rewrite

Popular Versus Saxton Hale gamemode remade from scratch

Stars: ✭ 30 (-76.19%)

Mutual labels: tf2

steam community market

Get item prices and volumes from the Steam Community Market using Python 3

Stars: ✭ 24 (-80.95%)

Mutual labels: tf2

tf retrieval baseline

A Tensorflow retrieval (space embedding) baseline. Metric learning baseline on CUB and Stanford Online Products.

Stars: ✭ 39 (-69.05%)

Mutual labels: tensorflow2

decentralized-rl

Decentralized Reinforcment Learning: Global Decision-Making via Local Economic Transactions (ICML 2020)

Stars: ✭ 40 (-68.25%)

Mutual labels: deep-reinforcement-learning

FreakFortressBat

No longer supported.

Stars: ✭ 32 (-74.6%)

Mutual labels: tf2

datascienv

datascienv is package that helps you to setup your environment in single line of code with all dependency and it is also include pyforest that provide single line of import all required ml libraries

Stars: ✭ 53 (-57.94%)

Mutual labels: tensorflow2

QuantumSpeech-QCNN

IEEE ICASSP 21 - Quantum Convolution Neural Networks for Speech Processing and Automatic Speech Recognition

Stars: ✭ 71 (-43.65%)

Mutual labels: tensorflow2

pomdp-baselines

Simple (but often Strong) Baselines for POMDPs in PyTorch - ICML 2022

Stars: ✭ 162 (+28.57%)

Mutual labels: deep-reinforcement-learning

pku-Artificial-intelligence-practice-homework

2019北京大学软件与微电子学院曹健老师的《人工智能实践》作业，有完整的注释,欢迎提出issue以及request

Stars: ✭ 45 (-64.29%)

Mutual labels: tensorflow2

Meta-Learning-for-StarCraft-II-Minigames

We reproduced DeepMind's results and implement a meta-learning (MLSH) agent which can generalize across minigames.

Stars: ✭ 26 (-79.37%)

Mutual labels: deep-reinforcement-learning

fortress-royale

Team Fortress 2 battle royale gamemode

Stars: ✭ 48 (-61.9%)

Mutual labels: tf2

CrowdNav DSRNN

[ICRA 2021] Decentralized Structural-RNN for Robot Crowd Navigation with Deep Reinforcement Learning

Stars: ✭ 43 (-65.87%)

Mutual labels: deep-reinforcement-learning

mae-scalable-vision-learners

A TensorFlow 2.x implementation of Masked Autoencoders Are Scalable Vision Learners

Stars: ✭ 54 (-57.14%)

Mutual labels: tensorflow2

face-mask-detection-tf2

A face mask detection using ssd with simplified Mobilenet and RFB or Pelee in Tensorflow 2.1. Training on your own dataset. Can be converted to kmodel and run on the edge device of k210

Stars: ✭ 72 (-42.86%)

Mutual labels: tensorflow2

Deep-Reinforcement-Learning-for-Automated-Stock-Trading-Ensemble-Strategy-ICAIF-2020

Live Trading. Please star.

Stars: ✭ 1,251 (+892.86%)

Mutual labels: deep-reinforcement-learning

CallAdmin

CallAdmin is a multilingual sourcemod plugin which provides in-game report functionality

Stars: ✭ 52 (-58.73%)

Mutual labels: tf2

imitation learning

PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.

Stars: ✭ 93 (-26.19%)

Mutual labels: deep-reinforcement-learning

Master-Thesis

Deep Reinforcement Learning in Autonomous Driving: the A3C algorithm used to make a car learn to drive in TORCS; Python 3.5, Tensorflow, tensorboard, numpy, gym-torcs, ubuntu, latex

Stars: ✭ 33 (-73.81%)

Mutual labels: deep-reinforcement-learning

tf-image

TensorFlow2+ graph image augmentation library optimized for tf.data.Dataset.

Stars: ✭ 24 (-80.95%)

Mutual labels: tensorflow2

Carla-ppo

This repository hosts a customized PPO based agent for Carla. The goal of this project is to make it easier to interact with and experiment in Carla with reinforcement learning based agents -- this, by wrapping Carla in a gym like environment that can handle custom reward functions, custom debug output, etc.

Stars: ✭ 122 (-3.17%)

Mutual labels: deep-reinforcement-learning

drift drl

High-speed Autonomous Drifting with Deep Reinforcement Learning

Stars: ✭ 82 (-34.92%)

Mutual labels: deep-reinforcement-learning

motion-planner-reinforcement-learning

End to end motion planner using Deep Deterministic Policy Gradient (DDPG) in gazebo

Stars: ✭ 99 (-21.43%)

Mutual labels: deep-reinforcement-learning

jupiter

A Monte-Carlo based AI to beat 2048