Coac / CommNet-BiCnet

Licence: other

CommNet and BiCnet implementation in tensorflow

Programming Languages

python

139335 projects - #7 most used programming language

Projects that are alternatives of or similar to CommNet-BiCnet

malib deprecated

A Multi-agent Learning Framework

Stars: ✭ 63 (+26%)

Mutual labels: multi-agent-reinforcement-learning

Fruit-API

A Universal Deep Reinforcement Learning Framework

Stars: ✭ 61 (+22%)

Mutual labels: multi-agent-reinforcement-learning

TiKick

Learning-based agent for Google Research Football

Stars: ✭ 60 (+20%)

Mutual labels: multi-agent-reinforcement-learning

robotic-warehouse

Multi-Robot Warehouse (RWARE): A multi-agent reinforcement learning environment

Stars: ✭ 62 (+24%)

Mutual labels: multi-agent-reinforcement-learning

SMAC

StarCraft II Multi Agent Challenge : QMIX, COMA, LIIR, QTRAN, Central V, ROMA, RODE, DOP, Graph MIX

Stars: ✭ 40 (-20%)

Mutual labels: multi-agent-reinforcement-learning

Mava

A library of multi-agent reinforcement learning components and systems

Stars: ✭ 355 (+610%)

Mutual labels: multi-agent-reinforcement-learning

CoDAIL

Implementation of CoDAIL in the ICLR 2021 paper <Multi-Agent Interactions Modeling with Correlated Policies>

Stars: ✭ 17 (-66%)

Mutual labels: multi-agent-reinforcement-learning

gym-battlesnake

Multi-agent reinforcement learning environment

Stars: ✭ 29 (-42%)

Mutual labels: multi-agent-reinforcement-learning

CommNet-BiCnet

CommNet and BiCnet implementation in tensorflow

Training

Train CommNet using DDPG algorithm

python train_comm_net.py

Hypersearch

To find the optimal hyperparameters such as actor_lr or critic_lr, a simple grid search has been implemented. It launches multiple instances of the trainer in parallel based on the number of CPU cores.

python hypersearch.py

Guessing sum environment

It is a simple game described in the BiCnet paper for testing if the communication works. The environment implements the crucial methods of the core gym interface from OpenAI

Each agent receives a scalar sampled between [−10, 10] under a truncated Gaussian. Each agent needs to output the sum of all inputs received among the agents. An agent gets a normalized reward between [0, 1] based on the absolute difference between the sum and its output.

Results

Training CommNet in the Guessing sum env with 2 agents

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Coac / CommNet-BiCnet

Programming Languages

Labels

Projects that are alternatives of or similar to CommNet-BiCnet

CommNet-BiCnet

Training

Hypersearch

Guessing sum environment

Results

Training CommNet in the Guessing sum env with 2 agents