All Projects → Coac → CommNet-BiCnet

Coac / CommNet-BiCnet

Licence: other
CommNet and BiCnet implementation in tensorflow

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to CommNet-BiCnet

malib deprecated
A Multi-agent Learning Framework
Stars: ✭ 63 (+26%)
Mutual labels:  multi-agent-reinforcement-learning
Fruit-API
A Universal Deep Reinforcement Learning Framework
Stars: ✭ 61 (+22%)
Mutual labels:  multi-agent-reinforcement-learning
TiKick
Learning-based agent for Google Research Football
Stars: ✭ 60 (+20%)
Mutual labels:  multi-agent-reinforcement-learning
robotic-warehouse
Multi-Robot Warehouse (RWARE): A multi-agent reinforcement learning environment
Stars: ✭ 62 (+24%)
Mutual labels:  multi-agent-reinforcement-learning
SMAC
StarCraft II Multi Agent Challenge : QMIX, COMA, LIIR, QTRAN, Central V, ROMA, RODE, DOP, Graph MIX
Stars: ✭ 40 (-20%)
Mutual labels:  multi-agent-reinforcement-learning
Mava
A library of multi-agent reinforcement learning components and systems
Stars: ✭ 355 (+610%)
Mutual labels:  multi-agent-reinforcement-learning
CoDAIL
Implementation of CoDAIL in the ICLR 2021 paper <Multi-Agent Interactions Modeling with Correlated Policies>
Stars: ✭ 17 (-66%)
Mutual labels:  multi-agent-reinforcement-learning
gym-battlesnake
Multi-agent reinforcement learning environment
Stars: ✭ 29 (-42%)
Mutual labels:  multi-agent-reinforcement-learning

CommNet-BiCnet

CommNet and BiCnet implementation in tensorflow

Training

Train CommNet using DDPG algorithm

python train_comm_net.py

Hypersearch

To find the optimal hyperparameters such as actor_lr or critic_lr, a simple grid search has been implemented. It launches multiple instances of the trainer in parallel based on the number of CPU cores.

python hypersearch.py

Guessing sum environment

It is a simple game described in the BiCnet paper for testing if the communication works. The environment implements the crucial methods of the core gym interface from OpenAI

Each agent receives a scalar sampled between [−10, 10] under a truncated Gaussian. Each agent needs to output the sum of all inputs received among the agents. An agent gets a normalized reward between [0, 1] based on the absolute difference between the sum and its output.

Results

Training CommNet in the Guessing sum env with 2 agents

2_agents_commnet_training_reward

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].