All Projects → openai → Robosumo

openai / Robosumo

Code for the paper "Continuous Adaptation via Meta-Learning in Nonstationary and Competitive Environments"

Programming Languages

python
139335 projects - #7 most used programming language

Labels

Projects that are alternatives of or similar to Robosumo

Dragan
A stable algorithm for GAN training
Stars: ✭ 189 (-19.23%)
Mutual labels:  paper
Research In Production
A collection of research papers categorized by real-world systems that enact them
Stars: ✭ 205 (-12.39%)
Mutual labels:  paper
Triplet Attention
Official PyTorch Implementation for "Rotate to Attend: Convolutional Triplet Attention Module." [WACV 2021]
Stars: ✭ 222 (-5.13%)
Mutual labels:  paper
Hustpapertemp
华中科技大学本科毕业论文LaTeX模板 2017
Stars: ✭ 189 (-19.23%)
Mutual labels:  paper
Papers
Summaries of machine learning papers
Stars: ✭ 2,362 (+909.4%)
Mutual labels:  paper
Survey Computer Vision
2020-2021年计算机视觉综述论文分方向整理
Stars: ✭ 207 (-11.54%)
Mutual labels:  paper
Anms Codes
Efficient adaptive non-maximal suppression algorithms for homogeneous spatial keypoint distribution
Stars: ✭ 174 (-25.64%)
Mutual labels:  paper
Machine Learning Resources
A curated list of awesome machine learning frameworks, libraries, courses, books and many more.
Stars: ✭ 226 (-3.42%)
Mutual labels:  paper
Epg
Code for the paper "Evolved Policy Gradients"
Stars: ✭ 204 (-12.82%)
Mutual labels:  paper
Cardboard
The Bukkit/Spigot/Paper API implementation for Fabric
Stars: ✭ 220 (-5.98%)
Mutual labels:  paper
Htmpapers
Numenta published papers code and data
Stars: ✭ 191 (-18.38%)
Mutual labels:  paper
Drl4recsys
Courses on Deep Reinforcement Learning (DRL) and DRL papers for recommender systems
Stars: ✭ 196 (-16.24%)
Mutual labels:  paper
Research Paper Notes
Notes and Summaries on ML-related Research Papers (with optional implementations)
Stars: ✭ 218 (-6.84%)
Mutual labels:  paper
Acl Papers
paper summary of Association for Computational Linguistics
Stars: ✭ 189 (-19.23%)
Mutual labels:  paper
Awesome Gans And Deepfakes
A curated list of GAN & Deepfake papers and repositories.
Stars: ✭ 224 (-4.27%)
Mutual labels:  paper
Signatureview
SignatureView is an open source Android library which allow developers to produce pen and paper like effect for creating signatures on Android
Stars: ✭ 185 (-20.94%)
Mutual labels:  paper
Awesome Deeplearning Resources
Deep Learning and deep reinforcement learning research papers and some codes
Stars: ✭ 2,483 (+961.11%)
Mutual labels:  paper
Gpt 2
Code for the paper "Language Models are Unsupervised Multitask Learners"
Stars: ✭ 15,142 (+6370.94%)
Mutual labels:  paper
Vehicle reid Collection
🚗 the collection of vehicle re-ID papers, datasets. 🚗
Stars: ✭ 225 (-3.85%)
Mutual labels:  paper
Nfnets Pytorch
NFNets and Adaptive Gradient Clipping for SGD implemented in PyTorch
Stars: ✭ 215 (-8.12%)
Mutual labels:  paper

Status: Archive (code is provided as-is, no updates expected)

RoboSumo

This repository contains a set of competitive multi-agent environments used in the paper Continuous Adaptation via Meta-Learning in Nonstationary and Competitive Environments.

Installation

RoboSumo depends on numpy, gym, and mujoco_py>=1.5 (if you haven't used MuJoCo before, please refer to the installation guide). Running demos with pre-trained policies additionally requires tensorflow>=1.1.0 and click.

The requirements can be installed via pip as follows:

$ pip install -r requirements.txt

To install RoboSumo, clone the repository and run pip install:

$ git clone https://github.com/openai/robosumo
$ cd robosumo
$ pip install -e .

Demos

You can run demos of the environments using demos/play.py script:

$ python demos/play.py

The script allows you to select different opponents as well as different policy architectures and versions for the agents. For details, please refer to the help:

$ python demos/play.py --help

Usage: play.py [OPTIONS]

Options:
  --env TEXT                    Name of the environment.  [default: RoboSumo-Ant-vs-Ant-v0]
  --policy-names [mlp|lstm]...  Policy names.  [default: mlp, mlp]
  --param-versions INTEGER...   Policy parameter versions.  [default: 1, 1]
  --max_episodes INTEGER        Number of episodes.  [default: 20]
  --help                        Show this message and exit.
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].