Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → tkn-tub → Ns3 Gym

tkn-tub / Ns3 Gym

Licence: gpl-2.0

ns3-gym - The Playground for Reinforcement Learning in Networking Research

Labels

reinforcement-learning openai-gym

Projects that are alternatives of or similar to Ns3 Gym

A universal flight control tuning framework

Stars: ✭ 210 (-4.98%)

Mutual labels: reinforcement-learning, openai-gym

Hierarchical Actor Critic Hac Pytorch

PyTorch implementation of Hierarchical Actor Critic (HAC) for OpenAI gym environments

Stars: ✭ 116 (-47.51%)

Mutual labels: reinforcement-learning, openai-gym

OpenAI's cartpole env solver.

Stars: ✭ 107 (-51.58%)

Mutual labels: reinforcement-learning, openai-gym

Gym Electric Motor

Gym Electric Motor (GEM): An OpenAI Gym Environment for Electric Motors

Stars: ✭ 95 (-57.01%)

Mutual labels: reinforcement-learning, openai-gym

🃏 OpenAI Gym No Limit Texas Hold 'em Environment for Reinforcement Learning

Stars: ✭ 135 (-38.91%)

Mutual labels: reinforcement-learning, openai-gym

Solving OpenAI Gym problems.

Stars: ✭ 98 (-55.66%)

Mutual labels: reinforcement-learning, openai-gym

Stable Baselines

Mirror of Stable-Baselines: a fork of OpenAI Baselines, implementations of reinforcement learning algorithms

Stars: ✭ 115 (-47.96%)

Mutual labels: reinforcement-learning, openai-gym

Minimalistic gridworld package for OpenAI Gym

Stars: ✭ 1,047 (+373.76%)

Mutual labels: reinforcement-learning, openai-gym

Train robotic agents to learn pick and place with deep learning for vision-based manipulation in PyBullet. Transporter Nets, CoRL 2020.

Stars: ✭ 133 (-39.82%)

Mutual labels: reinforcement-learning, openai-gym

Reinforcement learning

Implementation of selected reinforcement learning algorithms in Tensorflow. A3C, DDPG, REINFORCE, DQN, etc.

Stars: ✭ 132 (-40.27%)

Mutual labels: reinforcement-learning, openai-gym

Cs234 Reinforcement Learning Winter 2019

My Solutions of Assignments of CS234: Reinforcement Learning Winter 2019

Stars: ✭ 93 (-57.92%)

Mutual labels: reinforcement-learning, openai-gym

Reinforcement Learning Coach by Intel AI Lab enables easy experimentation with state of the art Reinforcement Learning algorithms

Stars: ✭ 2,085 (+843.44%)

Mutual labels: reinforcement-learning, openai-gym

Stars: ✭ 77 (-65.16%)

Mutual labels: reinforcement-learning, openai-gym

Framework for developing OpenAI Gym robotics environments simulated with Ignition Gazebo

Stars: ✭ 97 (-56.11%)

Mutual labels: reinforcement-learning, openai-gym

OpenAI Gym wrapper for the DeepMind Control Suite

Stars: ✭ 75 (-66.06%)

Mutual labels: reinforcement-learning, openai-gym

Ctc Executioner

Master Thesis: Limit order placement with Reinforcement Learning

Stars: ✭ 112 (-49.32%)

Mutual labels: reinforcement-learning, openai-gym

Deterministic Gail Pytorch

PyTorch implementation of Deterministic Generative Adversarial Imitation Learning (GAIL) for Off Policy learning

Stars: ✭ 44 (-80.09%)

Mutual labels: reinforcement-learning, openai-gym

Playing Atari games with TensorFlow implementation of Asynchronous Deep Q-Learning

Stars: ✭ 44 (-80.09%)

Mutual labels: reinforcement-learning, openai-gym

Reinforcementlearning Atarigame

Pytorch LSTM RNN for reinforcement learning to play Atari games from OpenAI Universe. We also use Google Deep Mind's Asynchronous Advantage Actor-Critic (A3C) Algorithm. This is much superior and efficient than DQN and obsoletes it. Can play on many games

Stars: ✭ 118 (-46.61%)

Mutual labels: reinforcement-learning, openai-gym

Forex trading simulator environment for OpenAI Gym, observations contain the order status, performance and timeseries loaded from a CSV file containing rates and indicators. Work In Progress

Stars: ✭ 151 (-31.67%)

Mutual labels: reinforcement-learning, openai-gym

View All Similar Projects ➔

ns3-gym

OpenAI Gym is a toolkit for reinforcement learning (RL) widely used in research. The network simulator ns–3 is the de-facto standard for academic and industry studies in the areas of networking protocols and communication technologies. ns3-gym is a framework that integrates both OpenAI Gym and ns-3 in order to encourage usage of RL in networking research.

Installation

Install all required dependencies required by ns-3.

# minimal requirements for C++:
apt-get install gcc g++ python

see https://www.nsnam.org/wiki/Installation

Install ZMQ and Protocol Buffers libs:

# to install protobuf-3.6 on ubuntu 16.04:
sudo add-apt-repository ppa:maarten-fonville/protobuf
sudo apt-get update

apt-get install libzmq5 libzmq5-dev
apt-get install libprotobuf-dev
apt-get install protobuf-compiler

Configure and build ns-3 project (if you are going to use Python virtual environment, please execute these commands inside it):

# Opengym Protocol Buffer messages (C++ and Python) are build during configure
./waf configure
./waf build

Install ns3gym located in src/opengym/model/ns3gym (Python3 required)

pip3 install ./src/opengym/model/ns3gym

(Optional) Install all libraries required by your agent (like tensorflow, keras, etc.).
Run example:

cd ./scratch/opengym
./simple_test.py

(Optional) Start ns-3 simulation script and Gym agent separately in two terminals (useful for debugging):

# Terminal 1
./waf --run "opengym"

# Terminal 2
cd ./scratch/opengym
./test.py --start=0

Examples

All examples can be found here.

Basic Interface

Example Python script. Note, that gym.make('ns3-v0') starts ns-3 simulation script located in current working directory.

import gym
import ns3gym
import MyAgent

env = gym.make('ns3-v0')
obs = env.reset()
agent = MyAgent.Agent()

while True:
  action = agent.get_action(obs)
  obs, reward, done, info = env.step(action)

  if done:
    break
env.close()

Any ns-3 simulation script can be used as a Gym environment. This requires only to instantiate OpenGymInterface and implement the ns3-gym C++ interface consisting of the following functions:

Ptr<OpenGymSpace> GetObservationSpace();
Ptr<OpenGymSpace> GetActionSpace();
Ptr<OpenGymDataContainer> GetObservation();
float GetReward();
bool GetGameOver();
std::string GetExtraInfo();
bool ExecuteActions(Ptr<OpenGymDataContainer> action);

Note, that the generic ns3-gym interface allows to observe any variable or parameter in a simulation.

A more detailed description can be found in our Paper.

Cognitive Radio

We consider the problem of radio channel selection in a wireless multi-channel environment, e.g. 802.11 networks with external interference. The objective of the agent is to select for the next time slot a channel free of interference. We consider a simple illustrative example where the external interference follows a periodic pattern, i.e. sweeping over all channels one to four in the same order as shown in the table.

We created such a scenario in ns-3 using existing functionality from ns-3, i.e. interference created using WaveformGenerator class and sensing performed using SpectrumAnalyzer class.

Such a periodic interferer can be easily learned by an RL-agent so that based on the current observation of the occupation on each channel in a given time slot the correct channel can be determined for the next time slot avoiding any collision with the interferer.

Our proposed RL mapping is:

observation - occupation on each channel in the current time slot, i.e. wideband-sensing,
actions - set the channel to be used for the next time slot,
reward - +1 in case of no collision with interferer; otherwise -1,
gameover - if more than three collisions happened during the last ten time-slots

The figure below shows the learning performance when using a simple neural network with fully connected input and an output layer. We see that after around 80 episodes the agent is able to perfectly predict the next channel state from the current observation hence avoiding any collision with the interference.

The full source code of the example can be found here.

Note, that in a more realistic scenario the simple waveform generator in this example can be replaced by a real wireless technology like LTE unlicensed (LTE-U).

RL-TCP

The proper RL-TCP agent example is still under development. However, we have already implemented and released two versions (i.e. time and event-based) of an interface allowing to monitor parameters of a TCP instance and control its Congestion Window and Slow Start Threshold -- see details here. Note, that both versions inherits from TcpCongestionOps and hence can be used as an argument for ns3::TcpL4Protocol::SocketType.

Moreover, using the event-based interface, we already have an example Python Gym agent that implements TCP NewReno and communicates with the ns-3 simulation process using ns3gym -- see here. The example can be used as a starting point to implement an RL-based TCP congestion control algorithms.

In order to run it, please execute:

cd ./scratch/rl-tcp
./test_tcp.py

Or in two terminals:

# Terminal 1:
./waf --run "rl-tcp --transport_prot=TcpRl"

# Terminal 2:
cd ./scratch/rl-tcp/
./test_tcp.py --start=0

Note, that our Python TCP NewReno implementation achieves the same number of transmitted packets like the one implemented in ns3 (see the output of ns-3 simulation, i.e. RxPkts: 5367 in both cases). Please execute the following command to cross-check:

./waf --run "rl-tcp --transport_prot=TcpNewReno"

Contact

Piotr Gawlowicz, TU-Berlin, [email protected]
Anatolij Zubow, TU-Berlin, [email protected]
tkn = tkn.tu-berlin.de

How to reference ns3-gym?

Please use the following bibtex :

@inproceedings{ns3gym,
  Title = {{ns-3 meets OpenAI Gym: The Playground for Machine Learning in Networking Research}},
  Author = {Gaw{\l}owicz, Piotr and Zubow, Anatolij},
  Booktitle = {{ACM International Conference on Modeling, Analysis and Simulation of Wireless and Mobile Systems (MSWiM)}},
  Year = {2019},
  Location = {Miami Beach, USA},
  Month = {November},
  Url = {http://www.tkn.tu-berlin.de/fileadmin/fg112/Papers/2019/gawlowicz19_mswim.pdf}
}

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 221

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (16) 🔗