Repo Info
- Cloning Repo
- Requirements
- Environment
  - IP address assignments
- Usage
- cAdvisor
- Consul
- Docker
- Elasticsearch
- Filebeat
- Grafana
- Kibana
- Netdata
- Prometheus
- Vault
- Monitoring Docker Services
- License
- Author Information

Repo Info

Spin up a multi-node Vagrant environment for learning/testing monitoring tools for a micro-services world. All provisioning is automated using Ansible.

Cloning Repo

All Ansible roles are added as submodules, therefore in order to properly clone the repo you must do the following:

git clone https://github.com/mrlesmithjr/vagrant-vault-consul-docker-monitoring.git --recursive

Requirements

Environment

node[0:2] (Consul Servers)
node0
- Grafana: http://192.168.250.10:3000
- Netdata: http://192.168.250.10:19999
- Prometheus: http://192.168.250.10:9090
node3 (Vault)
node[4:6] (Docker Swarm Managers)
node[7:8] (Docker Swarm Workers/Consul Clients)

IP address assignments

node0 (192.168.250.10)
node1 (192.168.250.11)
node2 (192.168.250.12)
node3 (192.168.250.13)
node4 (192.168.250.14)
node5 (192.168.250.15)
node6 (192.168.250.16)
node7 (192.168.250.17)
node8 (192.168.250.18)

Usage

Spin up Vagrant environment

vagrant up

cAdvisor

Docker hosts have exposed metrics for Prometheus consumption.

Consul

Checking Consul member status:

vagrant ssh node0

vagrant@node0:~$ sudo consul members list
Node   Address              Status  Type    Build  Protocol  DC
node0  192.168.250.10:8301  alive   server  0.8.1  2         dc1
node1  192.168.250.11:8301  alive   server  0.8.1  2         dc1
node2  192.168.250.12:8301  alive   server  0.8.1  2         dc1
node7  192.168.250.17:8301  alive   client  0.8.1  2         dc1
node8  192.168.250.18:8301  alive   client  0.8.1  2         dc1

Docker

Checking Docker swarm node status:

vagrant ssh node5

vagrant@node5:~$ sudo docker node ls
ID                           HOSTNAME  STATUS  AVAILABILITY  MANAGER STATUS
41oybdyk9njn7trhplpohk4tn *  node5     Ready   Active        Leader
4zc9ndv7rurfbgrhfzxs68sux    node4     Ready   Active        Reachable
8c3y3ta5ad56hlhmfzx2wmdgr    node8     Ready   Active
vmmpixn2i401cyhgd5g4l3cfd    node7     Ready   Active
x50d9z0zkloixvijxht1l36we    node6     Ready   Active        Reachable

Elasticsearch

Running as a Docker swarm service for storing Docker container logs.

To validate cluster functionality:

curl http://192.168.250.14:9200/_cluster/health\?pretty\=true

{
  "cluster_name" : "elasticsearch",
  "status" : "green",
  "timed_out" : false,
  "number_of_nodes" : 5,
  "number_of_data_nodes" : 5,
  "active_primary_shards" : 5,
  "active_shards" : 10,
  "relocating_shards" : 0,
  "initializing_shards" : 0,
  "unassigned_shards" : 0,
  "delayed_unassigned_shards" : 0,
  "number_of_pending_tasks" : 0,
  "number_of_in_flight_fetch" : 0,
  "task_max_waiting_in_queue_millis" : 0,
  "active_shards_percent_as_number" : 100.0
}

For the above you may also check against the other Docker Swarm hosts.

Filebeat

Docker logs for each host sent to Elasticsearch

Grafana

Log into the Grafana web UI here

username/password: admin/admin

Add the Prometheus data source:

click Add data source
Name: prometheus
type: Prometheus
Url: http://192.168.250.10:9090
click Add

Kibana

Dashboard to view Docker logs

Netdata

node0 is configured as a Netdata registry for all over nodes to announce to which are also running Netdata

Prometheus

Vault

Monitoring Docker Services

As part of the provisioning of this environment we spin up the following:

Elasticsearch cluster:
Kibana: