All Projects → etsy → Skyline

etsy / Skyline

Licence: other
It'll detect your anomalies! Part of the Kale stack.

Programming Languages

python
139335 projects - #7 most used programming language
javascript
184084 projects - #8 most used programming language
CSS
56736 projects
shell
77523 projects
HTML
75241 projects

Labels

Projects that are alternatives of or similar to Skyline

Logster
Parse log files, generate metrics for Graphite and Ganglia
Stars: ✭ 1,967 (-7.61%)
Mutual labels:  non-sox
Deployinator
Deployinate!
Stars: ✭ 1,889 (-11.27%)
Mutual labels:  non-sox
Androidstaggeredgrid
An Android staggered grid view which supports multiple columns with rows of varying sizes.
Stars: ✭ 4,794 (+125.18%)
Mutual labels:  non-sox
geonames
Scripts for using Geonames
Stars: ✭ 24 (-98.87%)
Mutual labels:  non-sox
PushBot
An IRC Bot for organizing code pushes
Stars: ✭ 80 (-96.24%)
Mutual labels:  non-sox
incpath
PHP extension to support atomic deploys
Stars: ✭ 50 (-97.65%)
Mutual labels:  non-sox
cdncontrol ui
A web UI for Etsy's cdncontrol tool
Stars: ✭ 13 (-99.39%)
Mutual labels:  non-sox
Apotheosis
No description or website provided.
Stars: ✭ 27 (-98.73%)
Mutual labels:  non-sox
DebriefingFacilitationGuide
Leading Groups at Etsy to Learn From Accidents
Stars: ✭ 237 (-88.87%)
Mutual labels:  non-sox
etsy.github.com
Etsy! on Github!
Stars: ✭ 15 (-99.3%)
Mutual labels:  non-sox
ELK-utils
Utilities for working with the ELK (Elasticsearch, Logstash, Kibana) stack
Stars: ✭ 58 (-97.28%)
Mutual labels:  non-sox
ab
Etsy's little framework for A/B testing, feature ramp up, and more.
Stars: ✭ 129 (-93.94%)
Mutual labels:  non-sox

Skyline is an Archived Project

Skyline is no longer actively maintained. Your mileage with patches may vary.

Skyline

Build Status

x

Skyline is a real-time* anomaly detection* system*, built to enable passive monitoring of hundreds of thousands of metrics, without the need to configure a model/thresholds for each one, as you might do with Nagios. It is designed to be used wherever there are a large quantity of high-resolution timeseries which need constant monitoring. Once a metrics stream is set up (from StatsD or Graphite or other source), additional metrics are automatically added to Skyline for analysis. Skyline's easily extendible algorithms automatically detect what it means for each metric to be anomalous. After Skyline detects an anomalous metric, it surfaces the entire timeseries to the webapp, where the anomaly can be viewed and acted upon.

Read the details in the wiki.

Install

  1. sudo pip install -r requirements.txt for the easy bits

  2. Install numpy, scipy, pandas, patsy, statsmodels, msgpack_python in that order.

  3. You may have trouble with SciPy. If you're on a Mac, try:

  • sudo port install gcc48
  • sudo ln -s /opt/local/bin/gfortran-mp-4.8 /opt/local/bin/gfortran
  • sudo pip install scipy

On Debian, apt-get works well for Numpy and SciPy. On Centos, yum should do the trick. If not, hit the Googles, yo.

  1. cp src/settings.py.example src/settings.py

  2. Add directories:

sudo mkdir /var/log/skyline
sudo mkdir /var/run/skyline
sudo mkdir /var/log/redis
sudo mkdir /var/dump/
  1. Download and install the latest Redis release

  2. Start 'er up

  • cd skyline/bin
  • sudo redis-server redis.conf
  • sudo ./horizon.d start
  • sudo ./analyzer.d start
  • sudo ./webapp.d start

By default, the webapp is served on port 1500.

  1. Check the log files to ensure things are running.

Debian + Vagrant specific, if you prefer

Gotchas

  • If you already have a Redis instance running, it's recommended to kill it and restart using the configuration settings provided in bin/redis.conf

  • Be sure to create the log directories.

Hey! Nothing's happening!

Of course not. You've got no data! For a quick and easy test of what you've got, run this:

cd utils
python seed_data.py

This will ensure that the Horizon service is properly set up and can receive data. For real data, you have some options - see wiki

Once you get real data flowing through your system, the Analyzer will be able start analyzing for anomalies!

Alerts

Skyline can alert you! In your settings.py, add any alerts you want to the ALERTS list, according to the schema (metric keyword, strategy, expiration seconds) where strategy is one of smtp, hipchat, or pagerduty. You can also add your own alerting strategies. For every anomalous metric, Skyline will search for the given keyword and trigger the corresponding alert(s). To prevent alert fatigue, Skyline will only alert once every for any given metric/strategy combination. To enable Hipchat integration, uncomment the python-simple-hipchat line in the requirements.txt file.

How do you actually detect anomalies?

An ensemble of algorithms vote. Majority rules. Batteries kind of included. See wiki

Architecture

See the rest of the wiki

Contributions

  1. Clone your fork
  2. Hack away
  3. If you are adding new functionality, document it in the README or wiki
  4. If necessary, rebase your commits into logical chunks, without errors
  5. Verfiy your code by running the test suite and pep8, adding additional tests if able.
  6. Push the branch up to GitHub
  7. Send a pull request to the etsy/skyline project.

We actively welcome contributions. If you don't know where to start, try checking out the issue list and fixing up the place. Or, you can add an algorithm - a goal of this project is to have a very robust set of algorithms to choose from.

Also, feel free to join the skyline-dev mailing list for support and discussions of new features.

(*depending on your data throughput, *you might need to write your own algorithms to handle your exact data, *it runs on one box)

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].