All Projects → logpai → Log3c

logpai / Log3c

Log-based Impactful Problem Identification using Machine Learning [FSE'18]

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Log3c

Loghub
A large collection of system log datasets for AI-powered log analytics
Stars: ✭ 551 (+320.61%)
Mutual labels:  log-analysis, anomaly-detection
Logdeep
log anomaly detection toolkit including DeepLog
Stars: ✭ 125 (-4.58%)
Mutual labels:  log-analysis, anomaly-detection
Logparser
A toolkit for automated log parsing [ICSE'19, TDSC'18, DSN'16]
Stars: ✭ 620 (+373.28%)
Mutual labels:  log-analysis, anomaly-detection
Loglizer
A log analysis toolkit for automated anomaly detection [ISSRE'16]
Stars: ✭ 785 (+499.24%)
Mutual labels:  log-analysis, anomaly-detection
Skip Ganomaly
Source code for Skip-GANomaly paper
Stars: ✭ 107 (-18.32%)
Mutual labels:  anomaly-detection
Wazuh Documentation
Wazuh - Project documentation
Stars: ✭ 82 (-37.4%)
Mutual labels:  log-analysis
Surpriver
Find big moving stocks before they move using machine learning and anomaly detection
Stars: ✭ 1,152 (+779.39%)
Mutual labels:  anomaly-detection
Coursera Ml Py
Python programming assignments for Machine Learning by Prof. Andrew Ng in Coursera
Stars: ✭ 1,140 (+770.23%)
Mutual labels:  anomaly-detection
Foremast
Foremast adds application resiliency to Kubernetes by leveraging machine learnt patterns of application health to keep applications healthy and stable
Stars: ✭ 115 (-12.21%)
Mutual labels:  anomaly-detection
Keras Oneclassanomalydetection
[5 FPS - 150 FPS] Learning Deep Features for One-Class Classification (AnomalyDetection). Corresponds RaspberryPi3. Convert to Tensorflow, ONNX, Caffe, PyTorch. Implementation by Python + OpenVINO/Tensorflow Lite.
Stars: ✭ 102 (-22.14%)
Mutual labels:  anomaly-detection
Awesome Anomaly Detection
A curated list of awesome anomaly detection resources
Stars: ✭ 1,378 (+951.91%)
Mutual labels:  anomaly-detection
Ml
A high-level machine learning and deep learning library for the PHP language.
Stars: ✭ 1,270 (+869.47%)
Mutual labels:  anomaly-detection
Octopussy
Octopussy - Open Source Log Management Solution
Stars: ✭ 111 (-15.27%)
Mutual labels:  log-analysis
Sentinl
Kibana Alert & Report App for Elasticsearch
Stars: ✭ 1,233 (+841.22%)
Mutual labels:  anomaly-detection
Ft Tree
IWQoS 2017: A toolkit for log template extraction
Stars: ✭ 123 (-6.11%)
Mutual labels:  log-analysis
Timecop
Time series based anomaly detector
Stars: ✭ 65 (-50.38%)
Mutual labels:  anomaly-detection
Pytorch cpp
Deep Learning sample programs using PyTorch in C++
Stars: ✭ 114 (-12.98%)
Mutual labels:  anomaly-detection
Visual Feature Attribution Using Wasserstein Gans Pytorch
Implementation of Visual Feature Attribution using Wasserstein GANs (VAGANs, https://arxiv.org/abs/1711.08998) in PyTorch
Stars: ✭ 88 (-32.82%)
Mutual labels:  anomaly-detection
Pysad
Streaming Anomaly Detection Framework in Python (Outlier Detection for Streaming Data)
Stars: ✭ 87 (-33.59%)
Mutual labels:  anomaly-detection
Deepai
Detection of Accounting Anomalies using Deep Autoencoder Neural Networks - A lab we prepared for NVIDIA's GPU Technology Conference 2018 that will walk you through the detection of accounting anomalies using deep autoencoder neural networks. The majority of the lab content is based on Jupyter Notebook, Python and PyTorch.
Stars: ✭ 104 (-20.61%)
Mutual labels:  anomaly-detection

Log3C

Log3C is a general framework that identifies service system problems from system logs. It utilizes both system logs and system KPI metrics to promptly and precisely identify impactful system problems. Log3C consists of four steps: Log parsing, Sequence vectorization, Cascading Clustering and Correlation analysis. This is a joint work by CUHK and Microsoft Research.

The repository contains the source code of Log3C, including data loading, sequence vectorization, cascading clustering, data saving, etc. The core part is the cascading clustering algorithm, which groups a large number of sequence vectors into clusters by iteratively sampling, clustering, matching. For more details, please refer to our paper:

Prerequisites:

  • Python version 3.5 or above
  • All required packages are installed
  • Windows, Linux or macOS platform

Note: Anaconda (Python 3.5 or above) is highly recommended, all required packages are already installed in Anaconda. You can also install the required packages with the "requirements.txt" by using command:

pip install -r requirements.txt

Installing:

  1. Download the project code files with:

    git clone https://github.com/logpai/Log3C.git

  2. Go to the project directory

    cd Log3C

Usage:

To use the model, open a terminal, change directory to this project code, run the command:

python run.py

Project Structure:

  1. run.py: main entry function, which defines all the required hyper-parameters.
  2. cascading_clustering.py: implementation of the cascading clustering algorithm.
  3. dataloader.py: load the input data files into memory
  4. save_results.py: save the clustering result into files.

For details, please refer to the code comments.

Data Format:

  • Multiple log sequence matrix files: each file consists of log sequence vectors within a time interval.
  • The KPI data: each KPI value corresponds to the system status of a time interval.
  1. A log sequence matrix at time interval T0:
Log Seq Event1 Event2 Event3 Event4 ...
 1   2     1     0     2     ...
2 3 2 4 1 ...
3 2 1 3 3 ...
... ... ... ... ... ...

Assume that there are N time intervals in total, then we have N such matrixes as well as N KPI values. These K KPI values are stored in one file, which is shown as below.

  1. KPI data
Time Interval KPI
T0 0.05
T1 0.10
T2 0.07
.. ...

Only one KPI file, which contains N KPI values.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].