alan-turing-institute / TCPD

Licence: MIT license
The Turing Change Point Dataset - A collection of time series for the evaluation and development of change point detection algorithms

Programming Languages

python
139335 projects - #7 most used programming language
Makefile
30231 projects
Dockerfile
14818 projects

Projects that are alternatives of or similar to TCPD

CDLab
Yet another repository for developing and benchmarking deep learning-based change detection methods.
Stars: ✭ 59 (-35.16%)
Mutual labels:  change-detection
pywebsitechanges
Change detection with a simple Python script to email you whenever a website changes.
Stars: ✭ 44 (-51.65%)
Mutual labels:  change-detection
massive-change-detection
QGIS 2 plugin for applying change detection algorithms on high resolution satellite imagery
Stars: ✭ 18 (-80.22%)
Mutual labels:  change-detection
verify-changed-files
Github action to verify file changes that occur during the workflow execution.
Stars: ✭ 62 (-31.87%)
Mutual labels:  change-detection
Detached-Mapper
An ORM friendly mapper. Allows saving entire entity graphs. Heavily inspired in GraphDiff and AutoMapper.
Stars: ✭ 89 (-2.2%)
Mutual labels:  change-detection
DSMSCN
[MultiTemp 2019] Official Tensorflow implementation for Change Detection in Multi-temporal VHR Images Based on Deep Siamese Multi-scale Convolutional Neural Networks.
Stars: ✭ 63 (-30.77%)
Mutual labels:  change-detection
changed-files
Github action to retrieve all (added, copied, modified, deleted, renamed, type changed, unmerged, unknown) files and directories.
Stars: ✭ 733 (+705.49%)
Mutual labels:  change-detection
singular-spectrum-transformation
fast implementation of singular spectrum transformation (change point detection algorithm)
Stars: ✭ 41 (-54.95%)
Mutual labels:  changepoint
fabric
Urban change model designed to identify changes across 2 timestamps
Stars: ✭ 53 (-41.76%)
Mutual labels:  change-detection
ChangeDetectionRepository
This repository contains some python code of some traditional change detection methods or provides their original websites, such as SFA, MAD, and some deep learning-based change detection methods, such as SiamCRNN, DSFA, and some FCN-based methods.
Stars: ✭ 311 (+241.76%)
Mutual labels:  change-detection
websitechanges
Alerts you via email about a website change.
Stars: ✭ 43 (-52.75%)
Mutual labels:  change-detection
ChangeOS
ChangeOS: Building damage assessment via Deep Object-based Semantic Change Detection - (RSE 2021)
Stars: ✭ 33 (-63.74%)
Mutual labels:  change-detection
CRC4Docker
Python scripts for the textbook "Image Analysis, Classification and Change Detection in Remote Sensing, Fourth Revised Edition"
Stars: ✭ 84 (-7.69%)
Mutual labels:  change-detection
prophesee ros wrapper
ROS driver for Prophesee event-based sensors
Stars: ✭ 63 (-30.77%)
Mutual labels:  change-detection
FDCNN
The implementation of FDCNN in paper - A Feature Difference Convolutional Neural Network-Based Change Detection Method
Stars: ✭ 54 (-40.66%)
Mutual labels:  change-detection
vue-observable
IntersectionObserver, MutationObserver and PerformanceObserver in Vue.js
Stars: ✭ 24 (-73.63%)
Mutual labels:  change-detection
ng-profiler
Angular Profiler for Ivy, debugging and visualization tools
Stars: ✭ 31 (-65.93%)
Mutual labels:  change-detection
gochanges
**[ARCHIVED]** website changes tracker 🔍
Stars: ✭ 12 (-86.81%)
Mutual labels:  change-detection
On Change
Watch an object or array for changes
Stars: ✭ 1,709 (+1778.02%)
Mutual labels:  change-detection
SenseEarth2020-ChangeDetection
1st place solution to the Satellite Remote Sensing Image Change Detection Challenge hosted by SenseTime
Stars: ✭ 156 (+71.43%)
Mutual labels:  change-detection

Turing Change Point Dataset

Build Status DOI

Welcome to the host repository of the Turing Change Point Dataset, a set of time series specifically collected for the evaluation of change point detection algorithms on real-world data. This dataset was introduced in this paper. For the repository containing the code used for the experiments, see TCPDBench.

Useful links:

Introduction

Change point detection focuses on accurately detecting moments of abrupt change in the behavior of a time series. While many methods for change point detection exists, past research has paid little attention to the evaluation of existing algorithms on real-world data. This work introduces a benchmark study and a dataset (TCPD) that are explicitly designed for the evaluation of change point detection algorithms. We hope that our work becomes a proving ground for the evaluation and development of change point detection algorithms that work well in practice.

This repository contains the code needed to obtain the time series in the dataset. For the benchmark study, see TCPDBench. Note that work based on the dataset should cite our paper:

@article{vandenburg2020evaluation,
        title={An Evaluation of Change Point Detection Algorithms},
        author={{Van den Burg}, G. J. J. and Williams, C. K. I.},
        journal={arXiv preprint arXiv:2003.06222},
        year={2020}
}

The annotations are stored in the annotations.json file, which is the same as that used in the experiments (see here). Annotations are organised in a JSON object by dataset name and annotator id, and use 0-based indexing. See the TCPDBench repository for more information on extending the benchmark with your own methods or datasets.

Getting Started

Many of the time series in the dataset are included in this repository. However, due to licensing restrictions, some series can not be redistributed and need to be downloaded locally. We've added a Python script and a Makefile to make this process as easy as possible. There is also a Dockerfile to facilitate reproducibility.

Using Docker

To build the dataset using Docker, first build the docker image:

$ docker build -t tcpd https://github.com/alan-turing-institute/TCPD.git

then build the dataset:

$ docker run -i -t -v /path/to/where/you/want/the/dataset:/TCPD/export tcpd

Using the command line

To obtain the dataset, please run the following steps:

  1. Clone the GitHub repository and change to the new directory:

    $ git clone https://github.com/alan-turing-institute/TCPD
    $ cd TCPD
    
  2. Make sure you have Python (v3.2 or newer) installed, as well as virtualenv:

    $ pip install virtualenv
    
  3. Next, use either of these steps:

    • To obtain the dataset using Make, simply run:

      $ make
      

      This command will download all remaining datasets and verify that they match the expected checksums.

    • If you don't have Make, you can obtain the dataset by manually executing the following commands:

      $ virtualenv ./venv
      $ source ./venv/bin/activate
      $ pip install -r requirements.txt
      $ python build_tcpd.py -v collect
      

      If you wish to verify the downloaded datasets you can run:

      $ python ./utils/check_checksums.py -v -c ./checksums.json -d ./datasets
      
  4. It may be convenient to export all dataset files to a single directory. This can be done using Make as follows:

    $ make export
    

All datasets are stored in individual directories inside the datasets directory and each has its own README file with additional metadata and sources. The data format used is JSON and each file follows the JSON Schema provided in schema.json.

Using the data

For your convenience, example code to load a dataset from the JSON format to a data frame is provided in the examples directory in the following languages:

Implementations of various change point detection algorithms that use these datasets are available in TCPDBench. A script to plot the datasets and detection results from TCPDBench is also provided in utils/plot_dataset.py.

The annotations are included in the annotations.json file. They are in the format:

{
  "<dataset>": {
      "annotator_id": [
          <change point index>
          <change point index>
          ...
          ],
      ...
  },
  ...
}

where the annotator_id is a unique ID for the annotator and the change point indices are 0-based. Please also see the documentation in TCPDBench for more information about using the dataset and benchmark in your own work.

License

The code in this repository is licensed under the MIT license. See the LICENSE file for more details. Individual data files are often distributed under different terms, see the relevant README files for more details. Work that uses this dataset should cite our paper.

Notes

If you find any problems or have a suggestion for improvement of this repository, please let us know as it will help us make this resource better for everyone. You can open an issue on GitHub or send an email to gertjanvandenburg at gmail dot com.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].