Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → holgerteichgraeber → Timeseriesclustering.jl

holgerteichgraeber / Timeseriesclustering.jl

Licence: mit

Julia implementation of unsupervised learning methods for time series datasets. It provides functionality for clustering and aggregating, detecting motifs, and quantifying similarity between time series datasets.

Programming Languages

julia

2034 projects

Labels

optimization clustering

Projects that are alternatives of or similar to Timeseriesclustering.jl

K Means Constrained

K-Means clustering - constrained with minimum and maximum cluster size

Stars: ✭ 33 (-32.65%)

Mutual labels: clustering, optimization

Moosefs

MooseFS – Open Source, Petabyte, Fault-Tolerant, Highly Performing, Scalable Network Distributed File System (Software-Defined Storage)

Stars: ✭ 1,025 (+1991.84%)

Mutual labels: clustering

Cytometry Clustering Comparison

R scripts to reproduce analyses in our paper comparing clustering methods for high-dimensional cytometry data

Stars: ✭ 30 (-38.78%)

Mutual labels: clustering

Model Optimization

A toolkit to optimize ML models for deployment for Keras and TensorFlow, including quantization and pruning.

Stars: ✭ 992 (+1924.49%)

Mutual labels: optimization

Online Relationship Learning

Unsupervised ML algorithm for predictive modeling and time-series analysis

Stars: ✭ 34 (-30.61%)

Mutual labels: optimization

Swarm

Easy clustering, registration, and distribution of worker processes for Erlang/Elixir

Stars: ✭ 1,004 (+1948.98%)

Mutual labels: clustering

Awesome Seo

Google SEO研究及流量变现

Stars: ✭ 942 (+1822.45%)

Mutual labels: optimization

Tinystr

A small ASCII-only bounded length string representation.

Stars: ✭ 48 (-2.04%)

Mutual labels: optimization

Gasmodels.jl

A Julia/JuMP Package for Gas Network Optimization

Stars: ✭ 43 (-12.24%)

Mutual labels: optimization

Clusteredbigcache

golang bigcache with clustering as a library.

Stars: ✭ 37 (-24.49%)

Mutual labels: clustering

Flexible Clustering

Clustering for arbitrary data and dissimilarity function

Stars: ✭ 37 (-24.49%)

Mutual labels: clustering

Compress

Compressing Representations for Self-Supervised Learning

Stars: ✭ 43 (-12.24%)

Mutual labels: clustering

Mlj.jl

A Julia machine learning framework

Stars: ✭ 982 (+1904.08%)

Mutual labels: clustering

Mob Suite

MOB-suite: Software tools for clustering, reconstruction and typing of plasmids from draft assemblies

Stars: ✭ 32 (-34.69%)

Mutual labels: clustering

Kdd2019 k Multiple Means

Implementation for the paper "K-Multiple-Means: A Multiple-Means Clustering Method with Specified K Clusters,", which has been accepted by KDD'2019 as an ORAL paper, in the Research Track.

Stars: ✭ 45 (-8.16%)

Mutual labels: clustering

Tsp solver

Solving tsp (travel sales problem) using ruin & recreate method.

Stars: ✭ 29 (-40.82%)

Mutual labels: optimization

Fuzzy C Means

A simple python implementation of Fuzzy C-means algorithm.

Stars: ✭ 40 (-18.37%)

Mutual labels: clustering

Scrattch.hicat

Hierarchical, iterative clustering for analysis of transcriptomics data in R

Stars: ✭ 47 (-4.08%)

Mutual labels: clustering

Smallfunction

Stack allocated and type-erased functors 🐜

Stars: ✭ 47 (-4.08%)

Mutual labels: optimization

Tris Webpack Boilerplate

A Webpack boilerplate for static websites that has all the necessary modern tools and optimizations built-in. Score a perfect 10/10 on performance.

Stars: ✭ 1,016 (+1973.47%)

Mutual labels: optimization

View All Similar Projects ➔

TimeSeriesClustering is a Julia implementation of unsupervised learning methods for time series datasets. It provides functionality for clustering and aggregating, detecting motifs, and quantifying similarity between time series datasets. The software provides a type system for temporal data, and provides an implementation of the most commonly used clustering methods and extreme value selection methods for temporal data. It provides simple integration of multi-dimensional time-series data (e.g. multiple attributes such as wind availability, solar availability, and electricity demand) in a single aggregation process. The software is applicable to general time series datasets and lends itself well to a multitude of application areas within the field of time series data mining.

The TimeSeriesClustering package was originally developed to perform time series aggregation for energy systems optimization problems. By reducing the number of time steps used in the optimization model, using representative periods leads to significant reductions in computational complexity of these problems. The package was previously known as ClustForOpt.jl.

The package has three main purposes:

Provide a simple process of finding representative periods (reducing the number of observations) for time-series input data, with implementations of the most commonly used clustering methods and extreme value selection methods.
Provide an interface between representative period data and application (e.g. optimization problem) by having representative period data stored in a generalized type system.
Provide a generalized import feature for time series, where variable names, attributes, and node names are automatically stored and can then be used later when the reduced time series is used in the application at hand (e.g. in the definition of sets of the optimization problem).

In the domain of energy systems optimization, an example problem that uses TimeSeriesClustering for its input data is the package CapacityExpansion, which implements a scalable generation and transmission capacity expansion problem.

The TimeSeriesClustering package follows the clustering framework presented in Teichgraeber and Brandt, 2019. The package is actively developed, and new features are continuously added. For a reproducible version of the methods and data of the original paper by Teichgraeber and Brandt, 2019, please refer to v0.1 (including shape based methods such as k-shape and dynamic time warping barycenter averaging).

This package is developed by Holger Teichgraeber @holgerteichgraeber and Elias Kuepper @YoungFaithful.

Installation

This package runs under julia v1.0 and higher. Install using:

import Pkg
Pkg.add("TimeSeriesClustering")

Documentation

Documentation (Stable): Please refer to this documentation for details on how to use TimeSeriesClustering the current version of TimeSeriesClustering. This is the documentation of the default version of the package. The default version is on the master branch.

Documentation (Development): If you like to try the development version of TimeSeriesClustering, please refer to this documentation. The development version is on the dev branch.

See NEWS for significant breaking changes when updating from one version of TimeSeriesClustering to another.

Citing TimeSeriesClustering

If you find TimeSeriesClustering useful in your work, we kindly request that you cite the following paper (link):

  @article{Teichgraeber2019joss,
  author = {Teichgraeber, Holger and Kuepper, Lucas Elias and Brandt, Adam R},
  doi = {https://doi.org/10.21105/joss.01573},
  journal = {Journal of Open Source Software},
  number = {41},
  pages = {1573},
  title = {TimeSeriesClustering : An extensible framework in Julia},
  volume = {4},
  year = {2019}
  }

If you find this package useful, our paper on comparing clustering methods for energy systems optimization problems may additionally be of interest.

Quick Start Guide

This quick start guide introduces the main concepts of using TimeSeriesClustering. The examples are taken from problems in the domain of scenario reduction for energy systems optimization. For more detail on the different functionalities that TimeSeriesClustering provides, please refer to the subsequent chapters of the documentation or the examples in the examples folder, specifically workflow_introduction.jl.

Generally, the workflow consists of three steps:

load data
find representative periods (clustering + extreme period selection)
optimization

Example Workflow

After TimeSeriesClustering is installed, you can use it by saying:

using TimeSeriesClustering

The first step is to load the data. The following example loads hourly wind, solar, and demand data for Germany (1 region) for one year.

ts_input_data = load_timeseries_data(:CEP_GER1)

The output ts_input_data is a ClustData data struct that contains the data and additional information about the data.

ts_input_data.data # a dictionary with the data.
ts_input_data.data["wind-germany"] # the wind data (choose solar, el_demand as other options in this example)
ts_input_data.K # number of periods

The second step is to cluster the data into representative periods. Here, we use k-means clustering and get 5 representative periods.

clust_res = run_clust(ts_input_data;method="kmeans",n_clust=5)
ts_clust_data = clust_res.clust_data

The ts_clust_data is a ClustData data struct, this time with clustered data (i.e. less representative periods).

ts_clust_data.data # the clustered data
ts_clust_data.data["wind-germany"] # the wind data. Note the dimensions compared to ts_input_data
ts_clust_data.K # number of periods

If this package is used in the domain of energy systems optimization, the clustered input data can be used as input to an optimization problem. The optimization problem formulated in the package CapacityExpansion can be used with the data clustered in this example.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 49

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (15) 🔗