Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → EthanRosenthal → Skits

EthanRosenthal / Skits

Licence: mit

scikit-learn-inspired time series

Programming Languages

python

139335 projects - #7 most used programming language

Labels

machine-learning time-series

Projects that are alternatives of or similar to Skits

Pyodds

An End-to-end Outlier Detection System

Stars: ✭ 141 (-10.76%)

Mutual labels: time-series

Gorilla Tsc

Implementation of time series compression method from the Facebook's Gorilla paper

Stars: ✭ 147 (-6.96%)

Mutual labels: time-series

Pyfts

An open source library for Fuzzy Time Series in Python

Stars: ✭ 154 (-2.53%)

Mutual labels: time-series

Friartuck

Live Quant Trading Framework for Robinhood, using IEX Trading and AlphaVantage for Free Prices.

Stars: ✭ 142 (-10.13%)

Mutual labels: time-series

Anomaly detection tuto

Anomaly detection tutorial on univariate time series with an auto-encoder

Stars: ✭ 144 (-8.86%)

Mutual labels: time-series

Forecasting

Time Series Forecasting Best Practices & Examples

Stars: ✭ 2,123 (+1243.67%)

Mutual labels: time-series

Survival Analysis Using Deep Learning

This repository contains morden baysian statistics and deep learning based research articles , software for survival analysis

Stars: ✭ 139 (-12.03%)

Mutual labels: time-series

Pyflux

Open source time series library for Python

Stars: ✭ 1,932 (+1122.78%)

Mutual labels: time-series

Hurst

Hurst exponent evaluation and R/S-analysis in Python

Stars: ✭ 148 (-6.33%)

Mutual labels: time-series

Adaptive Alerting

Anomaly detection for streaming time series, featuring automated model selection.

Stars: ✭ 152 (-3.8%)

Mutual labels: time-series

Sweep

Extending broom for time series forecasting

Stars: ✭ 143 (-9.49%)

Mutual labels: time-series

Tscv

Time Series Cross-Validation -- an extension for scikit-learn

Stars: ✭ 145 (-8.23%)

Mutual labels: time-series

Gluon Ts

Probabilistic time series modeling in Python

Stars: ✭ 2,373 (+1401.9%)

Mutual labels: time-series

Matrixprofile

A Python 3 library making time series data mining tasks, utilizing matrix profile algorithms, accessible to everyone.

Stars: ✭ 141 (-10.76%)

Mutual labels: time-series

Java Timeseries

Time series analysis in Java

Stars: ✭ 155 (-1.9%)

Mutual labels: time-series

Data science blogs

A repository to keep track of all the code that I end up writing for my blog posts.

Stars: ✭ 139 (-12.03%)

Mutual labels: time-series

Vde

Variational Autoencoder for Dimensionality Reduction of Time-Series

Stars: ✭ 148 (-6.33%)

Mutual labels: time-series

Celerite

Scalable 1D Gaussian Processes in C++, Python, and Julia

Stars: ✭ 155 (-1.9%)

Mutual labels: time-series

Java Deep Learning Cookbook

Code for Java Deep Learning Cookbook

Stars: ✭ 156 (-1.27%)

Mutual labels: time-series

Asap

ASAP: Prioritizing Attention via Time Series Smoothing

Stars: ✭ 151 (-4.43%)

Mutual labels: time-series

View All Similar Projects ➔

skits

A library for SciKit-learn-Inspired Time Series models.

The primary goal of this library is to allow one to train time series prediction models using a similar API to scikit-learn. Consequently, similar to scikit-learn, this library consists of preprocessors, feature_extractors, and pipelines.

Installation

Install with pip:

pip install skits

Preprocessors

The preprocessors expect to receive time series data, and then end up storing some data about the time series such that they can fully invert a transform. The following example shows how to create a DifferenceTransformer transform data, and then invert it back to its original form. The DifferenceTransformer subtracts the point shifted by period away from each point.

import numpy as np
from skits.preprocessing import DifferenceTransformer

y = np.random.random(10)
# scikit-learn expects 2D design matrices,
# so we duplicate the time series.
X = y[:, np.newaxis] 

dt = DifferenceTransformer(period=2)

Xt = dt.fit_transform(X,y)
X_inv = dt.inverse_transform(Xt)

assert np.allclose(X, X_inv)

Feature Extractors

After all preprocessing transformations are completed, multiple features may be built out of the time series. These can be built via feature extractors, which one should combine together into a large FeatureUnion. Current features include autoregressive, seasonal, and integrated features (covering the AR and I of ARIMA models).

Pipelines

There are two types of pipelines. The ForecasterPipeline is for forecasting time series (duh). Specifically, one should build this pipeline with a regressor as the final step such that one can make appropriate predictions. The functionality is similar to a regular scikit-learn pipeline. Differences include the addition of a forecast() method along with a to_scale keyword argument to predict() such that one can make sure that their prediction is on the same scale as the original data.

These classes are likely subject to change as they are fairly hacky right now. For example, one must transform both X and y for all transformations before the introduction of a DifferenceTransformer. While the pipeline handles this, one must prefix all of these transformations with pre_ in the step names.

Anywho, here's an example:

import numpy as np
from sklearn.linear_model import LinearRegression
from sklearn.preprocessing import StandardScaler
from sklearn.pipeline import FeatureUnion

from skits.pipeline import ForecasterPipeline
from skits.preprocessing import ReversibleImputer
from skits.feature_extraction import (AutoregressiveTransformer, 
                                      SeasonalTransformer)
                               
steps = [
    ('pre_scaling', StandardScaler()),
    ('features', FeatureUnion([
        ('ar_transformer', AutoregressiveTransformer(num_lags=3)),
        ('seasonal_transformer', SeasonalTransformer(seasonal_period=20)
    )])),
    ('post_features_imputer', ReversibleImputer()),
    ('regressor', LinearRegression(fit_intercept=False))
]
                               
l = np.linspace(0, 1, 101)
y = 5*np.sin(2 * np.pi * 5 * l) + np.random.normal(0, 1, size=101)
X = y[:, np.newaxis]

pipeline = ForecasterPipeline(steps)

pipeline.fit(X, y)
y_pred = pipeline.predict(X, to_scale=True, refit=True)

And this ends up looking like:

import matplotlib.pyplot as plt

plt.plot(y, lw=2)
plt.plot(y_pred, lw=2)
plt.legend(['y_true', 'y_pred'], bbox_to_anchor=(1, 1));

And forecasting looks like

start_idx = 70
plt.plot(y, lw=2);
plt.plot(pipeline.forecast(y[:, np.newaxis], start_idx=start_idx), lw=2);
ax = plt.gca();
ylim = ax.get_ylim();
plt.plot((start_idx, start_idx), ylim, lw=4);
plt.ylim(ylim);
plt.legend(['y_true', 'y_pred', 'forecast start'], bbox_to_anchor=(1, 1));

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 158

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (4) 🔗