Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

IBCNServices / Gendis

Licence: other

Contains an implementation (sklearn API) of the algorithm proposed in "GENDIS: GEnetic DIscovery of Shapelets" and code to reproduce all experiments.

Labels

jupyter-notebook data-mining time-series-analysis evolutionary-algorithms

Projects that are alternatives of or similar to Gendis

Tsai

Time series Timeseries Deep Learning Pytorch fastai - State-of-the-art Deep Learning with Time Series and Sequences in Pytorch / fastai

Stars: ✭ 407 (+589.83%)

Mutual labels: jupyter-notebook, time-series-analysis

Feature Engineering And Feature Selection

A Guide for Feature Engineering and Feature Selection, with implementations and examples in Python.

Stars: ✭ 526 (+791.53%)

Mutual labels: jupyter-notebook, data-mining

Mli Resources

H2O.ai Machine Learning Interpretability Resources

Stars: ✭ 428 (+625.42%)

Mutual labels: jupyter-notebook, data-mining

Lasio

Python library for reading and writing well data using Log ASCII Standard (LAS) files

Stars: ✭ 234 (+296.61%)

Mutual labels: jupyter-notebook, data-mining

Spring2017 proffosterprovost

Introduction to Data Science

Stars: ✭ 18 (-69.49%)

Mutual labels: jupyter-notebook, data-mining

Pydataroad

open source for wechat-official-account (ID: PyDataLab)

Stars: ✭ 302 (+411.86%)

Mutual labels: jupyter-notebook, data-mining

Rong360

用户贷款风险预测

Stars: ✭ 489 (+728.81%)

Mutual labels: jupyter-notebook, data-mining

2017 Ccf Bdci Aijudge

2017-CCF-BDCI-让AI当法官(初赛)：7th/415 (Top 1.68%)

Stars: ✭ 177 (+200%)

Mutual labels: jupyter-notebook, data-mining

Cookbook 2nd

IPython Cookbook, Second Edition, by Cyrille Rossant, Packt Publishing 2018

Stars: ✭ 704 (+1093.22%)

Mutual labels: jupyter-notebook, data-mining

Cookbook 2nd Code

Code of the IPython Cookbook, Second Edition, by Cyrille Rossant, Packt Publishing 2018 [read-only repository]

Stars: ✭ 541 (+816.95%)

Mutual labels: jupyter-notebook, data-mining

Amazing Feature Engineering

Feature engineering is the process of using domain knowledge to extract features from raw data via data mining techniques. These features can be used to improve the performance of machine learning algorithms. Feature engineering can be considered as applied machine learning itself.

Stars: ✭ 218 (+269.49%)

Mutual labels: jupyter-notebook, data-mining

Drugs Recommendation Using Reviews

Analyzing the Drugs Descriptions, conditions, reviews and then recommending it using Deep Learning Models, for each Health Condition of a Patient.

Stars: ✭ 35 (-40.68%)

Mutual labels: jupyter-notebook, data-mining

Gwu data mining

Materials for GWU DNSC 6279 and DNSC 6290.

Stars: ✭ 217 (+267.8%)

Mutual labels: jupyter-notebook, data-mining

Sktime

A unified framework for machine learning with time series

Stars: ✭ 4,741 (+7935.59%)

Mutual labels: data-mining, time-series-analysis

Auto ts

Automatically build ARIMA, SARIMAX, VAR, FB Prophet and XGBoost Models on Time Series data sets with a Single Line of Code. Now updated with Dask to handle millions of rows.

Stars: ✭ 195 (+230.51%)

Mutual labels: jupyter-notebook, time-series-analysis

Anomaly Detection Resources

Anomaly detection related books, papers, videos, and toolboxes

Stars: ✭ 5,306 (+8893.22%)

Mutual labels: data-mining, time-series-analysis

Data Science Resources

👨🏽‍🏫You can learn about what data science is and why it's important in today's modern world. Are you interested in data science?🔋

Stars: ✭ 171 (+189.83%)

Mutual labels: jupyter-notebook, data-mining

Evolutionary Computation Course

Jupyter/IPython notebooks about evolutionary computation.

Stars: ✭ 173 (+193.22%)

Mutual labels: jupyter-notebook, evolutionary-algorithms

Interpretable machine learning with python

Examples of techniques for training interpretable ML models, explaining ML models, and debugging ML models for accuracy, discrimination, and security.

Stars: ✭ 530 (+798.31%)

Mutual labels: jupyter-notebook, data-mining

Awesome Ai Books

Some awesome AI related books and pdfs for learning and downloading, also apply some playground models for learning

Stars: ✭ 855 (+1349.15%)

Mutual labels: jupyter-notebook, data-mining

View All Similar Projects ➔

GENDIS

GENetic DIscovery of Shapelets

In the time series classification domain, shapelets are small subseries that are discriminative for a certain class. It has been shown that by projecting the original dataset to a distance space, where each axis corresponds to the distance to a certain shapelet, classifiers are able to achieve state-of-the-art results on a plethora of datasets.

This repository contains an implementation of GENDIS, an algorithm that searches for a set of shapelets in a genetic fashion. The algorithm is insensitive to its parameters (such as population size, crossover and mutation probability, ...) and can quickly extract a small set of shapelets that is able to achieve predictive performances similar (or better) to that of other shapelet techniques.

Installation

We currently support Python 3.5 & Python 3.6. For installation, there are two alternatives:

Clone the repository https://github.com/IBCNServices/GENDIS.git and run (python3 -m) pip -r install requirements.txt
GENDIS is hosted on PyPi. You can just run (python3 -m) pip install gendis to add gendis to your dist-packages (you can use it from everywhere).

Make sure NumPy and Cython is already installed (pip install numpy and pip install Cython), since that is required for the setup script.

Tutorial & Example

1. Loading & preprocessing the datasets

In a first step, we need to construct at least a matrix with timeseries (X_train) and a vector with labels (y_train). Additionally, test data can be loaded as well in order to evaluate the pipeline in the end.

import pandas as pd
# Read in the datafiles
train_df = pd.read_csv(<DATA_FILE>)
test_df = pd.read_csv(<DATA_FILE>)
# Split into feature matrices and label vectors
X_train = train_df.drop('target', axis=1)
y_train = train_df['target']
X_test = test_df.drop('target', axis=1)
y_test = test_df['target']

2. Creating a `GeneticExtractor` object

Construct the object. For a list of all possible parameters, and a description, please refer to the documentation in the code

from gendis.genetic import GeneticExtractor
genetic_extractor = GeneticExtractor(population_size=50, iterations=25, verbose=True, 
                                     mutation_prob=0.3, crossover_prob=0.3, 
                                     wait=10, max_len=len(X_train) // 2)

3. Fit the `GeneticExtractor` and construct distance matrix

shapelets = genetic_extractor.fit(X_train, y_train)
distances_train = genetic_extractor.transform(X_train)
distances_test = genetic_extractor.transform(X_test)

4. Fit ML classifier on constructed distance matrix

from sklearn.linear_model import LogisticRegression
from sklearn.metrics import accuracy_score
lr = LogisticRegression()
lr.fit(distances_train, y_train)

print('Accuracy = {}'.format(accuracy_score(y_test, lr.predict(distances_test))))

Example notebook

A simple example is provided in this notebook

Data

All datasets in this repository are downloaded from timeseriesclassification. Please refer to them appropriately when using any dataset.

Paper experiments

In order to reproduce the results from the corresponding paper, please check out this directory.

Tests

We provide a few doctests and unit tests. To run the doctests: python3 -m doctest -v <FILE>, where <FILE> is the Python file you want to run the doctests from. To run unit tests: nose2 -v

Contributing, Citing and Contact

If you have any questions, are experiencing bugs in the GENDIS implementation, or would like to contribute, please feel free to create an issue/pull request in this repository or take contact with me at gilles(dot)vandewiele(at)ugent(dot)be

If you use GENDIS in your work, please use the following citation:

@article{vandewiele2021gendis,
  title={GENDIS: Genetic Discovery of Shapelets},
  author={Vandewiele, Gilles and Ongenae, Femke and Turck, Filip De},
  journal={Sensors},
  volume={21},
  number={4},
  pages={1059},
  year={2021},
  publisher={Multidisciplinary Digital Publishing Institute}
}

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 59

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (6) 🔗

Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

IBCNServices / Gendis

Labels

Projects that are alternatives of or similar to Gendis

GENDIS

GENetic DIscovery of Shapelets

Installation

Tutorial & Example

1. Loading & preprocessing the datasets

2. Creating a GeneticExtractor object

3. Fit the GeneticExtractor and construct distance matrix

4. Fit ML classifier on constructed distance matrix

Example notebook

Data

Paper experiments

Tests

Contributing, Citing and Contact

2. Creating a `GeneticExtractor` object

3. Fit the `GeneticExtractor` and construct distance matrix