All Projects → zhouhaoyi → ETDataset

zhouhaoyi / ETDataset

Licence: other
The Electricity Transformer dataset is collected to support the further investigation on the long sequence forecasting problem.

Projects that are alternatives of or similar to ETDataset

Introduction To Time Series Forecasting Python
Introduction to time series preprocessing and forecasting in Python using AR, MA, ARMA, ARIMA, SARIMA and Prophet model with forecast evaluation.
Stars: ✭ 173 (-40.34%)
Mutual labels:  forecasting
query-selector
LONG-TERM SERIES FORECASTING WITH QUERYSELECTOR – EFFICIENT MODEL OF SPARSEATTENTION
Stars: ✭ 63 (-78.28%)
Mutual labels:  forecasting
AutoTS
Automated Time Series Forecasting
Stars: ✭ 665 (+129.31%)
Mutual labels:  forecasting
Modeltime
Modeltime unlocks time series forecast models and machine learning in one framework
Stars: ✭ 189 (-34.83%)
Mutual labels:  forecasting
dbnR
Gaussian dynamic Bayesian networks structure learning and inference based on the bnlearn package
Stars: ✭ 33 (-88.62%)
Mutual labels:  forecasting
wattnet-fx-trading
WATTNet: Learning to Trade FX with Hierarchical Spatio-Temporal Representations of Highly Multivariate Time Series
Stars: ✭ 70 (-75.86%)
Mutual labels:  forecasting
Covid19 Severity Prediction
Extensive and accessible COVID-19 data + forecasting for counties and hospitals. 📈
Stars: ✭ 170 (-41.38%)
Mutual labels:  forecasting
pssa
Singular Spectrum Analysis for time series forecasting in Python
Stars: ✭ 119 (-58.97%)
Mutual labels:  forecasting
SARS-CoV-2-Nowcasting und -R-Schaetzung
Das Nowcasting erstellt eine Schätzung des Verlaufs der Anzahl von bereits erfolgten SARS-CoV-2-Erkrankungsfällen in Deutschland unter Berücksichtigung des Diagnose-, Melde- und Übermittlungsverzugs.
Stars: ✭ 80 (-72.41%)
Mutual labels:  forecasting
forecastVeg
A Machine Learning Approach to Forecasting Remotely Sensed Vegetation Health in Python
Stars: ✭ 44 (-84.83%)
Mutual labels:  forecasting
Tcdf
Temporal Causal Discovery Framework (PyTorch): discovering causal relationships between time series
Stars: ✭ 217 (-25.17%)
Mutual labels:  forecasting
anompy
A Python library for anomaly detection
Stars: ✭ 13 (-95.52%)
Mutual labels:  forecasting
McMasterPandemic
SEIR+ model
Stars: ✭ 18 (-93.79%)
Mutual labels:  forecasting
Supplychainpy
Supplychainpy is a Python library for supply chain analysis, modelling and simulation. The library assists a workflow that is reliant on Excel and VBA.
Stars: ✭ 184 (-36.55%)
Mutual labels:  forecasting
transformer-ls
Official PyTorch Implementation of Long-Short Transformer (NeurIPS 2021).
Stars: ✭ 201 (-30.69%)
Mutual labels:  long-sequence
Prophet
Tool for producing high quality forecasts for time series data that has multiple seasonality with linear or non-linear growth.
Stars: ✭ 13,832 (+4669.66%)
Mutual labels:  forecasting
ForestCoverChange
Detecting and Predicting Forest Cover Change in Pakistani Areas Using Remote Sensing Imagery
Stars: ✭ 23 (-92.07%)
Mutual labels:  forecasting
hydrotools
Suite of tools for retrieving USGS NWIS observations and evaluating National Water Model (NWM) data.
Stars: ✭ 36 (-87.59%)
Mutual labels:  forecasting
stock-market-forecasting
Stock market forecasting using the ARIMA model.
Stars: ✭ 32 (-88.97%)
Mutual labels:  forecasting
PyForecast
PyForecast is a statistical modeling tool used by Reclamation water managers and reservoir operators to train and build predictive models for seasonal inflows and streamflows. PyForecast allows users to make current water-year forecasts using models developed with the program.
Stars: ✭ 24 (-91.72%)
Mutual labels:  forecasting

Electricity Transformer Dataset (ETDataset)

In this Github repo, we provide several datasets could be used for the long sequence time-series problem. All datasets have been preprocessed and they were stored as .csv files. The dataset ranges from 2016/07 to 2018/07. 中文版本 | ChineseVersion

Dataset list (updating)

  • ETT-small: The data of 2 Electricity Transformers at 2 stations, including load, oil temperature.
  • ETT-large: The data of 39 Electricity Transformers at 39 stations, including load, oil temperature.
  • ETT-full: The data of 69 Transformer station at 39 stations, including load, oil temperature, location, climate, demand.

If you use this dataset please cite the work Informer @ AAAI2021 Best Paper Award[paper][code][video]:

@inproceedings{haoyietal-informer-2021,
  author    = {Haoyi Zhou and
               Shanghang Zhang and
               Jieqi Peng and
               Shuai Zhang and
               Jianxin Li and
               Hui Xiong and
               Wancai Zhang},
  title     = {Informer: Beyond Efficient Transformer for Long Sequence Time-Series Forecasting},
  booktitle = {The Thirty-Fifth {AAAI} Conference on Artificial Intelligence, {AAAI} 2021, Virtual Conference},
  volume    = {35},
  number    = {12},
  pages     = {11106--11115},
  publisher = {{AAAI} Press},
  year      = {2021},
}

Why Oil Temperature is involved in this dataset?

The electric power distribution problem is the distribution of electricity to different areas depends on its sequential usage. But predicting the following demand of a specific area is difficult, as it varies with weekdays, holidays, seasons, weather, temperatures, etc. However, no existing method can perform a long-term prediction based on super long-term real-world data with high precision. Any false prophecy may damage the electrical transformer. So currently, without an efficient method to predict future electric usage, managers have to make decisions based on the empirical number, which is much higher than the real-world demands. It causes unnecessary waste of electric and equipment depreciation. On the other hand, the oil temperatures can reflect the conditon of electricity Transformer. One of the most efficient strategies is to predict how the electrical transformers' oil temperature is safe and avoid unnecessary waste. As a result, to address this problem, our team and Beijing Guowang Fuda Science & Technology Development Company built a real-world platform and collected 2-year data. We work on it to predict the electrical transformers' oil temperature and investigate the extreme load capacity.

ETT-small:

We donated two years of data, in which each data point is recorded every minute (marked by m), and they were from two regions of a province of China, named ETT-small-m1 and ETT-small-m2, respectively. Each dataset contains 2 year * 365 days * 24 hours * 4 times = 70,080 data point. Besides, we also provide the hourly-level variants for fast development (marked by h), i.e. ETT-small-h1 and ETT-small-h2. Each data point consists of 8 features, including the date of the point, the predictive value "oil temperature", and 6 different types of external power load features.



Figure 1.The overall view of "OT" in the ETT-small.    Figure 2.The autocorrelation graph of all variables.

Specifically, the dataset combines short-term periodical patterns, long-term periodical patterns, long-term trends, and many irregular patterns. We firstly give an overall view in Figure 1, and it shows evident seasonal trends. To better examine the existence of long-term and short-term repetitive patterns, we plot the autorcorrelation graph for all the variables of the ETT-small-h1 dataset in Figure 2. The blue line in the above is the target 'oil temperature', and it maintains some short-term local continuity. However, the other variables (power load) shows short-term daily pattern (every 24 hours) and long-term week pattern (every 7 days).

We use the .csv file format to save the data, a demo of the ETT-small data is illustrated in Figure 3. The first line (8 columns) is the horizontal header and includes "date", "HUFL", "HULL", "MUFL", "MULL", "LUFL", "LULL" and "OT". The detailed meaning of each column name is shown in the Table 1.



Figure 3. A demo of the ETT data.

Field date HUFL HULL MUFL MULL LUFL LULL OT
Description The recorded date High UseFul Load High UseLess Load Middle UseFul Load Middle UseLess Load Low UseFul Load Low UseLess Load Oil Temperature (target)

Table 1. Description for each columm.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].