All Projects → buds-lab → building-data-genome-project-2

buds-lab / building-data-genome-project-2

Licence: MIT license
Whole building non-residential hourly energy meter data from the Great Energy Predictor III competition

Programming Languages

Jupyter Notebook
11667 projects

Projects that are alternatives of or similar to building-data-genome-project-2

open-energy-view
View resource consumption trends, history, analysis, and insights.
Stars: ✭ 32 (-71.43%)
Mutual labels:  energy-consumption, electricity-consumption, electricity-meter
scout
A tool for estimating the future energy use, carbon emissions, and capital and operating cost impacts of energy efficiency and demand flexibility technologies in the U.S. residential and commercial building sectors.
Stars: ✭ 34 (-69.64%)
Mutual labels:  energy-consumption, energy-efficiency, building-energy
ashrae-great-energy-predictor-3-solution-analysis
Analysis of top give winning solutions of the ASHRAE Great Energy Predictor III competition
Stars: ✭ 44 (-60.71%)
Mutual labels:  energy-consumption, building-energy
evergy
A simple utility that you can use to access your Evergy account and retrieve you meter readings.
Stars: ✭ 22 (-80.36%)
Mutual labels:  electricity-consumption, electricity-meter
Node-Linky
A simple node to connect to Enedis Linky smart-meter to fetch your datas
Stars: ✭ 29 (-74.11%)
Mutual labels:  smart-meter, electricity-consumption
Covid 19 Repo Data
Data archive of identifiable COVID-19 related public projects on GitHub
Stars: ✭ 236 (+110.71%)
Mutual labels:  open-data
MADBike
This is the public repository of the MADBike app for iOS. Public bike rental service for BiciMAD.
Stars: ✭ 23 (-79.46%)
Mutual labels:  open-data
Awesome Portugal Data
🇵🇹 Lista de repositórios de dados abertos em Portugal
Stars: ✭ 209 (+86.61%)
Mutual labels:  open-data
Graphql Camara Deputados
API GraphQL com os dados da câmara de deputados do Brasil
Stars: ✭ 204 (+82.14%)
Mutual labels:  open-data
osm-extracts
Each day, OSM Extracts by Interline mirrors the entire OpenStreetMap planet and creates city and region sized extracts
Stars: ✭ 34 (-69.64%)
Mutual labels:  open-data
CityScoreToolkit
Open-source version of Boston's CityScore performance dashboard
Stars: ✭ 42 (-62.5%)
Mutual labels:  open-data
api sof
Tutorial para acessar a API do Sistema de Orçamento e Finanças _SOF da cidade de São Paulo, utilizando Python e a biblioteca Pandas, realizar análises e salvar arquivo CSV/Excel
Stars: ✭ 31 (-72.32%)
Mutual labels:  open-data
Common Voice
Common Voice is part of Mozilla's initiative to help teach machines how real people speak.
Stars: ✭ 2,891 (+2481.25%)
Mutual labels:  open-data
patzilla
PatZilla is a modular patent information research platform and data integration toolkit with a modern user interface and access to multiple data sources.
Stars: ✭ 71 (-36.61%)
Mutual labels:  open-data
City Scrapers
Scrape, standardize and share public meetings from local government websites
Stars: ✭ 220 (+96.43%)
Mutual labels:  open-data
wbstats
wbstats: An R package for searching and downloading data from the World Bank API
Stars: ✭ 106 (-5.36%)
Mutual labels:  open-data
Scihub
Source code and data analyses for the Sci-Hub Coverage Study
Stars: ✭ 205 (+83.04%)
Mutual labels:  open-data
LDWizard
A generic framework for simplifying the creation of linked data.
Stars: ✭ 17 (-84.82%)
Mutual labels:  open-data
berlin-open-source-portal
Showcase of Open Source Software that is built, maintained and/or funded by Berlin state governmental agencies
Stars: ✭ 21 (-81.25%)
Mutual labels:  open-data
data.world-r
R library for data.world
Stars: ✭ 59 (-47.32%)
Mutual labels:  open-data

logo

DOI

The Building Data Genome 2 (BDG2) Data-Set

Data-set description

BDG2 is an open data set made up of 3,053 energy meters from 1,636 buildings. The time range of the times-series data is the two full years (2016 and 2017) and the frequency is hourly measurements of electricity, heating and cooling water, steam, and irrigation meters. A subset of the data was used in the Great Energy Predictor III (GEPIII) competition hosted by the ASHRAE organization in late 2019. A full overview of the GEPIII competition can be found in a Science and Technology for the Built Environment Journal - Preprint found on arXiv

The GEPIII sub-set includes hourly data from 2,380 meters from 1,449 buildings that were used in a machine learning competition for long-term prediction with an application to measurement and verification in the building energy analysis domain. This data set can be used to benchmark various statistical learning algorithms and other data science techniques. It can also be used simply as a teaching or learning tool to practice dealing with measured performance data from large numbers of non-residential buildings. The charts below illustrate the breakdown of the buildings according to primary use category and subcategory, industry and subindustry, timezone and meter type.

cat_features

Getting Started

We recommend you download the Anaconda Python Distribution and use Jupyter to get an understanding of the data.

  • Temporal meters data are found in /data/meters/
  • Metadata is found in data/metadata/
  • To join all meters raw data into one dataset follow this notebook

Example notebooks are found in /notebooks/ -- a few good overview examples:

Detailed Documentation

The detailed documentation of how this data set was created can be found in the repository's wiki and in the following publication:

Citation of BDG2 Data-Set

Miller, C., Kathirgamanathan, A., Picchetti, B. et al. The Building Data Genome Project 2, energy meter data from the ASHRAE Great Energy Predictor III competition. Sci Data 7, 368 (2020). https://doi.org/10.1038/s41597-020-00712-x



@ARTICLE{Miller2020-yc,
  title     = "The Building Data Genome Project 2, energy meter data from the
               {ASHRAE} Great Energy Predictor {III} competition",
  author    = "Miller, Clayton and Kathirgamanathan, Anjukan and Picchetti,
               Bianca and Arjunan, Pandarasamy and Park, June Young and Nagy,
               Zoltan and Raftery, Paul and Hobson, Brodie W and Shi, Zixiao
               and Meggers, Forrest",
  abstract  = "This paper describes an open data set of 3,053 energy meters
               from 1,636 non-residential buildings with a range of two full
               years (2016 and 2017) at an hourly frequency (17,544
               measurements per meter resulting in approximately 53.6 million
               measurements). These meters were collected from 19 sites across
               North America and Europe, with one or more meters per building
               measuring whole building electrical, heating and cooling water,
               steam, and solar energy as well as water and irrigation meters.
               Part of these data was used in the Great Energy Predictor III
               (GEPIII) competition hosted by the American Society of Heating,
               Refrigeration, and Air-Conditioning Engineers (ASHRAE) in
               October-December 2019. GEPIII was a machine learning competition
               for long-term prediction with an application to measurement and
               verification. This paper describes the process of data
               collection, cleaning, and convergence of time-series meter data,
               the meta-data about the buildings, and complementary weather
               data. This data set can be used for further prediction
               benchmarking and prototyping as well as anomaly detection,
               energy analysis, and building type classification.
               Machine-accessible metadata file describing the reported data:
               https://doi.org/10.6084/m9.figshare.13033847",
  journal   = "Scientific Data",
  publisher = "Nature Publishing Group",
  volume    =  7,
  pages     = "368",
  month     =  oct,
  year      =  2020,
  language  = "en"
}


Preprints

Publications or Projects that use BDG2 data-set

Please update this list if you add notebooks or R-Markdown files to the notebook folder. Naming convention is a number (for ordering), the creator's initials, and a short - delimited description, e.g. 1.0-jqp-initial-data-exploration.

  • (publication here)

Repository structure

building-data-genome-project-2
├─ README.md              <- BDG2 README for developers using this data-set
└─ data
|   ├─metadata            <- buildings metadata
|   ├─ weather            <- weather data
|   └─ meters
|       └─ raw            <- all meter reading datasets
|       └─ cleaned        <- cleaned meter data based on several filtering steps
|       └─ kaggle         <- the 2017 meter data that aligns with the Kaggle competition
├─ notebooks              <- Jupyter notebooks, named after the naming convention
└─ figures                <- figures created during exploration of BDG 2.0 Data-set
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].