Automated pipeline framework for processing of large-scale building energy time series data

Arash Khalilnejad, Ahmad M. Karimi, Shreyas Kamath, Rojiar Haddadian, Roger H. French, Alexis R. Abramson

Research output: Contribution to journalArticlepeer-review

15 Scopus citations

Abstract

Commercial buildings account for one third of the total electricity consumption in the United States and a significant amount of this energy is wasted. Therefore, there is a need for “virtual” energy audits, to identify energy inefficiencies and their associated savings opportunities using methods that can be non-intrusive and automated for application to large populations of buildings. Here we demonstrate virtual energy audits applied to large populations of buildings’ time-series smart-meter data using a systematic approach and a fully automated Building Energy Analytics (BEA) Pipeline that unifies, cleans, stores and analyzes building energy datasets in a non-relational data warehouse for efficient insights and results. This BEA pipeline is based on a custom compute job scheduler for a high performance computing cluster to enable parallel processing of Slurm jobs. Within the analytics pipeline, we introduced a data qualification tool that enhances data quality by fixing common errors, while also detecting abnormalities in a building’s daily operation using hierarchical clustering. We analyze the HVAC scheduling of a population of 816 buildings, using this analytics pipeline, as part of a cross-sectional study. With our approach, this sample of 816 buildings is improved in data quality and is efficiently analyzed in 34 minutes, which is 85 times faster than the time taken by a sequential processing. The analytical results for the HVAC operational hours of these buildings show that among 10 building use types, food sales buildings with 17.75 hours of daily HVAC cooling operation are decent targets for HVAC savings. Overall, this analytics pipeline enables the identification of statistically significant results from population based studies of large numbers of building energy time-series datasets with robust results. These types of BEA studies can explore numerous factors impacting building energy efficiency and virtual building energy audits. This approach enables a new generation of data-driven buildings energy analysis at scale.

Original languageEnglish
Article numbere0240461
JournalPLoS ONE
Volume15
Issue number12 December
DOIs
StatePublished - Dec 2020
Externally publishedYes

Funding

This work was supported by the U.S. Department of Energy, Advanced Research Projects Agency-Energy (ARPA-E), under award DE-AR-0000668. All the authors were funded or partially funded by this award. The detailed information is given in: https://arpa-e.energy.gov/ technologies/projects/virtual-building-energy-audits. This research was performed in the SDLE Research Center, established with Ohio Third Frontier funding under award Tech 11-060, Tech 12-004, and the Great Lakes Energy Institute, both at Case Western Reserve University. This work made use of the Rider High Performance Computing Resource in the Core Facility for Advanced Research Computing at Case Western Reserve University. The authors acknowledge useful discussions with Tian Wang.

FundersFunder number
Advanced Research Projects Agency-EnergyTech 12-004, Tech 11-060, DE-AR-0000668
Great Lakes Energy Institute
U.S. Department of Energy
Case Western Reserve University

    Fingerprint

    Dive into the research topics of 'Automated pipeline framework for processing of large-scale building energy time series data'. Together they form a unique fingerprint.

    Cite this