Waveform signal entropy and compression study of whole-building energy datasets

Thomas Kriechbaumer, Daniel Jorde, Hans Arno Jacobsen

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

9 Scopus citations

Abstract

Electrical energy consumption has been an ongoing research area since the coming of smart homes and Internet of Things. Consumption characteristics and usages profiles are directly influenced by building occupants and their interaction with electrical appliances. Data analysis together with machine learning models can be utilized to extract valuable information for the benefit of occupants themselves (conserve energy and increase comfort levels), power plants (maintenance), and grid operators (stability). Public energy datasets provide a scientific foundation to develop and benchmark these algorithms and techniques. With datasets exceeding tens of terabytes, we present a novel study of five whole-building energy datasets with high sampling rates, their signal entropy, and how a well-calibrated measurement can have a significant effect on the overall storage requirements. We show that some datasets do not fully utilize the available measurement precision, therefore leaving potential accuracy and space savings untapped. We benchmark a comprehensive list of 365 file formats, transparent data transformations, and lossless compression algorithms. The primary goal is to reduce the overall dataset size while maintaining an easy-to-use file format and access API. We show that with careful selection of file format and encoding scheme,we can reduce the size of some datasets by up to 73%.

Original languageEnglish
Title of host publicatione-Energy 2019 - Proceedings of the 10th ACM International Conference on Future Energy Systems
PublisherAssociation for Computing Machinery, Inc
Pages58-67
Number of pages10
ISBN (Electronic)9781450366717
DOIs
StatePublished - 15 Jun 2019
Event10th ACM International Conference on Future Energy Systems, e-Energy 2019 - Phoenix, United States
Duration: 25 Jun 201928 Jun 2019

Publication series

Namee-Energy 2019 - Proceedings of the 10th ACM International Conference on Future Energy Systems

Conference

Conference10th ACM International Conference on Future Energy Systems, e-Energy 2019
Country/TerritoryUnited States
CityPhoenix
Period25/06/1928/06/19

Keywords

  • Electricity aggregate
  • Energy dataset
  • File format
  • High sampling rate
  • Non-intrusive load monitoring
  • Waveform compression

Fingerprint

Dive into the research topics of 'Waveform signal entropy and compression study of whole-building energy datasets'. Together they form a unique fingerprint.

Cite this