Flexible Data Aggregation for Performance Profiling

David Boehme, David Beckingsale, Martin Schulz

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

4 Scopus citations

Abstract

Almost all performance analysis tools in the HPC space perform some form of aggregation to compute summary information of a series of performance measurements, from summations to more complex operations like histograms. Aggregation not only reduces data volumes and consequently storage space requirements and overheads, but is also crucial to extract insights from recorded measurement data. In current tools, however, most aspects that control the aggregation, such as the data dimensions to be reduced, are hard-coded in the tool for a set of particular use cases identified by the tool developer and cannot be extended or modified by the user. This limits their flexibility and often results in users having to learn and use multiple tools with different aggregation options for their performance analysis needs.We present a novel approach for performance data aggregation based on a flexible key:value data model with user-defined attributes, where users can define custom aggregation schemes in a simple description language. This not only gives users the control to deploy the particular data aggregation they need, but also opens the door for aggregations along application-specific data dimensions that cannot be achieved with traditional profiling tools. We show how our approach can be applied for performance profiling at runtime, cross-process data aggregation, and interactive data analysis and demonstrate its functionality with several case studies driven by real world codes.

Original languageEnglish
Title of host publicationProceedings - 2017 IEEE International Conference on Cluster Computing, CLUSTER 2017
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages419-428
Number of pages10
ISBN (Electronic)9781538623268
DOIs
StatePublished - 22 Sep 2017
Externally publishedYes
Event2017 IEEE International Conference on Cluster Computing, CLUSTER 2017 - Honolulu, United States
Duration: 5 Sep 20178 Sep 2017

Publication series

NameProceedings - IEEE International Conference on Cluster Computing, ICCC
Volume2017-September
ISSN (Print)1552-5244

Conference

Conference2017 IEEE International Conference on Cluster Computing, CLUSTER 2017
Country/TerritoryUnited States
CityHonolulu
Period5/09/178/09/17

Keywords

  • Data aggregation
  • Parallel processing
  • Performance analysis
  • Performance profiling
  • Software tools

Fingerprint

Dive into the research topics of 'Flexible Data Aggregation for Performance Profiling'. Together they form a unique fingerprint.

Cite this