Caliper: Performance Introspection for HPC Software Stacks

David Boehme, Todd Gamblin, David Beckingsale, Peer Timo Bremer, Alfredo Gimenez, Matthew Legendre, Olga Pearce, Martin Schulz

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

89 Scopus citations

Abstract

Many performance engineering tasks, from long-term performance monitoring to post-mortem analysis and online tuning, require efficient runtime methods for introspection and performance data collection. To understand interactions between components in increasingly modular HPC software, performance introspection hooks must be integrated into runtime systems, libraries, and application codes across the software stack. This requires an interoperable, cross-stack, general-purpose approach to performance data collection, which neither application-specific performance measurement nor traditional profile or trace analysis tools provide. With Caliper, we have developed a general abstraction layer to provide performance data collection as a service to applications, runtime systems, libraries, and tools. Individual software components connect to Caliper in independent data producer, data consumer, and measurement control roles, which allows them to share performance data across software stack boundaries. We demonstrate Caliper's performance analysis capbilities with two case studies of production scenarios.

Original languageEnglish
Title of host publicationProceedings of SC 2016
Subtitle of host publicationThe International Conference for High Performance Computing, Networking, Storage and Analysis
PublisherIEEE Computer Society
Pages550-560
Number of pages11
ISBN (Electronic)9781467388153
DOIs
StatePublished - 2 Jul 2016
Externally publishedYes
Event2016 International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2016 - Salt Lake City, United States
Duration: 13 Nov 201618 Nov 2016

Publication series

NameInternational Conference for High Performance Computing, Networking, Storage and Analysis, SC
Volume0
ISSN (Print)2167-4329
ISSN (Electronic)2167-4337

Conference

Conference2016 International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2016
Country/TerritoryUnited States
CitySalt Lake City
Period13/11/1618/11/16

Keywords

  • Computer performance
  • High performance computing
  • Parallel processing
  • Performance analysis
  • Software performance
  • Software reusability
  • Software tools

Fingerprint

Dive into the research topics of 'Caliper: Performance Introspection for HPC Software Stacks'. Together they form a unique fingerprint.

Cite this