Preserving time in large-scale communication traces

Prasun Ratn, Frank Mueller, Bronis R. De Supinski, Martin Schulz

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

36 Scopus citations

Abstract

Analyzing the performance of large-scale scientific applications is becoming increasingly difficult due to the sheer size of performance data gathered. Recent work on scalable communication tracing applies online interprocess compression to address this problem. Yet, analysis of communication traces requires knowledge about time progression that cannot trivially be encoded in a scalable manner during compression. We develop scalable time stamp encoding schemes for communication traces. At the same time, our work contributes novel insights into the scalable representation of time stamped data. We show that our representations capture sufficient information to enable what-if explorations of architectural variations and analysis for path-based timing irregularities while not requiring excessive disk space. We evaluate the ability of several time-stamped compressed MPI trace approaches to enable accurate timed replay of communication events. Our lossless traces are orders of magnitude smaller, if not near constant size, regardless of the number of nodes while preserving timing information suitable for application tuning or assessing requirements of future procurements. Our results prove timepreserving tracing without loss of communication information can scale in the number of nodes and time steps, which is a result without precedent.

Original languageEnglish
Title of host publicationICS'08 - Proceedings of the 2008 ACM International Conference on Supercomputing
Pages46-55
Number of pages10
DOIs
StatePublished - 2008
Externally publishedYes
Event22nd ACM International Conference on Supercomputing, ICS'08 - Island of Kos, Greece
Duration: 7 Jun 200812 Jun 2008

Publication series

NameProceedings of the International Conference on Supercomputing

Conference

Conference22nd ACM International Conference on Supercomputing, ICS'08
Country/TerritoryGreece
CityIsland of Kos
Period7/06/0812/06/08

Keywords

  • High-Performance Computing
  • Message Passing
  • Tracing

Fingerprint

Dive into the research topics of 'Preserving time in large-scale communication traces'. Together they form a unique fingerprint.

Cite this