TY - GEN
T1 - ScalaTrace
T2 - 10th International Conference on Applied Parallel and Scientific Computing, PARA 2010
AU - Mueller, Frank
AU - Wu, Xing
AU - Schulz, Martin
AU - De Supinski, Bronis R.
AU - Gamblin, Todd
PY - 2012
Y1 - 2012
N2 - Characterizing the communication behavior of large-scale applications is a difficult and costly task due to code/system complexity and their long execution times. An alternative to running actual codes is to gather their communication traces and then replay them, which facilitates application tuning and future procurements. While past approaches lacked lossless scalable trace collection, we contribute an approach that provides orders of magnitude smaller, if not near constant-size, communication traces regardless of the number of nodes while preserving structural information. We introduce intra- and inter-node compression techniques of MPI events, we develop a scheme to preserve time and causality of communication events, and we present results of our implementation for BlueGene/L. Given this novel capability, we discuss its impact on communication tuning and on trace extrapolation. To the best of our knowledge, such a concise representation of MPI traces in a scalable manner combined with time-preserving deterministic MPI call replay are without any precedence.
AB - Characterizing the communication behavior of large-scale applications is a difficult and costly task due to code/system complexity and their long execution times. An alternative to running actual codes is to gather their communication traces and then replay them, which facilitates application tuning and future procurements. While past approaches lacked lossless scalable trace collection, we contribute an approach that provides orders of magnitude smaller, if not near constant-size, communication traces regardless of the number of nodes while preserving structural information. We introduce intra- and inter-node compression techniques of MPI events, we develop a scheme to preserve time and causality of communication events, and we present results of our implementation for BlueGene/L. Given this novel capability, we discuss its impact on communication tuning and on trace extrapolation. To the best of our knowledge, such a concise representation of MPI traces in a scalable manner combined with time-preserving deterministic MPI call replay are without any precedence.
KW - High-Performance Computing
KW - Message Passing
KW - Tracing
UR - http://www.scopus.com/inward/record.url?scp=84863124036&partnerID=8YFLogxK
U2 - 10.1007/978-3-642-28145-7_40
DO - 10.1007/978-3-642-28145-7_40
M3 - Conference contribution
AN - SCOPUS:84863124036
SN - 9783642281440
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 410
EP - 418
BT - Applied Parallel and Scientific Computing - 10th International Conference, PARA 2010, Revised Selected Papers
Y2 - 6 June 2010 through 9 June 2010
ER -