TY - GEN
T1 - Detecting patterns in MPI communication traces
AU - Preissl, Robert
AU - Köckerbauer, Thomas
AU - Schulz, Martin
AU - Kranzlmüller, Dieter
AU - De Supinski, Bronis R.
AU - Quinlan, Daniel J.
PY - 2008
Y1 - 2008
N2 - Since processor counts in supercomputers are increasing dramatically, efficient interprocessor communication is becoming even more important for the applications that run on them. A high level, abstract understanding of an application's communication behavior would not only simplify debugging of that communication but would also support more directed performance optimization. We explore automated identification of communication patterns to provide that high level abstraction. We introduce an algorithm to extract communication patterns from MPI traces automatically. Our algorithm first finds locally repeating sequences and then iteratively grows them into global patterns. We demonstrate our technique on three realistic codes using traces from up to 128 processors. Our results show that our approach detects the underlying communication pattern within reasonable time and memory constraints, even for large trace sizes.
AB - Since processor counts in supercomputers are increasing dramatically, efficient interprocessor communication is becoming even more important for the applications that run on them. A high level, abstract understanding of an application's communication behavior would not only simplify debugging of that communication but would also support more directed performance optimization. We explore automated identification of communication patterns to provide that high level abstraction. We introduce an algorithm to extract communication patterns from MPI traces automatically. Our algorithm first finds locally repeating sequences and then iteratively grows them into global patterns. We demonstrate our technique on three realistic codes using traces from up to 128 processors. Our results show that our approach detects the underlying communication pattern within reasonable time and memory constraints, even for large trace sizes.
UR - http://www.scopus.com/inward/record.url?scp=55849125865&partnerID=8YFLogxK
U2 - 10.1109/ICPP.2008.71
DO - 10.1109/ICPP.2008.71
M3 - Conference contribution
AN - SCOPUS:55849125865
SN - 9780769533742
T3 - Proceedings of the International Conference on Parallel Processing
SP - 230
EP - 237
BT - Proceedings - 37th International Conference on Parallel Processing, ICPP 2008
T2 - 37th International Conference on Parallel Processing, ICPP 2008
Y2 - 9 September 2008 through 12 September 2008
ER -