TY - GEN
T1 - Creating a tool set for optimizing topology-aware node mappings
AU - Schulz, Martin
AU - Bhatele, Abhinav
AU - Bremer, Peer Timo
AU - Gamblin, Todd
AU - Isaacs, Katherine
AU - Levine, Joshua A.
AU - Pascucci, Valerio
PY - 2012
Y1 - 2012
N2 - Modern HPC systems, such as Cray's XE and IBM's Blue Gene line, feature sophisticated network architectures, often in the form of high dimensional tori. In order to fully exploit the performance of these systems, it is necessary to carefully map an application's communication structure to the underlying network topology. In this step, both latency (i.e., physical distance between nodes) and bandwidth (i.e., number of concurrently used links) have to be taken into account, leading to mappings that are often non-intuitive. To help developers with this complex problem, we are developing a set of tools that aim at helping users understand the communication behavior of their codes, map them onto the network architecture, and create better-performing topology-aware node mappings. In this paper, we present initial steps towards this goal, including a measurement environment capturing both communication patterns and network metrics within the same run, a methodology to compare these measurements, and a visualization tool that helps users understand the impact of their application's characteristics on the network behavior.
AB - Modern HPC systems, such as Cray's XE and IBM's Blue Gene line, feature sophisticated network architectures, often in the form of high dimensional tori. In order to fully exploit the performance of these systems, it is necessary to carefully map an application's communication structure to the underlying network topology. In this step, both latency (i.e., physical distance between nodes) and bandwidth (i.e., number of concurrently used links) have to be taken into account, leading to mappings that are often non-intuitive. To help developers with this complex problem, we are developing a set of tools that aim at helping users understand the communication behavior of their codes, map them onto the network architecture, and create better-performing topology-aware node mappings. In this paper, we present initial steps towards this goal, including a measurement environment capturing both communication patterns and network metrics within the same run, a methodology to compare these measurements, and a visualization tool that helps users understand the impact of their application's characteristics on the network behavior.
UR - http://www.scopus.com/inward/record.url?scp=84885237902&partnerID=8YFLogxK
U2 - 10.1007/978-3-642-31476-6_1
DO - 10.1007/978-3-642-31476-6_1
M3 - Conference contribution
AN - SCOPUS:84885237902
SN - 9783642314759
T3 - Proceedings of the 5th International Workshop on Parallel Tools for High Performance Computing 2011
SP - 1
EP - 12
BT - Proceedings of the 5th International Workshop on Parallel Tools for High Performance Computing 2011
T2 - 5th International Workshop on Parallel Tools for High Performance Computing, HPC 2011
Y2 - 26 September 2011 through 27 September 2011
ER -