TY - JOUR
T1 - Dynamic binary instrumentation and data aggregation on large scale systems
AU - Lee, Gregory L.
AU - Schulz, Martin
AU - Ahn, Dong H.
AU - Bernat, Andrew
AU - De Supinski, Bronis R.
AU - Ko, Steven Y.
AU - Rountree, Barry
PY - 2007/6
Y1 - 2007/6
N2 - Dynamic binary instrumentation for performance analysis on large scale architectures such as the IBM Blue Gene/L system (BG/L) poses unique challenges. Their unprecedented scale and often limited OS support require new mechanisms to organize binary instrumentation, to interact with the target application, and to collect the resulting data. We describe the design and current status of a new implementation of the Dynamic Probe Class Library (DPCL) API for large scale systems. DPCL provides an easy to use layer for dynamic instrumentation on parallel MPI applications based on the DynInst dynamic instrumentation library for sequential platforms. Our work includes modifying DynInst to control instrumentation from remote I/O nodes and porting DPCL's communication for performance data collection to use MRNet, a tree-based overlay network that (TBON) supports scalable multicast and data reduction. We describe extensions to the DPCL API that support instrumentation of task subsets and aggregation of collected performance data.
AB - Dynamic binary instrumentation for performance analysis on large scale architectures such as the IBM Blue Gene/L system (BG/L) poses unique challenges. Their unprecedented scale and often limited OS support require new mechanisms to organize binary instrumentation, to interact with the target application, and to collect the resulting data. We describe the design and current status of a new implementation of the Dynamic Probe Class Library (DPCL) API for large scale systems. DPCL provides an easy to use layer for dynamic instrumentation on parallel MPI applications based on the DynInst dynamic instrumentation library for sequential platforms. Our work includes modifying DynInst to control instrumentation from remote I/O nodes and porting DPCL's communication for performance data collection to use MRNet, a tree-based overlay network that (TBON) supports scalable multicast and data reduction. We describe extensions to the DPCL API that support instrumentation of task subsets and aggregation of collected performance data.
KW - Binary instrumentation
KW - Massively parallel architectures
KW - Performance analysis tools
KW - Scalable data collection
UR - http://www.scopus.com/inward/record.url?scp=34250163556&partnerID=8YFLogxK
U2 - 10.1007/s10766-007-0036-3
DO - 10.1007/s10766-007-0036-3
M3 - Article
AN - SCOPUS:34250163556
SN - 0885-7458
VL - 35
SP - 207
EP - 232
JO - International Journal of Parallel Programming
JF - International Journal of Parallel Programming
IS - 3
ER -