Event-action mappings for parallel tools infrastructures

Tobias Hilbrich, Martin Schulz, Holger Brunst, Joachim Protze, Bronis R. de Supinski, Matthias S. Müller

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

The development of applications for High Performance Computing (HPC) systems is a challenging task. Development steps such as optimization, tuning, porting, and debugging often motivate the use of tools, many of which operate at application runtime. Current trends in the HPC community, such as increasing compute core counts and the advent of new programming paradigms challenge the development of applications, as well as the development of runtime tools. Parallel tools infrastructures can help to simplify the development and adaption of runtime tools by reducing development time and increasing applicability. They can provide reusable tool components, communication services, and abstractions for scalable tools, which preserve lessons learned from existing tools projects. This paper defines an abstraction for a highly integrated infrastructure, which we implement in a prototype that targets MPI applications. Our abstraction enables an incorporation of common tasks such as instrumentation, i.e., observing application behavior, with existing concepts for tool communication, while at the same time enabling scalability. A formal description of our abstraction allows us to highlight its design and to differentiate it from alternatives, so tool developers have a clear understanding of the high-level approach that our infrastructure follows. Existing prototype tools that are based on this infrastructure demonstrate applicability at 1,024 and 16,384 processes respectively.

Original languageEnglish
Title of host publicationEuro-Par 2015
Subtitle of host publicationParallel Processing - 21st International Conference on Parallel and Distributed Computing, Proceedings
EditorsJesper Larsson Traff, Sascha Hunold, Francesco Versaci
PublisherSpringer Verlag
Pages43-54
Number of pages12
ISBN (Print)9783662480953
DOIs
StatePublished - 2015
Externally publishedYes
Event21st International Conference on Parallel and Distributed Computing, Euro-Par 2015 - Vienna, Austria
Duration: 24 Aug 201528 Aug 2015

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume9233
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference21st International Conference on Parallel and Distributed Computing, Euro-Par 2015
Country/TerritoryAustria
CityVienna
Period24/08/1528/08/15

Fingerprint

Dive into the research topics of 'Event-action mappings for parallel tools infrastructures'. Together they form a unique fingerprint.

Cite this