Memory usage optimizations for online event analysis

Tobias Hilbrich, Joachim Protze, Michael Wagner, Matthias S. Müller, Martin Schulz, Bronis R. de Supinski, Wolfgang E. Nagel

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Tools are essential for application developers and system support personnel during tasks such as performance optimization and debugging of massively parallel applications. An important class are event-based tools that analyze relevant events during the runtime of an application, e.g., function invocations or communication operations. We develop a parallel tools infrastructure that supports both the observation and analysis of application events at runtime. Some analyses—e.g., deadlock detection algorithms—require complex processing and apply to many types of frequently occurring events. For situations where the rate at which an application generates new events exceeds the processing rate of the analysis, we experience tool instability or even failures, e.g., memory exhaustion. Tool infrastructures must provide means to avoid or mitigate such situations. This paper explores two such techniques: first, a heuristic that selects events to receive and process next; second, a pause mechanism that temporarily suspends the execution of an application. An application study with applications from the SPEC MPI2007 benchmark suite and the NAS parallel benchmarks evaluates these techniques at up to 16,384 processes and illustrates how they avoid memory exhaustion problems that limited the applicability of a runtime correctness tool in the past.

Original languageEnglish
Title of host publicationSolving Software Challenges for Exascale - International Conference on Exascale Applications and Software, EASC 2014, Revised Selected Papers
EditorsStefano Markidis, Erwin Laure
PublisherSpringer Verlag
Pages110-121
Number of pages12
ISBN (Electronic)9783319159751
DOIs
StatePublished - 2015
Externally publishedYes
EventInternational Conference on Exascale Applications and Software, EASC 2014 - Stockholm, Sweden
Duration: 2 Apr 20143 Apr 2014

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume8759
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

ConferenceInternational Conference on Exascale Applications and Software, EASC 2014
Country/TerritorySweden
CityStockholm
Period2/04/143/04/14

Fingerprint

Dive into the research topics of 'Memory usage optimizations for online event analysis'. Together they form a unique fingerprint.

Cite this