Dynamic Resource Management for In-Situ Techniques Using MPI-Sessions

Yi Ju, Dominik Huber, Adalberto Perez, Philipp Ulbl, Stefano Markidis, Philipp Schlatter, Martin Schulz, Martin Schreiber, Erwin Laure

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

The computational power of High-Performance Computing (HPC) systems increases continuously and rapidly. Data-intensive applications are designed to leverage the high computational capacity of HPC resources and typically generate a large amount of data for traditional post-processing data analytics. However, the HPC systems’ in-/output (IO) subsystem develops relatively slowly, and the storage capacity is limited. This could lead to limited actual performance and scientific discovery. In-situ techniques are a partial remedy to these problems by reducing or avoiding the data flow through the IO subsystem to/from the storage. However, in current practice, asynchronous in-situ techniques with static resource management often allocate separate computing resources for executing in-situ task(s), which remain idle if no in-situ work is at hand. In the present work, we target improving the efficiency of computing resource usage by launching and releasing necessary additional computing resources for in-situ task(s). Our approach is based on extensions for MPI Sessions that enable the required dynamic resource management. In this paper, we propose a basic and an advanced in-situ techniques with dynamic resource management enabled by MPI Sessions, their implementations on two real-world use cases, and a critical analysis of the experimental results.

Original languageEnglish
Title of host publicationRecent Advances in the Message Passing Interface - 31st European MPI Users’ Group Meeting, EuroMPI 2024, Proceedings
EditorsClaudia Blaas-Schenner, Christoph Niethammer, Tobias Haas
PublisherSpringer Science and Business Media Deutschland GmbH
Pages105-120
Number of pages16
ISBN (Print)9783031733697
DOIs
StatePublished - 2025
Event31st European MPI Users’ Group Meeting, EuroMPI 2024 - Perth, Australia
Duration: 25 Sep 202427 Sep 2024

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume15267 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference31st European MPI Users’ Group Meeting, EuroMPI 2024
Country/TerritoryAustralia
CityPerth
Period25/09/2427/09/24

Keywords

  • Dynamic resource management
  • HPC
  • In-situ
  • MPI Session

Fingerprint

Dive into the research topics of 'Dynamic Resource Management for In-Situ Techniques Using MPI-Sessions'. Together they form a unique fingerprint.

Cite this