A Case Study on PMIx-Usage for Dynamic Resource Management

Dominik Huber, Martin Schreiber, Martin Schulz

Publikation: Beitrag in Buch/Bericht/KonferenzbandKonferenzbeitragBegutachtung

Abstract

With the increasing scale of HPC supercomputers efficient resource utilization on such systems becomes even more important. In this context, dynamic resource management is a very active research field, as it is expected to improve several metrics of resource utilization on HPC systems, such as job throughput and energy efficiency. However, dynamic resource management is complex and requires significant changes to various layers of the software stack including resource- and process management, programming models and applications. So far, approaches for resource management are often specific to a particular implementation of the resource management and process management software, thus hindering interoperability, composability and comparability of such approaches. In this paper, we discuss the usage of the Process Management Interface - Exascale (PMIx) Standard for interactions between the process manager and the resource manager. We describe an architecture that allows the resource manager to connect to the process manager as PMIx Tool to have access to a set of PMIx services useful for resource management. In a concrete case-study we connect a python- and PMIx-based resource manager to PRRTE and assess the applicability of this architecture for debugging and exploration of dynamic resource management techniques. We conclude that a PMIx-based architecture can simplify the process of exploring new dynamic and disruptive resource management mechanisms while improving composability.

OriginalspracheEnglisch
TitelHigh Performance Computing - ISC High Performance 2023 International Workshops, Revised Selected Papers
Redakteure/-innenAmanda Bienz, Michèle Weiland, Marc Baboulin, Carola Kruse
Herausgeber (Verlag)Springer Science and Business Media Deutschland GmbH
Seiten42-55
Seitenumfang14
ISBN (Print)9783031408427
DOIs
PublikationsstatusVeröffentlicht - 2023
Veranstaltung38th International Conference on High Performance Computing, ISC High Performance 2023 - Hamburg, Deutschland
Dauer: 21 Mai 202325 Mai 2023

Publikationsreihe

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Band13999 LNCS
ISSN (Print)0302-9743
ISSN (elektronisch)1611-3349

Konferenz

Konferenz38th International Conference on High Performance Computing, ISC High Performance 2023
Land/GebietDeutschland
OrtHamburg
Zeitraum21/05/2325/05/23

Fingerprint

Untersuchen Sie die Forschungsthemen von „A Case Study on PMIx-Usage for Dynamic Resource Management“. Zusammen bilden sie einen einzigartigen Fingerprint.

Dieses zitieren