On the Convergence of Malleability and the HPC PowerStack: Exploiting Dynamism in Over-Provisioned and Power-Constrained HPC Systems

Eishi Arima, A. Isaías Comprés, Martin Schulz

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

4 Scopus citations

Abstract

Recent High-Performance Computing (HPC) systems are facing important challenges, such as massive power consumption, while at the same time significantly under-utilized system resources. Given the power consumption trends, future systems will be deployed in an over-provisioned manner where more resources are installed than they can afford to power simultaneously. In such a scenario, maximizing resource utilization and energy efficiency, while keeping a given power constraint, is pivotal. Driven by this observation, in this position paper we first highlight the recent trends of resource management techniques, with a particular focus on malleability support (i.e., dynamically scaling resource allocations/requirements for a job), co-scheduling (i.e., co-locating multiple jobs within a node), and power management. Second, we consider putting them together, assess their relationships/synergies, and discuss the functionality requirements in each software component for future over-provisioned and power-constrained HPC systems. Third, we briefly introduce our ongoing efforts on the integration of software tools, which will ultimately lead to the convergence of malleability and power management, as it is designed in the HPC PowerStack initiative.

Original languageEnglish
Title of host publicationHigh Performance Computing. ISC High Performance 2022 International Workshops - Revised Selected Papers
EditorsHartwig Anzt, Amanda Bienz, Piotr Luszczek, Marc Baboulin
PublisherSpringer Science and Business Media Deutschland GmbH
Pages206-217
Number of pages12
ISBN (Print)9783031232190
DOIs
StatePublished - 2022
Event37th International Conference on High Performance Computing , ISC High Performance 2022 - Hamburg, Germany
Duration: 29 May 20222 Jun 2022

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume13387 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference37th International Conference on High Performance Computing , ISC High Performance 2022
Country/TerritoryGermany
CityHamburg
Period29/05/222/06/22

Keywords

  • Co-scheduling
  • Dynamic resource management
  • Heterogeneity
  • Malleability
  • Over-provisioning
  • Power management

Fingerprint

Dive into the research topics of 'On the Convergence of Malleability and the HPC PowerStack: Exploiting Dynamism in Over-Provisioned and Power-Constrained HPC Systems'. Together they form a unique fingerprint.

Cite this