Enabling fair pricing on HPC systems with node sharing

Alex D. Breslow, Ananta Tiwari, Martin Schulz, Laura Carrington, Lingjia Tang, Jason Mars

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

20 Scopus citations

Abstract

Co-location, where multiple jobs share compute nodes in large-scale HPC systems, has been shown to increase aggre-gate throughput and energy efficiency by 10 to 20%. How-ever, system operators disallow co-location due to fair-pricing concerns, i.e., a pricing mechanism that considers perfor-mance interference from co-running jobs. In the current pricing model, application execution time determines the price, which results in unfair prices paid by the minority of users whose jobs suffer from co-location. This paper presents POPPA, a runtime system that en-ables fair pricing by delivering precise online interference de-tection and facilitates the adoption of supercomputers with co-locations. POPPA leverages a novel shutter mechanism - a cyclic, fine-grained interference sampling mechanism to accurately deduce the interference between co-runners - to provide unbiased pricing of jobs that share nodes. POPPA is able to quantify inter-application interference within 4% mean absolute error on a variety of co-located benchmark and real scientific workloads.

Original languageEnglish
Title of host publicationProceedings of SC 2013
Subtitle of host publicationThe International Conference for High Performance Computing, Networking, Storage and Analysis
PublisherIEEE Computer Society
ISBN (Print)9781450323789
DOIs
StatePublished - 2013
Externally publishedYes
Event2013 International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2013 - Denver, CO, United States
Duration: 17 Nov 201322 Nov 2013

Publication series

NameInternational Conference for High Performance Computing, Networking, Storage and Analysis, SC
ISSN (Print)2167-4329
ISSN (Electronic)2167-4337

Conference

Conference2013 International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2013
Country/TerritoryUnited States
CityDenver, CO
Period17/11/1322/11/13

Keywords

  • Chip Multiprocessor
  • Contention
  • Online Pricing
  • Resource Sharing
  • Supercomputer Accounting

Fingerprint

Dive into the research topics of 'Enabling fair pricing on HPC systems with node sharing'. Together they form a unique fingerprint.

Cite this