TY - GEN
T1 - Enabling fair pricing on HPC systems with node sharing
AU - Breslow, Alex D.
AU - Tiwari, Ananta
AU - Schulz, Martin
AU - Carrington, Laura
AU - Tang, Lingjia
AU - Mars, Jason
PY - 2013
Y1 - 2013
N2 - Co-location, where multiple jobs share compute nodes in large-scale HPC systems, has been shown to increase aggre-gate throughput and energy efficiency by 10 to 20%. How-ever, system operators disallow co-location due to fair-pricing concerns, i.e., a pricing mechanism that considers perfor-mance interference from co-running jobs. In the current pricing model, application execution time determines the price, which results in unfair prices paid by the minority of users whose jobs suffer from co-location. This paper presents POPPA, a runtime system that en-ables fair pricing by delivering precise online interference de-tection and facilitates the adoption of supercomputers with co-locations. POPPA leverages a novel shutter mechanism - a cyclic, fine-grained interference sampling mechanism to accurately deduce the interference between co-runners - to provide unbiased pricing of jobs that share nodes. POPPA is able to quantify inter-application interference within 4% mean absolute error on a variety of co-located benchmark and real scientific workloads.
AB - Co-location, where multiple jobs share compute nodes in large-scale HPC systems, has been shown to increase aggre-gate throughput and energy efficiency by 10 to 20%. How-ever, system operators disallow co-location due to fair-pricing concerns, i.e., a pricing mechanism that considers perfor-mance interference from co-running jobs. In the current pricing model, application execution time determines the price, which results in unfair prices paid by the minority of users whose jobs suffer from co-location. This paper presents POPPA, a runtime system that en-ables fair pricing by delivering precise online interference de-tection and facilitates the adoption of supercomputers with co-locations. POPPA leverages a novel shutter mechanism - a cyclic, fine-grained interference sampling mechanism to accurately deduce the interference between co-runners - to provide unbiased pricing of jobs that share nodes. POPPA is able to quantify inter-application interference within 4% mean absolute error on a variety of co-located benchmark and real scientific workloads.
KW - Chip Multiprocessor
KW - Contention
KW - Online Pricing
KW - Resource Sharing
KW - Supercomputer Accounting
UR - http://www.scopus.com/inward/record.url?scp=84899688706&partnerID=8YFLogxK
U2 - 10.1145/2503210.2503256
DO - 10.1145/2503210.2503256
M3 - Conference contribution
AN - SCOPUS:84899688706
SN - 9781450323789
T3 - International Conference for High Performance Computing, Networking, Storage and Analysis, SC
BT - Proceedings of SC 2013
PB - IEEE Computer Society
T2 - 2013 International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2013
Y2 - 17 November 2013 through 22 November 2013
ER -