Using machine learning for data center cooling infrastructure efficiency prediction

Hayk Shoukourian, Torsten Wilde, Detlef Labrenz, Arndt Bode

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

20 Scopus citations

Abstract

Power consumption continues to remain a critical aspect for High Performance Computing (HPC) data centers. It becomes even more crucial for Exascale computing since scaling today's fastest system to an Exaflop level would consume more than 168 MW power which is 8 times higher than the 20 MW power consumption goal set, at the time of this publication, by the US Department of Energy. This naturally leads to a necessity for energy efficiency improvement that will encompass the full chain of the power consumers, starting from the data center infrastructure, including cooling overheads and electrical losses, up to compute resource scheduling and application scaling. In this paper a machine learning approach is proposed to model the Coefficient of Performance (COP) of HPC data center's hot water cooling loop. The suggested model is validated on operational data obtained at Leibniz Supercomputing Centre (LRZ). The paper shows how this COP model can help to improve the energy efficiency of modern HPC data centers.

Original languageEnglish
Title of host publicationProceedings - 2017 IEEE 31st International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2017
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages954-963
Number of pages10
ISBN (Electronic)9781538634080
DOIs
StatePublished - 30 Jun 2017
Externally publishedYes
Event31st IEEE International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2017 - Orlando, United States
Duration: 29 May 20172 Jun 2017

Publication series

NameProceedings - 2017 IEEE 31st International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2017

Conference

Conference31st IEEE International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2017
Country/TerritoryUnited States
CityOrlando
Period29/05/172/06/17

Keywords

  • COP
  • data center
  • energy efficiency
  • high performance computing
  • machine learning

Fingerprint

Dive into the research topics of 'Using machine learning for data center cooling infrastructure efficiency prediction'. Together they form a unique fingerprint.

Cite this