Aautopin - Automated optimization of thread-to-core pinning on multicore systems

Tobias Klug, Michael Ott, Josef Weidendorfer, Carsten Trinitis

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

26 Scopus citations

Abstract

In this paper we present a framework for automatic detection and application of the best binding between threads of a running parallel application and processor cores in a shared memory system, by making use of hardware performance counters. This is especially important within the scope of multicore architectures with shared cache levels. We demonstrate that many applications from the SPEC OMP benchmark show quite sensitive runtime behavior depending on the thread/core binding used. In our tests, the proposed framework is able to find the best binding in nearly all cases. The proposed framework is intended to supplement job scheduling systems for better automatic exploitation of systems with multicore processors, as well as making programmers aware of this issue by providing measurement logs.

Original languageEnglish
Title of host publicationTransactions on High-Performance Embedded Architectures and Compilers III
PublisherSpringer Verlag
Pages219-235
Number of pages17
ISBN (Print)9783642194474
DOIs
StatePublished - 2011

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume6590
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Keywords

  • CMP
  • CPU binding
  • Multicore
  • automatic performance optimization
  • hardware performance counters
  • thread placement

Fingerprint

Dive into the research topics of 'Aautopin - Automated optimization of thread-to-core pinning on multicore systems'. Together they form a unique fingerprint.

Cite this