Automatic performance analysis of OpenMP codes on a scalable shared memory system using periscope

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

8 Scopus citations

Abstract

OpenMP is a successful interface for programming parallel applications on shared memory systems. It is widely applied on small scale shared memory systems such as multicore processors, but also in hybrid programming on large supercomputers. This paper presents performance properties for OpenMP and their automatic detection by Periscope. We evaluate Periscope's OpenMP analysis strategy in the context of the Altix 4700 supercomputer at Leibniz Computing Center (LRZ) in Garching. On this unique machine OpenMP scales up to 500 cores, one partition of in total 19 partitions. We present results for the NAS parallel benchmarks and for a large hybrid scientific application.

Original languageEnglish
Title of host publicationApplied Parallel and Scientific Computing - 10th International Conference, PARA 2010, Revised Selected Papers
Pages452-462
Number of pages11
EditionPART 2
DOIs
StatePublished - 2012
Event10th International Conference on Applied Parallel and Scientific Computing, PARA 2010 - Reykjavik, Iceland
Duration: 6 Jun 20109 Jun 2010

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
NumberPART 2
Volume7134 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference10th International Conference on Applied Parallel and Scientific Computing, PARA 2010
Country/TerritoryIceland
CityReykjavik
Period6/06/109/06/10

Keywords

  • Memory accesses analysis
  • OpenMP
  • Performance analysis
  • Supercomputers

Fingerprint

Dive into the research topics of 'Automatic performance analysis of OpenMP codes on a scalable shared memory system using periscope'. Together they form a unique fingerprint.

Cite this