Automatic performance analysis of OpenMP codes on a scalable shared memory system using periscope

Publikation: Beitrag in Buch/Bericht/KonferenzbandKonferenzbeitragBegutachtung

8 Zitate (Scopus)

Abstract

OpenMP is a successful interface for programming parallel applications on shared memory systems. It is widely applied on small scale shared memory systems such as multicore processors, but also in hybrid programming on large supercomputers. This paper presents performance properties for OpenMP and their automatic detection by Periscope. We evaluate Periscope's OpenMP analysis strategy in the context of the Altix 4700 supercomputer at Leibniz Computing Center (LRZ) in Garching. On this unique machine OpenMP scales up to 500 cores, one partition of in total 19 partitions. We present results for the NAS parallel benchmarks and for a large hybrid scientific application.

OriginalspracheEnglisch
TitelApplied Parallel and Scientific Computing - 10th International Conference, PARA 2010, Revised Selected Papers
Seiten452-462
Seitenumfang11
AuflagePART 2
DOIs
PublikationsstatusVeröffentlicht - 2012
Veranstaltung10th International Conference on Applied Parallel and Scientific Computing, PARA 2010 - Reykjavik, Island
Dauer: 6 Juni 20109 Juni 2010

Publikationsreihe

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
NummerPART 2
Band7134 LNCS
ISSN (Print)0302-9743
ISSN (elektronisch)1611-3349

Konferenz

Konferenz10th International Conference on Applied Parallel and Scientific Computing, PARA 2010
Land/GebietIsland
OrtReykjavik
Zeitraum6/06/109/06/10

Fingerprint

Untersuchen Sie die Forschungsthemen von „Automatic performance analysis of OpenMP codes on a scalable shared memory system using periscope“. Zusammen bilden sie einen einzigartigen Fingerprint.

Dieses zitieren