Zur Hauptnavigation wechseln Zur Suche wechseln Zum Hauptinhalt wechseln

A hybrid framework for fast and accurate GPU performance estimation through source-level analysis and trace-based simulation

  • Technische Universität München
  • Sun Yat-Sen University
  • University of Southern California

Publikation: Beitrag in Buch/Bericht/KonferenzbandKonferenzbeitragBegutachtung

22 Zitate (Scopus)

Abstract

This paper proposes a hybrid framework for fast and accurate performance estimation of OpenCL kernels running on GPUs. The kernel execution flow is statically analyzed and thereupon the execution trace is generated via a loop-based bidirectional branch search. Then the trace is dynamically simulated to perform a dummy execution of the kernel to obtain the estimated time. The framework does not rely on profiling or measurement results which are used in conventional performance estimation techniques. Moreover, the lightweight trace-based simulation consumes much less time than a fine-grained GPU simulator. Our framework can accurately grasp the variation trend of the execution time in the design space and robustly predict the performance of the kernels across two generations of recent Nvidia GPU architectures. Experiments on four Commercial Off-The-Shelf (COTS) GPUs show that our framework can predict the runtime performance with average Mean Absolute Percentage Error (MAPE) of 17.04% and time consumption of a few seconds. We also demonstrate the practicability of our framework with a realworld application.

OriginalspracheEnglisch
TitelProceedings - 25th IEEE International Symposium on High Performance Computer Architecture, HPCA 2019
Herausgeber (Verlag)Institute of Electrical and Electronics Engineers Inc.
Seiten506-518
Seitenumfang13
ISBN (elektronisch)9781728114446
DOIs
PublikationsstatusVeröffentlicht - 26 März 2019
Veranstaltung25th IEEE International Symposium on High Performance Computer Architecture, HPCA 2019 - Washington, USA/Vereinigte Staaten
Dauer: 16 Feb. 201920 Feb. 2019

Publikationsreihe

NameProceedings - 25th IEEE International Symposium on High Performance Computer Architecture, HPCA 2019

Konferenz

Konferenz25th IEEE International Symposium on High Performance Computer Architecture, HPCA 2019
Land/GebietUSA/Vereinigte Staaten
OrtWashington
Zeitraum16/02/1920/02/19

Fingerprint

Untersuchen Sie die Forschungsthemen von „A hybrid framework for fast and accurate GPU performance estimation through source-level analysis and trace-based simulation“. Zusammen bilden sie einen einzigartigen Fingerprint.

Dieses zitieren