Non-Blocking GPU-CPU Notifications to Enable More GPU-CPU Parallelism

Bengisu Elis, Olga Pearce, David Boehme, Jason Burmark, Martin Schulz

Publikation: Beitrag in Buch/Bericht/KonferenzbandKonferenzbeitragBegutachtung

Abstract

GPUs are increasingly popular in HPC systems, and more applications are adopting GPUs each day. However, the control synchronization of GPUs with CPUs is suboptimal and only possible after GPU kernel termination points, resulting in serialized host and device tasks. In this paper, we propose a novel CPU-GPU notification method that enables non-blocking in-kernel control synchronization of device and host tasks in combination with persistent GPU kernels. Using this notification method, we increase the overlap of CPU and GPU execution and with that parallelism. We present the concept and structure of the proposed notification mechanism together with in-kernel GPU-CPU control synchronization, using halo-exchange as an example. We analyze the performance of the halo-exchange pattern using our new notification method, as well as the interference between CPU and GPU operations due to the execution overlap. Finally, we verify our results using a performance model covering the halo-exchange pattern with the new notification method.

OriginalspracheEnglisch
TitelBDSIC2023 - 2023 5th International Conference on Big-data Service and Intelligent Computation
Herausgeber (Verlag)Association for Computing Machinery
Seiten1-11
Seitenumfang11
ISBN (elektronisch)9798400708923
DOIs
PublikationsstatusVeröffentlicht - 20 Okt. 2023
Veranstaltung5th International Conference on Big-data Service and Intelligent Computation - Singapore, Singapur
Dauer: 20 Okt. 202322 Okt. 2023

Publikationsreihe

NameACM International Conference Proceeding Series

Konferenz

Konferenz5th International Conference on Big-data Service and Intelligent Computation
Land/GebietSingapur
OrtSingapore
Zeitraum20/10/2322/10/23

Fingerprint

Untersuchen Sie die Forschungsthemen von „Non-Blocking GPU-CPU Notifications to Enable More GPU-CPU Parallelism“. Zusammen bilden sie einen einzigartigen Fingerprint.

Dieses zitieren