A Stencil Framework to Realize Large-Scale Computations beyond Device Memory Capacity on GPU Supercomputers

Takashi Shimokawabe, Toshio Endo, Naoyuki Onodera, Takayuki Aoki

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

23 Scopus citations

Abstract

Stencil-based applications such as CFD have succeeded in obtaining high performance on GPU supercomputers. The problem sizes of these applications are limited by the GPU device memory capacity, which is typically smaller than the host memory. On GPU supercomputers, a locality improvement technique using temporal blocking method with memory swapping between host and device enables large computation beyond the device memory capacity. However, because the loop management of temporal blocking with data movement across these memories increase programming difficulty, the applying this methodology to the real stencil applications demands substantially higher programming cost. Our high-productivity stencil framework automatically applies temporal blocking to boundary exchange required for stencil computation and supports automatic memory swapping provided by a MPI/CUDA wrapper library. The framework-based application for the airflow in an urban city maintains 80% performance even with the twice larger than the GPU memory capacity and have demonstrated good weak scalability on the TSUBAME 2.5 supercomputer.

Original languageEnglish
Title of host publicationProceedings - 2017 IEEE International Conference on Cluster Computing, CLUSTER 2017
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages525-529
Number of pages5
ISBN (Electronic)9781538623268
DOIs
StatePublished - 22 Sep 2017
Externally publishedYes
Event2017 IEEE International Conference on Cluster Computing, CLUSTER 2017 - Honolulu, United States
Duration: 5 Sep 20178 Sep 2017

Publication series

NameProceedings - IEEE International Conference on Cluster Computing, ICCC
Volume2017-September
ISSN (Print)1552-5244

Conference

Conference2017 IEEE International Conference on Cluster Computing, CLUSTER 2017
Country/TerritoryUnited States
CityHonolulu
Period5/09/178/09/17

Fingerprint

Dive into the research topics of 'A Stencil Framework to Realize Large-Scale Computations beyond Device Memory Capacity on GPU Supercomputers'. Together they form a unique fingerprint.

Cite this