145 TFlops performance on 3990 GPUs of TSUBAME 2.0 supercomputer for an operational weather prediction

Takashi Shimokawabe, Takayuki Aoki, Junichi Ishida, Kohei Kawano, Chiashi Muroi

Research output: Contribution to journalConference articlepeer-review

32 Scopus citations

Abstract

Numerical weather prediction is one of the major applications in high performance computing and demands fast and high-precision simulation over fine-grained grids. While utilizing hundreds of CPUs is certainly the most common way to get high performance for large scale simulations, we have another solution to use GPUs as massively parallel computing platform. In order to drastically shorten the runtime of a weather prediction code, we rewrite its huge entire code for GPU computing from scratch in CUDA. The code ASUCA is a high resolution meso-scale atmosphere model that is being developed by the Japan Meteorological Agency for the purpose of the next-generation weather forecasting service. The TSUBAME 2.0 supercomputer, which is equipped with 4224 NVIDIA Tesla M2050 GPUs, has started operating in November 2010 at the Tokyo Institute of Technology. A benchmark on the 3990 GPUs on TSUBAME 2.0 achieves extremely high performance of 145 TFlops in single precision for 14368×14284×48 mesh. This paper also describes the multi-GPU optimizations introduced into the ASUCA porting on TSUBAME 2.0.

Original languageEnglish
Pages (from-to)1535-1544
Number of pages10
JournalProcedia Computer Science
Volume4
DOIs
StatePublished - 2011
Externally publishedYes
Event11th International Conference on Computational Science, ICCS 2011 - Singapore, Singapore
Duration: 1 Jun 20113 Jun 2011

Keywords

  • GPGPU
  • High performance computing
  • Numerical weather prediction

Fingerprint

Dive into the research topics of '145 TFlops performance on 3990 GPUs of TSUBAME 2.0 supercomputer for an operational weather prediction'. Together they form a unique fingerprint.

Cite this