TY - JOUR
T1 - Bluegene/L applications
T2 - Parallelism on a massive scale
AU - de Supinski, Bronis R.
AU - Schulz, Martin
AU - Bulatov, Vasily V.
AU - Cabot, William
AU - Chan, Bor
AU - Cook, Andrew W.
AU - Draeger, Erik W.
AU - Glosli, James N.
AU - Greenough, Jeffrey A.
AU - Henderson, Keith
AU - Kubota, Alison
AU - Louis, Steve
AU - Miller, Brian J.
AU - Patel, Mehul V.
AU - Spelce, Thomas E.
AU - Streitz, Frederick H.
AU - Williams, Peter L.
AU - Yates, Robert K.
AU - Yoo, Andy
AU - Almasi, George
AU - Bhanot, Gyan
AU - Gara, Alan
AU - Gunnels, John A.
AU - Gupta, Manish
AU - Moreira, Jose
AU - Sexton, James
AU - Walkup, Bob
AU - Archer, Charles
AU - Gygi, Francois
AU - Germann, Timothy C.
AU - Kadau, Kai
AU - Lomdahl, Peter S.
AU - Rendleman, Charles
AU - Welcome, Michael L.
AU - McLendon, William
AU - Hendrickson, Bruce
AU - Franchetti, Franz
AU - Kral, Stefan
AU - Lorenz, Jürgen
AU - Überhuber, Christoph W.
AU - Chow, Edmond
AU - Çatalyürek, Ümit
PY - 2008/3
Y1 - 2008/3
N2 - BlueGene/L (BG/L), developed through a partnership between IBM and Lawrence Livermore National Laboratory (LLNL), is currently the world's largest system both in terms of scale, with 131,072 processors, and absolute performance, with a peak rate of 367 Tflop/s. BG/L has led the last four Top500 lists with a Linpack rate of 280.6 Tflop/s for the full machine installed at LLNL and is expected to remain the fastest computer in the next few editions. However, the real value of a machine such as BG/L derives from the scientific breakthroughs that real applications can produce by successfully using its unprecedented scale and computational power. In this paper, we describe our experiences with eight large scale applications on BG/L from several application domains, ranging from molecular dynamics to dislocation dynamics and turbulence simulations to searches in semantic graphs. We also discuss the challenges we faced when scaling these codes and present several successful optimization techniques. All applications show excellent scaling behavior, even at very large processor counts, with one code even achieving a sustained performance of more than 100 Tflop/s, clearly demonstrating the real success of the BG/L design.
AB - BlueGene/L (BG/L), developed through a partnership between IBM and Lawrence Livermore National Laboratory (LLNL), is currently the world's largest system both in terms of scale, with 131,072 processors, and absolute performance, with a peak rate of 367 Tflop/s. BG/L has led the last four Top500 lists with a Linpack rate of 280.6 Tflop/s for the full machine installed at LLNL and is expected to remain the fastest computer in the next few editions. However, the real value of a machine such as BG/L derives from the scientific breakthroughs that real applications can produce by successfully using its unprecedented scale and computational power. In this paper, we describe our experiences with eight large scale applications on BG/L from several application domains, ranging from molecular dynamics to dislocation dynamics and turbulence simulations to searches in semantic graphs. We also discuss the challenges we faced when scaling these codes and present several successful optimization techniques. All applications show excellent scaling behavior, even at very large processor counts, with one code even achieving a sustained performance of more than 100 Tflop/s, clearly demonstrating the real success of the BG/L design.
KW - Application scalability
KW - BlueGene/L
KW - Massively parallel architectures
KW - Performance study and optimization
UR - http://www.scopus.com/inward/record.url?scp=38849168828&partnerID=8YFLogxK
U2 - 10.1177/1094342007085025
DO - 10.1177/1094342007085025
M3 - Article
AN - SCOPUS:38849168828
SN - 1094-3420
VL - 22
SP - 33
EP - 51
JO - International Journal of High Performance Computing Applications
JF - International Journal of High Performance Computing Applications
IS - 1
ER -