TY - JOUR
T1 - A two-scale approach for efficient on-the-fly operator assembly in massively parallel high performance multigrid codes
AU - Bauer, S.
AU - Mohr, M.
AU - Rüde, U.
AU - Weismüller, J.
AU - Wittmann, M.
AU - Wohlmuth, B.
N1 - Publisher Copyright:
© 2017 IMACS
PY - 2017/12
Y1 - 2017/12
N2 - Large scale matrix-free finite element implementations save memory and are often significantly faster than implementations using classical sparse matrix techniques. They are especially well suited for massively parallel geometric multigrid solvers in combination with hierarchical hybrid grids on polyhedral domains. In the case of constant coefficients, the number of different stencil entries depends only on the coarse grid and does not increase with the number of refinement levels. However, for non-polyhedral domains the situation changes. Then even for the Laplace operator, the element mapping leads to fine grid stencils that can vary from grid point to grid point. Traditional matrix-free techniques that are based on an element-wise assembly then result in a considerably increase in computational cost. To compensate for this shortcoming, we introduce a new two-scale approach that uses a surrogate operator. It exploits a piecewise polynomial approximation of the entries of the stencil of the fine grid operator with respect to the coarse mesh size. The low-cost evaluation of these surrogate polynomials results in an efficient stencil assembly on-the-fly for non-polyhedral domains. We discuss and illustrate numerically two-scale a priori bounds. The accuracy of the approximate solution can be further improved if combined with a double discretization technique. A careful performance analysis in combination with a hardware–aware code optimization based on the Execution–Cache–Memory model yields a significant speed up. Weak and strong scaling results illustrate the potential of this new two-scale approach within large scale PDE simulations.
AB - Large scale matrix-free finite element implementations save memory and are often significantly faster than implementations using classical sparse matrix techniques. They are especially well suited for massively parallel geometric multigrid solvers in combination with hierarchical hybrid grids on polyhedral domains. In the case of constant coefficients, the number of different stencil entries depends only on the coarse grid and does not increase with the number of refinement levels. However, for non-polyhedral domains the situation changes. Then even for the Laplace operator, the element mapping leads to fine grid stencils that can vary from grid point to grid point. Traditional matrix-free techniques that are based on an element-wise assembly then result in a considerably increase in computational cost. To compensate for this shortcoming, we introduce a new two-scale approach that uses a surrogate operator. It exploits a piecewise polynomial approximation of the entries of the stencil of the fine grid operator with respect to the coarse mesh size. The low-cost evaluation of these surrogate polynomials results in an efficient stencil assembly on-the-fly for non-polyhedral domains. We discuss and illustrate numerically two-scale a priori bounds. The accuracy of the approximate solution can be further improved if combined with a double discretization technique. A careful performance analysis in combination with a hardware–aware code optimization based on the Execution–Cache–Memory model yields a significant speed up. Weak and strong scaling results illustrate the potential of this new two-scale approach within large scale PDE simulations.
KW - ECM-model
KW - Large scale scaling results
KW - Massively parallel multigrid
KW - Matrix free on-the-fly assembly
KW - Surrogate operator
KW - Two-scale PDE discretization
UR - http://www.scopus.com/inward/record.url?scp=85026890303&partnerID=8YFLogxK
U2 - 10.1016/j.apnum.2017.07.006
DO - 10.1016/j.apnum.2017.07.006
M3 - Article
AN - SCOPUS:85026890303
SN - 0168-9274
VL - 122
SP - 14
EP - 38
JO - Applied Numerical Mathematics
JF - Applied Numerical Mathematics
ER -