43 - NHR PerfLab Seminar: High-Performance Implementations for High-Order Finite-Element Discretizations of PDEs/ClipID:45755 previous clip next clip

The automatic subtitles generated using Whisper Open AI in this video player (and in the Multistream video player) are provided for convenience and accessibility purposes. However, please note that accuracy and interpretation may vary. For more information, please refer to the FAQs (Paragraph 14).
Recording date 2022-11-08





Organisational Unit

Zentrum für Nationales Hochleistungsrechnen Erlangen (NHR@FAU)


Zentrum für Nationales Hochleistungsrechnen Erlangen (NHR@FAU)

NHR PerfLab Seminar talk on November 8, 2022

Speaker: Martin Kronbichler, University of Augsburg

Slides: https://hpc.fau.de/files/2022/10/Kronbichler_PerfLab_Nov2022.pdf

Abstract: My talk will present recent developments on matrix-free finite-element algorithms for numerically solving partial differential equations on complex geometries. The core ingredient is the computation of the integrals underlying the finite-element discretization on the fly. While this leads to algorithms with several hundreds of arithmetic operations per unknown and was traditionally considered too expensive compared to assembling a global sparse matrix, progress in performance engineering made it the fastest way to evaluate the matrix-vector product for practical cases of high-order discretizations with curvilinear unstructured hexahedral mesh elements or variable coefficients. The explanation is that the additional arithmetic work can be hidden behind the memory transfer of accessing the solution vectors, and in fact leverage a throughput close to simple finite difference stencils. I will present node-level performance results for high-order continuous and discontinuous Galerkin discretizations, including the case of adaptively refined meshes with hanging nodes. With the achieved high throughput of the matrix-vector product, we have observed that other operations in common iterative solvers, such as the vector operations in multigrid smoothers or the conjugate gradient method, now take a significant share of run time both on GPUs and CPUs. I will present results of loop fusion to increase data locality, which benefit CPUs with large L2 and L3 caches.

Speaker bio: Martin Kronbichler is a Professor at the University of Augsburg, Germany. He holds a diploma in applied mathematics from Technical University of Munich, Germany (2007) and a PhD degree in scientic computing with specialization in numerical analysis from Uppsala University, Sweden (2012). His research interests include high-order nite element methods for ow problems with matrix-free implementations, efcient numerical linear algebra, and their parallel and high-performance implementation on emerging exascale hardware using generic numerical software.

See https://hpc.fau.de/research/nhr-perflab-seminar-series/ for past and upcoming NHR PerfLab seminar talks.

More clips in this category "Friedrich-Alexander-Universität Erlangen-Nürnberg Zentralbereich"