PerficienCC - Performance and Efficiency in HPC with Custom Computing

DFG Project in call "Performance Engineering für wissenschaftliche Software"

Numerous research projects have demonstrated that accelerators like field-programmable gate arrays (FPGAs), many-cores and graphics processing units (GPUs) can achieve substantial performance and energy-efficiency benefits for high-performance computing. Still, accelerators are not pervasively used so far – even for applications that could very likely profit from accelerators. There are numerous reasons for the reluctance of HPC developers to adopt accelerators, such as: lack of technical knowledge; unclear benefit of the time invested for porting and optimization; lack of an estimate of the acceleration potential; poor quality of materials that teach the use of accelerators; few libraries that allow a black-box use of accelerators.

In this project, we are establishing a structured support and consulting process in our HPC center that supports HPC developers during the complete process from performance analysis and estimation of the acceleration potential up to the optimization of the runtime-critical parts of their applications. This process infuses a complementary expertise into teams of developers from computational sciences and thus improves the cost/benefit ratio of code porting and optimization. Thus ultimately allows these scientists to reduce the program runtime or to simulate larger or more complex systems.

To allow developers to profit from previous work we will translate frequently used functions into reusable libraries. Finally, we will develop teaching materials that are tailored to the needs of computational scientists and collect code examples that illustrate best practices.

We will focus the work in this project on the technological and application-related or method-related competencies of our compute center and its users. As research of our main users work is concentrated in three main domains (nanophotonics, molecular dynamics and quantum chemistry) we can reuse domain-specific methods and experiences for different codes. Technologically, we will focus on FPGAs as accelerator technology, because FPGAs have the highest potential to improve the energy efficiency of computation and the market for FPGAs is currently thriving. We see a fertile basis for our research and development efforts, driven by first initiatives to standardize software stacks for FPGAs, the beginning of hardware integration of FPGA technologies in processors, the introduction of CAPI as a general accelerator interface by IBM and the increased maturity of FPGA development tools. We expect that our substantial expertise in custom computing with FPGAs and our focus on few application domains provide ideal conditions to make a significant progress in the application of FPGAs in HPC and allow for demonstrating and quantifying the potential to improve the performance and energy efficiency with real HPC codes.

Funding agencyDeutsche Forschungsgemeinschaft (DFG)
Funding identifierPL 595/2-1

Publications

A Submatrix-Based Method for Approximate Matrix Function Evaluation in the Quantum Chemistry Code CP2K
M. Lass, R. Schade, T. Kühne, C. Plessl, in: Proc. International Conference for High Performance Computing, Networking, Storage and Analysis (SC), IEEE Computer Society, Los Alamitos, CA, USA, 2020, pp. 1127–1140.
Accurate Sampling with Noisy Forces from Approximate Computing
V. Rengaraj, M. Lass, C. Plessl, T. Kühne, Computation 8 (2020).
CP2K: An electronic structure and molecular dynamics software package - Quickstep: Efficient and accurate electronic structure calculations
T. Kühne, M. Iannuzzi, M.D. Ben, V.V. Rybkin, P. Seewald, F. Stein, T. Laino, R.Z. Khaliullin, O. Schütt, F. Schiffmann, D. Golze, J. Wilhelm, S. Chulkov, M.H.B.-H. Mohammad Hossein Bani-Hashemian, V. Weber, U. Borstnik, M. Taillefumier, A.S. Jakobovits, A. Lazzaro, H. Pabst, T. Müller, R. Schade, M. Guidon, S. Andermatt, N. Holmberg, G.K. Schenter, A. Hehn, A. Bussy, F. Belleflamme, G. Tabacchi, A. Glöß, M. Lass, I. Bethune, C.J. Mundy, C. Plessl, M. Watkins, J. VandeVondele, M. Krack, J. Hutter, The Journal of Chemical Physics 152 (2020).
A General Algorithm to Calculate the Inverse Principal p-th Root of Symmetric Positive Definite Matrices
D. Richters, M. Lass, A. Walther, C. Plessl, T. Kühne, Communications in Computational Physics 25 (2019) 564–585.
OpenCL Implementation of Cannon's Matrix Multiplication Algorithm on Intel Stratix 10 FPGAs
P. Gorlani, T. Kenter, C. Plessl, in: Proceedings of the International Conference on Field-Programmable Technology (FPT), IEEE, 2019.
A Massively Parallel Algorithm for the Approximate Calculation of Inverse p-th Roots of Large Sparse Matrices
M. Lass, S. Mohr, H. Wiebeler, T. Kühne, C. Plessl, in: Proc. Platform for Advanced Scientific Computing (PASC) Conference, ACM, New York, NY, USA, 2018.
Using Approximate Computing for the Calculation of Inverse Matrix p-th Roots
M. Lass, T. Kühne, C. Plessl, Embedded Systems Letters 10 (2018) 33–36.
Flexible FPGA design for FDTD using OpenCL
T. Kenter, J. Förstner, C. Plessl, in: Proc. Int. Conf. on Field Programmable Logic and Applications (FPL), IEEE, 2017.
Microdisk Cavity FDTD Simulation on FPGA using OpenCL
T. Kenter, C. Plessl, in: Proc. Workshop on Heterogeneous High-Performance Reconfigurable Computing (H2RC), 2016.
Show all publications

Dr. Tobias Kenter

Paderborn Center for Parallel Computing (PC2)

Scientific Advisor FPGA Acceleration

Write email +49 5251 60-4340
business-card image

Dr. Michael Laß

Paderborn Center for Parallel Computing (PC2)

Scientific Advisor for FPGA Acceleration

Write email +49 5251 60-1722