PaScaL_TDMA 2.0: A multi-GPU-based library for solving massive tridiagonal systems

Name: PaScaL_TDMA 2.0: A multi-GPU-based library for solving massive tridiagonal systems
Creator: Mingyu Yang
Published: 2023-06-09T05:25:40.067Z
Keywords: High Performance Computing, Computational Physics

Yang, Mingyu; Kang, Ji-Hoon; Kim, Ki-Ha; Kwon, Oh-Kyoung; Choi, Jung-Il

doi:10.17632/49z6fh94z3.2

PaScaL_TDMA 2.0: A multi-GPU-based library for solving massive tridiagonal systems

Published: 9 June 2023| Version 2 | DOI: 10.17632/49z6fh94z3.2

Contributors:

,

Description

We introduce an updated library, PaScaL_TDMA 2.0, which was originally designed for the efficient computation of batched tridiagonal systems and is now capable of exploiting multi-GPU environments. The library extends its functionality to include GPU support and minimizes CPU-GPU data transfer by utilizing the device-resident memory while retaining the original CPU-based capabilities. The library employs pipeline copying with shared memory for low-latency memory access and incorporates CUDA-aware MPI for efficient multi-GPU communication. Our GPU implementation demonstrated outstanding computational performance compared to the original CPU implementation while consuming much less energy. In summary, this updated version presents a time-efficient and energy-saving approach for solving batched tridiagonal systems on modern computing platforms, including both GPU and CPU.

PaScaL_TDMA 2.0: A multi-GPU-based library for solving massive tridiagonal systems

Description

Files

Categories

Licence