Articles | Volume 17, issue 17
https://doi.org/10.5194/gmd-17-6887-2024
https://doi.org/10.5194/gmd-17-6887-2024
Development and technical paper
 | 
13 Sep 2024
Development and technical paper |  | 13 Sep 2024

GPU-HADVPPM4HIP V1.0: using the heterogeneous-compute interface for portability (HIP) to speed up the piecewise parabolic method in the CAMx (v6.10) air quality model on China's domestic GPU-like accelerator

Kai Cao, Qizhong Wu, Lingling Wang, Hengliang Guo, Nan Wang, Huaqiong Cheng, Xiao Tang, Dongxing Li, Lina Liu, Dongqing Li, Hao Wu, and Lanning Wang

Download

Interactive discussion

Status: closed

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
  • RC1: 'Comment on gmd-2023-222', Anonymous Referee #1, 30 Jan 2024
    • AC1: 'Reply on RC1', Qizhong Wu, 23 Feb 2024
  • RC2: 'Comment on gmd-2023-222', Anonymous Referee #2, 05 Apr 2024
    • AC2: 'Reply on RC2', Qizhong Wu, 22 Apr 2024

Peer review completion

AR: Author's response | RR: Referee report | ED: Editor decision | EF: Editorial file upload
AR by Kai Cao on behalf of the Authors (28 Apr 2024)  Author's response   Author's tracked changes   Manuscript 
ED: Referee Nomination & Report Request started (07 May 2024) by Patrick Jöckel
RR by Anonymous Referee #1 (08 May 2024)
RR by Anonymous Referee #2 (20 Jun 2024)
ED: Reconsider after major revisions (01 Jul 2024) by Patrick Jöckel
AR by Qizhong Wu on behalf of the Authors (16 Jul 2024)  Author's response   Author's tracked changes   Manuscript 
ED: Publish as is (22 Jul 2024) by Patrick Jöckel
AR by Qizhong Wu on behalf of the Authors (24 Jul 2024)  Author's response   Manuscript 
Download
Short summary
AMD’s heterogeneous-compute interface for portability was implemented to port the piecewise parabolic method solver from NVIDIA GPUs to China's GPU-like accelerators. The results show that the larger the model scale, the more acceleration effect on the GPU-like accelerator, up to 28.9 times. The multi-level parallelism achieves a speedup of 32.7 times on the heterogeneous cluster. By comparing the results, the GPU-like accelerators have more accuracy for the geoscience numerical models.