Articles | Volume 16, issue 15
https://doi.org/10.5194/gmd-16-4367-2023
https://doi.org/10.5194/gmd-16-4367-2023
Development and technical paper
 | 
01 Aug 2023
Development and technical paper |  | 01 Aug 2023

GPU-HADVPPM V1.0: a high-efficiency parallel GPU design of the piecewise parabolic method (PPM) for horizontal advection in an air quality model (CAMx V6.10)

Kai Cao, Qizhong Wu, Lingling Wang, Nan Wang, Huaqiong Cheng, Xiao Tang, Dongqing Li, and Lanning Wang

Download

Interactive discussion

Status: closed

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
  • RC1: 'Comment on egusphere-2023-410', Anonymous Referee #1, 18 Apr 2023
  • CC1: 'Comment on egusphere-2023-410', Kai Cao, 20 Apr 2023
    • RC3: 'Reply on CC1', Anonymous Referee #1, 06 May 2023
  • RC2: 'Comment on egusphere-2023-410', Anonymous Referee #2, 05 May 2023
    • CC2: 'Reply on RC2', Kai Cao, 19 May 2023
  • AC1: 'Comment on egusphere-2023-410', Qizhong Wu, 01 Jun 2023

Peer review completion

AR: Author's response | RR: Referee report | ED: Editor decision | EF: Editorial file upload
AR by Qizhong Wu on behalf of the Authors (01 Jun 2023)  Author's response   Author's tracked changes   Manuscript 
ED: Publish subject to minor revisions (review by editor) (05 Jun 2023) by Xiaomeng Huang
AR by Qizhong Wu on behalf of the Authors (12 Jun 2023)  Author's response   Author's tracked changes   Manuscript 
ED: Publish as is (20 Jun 2023) by Xiaomeng Huang
AR by Qizhong Wu on behalf of the Authors (24 Jun 2023)  Manuscript 
Download
Short summary
Offline performance experiment results show that the GPU-HADVPPM on a V100 GPU can achieve up to 1113.6 × speedups to its original version on an E5-2682 v4 CPU. A series of optimization measures are taken, and the CAMx-CUDA model improves the computing efficiency by 128.4 × on a single V100 GPU card. A parallel architecture with an MPI plus CUDA hybrid paradigm is presented, and it can achieve up to 4.5 × speedup when launching eight CPU cores and eight GPU cards.