Articles | Volume 16, issue 15
https://doi.org/10.5194/gmd-16-4367-2023
https://doi.org/10.5194/gmd-16-4367-2023
Development and technical paper
 | 
01 Aug 2023
Development and technical paper |  | 01 Aug 2023

GPU-HADVPPM V1.0: a high-efficiency parallel GPU design of the piecewise parabolic method (PPM) for horizontal advection in an air quality model (CAMx V6.10)

Kai Cao, Qizhong Wu, Lingling Wang, Nan Wang, Huaqiong Cheng, Xiao Tang, Dongqing Li, and Lanning Wang

Related authors

GPU-HADVPPM4HIP V1.0: using the heterogeneous-compute interface for portability (HIP) to speed up the piecewise parabolic method in the CAMx (v6.10) air quality model on China's domestic GPU-like accelerator
Kai Cao, Qizhong Wu, Lingling Wang, Hengliang Guo, Nan Wang, Huaqiong Cheng, Xiao Tang, Dongxing Li, Lina Liu, Dongqing Li, Hao Wu, and Lanning Wang
Geosci. Model Dev., 17, 6887–6901, https://doi.org/10.5194/gmd-17-6887-2024,https://doi.org/10.5194/gmd-17-6887-2024, 2024
Short summary
Application of regional meteorology and air quality models based on the microprocessor without interlocked piped stages (MIPS) and LoongArch CPU platforms
Zehua Bai, Qizhong Wu, Kai Cao, Yiming Sun, and Huaqiong Cheng
Geosci. Model Dev., 17, 4383–4399, https://doi.org/10.5194/gmd-17-4383-2024,https://doi.org/10.5194/gmd-17-4383-2024, 2024
Short summary

Related subject area

Atmospheric sciences
LIMA (v2.0): A full two-moment cloud microphysical scheme for the mesoscale non-hydrostatic model Meso-NH v5-6
Marie Taufour, Jean-Pierre Pinty, Christelle Barthe, Benoît Vié, and Chien Wang
Geosci. Model Dev., 17, 8773–8798, https://doi.org/10.5194/gmd-17-8773-2024,https://doi.org/10.5194/gmd-17-8773-2024, 2024
Short summary
SLUCM+BEM (v1.0): a simple parameterisation for dynamic anthropogenic heat and electricity consumption in WRF-Urban (v4.3.2)
Yuya Takane, Yukihiro Kikegawa, Ko Nakajima, and Hiroyuki Kusaka
Geosci. Model Dev., 17, 8639–8664, https://doi.org/10.5194/gmd-17-8639-2024,https://doi.org/10.5194/gmd-17-8639-2024, 2024
Short summary
NAQPMS-PDAF v2.0: a novel hybrid nonlinear data assimilation system for improved simulation of PM2.5 chemical components
Hongyi Li, Ting Yang, Lars Nerger, Dawei Zhang, Di Zhang, Guigang Tang, Haibo Wang, Yele Sun, Pingqing Fu, Hang Su, and Zifa Wang
Geosci. Model Dev., 17, 8495–8519, https://doi.org/10.5194/gmd-17-8495-2024,https://doi.org/10.5194/gmd-17-8495-2024, 2024
Short summary
Source-specific bias correction of US background and anthropogenic ozone modeled in CMAQ
T. Nash Skipper, Christian Hogrefe, Barron H. Henderson, Rohit Mathur, Kristen M. Foley, and Armistead G. Russell
Geosci. Model Dev., 17, 8373–8397, https://doi.org/10.5194/gmd-17-8373-2024,https://doi.org/10.5194/gmd-17-8373-2024, 2024
Short summary
Observational operator for fair model evaluation with ground NO2 measurements
Li Fang, Jianbing Jin, Arjo Segers, Ke Li, Ji Xia, Wei Han, Baojie Li, Hai Xiang Lin, Lei Zhu, Song Liu, and Hong Liao
Geosci. Model Dev., 17, 8267–8282, https://doi.org/10.5194/gmd-17-8267-2024,https://doi.org/10.5194/gmd-17-8267-2024, 2024
Short summary

Cited articles

Bleichrodt, F., Bisseling, R. H., and Dijkstra, H. A.: Accelerating a barotropic ocean model using a GPU, Ocean Model., 41, 16–21, https://doi.org/10.1016/j.ocemod.2011.10.001, 2012. 
Cao, K., Wu, Q., Wang, L., Wang, N., Cheng, H., Tang, X., Li, D., and Wang, L.: The dataset of the manuscript “GPU-HADVPPM V1.0: high-efficient parallel GPU design of the Piecewise Parabolic Method (PPM) for horizontal advection in air quality model (CAMx V6.10)”, Zenodo [data set], https://doi.org/10.5281/zenodo.7765218, 2023. 
Colella, P. and Woodward, P. R.: The Piecewise Parabolic Method (PPM) for gas-dynamical simulations, J. Comput. Phys., 54, 174–201, https://doi.org/10.1016/0021-9991(84)90143-8, 1984. 
ENVIRON: User Guide for Comprehensive Air Quality Model with Extensions Version 6.1, https://camx-wp.azurewebsites.net/Files/CAMxUsersGuide_v6.10.pdf (last access: 19 December 2022), 2014. 
ENVIRON: CAMx version 6.1, ENVIRON [code], available at: https://camx-wp.azurewebsites.net/download/source/, last access: 24 March 2023. 
Download
Short summary
Offline performance experiment results show that the GPU-HADVPPM on a V100 GPU can achieve up to 1113.6 × speedups to its original version on an E5-2682 v4 CPU. A series of optimization measures are taken, and the CAMx-CUDA model improves the computing efficiency by 128.4 × on a single V100 GPU card. A parallel architecture with an MPI plus CUDA hybrid paradigm is presented, and it can achieve up to 4.5 × speedup when launching eight CPU cores and eight GPU cards.