A Geographically Weighted Gaussian Process Regression (GW-GPR) emulator of anthropogenic PM<sub>2.5</sub> from the GEOS-Chem High Performance (GCHP) 13.0.0 global chemical transport model

Wong, Anthony Y. H.; Eastham, Sebastian D.; Monier, Erwan; Selin, Noelle E.

doi:10.5194/gmd-19-3335-2026

Articles | Volume 19, issue 8

https://doi.org/10.5194/gmd-19-3335-2026

Articles | Volume 19, issue 8

Model description paper

27 Apr 2026

Model description paper |

| 27 Apr 2026

A Geographically Weighted Gaussian Process Regression (GW-GPR) emulator of anthropogenic PM_2.5 from the GEOS-Chem High Performance (GCHP) 13.0.0 global chemical transport model

Anthony Y. H. Wong, Sebastian D. Eastham, Erwan Monier, and Noelle E. Selin

Abstract

Chemical transport modelling has been an essential tool to study the impacts of socio-economic changes and policies on air quality and associated social costs due to human health impacts. However, high computational and human resource demands limit the use of state-of-the-art chemical transport models outside of the atmospheric science community. We address this limitation by training Geographically Weighted Gaussian Process Regressors (GW-GPR) on the outputs of a series of perturbation experiments from the GEOS-Chem High Performance global chemical transport model (GCHP 13.0.0). The Gaussian Process Regressor relates changes in annual mean surface anthropogenic PM_2.5 in each GCHP model grid cell to changes in short-lived air pollutant emissions and atmospheric CH₄ and CO₂ concentrations. In comparison to existing linearized and regionalized approaches, our method can account for sub-regional changes in air pollutant emission patterns and incorporates the non-linear response of secondary air pollutants to precursor and greenhouse gas emissions. We evaluate our emulator by predicting the global distribution of PM_2.5 in 2050 (relative to 2014) under 4 sets of climate and air pollution control policy scenarios. The emulator reproduces grid cell scale changes in anthropogenic PM_2.5 (R²=0.94–0.99 over the 4 scenarios tested), and associated global changes in premature mortalities at 95 % confidence level, while requiring <10 s of CPU time (vs. ∼3000 CPU hours for GCHP) for each scenario. We demonstrate the utility of the emulator by projecting global trends of population-weighted PM_2.5 from the AerChemMIP ensemble, where the emulator prediction falls within the ensemble range. To our knowledge, the GW-GPR emulator is the first global-scale emulator operating at grid cell scale with explicit consideration of non-linearities in atmospheric chemistry, climate change, and provides predictive uncertainties. The accuracy, speed and simplicity of the emulator also show the capability of machine learning algorithms in emulating global atmospheric chemistry models, and in making atmospheric chemistry modelling accessible for global climate/air pollution scenario analysis and integrated assessment.

Download & links

How to cite.

Received: 05 Jun 2025 – Discussion started: 14 Jul 2025 – Revised: 20 Mar 2026 – Accepted: 31 Mar 2026 – Published: 27 Apr 2026

1 Introduction

Fine (aerodynamic diameter ≤ 2.5 µm) particulate matter (PM_2.5) is among the most important air pollutants at global scale, threatening both human and ecosystem health. Globally, PM_2.5 exposure was estimated to be responsible for ∼4 million deaths in 2019 (Sang et al., 2022), and addressing health and environmental impacts from ambient air pollution has been included within the Sustainable Development Goals (goal 3.9.1 and 11.6.2) (United Nations, 2015). The conventional approach to evaluate the impacts of socio-economic changes and policy interventions on air quality involves producing the projected air pollutant emission inventories (and meteorological fields if direct impacts of climate change are considered) and feeding them as inputs to a chemical transport model to simulate the impacts on air pollutant concentration. Alternatively, the greenhouse gas (GHG) emission or concentration, and air pollutant emissions can be directly fed into chemistry-climate models or Earth system models to further include the feedback between atmospheric composition and other components of Earth system. This process is highly demanding in terms of human and computational resources, which limits its usage for policy analysis and integrated assessment.

To increase the accessibility of air quality modelling for the broader scientific and stakeholder communities, strategies have been developed to reduce the complexity of air quality modelling by drawing from full chemical transport model experiments, resulting in various reduced-form air quality models that are faster and easier to run while retaining reasonable accuracy. One approach involves dividing the world into regions. By assuming a linear relationship between air pollutant emissions in one region (source) and the air pollutant concentrations in other regions (receptor), source-receptor (SR) matrices are constructed by running a series of chemical transport model experiments with emission perturbed individually at each region. The SR matrices can then be used as a linearized global air quality model. This approach is also useful in spatial attribution of air pollution, which is applied in the Task Force on Hemispheric Transport of Air Pollutants (HTAP) (Galmarini et al., 2017; Liang et al., 2018). Designed to be a useful tool for science-policy analysis, the TM5-FASST model (Van Dingenen et al., 2018) computes SR matrices for 56 regions of the world, and subsequently processes the output into public health, agriculture and climate impact metrics. Another approach involves using output of full complexity chemical transport model to parameterize some physical and chemical processes, resulting in a reduced-order chemical transport model that can be run faster and with higher resolution, which are applicable for regional and global high-resolution (∼1–4 km) modelling with runtime of a few hundred CPU hours per model year (Tessum et al., 2017; Thakrar et al., 2022),. These SR and reduced-order models have been frequently applied in recent science and policy studies (e.g. Camilleri et al., 2023; Huang et al., 2023; Reis et al., 2022), showing the utility and demand for these outputs.

However, both SR and reduced-order models rely on several simplifying assumptions, which do not always hold. Many methods rely on linearizing the relationship between emissions and concentrations, which has been shown to be a reasonable approximation when the emission change is relatively small (Van Dingenen et al., 2018), but the formation rate of major secondary air pollutants such as inorganic PM_2.5 (Ansari and Pandis, 1998) are known to respond non-linearly to precursor emissions, especially when there are shifts in sulphate-nitrate-ammonium chemical regimes. This issue can be relevant when exploring a wide range of climate and air quality scenarios given the large range of possible air pollutant emissions and the discrepancy in the rates of change of different precursor emissions (NO_x vs. NH₃ vs. SO₂ for inorganic PM_2.5) (Atkinson et al., 2022a; Rao et al., 2017; Turnock et al., 2020). Existing SR matrices and reduced-order models also often ignore the effects of climate change on air pollution (e.g. changing precipitation and associated wet deposition, temperature effects on gas-aerosol partitioning and oxidation chemistry) (Jacob and Winner, 2009). Garcia-Menendez et al. (2015) find that under a high-warming scenario, climate change alone can increase population-weighted annual average PM_2.5 by 1.5 µg m⁻³ between 2000 and 2100 over the contiguous United States.

Recent innovations in regional reduced-form air quality models have moved beyond simple linear scaling, by applying non-linear regression techniques. Conibear et al. (2022) and Vander Hoorn et al. (2022) successfully used Gaussian Process Regression to emulate the grid cell scale response of annual mean PM_2.5 to a large range (−100 % to +50 %) of sectoral emission perturbations over China and the Perth greater metropolitan region respectively, with regional chemical transport model perturbation experiments as training data. Colette et al. (2022) applied multivariate quadratic regressions to emulate the simulated PM_2.5 response to emission control policies over Europe, achieving an accuracy of <2 % relative error in 95 % of grid cells. Meanwhile, a geographically weighted (i.e. using a weighted sum of regional emission changes as predictors to represent pollutant transport process) linear regression emulator was shown to reproduce PM_2.5 response to precursor emissions from the parent chemical transport model within 10 % accuracy over Europe (Pisoni et al., 2017).

Building on these regional scale applications, we combine Geographic Weighting and Gaussian Process Regression (GW-GPR) techniques to emulate the output of a global chemical transport model, GEOS-Chem High Performance (GCHP) driven by meteorological data from multiple climate simulations with the Community Atmosphere Model (CAM, collectively GCHP-CAM) (Eastham et al., 2023). This results in a global reduced-form anthropogenic PM_2.5 model that can account for spatially heterogenous pollutant emission changes and non-linearity in atmospheric chemistry under multiple climate scenarios without requiring simulated meteorological fields as input, and provide robust uncertainty estimates, without drastically increasing the computational cost. These properties would make our reduced-form model a highly viable candidate for specific use cases (e.g. ensemble modelling, building interactive tools, embedding in integrated assessment workflows).

2 Method

We emulate the response of annual mean surface anthropogenic PM_2.5 to changes in air pollutant emissions and climate modelled by a global chemical transport model, GEOS-Chem High Performance (GCHP). Here, we provide descriptions for our GCHP model setup and experiments, and the process of constructing and evaluating the emulator.

2.1 MIT Integrated Global System Model (IGSM) and its coupling with Community Atmosphere Model (CAM)

The climate scenarios are generated from the MIT IGSM framework (Fig. 1). The human system component of IGSM, the Economic Projection and Policy Analysis model version 7 (EPPA7) (Chen et al., 2022), is a global multi-sector (22 sectors) multi-region (18 regions) recursive-dynamic computable general equilibrium model. EPPA provides regionalized and sectorized consumption of different fuel types under the socioeconomic assumptions of each scenario. The associated greenhouse gas (GHG) and air pollutant emissions drive the MIT Earth System Model (MESM) (Sokolov et al., 2018) to simulate yearly global average atmospheric GHG concentration, and concentrations of zonally averaged climate and near-term climate forcers (NTCF, e.g. aerosols, O₃).

https://gmd.copernicus.org/articles/19/3335/2026/gmd-19-3335-2026-f01

Figure 1Schematic of the IGSM-CAM modelling framework. Orange boxes represent modelling systems, purple boxes represent data sets. The red dashed box represents the MIT IGSM part of the framework.

Download

Since the output of IGSM is zonally-averaged, we simulate 3D meteorological fields using the IGSM-CAM framework (Monier et al., 2013) that links the IGSM to the National Center for Atmospheric Research Community Atmosphere Model (CAM) 3.1 (Collins et al., 2006). In this framework, CAM is driven by the IGSM output GHG concentrations, sea surface temperature anomalies, sea ice cover, and NTCF concentrations (Fig. 1). A pattern scaling algorithm is used to translate 2D NTCF output from IGSM to the 3D input fields required by CAM. The simulation outputs used in this study are described and evaluated in detail by Monier et al. (2015). IGSM-CAM is run with a horizontal resolution of 2°×2.5° on 26 vertical layers up to 2.2 hPa.

2.2 GEOS-Chem High Performance model driven by CAM meteorological fields (GCHP-CAM)

We use the GCHP-CAM modelling system, which was described and evaluated in Eastham et al. (2023), to simulate global PM_2.5 distribution, and its response to climate and pollutant emission changes. The modelling system is based on a customized version of GCHP 13.0.0 (The International GEOS-Chem User Community, 2021) that can be driven by the modelled meteorological fields derived from the IGSM-CAM framework. Here we provide a brief description of the modelling system and specific setups for our work.

GCHP (Eastham et al., 2018) simulates PM_2.5 by resolving the chemistry, transport, emission and deposition of relevant chemical species. Oxidant chemistry is simulated using a coupled VOC–CO–NO_x–O₃–aerosol-halogen chemical mechanism (Sherwen et al., 2016). GCHP is run at C48 (∼200 km) horizontal resolution with the same vertical layers with the IGSM-CAM simulations. The model output is remapped into a 2° latitude × 2.5° longitude horizontal grid conservatively (Jones, 1999). PM_2.5 includes contribution from nitrate, sulphate, ammonium, black carbon (BC), organic carbon (OC), fine dust, sea salt and secondary organic aerosols (SOA). The formation of secondary inorganic aerosols is simulated by considering the thermodynamic equilibrium of the NH $_{4}^{+}$ –Na⁺–SO $_{4}^{2 -}$ –NO $_{3}^{-}$ –Cl⁻–H₂O system through ISORROPIA II (Fountoukis and Nenes, 2007). Organic aerosols are assumed to be non-volatile. SOA formation follows a simple yield-based scheme that converts a fixed potion of isoprene, monoterpenes and other terpenoids into a lumped SOA precursor pool and another lumped SOA pool (Kim et al., 2015).

Biogenic volatile organic compounds (BVOC) emissions follow Guenther et al. (2012) with isoprene inhibition by CO₂ .(Possell and Hewitt, 2011; Tai et al., 2013) included. Soil NO_x emissions follow Hudman et al. (2012). While BVOC and soil NO_x emissions are both calculated online (and therefore respond to climate and atmospheric CO₂ concentration), mineral dust (Meng et al., 2021), lightning NO_x (Murray et al., 2012) and wildfire (from Global Fire Emission Database, version 4.1; Van Der Werf et al., 2017) emissions are held at 2014 level.

Aerosol concentrations are archived at daily time resolution, and subsequently processed into annual mean surface total anthropogenic PM_2.5. Total PM_2.5 mass (Eq. 1) is calculated from the aerosol concentration by considering aerosol hygroscopic growth at 35 % relative humidity, aligning with the PM_2.5 measurement standard of the United States Environmental Protection Agency (USEPA) (Latimer and Martin, 2019). Anthropogenic PM_2.5 mass (Eq. 2) is calculated by the above method, but only summing a subset of aerosol species (sulphate, nitrate, ammonium, BC and OC) while omitting other aerosol species that are driven by non-industrial sources (dust, sea salt and SOA):

\begin{matrix} (1) & \begin{aligned} Total {PM}_{2.5} & = 1.1 ({NH}_{4}^{+} + {NO}_{3}^{-} + {SO}_{4}^{2 -}) + BC + OC \\ + fine mineral dust + SOA + 1.86 (Sea Salt) \end{aligned} \\ (2) & \begin{aligned} Anthropogenic {PM}_{2.5} & = 1.1 ({NH}_{4}^{+} + {NO}_{3}^{-} + {SO}_{4}^{2 -}) \\ + BC + OC . \end{aligned} \end{matrix}

2.3 Generating PM_2.5 training data using GCHP-CAM

We conduct GCHP-CAM simulations for atmospheric composition for the year 2014 (with additional 3 months of simulations as spin up (output discarded) before the start of 2014) by applying IGSM-CAM simulated meteorology, anthropogenic emissions of air pollutants from the Community Emission Data System (Hoesly et al., 2018), and the monthly surface CH₄ concentration derived by spatially kriging the observations from National Oceanic and Atmospheric Administration Global Monitoring Laboratory Cooperative Air Sampling Network for 2014. The resulting modelled total and anthropogenic PM_2.5 concentrations serve as a baseline for subsequent comparisons.

To effectively sample the sensitivity of PM_2.5 over a wide range of climate and air pollution emissions, we generate the training set from a series of GCHP-CAM perturbation experiments by manipulating 9 input variables that affect PM_2.5 and oxidant concentration: 7 air pollutant emissions – NO_x, SO₂, NH₃, NMVOC, BC, OC, carbon monoxide (CO) – that are commonly provided by integrated assessment models (Gidden et al., 2019), CH₄ concentration, and global warming.

2.3.1 Parameterizing global warming

Representing the direct impacts of climate change is an important aspect of building climate-aware reduced-form atmospheric composition models. However, unlike the other 8 perturbed variables, global warming cannot be directly implemented as a scaling factor in GCHP-CAM. Some recent studies achieve this goal by including 3D meteorological fields from climate model output as predictors (e.g. Li et al., 2022, 2026). However, this could limit the utility of the model to scenarios where climate model outputs are archived in a correct format. We use a simpler parameterization of climate effects in our emulator to expand its applicability.

We use the GHG concentration and IGSM-CAM simulated meteorological fields from its high-warming “REF” scenario (10 W m⁻² in 2100, resulting in 4.3 °C warming in 2080–2100 vs. 1990–2009) to provide samples across a wide range of global warming and GHG concentration from 2000–2100. While climate change can affect PM_2.5 through pathways other than simply warming (e.g. precipitation, regional stagnation, mixing depth) (Jacob and Winner, 2009), changes in meteorological variables due to well-mixed GHG forcing can usually be parameterized as spatially-varying functions (“patterns”, which are specific to individual climate models) of global mean temperature (e.g. Lütjens et al., 2025), which is a function of total radiative forcing by GHG (with climate sensitivity specific to each climate/Earth system model).

Therefore, the effects of GHG-forced climate change on PM_2.5 can be parameterized by total radiative forcing by GHG (TRF), which largely simplifies the statistical modeling and increases its applicability by not requiring meteorological variables as inputs. This implies the relation between GHG-forced climate change and PM_2.5 can be statistically learned by regression algorithms at each grid cell, when TRF is included as one of the input variables.

In our emulator, we further parameterize TRF as atmospheric CO₂ concentration. In many climate scenarios, CO₂ is projected to dominate (68 %–85 %) TRF and its trend in the 21st century (Meinshausen et al., 2020). In addition, atmospheric CO₂ concentration also directly affects isoprene emission, which could affect atmospheric oxidant (e.g. OH, O₃) (e.g. Tai et al., 2013), and therefore potentially secondary inorganic aerosol formation.

Parameterizing climate effects as TRF/CO₂ concentration allows our statistical model to include climate effects without explicitly requiring meteorological fields as input, which makes our emulator easy to integrate within the workflow of integrated assessments and ensemble modelling/emulation. However, this introduces some potential sources of systematic errors: (1) atmospheric CO₂ concentration can misrepresent TRF under climate scenarios where the trend of CO₂ emission is decoupled with the trends of other GHG emissions; (2) the assumption of time-invariant local relationship between global mean temperature/TRF and local climate variables breaks down under overshoot scenarios and over locations with strong changes in local forcing (e.g. aerosol) and energy balance (e.g. albedo feedback, land use and land cover change) (Giani et al., 2024). While the influence of pollutant emissions on PM_2.5 under these scenarios can still be properly represented by our statistical model, the result from our emulator should be interpreted more cautiously under these types of climate scenarios. More advanced methods to parameterize climate effects (e.g. using cumulative GHG emissions, combining information from climate emulators) could be further explored in future work.

2.3.2 Generating training dataset through GCHP-CAM perturbation experiments

We generate 120 sets of scaling factors for the 9 input variables (range displaced in Table 1) following a Latin Hypercube Sampling (LHS) (McKay et al., 1979) strategy. However, the changes in emissions of different pollutants are correlated, since different air pollutants often share similar emission sources (e.g. combustion). To account for co-emissions of air pollutants, we calculate the spatial correlations of the emissions of the 7 air pollutants from CEDS between 2000 and 2017. The correlation of CH₄ and global warming with other variables are set to be 0 to provide independence between climate and air pollution control policies. We then use the Iman–Conover Transform (Conover and Iman, 1982) to impose the correlation matrix for the independent and uncorrelated pairs of LHS scaling factors. The Iman–Conover Transform first transforms the sample pairs to an approximately multivariate normal distribution. Then Cholesky decompositions are used to impose the correlation matrix on the distribution, resulting in a matrix that can be applied to rearrange the sample pairs by ranking. This results in correlated pairs of scaling factors (Fig. 2) that allow us to focus on sampling the more probable parts of the input space (due to co-emissions), while preserving the marginal distributions of individual variables (i.e. uniform distribution over their respective ranges).

Table 1Range of scaling factors and CO₂ concentration of the perturbation experiments. The range of atmospheric CO₂ concentration is derived from the range of CO₂ concentration between 2000–2100 under the “REF” scenario.

Download Print Version | Download XLSX

https://gmd.copernicus.org/articles/19/3335/2026/gmd-19-3335-2026-f02

Figure 2Schematic of generating the training set by GCHP-CAM perturbation experiments using an Iman–Conover (IC) transformed Latin Hypercube Sampled (LHS) scaling factors. The orange box represents existing modelling systems, purple boxes represent data sets, and blue boxes represents output of the perturbation experiments.

Download

Figure 2 summarizes the workflow of the perturbation experiments. A 1-year perturbation simulation, again with an extra 3 months before as spin-up, is performed for each pair of global scaling factors applied to 2014 anthropogenic air pollutant emissions and surface CH₄ concentration. The CO₂ concentration is directly applied to calculate isoprene inhibition, and the corresponding climate effects are represented through driving the simulation with the IGSM-CAM simulated meteorological data from the year with the closest CO₂ concentration under “REF” scenario (e.g. a perturbation experiment having a CO₂ concentration of 446 ppm is driven by the IGSM-CAM simulated meteorological data at 2030 under “REF” scenario, as 2030 has the closet CO₂ concentration to 446 ppm among all years under “REF” scenario). The GCHP-CAM output we aim to emulate – changes in annual mean anthropogenic PM_2.5 concentration relative to 2014 baseline (ΔPM_2.5) – is calculated for each perturbation experiment as the training data set.

2.4 Training the Geographically Weighted Gaussian Process Regression (GW-GPR) emulator

We use Gaussian Process Regression (GPR) (Williams and Rasmussen, 1995) to relate the changes in pollutant emissions and climate with the corresponding changes in annual mean PM_2.5 concentration at grid cell scale, because of its effectiveness in handling non-linearity, good performance with small training set, and quantifying predictive uncertainties. GPR is a non-linear and non-parametric regression algorithm that does not require prior assumptions of the functional relationship between input and output variables. Instead, predictions are made by assuming the training and prediction sets follow a joint multivariate normal distribution (N):

\begin{matrix} (3) & (\begin{array}{c} Y_{1} \\ Y_{2} \end{array}) = N (\begin{array}{c} μ_{1}, Σ_{11}, Σ_{12} \\ μ_{2}, Σ_{21}, Σ_{22} \end{array}) \end{matrix}

where Y₁ is the random variable representing the prediction, and Y₂ is the random variable representing the response from training set. μ₁ and μ₂ are their means, and Σ₁₁, Σ₁₂, Σ₂₁, Σ₂₂ are the covariance matrix blocks. The prediction process can be viewed as finding ( $Y_{1} | Y_{2} = a) \sim N (μ^{'}, Σ^{'})$ , where a is the model output from training set. Thus, the mean (μ^′) and variance (Σ^′) of the prediction can be calculated as:

\begin{array}{l} (4) & μ^{'} = μ_{1} + Σ_{12} Σ_{22}^{- 1} (a - μ_{2}) \\ (5) & Σ^{'} = Σ_{11} - Σ_{12} Σ_{22}^{- 1} Σ_{21} . \end{array}

By setting the prior mean of prediction as 0 and proper normalization during training process, μ₁ and μ₂=0. Then μ^′ is essentially a sum of a weighted by the correlations between the training and prediction vectors ( $Σ_{12} Σ_{22}^{- 1}$ ). The elements of the correlation matrix take the form:

\begin{matrix} (6) & σ_{i j} = k (θ_{i} θ_{j}) \end{matrix}

where θ_i and θ_j are the input vectors at each training (i) or prediction (j) point, and k is the covariance function that can be chosen to control the shape and smoothness of the prediction. We use a sum of anisotropic (in the input space, not the physical distance described below) functions to represent the nature of our problem. These are smooth functions (rational quadratic function) with unknown points of chemical regime change + local interactions among variables (Matern 3/2 function) + noise from climate variability (white noise). “Training” the GPR essentially means optimizing the parameters of the covariance function k against the training data set.

We use the GPR as implemented in Scikit-learn version 1.3.2 (Pedregosa et al., 2011), and only train a GPR model for each populated (population density >1 person per km²) model grid cell. As the predictions are random variables, the uncertainty and confidence interval of each prediction can be calculated its standard deviation.

To emulate the process of chemical transport of emitted species, an isotropic 2D geographic weighting scheme is applied to calculate the effective air pollutant emission changes ( $Δ E_{weighted, x, i}$ ) at each grid cell x for each pollutant i:

\begin{matrix} (7) & Δ E_{weighted, x, i} = \sum_{y_{k}} e^{- d_{y, x}^{2} / 2 L_{i}^{2}} Δ E_{y, i} \end{matrix}

where ΔE_y,i is the emission change of pollutant i at individual grid cells considered within the dispersion range y_k (set to be within of 10 L_i of x for computational efficiency), d_y,x is the distance between grid cell y and x, and L_i is the dispersion length scale for pollutant i. Formally speaking, this implies at each grid cell x, our GW-GPR framework (f_x) predicts GCHP-CAM simulated ΔPM_2.5 using ΔE_weighted,x (the vector of $Δ E_{weighted, x, i}$ for all pollutants), and atmospheric CH₄ and CO₂ concentration as input:

\begin{matrix} (8) & Δ {PM}_{2.5} (x) = f_{x} (Δ E_{weighted, x}, {CH}_{4}, {CO}_{2}) . \end{matrix}

The geographic weighting scheme is implemented by the Gaussian Blurring algorithm as in Scipy version 1.10.1 (Virtanen et al., 2020). The input variables are normalized by their corresponding global maximum value after the geographic weighting. Since the output variables are not geographically weighted, and μ₂=0 simplifies computation, the output variables are normalized by local mean and maximum at each grid cell. We note that some previous regional studies (e.g. Pisoni et al., 2017) have treated the parameters of the geographic weighting scheme as optimizable hyperparameters. However, our GCHP-CAM experiments are conducted with uniform global scaling factors for emission fields. After the variable normalization procedure, training configurations with different geographic weighting scheme would effectively collapse to the same 120 sets of global scaling factors prescribed in the GCHP-CAM experiments. Therefore, our training set cannot be used to directly optimize L_i.

Instead, we choose a globally uniform set of L_i as an approximation: $L_{{NO}_{x}}$ , $L_{{NH}_{3}}$ and L_NMVOC=1 grid cell (=2° latitude × 2.5° longitude); $L_{{SO}_{2}}$ , L_BC and L_OC=2 grid cells; L_CO=3 grid cells.

To understand the utility of non-linear regression techniques, we conduct an experiment by training Multiple Linear Regressors (MLR) (instead of GPR) to represent f_x using identical input. We find that GPR increases the accuracy of the emulator over MLR (particularly over regions where the changes in NO_x and SO₂ versus NH₃ emissions are large enough to trigger non-linear responses in secondary inorganic aerosol formation) without incurring large increase computing resources required during the prediction process, therefore justifying the use of GPR over MLR. More details of the comparison are shown in Sect. 3.3.1

In addition, we train the GW-GPR emulator by choosing another set of globally uniform L_i based on the typical atmospheric lifetime of individual pollutants, which results in larger L_i for most pollutants. However, we find that such set of L_i increases the computing cost of the Gaussian blurring without providing improvements in emulating ΔPM_2.5. Therefore, we retain the choice of our original set of relatively small L_i, which provides enough distinction of dispersion length scales of different pollutants without invoking considerable additional computational cost. We also further explore the associated uncertainties by training the GW-GPR emulator with halved and doubled L_i. The result of all sensitivity simulations, and their implications on the accuracy and limitations of the emulator are discussed in Sect. 3.3.2.

2.5 Cross validation of the GW-GPR emulator

We evaluate the generalization ability of our models using the repeated random sub-sampling technique. For each repetition (Fig. 3), we randomly split the data into training (80 %) and testing (20 %) sets. New GW-GPR models are built from the synthetic training set and predictions are made over the synthetic testing set at grid cell scale. This process is repeated 10 times, and the performance metrics are calculated using all 10 synthetic testing sets and corresponding predictions. In addition, we perform Sobol global sensitivity analysis (Sobol', 2001) by drawing 1024 × (2 × number of input variables + 2) samples to compute the total sensitivity indices for each input variables using Saltelli sampling (Saltelli et al., 2010) at each grid cell using SALib 1.4.8 (Herman and Usher, 2017; Iwanaga et al., 2022), which helps us identify the importance of each input variable for different locations. The results from the cross validation and sensitivity analysis are shown in Sect. 3.2.

https://gmd.copernicus.org/articles/19/3335/2026/gmd-19-3335-2026-f03

Figure 3Schematic of the 10-fold random subsampling cross-validation procedure. At each “fold”, 80 % of the samples (96 runs) are used to train the emulator to predict the result from the other 20 % of the samples (24 runs). The prediction is then evaluated against the GCHP-CAM output ΔPM_2.5 for those 24 runs.

Download

2.6 Testing the emulator with IGSM-GAINS-TAPS combined air quality and climate legislation scenarios

To evaluate the emulator within the context of integrated assessment modelling, we evaluate the ability of the emulator in reproducing GCHP-CAM output anthropogenic PM_2.5 over 2 climate (Current Trend (CT) and Accelerated Action (AA), both generated from MIT IGSM) (Paltsev et al., 2023) ×2 air pollution control – Current Legislation (CLE) and Maximum Feasible Reduction (MFR), in total 4 scenarios (CT_CLE, CT_MFR, AA_CLE, AA_MFR) in 2050 (Fig. 4). CT assumes the implementation of Nationally Determined Contributions (NDCs) from Paris Agreement through 2030. Under CT, climate is not stabilized, and global mean temperature continues to increase. AA assumes the extension of these initial NDCs to align with the long-term goal of Paris Agreement, providing the ability to limit and stabilize anthropogenic warming to 1.5 °C at 2100 with at least 50 % probability. The CLE scenario assumes complying with existing region- and source-specific air pollutant emission limits, while the MFR scenario assumes increasing deployments of currently available lowest-emitting technologies.

https://gmd.copernicus.org/articles/19/3335/2026/gmd-19-3335-2026-f04

Figure 4Schematic of GW-GPR model evaluation using IGSM-GAIN-TAPS climate and air pollution scenarios. Orange boxes represent existing modelling systems, purple boxes represent data sets. Yellow represents our newly developed emulator (GW-GPR). Green and red arrows represent how GCHP-CAM and GW-GPR predicts changes in PM_2.5, respectively.

Download

The projected trends of air pollutant emission intensities under CLE and MFR scenarios are from Greenhouse gas-Air pollution INteractions and Synergies (GAINS) model (Amann et al., 2011), based on GAINS4/ECLIPSE (Evaluating the Climate and Air Quality Impacts of Short-lived Pollutants) v6b data (GAINS Developer Team, 2021; Klimont et al., 2017; Smith et al., 2020; Stohl et al., 2015). Future air pollutant emissions for each scenarios can be derived through the Tool for Air Pollution Scenarios (TAPS) (Atkinson et al., 2022a) by considering climate (fuel consumption) and air pollution (emission intensities) policies independently. The resulting regional (region definition shown in Fig. 5) and global air pollutant emissions of each of the 4 combined scenarios are shown in Fig. 6. The 4 scenarios have distinct emission scaling factors at each region, with less than 1 % of the region-scenario combinations having emission changes > 100 % (upper limit of training set). With these scenarios, we can test whether the emulator (trained on GCHP-CAM runs with globally uniform emission scaling factors) can perform reasonably under spatially heterogenous emission scaling factors, and within the intended range of emission changes.

https://gmd.copernicus.org/articles/19/3335/2026/gmd-19-3335-2026-f05

Figure 5IGSM region definition.

https://gmd.copernicus.org/articles/19/3335/2026/gmd-19-3335-2026-f06

Figure 6Total and regional air pollutant emissions for the four IGSM-GAINS-TAPS scenarios. Each row represents a climate scenario (Current Trend, CT; and Accelerated Actions, AA), and the emissions at 2014, and 2050 under Current LEgislation (CLE) and Maximum Feasible Reduction (MFR) air pollution control scenarios are shown for each species.

Download

We perform 10 years of GCHP-CAM simulations for each of the four IGSM-GAINS-TAPS scenarios with their respective anthropogenic air pollutant emissions, and CH₄ and CO₂ concentrations in 2050. The simulations are driven by IGSM-CAM meteorological fields from the “REF” scenario, with meteorological years (2031–2041 for AA and 2040–2050 for CT) chosen to match the CO₂ concentration and TRF (i.e. AA has similar CO₂ concentration and TRF at 2050 with “REF” over 2031–2041, and CT has similar CO₂ concentration and TRF at 2050 with “REF” over 2040–2050). The same GCHP-CAM input anthropogenic air pollutant emissions, and CH₄ and CO₂ concentrations in 2050 are fed into the GW-GPR emulator to estimate ΔPM_2.5 under each scenario, which is then compared to the multiannual mean ΔPM_2.5 simulated by GCHP-CAM (Sect. 3.3).

2.7 Demonstrating the utility of the emulator using AerChemMIP data

To demonstrate the utility of the emulator, particularly under global change scenarios, we also compare the output of the GW-GPR emulator with the output from the Aerosol Chemistry Intercomparison Project (AerChemMIP) under 2 Shared Socio-economic Pathways (SSP)-based scenarios: (1) the standard SSP3-7.0 “Regional Rivalry” scenario (radiative forcing = 7.0 W m⁻² at 2100); and (2) a variant of SSP3-7.0 with same socio-economic assumptions as the standard SSP3-7.0, but stronger air quality control measures, resulting in lower emissions of Near Term Climate Forcers (SSP3-7.0-lowNTCF) (Fujimori et al., 2017). AerChemMIP (Collins et al., 2017) is endorsed by the Coupled-Model Intercomparison Project 6 (CMIP 6) to quantify the impacts of aerosols and chemically reactive gases on climate. There are minimum model complexity requirements (atmosphere-ocean general circulation model with tropospheric aerosols driven by pollutant emission fluxes) to participate in AerChemMIP ensemble.

We calculate the anthropogenic PM_2.5 (sum of sulphate, nitrate, ammonium, BC and OC) from AerChemMIP archive in an identical manner as for GCHP-CAM (Eq. 2). We only include ensemble members with the output of all five anthropogenic PM_2.5 components available. The models selected and numbers of realizations included for each model are summarized in Table 2.

Table 2The numbers of ensemble members with all 5 anthropogenic PM_2.5 components (sulphate, nitrate, ammonium, BC, OC) available for SSP3-7.0 and SSP3-7.0-lowNTCF scenarios from the AerChemMIP archive.

Download Print Version | Download XLSX

The emulator calculates the decadal changes in anthropogenic PM_2.5 concentrations relative to 2020 over 2030–2090, using surface air pollutant emissions (calculated as anthropogenic + open burning emissions) and GHG concentration (Meinshausen et al., 2020) provided by the Input4MIP repository. As air pollutant emissions are provided every 10 years, one emulator prediction is done per decade. For each ensemble member in the AerChemMIP archive, the corresponding decadal changes in anthropogenic PM_2.5 are calculated by comparing the decadal average anthropogenic PM_2.5 with that of the first decade (2015–2024). The model-specific decadal changes in anthropogenic PM_2.5 are then calculated by averaging the result from all the ensemble members from the corresponding model. AerChemMIP and Input4MIP data are retrieved via the search engine of Earth System Grid Federation (ESGF) (https://aims2.llnl.gov, last access: 26 August 2024). The comparison between emulator predicted and AerChemMIP output trends in anthropogenic PM_2.5 will be discussed in Sect. 4.

2.8 Health impact calculation

For the four climate and air pollution control scenarios, we also estimate the impacts of changes in anthropogenic PM_2.5 on public health through premature mortalities. GCHP and emulator output are upsampled from 2°×2.5° to 0.5°×0.5° using the nearest neighbour algorithm, which matches the horizontal resolution of the age-specific population data we use (Gridded Population of the World version 4.11, last access: 19 April 2024) (Center For International Earth Science Information Network-CIESIN-Columbia University, 2018). Country-scale baseline age- and cause-specific mortality rates are provided by the World Health Organization (WHO) (WHO, 2018). The age- and cause-specific changes in the annual mortality due to chronic PM_2.5 exposure for scenario i (ΔMort_i) is calculated from the relative mortality risks under the baseline (total PM_2.5 from the 2014 baseline run) (RR_base) and each scenario i (RR_i):

\begin{matrix} (9) & Δ {Mort}_{i} = {Mort}_{base} (\frac{{RR}_{i} - {RR}_{base}}{{RR}_{base}}) \end{matrix}

where Mort_base is the age- and cause-specific mortalities from the WHO.

We use the age-specific non-linear Concentration Response Functions from the Global Exposure Mortality Model (Burnett et al., 2018) to calculate RR_i and RR_base for non-communicable diseases and lower respiratory infections attributable to outdoor PM_2.5 pollution. Since our emulator focuses on the changes and corresponding impacts in anthropogenic PM_2.5, the synthetic PM_2.5 concentration for scenario i at each grid cell is calculated as PM $_{2.5, base} + Δ$ PM_2.5,i, where PM_2.5,base is the modelled total PM_2.5 in year 2014, and ΔPM_2.5,i is the modelled/emulated change in anthropogenic PM_2.5 for scenario i.

3 Evaluation of the emulator

In this section we discuss and explain the performance of our GW-GPR emulators against GCHP-CAM simulations measured by the response of anthropogenic PM_2.5 pollution and associated premature mortalities.

3.1 Computing resource requirement

GCHP-CAM requires 2400–3000 CPU hours (Intel Xeon Processor E5-2679A v4, processor base frequency = 2.6 GHz) to simulate PM_2.5 for each 1-year run. The one-time operation of fitting the GW-GPR emulator requires 280 CPU hours (Intel Xeon Processor E5-2670, processor base frequency = 2.6 GHz). Once trained, the emulator requires approximately 10 CPU seconds to generate global PM_2.5 predictions for one scenario (Intel Xeon Processor Silver 4214R, processor base frequency = 2.4 GHz). This demonstrates the magnitude of the speed up offered by the emulator.

3.2 Emulator cross validation and sensitivity

In this sub section, we discuss the result of the cross validation and sensitivity test outline in Sect. 2.5. Figure 7 shows the result grid cell by grid cell comparison of changes in annual mean anthropogenic PM_2.5 (ΔPM_2.5) predicted by the GW-GPR emulator against that simulated by its parent model (GCHP-CAM) across all the data points generated by the random subsampling. The GW-GPR emulator can predict ΔPM_2.5 GCHP-CAM with reasonable accuracy (R²=0.97, mean absolute error (MAE) = 0.356 µg m⁻³) and minimal overall bias (mean bias (MB) = −0.012 µg m⁻³). Figure 8 shows the spatial distribution of grid cell scale MAE of the GW-GPR emulator, and the emulator output standard deviation from Eq. (4) (which can characterize the predictive uncertainty of emulator output). The largest MAE is found over Northern China and Northern India (up to 5 µg m⁻³), where the anthropogenic PM_2.5 and precursor emissions are very high in the base year of 2014. The emulator output standard deviation have similar magnitudes and spatial distributions (spatial R²=0.99) as MAE, confirming that emulator output standard deviation is an appropriate measure of the uncertainties of emulator predictions relative to the parent model.

https://gmd.copernicus.org/articles/19/3335/2026/gmd-19-3335-2026-f07

Figure 72D histogram from the grid cell by grid cell comparison between changes in annual mean anthropogenic PM_2.5 (ΔPM_2.5) predicted by the GW-GPR emulator (ΔPM_2.5, GW-GPR) and that simulated by GCHP-CAM (ΔPM_2.5, GCHP) from the 10-fold random sub-sampling cross-validation.

Download

https://gmd.copernicus.org/articles/19/3335/2026/gmd-19-3335-2026-f08

Figure 8The mean absolute error (MAE) of emulator prediction against the parent model (GCHP-CAM), and the average standard deviation of emulator predictions from Eq. (4) (indicative of predictive uncertainties) at grid cell scale (240 data points at each grid cell).

Figure 9 shows the normalized Sobol Total Sensitivity Indices of the GW-GPW emulator to each of the input variables (in the unit of fractional rather than absolute changes), which measure how much each input variable is responsible for the variance in the output over the whole domain of input data, including the interaction among variables. In other words, the Sensitivity Indices indicate how important the specific input variable is in controlling the output. The importance of input variables is spatially heterogenous. Over North America and Europe, ΔPM_2.5 is mostly sensitive to SO₂ (46 %–57 % of total sensitivity index) and NH₃ (28 %–32 %), and to a lesser extent NO emissions (10 %–14 %). The pattern of total sensitivity indices over India and China are similar (9 %–13 % for NO, 21 %–37 % for NH₃ and 36 %–57 % for SO₂), but the sensitivity of ΔPM_2.5 to OC (13 % vs. 3 %–7 % over North America and Europe) is higher over these regions. For most of the rest of the northern hemisphere, ΔPM_2.5 is primarily sensitive to SO₂ emissions (e.g. >80 % over Mexico and Middle East). Over the southern hemisphere, ΔPM_2.5 remains highly sensitive to SO₂ emissions (47 % over Indonesia – 76 % over Brazil). In Brazil, Indonesia, and Africa, ΔPM_2.5 is also sensitive to OC emissions (15 %–38 %). Sensitivity of ΔPM_2.5 to BC is relatively low globally (mean = 0.006 over the globe). This is because BC is largely co-emitted with OC, while the OC emissions are always around 1–2 times larger than BC emission by mass. Thus, the variance attributable to BC is mostly captured by the variance attributable to OC. The sensitivity index of CO₂, CH₄, VOC and CO are also relatively low (<3 %) globally, except over the certain regions with low anthropogenic emissions (tropical Pacific islands, edge of the Amazon and central African rainforests), reflecting the fact that our emulator does not consider SOA in our definition of anthropogenic PM_2.5.

https://gmd.copernicus.org/articles/19/3335/2026/gmd-19-3335-2026-f09

Figure 9Spatial patterns of Sobol Total Sensitivity Indices (0–1) for each predictor for ΔPM_2.5. The indices indicate the fraction of output variance attributable to each input variables at each grid cell.

3.3 Comparing emulator performance for IGSM-GAINS-TAPS scenarios with GCHP-CAM

Figure 10 shows the GCHP-CAM and GW-GPR (emulator) output ΔPM_2.5 (2045–2054 mean vs. 2014) over each IGSM-GAINS-TAPS scenario at grid cell scale. The global performance metrics are shown in Table 3. Generally, the emulator performs comparably to that in the random subsampling evaluation (R²=0.94–0.99, MAE = 0.20–0.42 µg m⁻³). 58.7 % (82.9 %)–84.8 % (96 %) of the grid cells have emulator predictions agreeing with GCHP-CAM within 1 (2) standard deviation of emulator output respectively (computed with Eq. 5).

https://gmd.copernicus.org/articles/19/3335/2026/gmd-19-3335-2026-f10

Figure 10Spatial patterns of GCHP-CAM and emulator predicted ΔPM_2.5 for each of the 4 IGSM-GAINS-TAPS scenarios at 2050 (relative to 2014). Only results in grid cells with population density > 1 person per km² are shown. The hatches show where GCHP-CAM output does not fall within the 95 % confidence interval of emulator prediction.

Table 3Gaussian Process Regression (GPR) and multilinear regression (MLR) emulator performance metrics (spatial coefficient of determination (R²), mean absolute error (MAE), mean bias (MB), computed at grid cell scale, N=2803) for each IGSM-GAINS-TAPS scenarios, relative to GCHP-CAM output. The rightmost column indicates the percentage of grid cells that the GPR emulator prediction agrees with GCHP-CAM output within 1 (2) standard deviation (prediction uncertainty of emulator).

Download Print Version | Download XLSX

When the predicted spatial distributions of ΔPM_2.5 are converted into premature mortalities using the GEMM CRF, we find that the GCHP-CAM and emulator output produce similar impacts on global premature mortalities over the 4 IGSM-GAINS-TAPS scenarios tested (differences within 1.2 %) that agree within the range of uncertainty due to the GEMM CRF parameters (Fig. 11). This shows the emulator's ability to reproduce both the magnitudes and spatial distributions of ΔPM_2.5 from GCHP-CAM, and the suitability of emulator output for public health impact calculation at global scale.

https://gmd.copernicus.org/articles/19/3335/2026/gmd-19-3335-2026-f11

Figure 11Changes in global annual premature mortality attributable to PM_2.5 exposure under each of the four scenarios between 2050 and 2014, calculated from the emulator and GCHP-Chem output ΔPM_2.5. The error bars represent the uncertainties due to GEMM CRF parameters, calculating by applying the 2.5 and 97.5 percentile estimate of the GEMM CRF parameters.

Download

In general, the emulator performs best over the western hemisphere (longitude < −20°), where the emulator error is within 2 µg m⁻³ (MAE = 0.1 µg m⁻³), and 77.4 % of emulator predictions agree with GCHP-CAM output within 1 emulator output standard deviation. In contrast, the emulator output shows consistent high bias of up to 2 µg m⁻³ over the 4 scenarios over the Sahel. Around the Bay of Bengal, emulator output does not agree with GCHP-CAM output with 1 emulator output standard deviation in a large portion of grid cells. In the subsections below, we will explore the potential sources of error by comparing the result presented above with that from alternative emulator architectures.

3.3.1 Comparison with linear model

For comparison, we train a multilinear regression (MLR) emulator with identical variables, geographic weighting and normalization schemes, and the performance metrics of the multilinear emulator is also shown in Table 3. In all scenarios, the MLR estimator has a larger global MAE than GPR (by 0.05 µg m⁻³ (19 %) in CT_CLE to 0.10 µg m⁻³ (40 %) in AA_CLE). However, in 3 out of 4 scenarios tested (except AA_MFR), the GPR emulator has higher MB than the MLR emulator, though the overall magnitudes of MB remain relatively small (within 0.1 µg m⁻³).

Figure 12 shows the spatial distribution of MLR and GPR error relative to their parent model (GCHP-CAM), and Fig. 13 shows the difference in absolute values of such errors between MLR and GPR. In the 4 IGSM-GAINS-TAPS scenarios, GPR predictions have less absolute error relative to the parent model than MLR in 57.6 % (CT_CLE) to 66.4 % (AA_MFR) of the grid cells. In all 4 scenarios, GPR outperforms MLR over the US (MAE = 0.05–0.14 µg m⁻³ for GPR vs. 0.17–0.29 µg m⁻³ for MLR), western and southern Europe (MAE = 0.05–0.06 µg m⁻³ for GPR vs. 0.20–0.26 µg m⁻³ for MLR), Middle East (MAE = 0.16–0.56 µg m⁻³ for GPR vs. 0.40–0.72 µg m⁻³ for MLR), South America (MAE = 0.08–0.14 µg m⁻³ for GPR vs. 0.14–0.16 µg m⁻³ for MLR), and South Asia.

https://gmd.copernicus.org/articles/19/3335/2026/gmd-19-3335-2026-f12

Figure 12GPR and MLR emulator errors relative to GCHP-CAM simulated ΔPM_2.5 over the 4 IGSM-GAINS-TAPS scenarios at 2050 (relative to 2014). Both emulators use the same geographic weighting scheme for pollutant emissions.

https://gmd.copernicus.org/articles/19/3335/2026/gmd-19-3335-2026-f13

Figure 13Difference in the absolute error (relative to GCHP-CAM output) between MLR and GPR emulation. Red (positive) indicate GPR is more accurate than MLR at the given grid cell, while blue (negative) indicates the opposite.

To further understand the utility of non-linear emulation, we analyse the shifts in the chemical regime of secondary inorganic aerosol formation calculating the Gas Ratio (GR) (Paulot and Jacob, 2014) over China at baseline year and under all 4 scenarios (Fig. 14):

\begin{matrix} (10) & GR = \frac{[{NH}_{3}] + [{NH}_{4}^{+}] - 2 [{SO}_{4}^{2 -}]}{[{HNO}_{3}] + [{NO}_{3}^{+}]} . \end{matrix}

GR < 0 indicates that secondary inorganic PM_2.5 is weakly sensitive to NH₃ emissions through adding NH $_{4}^{+}$ to existing SO $_{4}^{2 -}$ and HSO $_{4}^{-}$ ions. 0 < GR < 1 indicates that there is enough NH₃ to react with SO $_{4}^{2 -}$ , such that NH₃ and HNO₃ start partitioning into NH₄NO₃ aerosol, leading to strong sensitivity of secondary inorganic PM_2.5 to NH₃ emissions. In this regime, secondary inorganic PM_2.5 is more sensitive to NH₃ emissions. GR > 1 indicates that there is more than enough NH₃ to react with both SO $_{4}^{2 -}$ and HNO₃, and PM_2.5 sensitivity to NH₃ emissions will weaken continuously as GR keep increasing beyond 1 (Ansari and Pandis, 1998).

https://gmd.copernicus.org/articles/19/3335/2026/gmd-19-3335-2026-f14

Figure 14Gas ratio (GR) over China, which indicate secondary inorganic PM_2.5 sensitivity to NH₃ emissions. Secondary inorganic PM_2.5 is weakly sensitive to NH₃ when GR < 0.0 < GR < 1 indicate stronger sensitivity of secondary inorganic PM_2.5 to NH₃ emissions. When GR > 1, sensitivity of secondary inorganic PM_2.5 to NH₃ emissions decreases as GR increases.

In northern China, GPR is slightly less accurate than MLR on average under the MFR scenarios (by 0.31 µg m⁻³ under AA_MFR and 0.38 µg m⁻³ under CT_MFR, measured by regional MAE), but considerably more accurate under CLE scenarios (by 2.01 µg m⁻³ under AA_CLE and 0.81 µg m⁻³ under CT_CLE). At the baseline year, GR over Northern China is largely between 0–1. Under all four scenarios, GR increases beyond 1 over northern China. However, the increases in GR are the strongest under AA_CLE, followed by AA_MFR, while CT_CLE and CT_MFR have lower GR than the two AA scenarios. This indicates stronger shifts in secondary inorganic PM_2.5 sensitivity to precursor emissions relative to the baseline year (and therefore more non-linearity) under the two AA scenarios (especially AA_CLE) than the two CT scenarios, which is more well-captured by GPR than MLR.

The results in this sub-section show that GPR generally outperforms MLR. When emission changes could potentially trigger non-linear aerosol chemistry, non-linear emulators can be significantly more accurate than linear emulators. This justifies the use of non-linear regression techniques (e.g. GPR) in developing air quality emulators, especially given that GPR only requires 25 % more runtime than MLR.

3.3.2 Sensitivity to dispersion length scales (L)

In addition to regression techniques, we also conduct 4 sensitivity tests of altering the dispersion length scales for individual pollutants i (L_i): (1) L_i=0 (no geographic weighting); (2) halving L_i, (3) doubling L_i, (4) directly using atmospheric lifetime of pollutant to approximate L_i. Figure 15 shows the changes in absolute error of emulator prediction when the geographic weighting scheme is disabled (i.e. L_i=0). Turning off the geographic weighting scheme worsens the performance of the emulator, increasing the global MAE by between 0.31 µg m⁻³ (AA_MFR) and 0.88 µg m⁻³ (CT_CLE), and locally absolute error by up to 29.1 µg m⁻³. This decline in model performance is much larger than that by switching from GPR to MLR, indicating the necessity of the geographic weighting scheme in our emulator.

https://gmd.copernicus.org/articles/19/3335/2026/gmd-19-3335-2026-f15

Figure 15Changes in absolute error (relative to GCHP-CAM output ΔPM_2.5) when no geographic weighting of pollutant emissions is implemented. Red (positive) indicates that turning off dispersion worsens the performance (increasing error), blue (negative) indicates the opposite.

Figure 16 shows the changes in absolute error of emulator predictions when L_i are halved. Halving L_i increases global MAE at all the scenarios by 0.05 µg m⁻³ (AA_MFR) to 0.21 µg m⁻³ (CT_CLE), especially over the Sahel and northern India, where absolute error increases by up to 13.2 µg m⁻³. Figure 17 shows changes in absolute error of emulator predictions when L_i are doubled. Doubling L_i leads to minor changes in global MAE across the 4 scenarios (−0.015 µg m⁻³ under CT_CLE to +0.015 µg m⁻³ under CT_MFR). The geographic pattern of emulator performance changes is similar across different scenarios. After doubling the dispersion length scale, the emulator performs better over the Sahel, China and Indochina by up to 4.8 µg m⁻³ locally, but worse over India, where the regional MAE increases by 0.28 µg m⁻³ (AA_MFR) to 1.01 µg m⁻³ (CT_CLE), and local error increases by up to 4.05 to 9.67 µg m⁻³ over the 4 scenarios.

https://gmd.copernicus.org/articles/19/3335/2026/gmd-19-3335-2026-f16

Figure 16Changes in absolute error (relative to GCHP-CAM output ΔPM_2.5) when the dispersion length scales (L_i in Eq. 7) for all pollutants in the geographic weighing scheme are halved. Red (positive) indicates that turning off dispersion worsens the performance (increasing error), blue (negative) indicates the opposite.

https://gmd.copernicus.org/articles/19/3335/2026/gmd-19-3335-2026-f17

Figure 17Changes in absolute error (relative to GCHP-CAM output ΔPM_2.5) when the dispersion length scales (L_i in Eq. 7) for all pollutants in the geographic weighing scheme are doubled. Red (positive) indicates that turning off dispersion worsens the performance (increasing error), blue (negative) indicates the opposite.

In the final sensitivity experiment, we assume L_i (in km) to be approximately equal to 10–20× atmospheric lifetime of pollutant i (τ_i) (Li and Cohen, 2021). Anthropogenic NO_x and NH₃ have very short τ (within a few hours) (Dammers et al., 2019; Lange et al., 2022). Also, HNO₃ (the main oxidized form of NO_x) (Muller et al., 1993) and NH₃ (Schrader and Brümmer, 2014) deposit rapidly. However, the secondary inorganic aerosol formed from anthropogenic NO_x and NH₃ have longer τ (3–5 d) (Bian et al., 2017). Balancing these two factors, we assume $L_{{NO}_{x}}$ and $L_{{NH}_{3}} = 1$ grid cell. SO₂ has longer τ (4–12 h for point sources (Fioletov et al., 2015), and 1–1.5 d at regional and global scale (Chen et al., 2025; Hardacre et al., 2021; Lee et al., 2011)) and lower deposition velocity (Hardacre et al., 2021) than NO_x and NH₃. Therefore, we assume $L_{{SO}_{2}} = 3$ grid cells. Anthropogenic NMVOC that has the most significant contribution to photochemistry and oxidation chemistry (e.g. xylene, toluene, ethylene, propylene) (Gu et al., 2021; Ran et al., 2011) typically have τ of a few hours to 2 d (Franco et al., 2022; Tiwari et al., 2010; Trentmann et al., 2003). Therefore, we assume L_VOC=2 grid cells. As recent studies suggests that τ_BC<5 d (Lund et al., 2018), and τ_BC≈τ_OC (Gao et al., 2022), we assume of $L_{BC} = L_{OC} = 7$ grid cells. As τ_CO is at the order of months (Khalil and Rasmussen, 1990), we choose L_CO=15 grid cells to avoid excess processing time by the geographic weighting scheme.

https://gmd.copernicus.org/articles/19/3335/2026/gmd-19-3335-2026-f18

Figure 18Changes in absolute error (relative to GCHP-CAM output ΔPM_2.5) when an alternate set of L_i ( $L_{{NH}_{3}} = L_{{NO}_{x}} = 1$ grid cell, L_VOC=2 grid cells, $L_{{SO}_{2}} = 3$ grid cells, $L_{BC} = L_{OC} = 7$ grid cells, L_CO=15 grid cells, informed by the atmospheric lifetime of each pollutant i (τ_i)) is applied to train the GW-GPR. Red (positive) indicates that using the alternate set of L_i worsens the performance (increasing error), blue (negative) indicates the opposite.

Figure 18 shows the changes in absolute error of emulator predictions with the set of L_i described above. The global performance metrics (MAE = 0.21–0.42 µg m⁻³, MB = 0.06–0.1 µg m⁻³) are very similar to the metrics obtained by training the GW-GPR emulation with the default set of L_i. The regional pattern of changes in emulator accuracy is consistent over the 4 scenarios tested. Using this alternate set of L_i reduces the error over southern China by up to 4 µg m⁻³, while increasing the error over western Africa and central Asia by up to 4 µg m⁻³. Over northern India and Bangladesh, the emulator error could locally increase or decrease by up to 5 µg m⁻³. However, the generally larger L_i increases the runtime of the geographic weighting scheme (around 15 s), while only 0.6 s is required to run the geographic weighting scheme using our default choice of L_i. Given the GW-GPR emulator can finish its prediction within 10 s for each scenario, such a large increase in runtime without consistent global improvement in emulator performance is not justified.

The results of these sensitivity tests illustrate that the accuracy of our emulator is sensitive to the choice of L_i. While our choice of a set of globally uniform set of L_i provides a reasonable first-order approximation to emulate pollutant dispersion, performance of the emulator could conceivably be improved by regional, or even grid cell specific L_i. However, this will greatly increase the computing power required to train the model, and potentially require many additional global change scenarios to train and benchmark the model.

4 Evaluating utility: comparison with AerChemMIP ensemble

Figure 19 shows the changes in global population-weighted average anthropogenic PM_2.5 simulated by the models in the AerChemMIP ensemble and GW-GPR emulator over 2030–2090 under SSP3-7.0 and SSP3-7.0-lowNTCF scenarios, relative to the 2015–2024 average. The solid lines represent the mean predictions, and the shaded areas represent the ranges of uncertainty (min/max for AerChemMIP ensemble, 1 standard deviation for emulator). Global population-weighted average ΔPM_2.5 from the emulator is within the range of AerChemMIP ensemble for both SSP3-7.0 and SSP3-7.0-lowNTCF scenarios over 2030–2090. The emulator predicted decadal mean global population-weighted average ΔPM_2.5 falls well within the range and differs by less than 1.15 µg m⁻³ with the mean of AerChemMIP ensemble for all decades under both scenarios.

https://gmd.copernicus.org/articles/19/3335/2026/gmd-19-3335-2026-f19

Figure 19Changes in global decadal mean population-weighted anthropogenic PM_2.5 relative to 2015–2024 mean, predicted by the models in AerChemMIP ensemble and emulator with the shades indicate the range of uncertainty (ensemble range for AerChemMIP, 1 standard deviation for emulator), under standard SSP3-7.0 and the SSP3-7.0 low Near-Term Climate Forcer (SSP3-7.0-lowNTCF) scenario.

Download

Figure 20 shows the multimodel minimum and maximum ΔPM_2.5 simulated by models in AerChemMIP and GW-GPR emulator predicted ΔPM_2.5 in 2050 and 2090 under SSP3-7.0 and SSP3-7.0-lowNTCF scenarios. Output from models in AerChemMIP is conservatively regridded to the same horizontal resolution for comparison. GW-GPR produces similar spatial patterns of ΔPM_2.5 as AerChemMIP models (e.g. large increases and decreases in PM_2.5 over northern China and northern India) in all four scenario-year combinations shown. Over major population centres in the northern Hemisphere (eastern North America, Europe, northern India), GW-GPR emulator predictions of ΔPM_2.5 largely fall within the range of AerChemMIP model output. This contributes to the agreement of global population-weighted average ΔPM_2.5 between GW-GPR emulator and models in AerChemMIP (Fig. 19).

https://gmd.copernicus.org/articles/19/3335/2026/gmd-19-3335-2026-f20

Figure 20Multimodel minimum and maximum magnitude of ΔPM_2.5 simulated by models in AerChemMIP, and GW-GPR emulator predicted ΔPM_2.5 in 2050 and 2090 under SSP3-7.0 and SSP3-7.0-lowNTCF (abbreviated as lowNTCF in plot labels). Dots in the middle (GW-GPR) column indicate the grid cells where prediction of GW-GPR do not fall between the minimum and maximum of AerChemMIP simulation output.

However, there is systematic disagreement between GW-GPR and AerChemMIP model output over western Africa. Under SSP3-7.0, GW-GPR predicts 13.8 µg m⁻³, while models in AerChemMIP predicts a 4.4–10.7 µg m⁻³ increase in anthropogenic PM_2.5 in 2090 over Nigeria. Under SSP3-7.0-lowNTCF, GW-GPR predicts a 3.0 µg m⁻³ increase, while models in AerChemMIP predict a 1.8–3.0 µg m⁻³ decrease in anthropogenic PM_2.5 in 2090 over Nigeria.

https://gmd.copernicus.org/articles/19/3335/2026/gmd-19-3335-2026-f21

Figure 21Changes in air pollutant emissions (kg m⁻² s⁻¹) in 2090 relative to 2020 over equatorial Africa under SSP3-7.0-lowNTCF.

To explore the potential sources of error and bias of GW-GPR emulator, we conduct more detailed analysis of ΔPM_2.5 over equatorial Africa in 2090 under SSP3-7.0-lowNTCF. Figure 21 shows the changes in OC, BC, NO, SO₂ and NH₃ emissions over equatorial Africa. Over Nigeria, the magnitudes of NO (+145 %), SO₂ (+180 %) and NH₃ (+140 %) emission changes are well beyond the range prescribed in our training set (±100 %), which could lead to failure of the machine learning algorithms. We also recognize GW-GPR has consistent positive biases over GCHP-CAM (Fig. 10) over equatorial Africa that cannot be effectively eliminated by switching to MLR (Fig. 12). This hints that the mismatch between regional pollutant transport patterns and the prescribed geographic weighting scheme could be another possible source of error of GW-GPR over equatorial Africa. Theoretically, the isotropic geographic weighting scheme only emulates of primary pollutants and precursors of secondary pollutants as a purely diffusive process, but not directly emulating the dispersion of secondary pollutants nor the advective components of pollutant dispersion. This might also contribute to the failure of the emulator to predict the changes in regional aerosol background, especially when the emission changes are highly spatially heterogenous.

The comparison between GW-GPR and AerChemMIP output shows that the GW-GPR emulator can generate predictions of ΔPM_2.5 that are within the range of output from mainstream chemistry-climate models at global scale, while requiring much less computational resources (at the order of 10–100 CPU seconds per scenario) to run. This confirms the utility of GW-GPR in predicting the spatiotemporal changes in anthropogenic PM_2.5 exposure under global change scenarios. However, GW-GPR predictions must be interpreted cautiously when the changes in emissions are well beyond that prescribed in the training set (±100 %), or there is large spatial heterogeneity in pollutant emission changes within a region.

5 Discussion

In this work, we apply a classic emulator building workflow (carefully sampling the input space to create samples for training machine learning models) that has been widely applied in engineering (Alizadeh et al., 2020) to build a reduced-form global model of anthropogenic PM_2.5 from a global 3D chemical transport model, GCHP-CAM. Similar techniques (also often choosing Gaussian Process Regression as the machine learning algorithm) have been used for uncertainty analysis and parameter calibration in atmospheric chemistry modelling (Reyes-Villegas et al., 2023; Ryan and Wild, 2021; Wild et al., 2020), and directly emulate air quality models at local and regional scales (Conibear et al., 2021; Vander Hoorn et al., 2022). Our work extends this approach for global change scenarios, where climate change, inter-regional chemical transport and different chemical regimes pose another layer of challenges.

To address these challenges, there are a few unique features of the emulator architecture in comparison to other reduced-form global air quality models. We design the emulator to be usable for a wide range of integrated assessment modelling and policy evaluation, where new scenarios or greenhouse gas concentration and pollutant emissions are routinely generated, but complementary atmospheric simulations are not always available. Therefore, rather than directly using the meteorological fields as input (e.g. Chen et al., 2023), we parameterize anthropogenic climate change intensity as a function of atmospheric CO₂ concentration, and use geographic weighting of pollutant emissions (Pisoni et al., 2018) to approximate the effect of chemical transport. Rather than exploring the source-receptor relationships between pre-defined regions, the emulator is trained at individual grid cell scale. Therefore, the accuracy of the emulator is not affected by different definitions of regions and sub-regional changes in spatial patterns pollutant emissions, which is important for application across different integrated assessment frameworks. This also allow us to tackle the non-linearity in the atmospheric chemical system by exploring more combinations of pollutant emission changes (more efficiently via Latin Hypercube Sampling) and machine learning (via Gaussian Process Regression).

By analysing emulator performance at grid cell scale, we find the GW-GPR emulator successful in reproducing the global and regional changes in PM_2.5 simulated by GCHP-CAM under 4 climate and air quality legislation (IGSM-GAINS-TAPS) scenarios. To evaluate the utility of the emulator, we show that it provides results within the range of full-scale models in the AerChemMIP archive under SSP3-7.0 and SSP3-7.0-lowNTCF scenarios. This shows our emulator can provide reasonable predictions under climate scenarios (in this case, SSP3-7.0) other than that used to train the emulator (i.e. the 10 W m⁻² scenario from IGSM), with the caveat that the magnitudes of climate change (and therefore climate impact on PM_2.5) could be misrepresented under overshoot scenarios, large decoupling between emission trajectories of CO₂ and other important GHGs (e.g. high CH₄ scenarios), and a significant difference in climate response with that simulated by IGSM-CAM (see Sect. 2.3.1). We also find that the emulator may underperform when (1) the magnitude of pollutant emission changes is well beyond that prescribed in the training set (±100 %); (2) the geographic weighting scheme ignores the advective component of pollutant transport, therefore misrepresents region-specific directional pollutant transport patterns; (3) the spatial pattern of pollutant emission changes is highly heterogenous within a region. This points to some potential ways of improving the accuracy of the GW-GPR framework (e.g. fitting anisotropic geographic weighting schemes for each grid cells, expanding the training set), which can be explored in the future.

In addition to the mean, GPR also calculates the standard deviation of the prediction, which can be interpreted to characterize the statistical uncertainty of emulator output. As unforced climate variability directly contributes to the interannual variabilities of PM_2.5, previous studies recommend 10–20 years of averaging to robustly detect the changes in PM_2.5 over the contiguous United States (Brown-Steiner et al., 2018; Garcia-Menendez et al., 2017; Pienkosz et al., 2019), especially when the signal to be detected is smaller or comparable in magnitude to the underlying unforced climate variability. Due to limitations in computing time, we focus on exploring a wider range pollutant emission and climate change by generating each sample in the training set using 1 year of simulation, rather than running multiple years of simulation to generating robust signals amidst unforced climate variabilities for each set of perturbation experiment. This significant source of uncertainty, however, is captured by the uncertainty quantification algorithm of the GPR. Therefore, in addition to the magnitude, our emulator also provides the uncertainties in ΔPM_2.5, which can be important in quantifying the overall uncertainties of health impacts of future air pollution (Saari et al., 2019).

In combination with the emission intensity projections from GAINS, TAPS can translate integrated assessment model output to spatially explicit air pollutant emission inventories. Combining with the GW-GPR emulator (Fig. 2), we can potentially produce grid cell scale projection of anthropogenic PM_2.5 changes for any climate and air quality integrated assessment scenarios within seconds, as demonstrated by our IGSM-GAINS-TAPS emulation exercise. This opens up the possibility for including air quality impacts within climate and sustainability decision making and scientific analysis. As climate projections move towards including scenario design as part of the uncertainty (Guivarch et al., 2022; Lamontagne et al., 2018; O'Neill et al., 2016, 2020), climate and global change scenarios generated will increase by orders of magnitude (Lamontagne et al., 2018; Shindell and Smith, 2019), where the runtime required by reduced-order chemical transport models (a few hundred CPU hours per model year) could be a hurdle for large ensemble modelling, despite their advantage of having higher spatiotemporal resolution than global statistical emulators; and statistical emulators that require meteorological fields as input would not be applicable as 3D climate simulations are too computationally expensive to be conducted for individual scenario. Tools with proper balance between accuracy, computing resource and input data requirements become more important in enabling uncertainty analysis, impact assessment human-Earth system feedback research. Our work shows the potential of machine learning techniques in enabling rapid and accurate global air quality assessment. Future work includes applying the PM_2.5 emulator to study more global change scenarios, representing other important climate-sensitive emission sources (e.g. wetland methane, dust, fire), improving and extending the emulator to calculate changes in other pollutants (e.g. O₃) and local climate forcing, and building a software package and web interface to increase the accessibility of the emulator.

Code and data availability

The GW-GPR software is publicly available at https://doi.org/10.5281/zenodo.19391330 (Wong, 2026), with a brief user guide in the repository. A frozen version of the repository, which in addition contains the associated datasets required to reproduce the study and the figures presented in this manuscript, is available on Zenodo (https://doi.org/10.5281/zenodo.15484655, Wong, 2025). TAPS is publicly available at https://doi.org/10.5281/zenodo.7158380 (Atkinson et al., 2022b) . The GAINS data used in this study were processed by Atkinson et al. (2022a). The raw GAINS scenario data are available at https://gains.iiasa.ac.at/models (last access: 12 October 2021). The source code of GCHP 13.0.0 is available at https://doi.org/10.5281/zenodo.4618205 (The International GEOS-Chem User Community, 2021). Detailed descriptions and greenhouse gas emissions for CT and AA scenarios can be found within the MIT Joint Program on the Science and Policy for Global Change 2023 Global Change Outlook (Paltsev et al., 2023). The unprocessed CAM and GCHP output are available by contacting the corresponding authors.

Author contributions

AYHW, NES and SDE conceptualized the study. EM developed the underlying CAM variant and produced the meteorological data. SDE developed and evaluated the GCHP-CAM model. AYHW conducted GCHP-CAM simulations, preparing air pollutant emissions for the IGSM-GAINS-TAPS scenarios, GW-GPR modelling, acquiring and processing AerChemMIP, Input4MIP data. AYHW analyzed the results and wrote the manuscript with input from NES, SDE and EM. NES provided supervision and acquired funding for this project.

Competing interests

The contact author has declared that none of the authors has any competing interests.

Disclaimer

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. The authors bear the ultimate responsibility for providing appropriate place names. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.

Acknowledgements

The computation of this study was performed on the Svante computing cluster, which is supported by MIT Center for Sustainability Science and Strategy. We acknowledge the climate modelling groups participating in AerChemMIP and make the model output available through ESGF, and the funding agencies supporting the above work. We thank Will Atkinson for developing TAPS, Emmie Le Roy for configuring GCHP-CAM, and other members of the BC3 team and Selin group for their valuable insight.

Financial support

This research was part of the Bringing Computation to the Climate Challenge (BC3) and is supported by Schmidt Sciences, LLC, through the MIT Climate Grand Challenges.

Review statement

This paper was edited by Fiona O'Connor and reviewed by three anonymous referees.

References

Alizadeh, R., Allen, J. K., and Mistree, F.: Managing computational complexity using surrogate models: a critical review, Res. Eng. Design, 31, 275–298, https://doi.org/10.1007/s00163-020-00336-7, 2020.

Amann, M., Bertok, I., Borken-Kleefeld, J., Cofala, J., Heyes, C., Höglund-Isaksson, L., Klimont, Z., Nguyen, B., Posch, M., Rafaj, P., Sandler, R., Schöpp, W., Wagner, F., and Winiwarter, W.: Cost-effective control of air quality and greenhouse gases in Europe: Modeling and policy applications, Environ. Model. Softw., 26, 1489–1501, https://doi.org/10.1016/j.envsoft.2011.07.012, 2011.

Ansari, A. S. and Pandis, S. N.: Response of inorganic PM to precursor concentrations, Environ. Sci. Technol., 32, 2706–2714, https://doi.org/10.1021/es971130j, 1998.

Atkinson, W., Eastham, S. D., Chen, Y.-H. H., Morris, J., Paltsev, S., Schlosser, C. A., and Selin, N. E.: A tool for air pollution scenarios (TAPS v1.0) to enable global, long-term, and flexible study of climate and air quality policies, Geosci. Model Dev., 15, 7767–7789, https://doi.org/10.5194/gmd-15-7767-2022, 2022a.

Atkinson, W., Eastham, S. D., Henry Chen, Y.-H., Morris, J., Paltsev, S., Schlosser, C. A., and Selin, N. E.: Code and data used in “A Tool for Air Pollution Scenarios (TAPS v1.0) to enable global, long-term, and flexible study of climate and air quality policies” (1.0) [Data set], Zenodo [code and data set], https://doi.org/10.5281/zenodo.7158380, 2022b.

Bian, H., Chin, M., Hauglustaine, D. A., Schulz, M., Myhre, G., Bauer, S. E., Lund, M. T., Karydis, V. A., Kucsera, T. L., Pan, X., Pozzer, A., Skeie, R. B., Steenrod, S. D., Sudo, K., Tsigaridis, K., Tsimpidi, A. P., and Tsyro, S. G.: Investigation of global particulate nitrate from the AeroCom phase III experiment, Atmos. Chem. Phys., 17, 12911–12940, https://doi.org/10.5194/acp-17-12911-2017, 2017.

Boucher, O., Denvil, S., Levavasseur, G., Cozic, A., Caubel, A., Foujols, M.-A., Meurdesoif, Y., Cadule, P., Devilliers, M., Dupont, E., and Lurton, T.: IPCC DDC: IPSL IPSL-CM5A2-INCA model output prepared for CMIP6 ScenarioMIP (1), WDC Climate, https://doi.org/10.26050/WDCC/AR6.C6SPIPICI, 2023.

Brown-Steiner, B., Selin, N. E., Prinn, R. G., Monier, E., Tilmes, S., Emmons, L., and Garcia-Menendez, F.: Maximizing ozone signals among chemical, meteorological, and climatological variability, Atmos. Chem. Phys., 18, 8373–8388, https://doi.org/10.5194/acp-18-8373-2018, 2018.

Burnett, R., Chen, H., Szyszkowicz, M., Fann, N., Hubbell, B., Pope, C. A., Apte, J. S., Brauer, M., Cohen, A., Weichenthal, S., Coggins, J., Di, Q., Brunekreef, B., Frostad, J., Lim, S. S., Kan, H., Walker, K. D., Thurston, G. D., Hayes, R. B., Lim, C. C., Turner, M. C., Jerrett, M., Krewski, D., Gapstur, S. M., Diver, W. R., Ostro, B., Goldberg, D., Crouse, D. L., Martin, R. V., Peters, P., Pinault, L., Tjepkema, M., Van Donkelaar, A., Villeneuve, P. J., Miller, A. B., Yin, P., Zhou, M., Wang, L., Janssen, N. A. H., Marra, M., Atkinson, R. W., Tsang, H., Thach, T. Q., Cannon, J. B., Allen, R. T., Hart, J. E., Laden, F., Cesaroni, G., Forastiere, F., Weinmayr, G., Jaensch, A., Nagel, G., Concin, H., and Spadaro, J. V.: Global estimates of mortality associated with longterm exposure to outdoor fine particulate matter, P. Natl. Acad. Sci. USA, 115, 9592–9597, https://doi.org/10.1073/pnas.1803222115, 2018.

Camilleri, S. F., Montgomery, A., Visa, M. A., Schnell, J. L., Adelman, Z. E., Janssen, M., Grubert, E. A., Anenberg, S. C., and Horton, D. E.: Air quality, health and equity implications of electrifying heavy-duty vehicles, Nat. Sustain., 6, 1643–1653, https://doi.org/10.1038/s41893-023-01219-0, 2023.

Center For International Earth Science Information Network-CIESIN-Columbia University: Gridded Population of the World, Version 4 (GPWv4): National Identifier Grid, Revision 11, https://doi.org/10.7927/H4TD9VDP, 2018.

Chen, W., Lu, X., Yuan, D., Chen, Y., Li, Z., Huang, Y., Fung, T., Sun, H., and Fung, J. C. H.: Global PM _2.5 Prediction and Associated Mortality to 2100 under Different Climate Change Scenarios, Environ. Sci. Technol., 57, 10039–10052, https://doi.org/10.1021/acs.est.3c03804, 2023.

Chen, Y., Van Der A, R. J., Ding, J., Eskes, H., Williams, J. E., Theys, N., Tsikerdekis, A., and Levelt, P. F.: SO₂ emissions derived from TROPOMI observations over India using a flux-divergence method with variable lifetimes, Atmos. Chem. Phys., 25, 1851–1868, https://doi.org/10.5194/acp-25-1851-2025, 2025.

Chen, Y.-H. H., Paltsev, S., Gurgel, A., Reilly, J. M., and Morris, J.: A Multisectoral Dynamic Model for Energy, Economic, and Climate Scenario Analysis, LCE, 13, 70–111, https://doi.org/10.4236/lce.2022.132005, 2022.

Colette, A., Rouïl, L., Meleux, F., Lemaire, V., and Raux, B.: Air Control Toolbox (ACT_v1.0): a flexible surrogate model to explore mitigation scenarios in air quality forecasts, Geosci. Model Dev., 15, 1441–1465, https://doi.org/10.5194/gmd-15-1441-2022, 2022.

Collins, W. D., Rasch, P. J., Boville, B. A., Hack, J. J., McCaa, J. R., Williamson, D. L., Briegleb, B. P., Bitz, C. M., Lin, S.-J., and Zhang, M.: The Formulation and Atmospheric Simulation of the Community Atmosphere Model Version 3 (CAM3), J. Climate, 19, 2144–2161, https://doi.org/10.1175/JCLI3760.1, 2006.

Collins, W. J., Lamarque, J.-F., Schulz, M., Boucher, O., Eyring, V., Hegglin, M. I., Maycock, A., Myhre, G., Prather, M., Shindell, D., and Smith, S. J.: AerChemMIP: quantifying the effects of chemistry and aerosols in CMIP6, Geosci. Model Dev., 10, 585–607, https://doi.org/10.5194/gmd-10-585-2017, 2017.

Conibear, L., Reddington, C. L., Silver, B. J., Chen, Y., Knote, C., Arnold, S. R., and Spracklen, D. V.: Statistical Emulation of Winter Ambient Fine Particulate Matter Concentrations From Emission Changes in China, GeoHealth, 5, e2021GH000391, https://doi.org/10.1029/2021GH000391, 2021.

Conibear, L., Reddington, C. L., Silver, B. J., Chen, Y., Knote, C., Arnold, S. R., and Spracklen, D. V.: Sensitivity of Air Pollution Exposure and Disease Burden to Emission Changes in China Using Machine Learning Emulation, GeoHealth, 6, e2021GH000570, https://doi.org/10.1029/2021GH000570, 2022.

Conover, W. J. and Iman, R. L.: Analysis of Covariance Using the Rank Transformation, Biometrics, 38, 715, https://doi.org/10.2307/2530051, 1982.

Consortium (EC-Earth): IPCC DDC: EC-Earth-Consortium EC-Earth3-AerChem model output prepared for CMIP6 AerChemMIP, https://doi.org/10.26050/WDCC/AR6.C6ACEEEEA, 2023.

Dammers, E., McLinden, C. A., Griffin, D., Shephard, M. W., Van Der Graaf, S., Lutsch, E., Schaap, M., Gainairu-Matz, Y., Fioletov, V., Van Damme, M., Whitburn, S., Clarisse, L., Cady-Pereira, K., Clerbaux, C., Coheur, P. F., and Erisman, J. W.: NH₃ emissions from large point sources derived from CrIS and IASI satellite observations, Atmos. Chem. Phys., 19, 12261–12293, https://doi.org/10.5194/acp-19-12261-2019, 2019.

Döscher, R., Acosta, M., Alessandri, A., Anthoni, P., Arsouze, T., Bergman, T., Bernardello, R., Boussetta, S., Caron, L.-P., Carver, G., Castrillo, M., Catalano, F., Cvijanovic, I., Davini, P., Dekker, E., Doblas-Reyes, F. J., Docquier, D., Echevarria, P., Fladrich, U., Fuentes-Franco, R., Gröger, M., V. Hardenberg, J., Hieronymus, J., Karami, M. P., Keskinen, J.-P., Koenigk, T., Makkonen, R., Massonnet, F., Ménégoz, M., Miller, P. A., Moreno-Chamarro, E., Nieradzik, L., Van Noije, T., Nolan, P., O'Donnell, D., Ollinaho, P., Van Den Oord, G., Ortega, P., Prims, O. T., Ramos, A., Reerink, T., Rousset, C., Ruprich-Robert, Y., Le Sager, P., Schmith, T., Schrödner, R., Serva, F., Sicardi, V., Sloth Madsen, M., Smith, B., Tian, T., Tourigny, E., Uotila, P., Vancoppenolle, M., Wang, S., Wårlind, D., Willén, U., Wyser, K., Yang, S., Yepes-Arbós, X., and Zhang, Q.: The EC-Earth3 Earth system model for the Coupled Model Intercomparison Project 6, Geosci. Model Dev., 15, 2973–3020, https://doi.org/10.5194/gmd-15-2973-2022, 2022.

Dunne, J. P., Horowitz, L. W., Adcroft, A. J., Ginoux, P., Held, I. M., John, J. G., Krasting, J. P., Malyshev, S., Naik, V., Paulot, F., Shevliakova, E., Stock, C. A., Zadeh, N., Balaji, V., Blanton, C., Dunne, K. A., Dupuis, C., Durachta, J., Dussin, R., Gauthier, P. P. G., Griffies, S. M., Guo, H., Hallberg, R. W., Harrison, M., He, J., Hurlin, W., McHugh, C., Menzel, R., Milly, P. C. D., Nikonov, S., Paynter, D. J., Ploshay, J., Radhakrishnan, A., Rand, K., Reichl, B. G., Robinson, T., Schwarzkopf, D. M., Sentman, L. T., Underwood, S., Vahlenkamp, H., Winton, M., Wittenberg, A. T., Wyman, B., Zeng, Y., and Zhao, M.: The GFDL Earth System Model Version 4.1 (GFDL-ESM 4.1): Overall Coupled Model Description and Simulation Characteristics, J. Adv. Model. Earth Syst., 12, e2019MS002015, https://doi.org/10.1029/2019MS002015, 2020.

Eastham, S. D., Long, M. S., Keller, C. A., Lundgren, E., Yantosca, R. M., Zhuang, J., Li, C., Lee, C. J., Yannetti, M., Auer, B. M., Clune, T. L., Kouatchou, J., Putman, W. M., Thompson, M. A., Trayanov, A. L., Molod, A. M., Martin, R. V., and Jacob, D. J.: GEOS-Chem High Performance (GCHP v11-02c): a next-generation implementation of the GEOS-Chem chemical transport model for massively parallel applications, Geosci. Model Dev., 11, 2941–2953, https://doi.org/10.5194/gmd-11-2941-2018, 2018.

Eastham, S. D., Monier, E., Rothenberg, D., Paltsev, S., and Selin, N. E.: Rapid Estimation of Climate–Air Quality Interactions in Integrated Assessment Using a Response Surface Model, ACS Environ. Au, 3, 153–163, https://doi.org/10.1021/acsenvironau.2c00054, 2023.

Fioletov, V. E., McLinden, C. A., Krotkov, N., and Li, C.: Lifetimes and emissions of SO₂ from point sources estimated from OMI, Geophys. Res. Lett., 42, 1969–1976, https://doi.org/10.1002/2015GL063148, 2015.

Fountoukis, C. and Nenes, A.: ISORROPIAII: A computationally efficient thermodynamic equilibrium model for K⁺–Ca²⁺–Mg²⁺–NH $_{4}^{+}$ –Na⁺–SO $_{4}^{2 -}$ –NO $_{3}^{-}$ –Cl⁻–H₂O aerosols, Atmos. Chem. Phys., 7, 4639–4659, https://doi.org/10.5194/acp-7-4639-2007, 2007.

Franco, B., Clarisse, L., Van Damme, M., Hadji-Lazaro, J., Clerbaux, C., and Coheur, P.-F.: Ethylene industrial emitters seen from space, Nat. Commun., 13, 6452, https://doi.org/10.1038/s41467-022-34098-8, 2022.

Fujimori, S., Hasegawa, T., Masui, T., Takahashi, K., Herran, D. S., Dai, H., Hijioka, Y., and Kainuma, M.: SSP3: AIM implementation of Shared Socioeconomic Pathways, Global Environ. Change, 42, 268–283, https://doi.org/10.1016/j.gloenvcha.2016.06.009, 2017.

GAINS Developer Team: GAINS 4.01 IAM model outputs, IIASA, https://gains.iiasa.ac.at/models (last access: 12 October 2021), 2021.

Galmarini, S., Koffi, B., Solazzo, E., Keating, T., Hogrefe, C., Schulz, M., Benedictow, A., Griesfeller, J. J., Janssens-Maenhout, G., Carmichael, G., Fu, J., and Dentener, F.: Technical note: Coordination and harmonization of the multi-scale, multi-model activities HTAP2, AQMEII3, and MICS-Asia3: simulations, emission inventories, boundary conditions, and model output formats, Atmos. Chem. Phys., 17, 1543–1555, https://doi.org/10.5194/acp-17-1543-2017, 2017.

Gao, C. Y., Heald, C. L., Katich, J. M., Luo, G., and Yu, F.: Remote Aerosol Simulated During the Atmospheric Tomography (ATom) Campaign and Implications for Aerosol Lifetime, J. Geophys. Res.-Atmos., 127, e2022JD036524, https://doi.org/10.1029/2022JD036524, 2022.

Garcia-Menendez, F., Saari, R. K., Monier, E., and Selin, N. E.: U.S. Air Quality and Health Benefits from Avoided Climate Change under Greenhouse Gas Mitigation, Environ. Sci. Technol., 49, 7580–7588, https://doi.org/10.1021/acs.est.5b01324, 2015.

Garcia-Menendez, F., Monier, E., and Selin, N. E.: The role of natural variability in projections of climate change impacts on U.S. ozone pollution, Geophys. Res. Lett., 44, 2911–2921, https://doi.org/10.1002/2016GL071565, 2017.

Giani, P., Fiore, A. M., Flierl, G., Ferrari, R., and Selin, N. E.: Origin and Limits of Invariant Warming Patterns in Climate Models, arXiv [preprint], https://doi.org/10.48550/ARXIV.2411.14183, 2024.

Gidden, M. J., Riahi, K., Smith, S. J., Fujimori, S., Luderer, G., Kriegler, E., Van Vuuren, D. P., Van Den Berg, M., Feng, L., Klein, D., Calvin, K., Doelman, J. C., Frank, S., Fricko, O., Harmsen, M., Hasegawa, T., Havlik, P., Hilaire, J., Hoesly, R., Horing, J., Popp, A., Stehfest, E., and Takahashi, K.: Global emissions pathways under different socioeconomic scenarios for use in CMIP6: a dataset of harmonized emissions trajectories through the end of the century, Geosci. Model Dev., 12, 1443–1475, https://doi.org/10.5194/gmd-12-1443-2019, 2019.

Gu, S., Guenther, A., and Faiola, C.: Effects of Anthropogenic and Biogenic Volatile Organic Compounds on Los Angeles Air Quality, Environ. Sci. Technol., 55, 12191–12201, https://doi.org/10.1021/acs.est.1c01481, 2021.

Guenther, A. B., Jiang, X., Heald, C. L., Sakulyanontvittaya, T., Duhl, T., Emmons, L. K., and Wang, X.: The model of emissions of gases and aerosols from nature version 2.1 (MEGAN2.1): An extended and updated framework for modeling biogenic emissions, Geosci. Model Dev., 5, 1471–1492, https://doi.org/10.5194/gmd-5-1471-2012, 2012.

Guivarch, C., Le Gallic, T., Bauer, N., Fragkos, P., Huppmann, D., Jaxa-Rozen, M., Keppo, I., Kriegler, E., Krisztin, T., Marangoni, G., Pye, S., Riahi, K., Schaeffer, R., Tavoni, M., Trutnevyte, E., Van Vuuren, D., and Wagner, F.: Using large ensembles of climate change mitigation scenarios for robust insights, Nat. Clim. Change, 12, 428–435, https://doi.org/10.1038/s41558-022-01349-x, 2022.

Hardacre, C., Mulcahy, J. P., Pope, R. J., Jones, C. G., Rumbold, S. T., Li, C., Johnson, C., and Turnock, S. T.: Evaluation of SO₂ , SO $_{4}^{2 -}$ and an updated SO₂ dry deposition parameterization in the United Kingdom Earth System Model, Atmos. Chem. Phys., 21, 18465–18497, https://doi.org/10.5194/acp-21-18465-2021, 2021.

Herman, J. and Usher, W.: SALib: An open-source Python library for Sensitivity Analysis, JOSS, 2, 97, https://doi.org/10.21105/joss.00097, 2017.

Hoesly, R. M., Smith, S. J., Feng, L., Klimont, Z., Janssens-Maenhout, G., Pitkanen, T., Seibert, J. J., Vu, L., Andres, R. J., Bolt, R. M., Bond, T. C., Dawidowski, L., Kholod, N., Kurokawa, J., Li, M., Liu, L., Lu, Z., Moura, M. C. P., O'Rourke, P. R., and Zhang, Q.: Historical (1750–2014) anthropogenic emissions of reactive gases and aerosols from the Community Emissions Data System (CEDS), Geosci. Model Dev., 11, 369–408, https://doi.org/10.5194/gmd-11-369-2018, 2018.

Horowitz, L. W., Naik, V., Sentman, L., Paulot, F., Blanton, C., McHugh, C., Radhakrishnan, A., Rand, K., Vahlenkamp, H., Zadeh, N. T., Wilson, C., Ginoux, P., He, J., John, J. G., Lin, M., Paynter, D. J., Ploshay, J., Zhang, A., and Zeng, Y.: IPCC DDC: NOAA-GFDL GFDL-ESM4 model output prepared for CMIP6 AerChemMIP (1), WDC Climate, https://doi.org/10.26050/WDCC/AR6.C6ACNGGFE, 2023.

Huang, X., Srikrishnan, V., Lamontagne, J., Keller, K., and Peng, W.: Effects of global climate mitigation on regional air quality and health, Nat. Sustain., 6, 1054–1066, https://doi.org/10.1038/s41893-023-01133-5, 2023.

Hudman, R. C., Moore, N. E., Mebust, A. K., Martin, R. V., Russell, A. R., Valin, L. C., and Cohen, R. C.: Steps towards a mechanistic model of global soil nitric oxide emissions: Implementation and space based-constraints, Atmos. Chem. Phys., 12, 7779–7795, https://doi.org/10.5194/acp-12-7779-2012, 2012.

Iwanaga, T., Usher, W., and Herman, J.: Toward SALib 2.0: Advancing the accessibility and interpretability of global sensitivity analyses, SESMO, 4, 18155, https://doi.org/10.18174/sesmo.18155, 2022.

Jacob, D. J. and Winner, D. A.: Effect of climate change on air quality, Atmos. Environ., 43, 51–63, https://doi.org/10.1016/j.atmosenv.2008.09.051, 2009.

Jones, P. W.: First- and Second-Order Conservative Remapping Schemes for Grids in Spherical Coordinates, Mon. Weather Rev., 127, 2204–2210, https://doi.org/10.1175/1520-0493(1999)127<2204:FASOCR>2.0.CO;2, 1999.

Kawamiya, M., Hajima, T., Tachiiri, K., Watanabe, S., and Yokohata, T.: Two decades of Earth system modeling with an emphasis on Model for Interdisciplinary Research on Climate (MIROC), Prog. Earth Planet. Sci., 7, 64, https://doi.org/10.1186/s40645-020-00369-5, 2020.

Kelley, M., Schmidt, G. A., Nazarenko, L. S., Bauer, S. E., Ruedy, R., Russell, G. L., Ackerman, A. S., Aleinov, I., Bauer, M., Bleck, R., Canuto, V., Cesana, G., Cheng, Y., Clune, T. L., Cook, B. I., Cruz, C. A., Del Genio, A. D., Elsaesser, G. S., Faluvegi, G., Kiang, N. Y., Kim, D., Lacis, A. A., Leboissetier, A., LeGrande, A. N., Lo, K. K., Marshall, J., Matthews, E. E., McDermid, S., Mezuman, K., Miller, R. L., Murray, L. T., Oinas, V., Orbe, C., García-Pando, C. P., Perlwitz, J. P., Puma, M. J., Rind, D., Romanou, A., Shindell, D. T., Sun, S., Tausnev, N., Tsigaridis, K., Tselioudis, G., Weng, E., Wu, J., and Yao, M.: GISS-E2.1: Configurations and Climatology, J. Adv. Model. Earth Syst., 12, e2019MS002025, https://doi.org/10.1029/2019MS002025, 2020.

Khalil, M. A. K. and Rasmussen, R. A.: The global cycle of carbon monoxide: Trends and mass balance, Chemosphere, 20, 227–242, https://doi.org/10.1016/0045-6535(90)90098-E, 1990.

Kim, P. S., Jacob, D. J., Fisher, J. A., Travis, K., Yu, K., Zhu, L., Yantosca, R. M., Sulprizio, M. P., Jimenez, J. L., Campuzano-Jost, P., Froyd, K. D., Liao, J., Hair, J. W., Fenn, M. A., Butler, C. F., Wagner, N. L., Gordon, T. D., Welti, A., Wennberg, P. O., Crounse, J. D., St. Clair, J. M., Teng, A. P., Millet, D. B., Schwarz, J. P., Markovic, M. Z., and Perring, A. E.: Sources, seasonality, and trends of southeast US aerosol: an integrated analysis of surface, aircraft, and satellite observations with the GEOS-Chem chemical transport model, Atmos. Chem. Phys., 15, 10411–10433, https://doi.org/10.5194/acp-15-10411-2015, 2015.

Klimont, Z., Kupiainen, K., Heyes, C., Purohit, P., Cofala, J., Rafaj, P., Borken-Kleefeld, J., and Schöpp, W.: Global anthropogenic emissions of particulate matter including black carbon, Atmos. Chem. Phys., 17, 8681–8723, https://doi.org/10.5194/acp-17-8681-2017, 2017.

Lamontagne, J. R., Reed, P. M., Link, R., Calvin, K. V., Clarke, L. E., and Edmonds, J. A.: Large Ensemble Analytic Framework for Consequence-Driven Discovery of Climate Change Scenarios, Earth's Future, 6, 488–504, https://doi.org/10.1002/2017EF000701, 2018.

Lange, K., Richter, A., and Burrows, J. P.: Variability of nitrogen oxide emission fluxes and lifetimes estimated from Sentinel-5P TROPOMI observations, Atmos. Chem. Phys., 22, 2745–2767, https://doi.org/10.5194/acp-22-2745-2022, 2022.

Latimer, R. N. C. and Martin, R. V.: Interpretation of measured aerosol mass scattering efficiency over North America using a chemical transport model, Atmos. Chem. Phys., 19, 2635–2653, https://doi.org/10.5194/acp-19-2635-2019, 2019.

Lee, C., Martin, R. V., Van Donkelaar, A., Lee, H., Dickerson, R. R., Hains, J. C., Krotkov, N., Richter, A., Vinnikov, K., and Schwab, J. J.: SO₂ emissions and lifetimes: Estimates from inverse modeling using in situ and global, space-based (SCIAMACHY and OMI) observations, J. Geophys. Res., 116, D06304, https://doi.org/10.1029/2010JD014758, 2011.

Li, C. and Cohen, R. C.: Space-Borne Estimation of Volcanic Sulfate Aerosol Lifetime, J. Geophys. Res.-Atmos., 126, e2020JD033883, https://doi.org/10.1029/2020JD033883, 2021.

Li, D., Jin, J., Wang, G., Pang, M., Zhang, W., and Liao, H.: A Transformer-based agent model of GEOS-Chem v14.2.2 for informative prediction of PM_2.5 and O₃ levels to future emission scenarios: TGEOS v1.0, Geosci. Model Dev., 19, 1703–1725, https://doi.org/10.5194/gmd-19-1703-2026, 2026.

Li, H., Yang, Y., Wang, H., Wang, P., Yue, X., and Liao, H.: Projected Aerosol Changes Driven by Emissions and Climate Change Using a Machine Learning Method, Environ. Sci. Technol., 56, 3884–3893, https://doi.org/10.1021/acs.est.1c04380, 2022.

Liang, C.-K., West, J. J., Silva, R. A., Bian, H., Chin, M., Davila, Y., Dentener, F. J., Emmons, L., Flemming, J., Folberth, G., Henze, D., Im, U., Jonson, J. E., Keating, T. J., Kucsera, T., Lenzen, A., Lin, M., Lund, M. T., Pan, X., Park, R. J., Pierce, R. B., Sekiya, T., Sudo, K., and Takemura, T.: HTAP2 multi-model estimates of premature human mortality due to intercontinental transport of air pollution and emission sectors, Atmos. Chem. Phys., 18, 10497–10520, https://doi.org/10.5194/acp-18-10497-2018, 2018.

Lund, M. T., Samset, B. H., Skeie, R. B., Watson-Parris, D., Katich, J. M., Schwarz, J. P., and Weinzierl, B.: Short Black Carbon lifetime inferred from a global set of aircraft observations, npj Clim. Atmos. Sci., 1, 31, https://doi.org/10.1038/s41612-018-0040-x, 2018.

Lütjens, B., Ferrari, R., Watson-Parris, D., and Selin, N. E.: The Impact of Internal Variability on Benchmarking Deep Learning Climate Emulators, J. Adv. Model. Earth Syst., 17, e2024MS004619, https://doi.org/10.1029/2024MS004619, 2025.

McKay, M. D., Beckman, R. J., and Conover, W. J.: Comparison of Three Methods for Selecting Values of Input Variables in the Analysis of Output from a Computer Code, Technometrics, 21, 239–245, https://doi.org/10.1080/00401706.1979.10489755, 1979.

Meinshausen, M., Nicholls, Z. R. J., Lewis, J., Gidden, M. J., Vogel, E., Freund, M., Beyerle, U., Gessner, C., Nauels, A., Bauer, N., Canadell, J. G., Daniel, J. S., John, A., Krummel, P. B., Luderer, G., Meinshausen, N., Montzka, S. A., Rayner, P. J., Reimann, S., Smith, S. J., Van Den Berg, M., Velders, G. J. M., Vollmer, M. K., and Wang, R. H. J.: The shared socio-economic pathway (SSP) greenhouse gas concentrations and their extensions to 2500, Geosci. Model Dev., 13, 3571–3605, https://doi.org/10.5194/gmd-13-3571-2020, 2020.

Meng, J., Martin, R. V., Ginoux, P., Hammer, M., Sulprizio, M. P., Ridley, D. A., and Van Donkelaar, A.: Grid-independent high-resolution dust emissions (v1.0) for chemical transport models: application to GEOS-Chem (12.5.0), Geosci. Model Dev., 14, 4249–4260, https://doi.org/10.5194/gmd-14-4249-2021, 2021.

Monier, E., Scott, J. R., Sokolov, A. P., Forest, C. E., and Schlosser, C. A.: An integrated assessment modeling framework for uncertainty studies in global and regional climate change: the MIT IGSM-CAM (version 1.0), Geosci. Model Dev., 6, 2063–2085, https://doi.org/10.5194/gmd-6-2063-2013, 2013.

Monier, E., Gao, X., Scott, J. R., Sokolov, A. P., and Schlosser, C. A.: A framework for modeling uncertainty in regional climate change, Climatic Change, 131, 51–66, https://doi.org/10.1007/s10584-014-1112-5, 2015.

Muller, H., Kramm, G., Meixner, F., Dollard, G. J., Fowler, D., and Possanzini, M.: Determination of HNO₃ dry deposition by modified Bowen ratio and aerodynamic profile techniques, Tellus B, 45, 346, https://doi.org/10.3402/tellusb.v45i4.15735, 1993.

Murray, L. T., Jacob, D. J., Logan, J. A., Hudman, R. C., and Koshak, W. J.: Optimized regional and interannual variability of lightning in a global chemical transport model constrained by LIS/OTD satellite data, J. Geophys. Res., 117, 2012JD017934, https://doi.org/10.1029/2012JD017934, 2012.

NASA/GISS – NASA Goddard Institute For Space Studies: IPCC DDC: NASA-GISS GISS-E2.1G model output prepared for CMIP6 AerChemMIP (1), https://doi.org/10.26050/WDCC/AR6.C6ACGIGE1, 2023a.

NASA/GISS – NASA Goddard Institute For Space Studies: IPCC DDC: NASA-GISS GISS-E2.1G model output prepared for CMIP6 AerChemMIP ssp370-lowNTCF (1), https://doi.org/10.26050/WDCC/AR6.C6ACGIGE1SL, 2023b.

NASA/GISS – NASA Goddard Institute For Space Studies: IPCC DDC: NASA-GISS GISS-E2.1H model output prepared for CMIP6 CMIP (1), https://doi.org/10.26050/WDCC/AR6.C6CMGIGH1, 2023c.

O'Neill, B. C., Tebaldi, C., Van Vuuren, D. P., Eyring, V., Friedlingstein, P., Hurtt, G., Knutti, R., Kriegler, E., Lamarque, J.-F., Lowe, J., Meehl, G. A., Moss, R., Riahi, K., and Sanderson, B. M.: The Scenario Model Intercomparison Project (ScenarioMIP) for CMIP6, Geosci. Model Dev., 9, 3461–3482, https://doi.org/10.5194/gmd-9-3461-2016, 2016.

O'Neill, B. C., Carter, T. R., Ebi, K., Harrison, P. A., Kemp-Benedict, E., Kok, K., Kriegler, E., Preston, B. L., Riahi, K., Sillmann, J., Van Ruijven, B. J., Van Vuuren, D., Carlisle, D., Conde, C., Fuglestvedt, J., Green, C., Hasegawa, T., Leininger, J., Monteith, S., and Pichs-Madruga, R.: Achievements and needs for the climate change scenario framework, Nat. Clim. Change, 10, 1074–1084, https://doi.org/10.1038/s41558-020-00952-0, 2020.

Paltsev, S., Schlosser, C. A., Chen, H., Gao, X., Gurgel, A., Jacoby, H., Morris, J., Prinn, R. G., Reilly, J. M., Salunke, P., and Sokolov, A. P.: 2023 Global Change Outlook, MIT Joint Program on the Science and Policy of Global Change, https://cs3.mit.edu/publications/signature/2023-global-change-outlook (last access: 5 September 2025), 2023.

Paulot, F. and Jacob, D. J.: Hidden cost of U.S. agricultural exports: Particulate matter from ammonia emissions, Environ. Sci. Technol., 48, 903–908, https://doi.org/10.1021/es4034793, 2014.

Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., Blondel, M., Prettenhofer, P., Weiss, R., Dubourg, V., Vanderplas, J., Passos, A., Cournapeau, D., Brucher, M., Perrot, M., and Duchesnay, É.: Scikit-learn: Machine learning in Python, J. Mach. Learn. Res., 12, 2825–2830, 2011.

Pienkosz, B. D., Saari, R. K., Monier, E., and Garcia-Menendez, F.: Natural Variability in Projections of Climate Change Impacts on Fine Particulate Matter Pollution, Earth's Future, 7, 762–770, https://doi.org/10.1029/2019EF001195, 2019.

Pisoni, E., Clappier, A., Degraeuwe, B., and Thunis, P.: Adding spatial flexibility to source-receptor relationships for air quality modeling, Environ. Model. Softw., 90, 68–77, https://doi.org/10.1016/j.envsoft.2017.01.001, 2017.

Pisoni, E., Albrecht, D., Mara, T. A., Rosati, R., Tarantola, S., and Thunis, P.: Application of uncertainty and sensitivity analysis to the air quality SHERPA modelling tool, Atmos. Environ., 183, 84–93, https://doi.org/10.1016/j.atmosenv.2018.04.006, 2018.

Possell, M. and Hewitt, C. N.: Isoprene emissions from plants are mediated by atmospheric CO₂ concentrations, Global Change Biol., 17, 1595–1610, https://doi.org/10.1111/j.1365-2486.2010.02306.x, 2011.

Ran, L., Zhao, C. S., Xu, W. Y., Lu, X. Q., Han, M., Lin, W. L., Yan, P., Xu, X. B., Deng, Z. Z., Ma, N., Liu, P. F., Yu, J., Liang, W. D., and Chen, L. L.: VOC reactivity and its effect on ozone production during the HaChi summer campaign, Atmos. Chem. Phys., 11, 4657–4667, https://doi.org/10.5194/acp-11-4657-2011, 2011.

Rao, S., Klimont, Z., Smith, S. J., Van Dingenen, R., Dentener, F., Bouwman, L., Riahi, K., Amann, M., Bodirsky, B. L., Van Vuuren, D. P., Aleluia Reis, L., Calvin, K., Drouet, L., Fricko, O., Fujimori, S., Gernaat, D., Havlik, P., Harmsen, M., Hasegawa, T., Heyes, C., Hilaire, J., Luderer, G., Masui, T., Stehfest, E., Strefler, J., Van Der Sluis, S., and Tavoni, M.: Future air pollution in the Shared Socio-economic Pathways, Global Environ. Change, 42, 346–358, https://doi.org/10.1016/j.gloenvcha.2016.05.012, 2017.

Reis, L. A., Drouet, L., and Tavoni, M.: Internalising health-economic impacts of air pollution into climate policy: a global modelling study, Lancet Planet. Health, 6, e40–e48, https://doi.org/10.1016/S2542-5196(21)00259-X, 2022.

Reyes-Villegas, E., Lowe, D., Johnson, J. S., Carslaw, K. S., Darbyshire, E., Flynn, M., Allan, J. D., Coe, H., Chen, Y., Wild, O., Archer-Nicholls, S., Archibald, A., Singh, S., Shrivastava, M., Zaveri, R. A., Singh, V., Beig, G., Sokhi, R., and McFiggans, G.: Simulating organic aerosol in Delhi with WRF-Chem using the volatility-basis-set approach: exploring model uncertainty with a Gaussian process emulator, Atmos. Chem. Phys., 23, 5763–5782, https://doi.org/10.5194/acp-23-5763-2023, 2023.

Ryan, E. and Wild, O.: Calibrating a global atmospheric chemistry transport model using Gaussian process emulation and ground-level concentrations of ozone and carbon monoxide, Geosci. Model Dev., 14, 5373–5391, https://doi.org/10.5194/gmd-14-5373-2021, 2021.

Saari, R. K., Mei, Y., Monier, E., and Garcia-Menendez, F.: Effect of Health-Related Uncertainty and Natural Variability on Health Impacts and Cobenefits of Climate Policy, Environ. Sci. Technol., 53, 1098–1108, https://doi.org/10.1021/acs.est.8b05094, 2019.

Saltelli, A., Annoni, P., Azzini, I., Campolongo, F., Ratto, M., and Tarantola, S.: Variance based sensitivity analysis of model output. Design and estimator for the total sensitivity index, Comput. Phys. Commun., 181, 259–270, https://doi.org/10.1016/j.cpc.2009.09.018, 2010.

Sang, S., Chu, C., Zhang, T., Chen, H., and Yang, X.: The global burden of disease attributable to ambient fine particulate matter in 204 countries and territories, 1990–2019: A systematic analysis of the Global Burden of Disease Study 2019, Ecotoxicol. Environ. Safe., 238, 113588, https://doi.org/10.1016/j.ecoenv.2022.113588, 2022.

Schrader, F. and Brümmer, C.: Land Use Specific Ammonia Deposition Velocities: a Review of Recent Studies (2004–2013), Water Air Soil Pollut., 225, 2114, https://doi.org/10.1007/s11270-014-2114-7, 2014.

Sepulchre, P., Caubel, A., Ladant, J.-B., Bopp, L., Boucher, O., Braconnot, P., Brockmann, P., Cozic, A., Donnadieu, Y., Dufresne, J.-L., Estella-Perez, V., Ethé, C., Fluteau, F., Foujols, M.-A., Gastineau, G., Ghattas, J., Hauglustaine, D., Hourdin, F., Kageyama, M., Khodri, M., Marti, O., Meurdesoif, Y., Mignot, J., Sarr, A.-C., Servonnat, J., Swingedouw, D., Szopa, S., and Tardif, D.: IPSL-CM5A2 – an Earth system model designed for multi-millennial climate simulations, Geosci. Model Dev., 13, 3011–3053, https://doi.org/10.5194/gmd-13-3011-2020, 2020.

Sherwen, T., Schmidt, J. A., Evans, M. J., Carpenter, L. J., Großmann, K., Eastham, S. D., Jacob, D. J., Dix, B., Koenig, T. K., Sinreich, R., Ortega, I., Volkamer, R., Saiz-Lopez, A., Prados-Roman, C., Mahajan, A. S., and Ordóñez, C.: Global impacts of tropospheric halogens (Cl, Br, I) on oxidants and composition in GEOS-Chem, Atmos. Chem. Phys., 16, 12239–12271, https://doi.org/10.5194/acp-16-12239-2016, 2016.

Shindell, D. and Smith, C. J.: Climate and air-quality benefits of a realistic phase-out of fossil fuels, Nature, 573, 408–411, https://doi.org/10.1038/s41586-019-1554-z, 2019.

Smith, S. J., Klimont, Z., Drouet, L., Harmsen, M., Luderer, G., Riahi, K., Van Vuuren, D. P., and Weyant, J. P.: The Energy Modeling Forum (EMF)-30 study on short-lived climate forcers: introduction and overview, Climatic Change, 163, 1399–1408, https://doi.org/10.1007/s10584-020-02938-5, 2020.

Sobol', I. M.: Global sensitivity indices for nonlinear mathematical models and their Monte Carlo estimates, Math. Comput. Simul., 55, 271–280, https://doi.org/10.1016/S0378-4754(00)00270-6, 2001.

Sokolov, A., Kicklighter, D., Schlosser, A., Wang, C., Monier, E., Brown-Steiner, B., Prinn, R., Forest, C., Gao, X., Libardoni, A., and Eastham, S.: Description and Evaluation of the MIT Earth System Model (MESM), J. Adv. Model. Earth Syst., 10, 1759–1789, https://doi.org/10.1029/2018MS001277, 2018.

Stohl, A., Aamaas, B., Amann, M., Baker, L. H., Bellouin, N., Berntsen, T. K., Boucher, O., Cherian, R., Collins, W., Daskalakis, N., Dusinska, M., Eckhardt, S., Fuglestvedt, J. S., Harju, M., Heyes, C., Hodnebrog, Ø., Hao, J., Im, U., Kanakidou, M., Klimont, Z., Kupiainen, K., Law, K. S., Lund, M. T., Maas, R., MacIntosh, C. R., Myhre, G., Myriokefalitakis, S., Olivié, D., Quaas, J., Quennehen, B., Raut, J.-C., Rumbold, S. T., Samset, B. H., Schulz, M., Seland, Ø., Shine, K. P., Skeie, R. B., Wang, S., Yttri, K. E., and Zhu, T.: Evaluating the climate and air quality impacts of short-lived pollutants, Atmos. Chem. Phys., 15, 10529–10566, https://doi.org/10.5194/acp-15-10529-2015, 2015.

Tai, A. P. K., Mickley, L. J., Heald, C. L., and Wu, S.: Effect of CO₂ inhibition on biogenic isoprene emission: Implications for air quality under 2000 to 2050 changes in climate, vegetation, and land use, Geophys. Res. Lett., 40, 3479–3483, https://doi.org/10.1002/grl.50650, 2013.

Tessum, C. W., Hill, J. D., and Marshall, J. D.: InMAP: A model for air pollution interventions, PLoS ONE, 12, e0176131, https://doi.org/10.1371/journal.pone.0176131, 2017.

Thakrar, S. K., Tessum, C. W., Apte, J. S., Balasubramanian, S., Millet, D. B., Pandis, S. N., Marshall, J. D., and Hill, J. D.: Global, high-resolution, reduced-complexity air quality modeling for PM_2.5 using InMAP (Intervention Model for Air Pollution), PLoS ONE, 17, e0268714, https://doi.org/10.1371/journal.pone.0268714, 2022.

The International GEOS-Chem User Community: geoschem/gchp: GCHP 13.0.0 (13.0.0), Zenodo [code], https://doi.org/10.5281/zenodo.4618205, 2021.

Tiwari, V., Hanai, Y., and Masunaga, S.: Ambient levels of volatile organic compounds in the vicinity of petrochemical industrial area of Yokohama, Japan, Air Qual. Atmos. Health, 3, 65–75, https://doi.org/10.1007/s11869-009-0052-0, 2010.

Trentmann, J., Andreae, M. O., and Graf, H.: Chemical processes in a young biomass-burning plume, J. Geophys. Res., 108, 2003JD003732, https://doi.org/10.1029/2003JD003732, 2003.

Turnock, S. T., Allen, R. J., Andrews, M., Bauer, S. E., Deushi, M., Emmons, L., Good, P., Horowitz, L., John, J. G., Michou, M., Nabat, P., Naik, V., Neubauer, D., O'Connor, F. M., Olivié, D., Oshima, N., Schulz, M., Sellar, A., Shim, S., Takemura, T., Tilmes, S., Tsigaridis, K., Wu, T., and Zhang, J.: Historical and future changes in air pollutants from CMIP6 models, Atmos. Chem. Phys., 20, 14547–14579, https://doi.org/10.5194/acp-20-14547-2020, 2020.

United Nations: Transforming our world: the 2030 Agenda for Sustainable Development, https://sdgs.un.org/2030agenda (last access: 7 September 2025), 2015.

Vander Hoorn, S., Johnson, J. S., Murray, K., Smit, R., Heyworth, J., Lam, S., and Cope, M.: Emulation of a Chemical Transport Model to Assess Air Quality under Future Emission Scenarios for the Southwest of Western Australia, Atmosphere, 13, 2009, https://doi.org/10.3390/atmos13122009, 2022.

Van Der Werf, G. R., Randerson, J. T., Giglio, L., Van Leeuwen, T. T., Chen, Y., Rogers, B. M., Mu, M., Van Marle, M. J. E., Morton, D. C., Collatz, G. J., Yokelson, R. J., and Kasibhatla, P. S.: Global fire emissions estimates during 1997–2016, Earth Syst. Sci. Data, 9, 697–720, https://doi.org/10.5194/essd-9-697-2017, 2017.

Van Dingenen, R., Dentener, F., Crippa, M., Leitao, J., Marmer, E., Rao, S., Solazzo, E., and Valentini, L.: TM5-FASST: a global atmospheric source–receptor model for rapid impact analysis of emission changes on air quality and short-lived climate pollutants, Atmos. Chem. Phys., 18, 16173–16211, https://doi.org/10.5194/acp-18-16173-2018, 2018.

Virtanen, P., Gommers, R., Oliphant, T. E., Haberland, M., Reddy, T., Cournapeau, D., Burovski, E., Peterson, P., Weckesser, W., Bright, J., Van Der Walt, S. J., Brett, M., Wilson, J., Millman, K. J., Mayorov, N., Nelson, A. R. J., Jones, E., Kern, R., Larson, E., Carey, C. J., Polat, İ., Feng, Y., Moore, E. W., VanderPlas, J., Laxalde, D., Perktold, J., Cimrman, R., Henriksen, I., Quintero, E. A., Harris, C. R., Archibald, A. M., Ribeiro, A. H., Pedregosa, F., Van Mulbregt, P., SciPy 1.0 Contributors, Vijaykumar, A., Bardelli, A. P., Rothberg, A., Hilboll, A., Kloeckner, A., Scopatz, A., Lee, A., Rokem, A., Woods, C. N., Fulton, C., Masson, C., Häggström, C., Fitzgerald, C., Nicholson, D. A., Hagen, D. R., Pasechnik, D. V., Olivetti, E., Martin, E., Wieser, E., Silva, F., Lenders, F., Wilhelm, F., Young, G., Price, G. A., Ingold, G.-L., Allen, G. E., Lee, G. R., Audren, H., Probst, I., Dietrich, J. P., Silterra, J., Webber, J. T., Slavič, J., Nothman, J., Buchner, J., Kulick, J., Schönberger, J. L., De Miranda Cardoso, J. V., Reimer, J., Harrington, J., Rodríguez, J. L. C., Nunez-Iglesias, J., Kuczynski, J., Tritz, K., Thoma, M., Newville, M., Kümmerer, M., Bolingbroke, M., Tartre, M., Pak, M., Smith, N. J., Nowaczyk, N., Shebanov, N., Pavlyk, O., Brodtkorb, P. A., Lee, P., McGibbon, R. T., Feldbauer, R., Lewis, S., Tygier, S., Sievert, S., Vigna, S., Peterson, S., More, S., Pudlik, T., Oshima, T., Pingel, T. J., Robitaille, T. P., Spura, T., Jones, T. R., Cera, T., Leslie, T., Zito, T., Krauss, T., Upadhyay, U., Halchenko, Y. O., and Vázquez-Baeza, Y.: SciPy 1.0: fundamental algorithms for scientific computing in Python, Nat. Meth., 17, 261–272, https://doi.org/10.1038/s41592-019-0686-2, 2020.

Watanabe, S., Hajima, T., Sudo, K., Abe, M., Arakawa, O., Ogochi, K., Arakawa, T., Tatebe, H., Ito, A., Ito, A., Komuro, Y., Nitta, T., Noguchi, M. A., Ogura, T., Ohgaito, R., Sekiguchi, M., Suzuki, T., Tachiiri, K., Takata, K., Takemura, T., Watanabe, M., Yamamoto, A., Yamazaki, D., Yoshimura, K., and Kawamiya, M.: IPCC DDC: MIROC MIROC-ES2H model output prepared for CMIP6 CMIP (1), https://doi.org/10.26050/WDCC/AR6.C6CMMIMIH, 2023.

WHO: WHO methods and data sources for country-level causes of death 2000–2016, https://terrance.who.int/mediacentre/data/ghe/GlobalCOD_method_2000_2016.pdf (last access: 16 January 2025), 2018.

Wild, O., Voulgarakis, A., O'Connor, F., Lamarque, J.-F., Ryan, E. M., and Lee, L.: Global sensitivity analysis of chemistry–climate model budgets of tropospheric ozone and OH: exploring model diversity, Atmos. Chem. Phys., 20, 4047–4058, https://doi.org/10.5194/acp-20-4047-2020, 2020.

Williams, C. and Rasmussen, C.: Gaussian processes for regression. Advances in neural information processing systems, MIT Press, https://proceedings.neurips.cc/paper_files/paper/1995/file/7cce53cf90577442771720a370c3c723-Paper.pdf (last access: 24 July 2024), 1995.

Wong, A.: ayhwong/GW-GPR: Initial version (V1.0), Zenodo [code], https://doi.org/10.5281/zenodo.19391330, 2026.

Wong, A. Y. H.: Models and Data for Reproducing the Result in “A Geographically Weighted Gaussian Process Regression Emulator of the GCHP 13.0.0 Global Air Quality Model, Zenodo [code and data set], https://doi.org/10.5281/ZENODO.15484655, 2025.

Articles

Short summary

We developed a fast and accurate computer tool that predicts how air pollution will change around the world under different climate and policy choices. Using machine learning and real model data, our tool can estimate changes in harmful fine particulate pollution in seconds instead of thousands of hours. This makes it easier for researchers and policymakers to explore future air quality and health impacts under a wide range of scenarios.

A Geographically Weighted Gaussian Process Regression (GW-GPR) emulator of anthropogenic PM2.5 from the GEOS-Chem High Performance (GCHP) 13.0.0 global chemical transport model

2.1 MIT Integrated Global System Model (IGSM) and its coupling with Community Atmosphere Model (CAM)

2.2 GEOS-Chem High Performance model driven by CAM meteorological fields (GCHP-CAM)

2.3 Generating PM2.5 training data using GCHP-CAM

2.3.1 Parameterizing global warming

2.3.2 Generating training dataset through GCHP-CAM perturbation experiments

2.4 Training the Geographically Weighted Gaussian Process Regression (GW-GPR) emulator

2.5 Cross validation of the GW-GPR emulator

2.6 Testing the emulator with IGSM-GAINS-TAPS combined air quality and climate legislation scenarios

2.7 Demonstrating the utility of the emulator using AerChemMIP data

2.8 Health impact calculation

3.1 Computing resource requirement

3.2 Emulator cross validation and sensitivity

3.3 Comparing emulator performance for IGSM-GAINS-TAPS scenarios with GCHP-CAM

3.3.1 Comparison with linear model

3.3.2 Sensitivity to dispersion length scales (L)

A Geographically Weighted Gaussian Process Regression (GW-GPR) emulator of anthropogenic PM_2.5 from the GEOS-Chem High Performance (GCHP) 13.0.0 global chemical transport model

2.3 Generating PM_2.5 training data using GCHP-CAM