Articles | Volume 17, issue 2
https://doi.org/10.5194/gmd-17-567-2024
https://doi.org/10.5194/gmd-17-567-2024
Development and technical paper
 | 
24 Jan 2024
Development and technical paper |  | 24 Jan 2024

The first application of a numerically exact, higher-order sensitivity analysis approach for atmospheric modelling: implementation of the hyperdual-step method in the Community Multiscale Air Quality Model (CMAQ) version 5.3.2

Jiachen Liu, Eric Chen, and Shannon L. Capps
Abstract

Sensitivity analysis in chemical transport models quantifies the response of output variables to changes in input parameters. This information is valuable for researchers engaged in data assimilation and model development. Additionally, environmental decision-makers depend upon these expected responses of concentrations to emissions when designing and justifying air pollution control strategies. Existing sensitivity analysis methods include the finite-difference method, the direct decoupled method (DDM), the complex variable method, and the adjoint method. These methods are either prone to significant numerical errors when applied to nonlinear models with complex components (e.g. finite difference and complex step methods) or difficult to maintain when the original model is updated (e.g. direct decoupled and adjoint methods). Here, we present the implementation of the hyperdual-step method in the Community Multiscale Air Quality Model (CMAQ) version 5.3.2 as CMAQ-hyd. CMAQ-hyd can be applied to compute numerically exact first- and second-order sensitivities of species concentrations with respect to emissions or concentrations. Compared to CMAQ-DDM and CMAQ-adjoint, CMAQ-hyd is more straightforward to update and maintain, while it remains free of subtractive cancellation and truncation errors, just as those augmented models do. To evaluate the accuracy of the implementation, the sensitivities computed by CMAQ-hyd are compared with those calculated with other traditional methods or a hybrid of the traditional and advanced methods. We demonstrate the capability of CMAQ-hyd with the newly implemented gas-phase chemistry and biogenic aerosol formation mechanism in CMAQ. We also explore the cross-sensitivity of monoterpene nitrate aerosol formation to its anthropogenic and biogenic precursors to show the additional sensitivity information computed by CMAQ-hyd. Compared with the traditional finite difference method, CMAQ-hyd consumes fewer computational resources when the same sensitivity coefficients are calculated. This novel method implemented in CMAQ is also computationally competitive with other existing methods and could be further optimized to reduce memory and computational time overheads.

1 Introduction

Ambient air pollution, including particulate matter (PM), poses significant threats to human health. According to the Global Burden of Disease study, among all risk factors, ambient PM pollution accounted for 4.7 % of the disability-adjusted life years (Murray et al., 2020) and over 4.1 million deaths (Fuller et al., 2022) in 2019. Therefore, understanding the complex physicochemical and atmospheric transport processes that lead to PM formation is essential to reducing PM and other harmful secondary atmospheric pollutants. Amongst atmospheric scientists, chemical transport models (CTMs) have become essential tools for interpreting observations of and examining inferences about formation processes of atmospheric pollutants. By solving the mass conservation equation for different species based on atmospheric dispersion and transport, photochemical processes, atmospheric chemistry, and aerosol processes, CTMs can provide estimates of primary and secondary air pollutants (Seinfeld and Pandis, 2016). Environmental decision-makers and researchers rely on CTMs to determine appropriate policies to control air pollution and predict atmospheric pollutant concentrations. Experimental studies (e.g. Ng et al., 2008) and measurement campaigns (e.g. Sareen et al., 2016) provide researchers with more insights about the anthropogenic and biogenic aerosol formation processes. These studies ultimately lead to developments and updates to the gas-phase chemistry and aerosol formation mechanisms in CTMs. For these newly added species and mechanisms in CTMs, understanding the sensitivity of aerosol species concentrations to emissions of their precursors is crucial for determining the priority of primary pollutant emission reductions to achieve atmospheric pollutant reduction objectives.

Sensitivity analysis methods have become invaluable for evaluating uncertainties, understanding concentration–emission relationships in CTMs, and assimilating observations of atmospheric pollutants to improve model parameters. Specifically, the kind of sensitivities described here are the partial derivative of one or more model outputs with respect to one or more model inputs. For instance, if the model has input variables X and output variables Y, the nth-order sensitivity coefficient of one output variable Yi to one input variable Xi can be represented as the nth-order partial derivative of Yi with respect to Xi, nYiXin (Cohan and Napelenok, 2011). Most sensitivity analysis techniques are formulations of the tangent linear model, which provides source-oriented sensitivities or, mathematically, one column of the Jacobian or Hessian at the model state. In contrast, the model adjoint provides receptor-oriented sensitivities or, mathematically, one row of the Jacobian at the model state. Two distinct approaches to developing these models are the continuous approach, in which the derivative of the underlying equation is formulated and then implemented numerically, and the discrete approach, in which the derivative of the numerical solution of the model is formulated (Sandu et al., 2005). Since the model adjoint provides source-oriented sensitivities and is not directly comparable with other methods (including the hyperdual-step method, which is the focus of this work), it will not be further discussed in the following sections. Other augmented model methods, including the Integrated Source Apportionment Method (Kwok et al., 2013, 2015), are based on a different approach and are thus also not discussed further in the following paragraphs.

The first-order sensitivity coefficient is usually the most useful for CTM applications because it describes the linear relationship between Xi and Yi. Higher-order sensitivities can be helpful when assessing the nonlinear relationships or dynamics among multiple input variables. Previous studies found that highly nonlinear concentration–emission responses commonly exist in CTMs, particularly for the formation process of PM (Hakami, 2004; Xu et al., 2018; Tian et al., 2010). Therefore, accurately determining the first-order and second-order relationships is useful for understanding concentration–emission responses in CTMs. Practically, the characteristics of an ideal sensitivity analysis method are numerical accuracy, computational efficiency, and minimal development (Lantoine et al., 2012).

Because analytical sensitivities are impractical for these models, researchers have employed a few numerical methods to calculate the first- and higher-order sensitivities in CTMs. One such method is the finite difference method (FDM), which is often designated the brute-force method. The FDM is based on the first- or higher-order approximation of the Taylor series expansion from a small perturbation (Boole, 1960). The sensitivities are calculated by running the model multiple times with incrementally different values for the input variables of interest and taking the difference in the resulting concentration fields. While this method is simple to understand and implement, truncation and subtractive cancellation errors can substantially reduce the accuracy of the calculated sensitivity coefficients, particularly for nonlinear input–output relationships (Fornberg, 1981). Truncation errors originate from neglected higher-order terms in the Taylor series expansion. For instance, suppose a policymaker is interested in calculating the effects of reducing SO2 emission on the total PM2.5 concentration. The sensitivity analysis indicates that the first-order partial derivative is positive and the second-order partial derivative is negative. In that case, only considering the first-order FDM approximation will overestimate the effect of reducing the SO2 emission on the total PM2.5 concentration. The truncation error can be minimized by taking a small perturbation step, thus eliminating the impact of higher-order sensitivity terms on the first-order result. However, smaller perturbation steps might lead to subtractive cancellation errors, which stem from the fact that computers cannot distinguish two numbers close to each other. If the perturbation size is within the numerical noise of the model, the numerator difference sometimes approaches zero or the sensitivity information might be meaningless, which causes an inherent tension between reducing the truncation error and the subtractive cancellation error for the FDM. Determining ideal perturbation sizes for different variables is challenging because the ideal perturbation size depends on the input species of interest and other parameters in the model. The necessity of selecting the proper perturbation size for each input variable of interest and running the model multiple times makes the FDM a computationally expensive method to obtain sufficiently accurate sensitivities from CTMs.

As a continuous, source-oriented approach, the decoupled direct method (DDM) eliminates the numerical accuracy issues of the FDM and improves the computational efficiency of calculating source-oriented sensitivities but only with a hefty development cost. Dunker (1981) introduced to atmospheric modelling the direct method, which involves formulating new sensitivity equations from the original model and solving both sets simultaneously. The direct method has been proven numerically unstable for solving stiff equations, which exist in many chemical transport models. On the other hand, DDM formulates sensitivity equations like the direct method but separately solves the original and sensitivity equations. This approach improves the computational efficiency and stability compared to the direct method. Yang et al. (1997) was the first to apply the DDM-3D method in a three-dimensional chemical transport model. Hakami et al. (2003) extended the method to a higher-order DDM (HDDM) in the gas phase of the Community Multiscale Air Quality Model (CMAQ), while Zhang et al. (2012) augmented CMAQ-HDDM to include the second-order sensitivities of PM2.5 concentration to NOx and SO2 emissions. Unlike the FDM, the DDM does not incur truncation or subtractive cancellation errors since separate equations are solved for the sensitivities. The DDM also allows the computation of sensitivities of many outputs to more than one input simultaneously, saving significant computation resources. The major disadvantage of DDM for CMAQ and other CTMs is the difficulty of co-development alongside ongoing scientific model development, which is one purpose of CTMs. The implementation of DDM requires writing sensitivity equations for nonlinear steps, which commonly exist in the chemistry and advection parts of CTMs (Fike and Alonso, 2011). New sensitivity equations must be written when CTMs are updated, reducing the ease of maintenance of DDM in complex CTMs and eliminating the opportunity for sensitivities to be used for evaluation in the process of developing new scientific modules within the CTMs.

The complex variable method (CVM) and the multicomplex step approach (MCX) are the methods most comparable to the hyperdual step method. Lyness and Moler (1967) introduced the concept of using imaginary space to propagate derivatives for functions in real space based on Cauchy integrals. Squire and Trapp (1998) made the idea practical through an elegant truncation of the Taylor series expansion of complex numbers which allows nearly exact first-order sensitivities if the imaginary perturbation is small enough. Constantin and Barrett (2014) applied the CVM on the adjoint method of the global CTM GEOS-Chem to compute near-exact sensitivities with receptor orientation in one order and source orientation in the other. Lantoine et al. (2012) developed a multicomplex number system to allow higher-order sensitivities to be calculated to machine precision for functions in real space. Berman et al. (2023) implemented MCX in the inorganic aerosol thermodynamic model ISORROPIA, which is used in CMAQ and other air quality models. These methods require the inclusion of a library of overloaded operators to treat the types of numbers required and the conversion of the model from real to complex space. The accuracy of these approaches is only limited by ensuring that the imaginary perturbation is small enough, which may require tuning depending on the complexity of the model. CVM does not require as much memory and computational time as MCX, but both contribute overhead. Both are very easily updated with new scientific modules. The main limitation of CVM and MCX is that sensitivities cannot be propagated through models, like CMAQ, that originally include calculations in imaginary space.

Finally, the method of interest in this work is the hyperdual-step method (HYD), which computes source-oriented first- and second-order sensitivities to machine precision. HYD relies on hyperdual numbers, which are a specific type of generalized complex number developed particularly for first- and second-order derivative calculations (Fike and Alonso, 2011). The HYD, like the CVM or MCX, is an approach based on a Taylor series expansion in a non-real number space. The unique mathematical properties of hyperdual numbers lead to an elegant calculation of first-, second-, and potentially higher-order sensitivities to machine precision without truncation or cancellation errors. Hyperdual numbers have been applied in numerical models in different fields of study to calculate exact first- and second-order derivatives. Cohen and Shoham (2015) applied hyperdual numbers to compute second-order derivatives in multibody kinetics problems. Tanaka et al. (2015) utilized hyperdual numbers to automatically differentiate hyperelastic material models. Rehner and Bauer (2021) applied hyperdual numbers to equation of state modelling and the calculation of critical points. This method, which is applicable to models with calculations in imaginary space, is accurate to machine precision, reasonably computationally expensive, and quite straightforward to update.

Here, we implement the HYD in CMAQ version 5.3.2 to develop the novel augmented model CMAQ-hyd, and we apply it to calculate the sensitivities of both inorganic and organic aerosol concentrations to their precursor emissions. To our best knowledge, this work represents the first implementation of hyperdual numbers to calculate first- and second-order sensitivities in a CTM. In Sect. 2, hyperdual numbers and the hyperdual-step method are introduced, as is the process of implementing and evaluating HYD in CMAQ. In Sect. 3, the evaluation of first- and second-order sensitivities from CMAQ-hyd is conducted, including the computational costs. In Sect. 4, CMAQ-hyd is applied to understand the influences of anthropogenic and biogenic emissions on select secondary organic aerosol (SOA) concentrations. This work provides an accurate and easily manageable method to compute first- and second-order sensitivities implemented in CMAQ version 5.3.2 and an example of its potential application in other complex models where the sensitivities are of interest.

2 Methods

2.1 Hyperdual numbers and the hyperdual-step method

A hyperdual number (Fike and Alonso, 2011) has four components and is characterized by

(1) H = a 0 + a 1 ϵ 1 + a 2 ϵ 2 + a 12 ϵ 12 ,

where a0, a1, a2, and a12 are real numbers and ϵ1, ϵ2, and ϵ12 are non-real parts. The three crucial properties which enable numerically exact first- and second-order sensitivity calculations are

(2)ϵ12=ϵ22=ϵ122=0(3)ϵ1ϵ2ϵ120(4)ϵ1ϵ2=ϵ12.

The squares of the non-real individual parts equal zero (Eq. 2). The non-real parts themselves do not equal anything in real space (Eq. 3). The multiplication of ϵ1 and ϵ2 is equal to the third non-real component ϵ12 (Eq. 4). The addition and multiplication of hyperdual numbers are commutative, and the definitions help eliminate the higher-order terms in a Taylor series expansion. A demonstration of several basic operations is provided in the Supplement, while a more detailed discussion of the mathematical properties of hyperdual numbers is given by Fike and Alonso (2011).

Akin to the Taylor series expansion about the real value of x in the finite difference method, the method of ascertaining sensitivities through a perturbation in hyperdual space is based on a Taylor series expansion in an orthogonal dimension of the number. Specifically, a hyperdual number with unity in a0 and unity in one of a1 or a2 is multiplied by the independent variable of interest. After model execution, a Taylor series expansion is applied to extract sensitivities. For instance, the hyperdual-step method is applied to a scalar function f(x) by multiplying x by the hyperdual number Hh=1.0+h1ϵ1+h2ϵ2, which results in

(5) f ( x H h ) = f ( x ) + ( x h 1 ϵ 1 + x h 2 ϵ 2 ) f ( x ) + 1 2 ! × ( x h 1 ϵ 1 + x h 2 ϵ 2 ) 2 f ′′ ( x ) + 1 3 ! ( x h 1 ϵ 1 + x h 2 ϵ 2 ) 3 f ′′′ ( x ) + ,

where “…” represents higher-order terms in the series. Eliminating all terms that are zero due to the definition of hyperdual numbers (Eq. 2) leads to

(6) f ( x H h ) = f ( x ) + ( x h 1 ϵ 1 + x h 2 ϵ 2 ) f ( x ) + x 2 h 1 h 2 ϵ 12 f ′′ ( x ) ,

where f(xHh) is a hyperdual number.

The properties of hyperdual numbers (Eqs. 2–4) lead to two significant results. First, all terms in the Taylor series expansion with derivatives higher than second order become zero because all values include ϵ12, ϵ22, or ϵ122. Second, the real component is unchanged. A more detailed expansion of terms can be found in Eq. (S7) in the Supplement or the original development of hyperdual numbers; it follows the multiplication rule between a hyperdual and a real number (Fike and Alonso, 2011). Finally, the first- and second-order derivatives are separated into different parts of the hyperdual number. The first-order derivative exists in either the ϵ1 or the ϵ2 term, while the second-order derivatives only exist in the ϵ12 term. The first- and second-order derivatives are

(7)f(x)=ϵ1part[fxHh]xh1=ϵ2part[fxHh]xh2(8)f′′(x)=ϵ12part[fxHh]x2h1h2,

where ϵ1part[], ϵ2part[], and ϵ12part[] represent functions that extract the a1, a2, or a12, respectively. Since the derivative computation process (Eqs. 6–9) does not involve subtractions or higher-order sensitivities, the first- and second-order sensitivities calculated by the hyperdual-step method are free from subtractive cancellation and truncation errors. This method (Eqs. 8 and 9) extends to vector operations to compute arrays of numerically exact derivatives. For instance, the partial first- and second-order derivatives for f(x), where x=[x1,x2,,xn], with respect to x1 through a hyperdual-step perturbation to x1 is

(9)f(x)x1=ϵ1part[f(xHh,x1)]x1h1=ϵ2part[f(xHh,x1)]x1h2(10)2f(x)x12=ϵ12part[f(xHh,x1)]x12h1h2.

Similarly, two different independent variables x1 and x2 may be perturbed simultaneously. In this case, two arrays of first-order sensitivity and one array of cross-sensitivity result:

(11)f(x)x1=ϵ1part[f(xHh)]x1h1(12)f(x)x2=ϵ2part[f(xHh)]x2h2(13)2f(x)x1x2=ϵ12part[f(xHh)]x1x2h1h2.

Therefore, the two variations of the hyperdual-step method will generate either one or two arrays of first-order sensitivities and one array of second-order or cross sensitivities with a single run of the model.

2.2 Community Multiscale Air Quality Model and the implementation of the hyperdual-step method

The Community Multiscale Air Quality Model (CMAQ), developed by the U.S. Environmental Protection Agency (EPA), is an Eulerian CTM which can predict air pollutant concentrations at regional and hemispheric scales (Byun and Schere, 2006). CMAQ represents advection, diffusion, gas-phase chemistry, aerosol processes, cloud processes, and photolysis. CMAQ has been applied to predict pollutant concentrations in the atmosphere (Liu et al., 2010; Sayeed et al., 2021), understand fundamental atmospheric chemistry and aerosol formation mechanisms (Zhu et al., 2018; Li et al., 2019), and guide policy-making processes (Chemel et al., 2014; Che et al., 2011; Li et al., 2019; Ring et al., 2018). CMAQ is used in the regulatory process of the US EPA when states, tribes, or local jurisdictions demonstrate how they will attain the National Ambient Air Quality Standard (NAAQS) and or comply with the Regional Haze Rule (Mebust et al., 2003). CMAQ solves the atmospheric diffusion equation shown in Eq. (14) to calculate the concentrations of gaseous and aerosol species in the atmosphere:

(14) c i t = - ( u c i ) + ( K c i ) + R i + E i ,

where ci, u, K, Ri, and Ei are the concentration of species i, the wind velocity vector, the diffusivity tensor, the change in concentration due to chemical reactions of species i, and the emission rate of species i, respectively. Species concentrations are stored in a multidimensional array and propagated through different scientific modules within the model. For this work, CMAQ was run with 12 km by 12 km horizontal resolution and with 35 vertical layers, 100 columns, and 80 rows over the Southeast US on 1 July 2016, GMT (U.S. Environmental Protection Agency, 2019). The gas-phase chemistry mechanism used is Carbon Bond 6 (Luecken et al., 2019).

https://gmd.copernicus.org/articles/17/567/2024/gmd-17-567-2024-f01

Figure 1A schematic showing the original CMAQ model compared with CMAQ-hyd. (a) The original CMAQ model (green). (b) The CMAQ-hyd model. CMAQ-hyd incorporates the hyperdual number operator overloading module. The purple components in (b) represent changes made to the original CMAQ model. Changes were made to the modules and emission-processing parts of CMAQ. The first- and second-order sensitivities of species concentrations with respect to selected precursor concentrations or emissions are new outputs of the model.

Download

In CMAQ, the hyperdual-step method was implemented by strategically converting the model to use hyperdual numbers (Fig. 1). First, the operators were overloaded by translating a C-based library from Fike and Alonso (2011) to Fortran (“HDMod”) and augmenting it to treat multidimensional data as required by CMAQ. HDMod, which defines a hyperdual version of all possible calculations related to the chemical concentration array, was developed. The library includes basic arithmetic operations, such as addition and subtraction, as well as more advanced functions like trigonometric functions. Before being applied to CMAQ, the operator overloading library was separately validated by comparing it against analytical derivatives using a testing framework developed by Pellegrini and Russell (2016). Secondly, the CMAQ variable containing species concentration information and all other variables that depend on it were converted from real numbers to the newly defined hyperdual number type. The source code was carefully analysed to select only the necessary variables for conversion. Many variables in CMAQ do not need to be altered because they do not influence the main concentration array. This highly detailed process helped minimize the additional computational burden of the model since calculations with hyperdual numbers are more computationally intensive than those with real numbers. For instance, one hyperdual multiplication operation shown in Eq. (5) results in five more additions and nine more multiplications than an operation with real numbers. According to Fike and Alonso (2011), the computational cost of a hyperdual calculation ranges from 4 to 14 times higher than the original operation. Applying hyperdual numbers to all the variables in a CFD model results in an approximately 10 times higher computational cost (Fike and Alonso, 2011). Thirdly, the first- and second-order sensitivities of the species concentrations to perturbed emissions are included in the new hyperdual array, which is then saved to additional output files using the same structure as the output of the original concentration array. As a result, first- and second-order sensitivities can be propagated through the model without significantly modifying the source code. The modification efforts mainly focused on determining the variables that must be converted to the hyperdual type. Consequently, updating CMAQ-hyd when there are changes to the original model is a simple process that involves converting only the newly added variables to the hyperdual type. This simplicity is an advantage over other computational techniques, such as the DDM and adjoint method, which compute numerically exact sensitivities but require more complex and time-consuming update procedures, including writing new equations.

Several source code alterations were made to reduce the complexity of development and overcome the numerical instabilities related to hyperdual calculations in CMAQ's aerosol module, specifically within the inorganic thermodynamic module ISORROPIA (Fountoukis and Nenes, 2007; Nenes et al., 1998). For the simplicity of development, we applied a Fortran-90-compilant version of ISORROPIA to replace the original Fortran 77 version of ISORROPIA in CMAQ.

ISORROPIA, a key component of the aerosol module in CMAQ, is called in either the forward or the reverse mode. The forward mode of ISORROPIA takes the sum of gas and aerosol species concentrations, along with the relative humidity and temperature, to determine the partitioning of Aitken- and accumulation-mode species across the gas and aerosol phases in CMAQ. In the original CMAQ model, ISORROPIA is run in the forward mode without limiting the temperature and pressure of the simulation. The determination process for species concentrations involves an iterative method which is sometimes numerically unstable during iterations for upper layer cells with low temperature and pressure for sensitivity computations with the HYD. To increase the numerical stability of CMAQ-hyd, we implemented temperature and pressure constraints so that the forward-mode ISORROPIA is only called when the cell temperature exceeds 260 K and the cell pressure exceeds 20 000 Pa. A similar set of temperature and pressure limits were applied to the call of ISORROPIA in the adjoint of CMAQ (Zhao et al., 2020). These changes do not affect the species concentrations computed by CMAQ while ensuring that the sensitivity computation process is stable.

To calculate the dynamic equilibrium of coarse-mode aerosol species with the gas phase (Pilinis et al., 2000; Capaldo et al., 2000), CMAQ employs the reverse mode of ISORROPIA. The input to reverse-mode ISORROPIA includes concentrations of aerosol species, relative humidity, and temperature, and it results in partitioned concentrations in the solid, liquid, and gas phases for coarse-mode inorganic aerosols. The reverse-mode solution leads to unrealistic sensitivities calculated by the HYD when the aerosol pH is close to neutral. One previous study found that the reverse ISORROPIA fails to capture the actual behaviour of inorganic aerosol when the pH is close to 7 (Hennigan et al., 2015). To ensure the stability of the sensitivity calculations, the changes to the hyperdual components in the coarse-mode dynamic equilibrium are ignored when the pH of coarse-mode aerosol is close to neutral, which ensures that the real components are identical to the original model.

2.3 Evaluating sensitivities from CMAQ-hyd

CMAQ-hyd produces sensitivities that can be semi- or fully normalized for concentrations from any range of grid cells and times with respect to emissions or concentrations from any range of grid cells and times. Here, for the sake of illustration, we consider the semi-normalized sensitivities of time-averaged output concentrations of ground-level PM2.5 concentrations (CPM2.5,c,r,l=0,t) to input NOx (NO+NO2) emissions (ENOx,c,r,l,t) averaged over time, t, for any given cell as indicated by the column c and row r. The first-order semi-normalized sensitivities, sNOxPM2.5, and second-order semi-normalized sensitivities, sNOx(2)PM2.5, exemplify sensitivities relevant to environmental decision makers (Eqs. 15 and 16).

(15)sNOxPM2.5=CPM2.5,c,r,l=0,t|tENOx,c,r,l,t|tENOx,c,r,l,t|t(16)sNOx(2)PM2.5=2CPM2.5,c,r,l=0,t|tENOx,c,r,l,t2|tENOx,c,r,l,t2|t

Semi-normalized sensitivities reduce the complexity of interpretation by providing sensitivities in the units of concentration per percent change of emissions. The semi-normalized sensitivities also scale down the impact from cells with low emission rates, which is consistent with the concentration reduction that is realistic to expect. Similarly, the time-averaged, semi-normalized cross-sensitivity of PM2.5 to both NOx and monoterpene is denoted as sNOx,TERP(2)PM2.5, with ETERP representing the emission of monoterpenes (Eq. 17).

(17) s NO x , TERP ( 2 ) PM 2.5 = 2 C PM 2.5 , c , r , l = 0 , t | t E NO x , c , r , l , t | t E TERP , c , r , l , t | t × E NO x , c , r , l , t | t E TERP , c , r , l , t | t

The evaluation of the first order sensitivities of CMAQ-hyd is done by performing a comparison of the sensitivities calculated by the hyperdual-step method against those from the FDM (Eq. 18). The comparison is illustrated with an example of calculating cell-specific sensitivities of PM2.5 concentration to NOx emissions. The first-order sensitivity of PM2.5 concentration at the end of a 24 h simulation to a cumulative NOx emission perturbation is given by

(18) s NO x PM 2.5 , t = 24 C PM 2.5 , c , r , l , t = 24 inc - C PM 2.5 , c , r , l , t = 24 dec t = 0 t = 24 E NO x , c , r , l = 0 , t inc - t = 0 t = 24 E NO x , c , r , l = 0 , t dec × Σ t = 0 t = 24 E NO x , c , r , l = 0 , t orig ,

where the subscripts c, r, and l represent the column, row, and layer; the subscript t represents the time from the start of the model run; and the superscripts “inc”, “dec”, and “orig” represent the initial perturbation direction (i.e. increased, decreased, and original emissions, respectively). For instance, CPM2.5,c,r,l,t=24inc is the concentration of PM2.5 for each column, row, and layer 24 h into the run when there is an increase in NOx emissions throughout the model run. Unless otherwise noted, the relative perturbation sizes for first-order FDM calculations are 125 % and 75 % for domain-wide emissions. The average ground-layer sensitivities for the 24 h simulation time are computed. Previous studies have found that smaller perturbation sizes for inorganic aerosol sensitivity calculations in CMAQ using FDM are more prone to numerical noise (Zhang et al., 2012). The semi-normalized sensitivity of each cell is computed with the central difference method and is only an approximation of the actual sensitivity due to subtractive cancellation and truncation errors. The numerator is the difference between PM2.5 concentrations with persistent increases or decreases in NOx emissions. The denominator is the total emission perturbation of NOx emission. The sensitivities are semi-normalized by the sum of the NOx emissions in the base model run. The calculated first-order semi-normalized sensitivities will have units of µg m−3.

The semi-normalized sensitivity of PM2.5 concentrations with respect to NOx emissions using a hyperdual perturbation of Ha=1+a1ϵ1+a2ϵ2 is computed by the hyperdual-step method as

(19) s NO x PM 2.5 , t = 24 = ϵ 1 part C PM 2.5 , c , r , l , t = 24 orig a 1 = ϵ 2 part C PM 2.5 , c , r , l , t = 24 orig a 2 .

The first-order semi-normalized sensitivity can be computed with either the ϵ1 or the ϵ2 part. The ϵ1 or the ϵ2 part of the PM2.5 concentration is divided by the initial perturbation in the ϵ1 or ϵ2 space, respectively. The emissions in the denominator will cancel out with the semi-normalized emissions.

Although the FDM can be applied to compute second-order sensitivities in CMAQ, previous studies have shown that the results are noisy and highly dependent on the perturbation sizes (Zhao et al., 2020; Zhang et al., 2012). In order to evaluate the second-order sensitivities computed by the HYD method, we adopted a hybrid hyperdual–finite difference method (HYD-FDM). The HYD-FDM sensitivity calculation is given by

(20) s NO x ( 2 ) PM 2.5 ϵ 1 part C PM 2.5 , c , r , l , t = 24 inc a 1 t = 0 t = 24 E NO x , c , r , l = 0 , t inc - ϵ 1 part C PM 2.5 , c , r , l , t = 24 dec b 1 t = 0 t = 24 E NO x , c , r , l = 0 , t dec t = 0 t = 24 E NO x , c , r , l = 0 , t inc - t = 0 t = 24 E NO x , c , r , l = 0 , t dec × Σ t = 0 t = 24 E NO x , c , r , l = 0 , t orig 2 ,

where two separate simulations were run: one with increased and another with decreased initial NOx emissions. The emission perturbations for the two runs are Ha=1+a1ϵ1+a2ϵ2 for the run with increased initial NOx emission and Hb=1+b1ϵ1+b2ϵ2 for the run with decreased initial emission of NOx. HYD-FDM uses the regular finite difference on the difference between first-order sensitivities calculated by using the ϵ1 part of the hyperdual-step results. The sensitivity in this equation is an estimate and subject to numerical errors because it includes the usage of FDM.

The second-order sensitivity calculated by the hyperdual-step method is shown in Eq. (21) below:

(21) s NO x ( 2 ) PM 2.5 = ϵ 12 part C PM 2.5 , c , r , l , t = 24 orig a 1 a 2 .

The hyperdual-step method uses the ϵ12 part of the output variable, dividing it by the multiplication of a1 and a2. The second-order sensitivities calculated only by the HYD method are numerically exact. All the sensitivities are computed for each cell, and comparisons between the finite difference and the hyperdual-step method are performed on a cell-to-cell basis.

https://gmd.copernicus.org/articles/17/567/2024/gmd-17-567-2024-f02

Figure 2The comparisons of first-order sensitivities on the ground layer calculated by the hyperdual-step method (HYD sensitivities) and FDM (FD sensitivities). The sensitivities are colour coded by the perturbed emissions (i.e. NOx, SO2, TERP, and APIN). Panels (a)(d) show the gas-phase species sensitivities with respect to their emissions. APIN denotes α-pinene and TERP denotes all other monoterpene species. Panels (e)(h) show examples of aerosol-phase products with respect to their precursors. ANO3 denotes the total aerosol-phase nitrate products. ASO4 denotes the total aerosol sulfate products. ΣAMT denotes the total aerosol photooxidation products from monoterpene. Panels (i)(n) show the total PM2.5 concentration with respect to gas-phase precursors. The sensitivities calculated are noted at the bottom-right corner of each panel with the notation pattern mentioned in Sect. 2.3.

Download

Table 1The slopes and R2 values from the linear regression of the first-order sensitivities of ground-layer species concentrations to domain-wide perturbations by the hyperdual-step method compared to finite-difference sensitivities. The gas-phase species sensitivities with respect to their emissions are on line three, where APIN denotes α-pinene and TERP denotes all other monoterpene species. Line four includes sensitivities of aerosol-phase products with respect to their precursors, where ANO3 denotes the total aerosol-phase nitrate products, ASO4 denotes the total aerosol sulfate products, and ΣAMT denotes the total aerosol photooxidation products from monoterpene. Line five includes the sensitivities of the total PM2.5 concentration with respect to each gas-phase precursor. A visual comparison of the agreement for each relationship is shown in Fig. 2.

Download Print Version | Download XLSX

3 Results and discussion

3.1 Evaluation of the first- and second-order sensitivities

We evaluated the implementation of CMAQ-hyd by comparing the first-order sensitivities of various species in CMAQ calculated by HYD with a hyperdual-step perturbation as described in Sect. 2.3 (HYD sensitivities) and FDM with a domain-wide emission perturbation (FD sensitivities). The FD sensitivities were computed with the difference between a 25 % increase and a 25 % decrease in domain-wide emissions using the central finite difference method. Overall, the different HYD and FD sensitivities agree well, as evidenced by the close alignment of the points on the blue identity line, which represents perfect agreement, in most of the panels of Fig. 2. The slope and R2 values for all comparisons are provided in Table 1, and additional slope and R2 values are provided in Tables S1 and S2 in the Supplement. Specifically, the slopes and R2 values for gas-phase species concentrations on the ground layer with respect to their emissions on the ground layer (Fig. 2a–d) are all 1.00 (Table 1), indicating minimal nonlinearity in these relationships, as expected.

https://gmd.copernicus.org/articles/17/567/2024/gmd-17-567-2024-f03

Figure 3Comparisons of the first-order sensitivities of ground-layer aerosol nitrate (ANO3) concentration with respect to a domain-wide perturbation of SO2 emission calculated by the FDM method (a–c) and the HYD method. (a) FD sensitivities calculated with the central difference method and a 25 % perturbation each way (125 %, 75 %). (b) FD sensitivities calculated with the central difference method and a 10 % perturbation each way (110 %, 90 %). (c) FD sensitivities calculated with the central difference method and a 5 % perturbation each way (105 %, 95 %). (d) HYD sensitivities.

Secondary aerosol formation is a more nonlinear process, which is explored by using inorganic or organic aerosol concentrations with respect to select precursors (Fig. 2e–h). Nonlinearities in the modelled processes lead to discrepancies between HYD and FD sensitivities without tuning the FD sensitivity to capture the slope about the model state more exactly. The slopes and R2 values of the trendline between these HYD and FD sensitivities range from 0.99 to 1.00 and from 1.00 to 1.04 (Table 1), respectively. The comparisons between the HYD and FD sensitivities of sNOxANO3 and sSO2ASO4 show slight deviations from the identity line, indicating some nonlinearity in their formation processes (Fig. 2e and f). Most points representing the HYD and FD sensitivities of total monoterpene photooxidation products to monoterpenes (sTERPAMTNO3) and α-pinene (sAPINAMTNO3) remain on the identity line (Fig. 2g and h).

Regulatory models are often used to evaluate the response of total PM2.5 to emissions changes, so the sensitivities of the total PM2.5 concentration to the four different precursor emissions are evaluated (Fig. 2i–n). The HYD and FD sensitivities of sNOxPM2.5, sTERPPM2.5, and sAPINPM2.5 (Fig. 2i, k, and n) agree well, with slope and R2 values ranging from 0.96 to 1.03 and from 0.99 to 1.00, respectively (Table 1). However, the agreement between the HYD and FD sensitivities of sSO2PM2.5 (Fig. 2j) is much lower, with a slope of 0.65 and an R2 value of 0.63 (Table 1). Notably, although sSO2PM2.5 is usually positive, as evidenced by most of the points on the identity line, the sSO2PM2.5 calculated by FDM sensitivities has a few negative values where the HYD and FD sensitivities disagree. Because it is highly unlikely that an 25 % increase in SO2 emission leads to a decrease in PM2.5 concentration, the negative FD sensitivities likely arise from truncation errors inherent to the FDM since the perturbation sizes are large (i.e. a 25 % emission perturbation). Though it is possible to refine the perturbation size to one more suitable for this particular relationship of emissions to concentration, as demonstrated in the next section, this difference in one of 12 comparisons shows one of the strengths of HYD, which is the irrelevance of the perturbation size to the exactness of the resulting sensitivity.

To further illustrate the impact of nonlinear relationships between emissions and concentrations on FD sensitivities, we explored the sensitivity of ground-level aerosol nitrate to emissions of sulfur dioxide, sSO2ANO3, calculated with different perturbation sizes using the FDM. Our analysis revealed a low level of agreement between the FD and HYD sensitivities in the base case scenario where the domain-wide SO2 emission was perturbed by 125 % and 75 %, with a slope of 0.10 and an R2 of 0.30 (Table S1). The FD sensitivities calculated with the base case perturbation (125 %, 75 %) and two other perturbation size pairs (110 %, 90 %; 105 %, 95 %) are shown in Fig. 3. The FDM sensitivities calculated with different perturbation sizes are plotted in Fig. 3a–c, respectively. The FDM sensitivities exhibit similar behaviour to the HYD sensitivities over the continents. However, the inconsistency among the sensitivities calculated by FDM with different perturbation sizes over the ocean (Fig. 3) suggests that the FD sensitivities heavily depend on the perturbation sizes. This result demonstrates the relatively low credibility of FD sensitivities, particularly for highly nonlinear relationships where the truncation errors could be large. Notably, reducing perturbation sizes in the FDM did not lead to convergence with hyperdual sensitivities. This divergence may be attributed to the propagation of numerical noise from the model run to the calculated sensitivities as perturbation sizes decrease. This finding is consistent with the results in Zhang et al. (2012). Our findings highlight the importance of using other methods, including the HYD, which are not prone to truncation or cancellation errors for probing nonlinear relationships in CTMs.

We also compared the spatial distribution of HYD sensitivities (Fig. 4a) against the average (Fig. 4b) and the range (Fig. 4c) of the FD sensitivities with three different perturbation sizes. Differences are evident between the HYD and the average FD sensitivities in central North Carolina and Tennessee as well as off the coasts of Georgia and South Carolina. The HYD predicts slightly negative sensitivities in North Carolina and Tennessee, while the FDM predicts slightly positive values. The average FDM sensitivities off the coast of Georgia and South Carolina were noisy, with alternating positive and negative sensitivities, while the HYD sensitivities were much less noisy. In addition, the range of FDM sensitivities with different perturbation sizes was large (Fig. 4c), especially off the coast of Georgia and South Carolina. The results shown in Fig. 4b and c illustrate the dependence of FDM sensitivities on the perturbation sizes, especially for highly nonlinear relationships.

https://gmd.copernicus.org/articles/17/567/2024/gmd-17-567-2024-f04

Figure 4Comparisons of the sensitivities of aerosol-phase nitrate (ANO3) with respect to SO2 emission calculated by HYD and FDM on a map. (a) The HYD sensitivities. (b) The average of the FD sensitivities with three different perturbation size pairs (125 %, 75 %; 110 %, 90 %; 105 %, 95 %). (c) The range of FD sensitivities with three different perturbation size pairs (125 %, 75 %; 110 %, 90 %; 105 %, 95 %).

https://gmd.copernicus.org/articles/17/567/2024/gmd-17-567-2024-f05

Figure 5Comparisons of second-order sensitivities on the ground layer calculated by the hyperdual-step method (HYD sensitivities) and the hybrid HYD-FDM method (HYD-FDM sensitivities). The sensitivities are colour coded by the perturbed emissions (i.e. NOx, SO2, TERP, and APIN). Panels (a)(d) show the gas-phase species sensitivities with respect to their emissions. APIN denotes α-pinene and TERP denotes all other monoterpene species. Panels (e)(h) show examples of aerosol-phase products with respect to their precursors. ANO3 denotes the total aerosol-phase nitrate products. ASO4 denotes the total aerosol sulfate products. ΣAMT denotes the total aerosol photooxidation products from monoterpene. Panels (i)(n) show the total PM2.5 concentration with respect to gas-phase precursors. The sensitivities calculated are noted at the bottom-right corner of each panel with the notation pattern mentioned in Sect. 2.3.

Download

Table 2The slopes and R2 values from the linear regression of the second-order sensitivities of ground-layer species concentrations to domain-wide perturbations by the hyperdual-step method compared to finite difference sensitivities. The gas-phase species sensitivities with respect to their emissions are on line three, where APIN denotes α-pinene and TERP denotes all other monoterpene species. Line four includes sensitivities of aerosol-phase products with respect to their precursors, where ANO3 denotes the total aerosol-phase nitrate products, ASO4 denotes the total aerosol sulfate products, and ΣAMT denotes the total aerosol photooxidation products from monoterpene. Line five includes the sensitivities of the total PM2.5 concentration with respect to each gas-phase precursor. A visual comparison of the agreement for each relationship is shown in Fig. 5.

Download Print Version | Download XLSX

We also compared the second-order HYD sensitivities with those calculated from the hybrid HYD-FDM method (hybrid second-order sensitivities) described in Sect. 2.3 using one-to-one plots with identity lines for each panel (Fig. 4) along with the slope and R2 values (Table 2). Additional slopes and R2 values for second-order sensitivities can be found in Tables S1 and S2. Overall, the agreement between HYD and the hybrid second-order sensitivities is good except for those to SO2 emissions. This result can be attributed to the numerical errors in the first-order sensitivities to SO2, as illustrated in Figs. 2j and 3. Computing second-order sensitivities with the hybrid method, which includes FDM, is expected to add numerical noise. Except for sSO2(2)SO2, the hyperdual and hybrid second-order sensitivities of gas-phase species concentrations to emissions of the same species exhibit good agreement, with slopes and R2 values ranging from 0.84 to 0.85 and 0.84 to 0.86, respectively (Table 2). The hybrid second-order sensitivities are sometimes large, while HYD predicts close-to-zero sensitivities. This result is especially evident in sTERP(2)TERP (Fig. 5c) and sAPIN(2)APIN (Fig. 5d). This spread in the hybrid sensitivities likely originates from the FDM step, which is subject to numerical errors. Figure 5e–h show the HYD and HYD-FDM sensitivities of aerosol-phase product concentrations to the precursor emissions for this modelling period. Except for sSO2ASO4, the slope and R2 values range from 0.61 to 0.82 and 0.38 to 0.71, respectively (Table 2). The degree of agreement for sNOxANO3 is slightly lower than those for sTERPΣAMT and sAPINΣAMT, indicating more nonlinearity in the formation process from NOx to aerosol nitrate. The second-order sensitivities of total PM2.5 to different precursors demonstrate excellent agreement, with slope and R2 values ranging from 0.95 to 0.99 and from 0.96 to 0.99 (Table 2), again excluding the one to SO2. The second-order sensitivities of PM2.5 to NOx and α-pinene are primarily negative, while those to monoterpenes are positive. These findings have important implications for the formation process of PM2.5 from monoterpenes and α-pinene, which will be discussed in detail in the next section.

https://gmd.copernicus.org/articles/17/567/2024/gmd-17-567-2024-f06

Figure 6The first- and second-order sensitivities of total aerosol monoterpene photooxidation products (ΣAMT) and PM2.5 with respect to monoterpene (TERP) and α-pinene (APIN) emissions plotted on a map. Panels (a)(d) show the first-order sensitivities and (e)(h) show the second-order sensitivities.

3.2 Sensitivities of biogenic aerosol formation in the Southeast US computed by CMAQ-hyd

In this section, the first- and second-order sensitivities of several biogenic aerosols to both anthropogenic and biogenic aerosol precursors in the Southeast US are explored. The importance of calculating second-order sensitivities is demonstrated through the spatial distributions of the first- and second-order sensitivities of total aerosol-phase monoterpene photooxidation product (ΣAMT) and PM2.5 concentrations (Fig. 6). The first-order sensitivities (Fig. 6a–d) are predominantly positive, indicating that an increase in either TERP or APIN emissions will lead to an increase in ground-layer ΣAMT and PM2.5 concentrations. While sTERPΣAMT (Fig. 6a) and sAPINΣAMT (Fig. 6b) have similar values, sTERPPM2.5 (Fig. 6c) is slightly larger than sAPINPM2.5 (Fig. 6d) due to the formation of other species, such as aerosol-phase monoterpene nitrate products (AMTNO3). The second-order sensitivities (Fig. 6e–h) provide additional information about how ΣAMT and PM2.5 concentrations respond to changes in APIN and TERP emissions. The sTERP(2)ΣAMT (Fig. 6e) and sAPIN(2)ΣAMT (Fig. 6f) are generally positive, indicating that the response of the concentration of ΣAMT to monoterpene and α-pinene emissions is convex. An increase in either monoterpene or α-pinene emissions will lead to increases in sTERPΣAMT and sAPINΣAMT. If we only consider first-order sensitivities, the effect of changes in TERP or APIN emissions on ΣAMT concentrations will be underestimated. On the other hand, sTERP(2)PM2.5 (Fig. 6g) is mostly negative, while sAPIN(2)PM2.5 (Fig. 6h) is mostly positive. The distinct behaviour of the second-order sensitivities of PM2.5 concentration to either TERP or APIN emissions exemplifies the importance of considering second-order sensitivities for these nonlinear formation processes. Only considering the first-order sensitivities often leads to the overestimation or underestimation of the effects. The accurate second-order sensitivity information can help researchers understand the relationships of concentration to emissions more thoroughly and develop emission control strategies for specific aerosol precursor emissions.

We used the formation of aerosol monoterpene nitrate, AMTNO3, as an example of the importance of computing the cross-sensitivity, especially for complex anthropogenic–biogenic aerosol formation processes. The formation of AMTNO3 is influenced primarily by two precursors: NOx and monoterpenes. NOx is primarily emitted anthropogenically, while monoterpenes primarily originate from biogenic sources. The first- and second-order sensitivities of AMTNO3 to NOx or TERP can help researchers and environmental decision makers estimate the nonlinear effects of emissions changes on concentrations of secondary pollutants. The cross-sensitivity of AMTNO3 with respect to both NOx and TERP emissions, sNOx,TERP(2)AMTNO3, is a valuable tool for answering complex research questions. For instance, researchers can use sNOx,TERP(2)AMTNO3 to predict how an increase in monoterpene emissions would affect the first-order sensitivities of AMTNO3 to NOx. Since biogenic emissions of monoterpenes are temperature dependent, understanding how anthropogenic emissions of NOx will affect AMTNO3 formation with changing terpene emissions in future scenarios is crucial for designing effective air pollution control strategies. Computing the cross-sensitivity is especially challenging with traditional methods since determining the proper perturbation for two species using FDM is even harder than calculating second-order sensitivities with FDM. The distinct values of sNOx(2)AMTNO3, sTERP(2)AMTNO3, and sNOx,TERP(2)AMTNO3 demonstrate the value of the HYD method (Fig. 7). The spatial distributions of sNOxAMTNO3 and sTERPAMTNO3 are included in Fig. S1 in the Supplement. Overall, the second-order sensitivities are negative over land in the Southeast US. sNOx(2)AMTNO3 is generally smaller than sTERP(2)AMTNO3, indicating that the relationship between AMTNO3 and TERP emissions is more nonlinear than that between AMTNO3 and NOx. The cross-sensitivities sNOx,TERP(2)AMTNO3 are mostly positive over the Southeast US. Based on the cross-sensitivity results, we can conclude that an increase in TERP emission will make the first-order sensitivity of AMTNO3 to NOx larger. A warmer climate in the future would likely increase the impact of anthropogenic NOx on the AMTNO3 concentration in the atmosphere. This kind of information is invaluable for researchers and environmental decision makers aiming to evaluate complex secondary organic aerosol formation with multiple anthropogenic and biogenic precursors.

https://gmd.copernicus.org/articles/17/567/2024/gmd-17-567-2024-f07

Figure 7The second-order sensitivities and cross-sensitivities of aerosol-phase monoterpene nitrate products (AMTNO3) with respect to NOx and monoterpene (TERP) emissions plotted on a map. (a) Second-order sensitivities of AMTNO3 with respect to NOx. (b) Second-order sensitivities of AMTNO3 with respect to TERP. (c) Cross-sensitivities of AMTNO3 with respect to NOx and TERP.

https://gmd.copernicus.org/articles/17/567/2024/gmd-17-567-2024-f08

Figure 8Computational costs of CMAQ-hyd and standard CMAQ, and the computational costs of various modules in them. CMAQ-hyd and standard CMAQ simulations using different numbers of nodes (one, two, four, and eight) are shown on the x axis. The wall time is shown on the y axis.

Download

3.3 Computational cost of CMAQ-hyd

The practical application of any sensitivity analysis in CTMs depends heavily upon its computational cost. Previous works using operator overloading approaches resulted in high computational costs due to additional mathematical operations, making this approach computationally unfavourable compared to other existing methods. For instance, the implementation of CVM in GEOS-CHEM results in a 4.5-fold increase in computational overhead when compared to the standard model (Constantin and Barrett, 2014). To evaluate the computational efficiency of CMAQ-hyd, we compared it with a standard CMAQ model using different computational resources in a supercomputing cluster. The total wall times needed when different numbers of computational nodes are used for an identical run of the original CMAQ model and CMAQ-hyd are displayed in Fig. 8. The CMAQ-hyd and standard CMAQ runs were performed with one, two, four, and eight nodes of the supercomputing cluster. The configuration of computing resources is detailed in Sect. S4 of the Supplement. Profiling of the model was completed at the level of the scientific modules, special subroutines, or other important components of CMAQ. The scientific processes are Chem (gas-phase chemistry), Aero (aerosol dynamics and thermodynamics), Vdiff (vertical diffusion), Hadv (horizontal advection), Phot (photolysis), Cldproc (cloud processes), Hdiff (horizontal diffusion), and Zadv (vertical advection). MPI_Barrier is a special subroutine used for synchronizing processes among parallel processors after vertical diffusion and before the other three transport processes. Other processes necessary for CMAQ, including the I/O processes, are included in the “Other” category. Details about the high-performance computing cluster used can be found in the Supplement.

With the same computing resources, the total computation time of CMAQ-hyd is approximately 2.5 (2.44–2.56) times longer. Despite the additional computational burden, CMAQ-hyd remains computationally competitive with the traditional FDM when calculating derivatives. One run of CMAQ-hyd generates the same amount of first- and second-order sensitivity information as at least three runs of standard CMAQ. The relatively low computational cost of CMAQ-hyd compared to the previous operator overloading approach may be due to the selective modification of the source code. In contrast to GEOS-CHEM CVM (Constantin and Barrett, 2014), only the parts of the model that involve calculating the main species concentration array use hyperdual calculations.

The computational times of scientific modules in CMAQ-hyd generally scale well with increases in computational resources, similar to standard CMAQ. Chem, Aero, and Vdiff are the most computationally expensive modules in both CMAQ and CMAQ-hyd. The relative computational cost of Aero is higher in CMAQ-hyd than in standard CMAQ. The ratio of the computational time of Chem to that of Aero is 1.53 (1.49–1.56) for the CMAQ-hyd runs and 3.98 (3.85–4.19) for the standard CMAQ runs (Fig. 8). Future work can potentially reduce the computational cost by ignoring sensitivity propagations during the iterative root-finding process in select subroutines, since only the output concentrations from these subroutines are used in the later part of the model. This is also a significant advantage of any operator-overloading-based approach (Fike and Alonso, 2011). The computational time of each module is detailed in Table S3 in the Supplement, and the full relative percentage of the computational time of each module in eight runs is shown in Fig. S2 in the Supplement.

The MPI_Barrier function also scales well with an increasing number of processors. To a certain point, subdividing the domain further reduces the variability of the time required for scientific processes to be completed across different nodes, resulting in a reduction in the amount of time the program spends waiting for all processes to be synchronized. One important thing to note here is that the scaling of MPI_Barrier is dependent on the ratio of the number of nodes to the number of grid cells. A demonstration of the subdivision of the modelling domain by processors is shown in Fig. S3 in the Supplement.

The I/O process for newly added first- and second-order sensitivity output files increases the computational cost; however, the I/O of species concentration files has a much lower computational cost than other modules in CMAQ for this specific scenario. The I/O processes of CMAQ-hyd and CMAQ take 193 (181–206) s and 52 (47–56) s, respectively. The I/O process of CMAQ-hyd takes approximately 3.7 times longer on average than that of standard CMAQ. The overall memory overhead of CMAQ-hyd is approximately 25 GB for this simulation. A parallel input/output (I/O) approach may be applied to reduce the possibility of potential memory overflow in processor 0 (Wong et al., 2015).

4 Conclusion

We have demonstrated the implementation of the hyperdual-step method in CMAQ version 5.3.2 to formulate CMAQ-hyd. This novel model retains the majority of the CMAQ code with slight modifications in the declarations of selected variables and the addition of sensitivity computation modules. The novel model can be applied to compute exact first-order sensitivities, second-order sensitivities, and cross-sensitivities of pollutant concentrations to precursor emissions efficiently and accurately with a single model run. Compared with traditional sensitivity analysis methods, CMAQ-hyd is computationally competitive with conventional methods and easier to maintain than other existing advanced methods (DDM and adjoint). The development process of CMAQ-hyd is also more straightforward than that of other advanced methods, since all that is needed is to change the type of newly declared variables to hyperdual.

We developed and validated the hyperdual-step module “HDMod”, which is limited to analytically verifiable mathematical operations of hyperdual numbers. This module can also be applied to other numerical models where first- and second-order sensitivities are of interest. We further evaluated the development of CMAQ-hyd against the FDM and the FDM-HYD hybrid method to ensure the correctness of the implementation. During the evaluation process, CMAQ-hyd demonstrated the ability to compute sensitivities free from truncation and subtractive cancellation errors, unlike those calculated by the FDM. HDMod can potentially be applied to other numerical models written in Fortran to produce first- and second-order sensitivities.

The computation of second-order sensitivities is crucial for researchers and environmental decision makers who need to decide the priority and extent of controls on specific types of emissions to reduce atmospheric pollutant concentrations. For instance, the second-order sensitivity of PM2.5 concentration to monoterpenes and α-pinene provided additional information about the relationships of emissions to concentrations in CMAQ. With the additional second-order sensitivity information, the curvature of the concentration responses to emissions changes improves the estimate of how a specific pollutant concentration would respond to changes in emissions. The simplicity of computing cross-sensitivities with CMAQ-hyd is another advantage of this augmented model. Cross-sensitivities are especially useful in nonlinear processes with two precursors. Knowledge of the synergistic effect of anthropogenic and biogenic emissions on aerosol concentrations (e.g. NOx and monoterpene on AMTNO3) is essential for researchers to predict the dynamics between two potential pollutants and for environmental decision makers to propose policy implementations under different climate scenarios in the future.

Although CMAQ-hyd remains computationally competitive with the traditional finite-difference method, it is still computationally intensive and has a memory overhead. We plan to implement optimizations for iterative processes in CMAQ and to apply the parallel I/O approach to reduce the memory overhead on the compute node where all the information is gathered. The implementation of checkpointing of sensitivities after specific subroutines is also a potential advantage of CMAQ-hyd and will provide valuable information on how each module or even each line of the model affects the sensitivities, akin to a process analysis approach. This checkpointing feature cannot be easily implemented with other methods such as FDM, DDM, and the adjoint method.

In conclusion, we have developed and evaluated CMAQ-hyd, a novel augmented model to compute first-order sensitivities, second-order sensitivities, and cross-sensitivities free from subtractive cancellation and truncation errors in CMAQ. Our successful implementation also provides an example of the hyperdual-step method that may be applicable for other CTMs where sensitivities are helpful.

Code and data availability

CMAQv5.3.2 is available at https://github.com/USEPA/CMAQ/tree/5.3.2 (last access: 1 June 2021) and is archived at https://doi.org/10.5281/zenodo.4081737 (US EPA Office of Research and Development, 2020). The CMAQ-hyd model is archived at https://doi.org/10.5281/zenodo.10119026 (Liu et al., 2023). Both the CMAQv5.3.2 and the CMAQ-hyd models are under MIT licences. The input data for the simulation experiments are available at https://doi.org/10.15139/S3/IQVABD (US EPA, 2019).

Supplement

The supplement related to this article is available online at: https://doi.org/10.5194/gmd-17-567-2024-supplement.

Author contributions

JL developed the model code and performed the simulations with help from SLC. EC helped develop and test the “HDMod” library. JL prepared the manuscript with guidance from SLC.

Competing interests

The contact author has declared that none of the authors has any competing interests.

Disclaimer

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors.

Acknowledgements

The authors would also like to thank Ryan P. Russell for kindly providing the testing framework for multicomplex numbers, which inspired the development of the testing framework for hyperdual numbers.

Financial support

The work is supported by National Science Foundation CAREER Award grant no. 1944669 to Shannon L. Capps.

Review statement

This paper was edited by Yilong Wang and reviewed by Jixiang Li, Jaroslav Resler, and one anonymous referee.

References

Berman, B., Capps, S. L., Sauvageau, I., Gao, E., Eastham, S. D., and Russell, R. P.: ISORROPIA-MCX: Enabling Sensitivity Analysis With Multicomplex Variables in the Aerosol Thermodynamic Model, ISORROPIA, Earth Space Sci., 10, e2022EA002729, https://doi.org/10.1029/2022EA002729, 2023. 

Boole, G.: A Treatise on the Calculus of Finite Difference, 2nd Edn., Dover, ISBN 048649523X, 1960. 

Byun, D. and Schere, K. L.: Review of the Governing Equations, Computational Algorithms, and Other Components of the Models-3 Community Multiscale Air Quality (CMAQ) Modeling System, Appl. Mech. Rev., 59, 51–77, https://doi.org/10.1115/1.2128636, 2006. 

Capaldo, K. P., Pilinis, C., and Pandis, S. N.: A computationally efficient hybrid approach for dynamic gas/aerosol transfer in air quality models, Atmos. Environ., 34, 3617–3627, https://doi.org/10.1016/S1352-2310(00)00092-3, 2000. 

Che, W., Zheng, J., Wang, S., Zhong, L., and Lau, A.: Assessment of motor vehicle emission control policies using Model-3/CMAQ model for the Pearl River Delta region, China, Atmos. Environ., 45, 1740–1751, https://doi.org/10.1016/j.atmosenv.2010.12.050, 2011. 

Chemel, C., Fisher, B. E. A., Kong, X., Francis, X. V., Sokhi, R. S., Good, N., Collins, W. J., and Folberth, G. A.: Application of chemical transport model CMAQ to policy decisions regarding PM2.5 in the UK, Atmos. Environ., 82, 410–417, https://doi.org/10.1016/j.atmosenv.2013.10.001, 2014. 

Cohan, D. S. and Napelenok, S. L.: Air Quality Response Modeling for Decision Support, Atmosphere, 2, 407–425, https://doi.org/10.3390/atmos2030407, 2011. 

Cohen, A. and Shoham, M.: Application of Hyper-Dual Numbers to Multibody Kinematics, J. Mech. Robot., 8, 011015, https://doi.org/10.1115/1.4030588, 2015. 

Constantin, B. V. and Barrett, S. R. H.: Application of the complex step method to chemistry-transport modeling, Atmos. Environ., 99, 457–465, https://doi.org/10.1016/j.atmosenv.2014.10.017, 2014. 

Dunker, A. M.: Efficient calculation of sensitivity coefficients for complex atmospheric models, Atmos. Environ. (1967), 15, 1155–1161, https://doi.org/10.1016/0004-6981(81)90305-x, 1981. 

Fike, J. and Alonso, J.: The Development of Hyper-Dual Numbers for Exact Second-Derivative Calculations, 49th AIAA Aerospace Sciences Meeting including the New Horizons Forum and Aerospace Exposition, 2011-01-04, https://doi.org/10.2514/6.2011-886, 2011. 

Fornberg, B.: Numerical Differentiation of Analytic Functions, ACM T. Math. Software, 7, 512–526, https://doi.org/10.1145/355972.355979, 1981. 

Fountoukis, C. and Nenes, A.: ISORROPIA II: a computationally efficient thermodynamic equilibrium model for K+Ca2+Mg2+NH4+Na+SO42-NO3-ClH2O aerosols, Atmos. Chem. Phys., 7, 4639–4659, https://doi.org/10.5194/acp-7-4639-2007, 2007. 

Hakami, A.: Nonlinearity in atmospheric response: A direct sensitivity analysis approach, J. Geophys. Res., 109, D15303, https://doi.org/10.1029/2003jd004502, 2004. 

Hakami, A., Odman, M. T., and Russell, A. G.: High-order, direct sensitivity analysis of multidimensional air quality models, Environ. Sci. Technol., 37, 2442–2452, https://doi.org/10.1021/es020677h, 2003. 

Hennigan, C. J., Izumi, J., Sullivan, A. P., Weber, R. J., and Nenes, A.: A critical evaluation of proxy methods used to estimate the acidity of atmospheric particles, Atmos. Chem. Phys., 15, 2775–2790, https://doi.org/10.5194/acp-15-2775-2015, 2015. 

Kwok, R. H. F., Napelenok, S. L., and Baker, K. R.: Implementation and evaluation of PM2.5 source contribution analysis in a photochemical model, Atmos. Environ., 80, 398–407, https://doi.org/10.1016/j.atmosenv.2013.08.017, 2013. 

Kwok, R. H. F., Baker, K. R., Napelenok, S. L., and Tonnesen, G. S.: Photochemical grid model implementation and application of VOC, NOx, and O3 source apportionment, Geosci. Model Dev., 8, 99–114, https://doi.org/10.5194/gmd-8-99-2015, 2015. 

Lantoine, G., Russell, R. P., and Dargent, T.: Using Multicomplex Variables for Automatic Computation of High-Order Derivatives, ACM T. Math. Software, 38, 1–21, https://doi.org/10.1145/2168773.2168774, 2012. 

Li, Q., Borge, R., Sarwar, G., de la Paz, D., Gantt, B., Domingo, J., Cuevas, C. A., and Saiz-Lopez, A.: Impact of halogen chemistry on summertime air quality in coastal and continental Europe: application of the CMAQ model and implications for regulation, Atmos. Chem. Phys., 19, 15321–15337, https://doi.org/10.5194/acp-19-15321-2019, 2019. 

Liu, J., Chen, E., and Capps, S. L.: CMAQv5.3.2-hyd (5.3.2-hyd1.0.1), Zenodo [code], https://doi.org/10.5281/zenodo.10119026, 2023. 

Liu, X.-H., Zhang, Y., Cheng, S.-H., Xing, J., Zhang, Q., Streets, D. G., Jang, C., Wang, W.-X., and Hao, J.-M.: Understanding of regional air pollution over China using CMAQ, part I performance evaluation and seasonal variation, Atmos. Environ., 44, 2415–2426, https://doi.org/10.1016/j.atmosenv.2010.03.035, 2010. 

Luecken, D. J., Yarwood, G., and Hutzell, W. T.: Multipollutant modeling of ozone, reactive nitrogen and HAPs across the continental US with CMAQ-CB6, Atmos. Environ., 201, 62–72, https://doi.org/10.1016/j.atmosenv.2018.11.060, 2019. 

Lyness, J. and Moler, C.: Numerical Differentiation of Analytic Functions, SIAM J. Numer. Anal., 4, 202–210, https://doi.org/10.1137/0704019, 1967. 

Mebust, M. R., Eder, B. K., Binkowski, F. S., and Roselle, S. J.: Models-3 Community Multiscale Air Quality (CMAQ) model aerosol component 2. Model evaluation, J. Geophys. Res.-Atmos., 108, 4184, https://doi.org/10.1029/2001jd001410, 2003. 

Murray, C. J. L., Aravkin, A. Y., Zheng, P., Abbafati, C., Abbas, K. M., Abbasi-Kangevari, M., Abd-Allah, F., Abdelalim, A., Abdollahi, M., Abdollahpour, I., Abegaz, K. H., Abolhassani, H., Aboyans, V., Abreu, L. G., Abrigo, M. R. M., Abualhasan, A., Abu-Raddad, L. J., Abushouk, A. I., Adabi, M., Adekanmbi, V., Adeoye, A. M., Adetokunboh, O. O., Adham, D., Advani, S. M., Agarwal, G., Aghamir, S. M. K., Agrawal, A., Ahmad, T., Ahmadi, K., Ahmadi, M., Ahmadieh, H., Ahmed, M. B., Akalu, T. Y., Akinyemi, R. O., Akinyemiju, T., Akombi, B., Akunna, C. J., Alahdab, F., Al-Aly, Z., Alam, K., Alam, S., Alam, T., Alanezi, F. M., Alanzi, T. M., Alemu, B. W., Alhabib, K. F., Ali, M., Ali, S., Alicandro, G., Alinia, C., Alipour, V., Alizade, H., Aljunid, S. M., Alla, F., Allebeck, P., Almasi-Hashiani, A., Al-Mekhlafi, H. M., Alonso, J., Altirkawi, K. A., Amini-Rarani, M., Amiri, F., Amugsi, D. A., Ancuceanu, R., Anderlini, D., Anderson, J. A., Andrei, C. L., Andrei, T., Angus, C., Anjomshoa, M., Ansari, F., Ansari-Moghaddam, A., Antonazzo, I. C., Antonio, C. A. T., Antony, C. M., Antriyandarti, E., Anvari, D., Anwer, R., Appiah, S. C. Y., Arabloo, J., Arab-Zozani, M., Ariani, F., Armoon, B., Ärnlöv, J., Arzani, A., Asadi-Aliabadi, M., Asadi-Pooya, A. A., Ashbaugh, C., Assmus, M., Atafar, Z., Atnafu, D. D., Atout, M. M. D. W., Ausloos, F., Ausloos, M., Ayala Quintanilla, B. P., Ayano, G., Ayanore, M. A., Azari, S., Azarian, G., Azene, Z. N., Badawi, A., Badiye, A. D., Bahrami, M. A., Bakhshaei, M. H., Bakhtiari, A., Bakkannavar, S. M., Baldasseroni, A., Ball, K., Ballew, S. H., Balzi, D., Banach, M., Banerjee, S. K., Bante, A. B., Baraki, A. G., Barker-Collo, S. L., Bärnighausen, T. W., Barrero, L. H., Barthelemy, C. M., Barua, L., Basu, S., Baune, B. T., Bayati, M., Becker, J. S., Bedi, N., Beghi, E., Béjot, Y., Bell, M. L., Bennitt, F. B., Bensenor, I. M., Berhe, K., Berman, A. E., Bhagavathula, A. S., Bhageerathy, R., Bhala, N., Bhandari, D., Bhattacharyya, K., Bhutta, Z. A., Bijani, A., Bikbov, B., Bin Sayeed, M. S., Biondi, A., Birihane, B. M., Bisignano, C., Biswas, R. K., Bitew, H., Bohlouli, S., Bohluli, M., Boon-Dooley, A. S., Borges, G., Borzì, A. M., Borzouei, S., Bosetti, C., Boufous, S., Braithwaite, D., Breitborde, N. J. K., Breitner, S., Brenner, H., Briant, P. S., Briko, A. N., Briko, N. I., Britton, G. B., Bryazka, D., Bumgarner, B. R., Burkart, K., Burnett, R. T., Burugina Nagaraja, S., Butt, Z. A., Caetano Dos Santos, F. L., Cahill, L. E., Cámera, L. L. A., Campos-Nonato, I. R., Cárdenas, R., Carreras, G., Carrero, J. J., Carvalho, F., Castaldelli-Maia, J. M., Castañeda-Orjuela, C. A., Castelpietra, G., Castro, F., Causey, K., Cederroth, C. R., Cercy, K. M., Cerin, E., Chandan, J. S., Chang, K.-L., Charlson, F. J., Chattu, V. K., Chaturvedi, S., Cherbuin, N., Chimed-Ochir, O., Cho, D. Y., Choi, J.-Y. J., Christensen, H., Chu, D.-T., Chung, M. T., Chung, S.-C., Cicuttini, F. M., Ciobanu, L. G., Cirillo, M., Classen, T. K. D., Cohen, A. J., Compton, K., Cooper, O. R., Costa, V. M., Cousin, E., Cowden, R. G., Cross, D. H., Cruz, J. A., Dahlawi, S. M. A., Damasceno, A. A. M., Damiani, G., Dandona, L., Dandona, R., Dangel, W. J., Danielsson, A.-K., Dargan, P. I., Darwesh, A. M., Daryani, A., Das, J. K., Das Gupta, R., Das Neves, J., Dávila-Cervantes, C. A., Davitoiu, D. V., De Leo, D., Degenhardt, L., Delang, M., Dellavalle, R. P., Demeke, F. M., Demoz, G. T., Demsie, D. G., Denova-Gutiérrez, E., Dervenis, N., Dhungana, G. P., Dianatinasab, M., Dias Da Silva, D., Diaz, D., Dibaji Forooshani, Z. S., Djalalinia, S., Do, H. T., Dokova, K., Dorostkar, F., Doshmangir, L., Driscoll, T. R., Duncan, B. B., Duraes, A. R., Eagan, A. W., Edvardsson, D., El Nahas, N., El Sayed, I., El Tantawi, M., Elbarazi, I., Elgendy, I. Y., El-Jaafary, S. I., Elyazar, I. R., Emmons-Bell, S., Erskine, H. E., Eskandarieh, S., Esmaeilnejad, S., Esteghamati, A., Estep, K., Etemadi, A., Etisso, A. E., Fanzo, J., Farahmand, M., Fareed, M., Faridnia, R., Farioli, A., Faro, A., Faruque, M., Farzadfar, F., Fattahi, N., Fazlzadeh, M., Feigin, V. L., Feldman, R., Fereshtehnejad, S.-M., Fernandes, E., Ferrara, G., Ferrari, A. J., Ferreira, M. L., Filip, I., Fischer, F., Fisher, J. L., Flor, L. S., Foigt, N. A., Folayan, M. O., Fomenkov, A. A., Force, L. M., Foroutan, M., Franklin, R. C., Freitas, M., Fu, W., Fukumoto, T., Furtado, J. M., Gad, M. M., Gakidou, E., Gallus, S., Garcia-Basteiro, A. L., Gardner, W. M., Geberemariyam, B. S., Gebreslassie, A. A. A. A., Geremew, A., Gershberg Hayoon, A., Gething, P. W., Ghadimi, M., Ghadiri, K., Ghaffarifar, F., Ghafourifard, M., Ghamari, F., Ghashghaee, A., Ghiasvand, H., Ghith, N., Gholamian, A., Ghosh, R., Gill, P. S., Ginindza, T. G. G., Giussani, G., Gnedovskaya, E. V., Goharinezhad, S., Gopalani, S. V., Gorini, G., Goudarzi, H., Goulart, A. C., Greaves, F., Grivna, M., Grosso, G., Gubari, M. I. M., Gugnani, H. C., Guimarães, R. A., Guled, R. A., Guo, G., Guo, Y., Gupta, R., Gupta, T., Haddock, B., Hafezi-Nejad, N., Hafiz, A., Haj-Mirzaian, A., Haj-Mirzaian, A., Hall, B. J., Halvaei, I., Hamadeh, R. R., Hamidi, S., Hammer, M. S., Hankey, G. J., Haririan, H., Haro, J. M., Hasaballah, A. I., Hasan, M. M., Hasanpoor, E., Hashi, A., Hassanipour, S., Hassankhani, H., Havmoeller, R. J., Hay, S. I., Hayat, K., Heidari, G., Heidari-Soureshjani, R., Henrikson, H. J., Herbert, M. E., Herteliu, C., Heydarpour, F., Hird, T. R., Hoek, H. W., Holla, R., Hoogar, P., Hosgood, H. D., Hossain, N., Hosseini, M., Hosseinzadeh, M., Hostiuc, M., Hostiuc, S., Househ, M., Hsairi, M., Hsieh, V. C.-R., Hu, G., Hu, K., Huda, T. M., Humayun, A., Huynh, C. K., Hwang, B.-F., Iannucci, V. C., Ibitoye, S. E., Ikeda, N., Ikuta, K. S., Ilesanmi, O. S., Ilic, I. M., Ilic, M. D., Inbaraj, L. R., Ippolito, H., Iqbal, U., Irvani, S. S. N., Irvine, C. M. S., Islam, M. M., Islam, S. M. S., Iso, H., Ivers, R. Q., Iwu, C. C. D., Iwu, C. J., Iyamu, I. O., Jaafari, J., Jacobsen, K. H., Jafari, H., Jafarinia, M., Jahani, M. A., Jakovljevic, M., Jalilian, F., James, S. L., Janjani, H., Javaheri, T., Javidnia, J., Jeemon, P., Jenabi, E., Jha, R. P., Jha, V., Ji, J. S., Johansson, L., John, O., John-Akinola, Y. O., Johnson, C. O., Jonas, J. B., Joukar, F., Jozwiak, J. J., Jürisson, M., Kabir, A., Kabir, Z., Kalani, H., Kalani, R., Kalankesh, L. R., Kalhor, R., Kanchan, T., Kapoor, N., Karami Matin, B., Karch, A., Karim, M. A., Kassa, G. M., Katikireddi, S. V., Kayode, G. A., Kazemi Karyani, A., Keiyoro, P. N., Keller, C., Kemmer, L., Kendrick, P. J., Khalid, N., Khammarnia, M., Khan, E. A., Khan, M., Khatab, K., Khater, M. M., Khatib, M. N., Khayamzadeh, M., Khazaei, S., Kieling, C., Kim, Y. J., Kimokoti, R. W., Kisa, A., Kisa, S., Kivimäki, M., Knibbs, L. D., Knudsen, A. K. S., Kocarnik, J. M., Kochhar, S., Kopec, J. A., Korshunov, V. A., Koul, P. A., Koyanagi, A., Kraemer, M. U. G., Krishan, K., Krohn, K. J., Kromhout, H., Kuate Defo, B., Kumar, G. A., Kumar, V., Kurmi, O. P., Kusuma, D., La Vecchia, C., Lacey, B., Lal, D. K., Lalloo, R., Lallukka, T., Lami, F. H., Landires, I., Lang, J. J., Langan, S. M., Larsson, A. O., Lasrado, S., Lauriola, P., Lazarus, J. V., Lee, P. H., Lee, S. W. H., Legrand, K. E., Leigh, J., Leonardi, M., Lescinsky, H., Leung, J., Levi, M., Li, S., Lim, L.-L., Linn, S., Liu, S., Liu, S., Liu, Y., Lo, J., Lopez, A. D., Lopez, J. C. F., Lopukhov, P. D., Lorkowski, S., Lotufo, P. A., Lu, A., Lugo, A., Maddison, E. R., Mahasha, P. W., Mahdavi, M. M., Mahmoudi, M., Majeed, A., Maleki, A., Maleki, S., Malekzadeh, R., Malta, D. C., Mamun, A. A., Manda, A. L., Manguerra, H., Mansour-Ghanaei, F., Mansouri, B., Mansournia, M. A., Mantilla Herrera, A. M., Maravilla, J. C., Marks, A., Martin, R. V., Martini, S., Martins-Melo, F. R., Masaka, A., Masoumi, S. Z., Mathur, M. R., Matsushita, K., Maulik, P. K., McAlinden, C., McGrath, J. J., McKee, M., Mehndiratta, M. M., Mehri, F., Mehta, K. M., Memish, Z. A., Mendoza, W., Menezes, R. G., Mengesha, E. W., Mereke, A., Mereta, S. T., Meretoja, A., Meretoja, T. J., Mestrovic, T., Miazgowski, B., Miazgowski, T., Michalek, I. M., Miller, T. R., Mills, E. J., Mini, G., Miri, M., Mirica, A., Mirrakhimov, E. M., Mirzaei, H., Mirzaei, M., Mirzaei, R., Mirzaei-Alavijeh, M., Misganaw, A. T., Mithra, P., Moazen, B., Mohammad, D. K., Mohammad, Y., Mohammad Gholi Mezerji, N., Mohammadian-Hafshejani, A., Mohammadifard, N., Mohammadpourhodki, R., Mohammed, A. S., Mohammed, H., Mohammed, J. A., Mohammed, S., Mokdad, A. H., Molokhia, M., Monasta, L., Mooney, M. D., Moradi, G., Moradi, M., Moradi-Lakeh, M., Moradzadeh, R., Moraga, P., Morawska, L., Morgado-Da-Costa, J., Morrison, S. D., Mosapour, A., Mosser, J. F., Mouodi, S., Mousavi, S. M., Mousavi Khaneghah, A., Mueller, U. O., Mukhopadhyay, S., Mullany, E. C., Musa, K. I., Muthupandian, S., Nabhan, A. F., Naderi, M., Nagarajan, A. J., Nagel, G., Naghavi, M., Naghshtabrizi, B., Naimzada, M. D., Najafi, F., Nangia, V., Nansseu, J. R., Naserbakht, M., Nayak, V. C., Negoi, I., Ngunjiri, J. W., Nguyen, C. T., Nguyen, H. L. T., Nguyen, M., Nigatu, Y. T., Nikbakhsh, R., Nixon, M. R., Nnaji, C. A., Nomura, S., Norrving, B., Noubiap, J. J., Nowak, C., Nunez-Samudio, V., Oþoiu, A., Oancea, B., Odell, C. M., Ogbo, F. A., Oh, I.-H., Okunga, E. W., Oladnabi, M., Olagunju, A. T., Olusanya, B. O., Olusanya, J. O., Omer, M. O., Ong, K. L., Onwujekwe, O. E., Orpana, H. M., Ortiz, A., Osarenotor, O., Osei, F. B., Ostroff, S. M., Otstavnov, N., Otstavnov, S. S., Øverland, S., Owolabi, M. O., P A, M., Padubidri, J. R., Palladino, R., Panda-Jonas, S., Pandey, A., Parry, C. D. H., Pasovic, M., Pasupula, D. K., Patel, S. K., Pathak, M., Patten, S. B., Patton, G. C., Pazoki Toroudi, H., Peden, A. E., Pennini, A., Pepito, V. C. F., Peprah, E. K., Pereira, D. M., Pesudovs, K., Pham, H. Q., Phillips, M. R., Piccinelli, C., Pilz, T. M., Piradov, M. A., Pirsaheb, M., Plass, D., Polinder, S., Polkinghorne, K. R., Pond, C. D., Postma, M. J., Pourjafar, H., Pourmalek, F., Poznañska, A., Prada, S. I., Prakash, V., Pribadi, D. R. A., Pupillo, E., Quazi Syed, Z., Rabiee, M., Rabiee, N., Radfar, A., Rafiee, A., Raggi, A., Rahman, M. A., Rajabpour-Sanati, A., Rajati, F., Rakovac, I., Ram, P., Ramezanzadeh, K., Ranabhat, C. L., Rao, P. C., Rao, S. J., Rashedi, V., Rathi, P., Rawaf, D. L., Rawaf, S., Rawal, L., Rawassizadeh, R., Rawat, R., Razo, C., Redford, S. B., Reiner, R. C., Reitsma, M. B., Remuzzi, G., Renjith, V., Renzaho, A. M. N., Resnikoff, S., Rezaei, N., Rezaei, N., Rezapour, A., Rhinehart, P.-A., Riahi, S. M., Ribeiro, D. C., Ribeiro, D., Rickard, J., Rivera, J. A., Roberts, N. L. S., Rodríguez-Ramírez, S., Roever, L., Ronfani, L., Room, R., Roshandel, G., Roth, G. A., Rothenbacher, D., Rubagotti, E., Rwegerera, G. M., Sabour, S., Sachdev, P. S., Saddik, B., Sadeghi, E., Sadeghi, M., Saeedi, R., Saeedi Moghaddam, S., Safari, Y., Safi, S., Safiri, S., Sagar, R., Sahebkar, A., Sajadi, S. M., Salam, N., Salamati, P., Salem, H., Salem, M. R. R., Salimzadeh, H., Salman, O. M., Salomon, J. A., Samad, Z., Samadi Kafil, H., Sambala, E. Z., Samy, A. M., Sanabria, J., Sánchez-Pimienta, T. G., Santomauro, D. F., Santos, I. S., Santos, J. V., Santric-Milicevic, M. M., Saraswathy, S. Y. I., Sarmiento-Suárez, R., Sarrafzadegan, N., Sartorius, B., Sarveazad, A., Sathian, B., Sathish, T., Sattin, D., Saxena, S., Schaeffer, L. E., Schiavolin, S., Schlaich, M. P., Schmidt, M. I., Schutte, A. E., Schwebel, D. C., Schwendicke, F., Senbeta, A. M., Senthilkumaran, S., Sepanlou, S. G., Serdar, B., Serre, M. L., Shadid, J., Shafaat, O., Shahabi, S., Shaheen, A. A., Shaikh, M. A., Shalash, A. S., Shams-Beyranvand, M., Shamsizadeh, M., Sharafi, K., Sheikh, A., Sheikhtaheri, A., Shibuya, K., Shield, K. D., Shigematsu, M., Shin, J. I., Shin, M.-J., Shiri, R., Shirkoohi, R., Shuval, K., Siabani, S., Sierpinski, R., Sigfusdottir, I. D., Sigurvinsdottir, R., Silva, J. P., Simpson, K. E., Singh, J. A., Singh, P., Skiadaresi, E., Skou, S. T., Skryabin, V. Y., Smith, E. U. R., Soheili, A., Soltani, S., Soofi, M., Sorensen, R. J. D., Soriano, J. B., Sorrie, M. B., Soshnikov, S., Soyiri, I. N., Spencer, C. N., Spotin, A., Sreeramareddy, C. T., Srinivasan, V., Stanaway, J. D., Stein, C., Stein, D. J., Steiner, C., Stockfelt, L., Stokes, M. A., Straif, K., Stubbs, J. L., Sufiyan, M. A. B., Suleria, H. A. R., Suliankatchi Abdulkader, R., Sulo, G., Sultan, I., Szumowski, L., Tabarés-Seisdedos, R., Tabb, K. M., Tabuchi, T., Taherkhani, A., Tajdini, M., Takahashi, K., Takala, J. S., Tamiru, A. T., Taveira, N., Tehrani-Banihashemi, A., Temsah, M.-H., Tesema, G. A., Tessema, Z. T., Thurston, G. D., Titova, M. V., Tohidinik, H. R., Tonelli, M., Topor-Madry, R., Topouzis, F., Torre, A. E., Touvier, M., Tovani-Palone, M. R. R., Tran, B. X., Travillian, R., Tsatsakis, A., Tudor Car, L., Tyrovolas, S., Uddin, R., Umeokonkwo, C. D., Unnikrishnan, B., Upadhyay, E., Vacante, M., Valdez, P. R., Van Donkelaar, A., Vasankari, T. J., Vasseghian, Y., Veisani, Y., Venketasubramanian, N., Violante, F. S., Vlassov, V., Vollset, S. E., Vos, T., Vukovic, R., Waheed, Y., Wallin, M. T., Wang, Y., Wang, Y.-P., Watson, A., Wei, J., Wei, M. Y. W., Weintraub, R. G., Weiss, J., Werdecker, A., West, J. J., Westerman, R., Whisnant, J. L., Whiteford, H. A., Wiens, K. E., Wolfe, C. D. A., Wozniak, S. S., Wu, A.-M., Wu, J., Wulf Hanson, S., Xu, G., Xu, R., Yadgir, S., Yahyazadeh Jabbari, S. H., Yamagishi, K., Yaminfirooz, M., Yano, Y., Yaya, S., Yazdi-Feyzabadi, V., Yeheyis, T. Y., Yilgwan, C. S. , Yilma, M. T., Yip, P., Yonemoto, N., Younis, M. Z., Younker, T. P., Yousefi, B., Yousefi, Z., Yousefinezhadi, T., Yousuf, A. Y., Yu, C., Yusefzadeh, H., Zahirian Moghadam, T., Zamani, M., Zamanian, M., Zandian, H., Zastrozhin, M. S., Zhang, Y., Zhang, Z.-J., Zhao, J. T., Zhao, X.-J. G., Zhao, Y., Zhou, M., Ziapour, A., Zimsen, S. R. M., Brauer, M., Afshin, A., and Lim, S. S.: Global burden of 87 risk factors in 204 countries and territories, 1990–2019: a systematic analysis for the Global Burden of Disease Study 2019, Lancet, 396, 1223–1249, https://doi.org/10.1016/s0140-6736(20)30752-2, 2020. 

Nenes, A., Pandis, S. N., and Pilinis, C.: ISORROPIA: A New Thermodynamic Equilibrium Model for Multiphase Multicomponent Inorganic Aerosols, Aquat. Geochem., 4, 123–152, https://doi.org/10.1023/a:1009604003981, 1998. 

Ng, N. L., Kwan, A. J., Surratt, J. D., Chan, A. W. H., Chhabra, P. S., Sorooshian, A., Pye, H. O. T., Crounse, J. D., Wennberg, P. O., Flagan, R. C., and Seinfeld, J. H.: Secondary organic aerosol (SOA) formation from reaction of isoprene with nitrate radicals (NO3), Atmos. Chem. Phys., 8, 4117–4140, https://doi.org/10.5194/acp-8-4117-2008, 2008. 

Pellegrini, E. and Russell, R. P.: On the Computation and Accuracy of Trajectory State Transition Matrices, J. Guid. Control Dynam., 39, 2485–2499, https://doi.org/10.2514/1.G001920, 2016. 

Pilinis, C., Capaldo, K. P., Nenes, A., and Pandis, S. N.: MADM-A New Multicomponent Aerosol Dynamics Model, Aerosol Sci. Tech., 32, 482–502, https://doi.org/10.1080/027868200303597, 2000. 

Rehner, P. and Bauer, G.: Application of Generalized (Hyper-) Dual Numbers in Equation of State Modeling, Front. Chem. Eng., 3, ISSN 2673-2718, https://doi.org/10.3389/fceng.2021.758090, 2021. 

Ring, A. M., Canty, T. P., Anderson, D. C., Vinciguerra, T. P., He, H., Goldberg, D. L., Ehrman, S. H., Dickerson, R. R., and Salawitch, R. J.: Evaluating commercial marine emissions and their role in air quality policy using observations and the CMAQ model, Atmos. Environ., 173, 96–107, https://doi.org/10.1016/j.atmosenv.2017.10.037, 2018. 

Sandu, A., Daescu, D. N., Carmichael, G. R., and Chai, T.: Adjoint sensitivity analysis of regional air quality models, J. Comput. Phy., 204, 222–252, 2005. 

Sareen, N., Carlton, A. G., Surratt, J. D., Gold, A., Lee, B., Lopez-Hilfiker, F. D., Mohr, C., Thornton, J. A., Zhang, Z., Lim, Y. B., and Turpin, B. J.: Identifying precursors and aqueous organic aerosol formation pathways during the SOAS campaign, Atmos. Chem. Phys., 16, 14409–14420, https://doi.org/10.5194/acp-16-14409-2016, 2016. 

Sayeed, A., Lops, Y., Choi, Y., Jung, J., and Salman, A. K.: Bias correcting and extending the PM forecast by CMAQ up to 7 d using deep convolutional neural networks, Atmos. Environ., 253, 118376, https://doi.org/10.1016/j.atmosenv.2021.118376, 2021. 

Seinfeld, J. H. and Pandis, S. N.: Atmospheric Chemistry and Physics: From Air Pollution to Climate Change, 3rd Edn., John Wiley & Sons, Hoboken, New Jersey, 2016. 

Squire, W. and Trapp, G.: Using Complex Variables to Estimate Derivatives of Real Functions, SIAM Rev., 40, 110–112, https://doi.org/10.1137/s003614459631241x, 1998. 

Tanaka, M., Sasagawa, T., Omote, R., Fujikawa, M., Balzani, D., and Schröder, J.: A highly accurate 1st- and 2nd-order differentiation scheme for hyperelastic material models based on hyper-dual numbers, Comp. Meth. Appl. Mech. Eng., 283, 22–45, doi10.1016/j.cma.2014.08.020, 2015. 

Tian, D., Cohan, D. S., Napelenok, S., Bergin, M., Hu, Y., Chang, M., and Russell, A. G.: Uncertainty Analysis of Ozone Formation and Response to Emission Controls Using Higher-Order Sensitivities, J. Air Waste Manage. Assoc., 60, 797–804, https://doi.org/10.3155/1047-3289.60.7.797, 2010. 

US EPA: CMAQ Model Version 5.3, 5.3.1, 5.3.2, 5.3.3 Input Data – 7/1/2016–7/14/2016 12 km Southeast US (V1), UNC Dataverse [data set], https://doi.org/10.15139/S3/IQVABD, 2019. 

US EPA Office of Research and Development: CMAQ (Version 5.3.2), Zenodo [code], https://doi.org/10.5281/zenodo.4081737, 2020.  

Wong, D. C., Yang, C. E., Fu, J. S., Wong, K., and Gao, Y.: An approach to enhance pnetCDF performance in environmental modeling applications, Geosci. Model Dev., 8, 1033–1046, https://doi.org/10.5194/gmd-8-1033-2015, 2015. 

Xu, L., Pye, H. O. T., He, J., Chen, Y., Murphy, B. N., and Ng, N. L.: Experimental and model estimates of the contributions from biogenic monoterpenes and sesquiterpenes to secondary organic aerosol in the southeastern United States, Atmos. Chem. Phys., 18, 12613–12637, https://doi.org/10.5194/acp-18-12613-2018, 2018. 

Yang, Y., Wilkinson, J. G., and Russell, A. G.: Fast, Direct Sensitivity Analysis of Multidimensional Photochemical Models, Environ. Sci. Technol., 31, 2859–2868, https://doi.org/10.1021/es970117w, 1997. 

Zhang, W., Capps, S. L., Hu, Y., Nenes, A., Napelenok, S. L., and Russell, A. G.: Development of the high-order decoupled direct method in three dimensions for particulate matter: enabling advanced sensitivity analysis in air quality models, Geosci. Model Dev., 5, 355–368, https://doi.org/10.5194/gmd-5-355-2012, 2012. 

Zhao, S., Russell, M. G., Hakami, A., Capps, S. L., Turner, M. D., Henze, D. K., Percell, P. B., Resler, J., Shen, H., Russell, A. G., Nenes, A., Pappin, A. J., Napelenok, S. L., Bash, J. O., Fahey, K. M., Carmichael, G. R., Stanier, C. O., and Chai, T.: A multiphase CMAQ version 5.0 adjoint, Geosci. Model Dev., 13, 2925–2944, https://doi.org/10.5194/gmd-13-2925-2020, 2020. 

Zhu, S., Horne, J. R., Montoya-Aguilera, J., Hinks, M. L., Nizkorodov, S. A., and Dabdub, D.: Modeling reactive ammonia uptake by secondary organic aerosol in CMAQ: application to the continental US, Atmos. Chem. Phys., 18, 3641–3657, https://doi.org/10.5194/acp-18-3641-2018, 2018. 

Download
Short summary
Air pollution harms human life and ecosystems, but its sources are complex. Scientists and policy makers use air pollution models to advance knowledge and inform control strategies. We implemented a recently developed numeral system to relate any set of model inputs, like pollutant emissions from a given activity, to all model outputs, like concentrations of pollutants harming human health. This approach will be straightforward to update when scientists discover new processes in the atmosphere.