FastCTM (v1.0): Atmospheric chemical transport modelling with a principle-informed neural network for air quality simulations

Lyu, Baolei; Huang, Ran; Wang, Xinlu; Wang, Weiguo; Hu, Yongtao

doi:https://doi.org/10.5194/gmd-18-6295-2025

Articles | Volume 18, issue 18

https://doi.org/10.5194/gmd-18-6295-2025

Articles | Volume 18, issue 18

Development and technical paper

25 Sep 2025

Development and technical paper |

| 25 Sep 2025

FastCTM (v1.0): Atmospheric chemical transport modelling with a principle-informed neural network for air quality simulations

Baolei Lyu, Ran Huang, Xinlu Wang, Weiguo Wang, and Yongtao Hu

Abstract

Chemical-transport models (CTMs) are indispensable for air-quality assessment and policy development, yet their operational use is hampered by high computational cost. We present FastCTM, a physics-informed neural-network emulator that rapidly predicts hourly concentrations of ten key pollutant variables: major PM_2.5 species (SO $_{4}^{2 -}$ , NO $_{3}^{-}$ , NH $_{4}^{-}$ , organic matter, elemental carbon, crustal material), coarse PM₁₀, SO₂, NO₂, CO, and O₃. FastCTM embeds five process-specific neural modules – primary emissions, horizontal transport, turbulent diffusion, chemical reactions and deposition within a unified framework. Given 1 h initial condition data, FastCTM can simulate future 24 h concentrations for ten air pollutants using corresponding meteorological fields and emissions as input. Trained on 2018–2022 WRF-CMAQ forecasts over China and evaluated on 2023 data, FastCTM reproduces CMAQ with mean RMSE (µg m⁻³) of 9.1, 11.9, 4.4, 4.0, 48.9, 10.9 and R² of 0.80, 0.81, 0.80, 0.83, 0.90, 0.70 for PM_2.5, PM₁₀, SO₂, NO₂, CO and O₃, respectively. Sensitivity tests confirm physically plausible responses to temperature, wind speed, boundary-layer height and precursor emissions. The modular architecture enables quantitative process analysis, offering CTM-like insight at GPU-accelerated speeds. In a nutshell, FastCTM provides a computationally efficient solution for air-quality simulations, sensitivity analysis, and process attribution with high accuracy and physical consistency.

Download & links

Article (PDF, 7242 KB)

Supplement (10949 KB)

Download & links

Article (7242 KB)
Full-text XML
Supplement (10949 KB)
BibTeX
EndNote

How to cite.

Received: 26 Oct 2024 – Discussion started: 20 Nov 2024 – Revised: 07 Aug 2025 – Accepted: 08 Aug 2025 – Published: 25 Sep 2025

1 Introduction

Effective air quality management requires accurate characterization of current and future pollution conditions to implement targeted emission control measures (Wang et al., 2010; Council, 2004). Driven by this demand, deterministic air quality numerical models have been developed to simulate the spatiotemporal variability and evolution of ambient air pollutants in the atmosphere (Hakami et al., 2003; Eder et al., 2006). In these models, such as the Community Multiscale Air Quality (CMAQ) model, atmospheric physical and chemical processes (e.g., emissions, chemical reactions, horizontal advection, and diffusion) are mathematically represented by partial differential equations. The air pollutant and species concentrations can then be calculated by solving these complex equations using numerical methods (Byun and Schere, 2006), which is often time-consuming (Leal et al., 2017) and requires substantial computational resources such as high-performance computing (Efstathiou et al., 2024).

Deep learning offers promising alternatives for developing rapid, data-driven CTMs by leveraging the capacity of neural networks to encode complex spatiotemporal patterns from large datasets (LeCun et al., 2015; He et al., 2016; Liao et al., 2020). These deep learning-based CTM models are expected to provide accurate simulations that are comparable to the current deterministic numerical CTMs while offering much higher computational efficiency and enhanced learning capabilities. However, progress has been hindered by challenges in designing neural architectures that simultaneously achieve high accuracy, interpretability, and long-term simulation stability and fidelity (Reichstein et al., 2019; Irrgang et al., 2021). In constructing deep learning-based CTM models, air quality modeling is often formulated as a sequence-to-sequence prediction problem (Shi et al., 2015; Zhang et al., 2024) to capture the spatiotemporal correlations among multiple variables. Consequently, previous studies have mainly focused on refining neural network's representation capabilities by proposing new neural-network structures to improve error back-propagation efficiencies and model encoding capabilities (Wang et al., 2018; Huang et al., 2021; Mao et al., 2021). For example, Xing et al. (2022) developed a deep learning-based module named deepCTM that mimics atmospheric photochemical modeling to simulate ozone concentrations. However, these deep learning-based CTMs are often structured as uninterpretable black-box models that generate simulations reflecting the cumulative effect of all physical and chemical processes. Such black-box models hinder error attribution, inspection of internal processes, and knowledge discovery (Reichstein et al., 2019).

Quantifying individual atmospheric processes enables a mechanistic interpretation of model predictions and identification of error sources (Liu et al., 2010). Motivated by this need, recent studies have developed models that learn specific atmospheric processes, such as chemical reactions and deposition, within CTM frameworks. Kelp et al. (2022) developed a neural network chemical solver for stable long-term global simulations of atmospheric chemistry, trained from the GEOS-Chem model. Xia et al. (2025) simulated 74 chemical species and 229 reactions following the SAPRC-99 mechanism using an artificial intelligence photochemistry (AIPC) scheme, achieving approximately 8-fold speed-up. Sturm and Wexler (2020) developed a mass- and energy-conserving framework for using machine learning to accelerate computations, demonstrating successful application in a photochemistry example. For the deposition process, Silva et al. (2019) proposed a deep learning parameterization for ozone dry deposition velocities that provided accurate predictions on independent new datasets, revealing the potential of neural networks to capture complex spatio-temporal latent processes. Liu et al. (2025) proposed a Neural Network Emulator, named ChemNNE, for rapid chemical concentration modelling, which achieved strong performance in both accuracy and efficiency. Although these successes, a gap remains in coupling these NN operators into a complete deep-learning CTM.

The main objective of our study is to develop and validate a principles-guided, neural network-based FastCTM, capable of simulating spatial-temporal fields of hourly concentrations of 10 criteria pollutants, including major species of PM_2.5 (SO $_{4}^{2 -}$ , NO $_{3}^{-}$ , NH $_{4}^{+}$ , organic matters and other inorganic components, coarse part in PM₁₀, CO, NO₂, SO_2, and O₃. FastCTM models individual atmospheric process: transport, diffusion, deposition, chemical reactions, and emissions. FastCTM is capable of performing analysis of internal chemical and physical processes, offering benefits like high computational speed, efficient data assimilation, and rapid model updates.

2 Data and Methods

2.1 Parent Model Simulations and Datasets

In this study, the FastCTM model was designed to replicate the parent model CMAQ, trained by learning CMAQ's underlying physical and chemical processes among multiple air pollutants, including the complicated chemical reaction, transport, diffusion, and deposition. CMAQ has a process analysis (PA) tool to separate out and quantify the contributions of individual physical and chemical processes to the changes in the predicted concentrations of a pollutant, which provides the opportunity to conduct a sensitivity analysis by comparing process contributions between CMAQ and FastCTM.

Weather and air quality simulations from 2018 to 2023 were conducted using a WRF-CMAQ modeling system consisting of three major components: (1) the meteorology component, the Weather Research and Forecasting model, WRF v3.4.1 (Michalakes et al., 2005; Skamarock et al., 2008), which provides meteorological fields; provides meteorological fields, (2) the emission component, which supplies gridded estimates of hourly emission rates for primary pollutants matched to model species, and (3) the CTM component, CMAQ v5.0.2 (Byun and Schere, 2006), which solves the governing physical and chemical equations to obtain 3-D pollutant concentration fields. WRF-CMAQ simulations are not two-way coupled, so weather and chemistry do not influence each other. We used hourly average concentrations of dominant PM_2.5 components of sulfate (SO₄), nitrate (NO₃), ammonium (NH₄), organic carbon (OC), and other components (EC and soil, etc.), and CO, SO₂, NO_2, and O₃ in the surface layer. The 10 species were selected based on their direct relevance to regulatory standards (e.g., PM_2.5, PM₁₀, O₃, NO₂, SO₂, and CO) and their dominance in driving health and environmental impacts in urban and industrial regions.

Meteorological variables used in this study include relative humidity (RH), air temperature (T), wind components (U, V) at surface 10 m height, precipitation (RN), cloud fraction (CFRAC), and planetary boundary layer height (PBLH). Wind speed (WS) was calculated from U and V. The data covered the whole of China at a horizontal resolution of 12 km with 372 × 426 grid cells. The simulation data from 2018–2022 are used as the training dataset, while the remaining simulation data in 2023 is used for independent evaluation. The surface topographic data (HGT, Fig. S1 in the Supplement, obtained from https://lta.cr.usgs.gov/GTOPO30, last access: 28 July 2023) and land cover data (Zhang et al., 2021) of urban and tree fraction (LULC) are also used to reflect the effects of land surface conditions in this study.

The original primary emissions used in the aforementioned WRF-CMAQ modelling system are used as input to the FastCTM. The large amount of emission data is grouped according to the simulated 10 pollutant variables. Specifically, the primary PM_2.5 emissions of SO₄, NO₃, NH₄, OC, and other components, and gaseous emissions including sulfur oxide (SO₂), nitrogen oxides (NO_x, including HONO, NO, and NO₂), ammonia (NH₃), volatile organic species (VOCs, including isoprene (ISOP), terpene (TERP), and other species of VOC) are used in the FastCTM. Annual average emissions of NO_x, SO₂, and VOC are respectively depicted in Figs. S2–S4 in the Supplement.

2.2 FastCTM Model Formulations

2.2.1 FastCTM Model Framework

The deterministic CTM models simulate emissions, transport, deposition, diffusion, and chemical transformations of gases and particles in the troposphere through numerically solving the governing equations as follows,

\begin{matrix} (1) & \frac{\partial C_{i}}{\partial t} = - \nabla \cdot (u C_{i}) + \nabla (K \nabla C_{i}) + R_{i} + E_{i} + D_{i} \end{matrix}

where C_i is the concentration of species i, u is the air fluid velocity, K is the eddy diffusivity tensor, R_i is the net rate of chemical generation of species i, E_i is the rate of direct addition of the species from primary emissions, and D_i is the deposition rate caused by both dry and wet depositions. A detailed description of CMAQ principles is available elsewhere (Byun and Schere, 2006; Appel et al., 2017). Inspired by numerical CTMs principles and equations, the guiding framework of FastCTM was also structured in a similar formulation to represent the dominant processes in order to simulate air pollutant spatiotemporal variations.

In the context of deep learning, hourly air quality simulation is a spatiotemporal sequence-to-sequence learning problem aimed at predicting the most probable future sequence of length K, given a previous sequence of length J, as shown in Eq. (2),

\begin{matrix} (2) & \begin{aligned} {\hat{Y}}_{t + 1} {\hat{Y}}_{t + K} & = \arg max p ([Y_{t - J + 1}, \dots, Y_{t}], \\ [X_{t - J + 1}, \dots, X_{t}, X_{t + 1}, \dots, X_{t + K}]) \end{aligned} \end{matrix}

Where the arg max (short for “argument of the maximum”) function is used to find the p class with the highest predicted probability. The $X_{t} \in R^{M \times N \times V_{X}}$ is the data of V_X input variables at the spatial grid of M×N at time t. The $Y_{t} \in R^{M \times N \times V_{Y}}$ is the data of V_Y predictive variables at time t. Specifically, the FastCTM simulates future K-hour air pollutant concentrations, given J-hour air pollutant concentrations $[Y_{t - J + 1}, \dots, Y_{t}]$ as initial fields and (K+J)-hour meteorological and emission conditions $[X_{t - J + 1}, \dots, X_{t}, X_{t + 1}, \dots, X_{t + K}]$ . Previous studies generally used multiple-step input data with J>1 to ensure sufficient spatial-temporal correlations contained in the training data (Sun et al., 2022; Xing et al., 2022). Instead, we use a 1 h initial pollutant concentration (J=1) to simulate 24 h air quality pollutants (K=24), to ensure FastCTM is dedicated to learning air quality changes between two neighboring hours as shown in Fig. 1a. In other words, at time t=0, FastCTM predicted K-hour air pollutant concentrations of $C_{t = 0} C_{t = 1}, \dots, C_{t = K - 1}$ , given the input air pollutant concentration in the previous hour $C_{t = - 1}$ and corresponding meteorological data and emissions at time $t = 0, 1, \dots, K - 1$ . The unit of concentrations is µg m⁻³ for all pollutants.

https://gmd.copernicus.org/articles/18/6295/2025/gmd-18-6295-2025-f01

Figure 1(a) General model workflow, and (b) the basic simulator module structure at the time step t of the deep learning simulation model FastCTM, designed according to Eq. (1). Arrows and boxes with different colours represent calculation modules of different atmospheric physical and chemical processes.

Download

The FastCTM model uses the basic simulator module (Fig. 1a) recursively for hourly simulations, using output air pollutant concentrations from one step as input to the next step basic simulator. In contrast to directly learning spatiotemporal correlations of predictand itself as in most previous studies (Wang et al., 2018; Shi et al., 2017), the basic simulator (Fig. 1b) is formulated following the atmospheric physical and chemical equations and constraints shown in Eq. (1), and is composed of five modules to respectively represent the physics-chemical processes to improve the model performance. The modules for each of the five processes in the basic simulator are described in the following section. The time step used in FastCTM was 60 s.

2.2.2 Primary Emissions Module

Primary pollutants are assumed to be directly emitted into the atmosphere and instantly well-mixed within the PBL. Therefore, hourly enhancement of air-pollutant concentrations caused by primary emissions could be described in the following Eq. (3).

\begin{matrix} (3) & E_{m, n, i, t} = \frac{1000 \times {PE}_{m, n, i, t}}{PBLH \times d x \times d y} \end{matrix}

Where $E_{m, n, k, t}$ refers to the concentration changes contributed by primary emissions at spatial coordinate (m,n) for species i at time t. The ${PE}_{m, n, i, t}$ is the corresponding total primary emissions within the grid cell per second, which has a unit of g s⁻¹. Considering that the cell size in the FastCTM is 12 km by 12 km, we have dx = 12 000 and dy = 12 000 in this study. The boundary layer height PBLH, is also in the unit of meters (m). Therefore, the resulting air pollutant concentration increases by primary emission $E_{m, n, i, t}$ has a unit of µg m⁻³.

2.2.3 Horizontal Transport Module

In the FastCTM, horizontal transport usually has a significant influence on air quality variations (Lang, 2013). In CMAQ, the regional transport is generally represented by the divergence of the product of wind field and air pollutant species as in Eq. (1), inferred from continuity equations and convection equations (Michalakes et al., 2001; Byun and Schere, 2006). By decomposing the air mass movement into two orthogonal directions of east–west (x) and north–south (y), they could be rewritten in the form shown in Eq. (4),

\begin{matrix} (4) & \nabla \cdot (u C_{i}) = \frac{\partial (C_{i} U)}{\partial x} + \frac{\partial (C_{i} V)}{\partial y} \end{matrix}

Where the wind field is represented as u, which is then decomposed into U and V, respectively, in the x and y directions.

In the deep learning framework, the partial equation in Eq. (4) could be rewritten in a discrete form as convolution operations and inner product calculations as shown in Eq. (5) with a finite difference method. The convolutional kernels of W_x and W_y were defined in an upwind scheme as shown in Eqs. (6) and (7). With the scheme, this transport module itself is mass-conserved, even though FastCTM is not mass-conserved as a whole.

\begin{matrix} (5) & \nabla \cdot (u C_{i}) = \frac{W_{x} \cdot (C_{i} \times U)}{d x} + \frac{W_{y} \cdot (C_{i} \times V)}{d y} \\ (6) & W_{x} = \{\begin{cases} [\begin{array}{ccc} - 1 & 1 & 0 \end{array}] & if U < 0 \\ [\begin{array}{ccc} 0 & - 1 & 1 \end{array}] & if U \geq 0 \end{cases} \\ (7) & W_{y} = \{\begin{cases} [\begin{array}{c} 0 \\ 1 \\ - 1 \end{array}] & if V < 0 \\ [\begin{array}{c} 1 \\ - 1 \\ 0 \end{array}] & if V \geq 0 \end{cases} \end{matrix}

2.2.4 Diffusion Module

Diffusion involves the physical and chemical processes that disperse pollutants in the atmosphere. It is influenced by meteorological conditions, i.e. atmospheric stability and humidity, and surface features, i.e., land terrains and vegetation (Jiang et al., 2021). The turbulence diffusion process ∇(K∇C_i) in Eq. (1) helps the spread of pollutants in the atmosphere. It is expressed as the second-order deviation of species concentrations as shown in Eq. (8). They could also be discretized to convolutional operations with the finite difference method as shown in Eq. (9), just like that in the horizontal transport process module.

\begin{matrix} (8) & \nabla (K \nabla C_{i}) = \frac{\partial}{\partial x} (K \frac{\partial C_{i}}{\partial x}) + \frac{\partial}{\partial y} (K \frac{\partial C_{i}}{\partial y}) \\ (9) & \nabla (K \nabla C_{i}) = \frac{W_{x} \cdot (K \times W_{x} \cdot C_{i})}{d x \times d x} + \frac{W_{y} \cdot (K \times W_{y} \cdot C_{i})}{d y \times d y} \\ (10) & K = {Encoder}_{K} ([T, RH, PRS, PBLH]) \end{matrix}

The turbulent diffusivity K is closely related to the meteorological conditions of the atmosphere and is simulated with an encoder module Encoder_K (Eq. 10). The input variables of the Encoder_K include temperature T, humidity RH, surface pressure PRS, and boundary layer height PBLH. The Encoder_K is determined to be a grid-to-grid regression model based on the Unet $+ +$ model with a nested structure (Zhou et al., 2018; Ronneberger et al., 2015). The Encoder_K model consists of 5 layers with each layer respectively composed of 16, 32, 64, 128 and 256 filters.

2.2.5 Chemical Reaction Module

Reduced-form models like InMAP (Tessum et al., 2017) and EASIUR (Gentry et al., 2023) focus on annual-average exposure, while FastCTM provides hourly-resolved simulations critical for real-time management. FastCTM quantifies hourly contributions from individual processes (transport, chemistry, emissions) via its modular design, rather than aggregating source impacts in reduced-form models (e.g., EASIUR's source-receptor matrices). Furthermore, FastCTM explicitly couples meteorology (PBLH, T, RH) with chemistry, whereas InMAP/APEEP (Muller and Mendelsohn, 2007) assume static meteorology, which limits their utility in capturing diurnal or synoptic-scale variations. Specifically, the air pollutant concentration changes caused by chemical reactions are represented in the following Eq. (11). In the equation, the rate of chemical reaction of species i is expressed as the product of a rate constant k and a term that is dependent on the concentrations of its reactants j (Carter, 1990; Carter and Atkinson, 1996).

\begin{matrix} (11) & R_{m, n, i, t} = k_{m, n, i, t} \times f (C_{m, n, j, t}) \\ (12) & k_{i} = {Encoder}_{k} ([T, RH, PRS, WS, PRE, CFRAC]) \end{matrix}

The reaction kinetics constant k is generally temperature-dependent. They could also be related to atmospheric pressures and moisture humidity in some reaction processes. Therefore, the reaction rate constant k is simulated using a spatial encoder function Encoder as shown in Eq. (12), which has the same structure as that of diffusion encoder modules (Eq. 10). There are 6 input variables of the Encoder_k including T, RH, PRS, WS, RN and CFRAC. The concentration processor f is designed as a simple multi-layer convolutional network with a kernel size of 1 to represent high-order and complex relations among different reactants.

2.2.6 Deposition Module

Air pollutant deposition refers to the process by which atmospheric pollutants are transferred to Earth's surfaces (land, water, vegetation) or removed from the air. This phenomenon plays a critical role in environmental pollution dynamics and ecosystem impacts. The deposition was closely influenced by meteorological conditions and surface characteristics (Janhäll, 2015). For example, high wind disperses pollutants, while turbulence enhances dry deposition. Forests and crops act as sinks due to large surface areas for adsorption. Air quality changes due to the deposition process are expressed linearly as the product of the deposition rate d and the corresponding air pollutants concentrations C, as shown in Eq. (13). The constant d is closely related to the current and previous meteorological conditions, terrains, and underlying land cover types. Therefore, they are all simulated with an Encoder module as shown in Eq. (14).

\begin{matrix} (13) & D_{m, n, i, t} = d_{m, n, i, t} \times C_{m, n, i, t} \\ (14) & d = {Encoder}_{d} ([WS, RH, RN, HGT, LULC]) \end{matrix}

The model structure and parameter configurations are also the same as that of Encoder_K and Encoder_k. The input data variables of Encoder_d include WS, RH, RN, HGT and LULC.

2.3 Model Training

The FastCTM was programmed with Python 3 on the deep learning framework TensorFlow (Abadi et al., 2016). The model was trained with the WRF-CMAQ operational forecast data in China for 2018–2022. Considering that on each day we had 120 h forecasts with a spatial coverage of 426 × 372 grid cells (each with a size of 12 × 12 km²) for 9 meteorological variables and I=10 air pollutant variables, the total training dataset has a size of $TD = R^{1826, 120, 426, 372, 19}$ , where 1826 represents the total counting days from 2018 to 2022. Since the model was set to predict 24 h PM_2.5 concentrations from 1 h input data, the total input sequence length was 25 h in each training step. Besides, the size M×N of the input data X_t to FastCTM was decided to be 150 × 150, equal to an area of 1800 × 1800 km² in 12 km resolution. Therefore, the input batch data for FastCTM in each step should be the size of $BD = R^{b, 25, 150, 150, 19}$ , where b is the batch size (determined as 1 in this study). The input data BD are randomly sliced from the whole training dataset TD in each training iteration, indicating each BD represents different spatial and temporal coverages. The random sampling tactics help the model learn inherent physical and chemical principles rather than just statistical spatiotemporal autocorrelations using data in a constant spatial area (Xing et al., 2022). Besides, the spatio-temporal random samples contain varied emissions, which would improve FastCTM adaptation to changing emission levels.

Even though five modules are defined in FastCTM, individual processes are not trained separately. The model was trained as a whole with hour-to-hour air pollutant concentrations, while each process could learn its parameters under the constraints of its dedicated formulation. Specifically, FastCTM was tuned to minimize the loss function L, which was determined to be L2 loss (Bühlmann and Yu, 2003) of the regularized mean squared error (MSE) as shown in Eq. (15). The model was optimized using the Adam optimizer (Kingma and Ba, 2014).

\begin{matrix} (15) & L = \frac{1}{J \times N \times M \times I} \sum_{t = 1}^{J} \sum_{m = 1}^{M} \sum_{n = 1}^{N} \sum_{i = 1}^{I} {(C_{m, n, i, t} - {\tilde{C}}_{m, n, i, t})}^{2} \end{matrix}

The learning rate was set to be 0.001, and the batch size to be 1. The FastCTM model was trained on one entry-level professional acceleration card of NVIDIA A40 with a running time of 10 h for every 10 000 iterations. A total of 300 000 iterations were performed until the remaining model loss stabilized.

2.4 Model Evaluation

FastCTM was assessed against CMAQ simulations using the same input emission data and meteorological fields. Starting from 00:00 local time on each day, the CMAQ model simulated 120 h forecasts in one cycle. There are 139 cycles in the evaluation year of 2023 due to data unavailability in the remaining days. The FastCTM model generated 119 h forecasts using a 1 h initial input condition. The 119 h forecasts in the leading hours from 2 to 120 by the two models were compared regarding to the corresponding leading time. For example, when we had a 120 h forecast starting at 00:00 on 1 January 2023 at Beijing Local Time (BLT), the data of 00:00 on 1 January 2023 were fed into FastCTM to get the 119 h forecasts until 23:00 on 5 January. The 10 species forecasts by FastCTM were compared against the CMAQ forecasts at each corresponding hour. The metrics of root mean square error (RMSE) and coefficient of determination (R²) were calculated daily in each of 119 leading hours on the difference in each of the 158 742 grid cells between CMAQ and FastCTM. Therefore, metrics of R² and RMSE were obtained on each lead hour on each day of the independent test year of 2023. The statistical values on each day are then averaged for the same leading hour for comparison.

The FastCTM was also assessed in terms of sensitivity analysis to emission inputs and meteorological fields. For meteorological variables, responses of six criteria pollutant concentrations to T, WS, and PBLH were calculated. For emissions, responses to paired variables of SO₂/NH₄ and NO_x/VOC were calculated. Besides, FastCTM's capability to simulate responses to emission changes was also evaluated by comparing with CMAQ simulations in 11 emission-intervention scenarios. Finally, the contributions of five internal processes of transport, diffusion, emission, reaction, and deposition were also analyzed and discussed for an example pollution episode.

3 Results

3.1 Forecast Performance by FastCTM

FastCTM has exhibited strong, stable performance in reproducing CMAQ forecasts over the 119 h forecast period evaluated for 2023 (Fig. 2). The average RMSE values for six criteria pollutants of PM_2.5, PM₁₀, SO₂, NO₂, CO, and O₃ are, respectively 9.1, 11.9, 4.4, 4.0, 48.9 and 10.9 µg m⁻³. For R² values, they are 0.8, 0.81, 0.8, 0.83, 0.9 and 0.7. As for PM_2.5 components, RMSE values are 1.68, 2.68, 1.52, 1.98 and 4.25 µg m⁻³, respectively for SO $_{4}^{2 -}$ , NO $_{3}^{-}$ , NH $_{4}^{+}$ , organic matters and other inorganic components, while the R² values are 0.72, 0.6, 0.3, 0.83 and 0.68. Compared to the ∼ 5 ppb (∼ 10.5 µg m⁻³) in the previous study by Xing et al. (2022), the FastCTM model has similar RMSE values in forecasting O₃. To test the influences of initial conditions on FastCTM long-term simulations, FastCTM forecasts using zero values as input air quality data were almost the same as those using ordinary input in the long leading hours. Results indicating that FastCTM simulations in long leading hours are not affected by initial conditions (Fig. S5 in the Supplement), just like deterministic CTMs (such as CMAQ). In other words, the insensitivities of FastCTM to initial conditions indicate that it has well learned and encoded the most physical and chemical principles in CMAQ CTM, rather than just spatio-temporal correlations among air quality sequences.

https://gmd.copernicus.org/articles/18/6295/2025/gmd-18-6295-2025-f02

Figure 2The evaluation performances of FastCTM forecasts against CMAQ forecasts in 2023. Panels (a) and (b) respectively show RMSE values of criteria pollutants and the PM_2.5 components. Panels (c) and (d) show R² values. It should be noted that the RMSE value of CO corresponds to the right axis in (a).

Download

Hourly RMSE values show clear diurnal variation with higher RMSE values in the nighttime than that in the daytime, which could be due to higher hourly concentrations of air pollutants in the nighttime, except for O₃ (Fig. S6 of the Supplement). Consistency between CMAQ and FastCTM, as characterized by R², is lower in the daytime. Since the FastCTM is a 2-D model only considering atmospheric processes within the boundary layer, lower consistency with the CMAQ model during daytime, possibly due to more vigorous vertical mixing. Strong vertical mixing of air pollutants to the height above PBLH has been found (Li et al., 2017; Tang et al., 2016), which may not be fully represented in FastCTM. It is important to note that the relatively low R² values observed for NH $_{4}^{+}$ . While CMAQ explicitly resolves NH $_{4}^{+}$ formation reactions, FastCTM does not explicitly encode these pathways. Instead, the neural network implicitly learns relationships between NH $_{4}^{+}$ and precursor emissions (NH₃, NO_x, SO₂) and meteorological variables (e.g., temperature, humidity). This simplification omits acid-base equilibria and aerosol thermodynamics, which are critical for partitioning NH $_{4}^{+}$ between gas and particle phases. The low R² for NH $_{4}^{+}$ primarily reflects FastCTM's simplified chemical mechanism in this part, which could be improved by adding related species in the simulation.

The spatial distributions of the mean absolute error (MAE) and the normalized mean absolute error (NMAE) are presented in Fig. 3. For all six pollutants under consideration, MAE values tend to be higher in polluted areas. In polluted environments, there are often multiple sources of emissions, complex chemical reactions, and variable meteorological conditions that can lead to greater discrepancies between the predicted concentrations of the two models. Conversely, the NMAE values exhibit an opposite trend, being lower in polluted areas. In these regions, the NMAE values typically hover around 0.2, in contrast to the relatively higher values of approximately 1 in cleaner areas. The NMAE is a normalized metric that takes into account the magnitude of the actual pollutant concentrations. A lower NMAE in areas with high pollution levels suggests that the FastCTM model is effectively capturing the overall magnitude and trends relative to the reference CMAQ simulation. The Air quality forecasts starting from 00:00 a.m. on 4 March 2023 (Fig. S7 in the Supplement) demonstrate FastCTM's strong capability in modelling the complex spatio-temporal changes in a large spatial domain and over a relatively long period and a large area.

https://gmd.copernicus.org/articles/18/6295/2025/gmd-18-6295-2025-f03

Figure 3Spatial distribution of mean absolute error (a, c, e, g, i, k) and normalized mean absolute error for the six criteria pollutants (b, d, f, h, j, l) of FastCTM compared with CMAQ in 2023.

Defining the warm season as the months from April to September and the winter and cold season as the remaining months, the FastCTM model exhibited comparable performances. As shown in Fig. 4 (with detailed information in Fig. S8 in the Supplement), the coefficient of determination R² values for the six criteria pollutants were 0.82, 0.8, 0.8, 0.82, 0.91, and 0.7 in the warm season, and 0.8, 0.79, 0.78, 0.83, 0.88, and 0.68 in the cold season, respectively. To assess the performance variations of FastCTM across different spatial locations, comparative evaluations were carried out in urban and rural areas as well as in inland and coastal regions. Generally, FastCTM demonstrated slightly higher accuracies in rural areas compared to urban areas (as presented in Fig. S9 in the Supplement). This outcome is reasonable given the more intricate emission and chemical processes prevalent in urban settings (Guo et al., 2014). Similarly, FastCTM exhibited comparable performances in inland areas to those in coastal areas, except for PM_2.5 and PM₁₀ (Fig. S10 in the Supplement).

https://gmd.copernicus.org/articles/18/6295/2025/gmd-18-6295-2025-f04

Figure 4The mean evaluation R² values for all 119 leading hours of FastCTM forecasts in warm/cold seasons, rural/urban areas, and coastal/inland areas.

Download

To validate the FastCTM model, three land use regression (LUR) models were constructed, namely the linear regression model, the random forest model (with the number of trees set at 500), and the XGBoost model (with the booster specified as gbtree). These LUR models were developed using the same input meteorological data, emissions, and geophysical variables as FastCTM to ensure fair comparison. When compared with the FastCTM model, the performance of the LUR models was found to be significantly inferior, as demonstrated in the Table 1 and Figs. S10–S12 in the Supplement. For example, R² values for FastCTM range from 0.68–0.90, whereas the LUR models only achieve 0.06–0.33. This outcome is anticipated when we consider the complex nature of air quality dynamics in predicting future air quality. Air quality is not a static entity, but it varies both spatially and temporally, determined by the joint effects of local emissions, meteorological conditions, and surface features, etc. For instance, the transport of air pollution is a highly dynamic process that hinges on wind fields and air pollution concentrations in a reciprocal manner. The wind direction and speed dictate the trajectory along which pollutants travel, while the existing pollutant concentrations in different regions influence the overall dispersion and mixing patterns. LUR models, which on the other hand predominantly rely on local input data (Wong et al., 2021; Cheng et al., 2021), struggle to capture these intricate, non-local interactions. They cannot account for the far-reaching effects, such as wind-driven pollutant transport and the temporally accumulated changes in air quality over larger geographical areas. As far as we know, LUR models have been mostly applied in predicting air pollution fields in retrieval given corresponding air quality observations as training and constrained input data. They have been seldom used in air quality forecasts and simulations, as we have demonstrated with the FastCTM model.

Table 1Performance metrics of LUR models and FastCTM compared against CMAQ.

Download Print Version | Download XLSX

Annually, the daily air quality typically exhibits similar fluctuations to those in other years, which can be primarily attributed to the cyclical nature of meteorological conditions and pollutant emission patterns. The FastCTM model was trained using a comprehensive dataset spanning five years, from 2018 to 2022. In light of this, it was crucial to rule out the possibility that the model was merely reproducing historical averages during the test year of 2023. To this end, the daily national average concentrations of PM_2.5 and O₃ in 2023, as predicted by FastCTM, were meticulously compared with those simulated by CMAQ in the same test year, as well as with the CMAQ forecasts from the training years of 2018–2022. As illustrated in Fig. 5, the predictions made by FastCTM in 2023 align more closely with the actual CMAQ forecasts for that year, with R²=0.94 and 0.72, respectively, for PM_2.5 and O₃, rather than with the forecasts generated from the training data of 2018–2022, with R²=0.54 and 0.59. The NMB was also lower between FastCTM and CMAQ for the same year, 2023. These results not only validate the adaptive learning capabilities of the FastCTM model but also indicate that the model is not using a simplistic approach of averaging concentrations from the previous five years based on time of day. Hourly time series plots of air pollutant concentrations (Fig. S6 in the Supplement) further demonstrate that FastCTM appears to incorporate real-time meteorological feedback, adjust for shifts in emission patterns, and leverage its learned relationships to provide more accurate and contemporaneous predictions.

https://gmd.copernicus.org/articles/18/6295/2025/gmd-18-6295-2025-f05

Figure 5The daily FastCTM forecasts compared with CMAQ forecasts, respectively, in the training period of 2018–2022 and the evaluation period of 2023 for (a) PM_2.5 and (b) O₃. The gaps for FastCTM and CMAQ in 2023 are due to data unavailability these days.

Download

3.2 Sensitivity Analysis with FastCTM

The FastCTM model was trained with 5-year meteorological and air quality simulations by WRF-CMAQ. These simulations used an emission inventory that was identical for every year. In this condition, the FastCTM model has learned the relationships between the air quality and varied meteorology with fixed emissions input. Considering that the FastCTM model has exhibited high accuracy in an independent evaluation year 2023, when new meteorological fields are fed into FastCTM, the deep learning model should be able to simulate responses of air pollutant concentrations to meteorological variables. However, for the response of air pollutant concentrations to emissions, the training data do not contain relationships between inter-annual varied emissions and air quality under the condition of the same annual meteorological fields. Therefore, it is less expected for FastCTM to simulate reliable and correct response relationships between emissions and air quality. To validate these analyses, we calculated the sensitivities of simulated air pollutant concentrations to changes in meteorological variables and emissions.

3.2.1 Response of Air Pollutant Concentration to Meteorology

The responses of six criteria pollutants to meteorological changes simulated by FastCTM are evaluated as exhibited in Fig. 6. For ground-level temperature (T) elicited a distinct response in O₃ concentrations compared to the other five criteria pollutants. O₃ concentrations have slight negative responses to T in January, as shown in Fig. 6a, which is probably because higher temperatures increase NO? emissions, enhancing dilution. O₃ concentrations had the strongest positive responses in August among six pollutants, which is consistent with previous observation-based studies (Flaum et al., 1996). The O₃ had larger sensitivities when the air temperature was higher. The gaseous pollutants of CO, NO₂, and SO₂ show the strongest positive response to temperature, which could be caused by the shift of chemical equilibrium towards the higher release of these gaseous pollutants (Bassett and Seinfeld, 1983; Cox, 1982). The particulate matter pollutants, especially PM₁₀, have the weakest responses among six pollutants. Considering that there are dominating proportions of chemically inert species in particulates, the weak responses of PM_2.5 and PM₁₀ are expected.

https://gmd.copernicus.org/articles/18/6295/2025/gmd-18-6295-2025-f06

Figure 6The FastCTM predicted air pollutant percentage changes in response to changes of T, WS, and PBLH in Beijing on 2 January (a–c, respectively in the left column) and 1 August (d–f, respectively in the right column), 2023. The air pollutant concentrations are relative to those at the baseline meteorological conditions.

Download

For the wind speed and PBLH, the responses of pollutants have similar patterns for the same pollutant. First, O₃ concentrations exhibited patterns opposite to other pollutants both in January and August. Higher wind speed would increase the dispersion and transport of air pollutants (Feng et al., 2015; Lv et al., 2017), resulting in lower pollution levels, so concentrations decrease as wind speed increases, except for O₃. The contradictory response of ozone and particulate matter concentrations to PBLH is consistent with the analysis results of multiple-year observations (Liu and Tang, 2024). Theoretically, the air pollutant concentrations should exhibit an inverse relationship between air pollution concentrations and PBLH. The actual air pollutant concentration changes simulated by FastCTM generally fit the theory that there are negative nonlinear effects with increasing PBLH. Meanwhile, the sensitivity is stronger when the PBLH is lower (Fig. 6e and f), which is consistent with previous observation-based analysis (Wang et al., 2019; Su et al., 2020). The totally different relationship of O₃ to wind speed and PBLH compared to other pollutants could be due to its high dependence on chemical precursors, such as NO_x and VOC. Concentrations of these precursors could have an inverse relationship with O₃ at specific locations. The FastCTM model itself is trained with multi-year CMAQ simulations, indicating that it is preconditioned on varied meteorological fields with the same atmospheric physical and chemical rules. Therefore, the sensitivity of air quality simulations to meteorological variations could be well learned, especially with the discipline-based model FastCTM.

3.2.2 Response of Air Pollutant Concentration to Emission

The sensitivity analysis with a “brute force” method can be carried out with the FastCTM model quickly due to its high computational efficiency on GPU. The responses of PM_2.5 concentrations to doubled emissions of SO₂, NO_x were explored in the winter month of January 2023 (Fig. 7). For doubled NO_x, the PM_2.5 concentrations exhibited positive responses in most areas of China as shown in Fig. 7a. The largest increases occurred in North China, Heilongjiang province in Northeast China, the Yangtze River Delta, and Sichuan province. In these places, the NO_x emissions are relatively large. For doubled SO₂, PM_2.5 concentrations increased in almost all of China as shown in Fig. 7b. The response was larger in North China, Northeast China and the Sichuan basin. The PM_2.5 responses simulated by FastCTM were generally consistent with previous studies (Li et al., 2022).

https://gmd.copernicus.org/articles/18/6295/2025/gmd-18-6295-2025-f07

Figure 7Average predictions of PM_2.5 concentrations in 5 lead-days with doubled emissions in January 2023. Panel (a) refers to predictions with doubled NO_x, and (b) refers to double SO₂.

As for ozone, its responses to doubled NO_x and VOC are explored as shown in Fig. 8. For NO_x emission, decreases in O₃ concentrations in polluted regions like North China, the Yangtze River Delta, and other highly industrial regions are well captured by FastCTM. The response is reasonable considering that these regions are generally abundant with NO_x emissions and at VOC-limited conditions. Doubling VOC emissions leads to a significant decrease in O₃ (Fig. S14 in the Supplement), since increased VOC would consume more O₃ in these regions. The spatial patterns of O₃ responses to NO_x and VOC are similar to a previous deep learning study trained by emission-controlled simulation data (Xing et al., 2022). However, due to the complex speciation of VOC emissions that is simplified in the FastCTM, uncertainties for responses of O₃ to VOC should be noted.

https://gmd.copernicus.org/articles/18/6295/2025/gmd-18-6295-2025-f08

Figure 8Average predictions of hourly O₃ concentrations in 5 lead-days with doubled NO_x emissions in July 2023.

The sensitivities of FastCTM simulations to emission interventions were contrasted with those of CMAQ. Specifically, CMAQ was employed to simulate 11 emission scenarios over the two-month periods of January and July 2019 in Southwest China (Huang et al., 2022). The alterations in emissions relative to the base case are presented in Table 1. Among these scenarios, 10 involved reduced emissions of major species, with only the no-control scenario exhibiting increased emissions. Utilizing the identical emissions and meteorological data, FastCTM also conducted simulations, which were then compared to those of CMAQ. For the 11 scenarios in question, the changes in air pollutant concentrations relative to the base case at the locations of 139 national air quality monitoring stations (Fig. S15 in the Supplement) were extracted and compared in the winter month of January 2019 (Fig. 9a) and in the summer month of July 2019 (Fig. 9b). The results indicated that, overall, the FastCTM simulations due to emissions changes were in good agreement with those of CMAQ, as reflected in two aspects. The correlation coefficient R values are around 0.9 for SO₂, NO_2, and O₃ in both summer and winter months. For PM_2.5 and PM₁₀, FastCTM exhibited higher consistency with CMAQ in July than in January, with R values around 0.6 for most cases. For CO, FastCTM has much better performance in January than in July, with R values of approximately 0.8 and 0.2. Considering that CO concentration changes are mostly due to physical dispersion and transport, the decreased performance is probably due to increased vertical mixing in summer, which is not fully represented in the 2D scheme of FastCTM. Specifically, in January 2019, except for NO₂, FastCTM responded to emission changes with an interquartile range (IQR, 25 %–75 % percentile) similar to that of CMAQ (Fig. S16). In July 2019, as depicted in Fig. S17, all the criteria pollutants except CO demonstrated a comparable degree of response to emission reductions.

Table 2The emission change details of the emission scenarios.

Download Print Version | Download XLSX

https://gmd.copernicus.org/articles/18/6295/2025/gmd-18-6295-2025-f09

Figure 9Correlation coefficient R for responses of FastCTM and CMAQ to different emission scenarios and different air pollutants in January 2023 (a) and July 2023 (b).

Download

FastCTM model used a principles-constrained formulation framework. As shown in Eq. (4), atmospheric chemical reactions are in the Atkinson form, which independently estimates the reaction rate from meteorological conditions and polynomials of reactant concentrations in multiple powers. The principle-based formulation should be the reason for the relatively significant and reasonable response simulations of PM_2.5 and O₃ to precursor emissions, even though the FastCTM itself is not trained by emission-controlled CMAQ scenario simulations. The remaining uncertainties should be attributed to the reason that FastCTM only considered environmental chemical reactants in part, compared to that of the CMAQ model (Binkowski and Roselle, 2003).

3.3 Internal Processes Analysis with FastCTM

The FastCTM is a principles-guided deep neural network to individually simulate the dominant atmospheric physical and chemical processes as defined in Eq. (1). The processes are calculated numerically with critical parameters describing the processes being estimated by deep learning encoders. The hourly concentration changes equal the sum of the changes produced by each process. Figure 11 depicts an example during the nighttime of 13 January 2023, when hourly PM_2.5 concentration changes significantly. Between the two hours of 18:00 and 19:00, hourly PM_2.5 concentrations change markedly in neighbouring areas of Shandong, Hebei, and Henan provinces as shown in the red rectangle (denoted as Area A hereafter) in Fig. 11c. In this example, strong northern wind prevails, leading pollutants to move southward. For PM_2.5 concentration changes caused by primary emissions (Fig. 8d), it is determined by the primary emission and the mixing volumes determined by PBLH. PM_2.5 changes are mostly determined by the transport process (Fig. 11e) as its spatial pattern most closely resembles total PM_2.5 concentration changes. In the transport process, air pollutants move from one area to another, determined by the wind fields as shown in Eq. (4). When the northern clean air prevails as in Area A, changes should be negative in the upstream direction and positive in the downstream direction. The transport process simulated by FastCTM sticks to this pattern. As known to us, the diffusion process will bring pollutants from a region of high concentration to one of low concentration. Its contribution is low as shown in Fig. 11f, which is reasonable considering the relatively large grid cell size of 12 km and short simulation period of 1 h. PM_2.5 concentration changes caused by the diffusion process constituted a small proportion compared to other processes. The activities of chemical reactions are determined by both meteorological conditions and related precursor concentrations. PM_2.5 contribution changes between T1 and T2 caused by chemical reactions are lower in the areas to the north of Area A because the cold and clean air in this area is not favourable for chemical reactions. The deposition is the dominant process that led to PM_2.5 concentration reductions where regional transport was not significant. In general, deposition rates were proportional to PM_2.5 concentrations as shown in Fig. 8h (Davis and Swall, 2006). It should be noted that FastCTM simulated air quality in a 2-D domain rather than in 3-D. The deposition may also include the vertical transport of air pollutants to the upper air above PBL (Zhao et al., 2020).

https://gmd.copernicus.org/articles/18/6295/2025/gmd-18-6295-2025-f10

Figure 10An example of the PM_2.5 concentration at T1 (18:00, a) and T2 (19:00, b) on 13 January 2023 (with the forecast leading time of 42 h) and hourly changes (c). Changes caused by each of the five dominant processes are depicted in (d)–(h).

Simulated contributions of five major processes to hourly PM_2.5 concentration changes are compared between FastCTM and CMAQ at 139 stations (Fig. S15) in the Sichuan-Chongqing region from 12 to 16 October 2024, as shown in boxplots of Fig. 11. Overall, the simulation results of the process contributions by FastCTM and its parent model CMAQ were relatively consistent. Higher degrees of consistency were found in simulations of emissions, advection processes, and diffusion processes between the two models. Contributions from chemical reactions of FastCTM exhibited overestimation compared to CMAQ, while contributions from deposition were underestimated. The differences in the simulated deposition and reaction contributions between the two models could be due to incomplete representation of influencing factors, given the complexity of the two processes. In general, the consistency between the two models provides confidence in the reliability of FastCTM for simulating and understanding the complex interplay of atmospheric processes that govern PM_2.5 levels.

https://gmd.copernicus.org/articles/18/6295/2025/gmd-18-6295-2025-f11

Figure 11Boxplots of hourly PM_2.5 contribution changes from five major atmospheric processes at 139 evaluation stations from 13 to 16 October 2024, simulated by (a) CMAQ and (b) FastCTM.

Download

4 Discussions

4.1 Model Accuracy and Uncertainty

One debatable concern is the accuracy of neural network (NN)-based components in integrated chemical transport models (CTMs) and the potential for amplified uncertainty when coupling multiple NN modules. Literature precedent suggests that individual NN emulators may exhibit lower accuracy compared to traditional physical parameterizations, but their integration could introduce unexplained uncertainties. This is a valid consideration that aligns with broader discussions in Earth system modeling about the trade-offs between computational efficiency and physical fidelity (Irrgang et al., 2021).

In FastCTM, we address this by adopting a principle-informed modular design where each module (transport, chemistry, deposition, etc.) is constrained by governing physical/chemical equations (e.g., Eqs. 3–14). This distinguishes it from unconstrained “black-box” NN models, as each process is guided by known atmospheric dynamics. For example, the transport module explicitly enforces mass conservation via upwind schemes (Eqs. 5–7), and the chemical reaction module links reaction rates to meteorological conditions (Eq. 12) based on kinetic theory. Our evaluation shows that FastCTM maintains high consistency with CMAQ across 119 h forecasts (Sect. 3.1), with R² values exceeding 0.8 for most pollutants, indicating that physical constraints effectively mitigate accuracy losses.

However, we acknowledge that uncertainty can accumulate when coupling modules, particularly for species involved in complex multi-process interactions due to limited chemical constraints in our current training datasets(e.g., NH $_{4}^{+}$ , Sect. 3.1). This is partly due to simplifications in FastCTM's chemical mechanism, which omits some aerosol thermodynamics included in CMAQ. Future work will reduce such uncertainties by incorporating additional species (e.g., VOCs) and refining process formulations by adding CMAQ's integrated process rate (IPR) data for supervised training of individual modules.

4.2 Choosing Neural Network Components over Traditional Parameterizations

One question might arise about the utility of replacing non-bottleneck CTM components (e.g., deposition) with NN solvers, given the argument that traditional parameterizations may already be accurate and fast. This highlights a critical design choice in FastCTM: balancing computational efficiency with fidelity to the parent model (CMAQ).

It is important to note that even non-bottleneck components in traditional CTMs can benefit from NN acceleration in integrated simulations. For example, CMAQ's deposition module, while not a primary computational burden, relies on parameterizations based on similarity theory and limited flux measurements (Janhäll, 2015), which may oversimplify complex surface-atmosphere interactions (e.g., vegetation-specific uptake). NN-based parameterizations have shown promise in improving such processes. Silva et al. (2019), for instance, developed a deep learning model for ozone dry deposition that outperformed traditional schemes in independent validation. In FastCTM, the deposition module (Eq. 14) leverages NN to capture nonlinear relationships between meteorology (e.g., wind speed, land cover) and deposition rates, while retaining compatibility with CMAQ's output.

Moreover, FastCTM's modular architecture allows flexible integration of traditional parameterizations as an option. For example, users could replace the NN-based deposition module with CMAQ's original parameterization if higher fidelity to that specific process is prioritized. This hybrid approach addresses concerns about unnecessary replacement of robust components while retaining the overall speed advantage of NN for bottleneck processes (e.g., chemical reactions, which dominate CTM runtime; Xia et al., 2025).

4.3 Beyond “Black Boxes”: Interpretability and Error Identification

A central goal of FastCTM is to advance beyond opaque deep learning models by enabling process-level interpretability, addressing concerns about error attribution. Traditional “black-box” NN models obscure how individual processes contribute to predictions, hindering error analysis. In contrast, FastCTM's modular design quantifies hourly contributions from transport, diffusion, emissions, chemistry, and deposition separately (Sect. 3.3), allowing targeted identification of error sources. For example, in the January 2023 pollution episode (Fig. 10), transport was found to dominate PM_2.5 concentration changes, while deposition acted as a secondary sink This process-level attribution aligns well with CMAQ's process analysis (Fig. 11), ensuring that uncertainties are traced to specific physical processes rather than being attributed to arbitrary model behavior.

We anticipate that incorporating abundant CMAQ's integrated process rate (IPR) data for supervised training of individual modules will further refine the FastCTM's process level predictions. However, a comprehensive process-oriented error analysis that would further enhancing interpretability, for instance isolating and quantifying whether transport or chemistry drives urban-rural accuracy discrepancies, requires long-term process simulations and systematic perturbations plus observational datasets (e.g., tracer experiments) to validate specific processes predictions from both CMAQ and FastCTM.

4.4 Limitations and Future Directions

FastCTM's current limitations include simplified vertical dynamics (2D boundary layer representation) and incomplete chemical mechanisms, which affect performance during vigorous daytime mixing (Sect. 3.1). A future extension to a 3D framework will improve representation of vertical transport and in-cloud chemistry. Additionally, while FastCTM efficiently reproduces CMAQ simulations, it does not claim superiority over traditional CTMs across all scenarios; rather, it serves as a complementary tool for applications requiring rapid simulations (e.g., ensemble forecasting, emission scenario screening).

By addressing these limitations and engaging with ongoing debates about NN integration in atmospheric modeling, FastCTM aims to bridge the gap between computational efficiency and physical rigor, providing a flexible framework for air quality research and management.

Data availability

The land use and land cover data are available at the Data Sharing and Service Portal of the Chinese Academy of Science (http://data.casearth.cn/en/sdo/detail/5ebe2a9908415d14083a4c24, last access: 23 June 2023, login required). The CTM simulation data and source code files of the exact version used to produce the results used in this paper are available at https://doi.org/10.5281/zenodo.13757211 on Zenodo (Lyu, 2024). The configuration files for running models of WRF v3.4.1 and CAMQ v5.0.2 are also available at https://doi.org/10.5281/zenodo.5152621 (Hu, 2021).

Supplement

The supplement related to this article is available online at https://doi.org/10.5194/gmd-18-6295-2025-supplement.

Author contributions

BL and YH conceived the study. BL developed the model and codes. RH and XW contributed the CTM simulation data. BL and RH collected the observation data. BL analyzed data and wrote the paper with contributions from YH, RH, WW, and XW. RH managed the project.

Competing interests

The contact author has declared that none of the authors has any competing interests.

Disclaimer

The findings in this research do not necessarily reflect the views of the sponsors.

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Also, please note that this paper has not received English language copy-editing. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.

Acknowledgements

We thank the two anonymous reviewers for their constructive comments and suggestions.

Financial support

This research has been in part supported by the AiMa R&D Project (R#2016-004) of Hangzhou AiMa Technologies.

Review statement

This paper was edited by Volker Grewe and reviewed by two anonymous referees.

References

Abadi, M., Agarwal, A., Barham, P., Brevdo, E., Chen, Z., Citro, C., Corrado, G. S., Davis, A., Dean, J., and Devin, M.: Tensorflow: Large-scale machine learning on heterogeneous distributed systems, arXiv [preprint], https://doi.org/10.48550/arXiv.1603.04467, 2016.

Appel, K. W., Napelenok, S. L., Foley, K. M., Pye, H. O. T., Hogrefe, C., Luecken, D. J., Bash, J. O., Roselle, S. J., Pleim, J. E., Foroutan, H., Hutzell, W. T., Pouliot, G. A., Sarwar, G., Fahey, K. M., Gantt, B., Gilliam, R. C., Heath, N. K., Kang, D., Mathur, R., Schwede, D. B., Spero, T. L., Wong, D. C., and Young, J. O.: Description and evaluation of the Community Multiscale Air Quality (CMAQ) modeling system version 5.1, Geosci. Model Dev., 10, 1703–1732, https://doi.org/10.5194/gmd-10-1703-2017, 2017.

Bassett, M. and Seinfeld, J. H.: Atmospheric equilibrium model of sulfate and nitrate aerosols, Atmospheric Environment, 17, 2237–2252, https://doi.org/10.1016/0004-6981(83)90221-4, 1983.

Binkowski, F. S. and Roselle, S. J.: Models-3 Community Multiscale Air Quality (CMAQ) model aerosol component 1. Model description, Journal of Geophysical Research Atmospheres, 108, 335–346, 2003.

Bühlmann, P. and Yu, B.: Boosting With the L2 Loss, Publications of the American Statistical Association, 98, 324–339, 2003.

Byun, D. and Schere, K. L.: Review of the governing equations, computational algorithms, and other components of the Models-3 Community Multiscale Air Quality (CMAQ) modeling system, Applied Mechanics Reviews, 59, 51–77, 2006.

Carter, W. P. L.: A detailed mechanism for the gas-phase atmospheric reactions of organic compounds, Atmospheric Environment. Part A, 24, 481–518, https://doi.org/10.1016/0960-1686(90)90005-8, 1990.

Carter, W. P. L. and Atkinson, R.: Development and evaluation of a detailed mechanism for the atmospheric reactions of isoprene and NO_x, International Journal of Chemical Kinetics, 28, 497–530, https://doi.org/10.1002/(SICI)1097-4601(1996)28:7<497::AID-KIN4>3.0.CO;2-Q, 1996.

Cheng, B., Ma, Y., Feng, F., Zhang, Y., Shen, J., Wang, H., Guo, Y., and Cheng, Y.: Influence of weather and air pollution on concentration change of PM_2.5 using a generalized additive model and gradient boosting machine, Atmospheric environment, 255, 118437, https://doi.org/10.1016/j.atmosenv.2021.118437, 2021.

National Research Council: Air Quality Management in the United States, The National Academies Press, Washington, DC, https://doi.org/10.17226/10728, 2004.

Cox, R. A.: Chemical Transformation Processes for NoX Species in the Atmosphere, in: Studies in Environmental Science, edited by: Schneider, T. and Grant, L., Elsevier, 249–261, https://doi.org/10.1016/B978-0-444-42127-2.50027-0, 1982.

Davis, J. M. and Swall, J. L.: An examination of the CMAQ simulations of the wet deposition of ammonium from a Bayesian perspective, Atmospheric Environment, 40, 4562–4573, 2006.

Eder, B., Kang, D., Mathur, R., Yu, S., and Schere, K.: An operational evaluation of the Eta–CMAQ air quality forecast model, Atmospheric Environment, 40, 4894–4905, 2006.

Efstathiou, C. I., Adams, E., Coats, C. J., Zelt, R., Reed, M., McGee, J., Foley, K. M., Sidi, F. I., Wong, D. C., Fine, S., and Arunachalam, S.: Enabling high-performance cloud computing for the Community Multiscale Air Quality Model (CMAQ) version 5.3.3: performance evaluation and benefits for the user community, Geosci. Model Dev., 17, 7001–7027, https://doi.org/10.5194/gmd-17-7001-2024, 2024.

Feng, X., Li, Q., Zhu, Y., Hou, J., Jin, L., and Wang, J.: Artificial neural networks forecasting of PM_2.5 pollution using air mass trajectory based geographic model and wavelet transformation, Atmospheric Environment, 107, 118–128, https://doi.org/10.1016/j.atmosenv.2015.02.030, 2015.

Flaum, J. B., Rao, S. T., and Zurbenko, I. G.: Moderating the Influence of Meteorological Conditions on Ambient Ozone Concentrations, Journal of the Air & Waste Management Association, 46, 35–46, 1996.

Gentry, B. M., Robinson, A. L., and Adams, P. J.: EASIUR-HR: a model to evaluate exposure inequality caused by ground-level sources of primary fine particulate matter, Environmental Science & Technology, 57, 3817–3824, 2023.

Guo, S., Hu, M., Zamora, M. L., Peng, J., Shang, D., Zheng, J., Du, Z., Wu, Z., Shao, M., and Zeng, L.: Elucidating severe urban haze formation in China, Proceedings of the National Academy of Sciences, 111, 17373–17378, 2014.

Hakami, A., Odman, M. T., and Russell, A. G.: High-Order, Direct Sensitivity Analysis of Multidimensional Air Quality Models, Environmental Science & Technology, 37, 2442–2452, https://doi.org/10.1021/es020677h, 2003.

He, K., Zhang, X., Ren, S., and Sun, J.: Deep Residual Learning for Image Recognition, IEEE, https://doi.org/10.1109/CVPR.2016.90, 2016.

Hu, Y.: Configurations for running 12 km resolution WRF-CMAQ simulations in China (v1.0), Zenodo [code], https://doi.org/10.5281/zenodo.5152621, 2021.

Huang, L., Liu, S., Yang, Z., Xing, J., Zhang, J., Bian, J., Li, S., Sahu, S. K., Wang, S., and Liu, T.-Y.: Exploring deep learning for air pollutant emission estimation, Geosci. Model Dev., 14, 4641–4654, https://doi.org/10.5194/gmd-14-4641-2021, 2021.

Huang, R., Wang, X., Wang, C., Du, Y., Yan, B., Zhang, W., Luo, B., Zhang, W., and Hu, Y.: Future Year Air Quality Attainment Prediction Method Based on Design-Value and Relative Response Factor: A Case Study Focusing on Implementation Planning of the 14th Five-Year Plan in Sichuan Province, Acta Scientiarum Naturalium Universitatis Pekinensis, 58, 553–564, 2022.

Irrgang, C., Boers, N., Sonnewald, M., Barnes, E. A., Kadow, C., Staneva, J., and Saynisch-Wagner, J.: Towards neural Earth system modelling by integrating artificial intelligence in Earth system science, Nature Machine Intelligence, 3, 667–674, https://doi.org/10.1038/s42256-021-00374-3, 2021.

Janhäll, S.: Review on urban vegetation and particle air pollution–Deposition and dispersion, Atmospheric Environment, 105, 130–137, 2015.

Jiang, Z., Cheng, H., Zhang, P., and Kang, T.: Influence of urban morphological parameters on the distribution and diffusion of air pollutants: A case study in China, Journal of Environmental Sciences, 105, 163–172, 2021.

Kelp, M. M., Jacob, D. J., Lin, H., and Sulprizio, M. P.: An online-learned neural network chemical solver for stable long-term global simulations of atmospheric chemistry, Journal of Advances in Modeling Earth Systems, 14, e2021MS002926, https://doi.org/10.1029/2021MS002926, 2022.

Kingma, D. and Ba, J.: Adam: A Method for Stochastic Optimization, Computer Science, https://doi.org/10.48550/arXiv.1412.6980, 2014.

Lang, J.: A Monitoring and Modeling Study to Investigate Regional Transport and Characteristics of PM_2.5 Pollution, Aerosol & Air Quality Research, 13, 943–956, 2013.

Leal, A. M., Kulik, D. A., Smith, W. R., and Saar, M. O.: An overview of computational methods for chemical equilibrium and kinetic calculations for geochemical and reactive transport modeling, Pure and Applied Chemistry, 89, 597–643, 2017.

LeCun, Y., Bengio, Y., and Hinton, G.: Deep learning, Nature, 521, 436–444, https://doi.org/10.1038/nature14539, 2015.

Li, J., Dai, Y., Zhu, Y., Tang, X., Wang, S., Xing, J., Zhao, B., Fan, S., Long, S., and Fang, T.: Improvements of response surface modeling with self-adaptive machine learning method for PM_2.5 and O₃ predictions, Journal of Environmental Management, 303, 114210, https://doi.org/10.1016/j.jenvman.2021.114210, 2022.

Li, Z., Guo, J., Ding, A., Liao, H., Liu, J., Sun, Y., Wang, T., Xue, H., Zhang, H., and Zhu, B.: Aerosol and boundary-layer interactions and impact on air quality, National Science Review, 4, 810–833, 2017.

Liao, Q., Zhu, M., Wu, L., Pan, X., Tang, X., and Wang, Z.: Deep Learning for Air Quality Forecasts: a Review, Current Pollution Reports, 6, 399–409, https://doi.org/10.1007/s40726-020-00159-z, 2020.

Liu, X.-H., Zhang, Y., Xing, J., Zhang, Q., Wang, K., Streets, D., Jang, C., Wang, W.-X., and Hao, J.-M.: Understanding of regional air pollution over China using CMAQ, Part II. Process analysis and sensitivity of ozone and particulate matter to precursor emissions, Atmospheric Environment, 44, 3719–3727, https://doi.org/10.1016/j.atmosenv.2010.03.036, 2010.

Liu, Y. and Tang, G.: Contradictory response of ozone and particulate matter concentrations to boundary layer meteorology, Environmental Pollution, 343, 123209, https://doi.org/10.1016/j.envpol.2023.123209, 2024.

Liu, Z.-S., Clusius, P., and Boy, M.: Neural network emulator for atmospheric chemical ODE, Neural Networks, 184, 107106, https://doi.org/10.1016/j.neunet.2024.107106, 2025.

Lv, B., Cai, J., Xu, B., and Bai, Y.: Understanding the Rising Phase of the PM_2.5 Concentration Evolution in Large China Cities, Scientific Reports, 7, 46456, https://doi.org/10.1038/srep46456, 2017.

Lyu, B.: FastCTM(v1.0)_a principle-guided neural network for chemical transport modelling, Zenodo [data set], https://doi.org/10.5281/zenodo.13757211, 2024.

Mao, W., Wang, W., Jiao, L., Zhao, S., and Liu, A.: Modeling air quality prediction using a deep learning approach: Method optimization and evaluation, Sustainable Cities and Society, 65, 102567, https://doi.org/10.1016/j.scs.2020.102567, 2021.

Michalakes, J., Chen, S., Dudhia, J., Hart, L., Klemp, J., Middlecoff, J., and Skamarock, W.: Development of a next-generation regional weather research and forecast model, IEEE International Conference on High Performance Computing, Data, and Analytics, 11/1/2001, https://doi.org/10.1142/9789812799685_0024, 2001.

Michalakes, J., Dudhia, J., Gill, D., Henderson, T., Klemp, J., Skamarock, W., and Wang, W.: The Weather Research and Forecast Model: Software Architecture and Performance, https://doi.org/10.1142/9789812701831_0012, 2005.

Muller, N. Z. and Mendelsohn, R.: Measuring the damages of air pollution in the United States, Journal of Environmental Economics and Management, 54, 1–14, https://doi.org/10.1016/j.jeem.2006.12.002, 2007.

Reichstein, M., Camps-Valls, G., Stevens, B., Jung, M., Denzler, J., Carvalhais, N., and Prabhat: Deep learning and process understanding for data-driven Earth system science, Nature, 566, 195–204, https://doi.org/10.1038/s41586-019-0912-1, 2019.

Ronneberger, O., Fischer, P., and Brox, T.: U-Net: Convolutional Networks for Biomedical Image Segmentation, Medical Image Computing and Computer-Assisted Intervention – MICCAI 2015, Cham, 234–241, 2015.

Shi, X., Chen, Z., Wang, H., Yeung, D.-Y., Wong, W.-k., and Woo, W.-c.: Convolutional LSTM Network: A Machine Learning Approach for Precipitation Nowcasting, Computer Science, https://doi.org/10.48550/arXiv.1506.04214, 2015.

Shi, X., Gao, Z., Lausen, L., Wang, H., Yeung, D. Y., Wong, W., and Woo, W.: Deep Learning for Precipitation Nowcasting: A Benchmark and A New Model, Neural Information Processing Systems, https://doi.org/10.48550/arXiv.1506.04214, 2017.

Silva, S. J., Heald, C. L., Ravela, S., Mammarella, I., and Munger, J. W.: A deep learning parameterization for ozone dry deposition velocities, Geophysical Research Letters, 46, 983–989, 2019.

Skamarock, W. C., Klemp, J. B., Dudhia, J., Gill, D. O., Barker, D. M., Duda, M. G., Huang, X. Y., Wang, W., and Powers, J. G.: A description of the advanced research WRF version 3, NCAR Tech. Note, https://doi.org/10.13140/RG.2.1.2310.6645, 2008.

Sturm, P. O. and Wexler, A. S.: A mass- and energy-conserving framework for using machine learning to speed computations: a photochemistry example, Geosci. Model Dev., 13, 4435–4442, https://doi.org/10.5194/gmd-13-4435-2020, 2020.

Su, T., Li, Z., Zheng, Y., Luan, Q., and Guo, J.: Abnormally Shallow Boundary Layer Associated With Severe Air Pollution During the COVID-19 Lockdown in China, Geophysical Research Letters, https://doi.org/10.1029/2020GL090041, 2020.

Sun, H., Fung, J. C. H., Chen, Y., Li, Z., Yuan, D., Chen, W., and Lu, X.: Development of an LSTM broadcasting deep-learning framework for regional air pollution forecast improvement, Geosci. Model Dev., 15, 8439–8452, https://doi.org/10.5194/gmd-15-8439-2022, 2022.

Tang, G., Zhang, J., Zhu, X., Song, T., Münkel, C., Hu, B., Schäfer, K., Liu, Z., Zhang, J., Wang, L., Xin, J., Suppan, P., and Wang, Y.: Mixing layer height and its implications for air pollution over Beijing, China, Atmos. Chem. Phys., 16, 2459–2475, https://doi.org/10.5194/acp-16-2459-2016, 2016.

Tessum, C. W., Hill, J. D., and Marshall, J. D.: InMAP: A model for air pollution interventions, PLOS ONE, 12, e0176131, https://doi.org/10.1371/journal.pone.0176131, 2017.

Wang, C., Jia, M., Xia, H., Wu, Y., Wei, T., Shang, X., Yang, C., Xue, X., and Dou, X.: Relationship analysis of PM_2.5 and boundary layer height using an aerosol and turbulence detection lidar, Atmos. Meas. Tech., 12, 3303–3315, https://doi.org/10.5194/amt-12-3303-2019, 2019.

Wang, L., Jang, C., Zhang, Y., Wang, K., Zhang, Q., Streets, D., Fu, J., Lei, Y., Schreifels, J., He, K., Hao, J., Lam, Y.-F., Lin, J., Meskhidze, N., Voorhees, S., Evarts, D., and Phillips, S.: Assessment of air quality benefits from national air pollution control policies in China. Part II: Evaluation of air quality predictions and air quality benefits assessment, Atmospheric Environment, 44, 3449–3457, https://doi.org/10.1016/j.atmosenv.2010.05.058, 2010.

Wang, Y., Gao, Z., Long, M., Wang, J., and Yu, P. S.: PredRNN++: Towards A Resolution of the Deep-in-Time Dilemma in Spatiotemporal Predictive Learning, ICML, https://doi.org/10.48550/arXiv.1804.06300, 2018.

Wong, P.-Y., Lee, H.-Y., Chen, Y.-C., Zeng, Y.-T., Chern, Y.-R., Chen, N.-T., Lung, S.-C. C., Su, H.-J., and Wu, C.-D.: Using a land use regression model with machine learning to estimate ground level PM_2.5, Environmental Pollution, 277, 116846, https://doi.org/10.1016/j.envpol.2021.116846, 2021.

Xia, Z., Zhao, C., Du, Q., Yang, Z., Zhang, M., and Qiao, L.: Advancing Sophisticated Photochemistry Simulation in Atmospheric Numerical Models With Artificial Intelligence PhotoChemistry (AIPC) Scheme Using the Feature-Mapping Subspace Self-Attention Algorithm, Journal of Advances in Modeling Earth Systems, 17, e2024MS004646, https://doi.org/10.1029/2024MS004646, 2025

Xing, J., Zheng, S., Li, S., Huang, L., Wang, X., Kelly, J. T., Wang, S., Liu, C., Jang, C., Zhu, Y., Zhang, J., Bian, J., Liu, T.-Y., and Hao, J.: Mimicking atmospheric photochemical modeling with a deep neural network, Atmospheric Research, 265, 105919, https://doi.org/10.1016/j.atmosres.2021.105919, 2022.

Zhang, X., Liu, L., Chen, X., Gao, Y., Xie, S., and Mi, J.: GLC_FCS30: global land-cover product with fine classification system at 30 m using time-series Landsat imagery, Earth Syst. Sci. Data, 13, 2753–2776, https://doi.org/10.5194/essd-13-2753-2021, 2021.

Zhang, Z., Zhang, S., Chen, C., and Yuan, J.: A systematic survey of air quality prediction based on deep learning, Alexandria Engineering Journal, 93, 128–141, https://doi.org/10.1016/j.aej.2024.03.031, 2024.

Zhao, J., Ma, X., Wu, S., and Sha, T.: Dust emission and transport in Northwest China: WRF-Chem simulation and comparisons with multi-sensor observations, Atmospheric Research, 241, 104978, https://doi.org/10.1016/j.atmosres.2020.104978, 2020.

Zhou, Z., Rahman Siddiquee, M. M., Tajbakhsh, N., and Liang, J.: UNet $+ +$ : A Nested U-Net Architecture for Medical Image Segmentation, Cham, 3–11, https://doi.org/10.1007/978-3-030-00889-5_1, 2018.

Articles

Short summary

FastCTM is a neural network model to simulate key criteria air pollution levels, offering an efficient alternative to traditional chemical transport models. Its structure is informed by the physical and chemical principles of the atmosphere, allowing it to learn and replicate complex atmospheric processes. FastCTM demonstrated matching accuracy to traditional models with less computational demand. It also provides analysis of how different atmospheric processes contribute to air quality changes.