Articles | Volume 19, issue 12
https://doi.org/10.5194/gmd-19-5491-2026
https://doi.org/10.5194/gmd-19-5491-2026
Model evaluation paper
 | 
25 Jun 2026
Model evaluation paper |  | 25 Jun 2026

Benchmarking ozone stress parameterizations in CLM5: a global mechanistic assessment of thresholds and memory effects

Peng Zhou, Jieming Chou, Li Dan, Jean-François Lamarque, Muhammad Bilal, Fang Li, Mengting Sun, Rebecca Buccholz, Desneiges Murray, Zhaoxiang Cao, Jing Peng, Kai Li, Fuqiang Yang, Wei Pan, Jinyan Chen, and Liwen Xing
Abstract

Tropospheric ozone remains a critical but uncertain driver of terrestrial productivity loss, and land surface models (LSMs) diverge markedly in how they represent vegetation ozone stress. We conduct a global, mechanistically consistent evaluation of three prominent ozone stress parameterization schemes, Sitch, Lombardozzi, and Li, within the Community Land Model version 5 (CLM5). Using unified meteorological and ozone forcing from CAM-chem and GSWP3.1, we designed five experiments to isolate the roles of ozone flux threshold selection and response function form. The mixed experiments using thresholds and response functions derived from the Sitch and Lombardozzi schemes were implemented without additional recalibration, allowing structural sensitivities to be evaluated consistently within the Li framework. Model output is benchmarked against MODIS and FLUXNET gross primary production (GPP) across spatial gradients, biomes, and among plant functional types (PFTs). All parameterizations capture the ozone–induced reduction in GPP relative to the ozone-free baseline, but their accuracy varies widely. The Li scheme, featuring PFT-specific thresholds and separate nonlinear responses for photosynthesis and stomatal conductance, best agrees with observed GPP patterns across scales. In contrast, the Lombardozzi scheme produces much larger reductions in high-flux regions. Analysis reveals that the structures of ozone response functions and memory-decay mechanisms primarily determine improvements in GPP simulation. Our results support a shift toward ozone parameterizations that couple stomatal flux with canopy phenology, dynamic water constraints, and regionally calibrated thresholds. These findings provide a transferable framework for quantifying ozone–carbon coupling in LSMs and highlight priorities for improving terrestrial biosphere models under atmospheric change.

Share
1 Introduction

Tropospheric ozone (O3), a typical secondary pollutant primarily produced by photochemical reactions of anthropogenic precursors under solar radiation, has been shown to threaten human health, crop yields, and ecosystem functioning as near-surface concentrations increase (Agathokleous et al., 2020; Turnock et al., 2020; Feng et al., 2022; Achebak et al., 2024). Ecological and agronomic studies indicate that, compared with counting exceedances based on concentration thresholds such as the accumulated exposure over a threshold of 40 ppb (AOT40), whether vegetation is truly damaged depends more on the actual uptake of ozone through stomata and its accumulation over time (Emberson et al., 2018; Paoletti et al., 2022; Pleijel et al., 2022). This insight has driven a shift from concentration-based exposure metrics to cumulative flux dose and calls for process representations that more closely capture flux controls and temporal memory at leaf and canopy scales (Mills et al., 2011; Montes et al., 2021; Moura et al., 2023). Consequently, metrics and parameterization schemes grounded in stomatal ozone flux are essential for quantifying ozone–vegetation interactions across space and time. They provide the methodological foundation for integrated carbon flux assessment of ozone flux thresholds, cumulative dose, and response functions within the ecosystem for land surface models (Karlsson et al., 2007; Mills et al., 2011). In land surface models, the scientific rationale for adopting ozone-stress parameterizations is that ozone injury is not directly determined by ambient concentration alone, but by the amount of ozone entering leaves through stomata, the capacity of plants to detoxify or tolerate that flux, and the subsequent physiological downregulation of photosynthesis and stomatal conductance (Sitch et al., 2007). Without such process-based parameterizations, models either neglect an important atmospheric stressor or treat ozone exposure in a manner disconnected from canopy physiology, water limitation, phenology, and differences in plant functional type (Fuhrer, 1996). Therefore, ozone-stress parameterizations provide the necessary mechanistic bridge between atmospheric ozone forcing and terrestrial carbon-cycle responses, allowing land surface models to diagnose when ozone damage is triggered, how it accumulates over time, and how strongly it suppresses GPP across different ecosystems.

Sitch et al. (2007) first introduced a flux-based exposure assessment in the Met Office Surface Exchange Scheme Top-down Representation of Interactive Foliage and Flora Including Dynamics (MOSES-TRIFFID) model, with the predominant form being PODY (phytotoxic O3 dose over a flux threshold of Y nmol O3 m−2 s−1), which represents the time integral of ozone flux above a threshold and in the original implementation applied a single linear attenuation factor to scale both photosynthesis and stomatal conductance. Lombardozzi et al. (2013) treated cumulative uptake of O3 (CUO) as a state variable in the Community Land Model version 5 (CLM5), first solving for an ozone-free state and then applying plant functional type-specific linear scalings to photosynthesis and stomatal conductance, while distinguishing sunlit and shaded leaves and introducing growing-season gating, leaf turnover, and longevity-related decay to limit unrealistic long-term accumulation. Li et al. (2024), also within CLM5, continued to base the parameterization scheme on above-threshold flux accumulation but selected an optimal ozone flux threshold in a data-driven manner and, from multiple candidate forms, identified the best response functions for photosynthesis and stomatal conductance for each plant functional type. These developments enable ozone stress to be embedded in ecosystem and land surface models in a flux-based manner, thereby creating a coherent linkage with assessments of the carbon and water cycles.

However, we still lack a systematic, comprehensive assessment of the performance of the Sitch, Lombardozzi, and Li ozone-stress parameterizations, and the relative strengths, weaknesses, and domains of applicability of these parameterizations remain unclear. Existing studies often evaluate under different model settings, ozone flux thresholds, and response function selections, and varying driving and validation data conditions, or test only a single (or two) parameterization scheme for ozone stress (Gong et al., 2020; Cao et al., 2024). The impact of ozone flux threshold setting, cumulative dose characterization, and response function morphology of the three parameterization schemes on carbon flux has not yet been quantified within a unified mechanistic framework. Therefore, it is difficult to objectively assess the superiority and applicability of these parameterization schemes across models with different spatiotemporal distributions, biomes, or vegetation types. Benefiting from the large-scale, assimilation-consistent Moderate Resolution Imaging Spectroradiometer (MODIS) gross primary production (GPP) (Running et al., 2004; Ma et al., 2024), the process-based and high-temporal-resolution FLUXNET site fluxes (Baldocchi et al., 2001; Pastorello et al., 2020), and spatiotemporally consistent ozone fields from chemical transport models (Lamarque et al., 2012), we are now able to evaluate, within a single mechanistic framework, a no-ozone baseline together with established and modified parameterization schemes across scales, regions, and vegetation types.

Building on these advances and the above modeling needs, we here address the research gap by asking: (1) how do the Sitch et al. (2007), Lombardozzi et al. (2015), and Li et al. (2024) ozone-stress schemes differ in their impacts on GPP when evaluated under consistent conditions? (2) To what extent do differences in ozone flux threshold choices versus response function forms contribute to the variability in simulated ozone damage? (3) Which scheme best reproduces observed GPP patterns across regions and plant functional types? To answer these questions, we modify and compare multiple ozone-stress parameterization schemes in CLM5 around three core elements: ozone flux thresholds, cumulative dose, and response functions. Using simulated ozone and meteorological reanalysis as forcings, and coupling them with the land model's native resistance network and canopy stratification, we comprehensively evaluate how alternative ozone flux threshold settings and response functional forms scale photosynthesis and stomatal conductance. In parallel, MODIS GPP and FLUXNET flux observations serve as constraints and independent validation to test robustness across plant functional types and regions at multiple scales. By juxtaposing a no-ozone baseline, representative existing parameterization schemes, and modified parameterization schemes, we aim to clarify the dominant contributions of ozone flux threshold selection and response-function uncertainty to the outcomes, thereby providing diagnostic, traceable modeling baselines and data evidence for studies that couple ozone with the carbon and water cycles. Notably, the mixed-parameterization experiments using thresholds and response functions derived from the Sitch and Lombardozzi schemes were not recalibrated within the Li framework and should therefore be interpreted primarily as structural-sensitivity experiments.

2 Materials and methods

2.1 Model platform and experimental setup

We evaluate the impact of ozone stress on vegetation productivity within the offline framework of the Community Terrestrial Systems Model version 5.3.075 (CTSM 5.3.075) (Lawrence et al., 2019). The land surface model uses the CLM5 photosynthesis and stomatal coupling scheme (Farquhar–Collatz photosynthesis and Medlyn stomatal conductance) and, on this basis, incorporates alternative ozone stress parameterizations to conduct comparative experiments (see Sect. 2.2) (Farquhar et al., 1980; Collatz et al., 1991, 1992; Medlyn et al., 2011). The model uses the I2000Clm50Sp compset (prescribed vegetation with the biogeochemical module disabled), so ozone acts solely as an external stressor on land-surface physiological processes and does not interact with the atmosphere through coupled feedbacks.

The numerical experiments use a global latitude–longitude grid (1.895° latitude by 2.5° longitude) with a 30 min time step. The simulation spans 30 years by cycling the atmospheric forcing and surface ozone fields from 2005 to 2014 three times. The first 10 years are discarded as spin-up, and only steady-state statistics from the final 20 years are analyzed. The control and sensitivity experiments include a no-ozone baseline (I2000), the Li parameterization, the Li with Lombardozzi thresholds and function, the Li with Sitch thresholds and function, and the Lombardozzi parameterization (Table 1). All other physical and physiological parameters are held constant to ensure that any differences are attributable solely to the ozone process.

Table 1Summary of model experiments used in this study.

Download Print Version | Download XLSX

The atmospheric forcing is taken from the Global Soil Wetness Project (GSWP3.1), which provides three-hourly reanalysis fields at 0.5°×0.5° resolution for 2005–2014, including near-surface air temperature, specific humidity, wind speed, surface pressure, incoming longwave and shortwave radiation, sunshine duration, and precipitation (Dirmeyer, 2011). Vegetation structure follows prescribed contemporary distributions and parameters, such as leaf area index (LAI) and canopy height, derived from MODIS and held constant across years. All forcing and initial datasets, including the year-2000 atmospheric CO2 concentration and the nitrogen and aerosol deposition fields, are taken from the standard datasets provided with CTSM 5.3.075.

Ozone forcing was obtained from our CAM-chem version 6 simulations of hourly near-surface ozone concentrations for 2005–2014 at 0.9°×1.25° horizontal resolution with 32 vertical levels (Tilmes et al., 2019; Emmons et al., 2020). Further details of the CAM-chem simulation configuration and experimental design are described in another of our studies (Murray et al., 2026). CAM-chem was run in specified-dynamics mode and employed the Model for Ozone and Related Chemical Tracers troposphere and stratosphere mechanism version 1 (MOZART-TS1) (Granier et al., 2019). Anthropogenic, biogenic, and biomass-burning emissions were taken from the Community Emissions Data System (CEDS), the Model of Emissions of Gases and Aerosols from Nature (MEGAN), and a standard fire emissions inventory (QFED), respectively (Wiedinmyer et al., 2011; Guenther et al., 2012; Darmenov and da Silva, 2015). Air temperature (T) together with the zonal (u) and meridional (v) wind components are taken from the Modern-Era Retrospective analysis for Research and Applications, Version 2 (MERRA-2) reanalysis (Gelaro et al., 2017).

To provide an independent comparison with the model results, we use the monthly MODIS GPP (MOD17A2) product at 0.05° resolution to assess consistency with our simulated results for the same period (Zhao et al., 2005). At the same time, we incorporate site-level FLUXNET flux observations (Pastorello et al., 2020), apply quality control and gap-filling to the site monthly time series, and pair the observations with the model outputs station by station and month by month, using only samples that are fully collocated in space and time as the observational benchmark for the evaluation.

2.2 Calculation of ozone flux and cumulative dose

2.2.1 Calculation principle of Sitch

Sitch et al. (2007) described ozone suppression of instantaneous net photosynthesis in MOSES-TRIFFID using the concept of accumulated ozone flux above a threshold (PODY, mmol m−2). They first compute the instantaneous stomatal ozone flux FO3 (nmol m−2 s−1), then integrate the above-threshold portion over time to obtain PODY, from which a photosynthetic inhibition factor is defined. At each time step, the instantaneous ozone flux FO3,t is calculated (in a form analogous to Ohm's law) and can be written as:

(1) F O 3 , t = c O 3 , t r b + r a + 1 1.67 g s ,

where cO3,t is the molar concentration of ozone (nmol m−3), t denotes the model time step. rb+ra is the sum of the quasi-laminar boundary-layer resistance and the aerodynamic resistance over the leaf surface (s m−1), gs is the leaf conductance to water vapor (m s−1), and 1.67 is the ratio of the diffusion resistance of ozone to that of water vapor.

For each time step t, the ozone flux above the threshold is defined:

(2) F O 3 , t > max F O 3 , t - Y , 0 ,

where Y is the vegetation-type-dependent critical ozone flux threshold.

Within a given accumulation time window, the ozone flux above the threshold in Eq. (2) is accumulated over time to obtain the total above-threshold dose:

(3) POD Y = t = 1 T F O 3 , t > Δ t ,

where Δt is the length of the time step (s).

(4) f = 1 - a POD Y ,

where a is a vegetation-type-specific sensitivity parameter. Net photosynthesis is modified to A=A0(1-aPODY), where A0 is the photosynthetic rate in the absence of O3. Similarly, stomatal conductance is modified to gs=gs0(1-aPODY), where gs0 is the stomatal conductance in the absence of O3.

Because gs is linearly related to the photosynthetic rate, combining Eqs. (1) to (3) with the conductance relationship yields a quadratic equation in gs, which can be solved analytically.

2.2.2 Calculation principle of Lombardozzi

Lombardozzi et al. (2015) accumulated the stomatal ozone flux exceeding the threshold over time to obtain the cumulative uptake (CUO, mmol m−2), and used a linear response factor to directly scale net photosynthesis and leaf stomatal conductance in the absence of O3. For a given flux threshold Y specific to the plant functional type (PFT) and a time step Δt, within a prescribed accumulation window, we define:

(5) CUO = t = 1 T max F O 3 , t - Y , 0 Δ t ,

(κ is the conversion factor from nmol to mmol.)

Subsequently, PFT-specific linear response factors are used to scale ozone-free photosynthesis and conductance:

(6)A=A0(1-bCUO),(7)gs=gs0(1-cCUO),

where b and c are regression coefficients specific to PFT.

2.2.3 Calculation principle of Li

Building on the framework mentioned above, Li et al. (2024) represented ozone stress as the cumulative ozone flux above a threshold over time and accounted for damage decay due to leaf turnover. The cumulative dose PODY is computed recursively at each time step t as:

(8) POD Y , t = 1 - δ t POD Y , t - 1 + F O 3 , t > Δ t ,

where PODY,t-1, and PODY,t denote the cumulative above-threshold dose at time steps t−1 and t, respectively.

δt is the fraction of decay caused by leaf turnover or abscission and is calculated as follows:

(9) δ t = 1 - exp - Δ LAI t L leaf

where Lleaf is the leaf lifetime (years), and LAIt−1 and LAIt are the leaf area index at time steps t−1 and t, respectively.

Table 2Ozone flux thresholds Y for different PFTs in the three ozone stress parameterizations.

Download Print Version | Download XLSX

After obtaining the growing season cumulative dose PODY, dimensionless response factors for net photosynthesis and stomatal conductance are constructed as:

(10)fA=hAPODY,(11)fg=hgPODY,

where hA and hg are inhibition functions for photosynthesis and conductance (with values between 0 and 1).

The three approaches differ fundamentally in how thresholds are prescribed and how response functions are formulated (Table 2). Sitch uses ozone flux above the threshold, expressed as PODY, as the core quantity. In this parameterization scheme, the ozone flux thresholds are prescribed by PFT, and a single linear inhibition factor is applied to scale photosynthesis and stomatal conductance simultaneously (both the ozone flux thresholds and the response functions involve assumptions). Lombardozzi instead integrates over time the part of the flux that exceeds 0.8 nmol m−2 s−1 to obtain CUO, uses a fixed threshold for all PFTs, and applies separate linear scalings of photosynthesis and conductance based on regression coefficients fitted for each PFT. Li adopts a data-driven approach to selecting the optimal threshold Y and uses PODY as the cumulative quantity. For the response functions, this approach no longer restricts the form to linearity; instead, it selects, for each PFT, the best form among several candidates (linear, exponential, logarithmic, etc.) based on statistical criteria, and then applies separate multiplicative scalings to photosynthesis and conductance.

Compared with Sitch, Lombardozzi first represents ozone damage as a continuously updated state variable by computing ozone-free photosynthesis and stomatal conductance within the model, and then applies scaling based on this. Second, the effects of ozone on photosynthesis and stomatal conductance are treated in a decoupled manner, with separate ozone factors applied to each process, thereby allowing the asynchronous responses of photosynthesis and conductance to ozone commonly observed in the field (Wittig et al., 2007; Lombardozzi et al., 2013). Third, at the implementation level, the parameterization is refined to canopy radiation layers, with ozone uptake calculated and accumulated separately for sunlit and shaded leaves. Finally, growing-season gating and leaf-turnover effects are introduced, so that damage accumulates only during the growing season (diagnosed using a leaf area index threshold), and a healing offset is applied to newly emerged leaves, while evergreen leaves receive a decay term based on their lifetime.

Compared with Lombardozzi, Li retains the two-stage structure of flux accumulation and physiological scaling but provides greater detail for each component. Second, the flux calculation still relies on the native resistance chain and meteorological forcing in the land surface model, updates the ratio of diffusion resistances between ozone and water vapor, and uses the internal model time-scale parameters consistently in both time integration and the conversion based on leaf lifetime. Finally, for the physiological representation of the cumulative dose, the parameterization scheme continues to accumulate dose separately for sunlit and shaded leaves, accumulates dose only during daytime when vegetation effectively absorbs solar radiation, and sets leaf area index thresholds by plant functional type, with long-term memory controlled through two mechanisms, namely lifetime decay of leaves in evergreen vegetation and phenological replacement of leaves in deciduous vegetation.

2.3 Model evaluation

To quantitatively evaluate how well each ozone stress parameterization reproduces the observations, we use complementary metrics including correlation, agreement in magnitude, and error amplitude. We perform statistical analysis at different temporal aggregation levels and across different PFTs. All statistics are based on strictly paired samples, using only records for which observations and simulations are available at the same site and time step. The units of the metrics follow those of the variable being evaluated.

  1. Root mean square error (RMSE): it measures the overall magnitude error between simulations and observations, is non-negative, and smaller values indicate better performance (Zhou et al., 2025a):

    (12) RMSE = 1 N i = 1 N P i - O i 2 ,

    where Pi and Oi are the simulated and observed values of the ith paired sample, respectively, and N is the number of paired samples.

  2. Relative bias: it is used to describe the bias between simulations and observations or between simulations and the no-ozone-stress case (Lamarque et al., 2013):

    (13) B % = P - O O × 100 % ,

    where P and O are the sample means of prediction and observation.

  3. Pearson correlation coefficient (r): it measures the temporal phase consistency between simulations and observations, takes values in [−1, 1], and values closer to 1 indicate stronger correlation (Zhou et al., 2025b):

    (14) r = cov ( P , O ) σ P σ O ,

    where σP and σO are the sample standard deviations, and cov(P, O) is the covariance between the simulated values P and the observed values O.

  4. Normalized standard deviation (NSD): It is used to characterize the ability to reproduce the variability (amplitude) and is defined as the ratio of the standard deviation of simulations to that of observations (Taylor, 2001):

    (15) NSD = σ P σ O .

    NSD = 1 indicates that the simulation variance is consistent with the observed variance, and NSD < 1 indicates attenuated variance.

  5. Nash–Sutcliffe efficiency (NSE): it measures the degree of improvement of the simulations relative to using the observational mean as a naive model. It has a theoretical range from −∞ to 1, and values closer to 1 indicate better performance (Nash and Sutcliffe, 1970):

    (16) NSE = 1 - i = 1 N P i - O i 2 i = 1 N O i - O 2 ,

    when NSE = 0, the performance is equivalent to predicting a constant value O; and NSE < 0 indicates performance worse than this naive benchmark.

3 Results

3.1 Spatial evaluation and verification of ozone stress parameterization schemes

To quantitatively compare the impacts of different ozone stress parameterizations on land surface productivity, Fig. 1 presents the spatial distribution of the ten-year mean GPP and its zonal mean from 2005 to 2014. This includes the no ozone case (Fig. 1a), the Li parameterization (Fig. 1b), the Li parameterization combined with the threshold and response function sets of Lombardozzi and Sitch (Fig. 1c and d), the Lombardozzi parameterization (Fig. 1e), and the GPP from the MODIS satellite product (Fig. 1f). Compared with the MODIS observations (Fig. 1f), the no ozone case (Fig. 1a) shows pronounced overestimation of GPP in tropical regions, particularly in humid low-latitude ecosystems, indicating that neglecting ozone stress can substantially exaggerate tropical productivity. Compared with Fig. 1a, all parameterizations that include ozone show a systematic suppression of GPP and exhibit pronounced heterogeneity, with particularly low GPP values in humid tropical regions. The zonal-mean curves show that the peak values at the equator in the ozone-parameterized cases are clearly lower than in the no-ozone case, indicating that tropical regions are more sensitive to ozone stress. Among the different parameterizations, the Li parameterization scheme and its combinations with the Lombardozzi and Sitch thresholds and functions (Fig. 1b–d) are closer to MODIS (Fig. 1f) in terms of zonal gradients and tropical contrast, whereas the Lombardozzi parameterization scheme (Fig. 1e) produces lower GPP in the tropics, which enlarges the deviations from observations in these regions.

https://gmd.copernicus.org/articles/19/5491/2026/gmd-19-5491-2026-f01

Figure 1Decadal-mean (2005–2014) gross primary production (GPP) maps and zonal-mean profiles for (a) I2000 (no ozone stress), (b) Li, (c) Li with Lombardozzi thresholds and function, (d) Li with Sitch thresholds and function, (e) Lombardozzi, and (f) MODIS.

To test the ability of each parameterization scheme to reproduce the observations, Fig. 2 shows the spatial distribution of station-based RMSE of monthly GPP relative to FLUXNET and the corresponding zonal RMSE aggregated in 5° latitude bands. The figure shows that, across all five simulations and the MODIS product (Fig. 2a–f), RMSE is relatively low at high northern latitudes, whereas in the mid-latitudes the complexity of vegetation types leads to bands of abnormally high RMSE values. The high-resolution MODIS product has fewer extremely high values, indicating biases due to insufficient spatial resolution. Compared with FLUXNET, the no-ozone case (Fig. 2a) exhibits relatively large RMSE values in central African regions, reflecting pronounced overestimation under humid tropical conditions. However, at some stations in the middle and high latitudes, the no ozone case still shows lower RMSE values than the ozone-stress parameterization schemes. It is noteworthy that, compared with Fig. 2a (no ozone case), the Li parameterization and its combinations with the Lombardozzi and Sitch thresholds and functions (Fig. 2b–d) show some degree of improvement at different latitudes, indicating that optimal threshold settings for different PFTs (as in Li's scheme) reduce errors compared to using a single fixed threshold. The Lombardozzi parameterization (Fig. 2e) retains relatively large errors in some low-latitude bands. The zonal curves show that, compared with the no-ozone case, the overall variation in the ozone parameterizations is not very pronounced, but the inclusion of ozone stress yields a considerable reduction in errors in the tropics. The RMSE results for MODIS (Fig. 2f) provide some support for the reliability of the five simulations, as MODIS and FLUXNET show broadly similar RMSE patterns.

https://gmd.copernicus.org/articles/19/5491/2026/gmd-19-5491-2026-f02

Figure 2Spatial patterns and zonal-mean RMSE of site-level monthly GPP relative to FLUXNET for 2005–2014 for (a) I2000 (no ozone stress), (b) Li, (c) Li with Lombardozzi thresholds and function, (d) Li with Sitch thresholds and function, (e) Lombardozzi, and (f) MODIS.

https://gmd.copernicus.org/articles/19/5491/2026/gmd-19-5491-2026-f03

Figure 3Zonal-mean profiles of GPP and zonal-mean bias relative to FLUXNET for 2005–2014. (a) Zonal-mean GPP from FLUXNET, I2000 (no ozone stress), and MODIS; (b) corresponding zonal-mean biases; (c) zonal-mean GPP from Li, Li with Lombardozzi thresholds and function, and Li with Sitch thresholds and function; (d) corresponding zonal-mean biases; (e) zonal-mean GPP from Li with Lombardozzi thresholds and function and from Lombardozzi; (f) corresponding zonal-mean biases.

Download

To characterize the consistency between the model and the observations along the zonal gradient, Fig. 3 shows zonal profiles of GPP aggregated in 5° latitude bands (left panels) and their zonal biases relative to FLUXNET (right panels). Figure 3a and 3 provides the baseline comparison: the no-ozone case generally underestimates GPP in the midlatitudes of the Northern Hemisphere (around 20–50° N) and also shows substantial underestimation in the midlatitudes of the Southern Hemisphere, but exhibits pronounced overestimation in the tropics. MODIS as a whole tends to underestimate GPP (relative to FLUXNET), particularly at low latitudes and in the Southern Hemisphere midlatitudes. Only a few latitude bands show slight overestimation by MODIS, which is consistent with the findings of Wild et al. (2022). The bias statistics further support these zonal characteristics (Table 3). The no-ozone case shows an overall annual mean bias of 284 g C m−2 yr−1 relative to FLUXNET. Among the ozone-stress simulations, the Li parameterization scheme produces the smallest annual mean bias (420 g C m−2 yr−1), followed by the Li framework combined with Lombardozzi and Sitch thresholds and functions, with biases of 425 and 428 g C m−2 yr−1, respectively, whereas the Lombardozzi parameterization scheme exhibits the largest bias (566 g C m−2 yr−1). However, MODIS shows the smallest annual mean bias (229 g C m−2 yr−1) relative to FLUXNET. Figure 3c and d shows the improvements following ozone stress. The Li parameterization scheme, when combined with the thresholds and functions of Lombardozzi and Sitch, reproduces the zonal gradient of FLUXNET more accurately, particularly by substantially reducing the tropical overestimation in the no-ozone case, so that the bias curves are closer to the zero line. Comparison of the Lombardozzi parameterization scheme and the Li with Lombardozzi thresholds and function scheme in Fig. 3e and f shows that the former (Lombardozzi) displays a more pronounced positive bias in several low-latitude and midlatitude bands, whereas the combination of Lombardozzi's parameters within the Li framework can weaken this overestimation, approaching the FLUXNET benchmark. This suggests that some of the Lombardozzi scheme's bias is attributable to its fixed-threshold and linear-response assumptions, which the Li framework can partially mitigate.

https://gmd.copernicus.org/articles/19/5491/2026/gmd-19-5491-2026-f04

Figure 4Mean bias in monthly GPP relative to FLUXNET across biomes for 2005–2014.

Download

Table 3Annual mean GPP bias relative to FLUXNET across different ozone parameterization schemes.

Download Print Version | Download XLSX

The monthly mean GPP bias aggregated over eleven biomes shows a pronounced dependence on biome type (Fig. 4). In flooded grasslands and savannas, montane grasslands and shrublands, tropical moist broadleaf forests, and some other biomes, the bias is positive and relatively large, indicating that most parameterization schemes underestimate GPP in these biomes (since model GPP < observed, bias = model  obs > 0). The positive bias of the Lombardozzi parameterization scheme is the most pronounced in several of these biomes. The no-ozone case generally exhibits smaller biases than ozone-stress parameterization schemes across several temperate and boreal biomes, suggesting that introducing ozone stress alone does not necessarily improve model performance and that additional physiological or environmental constraints may still be required to better reproduce observations. In contrast, the magnitude of the bias is smaller in tundra, temperate grasslands/savannas/shrublands, and boreal forests (taiga), and the Li parameterization scheme and its threshold/function variants (Li + Sitch and Li + Lombardozzi) are overall closer to zero bias. In deserts and xeric shrublands, the biases are near zero. In tropical and subtropical moist broadleaf forests, the no ozone case shows a slight negative bias (model > obs in that biome, indicating overestimation), which is reduced after ozone stress is introduced (the ozone schemes dampen the model's overestimation).

https://gmd.copernicus.org/articles/19/5491/2026/gmd-19-5491-2026-f05

Figure 5Monthly GPP time series for 2005–2014 in five latitude bands (60–90, 30–60N, 0–30° N, 0–30, and 30–60° S), comparing FLUXNET, the no-ozone experiment, ozone-stress parameterization schemes, and MODIS.

Download

3.2 Temporal evaluation and verification of ozone stress parameterization schemes

The monthly GPP time series aggregated by latitude band show that the model generally reproduces the seasonal phase, but there are systematic differences in amplitude among the parameterization schemes (Fig. 5). At high northern latitudes (60–90° N), all schemes show peaks aligned with FLUXNET. MODIS, no ozone case, and the Li-family parameterization schemes (Li; Li with Lombardozzi thresholds/functions; Li with Sitch thresholds/functions) have amplitudes closest to the observations, while Lombardozzi shows a more pronounced underestimation of the summer peak. The no ozone case generally produces larger seasonal amplitudes than the ozone-stress parameterization schemes and, in several latitude bands, remains closer to FLUXNET during peak growing seasons, suggesting that introducing ozone stress alone cannot fully resolve the discrepancies between simulations and observations. In the midlatitudes of the Northern Hemisphere (30–60° N), all schemes systematically underestimate FLUXNET, and the degree of underestimation increases progressively from MODIS to the Li family of ozone stress schemes to Lombardozzi. This highlights differences arising from limitations in the model's intrinsic simulation skill and spatial resolution (a similar systematic underestimation is also found by Wild et al., 2022, for coarse-scale models). In the tropics and subtropics (0–30° N), seasonality is weaker and interannual variability is more pronounced. Except for the no-ozone case (which overestimates in some months), the other parameterization schemes all show some degree of underestimation, though the magnitude is smaller than in the northern midlatitudes. The low latitudes of the Southern Hemisphere (0–30° S) show a similar pattern, but the simulations agree better with the observations (the time-series curves overlap FLUXNET almost entirely for some schemes). At midlatitudes in the Southern Hemisphere (30–60° S), there is persistent underestimation during several growing seasons. It is noteworthy that, although these time series exhibit systematic biases, the results for the southern midlatitudes indicate that the Li parameterization scheme provides the best simulation (its curve is closest to the FLUXNET curve).

The monthly GPP time series aggregated by plant functional type (PFT) show that all parameterization schemes can generally reproduce the seasonal phase, but there are consistent and type-dependent systematic differences in amplitude (Fig. 6). For broadleaf trees and needleleaf trees (Fig. 6a and b), all five simulations and MODIS underestimate FLUXNET as a whole, with the underestimation of the growing season peak being most pronounced. In these forest PFTs, the no ozone case generally produces larger seasonal amplitudes and remains closer to FLUXNET during peak growing periods than most ozone-stress parameterization schemes. For shrubs (Fig. 6c), seasonality is weaker and interannual variability is larger; in the no-ozone case, schemes tend to overestimate, whereas the introduction of ozone stress leads to convergence among the schemes, most evident in the Li parameterization scheme (Li matches the observed shrub GPP level best). Grasslands and crops (Fig. 6d and e) show strong seasonal oscillations and are overall underestimated relative to the observations, but they generally perform better (closer to observations) than the broadleaf and needleleaf trees. As in Fig. 5, unavoidable systematic biases pose challenges for evaluation, but notably, the time series for shrubs again indicate that the Li parameterization scheme is optimal (it most effectively reduced the shrub GPP overestimation present in the no-ozone case).

https://gmd.copernicus.org/articles/19/5491/2026/gmd-19-5491-2026-f06

Figure 6Monthly GPP time series for 2005–2014 by plant functional type (broadleaf trees, needleleaf trees, shrubs, grasses, and crops), comparing FLUXNET, the I2000 no-ozone experiment, ozone-stress parameterization schemes, and MODIS.

Download

3.3 Overall evaluation and verification of ozone stress parameterization schemes

The seasonal reproduction of monthly GPP by each parameterization scheme and product is at a moderate level, with r mostly in the range from 0.4 to 0.6, and the normalized standard deviation at most sites being less than 1 (Fig. 7). This indicates that the model and MODIS both show varying degrees of variance damping relative to FLUXNET (both underestimate variability). Among the ozone parameterization schemes, the Li scheme has correlation and normalized standard deviation values closer to FLUXNET than those of the others. The no-ozone case and the Li parameterization scheme yield higher correlations for most vegetation types, and the Li scheme is closest to the observed amplitude for shrubs, consistent with Fig. 6, which shows that the Li scheme significantly reduced shrub biases. Compared with ozone-stress parameterization schemes, the no-ozone case generally exhibits higher normalized standard deviations across several PFTs, indicating stronger preservation of seasonal variability. However, its correlations are not consistently the highest, and its performance for shrubs and crops is poorer than that of some ozone-stress parameterization schemes. The Li + Sitch, Li + Lombardozzi, and Lombardozzi parameterization schemes exhibit varying degrees of discrepancy, and their correlations and normalized standard deviations are, overall, smaller (worse) than those of the Li scheme. MODIS shows both lower correlation and lower normalized standard deviation for most types (partly reflecting that MODIS GPP is a remote-sensing estimate with its own uncertainties).

https://gmd.copernicus.org/articles/19/5491/2026/gmd-19-5491-2026-f07

Figure 7Correlation (r) and normalized standard deviation (NSD) of monthly GPP (2005–2014) relative to FLUXNET for all sites combined and for five plant functional types, comparing the I2000 no-ozone experiment, four ozone-stress parameterization schemes (Li, Li with Lombardozzi thresholds and function, Li with Sitch thresholds and function, and Lombardozzi), and MODIS.

Download

Seasonal Nash–Sutcliffe efficiency (NSE) shows that MODIS consistently gives the highest agreement with FLUXNET, with excellent performance in winter and autumn (Fig. 8). Among the model experiments, the no ozone case generally maintains the highest NSE values in spring, summer, and autumn, indicating that the baseline simulation without ozone stress still reproduces seasonal variability better than most ozone-stress parameterization schemes at the large scale. For the ozone stress parameterization schemes, summer (JJA) is the weakest season for all schemes: no ozone case reaches only 0.10 and all schemes turn negative in NSE after ozone stress is introduced, indicating a deterioration in the fit to FLUXNET during peak growth months. In spring (MAM), the no-ozone case is NSE  0.26, while the ozone parameterization schemes range from 0.13 to 0.24 and are slightly lower overall. In autumn (SON), the no-ozone case is 0.21, while the ozone schemes range from 0.02 to 0.12 and remain lower. In winter (DJF), all model schemes perform well (NSE near 0.5–0.7), and the differences between the no-ozone case and the Li parameterization series are very small.

https://gmd.copernicus.org/articles/19/5491/2026/gmd-19-5491-2026-f08

Figure 8Seasonal Nash–Sutcliffe efficiency (NSE) of monthly GPP relative to FLUXNET (2005–2014) for the I2000 no-ozone experiment, four ozone-stress parameterization schemes (Li, Li with Lombardozzi thresholds and function, Li with Sitch thresholds and function, and Lombardozzi), and MODIS. Panels (a)(d) correspond to spring (MAM), summer (JJA), autumn (SON), and winter (DJF), respectively.

Download

https://gmd.copernicus.org/articles/19/5491/2026/gmd-19-5491-2026-f09

Figure 9Root mean square error (RMSE) of GPP relative to FLUXNET (2005–2014), evaluated from monthly data at three temporal aggregation scales (annual, seasonal, and monthly) for the I2000 no-ozone experiment, four ozone-stress parameterization schemes (Li, Li with Lombardozzi thresholds and function, Li with Sitch thresholds and function, and Lombardozzi), and MODIS.

Download

From the annual, seasonal, and monthly levels, RMSE increases markedly as the aggregation scale becomes finer (annual < seasonal < monthly), and importantly, the ranking of the parameterization schemes remains consistent across the different levels (Fig. 9). This consistency is similar to the interannual-scale RMSE results in the study of Wild et al. (2022). The no-ozone case consistently shows lower RMSE values than all ozone-stress parameterization schemes at the annual, seasonal, and monthly scales, suggesting that additional sources of model uncertainty beyond ozone stress may still contribute to discrepancies relative to FLUXNET. Among the model parameterization schemes, the Li scheme has the smallest error, followed by Li + Lombardozzi, Li + Sitch, and Lombardozzi; the Lombardozzi scheme has the largest error at each time-scale. MODIS yields the lowest RMSE at all three levels and is closest to FLUXNET among all model schemes. Overall, after ozone stress is introduced, the RMSE of the parameterization schemes does not decrease uniformly relative to the no-ozone experiment (in some cases, ozone stress introduces model bias, with persistent underestimation). Comparing the ozone stress parameterization schemes, the Li scheme shows the lowest RMSE at the annual, seasonal, and monthly levels.

The ten-year mean global total GPP shows that the no-ozone experiment produces a higher global total GPP, and that the inclusion of ozone stress reduces this total by about 10 % to 24 % across the schemes (Fig. 10). The no ozone case reaches 132.00 Pg C yr−1, substantially exceeding both MODIS and FLUXCOM estimates, indicating that neglecting ozone stress likely leads to an overestimation of global terrestrial productivity. Among the four schemes, the Li family of parameterization schemes (Li, Li + Lombardozzi, Li + Sitch) is closer to MODIS (112.51 Pg C yr−1). Additionally, the FLUXCOM upscaling estimate of ∼115.2 Pg C yr−1 suggests that the Li parameterization scheme is closest to the observations, whereas the Lombardozzi result is clearly lower (Li et al., 2024). In other words, the more substantial ozone damage in the Lombardozzi scheme yields a larger global GPP reduction (∼24 %) than appears warranted by satellite-derived GPP benchmarks (∼10 %–12 % reduction), while the Li scheme's more moderate effect (∼12 % reduction) aligns better with observed totals.

https://gmd.copernicus.org/articles/19/5491/2026/gmd-19-5491-2026-f10

Figure 10Decadal-mean (2005–2014) global total GPP and relative bias for four ozone-stress parameterization schemes and MODIS, compared with the I2000 no-ozone experiment. (a) Global total GPP; (b) relative bias (%).

Download

4 Discussion

The spatial heterogeneity and zonal structure of ozone impacts on productivity can be understood as the coupled outcome of multiple processes. High radiation, together with relatively open stomata, increases stomatal flux, thereby more easily triggering above-threshold accumulation and physiological suppression (Mills et al., 2018). In contrast, under low-temperature, drought, or stomatal-closure conditions, effective flux and cumulative dose can be suppressed even when ozone concentration is relatively high (Liu et al., 2024). These features suggest that the priority for parameterization schemes should be explicit gating and adaptive thresholds for radiation and turbulent transport, evapotranspiration and stomatal limitation, and phenology and canopy stratification (Clifton et al., 2020). From a mechanistic perspective, the Li framework parameterization scheme uses data-driven ozone flux thresholds and separate responses of photosynthesis and conductance, thereby improving its ability to represent flux triggering under tropical conditions with high radiation and humidity (Li et al., 2024; Pande et al., 2025). By contrast, the Lombardozzi parameterization scheme, which uses cumulative uptake as a state variable, exhibits a strong memory effect and accumulates ozone damage too rapidly under sustained high-flux conditions. Together with a relatively rigid scaling relation, it tends to amplify long-term suppression at low latitudes, leading to systematic GPP underestimation (Visser et al., 2021). Consistent with this, a recent model comparison in China showed that the Lombardozzi scheme produced GPP losses 2.5–4 times larger than those of the Sitch scheme, reflecting an over-accumulation of ozone damage in regions of high exposure (Cao et al., 2024). In terms of model structure, a weakly coupled pathway that first accumulates flux above the threshold and then scales a solution without ozone can partly avoid excessive coupled amplification of evapotranspiration, conductance, and photosynthesis, thereby reducing systematic bias in the tropics (Otu-Larbi et al., 2020). The comparison results show that what is decisive is not only whether ozone is represented, but also how ozone flux thresholds and response functions are defined and how memory and radiation stratification are treated at the canopy scale (Zhou et al., 2024).

Station-based RMSE, together with biome-mean bias, jointly reveal a three-way constraint arising from observations, simulations, and spatial resolution. On the one hand, complex land surfaces and diverse PFT combinations in the midlatitudes produce bands of significant errors, while higher-resolution remote sensing in the same regions shows fewer extreme high values, indicating that grid-scale mismatch and subgrid heterogeneity are essential sources of information loss (Wild et al., 2022). On the other hand, under the same forcing, the Li series of parameterization schemes shows convergence in RMSE and reduced bias across several latitude bands and biomes, indicating that a strategy in which ozone flux thresholds can be optimized, and response functions selected, is more favorable for transfer across ecological zones. In contrast, the Lombardozzi parameterization scheme tends to accumulate more strongly and impose more profound suppression in low latitudes and in some grass and shrub systems. It therefore requires region-specific adjustments to ozone flux thresholds and response functions (e.g., separate calibration for sensitive tropical ecosystems). Integrated statistics across multiple biomes also show that the biases exhibit pronounced biome dependence, underscoring that a one-size-fits-all ozone parameterization will lead to systematic errors in specific biomes (e.g., savannas, montane shrublands).

Experimental evidence from previous ozone exposure studies also suggests that the ozone flux thresholds currently adopted across different parameterization schemes remain highly uncertain across vegetation types. For example, experiments comparing loblolly pine (needleleaf) and yellow-poplar (broadleaf) showed stronger ozone sensitivity in the needleleaf species (Tjoelker and Luxmoore, 1991), whereas studies on Magnolia denudata and Cotinus coggygria indicated that broadleaf trees could be more ozone-sensitive than shrub species (Xu et al., 2021). Other controlled experiments further suggested a sensitivity gradient of soybean > tobacco > poplar, implying that crop species are generally more sensitive to ozone than many broadleaf woody species (Dai et al., 2019). However, the Lombardozzi parameterization scheme applies a uniform threshold of 0.8 nmol m−2 s−1 to all five vegetation categories, which may oversimplify species- and PFT-dependent detoxification capacity. Similarly, the original Sitch scheme was developed under limited observational constraints, lacking dedicated experimental evidence for shrub and grass vegetation types; therefore, shrub thresholds were assumed to be equivalent to those for broadleaf vegetation, while grass thresholds were assumed to be similar to those for crops. In contrast, the Li scheme assigns higher ozone sensitivity to needleleaf forests than to broadleaf forests, a result that is not always consistent with the experimental evidence summarized above. Taken together, these inconsistencies indicate that ozone flux thresholds remain insufficiently constrained and still require further recalibration across vegetation types and model frameworks.

This uncertainty was one of the primary motivations for introducing the mixed parameterization experiments in this study. By combining the Li framework with alternative threshold and response-function settings from the Lombardozzi and Sitch schemes, we aimed to isolate the effects of threshold definitions and response-function forms from other structural components of the parameterizations. Importantly, we did not perform additional retuning or recalibration of the empirical parameters, including the global parameter originally used in the Sitch scheme, when transferring these thresholds and response functions into the Li framework. Therefore, the mixed experiments should be interpreted as structural-sensitivity experiments rather than as optimized implementations. In addition, comparing the Li+Lombardozzi and original Lombardozzi schemes enabled us to distinguish further the role of cumulative memory effects from that of threshold and response-function choices alone. Our results suggest that differences in long-term ozone accumulation and canopy memory can substantially alter the magnitude and spatial pattern of simulated ozone damage, even under similar threshold settings.

Except for a few regions, most parameterization schemes still show systematic underestimation to varying degrees across several seasons. Therefore, we focused on the prominent overestimation of the no-ozone experiment relative to the observations in the low latitudes of the Southern Hemisphere (approximately 30° S to the equator) and in shrub-dominated regions, where the time series matched better after introducing ozone (Figs. 5 and 6c). This latitude band is primarily a transition zone from subtropical to tropical semiarid and monsoon climates, where intense radiation and relatively open stomata, together with enhanced boundary-layer mixing during the rainy season, tend to increase instantaneous stomatal flux (Khan et al., 2025). If the gating of above-threshold accumulation or CUO is insensitive to high afternoon vapor pressure deficit (VPD) and soil moisture constraints, or if the phenological switches and leaf-age decay associated with transitions between wet and dry seasons are not adequately represented, potential productivity will be overly amplified (Eghdami et al., 2022). Shrub ecosystems often have shallow roots and rapidly renewing leaves, coexist with shrub–grass mosaics, and are subject to grazing disturbance; in reality, stomatal regulation is more conservative and strongly driven by soil moisture pulses (Wang et al., 2018). When a parameterization scheme adopts forest-type ozone flux thresholds or fixed-shape response functions and does not distinguish the time-varying weighting of canopy flux under heterogeneous near-surface turbulence and radiation, the model tends to produce amplified GPP peaks after rainfall events or at the beginning of the growing season, resulting in systematic overestimation relative to FLUXNET (Vermeuel et al., 2024). In contrast, the Li parameterization scheme uses data-driven thresholds and selectable, decoupled response functions for photosynthesis and conductance, and its treatment of canopy stratification and phenological gating is more consistent with process constraints. As a result, in the band from 30° S to the equator and in shrub-rich regions, the Li scheme markedly reduces the overestimation and comes closer to FLUXNET.

Compared with the earlier CLM4.5-based study by Lombardozzi et al. (2015), an important difference emerges in the baseline productivity simulated in the absence of ozone stress. In CLM4.5, the no-ozone global GPP reached approximately 146.5 Pg C yr−1, whereas the corresponding no-ozone simulation in the present CLM5 framework is substantially lower (132 Pg C yr−1). This indicates that improvements in the underlying land-surface physiology and canopy process representation from CLM4.5 to CLM5 have already reduced part of the excessive global productivity bias. Consequently, introducing ozone stress in CLM4.5 moved the simulated GPP closer to observational benchmarks at the global scale, whereas in CLM5, the remaining correction space is smaller, and excessive ozone suppression can more readily lead to underestimation in some regions and across statistical metrics. This difference highlights that the apparent performance of ozone-stress parameterizations is strongly dependent on the baseline behavior of the host land-surface model.

From the perspective of global total GPP, the reasons why different ozone stress parameterization schemes produce varying degrees of reduction lie mainly in how ozone flux thresholds are determined and in the forms of the response functions. A framework in which thresholds are determined from data and the optimal choice can be made among several candidate functions helps calibrate, separately, when ozone stress is triggered and how strongly it decays after triggering. This maintains transparency and traceability when applied across regions. In contrast, configurations with overly strong cumulative memory more easily amplify the influence of historical fluxes in low-latitude ecosystems with high leaf longevity or long growing seasons. Our results here support the findings of Li et al. (2024) on a global scale: using PFT-specific optimal thresholds and nonlinear response curves (as in Li's scheme) yields roughly half the ozone-induced GPP reduction compared with a scheme with fixed thresholds and linear responses, thereby better aligning model GPP estimates with observational estimates. Thus, beyond the inclusion of ozone damage per se, the calibration and flexibility of the threshold and response parameters determine model outcomes.

5 Implications for future improvement in Earth system models

The significant differences in GPP bias across biomes reveal key directions for improving ozone-stress parameterization in models. In biomes with intense radiation, such as wetland grasslands and tropical moist broadleaf forests, models generally underestimate GPP, especially with the Lombardozzi scheme. Therefore, regional calibration of ozone flux thresholds and response functions is needed for these areas. In contrast, in low-stress regions such as tundra, temperate grasslands, and forests, the Li scheme, with optimized thresholds and decoupled response functions, performs best, suggesting that more refined model adjustments are required in these regions. In dry and desert biomes, a slight overestimation is observed, especially in the no-ozone simulations, indicating that the model's performance is generally good in these regions but still leaves room for further improvement when accounting for other environmental factors. In conclusion, optimizing the model to account for the characteristics of different biomes will improve the accuracy of ozone-stress simulations, thereby enhancing the reliability of climate predictions. In addition, although the Li scheme performs well across most vegetation types, especially in shrubs, where it significantly reduces bias (as shown in Fig. 7), it does not always perform best in crop types and under autumn conditions. Particularly in summer, the NSE values for all ozone stress schemes are relatively low, and after ozone stress is introduced, the model's fit to FLUXNET deteriorates. This indicates that, although the Li scheme performs well in certain biomes and seasons, further optimization is still needed for crop types and seasonal variations.

The process representation of ozone stress effects needs to shift from a paradigm of static ozone flux thresholds combined with fixed response functions to parameterization schemes based on a unified mechanistic framework that can be transferred across different vegetation types and regions. We should calibrate ozone flux thresholds for different vegetation types and areas in a data-driven manner, enabling optimal selection among multiple response functions while accounting separately for the independent suppression of photosynthesis and stomatal conductance. In parallel, we recommend introducing finer control over canopy stratification, phenological gating, and leaf-age memory, so that above-threshold accumulation or dose state variables can adapt across diurnal, seasonal, and growth stages, thereby avoiding systematic excessive downregulation caused by long-term memory.

Interactions between ozone stress and water stress need to be incorporated in parallel rather than corrected afterward. Under conditions of high temperature, intense radiation, and elevated VPD, whether above-threshold accumulation is triggered and how strongly it decays after triggering depend on the combined constraints imposed by evaporative demand, stomatal opening, and soil moisture. We recommend explicitly coupling the time-varying effects of VPD and soil moisture on stomatal and boundary-layer fluxes in the resistance-chain calculations, and setting robust gating for extreme afternoon conditions to reduce the degradation of model skill in summer. At the same time, for semiarid and shrub ecosystems, thresholds and response functions should be recalibrated using regional observations to avoid systematic over- or underestimation arising from the reuse of forest-type parameterization schemes. (Consistent with these recommendations, recent studies have called for jointly accounting for drought and ozone – e.g., Otu-Larbi et al. (2020) and Khan et al. (2025) – to better simulate vegetation risk.)

In addition to improvements to ozone stress parameterization schemes that need to infer thresholds, response functions, and water stress effects from in situ ground observations, previous studies have also discussed potential directions for development in terms of other explanatory variables such as leaf area, leaf thickness, and leaf mass per area, as well as additional environmental factors (Karlsson et al., 2007; Wittig et al., 2007; Bussotti, 2008; Feng et al., 2018; Hansen et al., 2019; Xu et al., 2020; Ma et al., 2023; Li et al., 2024). Here, we propose several potential directions from a remote sensing perspective for further improving model simulation ability and ozone stress parameterization schemes:

  1. Photosynthetic capacity and stress detection: solar-induced chlorophyll fluorescence and the near-infrared vegetation index can be used as joint constraints on photosynthetic potential and non-photochemical quenching (Porcar-Castell et al., 2014; Badgley et al., 2017). Meanwhile, microwave vegetation optical depth, together with passive microwave soil moisture, can be used to characterize canopy water content and soil moisture state. These could help ensure that stomatal opening/closure and the conditions under which above-threshold O3 uptake occurs are more accurately constrained under high-temperature, high-radiation conditions (Konings et al., 2017).

  2. Canopy structure and turbulence: multi-source leaf area index products and emerging canopy structure observations (e.g., from LiDAR) can be used to better constrain canopy layer weighting and vegetation structural representation (Xiao et al., 2019). Additionally, canopy height and surface roughness inferred from LiDAR and synthetic aperture radar can be used to reduce systematic bias under conditions of strong afternoon turbulence and intense radiation (Dubayah et al., 2020). This would refine how models partition ozone flux between tall, rough canopies and short, smooth ones, thereby improving the partitioning of flux to leaves (Visser et al., 2021).

  3. Plant traits and species-specific responses: leaf economic spectrum traits retrieved from hyperspectral observations – such as leaf mass per area (LMA), leaf thickness, and leaf nitrogen content – can be used to construct prior distributions of photosynthesis and conductance sensitivity for different vegetation types. This can support the dynamic, regional and seasonal updating of ozone flux thresholds and response functions (Serbin et al., 2014). For example, trait-based approaches (e.g., Ma et al., 2023) can inform which ecosystems have inherently higher ozone tolerance and should have higher threshold Y values in models.

  4. Data assimilation and uncertainty quantification: through joint assimilation of site observations and remote sensing data, ozone flux thresholds, response function shapes, and key trait parameters can be calibrated and their uncertainties quantified simultaneously using hierarchical Bayesian methods or surrogate models (MacBean et al., 2016). This would achieve consistent multi-scale constraints and provide a transferable evaluation framework spanning the site, regional, and global scales.

Additionally, artificial intelligence (AI) offers a potential technical pathway for further developing ozone stress parameterizations. On the one hand, AI-based surrogate models can approximate complex ozone flux calculations and nonlinear response processes while maintaining physical constraints, holding promise for reducing computational costs and thereby supporting the exploration of multi-parameter combinations (Lu and Ricciuto, 2019; Zhu et al., 2022). On the other hand, integrating site observations, remote sensing products, and model outputs into supervised or semi-supervised learning frameworks may help identify ozone flux thresholds, response function forms, and their dependencies on moisture, radiation, and phenological conditions from multi-source data, providing data-driven prior information for different vegetation types and regions (Sung et al., 2025). Furthermore, combining artificial intelligence with model–data assimilation may offer new pathways for jointly inverting ozone flux thresholds, response function parameters, and key leaf traits (Raoult et al., 2025). By integrating artificial intelligence with physical constraints, ecological mechanisms, and multi-scale observations, it is possible to develop more flexible ozone-stress parameterization frameworks that provide more robust support for global carbon cycle assessments and scenario projections.

6 Conclusion

Within a unified land surface model framework, we compared three representative ozone stress parameterization schemes and, by decomposing in layers the choices of ozone flux thresholds, the representation of cumulative dose, and the shapes of response functions, clarified that beyond whether ozone is included, the more critical question is how thresholds and responses are represented. The overall evidence shows that a diagnostic scheme that uses data-driven thresholds, models photosynthesis and stomatal conductance separately, and allows an optimal choice among multiple response functions is more favorable for transferable applications across regions, vegetation types, and time scales. In contrast, configurations with overly strong cumulative memory or rigid assumptions about thresholds and functions are more likely to amplify systematic biases in specific climate zones or habitat types. Based on the comprehensive evaluation, we conclude that the Li ozone stress parameterization scheme is the most robust and the most consistent with the observations in this study. Notably, the mixed experiments incorporating thresholds and response functions derived from the Sitch and Lombardozzi schemes were conducted without additional recalibration; therefore, the differences reported here primarily reflect structural sensitivities rather than fully optimized implementations of those parameterizations within the Li framework.

Large-scale remote sensing productivity benchmarks and high-resolution site-level flux observations are complementary in space, time, and process representation. This enables us to separate uncertainties in ozone thresholds and response functions from biases in forcing fields and observations. At the same time, the temporally and spatially consistent ozone forcing from the chemical transport model ensures mechanistic coherence between atmospheric inputs and land-surface processes. The resulting chain of evidence not only supports a quantitative discrimination of the relative performance of different ozone stress parameterization schemes but also points to directions for improvement. Namely, our results indicate that ozone flux thresholds need to be explicitly calibrated in both the regional dimension and the vegetation-type dimension; that response functions should remain flexible with respect to the nonlinear joint effects of high summer temperature, intense radiation, high VPD, and water constraints; and that canopy stratification together with phenological gating are key mechanisms for suppressing long-term over-memory and biases under extreme afternoon conditions. At a practical level, these findings help identify a prioritized set of ozone-related parameters and processes that warrant focused constraint in future model evaluation and model–data integration efforts.

Our results further suggest that the apparent benefit of introducing ozone stress depends strongly on the host land-surface model's baseline behavior. Compared with earlier CLM4.5-based studies, the no-ozone simulation in CLM5 already produces substantially lower global GPP, indicating that improvements in canopy physiology and land-surface process representation have reduced part of the intrinsic productivity bias. Consequently, although ozone stress effectively reduces the tropical overestimation observed in the no-ozone experiment, improvements in model performance remain spatially and temporally heterogeneous across evaluation metrics and ecosystems. This highlights that future ozone parameterizations should not only focus on representing ozone damage itself, but also on achieving balanced interactions with the underlying model structure, including canopy processes, phenology, hydrological constraints, and biome-dependent productivity dynamics.

We also highlight limitations and frontier needs that still require attention. First, representativeness differences between the site scale and the model grid scale, together with subgrid heterogeneity, continue to amplify biases in some biomes and climate zones. This calls for higher-resolution forcing and parameter fields, as well as joint assimilation of site observations and remote sensing data, to narrow this gap. Second, the parallel systems for water and ozone stress still need strengthening. Future work should explicitly couple, at the level of the resistance chain, the time-varying modulation of stomatal flux and boundary-layer flux by vapor pressure deficit and soil moisture, and should use hierarchical Bayesian or surrogate-model approaches to calibrate the thresholds, response functions, and prior distributions of key leaf traits in ozone stress parameterization schemes. Third, regional calibration of thresholds and ecosystem-type-specific response functions still requires richer observational evidence, especially in semiarid shrublands, tropical monsoon regions, and cropland systems under strong management. Strengthening these aspects is expected to extend the diagnostic framework for ozone flux threshold calibration and response function selection developed in this study into a consistency evaluation tool applicable across multiple scenarios and models, thereby enhancing the transferability and credibility of assessments of ozone impacts on the carbon cycle.

Code and data availability

The code (including its license) for the two modified ozone-stress parameterization schemes used to simulate ozone-caused damage to vegetation in the process model is available at https://doi.org/10.5281/zenodo.18323484 (Zhou, 2026a). The input data for ozone concentration and the model simulation output data are accessible at https://doi.org/10.5281/zenodo.18324274 (Zhou, 2026b). The code for the Community Terrestrial Systems Model CTSM 5.3.075 is archived at https://doi.org/10.5281/zenodo.15174742 (CTSM Team and Damaseaux, 2025). The MODIS and FLUXNET GPP data are documented in Zhao et al. (2005) (https://doi.org/10.1016/j.rse.2004.12.011) and Pastorello et al. (2020) (https://doi.org/10.1038/s41597-020-0534-3), respectively.

Author contributions

PZ and JC conceived the study. FL provided the initial parameterization code, which was further modified and refined by PZ. JL led the ozone simulations and data analysis, with contributions from RB and DM, while PZ conducted the land surface process simulations and data analysis. PZ, JC, and MB jointly drafted the initial manuscript. LD, JL, MB, FL, and MS critically revised the manuscript. ZC, JP, KL, FY, WP, JC, LX, and other co-authors contributed to in-depth discussions of the study. LD coordinated the overall team effort and provided funding support.

Competing interests

The contact author has declared that none of the authors has any competing interests.

Disclaimer

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. The authors bear the ultimate responsibility for providing appropriate place names. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.

Acknowledgements

We are grateful to Danica Lombardozzi, Sam Levis, Keith Oleson, Sam Rabin, Ryan Knox, Adrianna Foster, Jyoti Singh for their helpful discussions. We also thank the National Supercomputer Center in Guangzhou for providing the Tianhe 2 supercomputer used for the model simulations.

Financial support

TThis research was jointly supported by the National Key Research and Development Program of China (grant no. 2023YFF0805501) and the National Natural Science Foundation of China (grant nos. 42141017, 42261144687, 42075167, 41875137). Fang Li is supported by the Guangdong Major Project of Basic and Applied Basic Research (grant no. 2021B0301030007).

Review statement

This paper was edited by Hans Verbeeck and reviewed by two anonymous referees.

References

Achebak, H., Garatachea, R., Pay, M. T., Jorba, O., Guevara, M., Pérez García-Pando, C., and Ballester, J.: Geographic sources of ozone air pollution and mortality burden in Europe, Nat. Med., 30, 1732–1738, https://doi.org/10.1038/s41591-024-02976-x, 2024. 

Agathokleous, E., Feng, Z., Oksanen, E., Sicard, P., Wang, Q., Saitanis, C. J., Araminiene, V., Blande, J. D., Hayes, F., Calatayud, V., Domingos, M., Veresoglou, S. D., Peñuelas, J., Wardle, D. A., De Marco, A., Li, Z., Harmens, H., Yuan, X., Vitale, M., and Paoletti, E.: Ozone affects plant, insect, and soil microbial communities: A threat to terrestrial ecosystems and biodiversity, Sci. Adv., 6, eabc1176, https://doi.org/10.1126/sciadv.abc1176, 2020.  

Badgley, G., Field, C. B., and Berry, J. A.: Canopy near-infrared reflectance and terrestrial photosynthesis, Sci. Adv., 3, e1602244, https://doi.org/10.1126/sciadv.1602244, 2017. 

Baldocchi, D., Falge, E., Gu, L., Olson, R., Hollinger, D., Running, S., Anthoni, P., Bernhofer, C., Davis, K., Evans, R., Fuentes, J., Goldstein, A., Katul, G., Law, B., Lee, X., Malhi, Y., Meyers, T., Munger, W., Oechel, W., Paw U, K. T., Pilegaard, K., Schmid, H. P., Valentini, R., Verma, S., Vesala, T., Wilson, K., and Wofsy, S.: FLUXNET: A new tool to study the temporal and spatial variability of ecosystem-scale carbon dioxide, water vapor, and energy flux densities, B. Am. Meteorol. Soc., 82, 2415–2434, https://doi.org/10.1175/1520-0477(2001)082<2415:FANTTS>2.3.CO;2, 2001. 

Bussotti, F.: Functional leaf traits, plant communities and acclimation processes in relation to oxidative stress in trees: a critical overview, Global Change Biol., 14, 2727–2739, https://doi.org/10.1111/j.1365-2486.2008.01677.x, 2008. 

Cao, J., Yue, X., and Ma, M.: Simulation of ozone–vegetation coupling and feedback in China using multiple ozone damage schemes, Atmos. Chem. Phys., 24, 3973–3987, https://doi.org/10.5194/acp-24-3973-2024, 2024. 

Clifton, O. E., Fiore, A. M., Massman, W. J., Baublitz, C. B., Coyle, M., Emberson, L., Fares, S., Farmer, D. K., Gentine, P., Gerosa, G., Guenther, A. B., Helmig, D., Lombardozzi, D. L., Munger, J. W., Patton, E. G., Pusede, S. E., Schwede, D. B., Silva, S. J., Sörgel, M., Steiner, A. L., and Tai, A. P. K.: Dry deposition of ozone over land: processes, measurement, and modeling, Rev. Geophys., 58, e2019RG000670, https://doi.org/10.1029/2019RG000670, 2020. 

Collatz, G. J., Ball, J. T., Grivet, C., and Berry, J. A.: Physiological and environmental regulation of stomatal conductance, photosynthesis, and transpiration: A model that includes a laminar boundary layer, Agr. Forest Meteorol., 54, 107–136, https://doi.org/10.1016/0168-1923(91)90002-8, 1991. 

Collatz, G. J., Ribas-Carbo, M., and Berry, J. A.: Coupled photosynthesis–stomatal conductance model for leaves of C4 plants, Aust. J. Plant Physiol., 19, 519–538, https://doi.org/10.1071/PP9920519, 1992. 

CTSM Development Team and Damseaux, A.: AdrienDams/CTSM: sturm-paper (v1.0.0), Zenodo [code], https://doi.org/10.5281/zenodo.15174742, 2025. 

Dai, L., Feng, Z., Pan, X., Xu, Y., Li, P., Lefohn, A. S., Harmens, H., and Kobayashi, K.: Increase of apoplastic ascorbate induced by ozone is insufficient to remove the negative effects in tobacco, soybean and poplar, Environ. Pollut., 245, 380–388, https://doi.org/10.1016/j.envpol.2018.11.030, 2019. 

Darmenov, A. S. and da Silva, A. M.: The Quick Fire Emissions Dataset (QFED): documentation of versions 2.1, 2.2 and 2.4, in: NASA Technical Report Series on Global Modeling and Data Assimilation, Vol. 38, NASA/TM2015104606, edited by: Koster, R. D., National Aeronautics and Space Administration, Goddard Space Flight Center, Greenbelt, Maryland, USA, https://gmao.gsfc.nasa.gov/media/science_snapshot/k9bkl33mqdnhr9sql32cssvkg/Darmenov796.pdf (last access: 23 June 2026), 2015. 

Dirmeyer, P. A.: A history and review of the Global Soil Wetness Project (GSWP), J. Hydrometeorol., 12, 729–749, https://doi.org/10.1175/JHM-D-10-05010.1, 2011. 

Dubayah, R., Blair, J. B., Goetz, S., Fatoyinbo, T., Hansen, M., Healey, S., Hofton, M., Hurtt, G., Kellner, J., Luthcke, S., Armston, J., Tang, H., and Duncanson, L.: The Global Ecosystem Dynamics Investigation: High-resolution laser ranging of the Earth's forests and topography, Sci. Remote Sens., 1, 100002, https://doi.org/10.1016/j.srs.2020.100002, 2020. 

Eghdami, H., Werner, W., Büker, P., and Sicard, P.: Assessment of ozone risk to Central European forests: time series indicates perennial exceedance of ozone critical levels, Environ. Res., 203, 111798, https://doi.org/10.1016/j.envres.2021.111798, 2022. 

Emberson, L. D., Pleijel, H., Ainsworth, E. A., van den Berg, M., Ren, W., Osborne, S., Mills, G., Pandey, D., Dentener, F., Büker, P., Ewert, F., Koeble, R., and Van Dingenen, R.: Ozone effects on crops and consideration in crop models, Eur. J. Agron., 100, 19–34, https://doi.org/10.1016/j.eja.2018.06.002, 2018. 

Emmons, L. K., Schwantes, R. H., Orlando, J. J., Tyndall, G., Kinnison, D., Lamarque, J.-F., Marsh, D., Mills, M. J., Tilmes, S., Bardeen, C., Buchholz, R. R., Conley, A., Gettelman, A., Garcia, R., Simpson, I., Blake, D. R., Meinardi, S., and Pétron, G.: The chemistry mechanism in the Community Earth System Model version 2 (CESM2), J. Adv. Model. Earth Syst., 12, e2019MS001882, https://doi.org/10.1029/2019MS001882, 2020. 

Farquhar, G. D., von Caemmerer, S., and Berry, J. A.: A biochemical model of photosynthetic CO2 assimilation in leaves of C3 species, Planta, 149, 78–90, https://doi.org/10.1007/BF00386231, 1980. 

Feng, Z., Xu, Y., Kobayashi, K., Dai, L., Zhang, T., Agathokleous, E., Calatayud, V., Paoletti, E., Mukherjee, A., Agrawal, M., Park, R. J., Oak, Y. J., and Yue, X.: Ozone pollution threatens the production of major staple crops in East Asia, Nat. Food, 3, 47–56, https://doi.org/10.1038/s43016-021-00422-6, 2022. 

Feng, Z. Z., Buker, P., Pleijel, H., Emberson, L., Karlsson, P. E., and Uddling, J.: A unifying explanation for variation in ozone sensitivity among woody plants, Global Change Biol., 24, 78–84, https://doi.org/10.1111/gcb.13824, 2018. 

Fuhrer, J.: The critical level for effects of ozone on crops, and the transfer to mapping, in: Critical Levels for Ozone in Europe: Testing and Finalising the Concepts, edited by: Kärenlampi, L. and Skärby, L., University of Kuopio, Department of Ecology and Environmental Science, Kuopio, Finland, 27–43, ISBN 951-780-653-1, 1996. 

Gelaro, R., McCarty, W., Suárez, M. J., Todling, R., Molod, A., Takacs, L., Randles, C. A., Darmenov, A., Bosilovich, M. G., Reichle, R., Wargan, K., Coy, L., Cullather, R., Draper, C., Akella, S., Buchard, V., Conaty, A., da Silva, A. M., Gu, W., Kim, G.-K., Koster, R., Lucchesi, R., Merkova, D., Nielsen, J. E., Partyka, G., Pawson, S., Putman, W., Rienecker, M., Schubert, S. D., Sienkiewicz, M., and Zhao, B.: The Modern-Era Retrospective Analysis for Research and Applications, Version 2 (MERRA-2), J. Climate, 30, 5419–5454, https://doi.org/10.1175/JCLI-D-16-0758.1, 2017. 

Gong, C., Lei, Y., Ma, Y., Yue, X., and Liao, H.: Ozone–vegetation feedback through dry deposition and isoprene emissions in a global chemistry–carbon–climate model, Atmos. Chem. Phys., 20, 3841–3857, https://doi.org/10.5194/acp-20-3841-2020, 2020. 

Granier, C., Darras, S., Denier van der Gon, H., Doubalova, J., Elguindi, N., Galle, B., Gauss, M., Guevara, M., Jalkanen, J.-P., Kuenen, J., Liousse, C., Quack, B., Simpson, D., and Sindelarova, K.: The Copernicus Atmosphere Monitoring Service global and regional emissions (April 2019 version), CAMS – Copernicus Atmosphere Monitoring Service, https://doi.org/10.24380/d0bn-kx16, 2019. 

Guenther, A. B., Jiang, X., Heald, C. L., Sakulyanontvittaya, T., Duhl, T., Emmons, L. K., and Wang, X.: The Model of Emissions of Gases and Aerosols from Nature version 2.1 (MEGAN2.1): an extended and updated framework for modeling biogenic emissions, Geosci. Model Dev., 5, 1471–1492, https://doi.org/10.5194/gmd-5-1471-2012, 2012. 

Hansen, E. M. Ø., Hauggaard-Nielsen, H., Launay, M., Rose, P., and Mikkelsen, T. N.: The impact of ozone exposure, temperature and CO2 on the growth and yield of three spring wheat varieties, Environ. Exp. Bot., 168, 103868, https://doi.org/10.1016/j.envexpbot.2019.103868, 2019. 

Karlsson, P. E., Braun, S., Broadmeadow, M., Elvira, S., Emberson, L., Gimeno, B. S., Le Thiec, D., Novak, K., Oksanen, E., Schaub, M., Uddling, J., and Wilkinson, M.: Risk assessments for forest trees: The performance of the ozone flux versus the AOT concepts, Environ. Pollut., 146, 608–616, https://doi.org/10.1016/j.envpol.2006.06.012, 2007. 

Khan, A. M., Clifton, O. E., Bash, J. O., Bland, S., Booth, N., Cheung, P., Emberson, L., Flemming, J., Fredj, E., Galmarini, S., Ganzeveld, L., Gazetas, O., Goded, I., Hogrefe, C., Holmes, C. D., Horváth, L., Huijnen, V., Li, Q., Makar, P. A., Mammarella, I., Manca, G., Munger, J. W., Pérez-Camanyo, J. L., Pleim, J., Ran, L., San Jose, R., Schwede, D., Silva, S. J., Staebler, R., Sun, S., Tai, A. P. K., Tas, E., Vesala, T., Weidinger, T., Wu, Z., Zhang, L., and Stoy, P. C.: Ozone dry deposition through plant stomata: multi-model comparison with flux observations and the role of water stress as part of AQMEII4 Activity 2, Atmos. Chem. Phys., 25, 8613–8635, https://doi.org/10.5194/acp-25-8613-2025, 2025. 

Konings, A. G., Williams, A. P., and Gentine, P.: Sensitivity of grassland productivity to aridity controlled by stomatal and xylem regulation, Nat. Geosci., 10, 284–288, https://doi.org/10.1038/ngeo2903, 2017. 

Lamarque, J.-F., Emmons, L. K., Hess, P. G., Kinnison, D. E., Tilmes, S., Vitt, F., Heald, C. L., Holland, E. A., Lauritzen, P. H., Neu, J., Orlando, J. J., Rasch, P. J., and Tyndall, G. K.: CAM-chem: description and evaluation of interactive atmospheric chemistry in the Community Earth System Model, Geosci. Model Dev., 5, 369–411, https://doi.org/10.5194/gmd-5-369-2012, 2012. 

Lamarque, J.-F., Dentener, F., McConnell, J., Ro, C.-U., Shaw, M., Vet, R., Bergmann, D., Cameron-Smith, P., Dalsøren, S., Doherty, R., Faluvegi, G., Ghan, S. J., Josse, B., Lee, Y. H., MacKenzie, I. A., Plummer, D., Shindell, D. T., Skeie, R. B., Stevenson, D. S., Strode, S., Zeng, G., Curran, M., Dahl-Jensen, D., Das, S., Fritzsche, D., and Nolan, M.: Multi-model mean nitrogen and sulfur deposition from the Atmospheric Chemistry and Climate Model Intercomparison Project (ACCMIP): evaluation of historical and projected future changes, Atmos. Chem. Phys., 13, 7997–8018, https://doi.org/10.5194/acp-13-7997-2013, 2013. 

Lawrence, D. M., Fisher, R. A., Koven, C. D., Oleson, K. W., Swenson, S. C., Bonan, G., Collier, N., Ghimire, B., van Kampenhout, L., Kennedy, D., Kluzek, E., Lawrence, P. J., Li, F., Li, H., Lombardozzi, D., Riley, W. J., Sacks, W. J., Shi, M., Vertenstein, M., Wieder, W. R., Xu, C., Ali, A. A., Badger, A. M., Bisht, G., van den Broeke, M., Brunke, M. A., Burns, S. P., Buzan, J., Clark, M., Craig, A., Dahlin, K., Drewniak, B., Fisher, J. B., Flanner, M., Fox, A. M., Gentine, P., Hoffman, F., Keppel-Aleks, G., Knox, R., Kumar, S., Lenaerts, J., Leung, L. R., Lipscomb, W. H., Lu, Y., Pandey, A., Pelletier, J. D., Perket, J., Randerson, J. T., Ricciuto, D. M., Sanderson, B. M., Slater, A., Subin, Z. M., Tang, J., Thomas, R. Q., Val Martin, M., and Zeng, X.: The Community Land Model version 5: Description of new features, benchmarking, and impact of forcing uncertainty, J. Adv. Model. Earth Syst., 11, 4245–4287, https://doi.org/10.1029/2018MS001583, 2019. 

Li, F., Zhou, Z., Levis, S., Sitch, S., Hayes, F., Feng, Z., Reich, P. B., Zhao, Z., and Zhou, Y.: Quantifying the role of ozone-caused damage to vegetation in the Earth system: a new parameterization scheme for photosynthetic and stomatal responses, Geosci. Model Dev., 17, 6173–6193, https://doi.org/10.5194/gmd-17-6173-2024, 2024. 

Liu, Y., Wang, L., Xia, J., Guo, H., and Wang, J.: Ozone enrichment and drought stress have more negative effects on invasive leguminous woody species than co-occurring native species, Environ. Exp. Bot., 217, 105580, https://doi.org/10.1016/j.envexpbot.2023.105580, 2024. 

Lombardozzi, D., Sparks, J. P., and Bonan, G.: Integrating O3 influences on terrestrial processes: photosynthetic and stomatal response data available for regional and global modeling, Biogeosciences, 10, 6815–6831, https://doi.org/10.5194/bg-10-6815-2013, 2013. 

Lombardozzi, D., Levis, S., Bonan, G., Hess, P. G., and Sparks, J. P.: The influence of chronic ozone exposure on global carbon and water cycles, J. Climate, 28, 292–305, https://doi.org/10.1175/JCLI-D-14-00223.1, 2015. 

Lu, D. and Ricciuto, D.: Efficient surrogate modeling methods for large-scale Earth system models based on machine-learning techniques, Geosci. Model Dev., 12, 1791–1807, https://doi.org/10.5194/gmd-12-1791-2019, 2019. 

Ma, Y., Yue, X., Sitch, S., Unger, N., Uddling, J., Mercado, L. M., Gong, C., Feng, Z., Yang, H., Zhou, H., Tian, C., Cao, Y., Lei, Y., Cheesman, A. W., Xu, Y., and Duran Rojas, M. C.: Implementation of trait-based ozone plant sensitivity in the Yale Interactive Terrestrial Biosphere model v1.0 to assess global vegetation damage, Geosci. Model Dev., 16, 2261–2276, https://doi.org/10.5194/gmd-16-2261-2023, 2023. 

Ma, Y., Guan, X., Chen, J. M., Ju, W., and Shen, H.: Improving MODIS gross primary productivity by bridging big-leaf and two-leaf light use efficiency models, J. Geophys. Res.-Biogeo., 129, e2023JG007737, https://doi.org/10.1029/2023JG007737, 2024. 

MacBean, N., Peylin, P., Chevallier, F., Scholze, M., and Schürmann, G.: Consistent assimilation of multiple data streams in a carbon cycle data assimilation system, Geosci. Model Dev., 9, 3569–3588, https://doi.org/10.5194/gmd-9-3569-2016, 2016. 

Medlyn, B. E., Duursma, R. A., Eamus, D., Ellsworth, D. S., Prentice, I. C., Barton, C. V. M., Crous, K. Y., De Angelis, P., Freeman, M., and Wingate, L.: Reconciling the optimal and empirical approaches to modelling stomatal conductance, Global Change Biol., 17, 2134–2144, https://doi.org/10.1111/j.1365-2486.2010.02375.x, 2011. 

Mills, G., Hayes, F., Simpson, D., Emberson, L., Norris, D., Harmens, H., and Büker, P.: Evidence of widespread effects of ozone on crops and (semi-)natural vegetation in Europe (1990–2006) in relation to AOT40- and flux-based risk maps, Global Change Biol., 17, 592–613, https://doi.org/10.1111/j.1365-2486.2010.02217.x, 2011. 

Mills, G., Sharps, K., Simpson, D., Pleijel, H., Frei, M., Burkey, K., Emberson, L., Uddling, J., Broberg, M., Feng, Z., Kobayashi, K., and Agrawal, M.: Closing the global ozone yield gap: quantification and co-benefits for multi-stress tolerance, Global Change Biol., 24, 4869–4893, https://doi.org/10.1111/gcb.14381, 2018. 

Montes, C. M., Demler, H. J., Li, S., Martin, D. G., and Ainsworth, E. A.: Approaches to investigate crop responses to ozone pollution: from O3-FACE to satellite-enabled modeling, Plant J., 108, 780–796, https://doi.org/10.1111/tpj.15501, 2021. 

Moura, B. B., Manzini, J., Paoletti, E., and Hoshika, Y.: A three-year free-air experimental assessment of ozone risk on the perennial Vitis vinifera crop species, Environ. Pollut., 338, 122626, https://doi.org/10.1016/j.envpol.2023.122626, 2023. 

Murray, D., Buchholz, R., R., Emmons, L. K., Honomichl, S., Tang, W., Tilmes, S., Barth, M., and Wymore, A. S. Evaluating CAM-chem modeled atmospheric wet deposition with observed long-term records, J. Geophys. Res.-Atmos., in press, 2026. 

Nash, J. E. and Sutcliffe, J. V.: River flow forecasting through conceptual models part I – A discussion of principles, J. Hydrol., 10, 282–290, https://doi.org/10.1016/0022-1694(70)90255-6, 1970. 

Otu-Larbi, F., Conte, A., Fares, S., Wild, O., and Ashworth, K.: Current and future impacts of drought and ozone stress on Northern Hemisphere forests, Global Change Biol., 26, 6218–6234, https://doi.org/10.1111/gcb.15339, 2020. 

Pande, P., Bland, S., Booth, N., Cook, J., Feng, Z., and Emberson, L.: Development of the DO3SE-Crop model to assess ozone effects on crop phenology, biomass, and yield, Biogeosciences, 22, 181–212, https://doi.org/10.5194/bg-22-181-2025, 2025. 

Paoletti, E., Sicard, P., Hoshika, Y., Fares, S., Badea, O., Pitar, D., Popa, I., Anav, A., Baesso Moura, B., and De Marco, A.: Towards long-term sustainability of stomatal ozone flux monitoring at forest sites, Sustain. Horiz., 2, 100018, https://doi.org/10.1016/j.horiz.2022.100018, 2022. 

Pastorello, G., Trotta, C., Canfora, E., Chu, H., Christianson, D., Cheah, Y.-W., Poindexter, C., Chen, J., Elbashandy, A., Humphrey, M., Isaac, P., Polidori, D., Reichstein, M., Ribeca, A., van Ingen, C., Vuichard, N., Zhang, L., Amiro, B., Ammann, C., Arain, M. A., Ardó, J., Arkebauer, T., Arndt, S. K., Arriga, N., Aubinet, M., Aurela, M., Baldocchi, D., Barr, A., Beamesderfer, E., Belelli Marchesini, L., Bergeron, O., Beringer, J., Bernhofer, C., Berveiller, D., Billesbach, D., Black, T. A., Blanken, P. D., Bohrer, G., Boike, J., Bolstad, P. V., Bonal, D., Bonnefond, J.-M., Bowling, D. R., Bracho, R., Brodeur, J., Brümmer, C., Buchmann, N., Burban, B., Burns, S. P., Buysse, P., Cale, P., Cavagna, M., Cellier, P., Chen, S., Chini, I., Christensen, T. R., Cleverly, J., Collalti, A., Consalvo, C., Cook, B. D., Cook, D., Coursolle, C., Cremonese, E., Curtis, P. S., D'Andrea, E., da Rocha, H., Dai, X., Davis, K. J., De Cinti, B., de Grandcourt, A., De Ligne, A., De Oliveira, R. C., Delpierre, N., Desai, A. R., Di Bella, C. M., di Tommasi, P., Dolman, H., Domingo, F., Dong, G., Dore, S., Duce, P., Dufrêne, E., Dunn, A., Dušek, J., Eamus, D., Eichelmann, U., ElKhidir, H. A. M., Eugster, W., Ewenz, C. M., Ewers, B., Famulari, D., Fares, S., Feigenwinter, I., Feitz, A., Fensholt, R., Filippa, G., Fischer, M., Frank, J., Galvagno, M., Gharun, M., Gianelle, D., Gielen, B., Gioli, B., Gitelson, A., Goded, I., Goeckede, M., Goldstein, A. H., Gough, C. M., Goulden, M. L., Graf, A., Griebel, A., Gruening, C., Grünwald, T., Hammerle, A., Han, S., Han, X., Hansen, B. U., Hanson, C., Hatakka, J., He, Y., Hehn, M., Heinesch, B., Hinko-Najera, N., Hörtnagl, L., Hutley, L., Ibrom, A., Ikawa, H., Jackowicz-Korczynski, M., Janouš, D., Jans, W., Jassal, R., Jiang, S., Kato, T., Khomik, M., Klatt, J., Knohl, A., Knox, S., Kobayashi, H., Koerber, G., Kolle, O., Kosugi, Y., Kotani, A., Kowalski, A., Kruijt, B., Kurbatova, J., Kutsch, W. L., Kwon, H., Launiainen, S., Laurila, T., Law, B., Leuning, R., Li, Y., Liddell, M., Limousin, J.-M., Lion, M., Liska, A. J., Lohila, A., López-Ballesteros, A., López-Blanco, E., Loubet, B., Loustau, D., Lucas-Moffat, A., Lüers, J., Ma, S., Macfarlane, C., Magliulo, V., Maier, R., Mammarella, I., Manca, G., Marcolla, B., Margolis, H. A., Marras, S., Massman, W., Mastepanov, M., Matamala, R., Matthes, J. H., Mazzenga, F., McCaughey, H., McHugh, I., McMillan, A. M. S., Merbold, L., Meyer, W., Meyers, T., Miller, S. D., Minerbi, S., Moderow, U., Monson, R. K., Montagnani, L., Moore, C. E., Moors, E., Moreaux, V., Moureaux, C., Munger, J. W., Nakai, T., Neirynck, J., Nesic, Z., Nicolini, G., Noormets, A., Northwood, M., Nosetto, M., Nouvellon, Y., Novick, K., Oechel, W., Olesen, J. E., Ourcival, J.-M., Papuga, S. A., Parmentier, F.-J., Paul-Limoges, E., Pavelka, M., Peichl, M., Pendall, E., Phillips, R. P., Pilegaard, K., Pirk, N., Posse, G., Powell, T., Prasse, H., Prober, S. M., Rambal, S., Rannik, Ü., Raz-Yaseef, N., Rebmann, C., Reed, D., Resco de Dios, V., Restrepo-Coupe, N., Reverter, B. R., Roland, M., Sabbatini, S., Sachs, T., Saleska, S. R., Sánchez-Cañete, E. P., Sanchez-Mejia, Z. M., Schmid, H. P., Schmidt, M., Schneider, K., Schrader, F., Schroder, I., Scott, R. L., Sedlák, P., Serrano-Ortíz, P., Shao, C., Shi, P., Shironya, I., Siebicke, L., Šigut, L., Silberstein, R., Sirca, C., Spano, D., Steinbrecher, R., Stevens, R. M., Sturtevant, C., Suyker, A., Tagesson, T., Takanashi, S., Tang, Y., Tapper, N., Thom, J., Tomassucci, M., Tuovinen, J.-P., Urbanski, S., Valentini, R., van der Molen, M., van Gorsel, E., van Huissteden, K., Varlagin, A., Verfaillie, J., Vesala, T., Vincke, C., Vitale, D., Vygodskaya, N., Walker, J. P., Walter-Shea, E., Wang, H., Weber, R., Westermann, S., Wille, C., Wofsy, S., Wohlfahrt, G., Wolf, S., Woodgate, W., Li, Y., Zampedri, R., Zhang, J., Zhou, G., Zona, D., Agarwal, D., Biraud, S., Torn, M., and Papale, D.: The FLUXNET2015 dataset and the ONEFlux processing pipeline for eddy covariance data, Sci. Data, 7, 225, https://doi.org/10.1038/s41597-020-0534-3, 2020. 

Pleijel, H., Danielsson, H., and Broberg, M. C.: Benefits of the Phytotoxic Ozone Dose (POD) index in dose–response functions for wheat yield loss, Atmos. Environ., 268, 118797, https://doi.org/10.1016/j.atmosenv.2021.118797, 2022. 

Porcar-Castell, A., Tyystjärvi, E., Atherton, J., van der Tol, C., Flexas, J., Pfündel, E. E., Moreno, J., Frankenberg, C., and Berry, J. A.: Linking chlorophyll a fluorescence to photosynthesis for remote sensing applications: mechanisms and challenges, J. Exp. Bot., 65, 4065–4095, https://doi.org/10.1093/jxb/eru191, 2014. 

Raoult, N., Douglas, N., MacBean, N., Kolassa, J., Quaife, T., Roberts, A. G., Fisher, R., Fer, I., Bacour, C., Dagon, K., Hawkins, L., Carvalhais, N., Cooper, E., Dietze, M. C., Gentine, P., Kaminski, T., Kennedy, D., Liddy, H. M., Moore, D. J. P., Peylin, P., Pinnington, E., Sanderson, B., Scholze, M., Seiler, C., Smallman, T. L., Vergopolan, N., Viskari, T., Williams, M., and Zobitz, J.: Parameter estimation in land surface models: Challenges and opportunities with data assimilation and machine learning, J. Adv. Model. Earth Syst., 17, e2024MS004733, https://doi.org/10.1029/2024MS004733, 2025. 

Running, S. W., Nemani, R. R., Heinsch, F. A., Zhao, M., Reeves, M., and Hashimoto, H.: A continuous satellite-derived measure of global terrestrial primary production, BioScience, 54, 547–560, https://doi.org/10.1641/0006-3568(2004)054[0547:ACSMOG]2.0.CO;2, 2004. 

Serbin, S. P., Singh, A., Desai, A. R., Dubois, S. G., Jablonski, A. D., Kingdon, C. C., Kruger, E. L., and Townsend, P. A.: Spectroscopic determination of leaf morphological and biochemical traits for northern temperate and boreal tree species, Ecol. Appl., 24, 1651–1669, https://doi.org/10.1890/13-2110.1, 2014. 

Sitch, S., Cox, P. M., Collins, W. J., and Huntingford, C.: Indirect radiative forcing of climate change through ozone effects on the land-carbon sink, Nature, 448, 791–794, https://doi.org/10.1038/nature06059, 2007. 

Sung, C.-H., Wijaya, C., Asri, A. K., Chern, Y.-R., Teo, T.-A., Liu, W.-Y., Lee, C.-H., Hsu, Y.-L., and Wu, C.-D.: A high-resolution machine learning and multi-source remote sensing approach for estimating net primary productivity in campus green spaces, Sci. Rep., 15, 44609, https://doi.org/10.1038/s41598-025-28350-6, 2025. 

Taylor, K. E.: Summarizing multiple aspects of model performance in a single diagram, J. Geophys. Res.-Atmos., 106, 7183–7192, https://doi.org/10.1029/2000JD900719, 2001. 

Tilmes, S., Hodzic, A., Emmons, L. K., Mills, M. J., Gettelman, A., Kinnison, D. E., Marsh, D. R., Vitt, F., Bardeen, C., Buchholz, R. R., Conley, A., Garcia, R. R., Gaston, C. J., Jo, D. S., Liu, X., Lamarque, J.-F., Núñez, C., Parworth, C., Richardson, M., Schmidt, A., and Sulprizio, M. P.: Climate forcing and trends of organic aerosols in the Community Earth System Model (CESM2), J. Adv. Model. Earth Syst., 11, 4323–4351, https://doi.org/10.1029/2019MS001827, 2019. 

Tjoelker, M. G. and Luxmoore, R. J.: Soil nitrogen and chronic ozone stress influence physiology, growth and nutrient status of Pinus taeda L. and Liriodendron tulipifera L. seedlings, New Phytol., 119, 69–81, https://doi.org/10.1111/j.1469-8137.1991.tb01009.x, 1991. 

Turnock, S. T., Allen, R. J., Andrews, M., Bauer, S. E., Deushi, M., Emmons, L., Good, P., Horowitz, L., John, J. G., Michou, M., Nabat, P., Naik, V., Neubauer, D., O'Connor, F. M., Olivié, D., Oshima, N., Schulz, M., Sellar, A., Shim, S., Takemura, T., Tilmes, S., Tsigaridis, K., Wu, T., and Zhang, J.: Historical and future changes in air pollutants from CMIP6 models, Atmos. Chem. Phys., 20, 14547–14579, https://doi.org/10.5194/acp-20-14547-2020, 2020. 

Vermeuel, M. P., Millet, D. B., Farmer, D. K., Ganzeveld, L. N., Visser, A. J., Alwe, H. D., Bertram, T. H., Cleary, P. A., Desai, A. R., and Helmig, D.: A vertically resolved canopy improves chemical transport model predictions of ozone deposition to north temperate forests, J. Geophys. Res.-Atmos., 129, e2024JD042092, https://doi.org/10.1029/2024JD042092, 2024. 

Visser, A. J., Ganzeveld, L. N., Goded, I., Krol, M. C., Mammarella, I., Manca, G., and Boersma, K. F.: Ozone deposition impact assessments for forest canopies require accurate ozone flux partitioning on diurnal time-scales, Atmos. Chem. Phys., 21, 18393–18411, https://doi.org/10.5194/acp-21-18393-2021, 2021. 

Wang, P., Li, X. Y., Wang, L., Wu, X., Hu, X., Fan, Y., and Tong, Y.: Divergent evapotranspiration partition dynamics between shrubs and grasses in a shrub-encroached steppe ecosystem, New Phytol., 219, 1325–1337, https://doi.org/10.1111/nph.15237, 2018. 

Wiedinmyer, C., Akagi, S. K., Yokelson, R. J., Emmons, L. K., Al-Saadi, J. A., Orlando, J. J., and Soja, A. J.: The Fire INventory from NCAR (FINN): a high resolution global model to estimate the emissions from open burning, Geosci. Model Dev., 4, 625–641, https://doi.org/10.5194/gmd-4-625-2011, 2011. 

Wild, B., Teubner, I., Moesinger, L., Zotta, R.-M., Forkel, M., van der Schalie, R., Sitch, S., and Dorigo, W.: VODCA2GPP – a new, global, long-term (1988–2020) gross primary production dataset from microwave remote sensing, Earth Syst. Sci. Data, 14, 1063–1085, https://doi.org/10.5194/essd-14-1063-2022, 2022. 

Wittig, V. E., Ainsworth, E. A., and Long, S. P.: To what extent do current and projected increases in surface ozone affect photosynthesis and stomatal conductance of trees? A meta-analytic review of the last 3 decades of experiments, Plant Cell Environ., 30, 1150–1162, https://doi.org/10.1111/j.1365-3040.2007.01717.x, 2007. 

Xiao, J., Chevallier, F., Gomez, C., Guanter, L., Hicke, J., Huete, A. R., Ichii, K., Ni, W., Pang, Y., Rahman, A. F., Sun, G., Yuan, W., Zhang, L., Zhang, X., and Zhao, M.: Remote sensing of the terrestrial carbon cycle: A review of advances over 50 years, Remote Sens. Environ., 233, 111383, https://doi.org/10.1016/j.rse.2019.111383, 2019. 

Xu, Y., Shang, B., Feng, Z., and Tarvainen, L.: Effect of elevated ozone, nitrogen availability and mesophyll conductance on the temperature responses of leaf photosynthetic parameters in poplar, Tree Physiol., 40, 484–497, https://doi.org/10.1093/treephys/tpaa007, 2020. 

Xu, Y., Shang, B., Peng, J., Feng, Z., and Tarvainen, L.: Stomatal response drives between-species difference in predicted leaf water-use efficiency under elevated ozone, Environ. Pollut., 269, 116137, https://doi.org/10.1016/j.envpol.2020.116137, 2021. 

Zhao, M., Heinsch, F. A., Nemani, R. R., and Running, S. W.: Improvements of the MODIS terrestrial gross and net primary production global data set, Remote Sens. Environ., 95, 164–176, https://doi.org/10.1016/j.rse.2004.12.011, 2005. 

Zhou, P.: Modified ozone stress parameterization schemes for vegetation damage simulation in CTSM 5.3: code, Zenodo [code], https://doi.org/10.5281/zenodo.18323484, 2026a. 

Zhou, P.: Modified ozone stress parameterization schemes for vegetation damage simulation in CTSM 5.3; data, Zenodo [data set], https://doi.org/10.5281/zenodo.18324274, 2026b. 

Zhou, P., Chou, J., Ye, S., Yang, L., Sun, M., Li, P., Wang, H., Luo, J., Cao, Z., Yao, Q., Zhang, H., and Pei, H.: A novel ensemble learning algorithm integrating WRF-CMAQ and downscaling models for hourly estimation of regional air pollution along with vegetation exposure risk detection, Int. J. Digit. Earth, 18, 2509825, https://doi.org/10.1080/17538947.2025.2509825, 2025a. 

Zhou, P., Chou, J., Sun, M., Zhao, W., Li, Y., Pei, H., Jin, H., and Bilal, M.: Quantifying vegetation health exposure risk and spatial inequality along with their impact on vegetation, Int. J. Digit. Earth, 18, 2506492, https://doi.org/10.1080/17538947.2025.2506492, 2025b. 

Zhou, X., Yue, X., Tian, C., and Lu, X.: Global assessment of climatic responses to ozone–vegetation interactions, Atmos. Chem. Phys., 24, 9923–9937, https://doi.org/10.5194/acp-24-9923-2024, 2024.  

Zhu, Q., Li, F., Riley, W. J., Xu, L., Zhao, L., Yuan, K., Wu, H., Gong, J., and Randerson, J.: Building a machine learning surrogate model for wildfire activities within a global Earth system model, Geosci. Model Dev., 15, 1899–1911, https://doi.org/10.5194/gmd-15-1899-2022, 2022. 

Download
Short summary
We assessed the impact of ozone damage representations in a land-surface model on simulations of vegetation productivity. Results varied depending on how ozone effects were triggered and how vegetation recovery was modeled. Schemes that incorporated vegetation-specific thresholds and memory effects on photosynthesis and water loss more accurately reflected spatial patterns, indicating directions for enhancing model realism and improving projections of ecosystem responses to ozone pollution.
Share