Articles | Volume 18, issue 23
https://doi.org/10.5194/gmd-18-9855-2025
https://doi.org/10.5194/gmd-18-9855-2025
Development and technical paper
 | 
10 Dec 2025
Development and technical paper |  | 10 Dec 2025

rsofun v5.1: a model-data integration framework for simulating ecosystem processes

Josefa Arán Paredes, Fabian Bernhard, Koen Hufkens, Mayeul Marcadella, and Benjamin D. Stocker
Abstract

Mechanistic vegetation models serve to estimate terrestrial carbon fluxes and climate impacts on ecosystems across diverse biotic and abiotic conditions. Systematically informing them with data is key for enhancing their predictive accuracy and estimating uncertainty. Here, we demonstrate and evaluate the Simulating Optimal FUNctioning (rsofun) R package, which provides a computationally efficient implementation of the P-model for site-scale simulations of ecosystem photosynthesis and the acclimation of photosynthetic traits, complemented with functionalities for Bayesian model-data integration and the estimation of model parameters and prediction uncertainty. We estimated model parameters simultaneously from observed time series of ecosystem gross primary productivity (GPP), and from globally distributed data on leaf carbon-13 isotopic discrimination (Δ13C) and the ratio of the maximum biochemical rates of carboxylation to electron transport (Vcmax/Jmax). The multi-target calibration yielded unbiased predictions for all variables simultaneously and produced similar distributions of prediction–observation residuals for both calibration and out-of-sample test data, indicating that the model generalises robustly across diverse environments. We found that a step-wise approach to successive model integration and calibration yielded best results, and that correlations among parameters related to the representation of water stress effects underpinned non-robust parameter estimations. This likely indicates a dominant source of model structural uncertainty related to the representation of the response of photosynthesis to dry conditions in soil and air.

Share
1 Introduction

The modelling of land ecosystem processes and structure, water, and carbon fluxes relies on both mechanistic and statistical approaches (Dietze et al.2018; Hartig et al.2012; Van Oijen et al.2005). Mechanistic models are formulated as mathematical descriptions of functional relationships between the abiotic environment and ecosystem states, rates, and dynamics. These descriptions reflect available theory and general empirical patterns and provide a means for translating hypotheses about governing principles and causal relationships into testable predictions (Marquet et al.2014), and for upscaling model-based estimates in geographical space and to novel environmental conditions. However, mechanistic models rely also on empirical descriptions of processes at varying levels of abstraction.

Mechanistic models have model parameters that are either specified or fitted to data. A great advantage of mechanistic models is that they explicitly link known physical constants with process representations (e.g., molecular mass of CO2 for diffusion and assimilation, or the gravitational constant and viscosity of water for its transport and transpiration). Other parameters may be specified based on independent measurements under controlled conditions (e.g., the activation energy of Arrhenius-type metabolic rates), or represent measurable plant functional traits, taken as constant over time and within plant functional types (PFTs – the basic unit in mechanistic vegetation models). Both types of parameters have traditionally been directly specified in models (“direct parameterization”, Hartig et al.2012). Yet other model parameters may not be directly observable and describe processes that are not explicitly resolved but can be described at a higher level of abstraction. Such parameters are often fitted to observational data such that the agreement between one (or several) related model predictions and observations is optimised. Parameter estimation for mechanistic vegetation models typically employs generic optimization algorithms or Bayesian statistical approaches and is often used to specify diverse types of parameters (except for universal physical constants). Bayesian methods have the advantage that they enable a systematic assessment of the correlation structure among multiple fitted parameters, provide a means to consider uncertainty in inputs, observations, models, and available prior information, generate probabilistic parameter estimations and model predictions, and provide a basis to quantify the constraints by various calibration target data and to identify errors arising from model structural choices (Bagnara et al.2015; Dietze et al.2018; Hartig et al.2012; van Oijen2017; Raj et al.2016; Van Oijen et al.2005; Xiao et al.2019).

As the number of parameters increases in state-of-the-art mechanistic vegetation models, taking into account multiple PFTs and ecosystem components (e.g. soil, microbes, hydrology), larger amounts of data and computing resources are required to fully explore the parameter space (Hartig et al.2012). This poses a limitation for systematic model-data integration and Bayesian parameter estimation. Eco-evolutionary optimality (EEO) principles have been proposed for reducing model complexity and for a robust grounding of models in governing principles (Franklin et al.2020; Harrison et al.2021). They enable parameter-sparse representations, limit the distinction of separate PFTs, and may enable better model generalisations to novel environmental regimes. As such EEO principles make predictions of plant functional traits that would otherwise have to be prescribed – typically as temporally fixed model parameters. Parameters in EEO models are considered to be universally valid, e.g., across different PFTs. Ideally, they represent known physical constants or quantities that can be measured independently. However, not all can be measured directly, e.g., the marginal cost of water in Cowan and Farquhar (1977), or the unit cost ratio in Prentice et al. (2014) and need to be fitted to data.

The P-model (Prentice et al.2014; Wang et al.2017; Stocker et al.2020) is an example of an EEO-guided model for terrestrial photosynthesis and its acclimation. It avoids the requirement for prescribing PFT-specific parameters of photosynthesis and stomatal regulation but instead predicts them from universal EEO principles for the full range of environmental conditions across the Earth's (C3 photosynthesis-dominated) biomes. Through its representation of the vegetation as a big leaf (Bonan et al.2021; De Pury and Farquhar1997), it represents the scaling between leaf traits and ecosystem-level photosynthetic CO2 uptake (gross primary productivity, GPP). However, despite its foundation on EEO theory and resulting parameter-sparseness, a small set of model parameters remains (Stocker et al.2020) (Table 2) and must be fitted to data.

Here, we explore the respective constraints provided by ecosystem-level fluxes and leaf-level traits for a probabilistic (Bayesian) multi-target estimation of the P-model parameters. The selection of observational target data types is motivated by their known effectiveness in model calibration and parameter estimation from previous work (Prentice et al.2014; Wang et al.2017; Stocker et al.2020). Specifically, we use observations-based GPP time series from multiple eddy covariance measurement sites, and compilations of globally distributed measurements of leaf traits, including the leaf carbon isotopic fractionation relative to the atmosphere (Δ13C, hereafter shortened to Δ), and the ratio of the maximum biochemical rate of carboxylation to electron transport (Vcmax/JmaxVJ). Data from these two leaf traits have previously been used for independently estimating separate model parameters in the P-model (Wang et al.2017). In Stocker et al. (2020), these independently estimated model parameters where then specified for model simulations (direct parametrization), and were not subject to model parameter calibration. Here, we demonstrate how the combined consideration of diverse data types within a Bayesian model-data integration framework – combining ecosystem flux data and leaf traits data – enables the simultaneous estimation of a comprehensive set model parameters that control functional dependencies of processes at multiple organisational levels – from the leaf to the canopy. This enables a better understanding of interdependencies between model parameters and a more reliable estimation of model prediction uncertainty.

Unbalanced observations of multiple calibration targets can lead to parameter estimates that compensate structural errors in the model, as shown previously with synthetic data (MacBean et al.2016; Oberpriller et al.2021; Cameron et al.2022). We are therefore interested in the questions (i) if the P-model can be calibrated in a consistent manner to these targets resulting in unbiased parameter estimates (relative to expected ranges based on our process-based interpretation of each parameter), and (ii) if the calibrated model generates unbiased predictions with consistent precision on independent test data set.

We start by providing a brief description of the P-model and introduce the P-model implementation in the Simulating Optimal FUNctioning rsofun version v5.1 modelling framework, made available as an R package (Stocker et al.2025). A more comprehensive description of the P-model can be found in Stocker et al. (2020). rsofun implements the P-model and its connection to data through Bayesian parameter optimisation and analysis. We demonstrate the functionalities implemented in and accessible through rsofun with different calibration setups – i.e., different combinations of model parameters subjected to Bayesian calibration and different observational data types used for calibration. Alternative calibration setups serve to elucidate the role of different observations in constraining estimates for different model parameters. rsofun functionalities demonstrated here include the flexible specification of likelihood functions for connecting model predictions to specific data types, Bayesian model calibration using Monte Carlo Markov Chain sampling, the analysis of posterior distributions of estimated parameters, and the estimation of prediction uncertainty, probabilistic predictions of GPP, VJ, and Δ, and their evaluation against out-of-sample test data.

2 Methods

2.1 P-model description

The P-model predicts the acclimation of leaf-level photosynthesis to a (slowly varying) environment based on EEO principles. It thereby yields a parameter-sparse representation of ecosystem-level quantities, generalising across (C3 photosynthesis-dominated) vegetation types and biomes. The P-model combines established theory for C3 photosynthesis following the Farquhar–von Caemmerer–Berry (FvCB) model (Farquhar et al.1980) with the Least-Cost hypothesis for the optimal balancing of water loss and carbon gain (Prentice et al.2014), and the coordination hypothesis (Wang et al.2017), which states that the light and Rubisco-limited assimilation rates (as described by the FvCB model) are equal for representative daytime environmental conditions.

The theory results in a prediction of the ratio of leaf-internal to ambient CO2 concentration (ci:caχ) as a function of the atmospheric environment, characterised by the following meteorological variables: daytime mean air temperature T (°C), daytime mean vapor pressure deficit D (Pa), and the atmospheric CO2 partial pressure (ca, Pa).

(1)χ=Γ*ca+1-Γ*caξξ+D(2)ξ=βK+Γ*1.6η*

Γ* is the photorespiratory compensation point (Pa). η* is the ratio the (temperature-dependent) viscosity of water, relative to its value at 25 °C, and K is the Michaelis–Menten coefficient for photosynthesis (Pa). β (unitless) is the unit cost ratio of carboxylation to transpiration in the EEO framework of Prentice et al. (2014), and is calibrated to data here (see Table 1). The functional dependency of Γ* on temperature and atmospheric pressure, the dependency of η* on temperature, and the dependency of K on temperature and atmospheric pressure are described in detail in Stocker et al. (2020) based on published work (Farquhar et al.1980; Bernacchi et al.2001; Huber et al.2009). Involved parameters are held fixed here and are not calibrated.

A set of corollary predictions, physically and physiologically consistent with the predicted χ, follow. The following predicted quantities are used for model-data integration here. A complete description of the mathematical derivation of these quantities from first principles is given in Stocker et al. (2020).

2.1.1 Isotope fractionation by photosynthesis

χ directly controls isotopic discrimination of carbon assimilates (Δ) relative to the atmospheric signature (δ13Ca) (Farquhar et al.1989, 1982; Lavergne et al.2020).

(3) Δ = a Δ + b Δ - a Δ χ - f Δ Γ * c a

Here, parameters represent the isotope fractionation from CO2 diffusion in air (aΔ=4.4 ‰, Craig1953), from Rubisco carboxylation (bΔ=27 ‰), and from photorespiration (fΔ=8.0 ‰, Ubierna and Farquhar2014). Also these parameters were held fixed here and not subjected to calibration.

2.1.2 Maximum rates of carboxylation and electron transport

For daytime conditions, averaged over multiple days, the P-model assumes the Rubisco carboxylation-limited and the electron transport-limited rates of photosynthesis to be equal:

(4) A C = A J .

Following the Farquhar–von Caemmerer–Berry (FvCB) model for C3 photosynthesis (Farquhar et al.1980; von Caemmerer and Farquhar1981), the maximum rate of carboxylation Vcmax can thus be expressed as

(5) V cmax = φ 0 I abs m m C ,

with

(6) m C = c i - Γ * c i + K ,

and with

(7) m = m 1 - c * m 2 / 3

and

(8) m = c i - Γ * c i + 2 Γ * .

Here, φ0 is the intrinsic quantum yield of photosystem II (mol mol−1) which depends on the leaf temperature T (see Eq. 14) – here taken as identical to air temperature. c* (unitless) is the unit cost of electron transport and is treated as a calibratable model parameter (see Table 2 for an overview of calibrated model parameters). Iabs is the photosynthetic photon flux density absorbed by the leaf.

Equation (7) accounts for a limited electron transport capacity (Jmax) such that m can also be written as

(9) m = m 1 1 + 4 φ 0 I abs J max 2 .

Again using AC=AJ, Jmax can be solved for and can be expressed as

(10) J max = 4 φ 0 I abs L - 2 - 1 ,

with

(11) L = V cmax c i + 2 Γ * φ 0 I abs c i + K .

The ratio VJ=Vcmax/Jmax is finally calculated by dividing respective values obtained with Eqs. (5) and (10).

2.1.3 Gross primary productivity

Gross primary productivity (GPP) can be expressed in the form of a light use efficiency model (Prentice et al.2024; Bao et al.2022; Monteith1972):

(12) GPP = fAPAR PPFD LUE ,

with fAPAR being the fraction of absorbed photosynthetically active radiation (unitless), PPFD being the photosynthetic photon flux density, PPFD (mol s−1 m−2), and LUE (g C mol−1) being the light use efficiency, calculated as

(13) LUE = φ 0 ( T ) f β ( θ ) M C m .

Here, fβ is the unitless soil moisture stress function, varying between 0 and 1 (see Eq. 15), with θ representing the plant-available soil water content in mm. MC is the molar mass of C (12.0107 g mol−1). Note that the application of Eq. (12) assumes GPP to scale linearly with absorbed light. This functional relationship is assumed here to describe the relationship between multi-day sums of GPP and PAR and emerges from the assumption of the Coordination Hypothesis (Wang et al.2017). However, it cannot be expected to describe the functional dependencies at shorter time scales, where the limitation by the electron transport capacity (Jmax) becomes effective (Mengoli et al.2022; Farquhar et al.1980).

Such acclimation is considered by employing the P-model theory to gradually varying environmental conditions (P, D, CO2, and PAR) where variations are damped by a low-pass filter with a characteristic, empirically determined time scale τ (days) (see Eq. A1 for the definition of the low-pass filter).

2.1.4 Quantum yield efficiency

The temperature dependency of the quantum yield efficiency φ0 is empirically parametrized as:

(14) φ 0 ( T ) = φ 0 * 1 + a φ T - b φ 2 , bounded to the range 0 , φ 0 * .

This is in contrast to the formulation used in Stocker et al. (2020), where aφ and bφ were effectively prescribed and not subjected to calibration.

2.1.5 Soil moisture stress

Soil moisture stress is computed as

(15) f β ( θ ) = 1 - θ - θ * 2 θ * 2 if θ θ * , 1 if θ > θ *

where θ* (mm) represents the threshold below which GPP is reduced. The soil water balance is simulated with a bucket model of soil water content (uniform total water storage capacity), whereby the dynamics of θ (mm) considers daily precipitation, the snow melt, a Priestley-Taylor-based evapotranspiration estimate, and runoff when the bucket is full. Except for implicitly enforcing fβ(θ=0)=0 here, this formulation follows the description in Stocker et al. (2020) and is based on the SPLASH model (Davis et al.2017).

2.2 Calibration setups

Different calibration setups were used to illustrate the constraints imposed by the different calibration targets for fitting different sets of parameters (Table 1).

Table 1Overview of targets used and parameters estimatable in the six calibration setups. For parameter description and units, see Table 2. Entries “S3” mean the posterior of that setup had been used as (truncated) prior.

Download Print Version | Download XLSX

Table 2Model parameters, their descriptions, and estimated values from different setups. Values refer to the Maximum A Posteriori (MAP) estimates. Values in square brackets represent the lower and upper bounds of uniform prior distributions or truncated normal prior distributions. 𝒩 represents a normal distribution with its mean and variance in parenthesis. Parameters that were held fixed are shown as empty cells.

a Truncated to [0.0, 40.0]. b Truncated to [187.5 to 228.2].

Download Print Version | Download XLSX

The choice of calibration setups was guided by our expectation of constraints on different model parameters, provided by the different target data types, given the model structure. Using common priors across (most) setups, the resulting parameter estimates can illustrate the constraints imposed by data alone. Setup S1 is chosen in view of the expected constraint of Δ data on model parameter β, Setup S2 for the expected constraint of VJ data on model parameters β and c*, Setup S3 for these data combined constraint on both parameters simultaneously. Setup S4 analyses the constraint of GPP data on all model parameters simultaneously. Setup S5 analyses the constraint of using all three data types on all model parameters simultaneously. Setup S6 reduces the “freedom” of Setup S5 in that the posteriors from Setup 3 for β and c* were used as priors. This corresponds to a stepwise approach and was chosen in view of results obtained for Setup S5.

We use the implementation of P-model in the rsofun R package (Stocker et al.2025) to model daily gross primary productivity, GPP, and leaf-level traits, namely the ratio of the rates of photosynthetic capacities in the FvCB model, VJ, and the isotopic fractionation of assimilated carbon, Δ. Implementation details of the rsofun framework are provided in Appendix B.

Three latent (not directly observable) parameters govern the optimality-guided water-carbon trade-off: the unit cost ratio, β, (governing the balancing of maintaining the carboxylation capacity versus the transpiration stream), the marginal cost of maintaining the electron transport rate, c*, and the quantum yield efficiency φ0 (parametrized with φ0*, aφbφ).

The parameters τ, β, c* have previously been calibrated separately to data and have been been specified as fixed model parameters in the P-model (Stocker et al.2020; Wang et al.2017). Here they were instead calibrated simultaneously to multiple calibration targets, together with additional parameters.

For simplicity, the same θ*, the soil water volume (mm) below which plants are stressed, was used across all sites.

Three different calibration targets were used in this study. Δ represents accumulated assimilates and is influenced by stomatal opening through the leaf-internal to ambient carbon dioxide ratio. Equation (2) indicates its dependency on the parameter β. VJ is assumed constant throughout the season. Observations of this ratio (Eqs. 5 and 10) are expected to inform the model parameters β and c*. GPP(t) observations represent daily values of ecosystem-level photosynthetic CO2 uptake fluxes (with acclimation of LUE(t)). Equations (3)–(12) illustrate the dependence of these observations to model parameters β, c*, as well as θ*, τ, φ0*, aφ, and bφ.

Note that Vcmax and Jmax both scale nearly linearly with φ0 (Eqs. 5 and 10). This dependency to φ0 mostly cancels out when considering the ratio of the two VJ.

2.3 Calibration target and test data

The data set consisted of 50 sites with GPP flux time series (172 055 site-dates in total), 49 sites with a total of 597 individual VJ observations (multiple individual plants and/or species per site sampled), and 325 sites with a total of 2357 Δ observations. Data for all variables were split by sites for model calibration (training) and out-of-sample testing (Fig. 2 and listed in Table S2). The split into training and test data sets was performed in a stratified manner according to vegetation type and land cover class (Beck et al.2018; Copernicus Climate Change Service2019; Hufkens and Stocker2025) to ensure balanced representations of each stratum in the test and training data set. Sites, classified as croplands or wetlands were removed, as well as any site with five or less complete years of data. We additionally required that a site contained more than 12 years of good-quality gap-free GPP data to be available as a training site. We used GPP data from 12 sites for training and data from the remaining 38 sites for testing. The VJ and the Δ data were split such that data from roughly 50 % of all sites in each stratum were used for training and testing, respectively.

GPP observations were taken from FluxDataKit (FDK v3.4.2) (Hufkens and Stocker2025; Pastorello et al.2020), which combines published releases of consistently processed eddy-covariance data from multiple regional networks (Ukkola2020; Warm Winter 2020 Team et al.2022; Drought 2018 Team et al.2020). GPP estimated through the nighttime flux partitioning was used (variable “GPP_NT_VUT_REF”, Reichstein et al.2005). For model calibration and evaluation, we filtered data to retain only (daily) values that were computed with at least 80 % measured or good-quality gap-filled (half-hourly) values. Based on visual inspection, the following site-year combinations exhibited spurious patterns and were additionally removed: ES-LJu, year 2006; US-Ho2, year 2007; CH-Dav, year 2010; US-Whs years 2016 and 2017.

Observations of VJ were obtained from the data compilation of Smith et al. (2019), containing data reported for top canopy from multiple sources (De Kauwe et al.2016; Keenan and Niinemets2016; Smith and Dukes2017; Kattge et al.2011; Wang et al.2018; Fürstenau Togashi et al.2018; Togashi et al.2018; Domingues et al.2010; Ferreira Domingues et al.2015; Serbin et al.2015; Tarvainen et al.2013; Ellsworth2016; Rogers et al.2017).

Observations of carbon-13 isotope discrimination in leaf material (Δ) were taken from a global data set (Cornwell et al.2018; Cornwell2025), subsetting only observations that were marked as C3 plants. We used Δ values that were derived from the isotopic signature of leaf material in relation to the atmospheric signature at the date and latitude measurements were made.

2.4 Forcing data

For simulations of GPP time series, daily meteorological measurements, obtained in parallel with GPP observations, were used as model forcing. Daily forcing data was taken from FluxDataKit (FDK v3.4.2) (Hufkens and Stocker2025; Pastorello et al.2020) and was derived from half-hourly data. The required daily variables are listed in Tab. S1. We used mean daytime air temperature and vapor pressure deficit (based on half-hourly values during daylight conditions), daily minimum and maximum temperatures, daily sum of precipitation, and daily means of net radiation, atmospheric pressure and CO2 concentration.

For predictions of the two leaf traits (Δ, VJ), the P-model was forced with average climate conditions during the growing season, derived from the global WorldClim data set (Fick and Hijmans2017) (comprising monthly averages of daily minimum, maximum and average temperature, vapor pressure, and solar radiation) and considering geographic positions of the sites.

The monthly WorldClim data were temporally disaggregated to daily values through polynomial interpolation (daily minimum, maximum, and average temperatures, cloud cover fraction, solar radiation, and vapor pressure). Interpolated daily maximum and minimum temperatures were then combined to an average daytime temperature using location-based day length assuming a sinusoidal temperature profile (Davis et al.2017; Peng et al.2023). The average daytime vapor pressure deficit (D) was derived from the average vapor pressure and daily maximum and minimum temperature.

Daily values were averaged to conditions representing the growing season. Growing season was defined as the period with daily average temperature above 0 °C. Then, daytime temperature, vapor pressure deficit D and solar radiation were averaged (mean) across all days of the growing season and used as model forcing (T, D, PAR) for a non-temporally resolved single prediction of VJ or Δ for each site. Atmospheric pressure P was derived from the ETOPO-1 digital elevation model (NOAA National Geophysical Data Center2009), using site positions and assuming standard atmospheric pressure. CO2 was taken from yearly mean values from the Mauna Loa record (Keeling et al.2017), using the corresponding observation year (or the year 2000 if observation year was unavailable).

2.5 Bayesian calibration

We estimated model parameters β, c*, τ, φ0*, aφ, bφ, and θ* (see Table 2 for a description of parameters) in multiple combinations of parameters and target data (Sect. 2.3 and Table 1). Parameter estimation was done through a Bayesian calibration approach, using Markov chain Monte Carlo (MCMC) sampling (Clark2004; Dietze et al.2013), using the DREAMzs sampling algorithm (Vrugt et al.2009) as implemented in BayesianTools (Hartig et al.2023). Eight independent chains were run, each for 100 000 iterations split among three internal chains, burn-in period was set to 30 000 and convergence was checked visuall with trace plots and Gelman–Rubin statistics (Gelman and Rubin1992). Parameters were calibrated to all sites' data simultaneously and are thus assumed to be universal across space and environmental conditions.

2.5.1 Likelihood

The choice of likelihood summarized our assumptions about different sources of uncertainties. Uncertainties in model inputs (parameters p and forcings x), in model structure f, and in the measured observations y of all target types (van Oijen2017) combine as

y+εy=fp,x+εx+εf

where εy and εf represent (unknown) observational errors and model structural errors, respectively. εx is the error in the forcing data.

For all target variables, we assumed an additive and normally distributed model error term around the model prediction (Trotsiuk et al.2020) and expressed the fit to observed data via the Gaussian log-likelihood

(16) L ( p y ) = 1 2 π σ 2 exp - 1 2 σ 2 ( y - f ( p , x ) ) 2

with target-specific standard deviations σGPP, σΔ, and σVJ. These standard deviations of the error model were estimated together with the model parameters (Table 2). Individual observations were considered independent from each other, thus the total likelihood for a dataset simply multiplied the likelihoods of each observation. With this likelihood, we neglected input error εx=0 and lumped together observational uncertainties and model structural uncertainties into a single “mismatch” or “residual” uncertainty (van Oijen2017; Dietze et al.2013).

Since the P-model is conceived as a single-big-leaf model (Fig. 1), it represents average properties and fluxes for the whole canopy. The estimated residual uncertainty thus contains also a potential uncertainty due to the scale mismatch between observation and model. Moreover, across-tree and across-species variabilities are also included since the likelihood was computed for each VJ and Δ observations of individual trees.

https://gmd.copernicus.org/articles/18/9855/2025/gmd-18-9855-2025-f01

Figure 1P-model inputs, outputs, and target observations for the parameter estimation. The model takes as inputs static site information, time series of meteorological forcings, simulation parameters, and common model parameters for all sites. The simulation returns a time series of several ecosystem fluxes, acclimating leaf traits and ecosystem states. By comparing these outputs with measured traits and flux data in a Bayesian calibration routine, model parameters can be estimated. The structure (named list) of the input data object (driver) used for the model call within the rsofun R package is given in the “arrow boxes” on the left. Variable names of predicted quantities are given in the “arrow boxes” under “output” and are described in the main text.

Download

https://gmd.copernicus.org/articles/18/9855/2025/gmd-18-9855-2025-f02

Figure 2Global maps of site locations (red dots) with observations of the three target variables in the training and test data sets for (a) GPP, (b) VJ, (c) Δ.

While describing leaf-level quantities at relatively high mechanistic detail, the link between the leaf and the canopy-scale was not explicitly resolved. Instead, an empirical approach for leaf-to-canopy scaling of GPP was employed by treating the quantum yield parameter φ0 to be representative for the canopy-scale and allowing it to be calibrated to ecosystem-level GPP flux data.

2.5.2 Priors

Prior distributions were defined based on prior knowledge and kept the same across all calibration setups, except for setup S6, where the posteriors of the model parameters β and c* estimated from setup S3, were used as priors (Table 2 and shown in grey in Fig. 3). These posteriors from setup S3 were characterized as uni-variant normal distribution. For β, it was additionally truncated to the mean ± three times the standard deviation.

https://gmd.copernicus.org/articles/18/9855/2025/gmd-18-9855-2025-f03

Figure 3Prior and posterior distributions of the calibrated model parameters and error model terms for different setups. The maximum a posteriori (MAP) estimates are indicated with a solid line. See Table 2 for parameter descriptions.

Download

The prior knowledge on the acclimation time scale τ was approximated by a normal distribution (with mean and variance of 𝒩(14, 82)) based on prior findings (Mäkelä et al.2004; Liu et al.2024; Mengoli et al.2022) and truncated to the range from 0 to 40 d. For all other parameters, uniform priors with distinct ranges were used. Ranges for β, c*, and φ0* were specified to range between 10 % and 300 % of published estimates of 146 (unitless), 0.41 (unitless), and 0.05 mol mol−1, respectively, (Stocker et al.2020; Wang et al.2017). We chose wide uniform priors with the aim that posteriors would solely be informed by the used observations. The optimal temperature bφ and shape parameter aφ were specified to range between 10 and 30 °C and −0.004 and −0.001 °C−2. The soil moisture limitation threshold θ* was specified to range between 1 and 250 mm. Given the lack of prior knowledge on the error parameters characterizing the combined structural and observational uncertainty large, uninformative priors were assumed.

2.6 Prediction uncertainty

The parameter sets generated by the MCMC chains provide the basis for model prediction including an estimation of the uncertainty in the predictions on the train and test data set. Here, we propagated both characterized uncertainties: the parametric and the residual (structural and observational) uncertainty.

Retaining 20 samples from the combined Markov chains, representative of the joint parameter posterior distribution (including parameter correlations), we ran the P-model for each set of parameters to predict the target variables for the test and training data set. These 20 sets of predictions represented the parametric uncertainty of the model.

Additionally, for each prediction (i.e. target, site, and date combination) three observational errors were drawn from the residual error model (characterized by σGPP, σΔ, and σVJ) and added to the prediction. These 60 sets of predictions represented the combined parametric and residual uncertainty of the model. These samplings appeared sufficient, since results did not change upon sampling 50×10 parameter/error pairs. A third comparison with observations was based on one set of predictions using the Maximum A Posteriori estimate as a single set of parameters and without considering the residual uncertainty.

3 Results

3.1 Calibrated parameters

MCMC sampling with DREAMzs of 8 parallel, independent chains took between 2.5 h (S2) and 68 h (S6) to reach 100 000 iterations for the setups S1 to S6. The trace plot of the chain of setup S6, including computations of the scale reduction factors (Gelman and Rubin1992) are reported in the Appendix (Fig. C1).

Posterior distributions of the estimated parameters varied across the different setups (Fig. 3). Setup S1 (using only Δ as observational target) constrained β to a maximum a posteriori (MAP) of 207 (unitless), a median of 208 and with an inter-quartile range (IQR) from 203 to 213 and the residual prediction error σΔ to a MAP of 2.18 (unitless), which corresponds to a 10 % of the mean of predicted Δ. Other parameters were not informed by the calibration and their posteriors remained largely identical to their prior distributions. This reflects model structural dependencies of parameters and predicted quantities (Δ is independent of the other model parameters, see Eq. 3).

Setup S2 (using only VJ as observational target) constrained c* to a MAP of 0.214 (unitless), (median of 0.397, IQR from 0.347 to 0.428). As revealed by the posterior correlation analysis, β and c showed a strong correlation (r=0.86, Fig. C2). These compensating effects were disentangled when simultaneously calibrating to Δ and VJ in setup S3. This constrained β to a MAP of 207.1 (unitless), (median of 207.9, IQR from 203.5 to 212.3) and c* to MAP of 0.419 (unitless) (median of 0.425, IQR from 0.410 to 0.439), slightly higher than in setup S2, and avoided the correlation of posteriors (Figs. 3 and C3). The error model parameters associated with the two targets Δ and VJ were estimated to MAPs of 2.18 ‰ and 0.178 (unitless).

https://gmd.copernicus.org/articles/18/9855/2025/gmd-18-9855-2025-f04

Figure 4Density plot of residuals between predicted and observed (a) Δ, (b) VJ and (c) GPP from the training (grey) and test sets (green) for calibration setups S1, S2, S4, and S6. Model output is computed with parametric uncertainty (filled area, based on 20 samples from the posterior distribution) and residual uncertainty (solid line, based on 3 samples from error model). Model outputs are compared against individual observations (dates from all sites pooled for GPP, and individual observations of each site for Δ and VJ).

Download

Setups that use GPP as observational target and uninformed priors – setups S4, and S5 – yield estimates of β that are at the lower bound of the uniform prior range – i.e. 14.6 (unitless) –, while τ is estimated to be exactly 1 d. This indicates that GPP observations “push” estimates of β towards extremely high unit costs of transpiration in relation to carboxylation, and that no smoothing of the daily meteorological conditions (Eq. A1) was necessary to optimize the likelihood of observing the GPP data. However, while improving the likelihood of GPP, the fit with Δ observations was deteriorated in these setups, as indicated by an offset between model predictions and observations (Figs. 4 and 5). Only setup S6, using GPP in combination with a truncated and prior for β, informed by the reduced setup S3, mitigates this offset. Also here, the posterior estimate of β came to lie at the border of the truncated region – 14.6 (unitless). The error model parameter associated with the GPP target was estimated to a MAP of 2.04 g C m−2 s−1 in setup S6, which was slightly smaller than the error in setups S4 and S5. In the posterior parameters of setup S6, correlations of r=0.89 remain between φ0* and c* and of r=0.81 between φ0 and bφ (Fig. C4).

https://gmd.copernicus.org/articles/18/9855/2025/gmd-18-9855-2025-f05

Figure 5Predicted versus observed values for different target variables (Δ, VJ, and GPP along columns) and calibration setups (S1, S2, S3, S4, S5, and S6, along rows), evaluated on the test set. Model output is computed with Maximum A Posteriori parameter values (MAP) for each calibration setup. Note that the MAP parameters from setup S1 result in null-predictions of VJ and GPP. Color indicates density, red line indicates a linear regression. Green panel backgrounds indicate which variables were used as targets for model calibration in the corresponding setup. Model outputs are compared against individual observations (predicted and observed daily GPP values from all sites pooled and site-specific predictions against all observations from the respective site for Δ and VJ).

Download

3.2 Prediction uncertainty

Model predictions were unbiased and residuals were of similar magnitude when evaluated on the test and on the training data sets (Fig. 4), which indicates a good generalizability of the parametrized model. Including structural and observational uncertainties on top of parametric uncertainties only slightly increased the deviations between predicted and observed targets in setup S6, with strongest relative increases for GPP.

Δ and GPP predictions based on MAP parameter sets of setup S6 showed no magnitude-dependent bias (linear slopes of regressions between predicted and observed values were close to 1), whereas VJ showed a prediction range that appears too large when compared to the observed range (slope close to 0, Fig. 5). Setup S6 showed a slightly worse root mean squared error (RMSE) than setup S5 for GPP, but clearly reduced RMSE for Δ and VJ.

Time series of GPP of a few select years on training site data showed that the model successfully reproduced seasonal patterns and differences across site in GPP for most sites (Fig. C5), with some shortcomings in accurate simulations of GPP during under dry conditions, as seen for site US-Var, and a general low bias for the moist tropical site of GF-Guy. The model also tends to systematically overestimate GPP in the early growing season at US-MMS and US-PFa – a known bias (Luo et al.2023).

4 Discussion

This study showed that a model for ecosystem photosynthesis and its acclimation to the environment can be robustly parameterised and that its predictions of multiple variables generalise well across a wide range of environmental conditions. Multiple model parameters can be estimated simultaneously by using diverse calibration target data types, combining ecosystem flux time series and static, species-specific traits data. This demonstrates how the explicit representation of connections between traits and process rates enable model-data integration on the basis of diverse observations, obtained at multiple organisational levels – from the leaf to the canopy. The robustness of the parameter estimates is indicated by the convergence of the MCMC chains (Fig. C1) and the resulting narrow posterior distributions of Bayesian parameter estimates (Fig. 3). Despite being calibrated to only a relatively small set of sites with GPP data (N=12), the calibrated model generalizes well, as validated against an independent and much larger set of test data for GPP from 38 sites (Figs. 4 and 5).

Our results confirmed the expectation that the use of multiple observational targets yields more robust parameter estimates compared to a calibration setup that relies on a single data source, and that specific observation types imposed constraints for specific model parameters. Leaf carbon fractionation observations, Δ, allowed to constrain the unit cost ratio of carboxylation to transpiration, β (Fig. 3). Observed ratios of the biochemical rates of carboxylation to electron transport, VJ, constrained β and c*, albeit with strong correlations between them, indicating compensating effects and a lack of robustness in resulting parameter estimates (Figs. 3 and C2). The combination of both these observation types allowed to constrain both parameters simultaneously, avoiding correlations between parameter estimates. This indicates that the use of two observational targets simultaneously made use of their complementary information content for parameter estimation in our model (Fig. C3).

Despite the general robustness of parameter estimates, we found several limitations and aspects that indicate challenges for model calibration in our case. When observations of GPP were included in calibration setups, parameter estimates of β differed strongly from results obtained from setups that used only observations of Δ and tended towards the lower margin of the uniform prior range – substantially lower than the value used for direct parameterisation of the model in previous work (Wang et al.2017). β represents the ratio between the unit cost of carboxylation to transpiration within the EEO modelling framework applied here (Prentice et al.2014). An extremely low value of β implies relatively high costs associated with transpiration, which is driven by vapor pressure deficit D. The calibration tending towards low values of β potentially reflects a compensating effect for the lack of GPP reductions under conditions of dry soils, e.g., during the dry summer periods at the site US-Var (Fig. C5). In other words, this apparent lack of robustness of parameter estimate may indicate a misspecification of the model structure. This interpretation could be tested with targeted setups (e.g., removing dry sites from the calibration data set) or by alternative specifications of the soil moisture stress that better accounts for its limiting effect on GPP.

To address this challenge and avoid unexpectedly low estimates of the unit cost ratio parameter β, we resorted to a step-wise Bayesian calibration (MacBean et al.2016) and used the posterior distribution of setup S3 as a prior in setup S6. This resulted in the disappearance of offsets in Δ observation, a c* closer to, but not at, the upper limit of the prior (MAP = 0.58), and a sightly lower φ0* than in GPP-only setup (S4) or in the unconstrained GPP-Δ-VJ setup (S5). The step-wise posterior estimation and prior specification of P-model parameters in setup S6 yielded estimates of all parameters that compared favourably with previous estimates (Stocker et al.2020; Wang et al.2017). This study estimated (MAP): β=208 (unitless, compared with 146 and the range from 200 to 240), c*=0.58 (unitless, compared with 0.41), and φ0*=0.05 mol mol−1 (compared to 0.05). However, the MAP estimate of τ=1, indicated an instantaneous acclimation appeared to yield better agreement of model output with daily GPP observations, than a delayed acclimation using smoothed versions of the environmental conditions. This contrasts with previous estimations of the acclimation time scale being on the order of 14 to 15 d (Mäkelä et al.2004; Liu et al.2024; Mengoli et al.2022).

Remaining correlations in posterior parameters of setup S6 between φ0* and c* indicate some equifinality. These compensating effects indicate another potential implicit “misinterpretation” of what the parameter c* represents in setup S2 versus setup S6, and/or model structural error and inaccurate process representation. The Bayesian approach allows propagating the effect of this into predictive uncertainties by using posterior distributions instead of point-estimates of parameters. Still, this entanglement might be resolved in future calibrations by fixing one of these two parameters or by using additional observations. Future work is needed to look into causes of this posterior variability of c*, identify potential observational constraints, and potentially to revise related model structures.

The calibrated model of setup S6 showed unbiased predictions against observations in an independent test data set, indicating the model's generalizability. Based on the universal validity of the EEO parameters across plant functional types and biomes, the calibrated P-model can may be scaled to new locations sites and environmental conditions. However, it should be noted that our estimates of prediction uncertainty and the finding of robust generalisability only applies to environmental conditions that are within the domain of (or similarly distanced to) sites used for training and testing here (Ludwig et al.2023). Further caveats apply. The choice of the specification of soil moisture stress, which uses a single global parameter θ*, may be overly simplistic for describing diverse physiological responses to dry-downs across sites characterised by different soil texture (Fu et al.2022; Wankmüller et al.2024) and the model neglects the highly variable rooting zone water storage capacities across space globally (Stocker et al.2023). Although generalization to the test data set did globally show unbiased predictions across all sites (Fig. 4), shortcomings were visible for certain sites (e.g., US-Var, Fig. 5) and warrant a re-consideration and potential revision of the related process representations. An improved representation of soil moisture stress effects and across-site variations of critical soil moisture thresholds and storage capacities may potentially also mitigate the need for targeted interventions into the calibration procedure, e.g., through the truncation of the prior for β or the step-wise approach to model calibration. More specifically, θ* could potentially be expressed by additionally considering soil structural information (Wankmüller et al.2024), which could be provided as additional information for each site. Alternatively, the Bayesian approach allows for the use of hierarchical models (van Oijen2017) that could make use of potential, globally available predictors (or covariates) for the site-specific parameters.

Generally, design choices related to the Bayesian likelihood specification and parameter calibration were adequate to showcase the sensitivity of parameter estimates across setups and retrieve an unbiased, generalizable model. While simple, the additive Gaussian error model chosen for the likelihood was sufficient to illustrate the constraints. However, more appropriate functional forms could be used – for example, ones that account for the fact that GPP values should be non-negative and that observational errors tend to scale with the magnitude of the flux (Cameron et al.2022), or that better separate observation uncertainty from model structural uncertainty (van Oijen2017). Independent estimates of observation uncertainty could be included as fixed parameters or as informed priors. This would potentially lead to a reduced residual error representing more closely the structural model uncertainty. Potential estimates for these could be the error on GPP measurements or errors of a trait measurement to represent the model scale (i.e. the ecosystem-level averages across species instead of individual observations). Alternatively, fitting to average observations instead of individual observations might reduce the (independent) observational errors. This would mean fitting to averaged traits over different species at a given site or weekly cumulative fluxes instead of daily values.

A strength of the Bayesian approach is to update model parameters iteratively with additional data by using previous posteriors as new prior distributions. Our illustration of this approach in setup S6 relied on manual extraction and specification of these distributions. A future development step for the rsofun framework would thus be to support easier specifications of these type of stepwise calibration (MacBean et al.2016) directly through the model interface.

The model employed here, the P-model, describes light absorption, utilisation, and CO2 uptake at an aggregated organisational scale – the canopy-level. Despite this simplified representation, we found that parameter interactions and structural model errors can to lead biases in parameter estimates. The stepwise calibration approach allowed to rectify (likely biased) estimates of β that deviated from estimates used in previous work (Prentice et al.2014; Wang et al.2017; Stocker et al.2020). This violated the initial aim of simultaneously estimating all parameters, but is interpreted here as an indication of where to find structural deficiencies and target model improvements (Oberpriller et al.2021). More comprehensive vegetation models of larger scope, e.g., Dynamic Global Vegetation Models (Sitch et al.2024), will likely face similar challenges, considering that structural deficiencies are unavoidable and more parameter interactions may arise through feedbacks in the soil-plant system. Our findings indicate that stepwise approaches to model calibration can be a solution to this challenge.

While this study looked at seven model parameters simultaneously, several other parameters were held fixed, namely those involved in describing temperature and pressure dependencies of physical and physiological quantities, isotope fractionation, etc. This simplified the calibration and parameter estimation task, but assumed that the uncertainty stemming from these is negligible for the prediction of target variables. This is a strong assumption and may be relaxed in future attempts at calibrating P-model.

Lastly, the chosen likelihood ignored input data uncertainty εx. For sites from which GPP estimates were obtained, we expect small uncertainty in meteorological variables thanks to local measurements. In contrast, fAPAR data is obtained from satellite remote sensing data, which is likely to include uncertainties. For sites from which Δ and VJ data was obtained, input forcing was based on a global dataset of high spatial resolution (∼1 km around the equator). However, topographical effects and related micro-climata that deviate from the larger climate could be remaining sources of errors.

The rsofun R package provides a user-friendly and efficient implementation of the P-model and off-the-shelf model-data assimilation functionalities through its connection to the BayesianTools R package (Hartig et al.2023), while maintaining flexibility for altered calibration setups and likelihood definitions. P-model's computational efficiency offered the potential for effective model parameter and uncertainty estimation using Bayesian statistical methods in combination with flux and traits data here.

Our model implementation as an R package takes inspiration from the r3PG forest model (Trotsiuk et al.2020), and our implementation of model-data integration on the basis of ecosystem data serves similar, yet reduced aims and functionalities compared to PEcAn (https://pecanproject.github.io/index.html, last access: 25 November 2025) (LeBauer et al.2013). rsofun is designed to be minimally reliant on package dependencies and connections to specific data, while limiting the scope to a predefined set of process models (currently P-model, BiomeE at an experimentation stage). rsofun and the implementation of our simulations and analyses shown here in accompanying code (see code availability statement) and package vignettes provide a blueprint for model-data assimilation for modelling terrestrial photosynthesis.

5 Conclusions

This study used rsofun (available as an R package) to calibrate latent P-model parameters to a set of flux and traits data, obtained from 193 training sites and 231 test sites. Predictions across all test sites showed that the calibrated model generalized well, not showing any biases and similar prediction residuals between train and test data. The Bayesian calibration also exhibited challenges. Structural uncertainty, the imbalanced data set, and a potentially too simplistic likelihood lead to biased parameter estimates and predictions in the first calibration attempt. An alternative calibration setup made use of a stepwise calibration and the hierarchical design of the model structure and predictions, enabling a successive model integration and parameter calibration. The necessity for this approach to obtain robust and reliable parameter estimations may indicate model structural deficiencies, which we identified here as primarily being related to the representation of effects by dryness in soil and air.

Appendix A: Acclimation

Damped acclimation to daily environmental conditions was considered for the GPP prediction, while for Δ and VJ predictions used growing season average conditions.

The the low-pass filter of characteristic time scale τ (days) is defined as

(A1) T ( t + 1 ) = T ( t ) + 1 τ ( T ( t + 1 ) - T ( t ) ) , initialized as T ( 0 ) = T ( 0 ) ,

where T is the low-pass filtered quantity (here temperature) and T the daily observations, resulting in a daily time series of the damped quantity. Equivalent expressions with the same τ are used for P, D, CO2, and PAR.

Appendix B: The rsofun model framework

rsofun implements the P-model (Stocker et al.2020) and provides off-the-shelf methods for Bayesian (probabilistic) parameter and prediction uncertainty estimation. rsofun is distributed as an R package on R's central and public package repository. rsofun also implements the BiomeE vegetation demography model (Weng et al.2017, 2015). The latter is not further described here and is implemented at an experimental stage in rsofun version v5.1. The P-model implementation in rsofun is designed for time series simulations by accounting for temporal dependencies in the acclimation to a continuously varying environment. Function wrappers in R make the simulation workflow user-friendly and all functions and input forcing data structures are comprehensively documented (https://geco-bern.github.io/rsofun, last access: 25 November 2025).

In rsofun, model parameters can be calibrated using a calibration function calib_sofun( ), providing two modes of calibration, one based on generalised simulated annealing (GenSA R package) for global optimization (Xiang et al.2013) and one based on Markov chain Monte Carlo (MCMC) implemented by the BayesianTools R package, giving access to a wide variety of Bayesian methods (Hartig et al.2023). The former being fast, while the latter provides more informed parameter optimization statistics (Clark2004; Dietze et al.2013). This gives the option for both exploratory and more in-depth analysis of estimated parameters. A set of standard cost functions are provided for the calibration, facilitating the exploration of various metrics or target variables and the specification of calibrated model parameters. Furthermore, the vignettes accompanying the package (https://geco-bern.github.io/rsofun/articles/, last access: 25 November 2025) explain how to customise the calibration cost functions and interpret the calibration results.

Appendix C: Posterior parameter estimates

C1 Posterior parameter sampling

https://gmd.copernicus.org/articles/18/9855/2025/gmd-18-9855-2025-f06

Figure C1Trace plot of MCMC sampling of all parameters in setup S6.

Download

C2 Posterior parameter correlations

https://gmd.copernicus.org/articles/18/9855/2025/gmd-18-9855-2025-f07

Figure C2Parameter correlation of posterior parameters in setup S2 with 100 000 iterations and 30 000 burn-in.

Download

https://gmd.copernicus.org/articles/18/9855/2025/gmd-18-9855-2025-f08

Figure C3Parameter correlation of posterior parameters in setup S3 with 100 000 iterations and 30 000 burn-in.

Download

https://gmd.copernicus.org/articles/18/9855/2025/gmd-18-9855-2025-f09

Figure C4Parameter correlation of posterior parameters in setup S6 with 100 000 iterations and 30 000 burn-in.

Download

C3 Posterior parameter predictions

https://gmd.copernicus.org/articles/18/9855/2025/gmd-18-9855-2025-f10

Figure C5Time series plot of observed (black crosses) and modelled daily gross primary production (GPP) for setup S6 for years 2012 to 2015 of the training data set. Model output is computed with parametric uncertainty (green shaded area, based on 20 samples from the posterior) and structural uncertainty (grey shaded area, based on 3 samples from error model).

Download

Code and data availability

The rsofun R package can be installed from CRAN (https://cran.r-project.org/package=rsofun, last access: 25 November 2025) or directly from its source code on GitHub (publicly available at https://github.com/geco-bern/rsofun, last access: 25 November 2025 under an AGPLv3 licence). Versioned releases of this GitHub repository are deposited on Zenodo (https://doi.org/10.5281/zenodo.17313273Stocker et al.2025). Code to reproduce the analysis and plots presented here is contained in the repository at https://github.com/geco-bern/rsofun_doc (last access: 25 November 2025) and archived on Zenodo (https://doi.org/10.5281/zenodo.17204361Bernhard and Stocker2025). The model forcing and evaluation data for GPP sites are based on the publicly available data, prepared by FluxDataKit v3.4.2 (https://doi.org/10.5281/zenodo.14808331Hufkens and Stocker2025). Scripts for generating these data files from open access sources are contained in the “rsofun_doc” repository in the subdirectory “data-raw/”. The model forcing for Δ and VJ sites are based on the publicly available WorldClim (https://doi.org/10.1002/joc.5086Fick and Hijmans2017), ETOPO1 (https://doi.org/10.7289/V5C8276MNOAA National Geophysical Data Center2009), and Mauna Loa CO2 (https://doi.org/10.6075/J08W3BHWKeeling et al.2017) data. The model evaluation data for Δ sites are based on data associated with Cornwell et al. (https://doi.org/10.1111/geb.127642018) available at Cornwell (https://doi.org/10.5281/zenodo.152392202025). The model evaluation data for VJ sites are based on the freely available subset of the data from Smith et al. (https://doi.org/10.1111/ele.132102019). Scripts for generating these data are contained in the “rsofun_doc” repository in subdirectory “data-raw/”. Outputs of those scripts and of the analysis presented here are archived in the subdirectory “data/” and “analysis-output/”.

Supplement

The supplement related to this article is available online at https://doi.org/10.5194/gmd-18-9855-2025-supplement.

Author contributions

BDS and FB designed the study. BDS wrote the Fortran code, JAP wrote the calibration implementation in the R package, sensitivity analysis and uncertainty estimation. MM and FB prepared the package for publication to CRAN. KH and JAP wrote the initial draft of the manuscript. FB processed the observational data, ran the analysis, and finalized the manuscript. All authors contributed to manuscript writing.

Competing interests

The contact author has declared that none of the authors has any competing interests.

Disclaimer

Publisher’s note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.

Acknowledgements

We thank Florian Hartig for advice on uncertainty modelling and the maintenance of the BayesianTools package, Volodymyr Trotsiuk for an initial template of the R package, and Colin Prentice for comments on the manuscript. We thank Nick Smith and Wang Han for inputs, data, and code sharing. This work is a contribution to the LEMONTREE (Land Ecosystem Models based On New Theory, obseRvations and ExperimEnts) project, funded through the generosity of Eric and Wendy Schmidt by recommendation of the Schmidt Futures program (Koen Hufkens). Calculations were performed on AMD EPYC 7742 (2.25, 3.4 GHz) processors on UBELIX (https://www.id.unibe.ch/hpc, last access: 25 November 2025), the HPC cluster at the University of Bern.

Financial support

This research has been supported by the Schweizerischer Nationalfonds zur Förderung der Wissenschaftlichen Forschung (grant no. PCEFP2_181115).

Review statement

This paper was edited by Carlos Sierra and reviewed by three anonymous referees.

References

Bagnara, M., Sottocornola, M., Cescatti, A., Minerbi, S., Montagnani, L., Gianelle, D., and Magnani, F.: Bayesian optimization of a light use efficiency model for the estimation of daily gross primary productivity in a range of Italian forest ecosystems, Ecol. Model., 306, 57–66, https://doi.org/10.1016/j.ecolmodel.2014.09.021, 2015. a

Bao, S., Wutzler, T., Koirala, S., Cuntz, M., Ibrom, A., Besnard, S., Walther, S., Šigut, L., Moreno, A., Weber, U., Wohlfahrt, G., Cleverly, J., Migliavacca, M., Woodgate, W., Merbold, L., Veenendaal, E., and Carvalhais, N.: Environment-sensitivity functions for gross primary productivity in light use efficiency models, Agr. Forest Meteorol., 312, 108708, https://doi.org/10.1016/j.agrformet.2021.108708, 2022. a

Beck, H. E., Zimmermann, N. E., McVicar, T. R., Vergopolan, N., Berg, A., and Wood, E. F.: Present and future Köppen-Geiger climate classification maps at 1-km resolution, Sci. Data, 5, 180214, https://doi.org/10.1038/sdata.2018.214, 2018. a

Bernacchi, C. J., Singsaas, E. L., Pimentel, C., Portis Jr., A. R., and Long, S. P.: Improved temperature response functions for models of Rubisco-limited photosynthesis, Plant Cell Environ., 24, 253–259, https://doi.org/10.1111/j.1365-3040.2001.00668.x, 2001. a

Bernhard, F. and Stocker, B.: geco-bern/rsofun_doc: v1.0.2, Zenodo [code], https://doi.org/10.5281/zenodo.17204361, 2025. a

Bonan, G. B., Patton, E. G., Finnigan, J. J., Baldocchi, D. D., and Harman, I. N.: Moving beyond the Incorrect but Useful Paradigm: Reevaluating Big-Leaf and Multilayer Plant Canopies to Model Biosphere-Atmosphere Fluxes – a Review, Agr. Forest Meteorol., 306, 108435, https://doi.org/10.1016/j.agrformet.2021.108435, 2021. a

Cameron, D., Hartig, F., Minnuno, F., Oberpriller, J., Reineking, B., Van Oijen, M., and Dietze, M.: Issues in calibrating models with multiple unbalanced constraints: the significance of systematic model and data errors, Meth. Ecol. Evol., 13, 2757–2770, https://doi.org/10.1111/2041-210X.14002, 2022.  a, b

Clark, J. S.: Why environmental scientists are becoming Bayesians: Modelling with Bayes, Ecol. Lett., 8, 2–14, https://doi.org/10.1111/j.1461-0248.2004.00702.x, 2004. a, b

Copernicus Climate Change Service: Land cover classification gridded maps from 1992 to present derived from satellite observations, https://doi.org/10.24381/CDS.006F2C9A, 2019. a

Cornwell, W.: traitecoevo/leaf13C: v0.2.2, Zenodo [data set], https://doi.org/10.5281/zenodo.15239220, 2025. a, b

Cornwell, W. K., Wright, I. J., Turner, J., Maire, V., Barbour, M. M., Cernusak, L. A., Dawson, T., Ellsworth, D., Farquhar, G. D., Griffiths, H., Keitel, C., Knohl, A., Reich, P. B., Williams, D. G., Bhaskar, R., Cornelissen, J. H. C., Richards, A., Schmidt, S., Valladares, F., Körner, C., Schulze, E., Buchmann, N., and Santiago, L. S.: Climate and soils together regulate photosynthetic carbon isotope discrimination within C3 plants worldwide, Global Ecol. Biogeogr., 27, 1056–1067, https://doi.org/10.1111/geb.12764, 2018. a, b

Cowan, I. R. and Farquhar, G. D.: Stomatal function in relation to leaf metabolism and environment, in: Society for Experimental Biology Symposium, vol. 31, edited by: Jennings, D., Cambridge University Press, Cambridge, 471–505, ISSN 0081-1386, ISBN 9780521216173, 1977. a

Craig, H.: The geochemistry of the stable carbon isotopes, Geochim. Cosmochim. Ac., 3, 53–92, https://doi.org/10.1016/0016-7037(53)90001-5, 1953. a

Davis, T. W., Prentice, I. C., Stocker, B. D., Thomas, R. T., Whitley, R. J., Wang, H., Evans, B. J., Gallego-Sala, A. V., Sykes, M. T., and Cramer, W.: Simple process-led algorithms for simulating habitats (SPLASH v.1.0): robust indices of radiation, evapotranspiration and plant-available moisture, Geosci. Model Dev., 10, 689–708, https://doi.org/10.5194/gmd-10-689-2017, 2017. a, b

De Kauwe, M. G., Lin, Y., Wright, I. J., Medlyn, B. E., Crous, K. Y., Ellsworth, D. S., Maire, V., Prentice, I. C., Atkin, O. K., Rogers, A., Niinemets, Ü., Serbin, S. P., Meir, P., Uddling, J., Togashi, H. F., Tarvainen, L., Weerasinghe, L. K., Evans, B. J., Ishida, F. Y., and Domingues, T. F.: A test of the `one‐point method' for estimating maximum carboxylation capacity from field‐measured, light‐saturated photosynthesis, New Phytol., 210, 1130–1144, https://doi.org/10.1111/nph.13815, 2016. a

De Pury, D. G. G. and Farquhar, G. D.: Simple Scaling of Photosynthesis from Leaves to Canopies without the Errors of Big-leaf Models, Plant Cell Environ., 20, 537–557, https://doi.org/10.1111/j.1365-3040.1997.00094.x, 1997. a

Dietze, M. C., Lebauer, D. S., and Kooper, R.: On improving the communication between models and data, Plant Cell Environ., 36, 1575–1585, https://doi.org/10.1111/pce.12043, 2013. a, b, c

Dietze, M. C., Fox, A., Beck-Johnson, L. M., Betancourt, J. L., Hooten, M. B., Jarnevich, C. S., Keitt, T. H., Kenney, M. A., Laney, C. M., Larsen, L. G., Loescher, H. W., Lunch, C. K., Pijanowski, B. C., Randerson, J. T., Read, E. K., Tredennick, A. T., Vargas, R., Weathers, K. C., and White, E. P.: Iterative near-term ecological forecasting: Needs, opportunities, and challenges, P. Natl. Acad. Sci. USA, 115, 1424–1432, https://doi.org/10.1073/pnas.1710231115, 2018. a, b

Domingues, T. F., Meir, P., Feldpausch, T. R., Saiz, G., Veenendaal, E. M., Schrodt, F., Bird, M., Djagbletey, G., Hien, F., Compaore, H., Diallo, A., Grace, J., and Lloyd, J.: Co-limitation of photosynthetic capacity by nitrogen and phosphorus in West Africa woodlands, Plant Cell Environ., 33, 959–980, https://doi.org/10.1111/j.1365-3040.2010.02119.x, 2010. a

Drought 2018 Team, ICOS Ecosystem Thematic Centre, ICOS Ecosystem Thematic Centre, and Trotta, C.: Drought-2018 ecosystem eddy covariance flux product for 52 stations in FLUXNET-Archive format, ICOS, https://doi.org/10.18160/YVR0-4898, 2020. a

Ellsworth, D.: A global dataset of photosynthetic CO2 response curves measured in the field at controlled light, CO2 and temperatures, Western Sydney University, https://doi.org/10.4225/35/569434CFBA16E, 2016. a

Farquhar, G., O'Leary, M., and Berry, J.: On the Relationship Between Carbon Isotope Discrimination and the Intercellular Carbon Dioxide Concentration in Leaves, Funct. Plant Biol., 9, 121, https://doi.org/10.1071/PP9820121, 1982. a

Farquhar, G. D., von Caemmerer, S., and Berry, J. A.: A biochemical model of photosynthetic CO2 assimilation in leaves of C3 species, Planta, 149, 78–90, https://doi.org/10.1007/BF00386231, 1980. a, b, c, d

Farquhar, G. D., Ehleringer, J. R., and Hubick, K. T.: Carbon Isotope Discrimination and Photosynthesis, Annu. Rev. Plant Biol., 40, 503–537, https://doi.org/10.1146/annurev.pp.40.060189.002443, 1989. a

Ferreira Domingues, T., Ishida, F. Y., Feldpausch, T. R., Grace, J., Meir, P., Saiz, G., Sene, O., Schrodt, F., Sonké, B., Taedoumg, H., Veenendaal, E. M., Lewis, S., and Lloyd, J.: Biome-specific effects of nitrogen and phosphorus on the photosynthetic characteristics of trees at a forest-savanna boundary in Cameroon, Oecologia, 178, 659–672, https://doi.org/10.1007/s00442-015-3250-5, 2015. a

Fick, S. E. and Hijmans, R. J.: WorldClim 2: new 1‐km spatial resolution climate surfaces for global land areas, Int. J. Climatol., 37, 4302–4315, https://doi.org/10.1002/joc.5086, 2017. a, b

Franklin, O., Harrison, S. P., Dewar, R., Farrior, C. E., Brännström, Å., Dieckmann, U., Pietsch, S., Falster, D., Cramer, W., Loreau, M., Wang, H., Mäkelä, A., Rebel, K. T., Meron, E., Schymanski, S. J., Rovenskaya, E., Stocker, B. D., Zaehle, S., Manzoni, S., van Oijen, M., Wright, I. J., Ciais, P., van Bodegom, P. M., Peñuelas, J., Hofhansl, F., Terrer, C., Soudzilovskaia, N. A., Midgley, G., and Prentice, I. C.: Organizing principles for vegetation dynamics, Nat. Plants, 6, 444–453, https://doi.org/10.1038/s41477-020-0655-x, 2020. a

Fu, Z., Ciais, P., Makowski, D., Bastos, A., Stoy, P. C., Ibrom, A., Knohl, A., Migliavacca, M., Cuntz, M., Šigut, L., Peichl, M., Loustau, D., El-Madany, T. S., Buchmann, N., Gharun, M., Janssens, I., Markwitz, C., Grünwald, T., Rebmann, C., Mölder, M., Varlagin, A., Mammarella, I., Kolari, P., Bernhofer, C., Heliasz, M., Vincke, C., Pitacco, A., Cremonese, E., Foltýnová, L., and Wigneron, J.-P.: Uncovering the critical soil moisture thresholds of plant water stress for European ecosystems, Global Change Biol/, 28, 2111–2123, https://doi.org/10.1111/gcb.16050, 2022. a

Fürstenau Togashi, H., Prentice, I. C., Atkin, O. K., Macfarlane, C., Prober, S. M., Bloomfield, K. J., and Evans, B. J.: Thermal acclimation of leaf photosynthetic traits in an evergreen woodland, consistent with the coordination hypothesis, Biogeosciences, 15, 3461–3474, https://doi.org/10.5194/bg-15-3461-2018, 2018. a

Gelman, A. and Rubin, D. B.: Inference from Iterative Simulation Using Multiple Sequences, Stat. Sci., 7, https://doi.org/10.1214/ss/1177011136, 1992. a, b

Harrison, S. P., Cramer, W., Franklin, O., Prentice, I. C., Wang, H., Brännström, Å., de Boer, H., Dieckmann, U., Joshi, J., Keenan, T. F., Lavergne, A., Manzoni, S., Mengoli, G., Morfopoulos, C., Peñuelas, J., Pietsch, S., Rebel, K. T., Ryu, Y., Smith, N. G., Stocker, B. D., and Wright, I. J.: Eco-evolutionary optimality as a means to improve vegetation and land-surface models, New Phytol., 231, 2125–2141, https://doi.org/10.1111/nph.17558, 2021. a

Hartig, F., Dyke, J., Hickler, T., Higgins, S. I., O'Hara, R. B., Scheiter, S., and Huth, A.: Connecting dynamic vegetation models to data – an inverse perspective: Dynamic vegetation models – an inverse perspective, J. Biogeogr., 39, 2240–2252, https://doi.org/10.1111/j.1365-2699.2012.02745.x, 2012. a, b, c, d

Hartig, F., Minunno, F., and Paul, S.: BayesianTools: General-purpose MCMC and SMC samplers and tools for bayesian statistics, CRAN, https://doi.org/10.32614/CRAN.package.BayesianTools, 2023. a, b, c

Huber, M. L., Perkins, R. A., Laesecke, A., Friend, D. G., Sengers, J. V., Assael, M. J., Metaxa, I. N., Vogel, E., Mares̆, R., and Miyagawa, K.: New international formulation for the viscosity of H2O, J. Phys. Chem. Ref. Data, 38, 101–125, 2009. a

Hufkens, K. and Stocker, B.: FluxDataKit v3.4.2: A comprehensive data set of ecosystem fluxes for land surface modelling, Zenodo [data set], https://doi.org/10.5281/zenodo.14808331, 2025. a, b, c, d

Kattge, J., Díaz, S., Lavorel, S., Prentice, I. C., Leadley, P., Bönisch, G., Garnier, E., Westoby, M., Reich, P. B., Wright, I. J., Cornelissen, J. H. C., Violle, C., Harrison, S. P., Van Bodegom, P. M., Reichstein, M., Enquist, B. J., Soudzilovskaia, N. A., Ackerly, D. D., Anand, M., Atkin, O., Bahn, M., Baker, T. R., Baldocchi, D., Bekker, R., Blanco, C. C., Blonder, B., Bond, W. J., Bradstock, R., Bunker, D. E., Casanoves, F., Cavender‐Bares, J., Chambers, J. Q., Chapin Iii, F. S., Chave, J., Coomes, D., Cornwell, W. K., Craine, J. M., Dobrin, B. H., Duarte, L., Durka, W., Elser, J., Esser, G., Estiarte, M., Fagan, W. F., Fang, J., Fernández‐Méndez, F., Fidelis, A., Finegan, B., Flores, O., Ford, H., Frank, D., Freschet, G. T., Fyllas, N. M., Gallagher, R. V., Green, W. A., Gutierrez, A. G., Hickler, T., Higgins, S. I., Hodgson, J. G., Jalili, A., Jansen, S., Joly, C. A., Kerkhoff, A. J., Kirkup, D., Kitajima, K., Kleyer, M., Klotz, S., Knops, J. M. H., Kramer, K., Kühn, I., Kurokawa, H., Laughlin, D., Lee, T. D., Leishman, M., Lens, F., Lenz, T., Lewis, S. L., Lloyd, J., Llusià, J., Louault, F., Ma, S., Mahecha, M. D., Manning, P., Massad, T., Medlyn, B. E., Messier, J., Moles, A. T., Müller, S. C., Nadrowski, K., Naeem, S., Niinemets, Ü., Nöllert, S., Nüske, A., Ogaya, R., Oleksyn, J., Onipchenko, V. G., Onoda, Y., Ordoñez, J., Overbeck, G., Ozinga, W. A., Patiño, S., Paula, S., Pausas, J. G., Peñuelas, J., Phillips, O. L., Pillar, V., Poorter, H., Poorter, L., Poschlod, P., Prinzing, A., Proulx, R., Rammig, A., Reinsch, S., Reu, B., Sack, L., Salgado‐Negret, B., Sardans, J., Shiodera, S., Shipley, B., Siefert, A., Sosinski, E., Soussana, J., Swaine, E., Swenson, N., Thompson, K., Thornton, P., Waldram, M., Weiher, E., White, M., White, S., Wright, S. J., Yguel, B., Zaehle, S., Zanne, A. E., and Wirth, C.: TRY – a global database of plant traits, Global Change Biol., 17, 2905–2935, https://doi.org/10.1111/j.1365-2486.2011.02451.x, 2011. a

Keeling, R. F., Morgan, E. J., and Keeling, C. D.: Atmospheric Monthly In Situ CO2 Data – Mauna Loa Observatory, Hawaii. In Scripps CO2 Program Data, UC San Diego Library Digital Collections [data set], https://doi.org/10.6075/J08W3BHW, 2017. a, b

Keenan, T. F. and Niinemets, Ü.: Global leaf trait estimates biased due to plasticity in the shade, Nat. Plants, 3, 16201, https://doi.org/10.1038/nplants.2016.201, 2016. a

Lavergne, A., Voelker, S., Csank, A., Graven, H., de Boer, H. J., Daux, V., Robertson, I., Dorado-Liñán, I., Martínez-Sancho, E., Battipaglia, G., Bloomfield, K. J., Still, C. J., Meinzer, F. C., Dawson, T. E., Julio Camarero, J., Clisby, R., Fang, Y., Menzel, A., Keen, R. M., Roden, J. S., and Prentice, I. C.: Historical changes in the stomatal limitation of photosynthesis: empirical support for an optimality principle, New Phytol., 225, 2484–2497, https://doi.org/10.1111/nph.16314, 2020. a

LeBauer, D. S., Wang, D., Richter, K. T., Davidson, C. C., and Dietze, M. C.: Facilitating feedbacks between field measurements and ecosystem models, Ecol. Monogr., 83, 133–154, 2013. a

Liu, J., Ryu, Y., Luo, X., Dechant, B., Stocker, B. D., Keenan, T. F., Gentine, P., Li, X., Li, B., Harrison, S. P., and Prentice, I. C.: Evidence for widespread thermal acclimation of canopy photosynthesis, Nat. Plants, 10, 1919–1927, https://doi.org/10.1038/s41477-024-01846-1, 2024. a, b

Ludwig, M., Moreno-Martinez, A., Hölzel, N., Pebesma, E., and Meyer, H.: Assessing and improving the transferability of current global spatial prediction models, Global Ecol. Biogeogr., 32, 356–368, https://doi.org/10.1111/geb.13635, 2023. a

Luo, Y., Gessler, A., D'Odorico, P., Hufkens, K., and Stocker, B. D.: Quantifying effects of cold acclimation and delayed springtime photosynthesis resumption in northern ecosystems, New Phytol., 240, 984–1002, https://doi.org/10.1111/nph.19208, 2023. a

MacBean, N., Peylin, P., Chevallier, F., Scholze, M., and Schürmann, G.: Consistent assimilation of multiple data streams in a carbon cycle data assimilation system, Geosci. Model Dev., 9, 3569–3588, https://doi.org/10.5194/gmd-9-3569-2016, 2016. a, b, c

Mäkelä, A., Hari, P., Berninger, F., Hänninen, H., and Nikinmaa, E.: Acclimation of photosynthetic capacity in Scots pine to the annual cycle of temperature, Tree Physiol., 24, 369–376, https://doi.org/10.1093/treephys/24.4.369, 2004. a, b

Marquet, P. A., Allen, A. P., Brown, J. H., Dunne, J. A., Enquist, B. J., Gillooly, J. F., Gowaty, P. A., Green, J. L., Harte, J., Hubbell, S. P., O'Dwyer, J., Okie, J. G., Ostling, A., Ritchie, M., Storch, D., and West, G. B.: On Theory in Ecology, BioScience, 64, 701–710, https://doi.org/10.1093/biosci/biu098, 2014. a

Mengoli, G., Agustí-Panareda, A., Boussetta, S., Harrison, S. P., Trotta, C., and Prentice, I. C.: Ecosystem Photosynthesis in Land-Surface Models: A First-Principles Approach Incorporating Acclimation, J. Adv. Model. Earth Syst., 14, e2021MS002767, https://doi.org/10.1029/2021MS002767, 2022. a, b, c

Monteith, J. L.: Solar Radiation and Productivity in Tropical Ecosystems, J. Appl. Ecol., 9, 747, https://doi.org/10.2307/2401901, 1972. a

NOAA National Geophysical Data Center: ETOPO1 1 Arc-Minute Global Relief Model, NOAA National Geophysical Data Center [data set], https://doi.org/10.7289/V5C8276M, 2009. a, b

Oberpriller, J., Cameron, D. R., Dietze, M. C., and Hartig, F.: Towards robust statistical inference for complex computer models, Ecol. Lett., 24, 1251–1261, https://doi.org/10.1111/ele.13728, 2021. a, b

Pastorello, G., Trotta, C., Canfora, E., Chu, H., Christianson, D., Cheah, Y.-W., Poindexter, C., Chen, J., Elbashandy, A., Humphrey, M., Isaac, P., Polidori, D., Reichstein, M., Ribeca, A., van Ingen, C., Vuichard, N., Zhang, L., Amiro, B., Ammann, C., Arain, M. A., Ardö, J., Arkebauer, T., Arndt, S. K., Arriga, N., Aubinet, M., Aurela, M., Baldocchi, D., Barr, A., Beamesderfer, E., Marchesini, L. B., Bergeron, O., Beringer, J., Bernhofer, C., Berveiller, D., Billesbach, D., Black, T. A., Blanken, P. D., Bohrer, G., Boike, J., Bolstad, P. V., Bonal, D., Bonnefond, J.-M., Bowling, D. R., Bracho, R., Brodeur, J., Brümmer, C., Buchmann, N., Burban, B., Burns, S. P., Buysse, P., Cale, P., Cavagna, M., Cellier, P., Chen, S., Chini, I., Christensen, T. R., Cleverly, J., Collalti, A., Consalvo, C., Cook, B. D., Cook, D., Coursolle, C., Cremonese, E., Curtis, P. S., D'Andrea, E., da Rocha, H., Dai, X., Davis, K. J., Cinti, B. D., Grandcourt, A. D., Ligne, A. D., De Oliveira, R. C., Delpierre, N., Desai, A. R., Di Bella, C. M., Tommasi, P. D., Dolman, H., Domingo, F., Dong, G., Dore, S., Duce, P., Dufrêne, E., Dunn, A., Dušek, J., Eamus, D., Eichelmann, U., ElKhidir, H. A. M., Eugster, W., Ewenz, C. M., Ewers, B., Famulari, D., Fares, S., Feigenwinter, I., Feitz, A., Fensholt, R., Filippa, G., Fischer, M., Frank, J., Galvagno, M., Gharun, M., Gianelle, D., Gielen, B., Gioli, B., Gitelson, A., Goded, I., Goeckede, M., Goldstein, A. H., Gough, C. M., Goulden, M. L., Graf, A., Griebel, A., Gruening, C., Grünwald, T., Hammerle, A., Han, S., Han, X., Hansen, B. U., Hanson, C., Hatakka, J., He, Y., Hehn, M., Heinesch, B., Hinko-Najera, N., Hörtnagl, L., Hutley, L., Ibrom, A., Ikawa, H., Jackowicz-Korczynski, M., Janouš, D., Jans, W., Jassal, R., Jiang, S., Kato, T., Khomik, M., Klatt, J., Knohl, A., Knox, S., Kobayashi, H., Koerber, G., Kolle, O., Kosugi, Y., Kotani, A., Kowalski, A., Kruijt, B., Kurbatova, J., Kutsch, W. L., Kwon, H., Launiainen, S., Laurila, T., Law, B., Leuning, R., Li, Y., Liddell, M., Limousin, J.-M., Lion, M., Liska, A. J., Lohila, A., López-Ballesteros, A., López-Blanco, E., Loubet, B., Loustau, D., Lucas-Moffat, A., Lüers, J., Ma, S., Macfarlane, C., Magliulo, V., Maier, R., Mammarella, I., Manca, G., Marcolla, B., Margolis, H. A., Marras, S., Massman, W., Mastepanov, M., Matamala, R., Matthes, J. H., Mazzenga, F., McCaughey, H., McHugh, I., McMillan, A. M. S., Merbold, L., Meyer, W., Meyers, T., Miller, S. D., Minerbi, S., Moderow, U., Monson, R. K., Montagnani, L., Moore, C. E., Moors, E., Moreaux, V., Moureaux, C., Munger, J. W., Nakai, T., Neirynck, J., Nesic, Z., Nicolini, G., Noormets, A., Northwood, M., Nosetto, M., Nouvellon, Y., Novick, K., Oechel, W., Olesen, J. E., Ourcival, J.-M., Papuga, S. A., Parmentier, F.-J., Paul-Limoges, E., Pavelka, M., Peichl, M., Pendall, E., Phillips, R. P., Pilegaard, K., Pirk, N., Posse, G., Powell, T., Prasse, H., Prober, S. M., Rambal, S., Rannik, Ü., Raz-Yaseef, N., Rebmann, C., Reed, D., Dios, V. R. D., Restrepo-Coupe, N., Reverter, B. R., Roland, M., Sabbatini, S., Sachs, T., Saleska, S. R., Sánchez-Cañete, E. P., Sanchez-Mejia, Z. M., Schmid, H. P., Schmidt, M., Schneider, K., Schrader, F., Schroder, I., Scott, R. L., Sedlák, P., Serrano-Ortíz, P., Shao, C., Shi, P., Shironya, I., Siebicke, L., Šigut, L., Silberstein, R., Sirca, C., Spano, D., Steinbrecher, R., Stevens, R. M., Sturtevant, C., Suyker, A., Tagesson, T., Takanashi, S., Tang, Y., Tapper, N., Thom, J., Tomassucci, M., Tuovinen, J.-P., Urbanski, S., Valentini, R., van der Molen, M., van Gorsel, E., van Huissteden, K., Varlagin, A., Verfaillie, J., Vesala, T., Vincke, C., Vitale, D., Vygodskaya, N., Walker, J. P., Walter-Shea, E., Wang, H., Weber, R., Westermann, S., Wille, C., Wofsy, S., Wohlfahrt, G., Wolf, S., Woodgate, W., Li, Y., Zampedri, R., Zhang, J., Zhou, G., Zona, D., Agarwal, D., Biraud, S., Torn, M., and Papale, D.: The FLUXNET2015 dataset and the ONEFlux processing pipeline for eddy covariance data, Sci. Data, 7, 225, https://doi.org/10.1038/s41597-020-0534-3, 2020. a, b

Peng, Y., Prentice, I. C., Bloomfield, K. J., Campioli, M., Guo, Z., Sun, Y., Tian, D., Wang, X., Vicca, S., and Stocker, B. D.: Global terrestrial nitrogen uptake and nitrogen use efficiency, J. Ecol., 111, 2676–2693, https://doi.org/10.1111/1365-2745.14208, 2023. a

Prentice, I. C., Dong, N., Gleason, S. M., Maire, V., and Wright, I. J.: Balancing the costs of carbon gain and water transport: testing a new theoretical framework for plant functional ecology, Ecol. Lett., 17, 82–91, https://doi.org/10.1111/ele.12211, 2014. a, b, c, d, e, f, g

Prentice, I. C., Balzarolo, M., Bloomfield, K. J., Chen, J. M., Dechant, B., Ghent, D., Janssens, I. A., Luo, X., Morfopoulos, C., Ryu, Y., Vicca, S., and van Hoolst, R.: Principles for satellite monitoring of vegetation carbon uptake, Nat. Rev. Earth Environ., 5, 818–832, https://doi.org/10.1038/s43017-024-00601-6, 2024. a

Raj, R., Hamm, N. A. S., Tol, C. V. D., and Stein, A.: Uncertainty analysis of gross primary production partitioned from net ecosystem exchange measurements, Biogeosciences, 13, 1409–1422, https://doi.org/10.5194/bg-13-1409-2016, 2016. a

Reichstein, M., Falge, E., Baldocchi, D., Papale, D., Aubinet, M., Berbigier, P., Bernhofer, C., Buchmann, N., Gilmanov, T., Granier, A., Grünwald, T., Havránková, K., Ilvesniemi, H., Janous, D., Knohl, A., Laurila, T., Lohila, A., Loustau, D., Matteucci, G., Meyers, T., Miglietta, F., Ourcival, J.-M., Pumpanen, J., Rambal, S., Rotenberg, E., Sanz, M., Tenhunen, J., Seufert, G., Vaccari, F., Vesala, T., Yakir, D., and Valentini, R.: On the separation of net ecosystem exchange into assimilation and ecosystem respiration: review and improved algorithm, Global Change Biol., 11, 1424–1439, https://doi.org/10.1111/j.1365-2486.2005.001002.x, 2005. a

Rogers, A., Serbin, S. P., Ely, K. S., Sloan, V. L., and Wullschleger, S. D.: Terrestrial biosphere models underestimate photosynthetic capacity and CO2 assimilation in the Arctic, New Phytol., 216, 1090–1103, https://doi.org/10.1111/nph.14740, 2017. a

Serbin, S. P., Singh, A., Desai, A. R., Dubois, S. G., Jablonski, A. D., Kingdon, C. C., Kruger, E. L., and Townsend, P. A.: Remotely estimating photosynthetic capacity, and its response to temperature, in vegetation canopies using imaging spectroscopy, Remote Sens. Environ., 167, 78–87, https://doi.org/10.1016/j.rse.2015.05.024, 2015. a

Sitch, S., O'Sullivan, M., Robertson, E., Friedlingstein, P., Albergel, C., Anthoni, P., Arneth, A., Arora, V. K., Bastos, A., Bastrikov, V., Bellouin, N., Canadell, J. G., Chini, L., Ciais, P., Falk, S., Harris, I., Hurtt, G., Ito, A., Jain, A. K., Jones, M. W., Joos, F., Kato, E., Kennedy, D., Klein Goldewijk, K., Kluzek, E., Knauer, J., Lawrence, P. J., Lombardozzi, D., Melton, J. R., Nabel, J. E. M. S., Pan, N., Peylin, P., Pongratz, J., Poulter, B., Rosan, T. M., Sun, Q., Tian, H., Walker, A. P., Weber, U., Yuan, W., Yue, X., and Zaehle, S.: Trends and Drivers of Terrestrial Sources and Sinks of Carbon Dioxide: An Overview of the TRENDY Project, Global Biogeochem. Cy., 38, e2024GB008102, https://doi.org/10.1029/2024GB008102, 2024. a

Smith, N. G. and Dukes, J. S.: LCE: leaf carbon exchange data set for tropical, temperate, and boreal species of North and Central America, Ecology, 98, 2978–2978, https://doi.org/10.1002/ecy.1992, 2017. a

Smith, N. G., Keenan, T. F., Colin Prentice, I., Wang, H., Wright, I. J., Niinemets, Ü., Crous, K. Y., Domingues, T. F., Guerrieri, R., Yoko Ishida, F., Kattge, J., Kruger, E. L., Maire, V., Rogers, A., Serbin, S. P., Tarvainen, L., Togashi, H. F., Townsend, P. A., Wang, M., Weerasinghe, L. K., and Zhou, S.: Global photosynthetic capacity is optimized to the environment, Ecol. Lett., 22, 506–517, https://doi.org/10.1111/ele.13210, 2019. a, b

Stocker, B. D., Wang, H., Smith, N. G., Harrison, S. P., Keenan, T. F., Sandoval, D., Davis, T., and Prentice, I. C.: P-model v1.0: an optimality-based light use efficiency model for simulating ecosystem gross primary production, Geosci. Model Dev., 13, 1545–1581, https://doi.org/10.5194/gmd-13-1545-2020, 2020. a, b, c, d, e, f, g, h, i, j, k, l, m, n

Stocker, B. D., Tumber-Dávila, S. J., Konings, A. G., Anderson, M. C., Hain, C., and Jackson, R. B.: Global patterns of water storage in the rooting zones of vegetation, Nat. Geosci., 16, 250–256, https://doi.org/10.1038/s41561-023-01125-2, 2023. a

Stocker, B. D., Hufkens, K., Arán Paredes, J., Bernhard, F., and Marcadella, M.: geco-bern/rsofun: v5.1.0, Zenodo [code], https://doi.org/10.5281/zenodo.17313273, 2025. a, b, c

Tarvainen, L., Wallin, G., Räntfors, M., and Uddling, J.: Weak vertical canopy gradients of photosynthetic capacities and stomatal responses in a fertile Norway spruce stand, Oecologia, 173, 1179–1189, https://doi.org/10.1007/s00442-013-2703-y, 2013. a

Togashi, H. F., Atkin, O. K., Bloomfield, K. J., Bradford, M., Cao, K., Dong, N., Evans, B. J., Fan, Z., Harrison, S. P., Hua, Z., Liddell, M. J., Lloyd, J., Ni, J., Wang, H., Weerasinghe, L. K., and Prentice, I. C.: Functional trait variation related to gap dynamics in tropical moist forests: A vegetation modelling perspective, Perspect. Plant Ecol. Evol. System., 35, 52–64, https://doi.org/10.1016/j.ppees.2018.10.004, 2018. a

Trotsiuk, V., Hartig, F., and Forrester, D. I.: r3PG – An r package for simulating forest growth using the 3-PG process-based model, Meth. Ecol. Evol., 11, 1470–1475, https://doi.org/10.1111/2041-210X.13474, 2020. a, b

Ubierna, N. and Farquhar, G. D.: Advances in measurements and models of photosynthetic carbon isotope discrimination in C3 plants, Plant Cell Environ., 37, 1494–1498, https://doi.org/10.1111/pce.12346, 2014. a

Ukkola, A.: PLUMBER2: forcing and evaluation datasets for a model intercomparison project for land surface models v1.0, NCI Data Catalogue [data set], https://doi.org/10.25914/5FDB0902607E1, 2020. a

van Oijen, M.: Bayesian Methods for Quantifying and Reducing Uncertainty and Error in Forest Models, Curr. Forest. Rep., 3, 269–280, https://doi.org/10.1007/s40725-017-0069-9, 2017. a, b, c, d, e

Van Oijen, M., Rougier, J., and Smith, R.: Bayesian calibration of process-based forest models: bridging the gap between models and data, Tree Physiol., 25, 915–927, https://doi.org/10.1093/treephys/25.7.915, 2005.  a, b

von Caemmerer, S. and Farquhar, G. D.: Some relationships between the biochemistry of photosynthesis and the gas exchange of leaves, Planta, 153, 376–387, https://doi.org/10.1007/BF00384257, 1981. a

Vrugt, J. A., Ter Braak, C., Diks, C., Robinson, B. A., Hyman, J. M., and Higdon, D.: Accelerating Markov Chain Monte Carlo Simulation by Differential Evolution with Self-Adaptive Randomized Subspace Sampling, Int. J. Nonlin. Sci. Numer. Simul., 10, https://doi.org/10.1515/IJNSNS.2009.10.3.273, 2009. a

Wang, H., Prentice, I. C., Keenan, T. F., Davis, T. W., Wright, I. J., Cornwell, W. K., Evans, B. J., and Peng, C.: Towards a universal model for carbon dioxide uptake by plants, Nat. Plants, 3, 734–741, https://doi.org/10.1038/s41477-017-0006-8, 2017. a, b, c, d, e, f, g, h, i, j

Wang, H., Harrison, S. P., Prentice, I. C., Yang, Y., Bai, F., Togashi, H. F., Wang, M., Zhou, S., and Ni, J.: The China Plant Trait Database: toward a comprehensive regional compilation of functional traits for land plants, Ecology, 99, 500–500, https://doi.org/10.1002/ecy.2091, 2018. a

Wankmüller, F. J. P., Delval, L., Lehmann, P., Baur, M. J., Cecere, A., Wolf, S., Or, D., Javaux, M., and Carminati, A.: Global influence of soil texture on ecosystem water limitation, Nature, 635, 631–638, https://doi.org/10.1038/s41586-024-08089-2, 2024. a, b

Warm Winter 2020 Team, ICOS Ecosystem Thematic Centre, ICOS Ecosystem Thematic Centre, and Trotta, C.: Warm Winter 2020 ecosystem eddy covariance flux product for 73 stations in FLUXNET-Archive format – release 2022-1, ICOS, https://doi.org/10.18160/2G60-ZHAK, 2022. a

Weng, E., Farrior, C. E., Dybzinski, R., and Pacala, S. W.: Predicting vegetation type through physiological and environmental interactions with leaf traits: evergreen and deciduous forests in an earth system modeling framework, Global Change Biol., 23, 2482–2498, https://doi.org/10.1111/gcb.13542, 2017. a

Weng, E. S., Malyshev, S., Lichstein, J. W., Farrior, C. E., Dybzinski, R., Zhang, T., Shevliakova, E., and Pacala, S. W.: Scaling from individual trees to forests in an Earth system modeling framework using a mathematically tractable model of height-structured competition, Biogeosciences, 12, 2655–2694, https://doi.org/10.5194/bg-12-2655-2015, 2015. a

Xiang, Y., Gubian, S., Suomela, B., and Hoeng, J.: Generalized Simulated Annealing for Global Optimization: The GenSA Package, R J., 5, 13, https://doi.org/10.32614/RJ-2013-002, 2013. a

Xiao, J., Chevallier, F., Gomez, C., Guanter, L., Hicke, J. A., Huete, A. R., Ichii, K., Ni, W., Pang, Y., Rahman, A. F., Sun, G., Yuan, W., Zhang, L., and Zhang, X.: Remote sensing of the terrestrial carbon cycle: A review of advances over 50 years, Remote Sens. Environ., 233, 111383, https://doi.org/10.1016/j.rse.2019.111383, 2019. a

Short summary
Mechanistic vegetation models serve to estimate terrestrial carbon fluxes and climate impacts on ecosystems across diverse conditions. Here, we demonstrate and evaluate the rsofun R package, which provides a computationally efficient implementation of the P-model for site-scale simulations of ecosystem photosynthesis. Bayesian model fitting to observed fluxes and traits and evaluation on an independent test data set indicated robust calibration and unbiased prediction capabilities.
Share