Articles | Volume 17, issue 17
https://doi.org/10.5194/gmd-17-6725-2024
https://doi.org/10.5194/gmd-17-6725-2024
Methods for assessment of models
 | 
12 Sep 2024
Methods for assessment of models |  | 12 Sep 2024

Impacts of land-use change on biospheric carbon: an oriented benchmark using the ORCHIDEE land surface model

Thi Lan Anh Dinh, Daniel Goll, Philippe Ciais, and Ronny Lauerwald
Abstract

Land-use change (LUC) impacts biospheric carbon, encompassing biomass carbon and soil organic carbon (SOC). Despite the use of dynamic global vegetation models (DGVMs) in estimating the anthropogenic perturbation of biospheric carbon stocks, critical evaluations of model performance concerning LUC impacts are scarce. Here, we present a systematic evaluation of the performance of the DGVM Organising Carbon and Hydrology in Dynamic Ecosystems (ORCHIDEE) in reproducing observed LUC impacts on biospheric carbon stocks over Europe. First, we compare model predictions with observation-based gridded estimates of net and gross primary productivity (NPP and GPP), biomass growth patterns, and SOC stocks. Second, we evaluate the predicted response of soil carbon stocks to LUC based on data from forest inventories, paired plots, chronosequences, and repeated sampling designs. Third, we use interpretable machine learning to identify factors contributing to discrepancies between simulations and observations, including drivers and processes not resolved in ORCHIDEE (e.g. erosion, soil fertility). Results indicate agreement between the model and observed spatial patterns and temporal trends, such as the increase in biomass with age, when simulating biosphere carbon stocks. The direction of the SOC responses to LUC generally aligns between simulated and observed data. However, the model underestimates carbon gains for cropland-to-grassland conversions and carbon losses for grassland-to-cropland and forest-to-cropland conversions. These discrepancies are attributed to bias arising from soil erosion rate, which is not fully captured in ORCHIDEE. Our study provides an oriented benchmark for assessing the DGVMs against observations and explores their potential in studying the impact of LUCs on SOC stocks.

1 Introduction

The terrestrial biosphere, with its organic carbon stocks in biomass and soils, currently acts as a sink for anthropogenic CO2 emissions (Lal2008; Canadell and Schulze2014; IPCC2023; Friedlingstein et al.2023). It has long been known that land use and land-use changes (LUCs) significantly alter the quantity of carbon stored in both biomass and soil (Guo and Gifford2002; Laganière et al.2010; Deng et al.2014; Le Quéré et al.2015; Sanderman et al.2017). For example, afforestation and reforestation activities can increase biomass carbon stocks and, consequently, expand soil and litter carbon reserves. LUC-induced changes in soil organic carbon (SOC) stocks result from changes in the quality and quantity of litter inputs or decomposition processes driven by shifts in soil moisture and temperature regimes. As such, investigating the implications of LUC for biospheric carbon pools and fluxes becomes indispensable in shaping effective climate change mitigation strategies and fostering sustainable land management practices (Watson et al.2007; Arora and Boer2010; IPCC2022). A comprehensive understanding of these dynamics is essential for harnessing the potential of carbon sequestration in climate change mitigation efforts and achieving global sustainability goals (Lal2004; Canadell and Schulze2014).

Dynamic global vegetation models (DGVMs) serve as indispensable tools for estimating regional and global changes in biospheric carbon stocks in response to climate change and LUC (Nyawira et al.2016). However, accurate evaluation of DGVMs against observational data is crucial to assess their reliability in representing biomass and soil carbon dynamics. In addition, to the best of our knowledge, very few studies have comprehensively compared observed data and model simulations concerning tree biomass versus age across large spatial scales. Here, we present a benchmark procedure to comprehensively evaluate a DGVM's performance in reproducing LUC impacts on biomass carbon and SOC stocks using diverse observational data sources. The approach is applied to assess the performance of the Organising Carbon and Hydrology in Dynamic Ecosystems (ORCHIDEE) model (Krinner et al.2005).

In the recent past, a wide range of meta-analyses has been published, focusing on SOC changes following LUC (Guo and Gifford2002; Laganière et al.2010; Poeplau et al.2011; Poeplau and Don2015; Li et al.2018; Fohrafellner et al.2023). One of the key advantages of these meta-analyses is their utilisation of various quality checks to combine and aggregate local-scale measurements. Through this approach, the meta-analyses offer valuable insights into the representative ranges and averages of magnitudes and speed of changes in SOC stocks following LUC (Poeplau et al.2011; Li et al.2018). As a result, meta-analyses can be used to validate DGVMs' ability to reproduce SOC stock dynamics following LUC. Nevertheless, very few DGVMs have been evaluated against such meta-analyses (Nyawira et al.2016), while for most DGVMs, such an evaluation is yet to be performed.

Our goal is to create a universal benchmark that can be used by DGVMs in general, making it easier to evaluate how well these models simulate changes in biomass carbon and SOC stocks after LUC. We build this LUC–carbon benchmarking framework at a continental scale in Europe. To achieve this, we use a combination of diverse observational data sources and employ the ORCHIDEE model. This approach provides insights into and a more profound comprehension of the model processes as we compare them with the observations. The first step involves verifying whether the model reproduces carbon fluxes and stocks accurately. Next, we assess the simulated impact of LUC on SOC stock changes by comparing it with observational data from meta-analyses. Five LUC transitions will be considered: cropland to grassland (C to G), grassland to cropland (G to C), cropland to forest (C to F), grassland to forest (G to F), and forest to cropland (F to C). Then, we explore potential factors that may cause model bias when simulating changes in SOC stock for each LUC scenario. In the following, we (1) introduce materials, including a brief description of the ORCHIDEE model and observational databases used; (2) describe the model set-ups and comparison process used in this study; (3) assess the model's performance in reproducing carbon stocks, stock changes, and the major related carbon fluxes; (4) compare simulations against meta-analyses of observations of soil carbon dynamics following LUC and investigate potential factors that contribute to model bias; and (5) discuss the comparisons, sources of discrepancies, and challenges in model–data comparison.

2 Materials and methods

2.1 Organising Carbon and Hydrology in Dynamic Ecosystems model

ORCHIDEE version 2.2 is a state-of-the-art DGVM designed to simulate carbon, water, and energy fluxes from local sites to the global level (Krinner et al.2005). It calculates the energy and hydrology budget of the terrestrial biosphere at half-hourly intervals, distinguishing 15 plant functional types (PFTs; shown in Table 1) (Ducoudré et al.1993; de Rosnay and Polcher1998). In addition, it simulates vegetation phenology and carbon dynamics, including photosynthesis, maintenance and growth respiration, carbon allocation in vegetation biomass, production and decomposition of litter, and soil carbon dynamics, at daily time steps (Krinner et al.2005). The basic scheme of biospheric carbon cycling representation in ORCHIDEE is described in Appendix A.

ORCHIDEE is forced with meteorological data, wood harvest maps, soil texture, and land cover maps to prescribe the areal proportion of each PFT in each model grid cell for a given point in time. When land cover changes happen, PFT-level carbon stocks are redistributed from the shrinking PFT to the expanding one.

All simulations described in this study share the same forcing data. In detail, we employed the Climatic Research Unit and Japanese Reanalysis (CRU JRA) v2.3 dataset for meteorological forcings with a spatial resolution of 0.5°. This dataset is accessible for the period spanning 1901 to 2021 and is available at https://catalogue.ceda.ac.uk/uuid/38715b12b22043118a208acd61771917 (last access: 15 May 2023). The CRU JRA v2.3 data comprise 6-hourly records of various variables, including temperature at 2 m above ground, air pressure, specific humidity, wind speed, precipitation (rain and snow), and downward longwave and shortwave radiation. The land cover map is from the European Space Agency (ESA) LUH2v2 data (Lurton et al.2020), i.e. a combination of the ESA Climate Change Initiative land cover map (https://www.esa-landcover-cci.org/, last access: 15 May 2023) and the historical land-use harmonisation database (LUH2v2; Hurtt et al.2020). These data provide areal fractions for each of the 15 PFTs within individual cells of the modelling grid. The land cover map is updated annually, and LUC is represented as an abrupt transition of land cover at the beginning of each year. More subtle LUC changes, like changes in management intensity, are not considered due to a lack of historical data. Over standard historical gridded simulations, LUC change is treated as a continuous process, slightly increasing or decreasing the areal proportion of one or more PFTs at the detriment of others. The litter and SOC pools inherited from a disappearing PFT to a target PFT are merged with the existing litter and SOC pools of the target PFT, which already occupy a fraction of the grid cell. This dilution of a small amount of newly delivered litter and SOC brought from LUC into a large amount of SOC already existing in the target PFT area conserves mass but makes it impractical to compare SOC and litter change with observations because observations come from sites where 100 % of a PFT is converted to another.

Therefore, we built idealised LUC scenarios in which we assume an abrupt transition referring to a 100 % conversion from one PFT to another in a grid, meaning there is no dilution of old soil carbon signals into the new PFT area. This transition is based on homogeneously prescribed land cover consisting of one single PFT, not on changing land cover maps. This abrupt change run is necessary to make simulations comparable to observations at the site level, which consider local change from one PFT to another rather than a change in PFT mix from the landscape perspective usually taken by a DGVM such as ORCHIDEE. The wood harvest map is sourced from the LUH2v2 database. It provides the wood harvest data as annual areal flux rates of carbon in the extracted biomass (gC m−2 yr−1). This means that these data can be applied to different PFT maps without causing extreme flux rates due to inconsistent representation of forest area. The soil texture classification relies on the study of Post and Zobler (2000). This scheme distinguishes seven texture classes, which for ORCHIDEE are further aggregated into three texture classes (i.e. coarse, medium, fine), each associated with specific soil physical properties. This classification is essential for simulating the soil water budget, and through that, it also significantly affects vegetation dynamics. In addition, it impacts soil carbon dynamics by directly influencing the turnover rates of SOC through clay content and its presumed effect of enhancing the physical protection of the active SOC pool. In ORCHIDEE, the module of soil has an assumed globally uniform depth of 2 m. Note however that soil carbon is not depth discretised, and average values of soil temperature, moisture, and clay content are used.

Table 1ORCHIDEE plant functional types (PFTs) and PFT-specific parameters. Values in parentheses indicate the modifications in the simulation set-ups (detailed in Sect. 2.3). (Vcmax represents the maximal rate of carboxylation ( the potential photosynthetic capacity) (µmol m−2 s−1); Fgrowthresp is the fraction of gross primary production (GPP) which is lost as growth respiration.)

Download Print Version | Download XLSX

2.2 Observation-based data

To evaluate the model's performance concerning the dynamics of carbon stocks, we compare simulation results against observations of net primary production (NPP) and gross primary production (GPP), which are the primary factors controlling land carbon stocks; paired observations of above-ground biomass and plant age (Somogyi et al.2008; Schepaschenko et al.2017); observation-based maps of SOC; and SOC stock changes due to LUC. For the investigation of potential factors (detailed in Sect. 2.2.4) causing model bias in estimating changes in SOC stocks due to LUC, we used meteorological data from the CRU JRA dataset and soil-related data from Land Use and Cover Area frame Survey (LUCAS) soil surveys.

2.2.1 Primary production

Annual NPP data were derived from a comprehensive forest ecosystem database from Luyssaert et al. (2007), including a rigorous selection of single or multiple direct measurements and modelled fluxes. The model-generated fluxes in this database closely match the observed data because they were generated using a mechanistic process model with daily or more detailed climatological data, calibrated with site-specific parameters, and validated against site-specific measurements. NPP is reported at different levels, ranging from a single plant component (e.g. foliage or stem NPPs) to the complete plant. Here, we selected the most superficial aggregation level of the total NPP (i.e. the sum of above-ground (foliage + wood) and below-ground (coarse + fine roots) NPP components).

The observed annual GPP data were obtained from four datasets. The first dataset was, similar to the above NPP dataset, extracted from the global forest database from Luyssaert et al. (2007). Second, GPP data were also gathered from the FLUXNET 2015 dataset, including data from multiple regional flux networks (Pastorello et al.2020). This dataset collects eddy covariance measurements of carbon, water, and energy fluxes between the biosphere and atmosphere. It can be downloaded from https://fluxnet.org/data/fluxnet2015-dataset/ (Pastorello et al.2020). Over our study area, these GPP data are available mainly from 1996 to 2015. Thirdly, additional GPP data from European sites in 2020 were collected from the Integrated Carbon Observation System (ICOS), a European-wide greenhouse gas research infrastructure. Finally, GPP data were also gathered from Campioli et al. (2015). Like the comprehensive database of Luyssaert et al. (2007), Campioli et al. (2015) compiled the data from individual studies using harvest, biometric, eddy covariance, or process-based model estimates of primary production. In addition, this dataset includes data not only from forest sites but also from grassland and cropland sites.

More detailed information on the selected NPP and GPP sites from different sources can be found in the Supplement (Tables S1 to S4).

2.2.2 Biomass

The biomass dataset considered here includes in situ estimates for the different plant compartments (i.e. foliage, stem, and branch) and spans all of the European biomes (Fig. S1 in the Supplement). The dataset consists of a collection and harmonisation of available open forest inventory databases (e.g. Somogyi et al.2008; Schepaschenko et al.2017; Anderson-Teixeira et al.2018) already used to quantify ecological and environmental controls on the spatial variability of stand age (Besnard et al.2021). Despite the global nature of the dataset, given the current European scope of this analysis, here we focused on locations in Europe where the total above-ground biomass (AGB) could be estimated based on in situ measurements. The final dataset comprises 603 sites, including six PFTs (TeNE, TeBS, BoNE, BoBS, and a few sites of TeBE and BoNS). The average stand age is 58 years (with a standard deviation of 43), and the mean AGB is 6.4 kgC m−2 (with a standard deviation of 4.5).

2.2.3 Soil organic carbon

Data on SOC stocks were obtained from the Land Use and Cover Area frame Survey (LUCAS) collected during 2018 (Orgiazzi et al.2018). This dataset offers comprehensive information on various chemical and physical soil properties throughout the European region. The LUCAS sampling was conducted at different depths, primarily focusing on the fine-soil component of the top 20 cm of the soil column while excluding above-ground vegetation residues, grass, and litter. Site selection for our study was based on the availability of observed organic carbon content, bulk density, and the fraction of coarse fragments within the top 20 cm layer. In addition, the land-use information was consistently available for all samples.

We considered the latest surveys from LUCAS 2018 topsoil data (Fernandez-Ugalde et al.2022), which can be downloaded from the European Soil Data Centre's website https://esdac.jrc.ec.europa.eu/content/lucas-2018-topsoil-data. However, it is important to note that, at the time of writing this paper, the fraction of coarse fragments was not included in the LUCAS 2018 topsoil data and had to be obtained from a previous survey, LUCAS 2015 (Jones et al.2020). We downloaded and extracted the coarse-fragment data from https://esdac.jrc.ec.europa.eu/content/lucas2015-topsoil-data (last access: 3 August 2023) and then combined them with the LUCAS 2018 topsoil data. Furthermore, our analysis focused only on samples associated with forest, grassland, and cropland land uses, excluding other land-use types not represented by the PFTs of ORCHIDEE, such as shrubland or wetlands. In total, we identified and included 5150 sampling sites in our study.

The total SOC (in kg m−2) stocks were then calculated based on the following equation (Batjes1996):

(1) SOC = OC × BD × D × ( 1 - CF ) 100 ,

in which OC is organic carbon content (gC kg−1), BD is bulk density (gC cm−3), D is soil depth (cm), and CF is the volumetric fraction of coarse fragments (>2 mm).

We compiled data from 102 study sites sourced from 34 peer-reviewed publications (detailed in Table S5 in the Supplement) investigating the impact of LUC on soil carbon stocks in the European region. Our selection process included several criteria to identify relevant SOC data from these studies. Firstly, we focused on five specific LUC transitions: cropland to grassland (C to G), grassland to cropland (G to C), cropland to forest (C to F), grassland to forest (G to F), and forest to cropland (F to C). Secondly, we included only studies with paired plots, chronosequences, or repeated sampling designs. Paired plots involve assessing two adjacent sites – one that has not experienced LUC and has the original land cover and the other with a new land cover after LUC. Similarly, chronosequences utilise adjacent plots with different ages of new vegetation since conversion to another land-use type. Repeated or mono-site sampling involves the periodic collection of soil samples at the same location/site. A “space-for-time” approach, assuming that the SOC stocks of prior land use are in a steady state, is used in paired plots and chronosequences. Thirdly, we required information about whether the forest floor (i.e. the above-ground litter organic layer) was included in the sampling process for forest sites. Finally, additional relevant properties such as sampling depth, land-use history, age of current land use, and the unit of soil carbon stocks must be provided. The collected data were finally categorised into seven conversion types, as detailed in Table 2.

Table 2Number of study sites and samples, mean sampling depths with standard deviation, and mean current land-use age for the local-scale observations in the meta-analyses.

Download Print Version | Download XLSX

2.2.4 Additional data for model bias attribution

We considered two important meteorological variables, the air temperature at 2 m a.g.l. and precipitation, which are derived from the Climatic Research Unit and Japanese Reanalysis (CRU JRA) v2.3 dataset. This dataset is also used for the meteorological forcings in ORCHIDEE and is detailed in the next section (Sect. 2.3).

Soil-related data are obtained from Ballabio et al. (2019), who provided maps of soil chemical properties at 500 m spatial resolution across Europe using soil point data from LUCAS 2009/2012 soil surveys (Toth et al.2013). These datasets align with the observed SOC stocks (see Sect. 2.2.3) and are considered among the most reliable data sources for Europe (d'Andrimont et al.2020). Our focus was on three key properties: the soil carbon-to-nitrogen (CN) ratio, nitrogen (N), and phosphorus (P). The data were retrieved from https://esdac.jrc.ec.europa.eu/content/chemical-properties-european-scale-based-lucas-topsoil-data (last access: 3 August 2023). Additionally, our study considered the annual soil erosion rate in 2009 (Fendrich et al.2022; available at https://esdac.jrc.ec.europa.eu/themes/historical-reconstruction-erosion, last access: 3 August 2023), which we considered to be representative of the last few decades. The maps of all soil-related data are aggregated to 0.5° grids to match ORCHIDEE's resolution.

2.3 ORCHIDEE simulations

We conducted simulations with the ORCHIDEE model across Europe (33 to 70° N and 10° W to 40° E) for a straightforward comparison to observational data (Table 3). These simulations can be categorised into two groups: (a) realistic simulations for the historic period aimed at evaluating the ORCHIDEE model's ability to reproduce observed primary production (NPP, GPP), biomass carbon stocks, and SOC stocks and (b) idealised LUC simulations aimed at evaluating the biomass carbon stock changes and the effects of LUC on SOC stocks in terms of total magnitude and timing.

2.3.1 Historical simulations

Firstly, the realistic simulation, referred to as RLS, is based on the default configuration and parameters inspired by the TRENDY protocol (Sitch et al.2015), including two steps. In the first step of our simulations (FG1), we spun up the model to reach a steady state representative of the year 1950. This involved conducting a simulation over 340 years, with a 30-year loop of meteorological forcing data (1921–1950), as well as fixed values for atmospheric CO2 levels, PFT maps, and wood harvest, all corresponding to the year 1950. The PFT map employed here consists of 15 PFTs (Boucher et al.2020), as specified in Table 1. In the second step, we ran a transient simulation (FG2) from 1950 to 2020 using historical meteorology, CO2 concentrations, and wood harvest data. The FG2 was restarted from the last year of output of FG1. These RLS simulation outputs are evaluated against the observation-driven NPP, GPP, and SOC data, as detailed in Sect. 2.4.

Secondly, we performed a BM simulation (where BM refers to biomass assessment), which uses the same configuration as RLS but pre-describing the land cover with constant fractions of dominant PFTs in the EU (see Table 1) and no wood harvest, which ensures PFT carbon stocks for the observation period are not affected by LUC for the comparison with the observation of natural forest. In addition, a forest clear-cut simulation is performed before running the transient simulation, and during FG2 the biomass regrows from approximately zero. Thus, the simulated forest age was defined as the time since the beginning of the FG2 simulation.

Furthermore, in all simulations, we calibrated, by trial and error, two parameters, namely Vcmax and Fgrowthresp, specifically for the temperate needleleaf evergreen forest (TeNE) to reduce biases in NPP, GPP, and AGB (see more in Sect. 3.1 and 3.2). Our initial objective was to approximate the correct values of NPP and GPP, ultimately leading to an improved representation of AGB. In detail, Vcmax is adjusted based on the observed-to-simulated GPP ratio, and Fgrowthresp is gradually reduced to increase NPP and GPP values to be closer to the observations. The final adjusted values for these parameters are indicated in parentheses in Table 1.

2.3.2 Idealised LUC simulations

We conducted idealised LUC simulations, assuming the entire study area was covered by a single PFT. To ensure accurate comparisons between simulated results and meta-analyses from site-level carbon pool changes caused by LUC, regions where this PFT does not occur according to the PFT maps were excluded from the analysis. Then, we transformed this initial PFT into other PFTs, such as temperate broadleaf summergreen forest to temperate natural C3 grassland and C3 cropland (TeBS to TeGC3 or C3C, aberrations as presented in Table 1). These transformations exemplify the conversion of forest areas into grasslands or croplands (referred to as F-to-GC conversion). Note that ORCHIDEE simulates SOC stocks separately for each PFT, allowing us to simultaneously represent LUC from one PFT to two different PFTs. This is an improvement compared to other DGVMs that typically assign one value of SOC to all PFTs (e.g. the LPJ model, Sitch et al.2003).

The LUC simulations are somewhat similar to the RLS simulation, including two main processes, i.e. the spin-up simulation FG1 and the transient historical simulation FG2. In detail, we first ran the 340-year spin-up FG1F looping over 10 years of meteorological forcing (1901–1910) and fixed atmospheric CO2 concentrations and wood harvest as in 1900 in the F-to-GC simulation. Here, we fixed land cover to 100 % TeBS. At this stage, the biomass and SOC stocks are in equilibrium. In the second step, we ran a historical simulation FG2F from 1901 to 1950 for this same PFT (with historical meteorology, CO2 concentrations, and wood harvest data), restarting from the last year of the spin-up simulation FG1F. To perform the LUCs (in this case, from F to G and F to C), we changed the prescribed PFTs to C3 grassland and cropland (i.e. TeGC3 and C3C) and continued running the historical simulation FG2GCa from 1951 to 2020, restarting from FG2F. In addition, to study the LUC impact for a longer period, we extended the model run until 2100, looping over the last 20 years of meteorological forcing data (2001–2020). For this extended simulation, we kept the atmospheric CO2 fixed to the value in 2020. Although the projected climate is available (e.g. data from the Coupled Model Intercomparison Project Phase 6, Eyring et al.2016), historical data were used here to be compatible with the meta-analyses. Other LUC simulations, i.e. G to CF and C to FG, were set up similarly.

We acknowledge that defining 1950 as the same year of LUC in our simulations increases the uncertainties when comparing simulations to observations which relate to different years of LUC. Note that only a fraction of the studies from which we source the observed LUC impacts on SOC stocks specifies the year of LUC. To explore this source of uncertainty, we thus conducted tailored simulations matching the individual years of LUC reported in these studies and compared the results to simulations using 1950 as the year of LUC. Detailed information about these additional simulations is provided and discussed in Sect. S1 of the Supplement.

Table 3Main simulations.

Download Print Version | Download XLSX

2.4 Model–data comparisons

Our ORCHIDEE simulations generate outputs encompassing all grid cells at a resolution of 0.5° (≈50 km) over Europe. Conversely, observational data are typically collected at specific locations. To facilitate the comparison between the observed and simulated values, each observational site was matched with the closest corresponding ORCHIDEE cell. This approach ensures a comprehensive evaluation of the model's performance in relation to the observed data.

Table 4Model outputs corresponding to the simulations in Table 3.

Download Print Version | Download XLSX

2.4.1 Historical simulations

To compare simulated NPP and GPP against observations, we used NPP and GPP outputs from the FG2 RLS simulation for the respective observed years and corresponding PFTs. In cases where the PFT was absent or unclear in the observations (e.g. mixed forest), we assigned the dominant PFT in that particular site based on the ESA LUH2v2 land cover map. We then grouped the observed and simulated values by PFT and employed boxplots for comparison. The boxplot representation offers valuable insights into the statistical distribution of values, including the median, the 25th and 75th percentiles, the range of extreme data points, and any outliers.

Similarly, we used boxplot representation to evaluate AGB simulations categorised by PFT and age groups. The age groups are divided as follows: group 1 – 0 to 19 years, group 2 – 20 to 39 years, group 3 – 40 to 59 years, group 4 – 60 to 79 years, group 5 – 80 to 99 years, and group 6 – >99 years. ORCHIDEE simulates biomass for different plant compartments (e.g. leaves, wood, roots). To maintain consistency with the observations, simulated AGB was derived from the BM simulation by summing the biomass of leaves, above-ground sapwood and heartwood, and fruits.

We used three diagnostic measures to assess ORCHIDEE's performance in simulating soil carbon stocks (i.e. the FG2 RLS simulation's outputs). These measures include Pearson's correlation (COR; unitless), root mean square error (RMSE; in kg m−2), and relative RMSE (rRMSE; in %) between the observed and simulated SOC. The above-ground litter was excluded from the simulated SOC for comparison to the LUCAS data. The calculation of ORCHIDEE's outputs is detailed in Table 4.

2.4.2 Idealised LUC simulations

Soil profile data in meta-analyses are reported at various depths (as detailed in Table 2). To ensure uniform comparisons, we first standardised all soil carbon data, both observed and simulated, to represent SOC stocks in the top 30 cm, utilising the following depth function (Jobbágy and Jackson2000; Deng et al.2016):

(2) X 30 = 1 - β 30 1 - β d 0 × X d 0 ,

where X30 represents the soil carbon stocks in the top 30 cm, d0 is the original soil depth available in observations or simulations (in cm), Xd0 is the original soil carbon stocks, and β characterises relative rates of decrease with depth (β=0.9786, unitless). For instance, a simulated sample XORC at 2 m depth (d0=200) is converted into the topsoil sample using the equation X30=1-0.9786301-0.9786200×XORC=0.48×XORC.

We then used the absolute SOC stock change (ΔSOC; in kg m−2) as a variable for the comparison of soil carbon changes:

(3) Δ SOC = SOC LU 2 - SOC LU 1 ,

where LU1 corresponds to the land use before conversion, and LU2 is the land use after conversion. Similar to the observations, the simulated SOC for the prior land use is assumed to be in a steady state. For example, in the conversion from F to GC, the simulated SOCLU1 is set to be equal to the SOC value in 1950 from the FG2F simulation (Table 3). In contrast, the observed SOC measurements after land cover conversion are taken at various ages. A fitted carbon response function (CRF), detailed below, is derived for each conversion, describing the ΔSOC as a function of time. For the simulations, a distinct response function was derived from the simulation, corresponding to each meta-analyses site. Subsequently, the average simulated soil carbon response was computed across all these response functions. This aggregate response, referred to as “simulated CRF”, was then compared with the fitted or observed CRF obtained from the meta-analyses.

The observed CRF was constructed using diverse regression models, including linear regression, second- and third-order polynomial regressions, and single-term and two-term exponential models. Due to the limited size of the observed samples (as detailed in Table 2), a leave-one-out cross-validation (LOOCV) method (Stone1974; Dinh and Aires2022) was employed for the model selection process. This iterative approach facilitates the validation of each model's performance by training it on all data points but one and evaluating its prediction accuracy on the excluded data point. By repeating this process for all data points and assessing the overall performance, we can identify the best-performing model that generalises well to the entire dataset and the new samples. Finally, the models providing the most adequate description of the temporal dynamic of relative SOC stock changes were the linear function (Eq. 4) and the single-term exponential function (Eq. 5):

(4)ΔSOC=at+b,(5)ΔSOC=a×eb*t,

where t is the time after LUC (years), and a and b are regression coefficients. A detailed fitted CRF for each LUC in meta-analyses is presented in Table S6 in the Supplement. Furthermore, to better understand the accuracy and uncertainty of the fitted CRFs, we established approximate 95 % confidence intervals using simultaneous prediction bounds for the fitted functions. These confidence intervals visually represent the range of potential outcomes, providing valuable insights into the variability of the observed carbon stock change rate.

2.4.3 Factors explaining model bias

We used random forest (RF) (Breiman2001; Liaw and Wiener2002) to explore the factors contributing to bias in estimating SOC stock changes following LUC. The bias is calculated for each site observation taken from the meta-analyses (Table 2). For this, we compared the observed SOC stock changes per site with corresponding simulated values from the corresponding ORCHIDEE grid cells. Then we analysed which predictor variables best explain the site-to-site variations in model bias for each LUC scenario. Our chosen explanatory variables encompassed both meteorological variables (i.e. temperature at 2 m a.g.l. (T2m) and rainfall (Rain)) and key soil-related metrics (i.e. soil carbon-to-nitrogen ratio (CN), nitrogen (N), phosphorus (P), and soil erosion rate (ER)). Given the constraints of the available observations (Table 2), we also employed LOOCV here to assess the performance of the RF regression model for each LUC scenario. The model consists of 100 decision trees. Each tree is constructed independently and operates on a random subset of the data. During the LOOCV process, the model iterates through each sample in the dataset, systematically excluding one for validation in each iteration. Subsequently, the model is trained on the remaining samples, and feature importance is cumulatively assessed throughout each iteration. The performance of this LOOCV process is shown in Table S7 in the Supplement. The LUC scenarios with poor RF regression results are excluded. For the remaining cases, we then derived importance scores (Liaw and Wiener2002) associated with individual explanatory variables. These scores are then normalised or scaled from 0 to 1, with a value of 1 denoting the utmost relevance and 0 signifying the lowest relevance concerning the model bias.

3 The performance of the ORCHIDEE model

3.1 Net and gross primary productivity (NPP and GPP)

Figure 1 compares the simulated NPP and GPP values with site observations (Sect. 2.2.1). Both simulations and observations exhibit comparable ranges across various PFTs, notably showcasing good performance in temperate forests and temperate C3 grasslands, where the relative differences in the medians are around 10 %. However, the ORCHIDEE simulation results often present a narrower range than the observed site data. This difference can be attributed to the fact that the ORCHIDEE PFTs are a rigid classification of vegetation, with each PFT representing the average characteristics of various tree species. In contrast, differences between individual species within the same PFT class can be substantial (Poulter et al.2011). On the contrary, observations refer to individual species.

The calibration of Vcmax and Fgrowthresp parameters for TeNE (see Table 1) resulted in considerable improvement, in particular for NPP. The default parameterisation (Vcmax=35 and Fgrowthresp=0.28) resulted in the simulated NPP median deviating by approximately 32 % from the observed value. The adjusted parameters (Vcmax=44.45 and Fgrowthresp=0.1) reduced deviation to 5 %.

https://gmd.copernicus.org/articles/17/6725/2024/gmd-17-6725-2024-f01

Figure 1Maps of net and gross primary productivity (NPP and GPP) sites, along with boxplots comparing observations (Obs in black) and ORCHIDEE (ORC in red) simulations. The grey boxes show outputs from ORCHIDEE's default configuration (i.e. without calibrating Vcmax and Fgrowthresp parameters) for comparison purposes. In each boxplot, the number in parentheses indicates the number of sites in each plant functional type (PFT) group.

3.2 Above-ground biomass (AGB)

The boxplots in Fig. 2 present AGB versus age comparisons between observations (in black) and simulations (in red) for four PFTs (TeNE, TeBS, BoNE, and BoBS) and each age group. Other types of forest PFTs were excluded due to the limited number of observed samples. Simulations capture the same trend as the observations: AGB increases quickly in young stands (i.e. <60 years old) and moderately saturates at later ages (>60 years old). Like the NPP and GPP comparisons, the observed AGBs appear in more extensive ranges and have more extremes than the simulated values for all considered PFTs.

https://gmd.copernicus.org/articles/17/6725/2024/gmd-17-6725-2024-f02

Figure 2Maps of above-ground biomass (AGB) sites for four plant functional types (PFTs), including temperate needleleaf and broadleaf, as well as boreal needleleaf and broadleaf forests (TeNE, TeBS, BoNE, BoBS), along with boxplots comparing observations (Obs in black) and ORCHIDEE (ORC in red) simulations. In the boxplot of the TeNE forest (a), the grey boxes present outputs from ORCHIDEE's default configuration (i.e. without calibrating Vcmax and Fgrowthresp parameters), for comparison purposes. In each boxplot, the number in parentheses indicates the number of sites in each age group (group 1: 0–19 years, group 2: 20–39 years, group 3: 40–59 years, group 4: 60–79 years, group 5: 80–99 years, group 6: >99 years). The colour scale in the maps indicates the ORCHIDEE vegetation fraction.

Again, the adjustment of Vcmax and Fgrowthresp parameters for TeNE (see Table 1) improved the simulated AGB age curves significantly, while in the original set-up, the simulated values are much lower compared to the observed ones. This improvement is visually represented in the boxplot of the TeNE forest, in Fig. 2a: the grey boxes, representing AGB values obtained from the default settings, indicate a median deviation of about 60 % from the observed values (black boxes) across five age groups. Conversely, with our calibration, this deviation is reduced to less than 10 %, as indicated by the red boxes, which now closely align with the observed data represented by the black boxes.

https://gmd.copernicus.org/articles/17/6725/2024/gmd-17-6725-2024-f03

Figure 3The same as the boxplots in Fig. 2 but for the comparison of the observed values for the BoNE forest with simulated values for the TeNE forest (a), as well as observed values for the BoBS forest with simulated values for the TeBS forest (b).

Download

Furthermore, Fig. 2 highlights a contrast in ORCHIDEE performance; notably, boreal forests exhibit lower biomass per age class than temperate ones, illustrated by BoNE versus TeNE and BoBS versus TeBS forests. Interestingly, observed biomass ranges for BoNE and BoBS forests closely resemble those of TeNE and TeBS forests. Further comparisons are detailed in Fig. 3, despite variations in site numbers between observations and simulations for each age group. This comparative approach provides insights into and offers a broader understanding of how the model's parameterisation performs in Europe. We found that the parameterisation of BoNE and BoBS may need improvement, as they appear less well fitted than TeNE and TeBS. This might raise questions about the relevance and necessity of using BoNE and BoBS as distinct PFTs when TeNE and TeBS demonstrate better alignment with the observed data in the study region.

3.3 Soil organic carbon (SOC)

The SOC map showing 5150 LUCAS samples is presented in Fig. 4a. We also generated a corresponding SOC map using the ORCHIDEE simulation weighted by the areal proportions of each PFT. The difference between our simulated SOC stocks from LUCAS data is presented in Fig. 4b. The correlation between the observed and simulated SOCs is 0.4 (and RMSE = 2.03 kg m−2, rRMSE = 50.31 %), indicating moderate agreement. However, it is essential to note that this general ORCHIDEE simulation does not represent peatlands or important factors such as land management, effects of soil erosion and translocation of SOC from eroded sides to colluvial sediments, topographic wetness, land-use history before 1900, soil class, and geochemistry of soil-forming substrates, etc. Therefore, achieving a correlation coefficient of 0.4 is already significant. Notably, ORCHIDEE underestimates SOC in certain regions, particularly in northern Europe. This discrepancy can be explained by the absence of peatlands in this particular version of ORCHIDEE.

https://gmd.copernicus.org/articles/17/6725/2024/gmd-17-6725-2024-f04

Figure 4Maps showing the stocks after soil organic carbon (SOC in kg m−2) based on the LUCAS topsoil database (a) and the deviation of simulated SOC stocks from these observation-based estimates (b).

In the following, we classified 5150 SOC samples into three vegetation groups (forest, grass, and crop) based on the land-use information provided in the LUCAS dataset. The assessment of observed and simulated SOC stocks is illustrated through comparison scores (COR, RMSE, and rRMSE) in Table S8 within the Supplement, along with scatterplots and histograms in Fig. 5, showcasing variations across different grid scales: 0.5° × 0.5°, 1° × 1°, 2° × 2°, and 3° × 3° cells. This stepwise aggregation aims to enhance our understanding of how far spatial correlations between observed and modelled SOC stocks are scale dependent. At the 0.5° × 0.5° scale, the correlation between observed and simulated SOC for forest sites (Fig. 5a1) is relatively low (COR = 0.17, rRMSE = 59.15 %). However, the correlation values are significantly better for grassland and cropland sites (Fig. 5b1 and c1), reaching 0.53 and 0.42, respectively (with corresponding rRMSE values of 39.38 % and 35.98 %). Interestingly, the correlation scores improve for all vegetation types as we increase the grid-scale size and, thus, the level of spatial aggregation. For example, when examining the 3° × 3° scale, as illustrated in Fig. 5a2, b2, and c2, the correlation coefficients increase to 0.45, 0.68, and 0.59 for forest, grassland, and cropland sites, respectively. These improvements in correlation are accompanied by decreasing rRMSE values (by 10 % to 15 %), indicating a reduction in the differences between observed and simulated SOC values. This effect can be attributed to various factors, such as small-scale variations (Garten et al.2007) related to soil class, topography, and management history, which are not accounted for in ORCHIDEE but lose their importance at a higher level of spatial aggregation. On the other hand, the coarse large-scale spatial patterns are primarily influenced by climate differences, which are better represented in a DGVM such as ORCHIDEE.

https://gmd.copernicus.org/articles/17/6725/2024/gmd-17-6725-2024-f05

Figure 5Scatterplots show the relationship between LUCAS topsoil data and derived ORCHIDEE soil organic carbon (SOCLUCAS topsoil versus SOCderived ORC, in kg m−2), along with their corresponding histograms. Plots are presented for two grid scales: 0.5° × 0.5° (a1, b1, c1) and 3° × 3° (a2, b2, c2). Darker colours indicate denser point concentrations. Complementary summary statistics are provided, including the mean and standard deviation (SD) values for each dataset, along with the correlation (COR) and relative root mean square error (rRMSE) between the two datasets. The corresponding maps are also presented in the Supplement (Figs. S2 to S4).

Download

4 SOC change following land-use change (LUC)

Figure 6 compares observed and simulated SOC changes for different LUC transitions (see Table 2). During the C-to-G conversion, there is an increase in SOC stocks. However, the simulated results give a smaller increase than those observed in meta-analyses. Specifically, after a 100-year conversion period, the simulated SOC stocks increase on average by a mere 0.73 ± 0.09 kg m−2, while the observed data show a much higher increase of 3.85 ± 1.33 kg m−2. The G-to-C conversion leads to a decrease in SOC stocks. The model agrees with the observed change in direction but has a slower rate. Notably, the observed data display a wide range of confidence interval levels, and the simulated CRFs closely align with the upper boundary of the confidence interval. This highlights the difficulty of accurately capturing real-world SOC dynamics due to significant variability in the observed data.

Regarding G-to-F conversion, simulations using both TeBS and TeNE show different trends compared to the observed CRFs, as shown in the G-to-FwoFF and G-to-FwoFF subplots in Fig. 6. However, they consistently fall within the 95 % confidence interval, regardless of whether the forest floor is included in the analysis. In addition, the observed data for G-to-F conversions display considerable variability over time, which partly accounts for the difficulty in accurately modelling the true impact of this conversion type.

The conversions of C to F and F to C show opposite trends, as presented in the C-to-FwoFF, C-to-FwFF, and F-to-CwoFF subplots in Fig. 6: C-to-F conversion leads to an increase in SOC and vice versa. Again, the averaged simulated CRF results align with the observed direction but indicate changes considerably slower than those reported in meta-analyses. In these two conversions, simulations with the TeBS forest appear closer to the observations than those with the TeNE forest.

https://gmd.copernicus.org/articles/17/6725/2024/gmd-17-6725-2024-f06

Figure 6The absolute soil organic carbon changes (in kg m−2) from site observations in meta-analyses (black circles) and the fitted carbon response function (CRF; black lines) ±95 % confidence interval (dotted black lines) compared to simulated CRFs (magenta and green lines) for different land-use changes (LUCs; as presented in Table 2): cropland to grassland (C to G), grassland to cropland (G to C), grassland to forest (without and with forest floor G to FwoFF, G to FwFF), cropland to forest (C to FwoFF and C to FwFF), and forest to cropland (F to CwoFF). The first number in the parentheses indicates the number of study sites, and the second is the number of samples in the meta-analyses. Two distinct forest types, namely temperate broadleaf summergreen and temperate needleleaf evergreen, are considered for the forest sites in ORCHIDEE simulations (ORCTeBS, ORCTeNE).

Download

Figure 6 suggests that the key model biases are the systematic underestimation of SOC gain during C-to-G transition and losses during G-to-C and F-to-CwoFF conversions. Multiple factors could contribute to these observed underestimations. As depicted in Fig. 7, soil erosion rate plays a pivotal role in the discrepancies observed across all considered LUC conversions among the six chosen factors. Conversely, temperature appears relatively less influential overall, except notably in the C-to-G conversion. Rainfall considerably influences the differences between observed and simulated absolute SOC changes after the conversions from C to G, C to FwoFF, and F to CwoFF. Soil phosphorus, on the other hand, demonstrates significance in the conversions of G to FwoFF, F to CwoFF (particularly for the TeNE forest), and C to G. Furthermore, the six chosen factors demonstrate a relatively consistent behaviour across the two forest types, as shown by the magenta and green bars for TeBS and TeNE (see Fig. 7).

https://gmd.copernicus.org/articles/17/6725/2024/gmd-17-6725-2024-f07

Figure 7Scaled feature importance scores resulting from random forest (RF) analysis showing the relationship between model bias and potential influencing factors (i.e. temperature 2 m a.g.l. (T2m), rain, soil carbon-to-nitrogen ratio (CN), soil nitrogen (N), soil phosphorus (P), and soil erosion rate (ER)) for different land-use changes (LUCs): cropland to grassland (C to G), grassland to cropland (G to C), grassland to forest (without forest floor G to FwoFF), cropland to forest (C to FwoFF), and forest to cropland (F to CwoFF). Results for other conversions are not shown since RF shows poor performance (Table S7 in the Supplement). Each score is normalised within the range of 0 to 1, where 1 signifies the highest relevance, and 0 indicates the lowest importance. Two distinct forest types, namely temperate broadleaf summergreen and temperate needleleaf evergreen, are considered for the forest sites (TeBS, TeNE).

Download

5 Discussion

5.1 Model performance for biosphere carbon stocks

The ORCHIDEE model shows a reasonable alignment with observed NPP, GPP (Fig. 1), and AGB trends (Fig. 2). Compared to observed data for all PFTs, the model shows narrower ranges. This dampened spatial variability may be due to the model's coarse resolution and constant parameter values for a given PFT, differing from species-specific observations affected by finer-scale environmental variations (Chang et al.2013). The latter emphasises the need to incorporate a sufficiently large population of observed sites. Additionally, model–data disagreement can be linked to not-well-enough constrained values for PFT-specific parameters. For instance, our findings indicated that calibrating Vcmax and Fgrowthresp based on NPP and GPP observations for the TeNE forest type improved the model's performance in simulating AGB for this specific PFT (as shown in Fig. 2). Furthermore, our comparative analysis implied that employing temperate PFTs rather than boreal PFTs can enhance model performance in simulating the biomass of the boreal forests. This result suggests that certain PFTs, particularly those linked to boreal forest types, may be redundant in ORCHIDEE biomass simulations in the European context.

For SOC stock simulation, a Pearson's correlation of 0.4 between observed and simulated SOC values (Fig. 4) is significant, given the absence of certain controlling factors and processes in the model version used. This score is similar to that in other DGVM models (Wu et al.2019; Seiler et al.2022). For example, Wu et al. (2019) demonstrated a correlation coefficient of approximately 0.45 between LPJ-GUESS (a global dynamic ecosystem model) and SoilGrids (an observation-driven global soil dataset) on a global scale and lower correlation scores among different land cover classes. In this study, SOC scores varied among vegetation groups (Fig. 5), with lower correlations for forest sites. The inclusion of up to six PFTs in forest groups, with poorly determined classifications in observations, contributes to the model–data discrepancy. In contrast, grass and crop groups exhibit improved correlations with fewer PFTs and better distinction in LUCAS data (Ballot et al.2023). Additionally, the smaller population of forest sites (Fig. 5) may account for the lower score compared with the other groups. Additionally, when examining different levels of resolution, we find that larger grid scales demonstrate a stronger correlation, which may be driven by climate patterns (Wang et al.2023). At smaller scales, other environmental controls, like soil types, soil chemistry, topography, and management, become more important (Garten et al.2007), which are not or only rudimentarily represented in ORCHIDEE. Implementing ORCHIDEE at a higher resolution using higher-resolution climate forcing (Anav et al.2010; Lafont et al.2012) can be challenging. This complexity arises from the fundamental reliance of the ORCHIDEE model on low-resolution environmental factors such as soil characteristics and erosion. Overcoming these inherent limitations in ORCHIDEE, as well as other DGVMs, can significantly improve model performance, particularly at more regional scales and higher spatial resolutions.

5.2 Impacts of LUC on soil carbon stocks

In pursuit of a more comprehensive evaluation, we explored the applicability of meta-analyses of site-level SOC changes for “pure” land cover transitions in assessing DGVMs' ability to simulate SOC stock responses to LUC. As discussed earlier, DGVMs, including ORCHIDEE, face challenges in simulating SOC stocks at a small scale, making it difficult to capture the SOC stock response at individual sites. Nevertheless, the model should be capable of matching average responses across broader regions.

In our comparison, we averaged the model responses over all grid cells encompassing the sites where LUC has occurred. This enabled us to compare the model's response to the meta-analysis data and its fitted CRF. Generally, the simulated results agree in terms of direction with observed data, notably the decrease in soil carbon stocks for G-to-C and F-to-C conversions and the opposite for C-to-G and C-to-F conversions (Fig. 6). As for G-to-F conversions, the simulations exhibit different trends from the observed CRFs but fall within the 95 % confidence interval (Fig. 6). In addition, the meta-analysis data exhibit considerable uncertainties, evident in the wide confidence intervals around the fits in Fig. 6. These uncertainties can be attributed to challenges related to data compatibility, methodological heterogeneity, and the diversity of ecosystems and LUC scenarios considered, as discussed in prior studies (Verburg et al.2011; Deng et al.2016; Fohrafellner et al.2023). Therefore, while meta-analyses offer valuable insights, their interpretation requires careful consideration and integration with site-specific observations.

Despite this alignment in direction, there are noticeable discrepancies in the magnitudes of SOC stock changes between the simulated and observed CRFs, i.e. the underestimated SOC gain during C-to-G conversion and underestimated SOC losses during G-to-C and F-to-CwoFF conversions. These differences could potentially be attributed to various factors that the model may not fully capture. For instance, our findings indicate that soil erosion rate significantly influences the model bias among six selected potential factors. In addition, the influence of varying land management practices can substantially shape the model bias (Nyawira et al.2016). These complexities underscore the challenges involved in accurately simulating local SOC dynamics. Further investigations or adjustments will be essential to reduce the biases and thereby enhance the accuracy of the model estimations. Additionally, our idealised assumption regarding the transition year in 1950 may introduce uncertainties to the model outputs. However, as shown in Sect. S1 of the Supplement, considering the actual transition year does not significantly enhance agreement with observations. This might be due to the limited number of available samples. It is also possible that the impact of climate change on LUC effects over the past century is not substantial. If the latter is true, using an idealised transition year should not create significant issues.

5.3 Challenges in model–data comparisons

Evaluating DGVM outputs against observational data is challenging, primarily due to constraints on the quantity and quality of existing long-term observational datasets. While observational data exist, their scarcity is evident, exemplified in Fig. 1, particularly in the instances of NPP and GPP sites for several PFTs like BoBS, TeBE, TeGC3, and BoGC3. Furthermore, as previously highlighted, substantial uncertainties persist in observed changes in SOC stocks when contrasted with anticipated changes. These limitations introduce intricacy into the process of calibrating and validating our models.

Another significant challenge arises from the long-lasting impact (e.g. >100 years) of historical LUC, particularly in the case of substantial events like erosion (Bakker et al.2005; Borrelli et al.2017). The absence of site history information hinders our ability to incorporate these effects into our simulations (Verburg et al.2011). Disregarding the influence of major historical LUC events may lead to accurate simulations but for the wrong reasons. This approach further complicates our ability to predict changes in SOC stocks. In addition, failing to simulate LUC impacts accurately can have significant consequences for forecasting future land carbon balances and influencing decisions related to climate change mitigation and land management. To gain a more comprehensive perspective, we consider assessing the relative importance of SOC stock changes versus biomass carbon stock changes over, for instance, a 30-year horizon. This analysis can be relevant for initiatives like the European Green Deal (European Council2019), as it could offer essential guidance for shaping policies related to carbon sequestration, sustainable land-use practices, and preserving ecosystem health.

6 Conclusions

Our research investigated the ability of the DGVM ORCHIDEE model to reproduce what is known from experimental studies about LUC impacts on biospheric carbon. We performed various comparisons between simulations and experimental data, including on-site measurements and data from meta-analyses.

Discrepancies between the model and data can be attributed to several factors, such as the grouping of vegetation in DGVMs, which often use a limited number of PFTs, unlike the species-specific observations. The coarse model resolution also contributes to discrepancies. For example, our spatially explicit simulation of SOC stocks has a spatial resolution of 0.5°, whereas, in reality, SOC stocks and their controlling factors vary at a much smaller scale. Our analysis also identifies potential factors contributing to model bias when studying the impact of LUCs on SOC. Various factors, such as soil erosion rate, phosphorus, or rainfall, can influence each type of LUC. Further studies are needed to explore these impacts more comprehensively.

In summary, this study enhances our understanding of using DGVMs for studying carbon dynamics and provides insights for future model development and applications. While ORCHIDEE was our chosen model, this methodology can be readily applied to other DGVMs using the same protocol.

Appendix A: ORCHIDEE carbon module

Figure A1 presents the basic scheme of biospheric carbon cycling representation in ORCHIDEE. Simulated carbon dynamics include the exchange of carbon between the atmosphere and various carbon pools in vegetation biomass and soils. Carbon dynamics are simulated for each PFT individually, distinguishing eight vegetation biomass pools (leaves, roots, above- and below-ground sapwood, above- and below-ground heartwood, fruits, and a plant carbohydrate reserve), four litter pools (structural and metabolic litter above and below the surface), and three SOC pools (active, slow, and passive soil carbon). The turnover time of SOC and litter pools is determined by various factors, including temperature and humidity of the soil. The litter is produced through senescence and death, and the latter can also be related to LUC when the original vegetation is destroyed to make space for the new PFT. Further, carbon fluxes occur from litter to SOC pools and between the three SOC pools, with a part of the transferred carbon lost to the atmosphere through heterotrophic respiration. The model does not consider nutrient cycling, depth distribution of SOC, or soil carbon losses through leaching and erosion. Detailed formulations of the main processes represented in the version of ORCHIDEE used in this study can be found in Appendix A of Krinner et al. (2005).

https://gmd.copernicus.org/articles/17/6725/2024/gmd-17-6725-2024-f08

Figure A1Basic structure of ORCHIDEE carbon module. The processes are denoted by rounded rectangles, while the reservoirs are represented by regular rectangles (accompanied by corresponding basic state variables in bold). The sub-processes are linked through carbon fluxes (depicted as black arrows). The figure is adapted from Krinner et al. (2005).

Code and data availability
Supplement

The supplement related to this article is available online at: https://doi.org/10.5194/gmd-17-6725-2024-supplement.

Author contributions

All authors conceptualised this research. TLAD and RL designed the simulations, and TLAD implemented the simulations. TLAD collected the data. All authors discussed the analysing methods. TLAD conducted the analysis and wrote the paper. All authors discussed the results and revised the paper.

Competing interests

The contact author has declared that none of the authors has any competing interests.

Disclaimer

Publisher’s note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors.

Acknowledgements

For the NPP and GPP datasets, we thank all site investigators, their funding agencies, the various regional flux networks (AfriFlux, AmeriFlux, AsiaFlux, CarboAfrica, CarboEurope-IP, ChinaFlux, Fluxnet-Canada, KoFlux, LBA, NECC, OzFlux, TCOS-Siberia, USCCC), and the Fluxnet project, whose support is essential for obtaining the measurements without which the type of integrated analyses conducted in this study would not be possible.

We thank Nuno Carvalhais and Simon Besnard for providing us access to the in situ biomass and age data, and we thank Sebastiaan Luyssaert for providing the NPP and GPP data.

We also thank Nuno Carvalhais and Wei Li for their valuable comments on the initial version of the manuscript.

This research received funding from the French state aid, managed by ANR under the Investissements d'avenir programme (ANR-16-CONV-0003), and the European Union’s Horizon Europe Research and Innovation programme under grant agreement no. 101060423.

Financial support

This research has been supported by the EU Horizon 2020 (grant no. 101060423) and the Agence Nationale de la Recherche (grant no. ANR-16-CONV-0003).

Review statement

This paper was edited by Sam Rabin and reviewed by two anonymous referees.

References

Anav, A., D'Andrea, F., Viovy, N., and Vuichard, N.: A validation of heat and carbon fluxes from high-resolution land surface and regional models, J. Geophys. Res.-Biogeo., 115, G04016, https://doi.org/10.1029/2009JG001178, 2010. a

Anderson-Teixeira, K. J., Wang, M. M. H., McGarvey, J. C., Herrmann, V., Tepley, A. J., Bond-Lamberty, B., and LeBauer, D. S.: ForC: a global database of forest carbon stocks and fluxes, Ecology, 99, 1507, https://doi.org/10.1002/ecy.2229, 2018. a

Arora, V. K. and Boer, G. J.: Uncertainties in the 20th century carbon budget associated with land use change, Glob. Change Biol., 16, 3327–3348, https://doi.org/10.1111/j.1365-2486.2010.02202.x, 2010. a

Bakker, M. M., Govers, G., Kosmas, C., Vanacker, V., van Oost, K., and Rounsevell, M.: Soil erosion as a driver of land-use change, Agr. Ecosyst. Environ., 105, 467–481, https://doi.org/10.1016/j.agee.2004.07.009, 2005. a

Ballabio, C., Lugato, E., Fernández-Ugalde, O., Orgiazzi, A., Jones, A., Borrelli, P., Montanarella, L., and Panagos, P.: Mapping LUCAS topsoil chemical properties at European scale using Gaussian process regression, Geoderma, 355, 113912, https://doi.org/10.1016/j.geoderma.2019.113912, 2019. a

Ballot, R., Guilpart, N., and Jeuffroy, M.-H.: The first map of crop sequence types in Europe over 2012–2018, Earth Syst. Sci. Data, 15, 5651–5666, https://doi.org/10.5194/essd-15-5651-2023, 2023. a

Batjes, N.: Total carbon and nitrogen in the soils of the world, Eur. J. Soil Sci., 47, 151–163, https://doi.org/10.1111/j.1365-2389.1996.tb01386.x, 1996. a

Besnard, S., Koirala, S., Santoro, M., Weber, U., Nelson, J., Gütter, J., Herault, B., Kassi, J., N'Guessan, A., Neigh, C., Poulter, B., Zhang, T., and Carvalhais, N.: Mapping global forest age from forest inventories, biomass and climate data, Earth Syst. Sci. Data, 13, 4881–4896, https://doi.org/10.5194/essd-13-4881-2021, 2021 (data available at: https://www.bgc-jena.mpg.de/geodb/projects/FileDetails.php, last access: 10 May 2023). a, b

Borrelli, P., Robinson, D. A., Fleischer, L. R., Lugato, E., Ballabio, C., Alewell, C., Meusburger, K., Modugno, S., Schütt, B., Ferro, V., Bagarello, V., Oost, K. V., Montanarella, L., and Panagos, P.: An assessment of the global impact of 21st century land use change on soil erosion, Nat. Commun., 8, 2013, https://doi.org/10.1038/s41467-017-02142-7, 2017. a

Boucher, O., Servonnat, J., Albright, A. L., Aumont, O., Balkanski, Y., Bastrikov, V., Bekki, S., Bonnet, R., Bony, S., Bopp, L., Braconnot, P., Brockmann, P., Cadule, P., Caubel, A., Cheruy, F., Codron, F., Cozic, A., Cugnet, D., D'Andrea, F., Davini, P., de Lavergne, C., Denvil, S., Deshayes, J., Devilliers, M., Ducharne, A., Dufresne, J.-L., Dupont, E., Éthé, C., Fairhead, L., Falletti, L., Flavoni, S., Foujols, M.-A., Gardoll, S., Gastineau, G., Ghattas, J., Grandpeix, J.-Y., Guenet, B., Guez, Lionel, E., Guilyardi, E., Guimberteau, M., Hauglustaine, D., Hourdin, F., Idelkadi, A., Joussaume, S., Kageyama, M., Khodri, M., Krinner, G., Lebas, N., Levavasseur, G., Lévy, C., Li, L., Lott, F., Lurton, T., Luyssaert, S., Madec, G., Madeleine, J.-B., Maignan, F., Marchand, M., Marti, O., Mellul, L., Meurdesoif, Y., Mignot, J., Musat, I., Ottlé, C., Peylin, P., Planton, Y., Polcher, J., Rio, C., Rochetin, N., Rousset, C., Sepulchre, P., Sima, A., Swingedouw, D., Thiéblemont, R., Traore, A. K., Vancoppenolle, M., Vial, J., Vialard, J., Viovy, N., and Vuichard, N.: Presentation and Evaluation of the IPSL-CM6A-LR Climate Model, J. Adv. Model. Earth Sy., 12, e2019MS002010, https://doi.org/10.1029/2019MS002010, 2020. a

Breiman, L.: Random Forests, Mach. Learn., 45, 5–32, https://doi.org/10.1023/A:1010933404324, 2001. a

Campioli, M., Vicca, S., Luyssaert, S., Bilcke, J., Ceschia, E., Chapin III, F. S., Ciais, P., Fernández-Martínez, M., Malhi, Y., Obersteiner, M., Olefeldt, D., Papale, D., Piao, S. L., Peñuelas, J., Sullivan, P. F., Wang, X., Zenone, T., and Janssens, I. A.: Biomass production efficiency controlled by management in temperate and boreal ecosystems, Nat. Geosci., 8, 843–846, https://doi.org/10.1038/ngeo2553, 2015. a, b

Canadell, J. G. and Schulze, E. D.: Global potential of biospheric carbon management for climate mitigation, Nat. Commun., 5, 5282, https://doi.org/10.1038/ncomms6282, 2014. a, b

Chang, J. F., Viovy, N., Vuichard, N., Ciais, P., Wang, T., Cozic, A., Lardy, R., Graux, A.-I., Klumpp, K., Martin, R., and Soussana, J.-F.: Incorporating grassland management in ORCHIDEE: model description and evaluation at 11 eddy-covariance sites in Europe, Geosci. Model Dev., 6, 2165–2181, https://doi.org/10.5194/gmd-6-2165-2013, 2013. a

d'Andrimont, R., Yordanov, M., Martinez-Sanchez, L., Eiselt, B., Palmieri, A., Dominici, P., Gallego, J., Reuter, H. I., Joebges, C., Lemoine, G., and van der Velde, M.: Harmonised LUCAS in-situ land cover and use database for field surveys from 2006 to 2018 in the European Union, Scientific Data, 7, 352, https://doi.org/10.1038/s41597-020-00675-z, 2020. a

de Rosnay, P. and Polcher, J.: Modelling root water uptake in a complex land surface scheme coupled to a GCM, Hydrol. Earth Syst. Sci., 2, 239–255, https://doi.org/10.5194/hess-2-239-1998, 1998. a

Deng, L., Liu, G.-B., and Shangguan, Z.-P.: Land-use conversion and changing soil carbon stocks in China's “Grain-for-Green” Program: a synthesis, Glob. Change Biol., 20, 3544–3556, https://doi.org/10.1111/gcb.12508, 2014. a

Deng, L., yu Zhu, G., sheng Tang, Z., and ping Shangguan, Z.: Global patterns of the effects of land-use changes on soil carbon stocks, Global Ecology and Conservation, 5, 127–138, https://doi.org/10.1016/j.gecco.2015.12.004, 2016. a, b

Dinh, T. L. A. and Aires, F.: Nested leave-two-out cross-validation for the optimal crop yield model selection, Geosci. Model Dev., 15, 3519–3535, https://doi.org/10.5194/gmd-15-3519-2022, 2022. a

Ducoudré, N. I., Laval, K., and Perrier, A.: SECHIBA, a New Set of Parameterizations of the Hydrologic Exchanges at the Land-Atmosphere Interface within the LMD Atmospheric General Circulation Model, J. Climate, 6, 248–273, https://doi.org/10.1175/1520-0442(1993)006<0248:SANSOP>2.0.CO;2, 1993. a

European Council: European Green Deal, https://www.consilium.europa.eu/en/policies/green-deal/ (last access: 20 October 2023), 2019. a

Eyring, V., Bony, S., Meehl, G. A., Senior, C. A., Stevens, B., Stouffer, R. J., and Taylor, K. E.: Overview of the Coupled Model Intercomparison Project Phase 6 (CMIP6) experimental design and organization, Geosci. Model Dev., 9, 1937–1958, https://doi.org/10.5194/gmd-9-1937-2016, 2016. a

Fendrich, A. N., Ciais, P., Lugato, E., Carozzi, M., Guenet, B., Borrelli, P., Naipal, V., McGrath, M., Martin, P., and Panagos, P.: Matrix representation of lateral soil movements: scaling and calibrating CE-DYNAM (v2) at a continental level, Geosci. Model Dev., 15, 7835–7857, https://doi.org/10.5194/gmd-15-7835-2022, 2022. a

Fernandez-Ugalde, O., Scarpa, S., Orgiazzi, A., Panagos, P., Van Liedekerke, M., Maréchal, A., and Jones, A.: LUCAS 2018 Soil Module, KJ-NA-31-144-EN-N (online), Publications Office of the European Union, Luxembourg (Luxembourg), ISBN 978-92-76-54832-4 (online), https://doi.org/10.2760/215013, 2022 (data available at: https://esdac.jrc.ec.europa.eu/content/lucas-2018-topsoil-data, last access: 3 August 2023). a, b

Fohrafellner, J., Zechmeister-Boltenstern, S., Murugan, R., and Valkama, E.: Quality assessment of meta-analyses on soil organic carbon, SOIL, 9, 117–140, https://doi.org/10.5194/soil-9-117-2023, 2023. a, b

Friedlingstein, P., O'Sullivan, M., Jones, M. W., Andrew, R. M., Bakker, D. C. E., Hauck, J., Landschützer, P., Le Quéré, C., Luijkx, I. T., Peters, G. P., Peters, W., Pongratz, J., Schwingshackl, C., Sitch, S., Canadell, J. G., Ciais, P., Jackson, R. B., Alin, S. R., Anthoni, P., Barbero, L., Bates, N. R., Becker, M., Bellouin, N., Decharme, B., Bopp, L., Brasika, I. B. M., Cadule, P., Chamberlain, M. A., Chandra, N., Chau, T.-T.-T., Chevallier, F., Chini, L. P., Cronin, M., Dou, X., Enyo, K., Evans, W., Falk, S., Feely, R. A., Feng, L., Ford, D. J., Gasser, T., Ghattas, J., Gkritzalis, T., Grassi, G., Gregor, L., Gruber, N., Gürses, Ö., Harris, I., Hefner, M., Heinke, J., Houghton, R. A., Hurtt, G. C., Iida, Y., Ilyina, T., Jacobson, A. R., Jain, A., Jarníková, T., Jersild, A., Jiang, F., Jin, Z., Joos, F., Kato, E., Keeling, R. F., Kennedy, D., Klein Goldewijk, K., Knauer, J., Korsbakken, J. I., Körtzinger, A., Lan, X., Lefèvre, N., Li, H., Liu, J., Liu, Z., Ma, L., Marland, G., Mayot, N., McGuire, P. C., McKinley, G. A., Meyer, G., Morgan, E. J., Munro, D. R., Nakaoka, S.-I., Niwa, Y., O'Brien, K. M., Olsen, A., Omar, A. M., Ono, T., Paulsen, M., Pierrot, D., Pocock, K., Poulter, B., Powis, C. M., Rehder, G., Resplandy, L., Robertson, E., Rödenbeck, C., Rosan, T. M., Schwinger, J., Séférian, R., Smallman, T. L., Smith, S. M., Sospedra-Alfonso, R., Sun, Q., Sutton, A. J., Sweeney, C., Takao, S., Tans, P. P., Tian, H., Tilbrook, B., Tsujino, H., Tubiello, F., van der Werf, G. R., van Ooijen, E., Wanninkhof, R., Watanabe, M., Wimart-Rousseau, C., Yang, D., Yang, X., Yuan, W., Yue, X., Zaehle, S., Zeng, J., and Zheng, B.: Global Carbon Budget 2023, Earth Syst. Sci. Data, 15, 5301–5369, https://doi.org/10.5194/essd-15-5301-2023, 2023. a

Garten, C. T., Kang, S., Brice, D. J., Schadt, C. W., and Zhou, J.: Variability in soil properties at different spatial scales (1 m–1 km) in a deciduous forest ecosystem, Soil Biol. Biochem., 39, 2621–2627, https://doi.org/10.1016/j.soilbio.2007.04.033, 2007. a, b

Guo, L. B. and Gifford, R. M.: Soil carbon stocks and land use change: a meta analysis, Glob. Change Biol., 8, 345–360, https://doi.org/10.1046/j.1354-1013.2002.00486.x, 2002. a, b

Hurtt, G. C., Chini, L., Sahajpal, R., Frolking, S., Bodirsky, B. L., Calvin, K., Doelman, J. C., Fisk, J., Fujimori, S., Klein Goldewijk, K., Hasegawa, T., Havlik, P., Heinimann, A., Humpenöder, F., Jungclaus, J., Kaplan, J. O., Kennedy, J., Krisztin, T., Lawrence, D., Lawrence, P., Ma, L., Mertz, O., Pongratz, J., Popp, A., Poulter, B., Riahi, K., Shevliakova, E., Stehfest, E., Thornton, P., Tubiello, F. N., van Vuuren, D. P., and Zhang, X.: Harmonization of global land use change and management for the period 850–2100 (LUH2) for CMIP6, Geosci. Model Dev., 13, 5425–5464, https://doi.org/10.5194/gmd-13-5425-2020, 2020. a

IPCC: Climate Change and Land: IPCC Special Report on Climate Change, Desertification, Land Degradation, Sustainable Land Management, Food Security, and Greenhouse Gas Fluxes in Terrestrial Ecosystems, Cambridge University Press, https://doi.org/10.1017/9781009157988, 2022. a

IPCC: Climate Change 2023: Synthesis Report. Contribution of Working Groups I, II and III to the Sixth Assessment Report of the Intergovernmental Panel on Climate Change. Core Writing Team, edited by: Lee, H. and Romero, J., IPCC, Geneva, Switzerland, https://doi.org/10.59327/IPCC/AR6-9789291691647, 2023. a

Jobbágy, E. G. and Jackson, R. B.: The Vertical Distribution of Soil Organic Carbon and its Relation to Climate and Vegetation, Ecol. Appl., 10, 423–436, https://doi.org/10.1890/1051-0761(2000)010[0423:TVDOSO]2.0.CO;2, 2000. a

Jones, A., Fernández-Ugalde, O., and Scarpa, S.: LUCAS 2015 Topsoil Survey, KJ-NA-30332-EN-N (online), Publications Office of the European Union, Luxembourg (Luxembourg), ISBN 978-92-76-21080-1 (online), https://esdac.jrc.ec.europa.eu/public_path/shared_folder/dataset/66/JRC121325_lucas_2015_topsoil_survey_final_1.pdf (last access: 3 August 2023), 2020. a

Krinner, G., Viovy, N., de Noblet-Ducoudré, N., Ogée, J., Polcher, J., Friedlingstein, P., Ciais, P., Sitch, S., and Prentice, I. C.: A dynamic global vegetation model for studies of the coupled atmosphere-biosphere system, Global Biogeochem. Cy., 19, GB1015, https://doi.org/10.1029/2003GB002199, 2005 (data available at: http://forge.ipsl.jussieu.fr/orchidee/browser/branches/ORCHIDEE_2_2, last access: 3 April 2023). a, b, c, d, e, f

Lafont, S., Zhao, Y., Calvet, J.-C., Peylin, P., Ciais, P., Maignan, F., and Weiss, M.: Modelling LAI, surface water and carbon fluxes at high-resolution over France: comparison of ISBA-A-gs and ORCHIDEE, Biogeosciences, 9, 439–456, https://doi.org/10.5194/bg-9-439-2012, 2012. a

Laganière, J., Angers, D. A., and Paré, D.: Carbon accumulation in agricultural soils after afforestation: a meta-analysis, Glob. Change Biol., 16, 439–453, https://doi.org/10.1111/j.1365-2486.2009.01930.x, 2010. a, b

Lal, R.: Soil Carbon Sequestration Impacts on Global Climate Change and Food Security, Science, 304, 1623–1627, https://doi.org/10.1126/science.1097396, 2004. a

Lal, R.: Carbon sequestration, Philos. T. R. Soc. B, 363, 815–830, https://doi.org/10.1098/rstb.2007.2185, 2008. a

Le Quéré, C., Moriarty, R., Andrew, R. M., Canadell, J. G., Sitch, S., Korsbakken, J. I., Friedlingstein, P., Peters, G. P., Andres, R. J., Boden, T. A., Houghton, R. A., House, J. I., Keeling, R. F., Tans, P., Arneth, A., Bakker, D. C. E., Barbero, L., Bopp, L., Chang, J., Chevallier, F., Chini, L. P., Ciais, P., Fader, M., Feely, R. A., Gkritzalis, T., Harris, I., Hauck, J., Ilyina, T., Jain, A. K., Kato, E., Kitidis, V., Klein Goldewijk, K., Koven, C., Landschützer, P., Lauvset, S. K., Lefèvre, N., Lenton, A., Lima, I. D., Metzl, N., Millero, F., Munro, D. R., Murata, A., Nabel, J. E. M. S., Nakaoka, S., Nojiri, Y., O'Brien, K., Olsen, A., Ono, T., Pérez, F. F., Pfeil, B., Pierrot, D., Poulter, B., Rehder, G., Rödenbeck, C., Saito, S., Schuster, U., Schwinger, J., Séférian, R., Steinhoff, T., Stocker, B. D., Sutton, A. J., Takahashi, T., Tilbrook, B., van der Laan-Luijkx, I. T., van der Werf, G. R., van Heuven, S., Vandemark, D., Viovy, N., Wiltshire, A., Zaehle, S., and Zeng, N.: Global Carbon Budget 2015, Earth Syst. Sci. Data, 7, 349–396, https://doi.org/10.5194/essd-7-349-2015, 2015. a

Li, W., Ciais, P., Guenet, B., Peng, S., Chang, J., Chaplot, V., Khudyaev, S., Peregon, A., Piao, S., Wang, Y., and Yue, C.: Temporal response of soil organic carbon after grassland-related land-use change, Glob. Change Biol., 24, 4731–4746, https://doi.org/10.1111/gcb.14328, 2018. a, b

Liaw, A. and Wiener, M.: Classification and Regression by randomForest, R News, 2, 18–22, https://CRAN.R-project.org/doc/Rnews/ (last access: 12 May 2023), 2002. a, b

Lurton, T., Balkanski, Y., Bastrikov, V., Bekki, S., Bopp, L., Braconnot, P., Brockmann, P., Cadule, P., Contoux, C., Cozic, A., Cugnet, D., Dufresne, J.-L., Éthé, C., Foujols, M.-A., Ghattas, J., Hauglustaine, D., Hu, R.-M., Kageyama, M., Khodri, M., Lebas, N., Levavasseur, G., Marchand, M., Ottlé, C., Peylin, P., Sima, A., Szopa, S., Thiéblemont, R., Vuichard, N., and Boucher, O.: Implementation of the CMIP6 Forcing Data in the IPSL-CM6A-LR Model, J. Adv. Model. Earth Sy., 12, e2019MS001940, https://doi.org/10.1029/2019MS001940, 2020. a

Luyssaert, S., Inglima, I., Jung, M., Richardson, A., Reichsteins, M., Papale, D., Piao, S., Schulzes, E., Wingate, L., Matteucci, G., Aragao, L., Aubinet, M., Beers, C., Bernhoffer, C., Black, K., Bonal, D., Bonnefond, J., Chambers, J., Ciais, P., Cook, B., Davis, K., Dolman, A., Gielen, B., Goulden, M., Grace, J., Granier, A., Grelle, A., Griffis, T., Gruenwald, T., Guidolotti, G., Hanson, P., Harding, R., Hollinger, D., Hutyra, L., Kolar, P., Kruijt, B., Kutsch, W., Lagergren, F., Laurila, T., Law, B., Le Maire, G., Lindroth, A., Loustau, D., Malhi, Y., Mateus, J., Migliavacca, M., Misson, L., Montagnani, L., Moncrieff, J., Moors, E., Munger, J., Nikinmaa, E., Ollinger, S., Pita, G., Rebmann, C., Roupsard, O., Saigusa, N., Sanz, M., Seufert, G., Sierra, C., Smith, M., Tang, J., Valentini, R., Vesala, T., and Janssens, I.: CO2 balance of boreal, temperate, and tropical forests derived from a global database, Glob. Change Biol., 13, 2509–2537, https://doi.org/10.1111/j.1365-2486.2007.01439.x, 2007. a, b, c, d

Luyssaert, S., Inglima, I., and Jung, M.: Global Forest Ecosystem Structure and Function Data for Carbon Balance Research, Oak Ridge National Laboratory Distributed Active Archive Center, Oak Ridge, Tennessee, U.S.A. [data set], https://doi.org/10.3334/ORNLDAAC/949, 2009. a

Nyawira, S. S., Nabel, J. E. M. S., Don, A., Brovkin, V., and Pongratz, J.: Soil carbon response to land-use change: evaluation of a global vegetation model using observational meta-analyses, Biogeosciences, 13, 5661–5675, https://doi.org/10.5194/bg-13-5661-2016, 2016. a, b, c

Orgiazzi, A., Ballabio, C., Panagos, P., Jones, A., and Fernández-Ugalde, O.: LUCAS Soil, the largest expandable soil dataset for Europe: a review, Eur. J. Soil Sci., 69, 140–153, https://doi.org/10.1111/ejss.12499, 2018. a

Pastorello, G., Trotta, C., Canfora, E., Chu, H., Christianson, D., Cheah, Y.-W., Poindexter, C., Chen, J., Elbashandy, A., Humphrey, M., Isaac, P., Polidori, D., Reichstein, M., Ribeca, A., van Ingen, C., Vuichard, N., Zhang, L., Amiro, B., Ammann, C., Arain, M. A., Ardö, J., Arkebauer, T., Arndt, S. K., Arriga, N., Aubinet, M., Aurela, M., Baldocchi, D., Barr, A., Beamesderfer, E., Marchesini, L. B., Bergeron, O., Beringer, J., Bernhofer, C., Berveiller, D., Billesbach, D., Black, T. A., Blanken, P. D., Bohrer, G., Boike, J., Bolstad, P. V., Bonal, D., Bonnefond, J.-M., Bowling, D. R., Bracho, R., Brodeur, J., Brümmer, C., Buchmann, N., Burban, B., Burns, S. P., Buysse, P., Cale, P., Cavagna, M., Cellier, P., Chen, S., Chini, I., Christensen, T. R., Cleverly, J., Collalti, A., Consalvo, C., Cook, B. D., Cook, D., Coursolle, C., Cremonese, E., Curtis, P. S., D'Andrea, E., da Rocha, H., Dai, X., Davis, K. J., Cinti, B. D., Grandcourt, A. d., Ligne, A. D., De Oliveira, R. C., Delpierre, N., Desai, A. R., Di Bella, C. M., Tommasi, P. d., Dolman, H., Domingo, F., Dong, G., Dore, S., Duce, P., Dufrêne, E., Dunn, A., Dušek, J., Eamus, D., Eichelmann, U., ElKhidir, H. A. M., Eugster, W., Ewenz, C. M., Ewers, B., Famulari, D., Fares, S., Feigenwinter, I., Feitz, A., Fensholt, R., Filippa, G., Fischer, M., Frank, J., Galvagno, M., Gharun, M., Gianelle, D., Gielen, B., Gioli, B., Gitelson, A., Goded, I., Goeckede, M., Goldstein, A. H., Gough, C. M., Goulden, M. L., Graf, A., Griebel, A., Gruening, C., Grünwald, T., Hammerle, A., Han, S., Han, X., Hansen, B. U., Hanson, C., Hatakka, J., He, Y., Hehn, M., Heinesch, B., Hinko-Najera, N., Hörtnagl, L., Hutley, L., Ibrom, A., Ikawa, H., Jackowicz-Korczynski, M., Janouš, D., Jans, W., Jassal, R., Jiang, S., Kato, T., Khomik, M., Klatt, J., Knohl, A., Knox, S., Kobayashi, H., Koerber, G., Kolle, O., Kosugi, Y., Kotani, A., Kowalski, A., Kruijt, B., Kurbatova, J., Kutsch, W. L., Kwon, H., Launiainen, S., Laurila, T., Law, B., Leuning, R., Li, Y., Liddell, M., Limousin, J.-M., Lion, M., Liska, A. J., Lohila, A., López-Ballesteros, A., López-Blanco, E., Loubet, B., Loustau, D., Lucas-Moffat, A., Lüers, J., Ma, S., Macfarlane, C., Magliulo, V., Maier, R., Mammarella, I., Manca, G., Marcolla, B., Margolis, H. A., Marras, S., Massman, W., Mastepanov, M., Matamala, R., Matthes, J. H., Mazzenga, F., McCaughey, H., McHugh, I., McMillan, A. M. S., Merbold, L., Meyer, W., Meyers, T., Miller, S. D., Minerbi, S., Moderow, U., Monson, R. K., Montagnani, L., Moore, C. E., Moors, E., Moreaux, V., Moureaux, C., Munger, J. W., Nakai, T., Neirynck, J., Nesic, Z., Nicolini, G., Noormets, A., Northwood, M., Nosetto, M., Nouvellon, Y., Novick, K., Oechel, W., Olesen, J. E., Ourcival, J.-M., Papuga, S. A., Parmentier, F.-J., Paul-Limoges, E., Pavelka, M., Peichl, M., Pendall, E., Phillips, R. P., Pilegaard, K., Pirk, N., Posse, G., Powell, T., Prasse, H., Prober, S. M., Rambal, S., Rannik, Ü., Raz-Yaseef, N., Rebmann, C., Reed, D., Dios, V. R. d., Restrepo-Coupe, N., Reverter, B. R., Roland, M., Sabbatini, S., Sachs, T., Saleska, S. R., Sánchez-Cañete, E. P., Sanchez-Mejia, Z. M., Schmid, H. P., Schmidt, M., Schneider, K., Schrader, F., Schroder, I., Scott, R. L., Sedlák, P., Serrano-Ortíz, P., Shao, C., Shi, P., Shironya, I., Siebicke, L., Šigut, L., Silberstein, R., Sirca, C., Spano, D., Steinbrecher, R., Stevens, R. M., Sturtevant, C., Suyker, A., Tagesson, T., Takanashi, S., Tang, Y., Tapper, N., Thom, J., Tomassucci, M., Tuovinen, J.-P., Urbanski, S., Valentini, R., van der Molen, M., van Gorsel, E., van Huissteden, K., Varlagin, A., Verfaillie, J., Vesala, T., Vincke, C., Vitale, D., Vygodskaya, N., Walker, J. P., Walter-Shea, E., Wang, H., Weber, R., Westermann, S., Wille, C., Wofsy, S., Wohlfahrt, G., Wolf, S., Woodgate, W., Li, Y., Zampedri, R., Zhang, J., Zhou, G., Zona, D., Agarwal, D., Biraud, S., Torn, M., and Papale, D.: The FLUXNET2015 dataset and the ONEFlux processing pipeline for eddy covariance data, Scientific Data, 7, 225, https://doi.org/10.1038/s41597-020-0534-3, 2020 (data available at: https://fluxnet.org/data/fluxnet2015-dataset/, last access: 27 June 2023). a, b, c

Poeplau, C. and Don, A.: Carbon sequestration in agricultural soils via cultivation of cover crops – A meta-analysis, Agr. Ecosyst. Environ., 200, 33–41, https://doi.org/10.1016/j.agee.2014.10.024, 2015. a

Poeplau, C., Don, A., Vesterdal, L., Leifeld, J., Van Wesemael, B., Schumacher, J., and Gensior, A.: Temporal dynamics of soil organic carbon after land-use change in the temperate zone – carbon response functions as a model approach, Glob. Change Biol., 17, 2415–2427, https://doi.org/10.1111/j.1365-2486.2011.02408.x, 2011. a, b

Post, W. M., and Zobler, L.: Global Soil Types, 0.5-Degree Grid (Modified Zobler), ORNL DAAC, Oak Ridge, Tennessee, USA [data set], https://doi.org/10.3334/ORNLDAAC/540, 2000. a

Poulter, B., Ciais, P., Hodson, E., Lischke, H., Maignan, F., Plummer, S., and Zimmermann, N. E.: Plant functional type mapping for earth system models, Geosci. Model Dev., 4, 993–1010, https://doi.org/10.5194/gmd-4-993-2011, 2011. a

Sanderman, J., Hengl, T., and Fiske, G. J.: Soil carbon debt of 12,000 years of human land use, P. Natl. Acad. Sci. USA, 114, 9575–9580, https://doi.org/10.1073/pnas.1706103114, 2017. a

Schepaschenko, D., Shvidenko, A., Usoltsev, V., Lakyda, P., Luo, Y., Vasylyshyn, R., Lakyda, I., Myklush, Y., See, L., McCallum, I., Fritz, S., Kraxner, F., and Obersteiner, M.: A dataset of forest biomass structure for Eurasia, Scientific Data, 4, 170070, https://doi.org/10.1038/sdata.2017.70, 2017. a, b

Seiler, C., Melton, J. R., Arora, V. K., Sitch, S., Friedlingstein, P., Anthoni, P., Goll, D., Jain, A. K., Joetzjer, E., Lienert, S., Lombardozzi, D., Luyssaert, S., Nabel, J. E. M. S., Tian, H., Vuichard, N., Walker, A. P., Yuan, W., and Zaehle, S.: Are Terrestrial Biosphere Models Fit for Simulating the Global Land Carbon Sink?, J. Adv. Model. Earth Sy., 14, e2021MS002946, https://doi.org/10.1029/2021MS002946, 2022.  a

Sitch, S., Smith, B., Prentice, I. C., Arneth, A., Bondeau, A., Cramer, W., Kaplan, J. O., Levis, S., Lucht, W., Sykes, M. T., Thonicke, K., and Venevsky, S.: Evaluation of ecosystem dynamics, plant geography and terrestrial carbon cycling in the LPJ dynamic global vegetation model, Glob. Change Biol., 9, 161–185, https://doi.org/10.1046/j.1365-2486.2003.00569.x, 2003. a

Sitch, S., Friedlingstein, P., Gruber, N., Jones, S. D., Murray-Tortarolo, G., Ahlström, A., Doney, S. C., Graven, H., Heinze, C., Huntingford, C., Levis, S., Levy, P. E., Lomas, M., Poulter, B., Viovy, N., Zaehle, S., Zeng, N., Arneth, A., Bonan, G., Bopp, L., Canadell, J. G., Chevallier, F., Ciais, P., Ellis, R., Gloor, M., Peylin, P., Piao, S. L., Le Quéré, C., Smith, B., Zhu, Z., and Myneni, R.: Recent trends and drivers of regional sources and sinks of carbon dioxide, Biogeosciences, 12, 653–679, https://doi.org/10.5194/bg-12-653-2015, 2015. a

Somogyi, Z., Teobaldelli, M., Federici, S., Matteucci, G., Pagliari, V., Grassi, G., and Seufert, G.: Allometric biomass and carbon factors database, iForest, 1, 107–113, https://doi.org/10.3832/ifor0463-0010107, 2008. a, b

Stone, M.: Cross-Validatory Choice and Assessment of Statistical Predictions, J. R. Stat. Soc. B, 36, 111–133, https://doi.org/10.1111/j.2517-6161.1974.tb00994.x, 1974. a

Toth, G., Jones, A., and Montanarella, L.: LUCAS Topsoil Survey. Methodology, Data and Results, Publications Office of the European Union, Luxembourg (Luxembourg), ISBN 978-92-79-32542-7 (online), https://doi.org/10.2788/97922, 2013. a

Verburg, P. H., Neumann, K., and Nol, L.: Challenges in using land use and land cover data for global change studies, Glob. Change Biol., 17, 974–989, https://doi.org/10.1111/j.1365-2486.2010.02307.x, 2011. a, b

Wang, X., Lin, D., Zhao, L., and Michalet, R.: The Relative Importance of Coarse-Scale Climate and Fine-Scale Nitrogen Availability Contrasts in Driving Home-Field Advantage Effects in Litter Decomposition, Ecosystems, 26, 1456–1467, https://doi.org/10.1007/s10021-023-00844-2, 2023. a

Warm Winter 2020 Team and ICOS Ecosystem Thematic Centre: Warm Winter 2020 ecosystem eddy covariance flux product for 73 stations in FLUXNET-Archive format – release 2022-1 (Version 1.0), ICOS Carbon Portal [data set], https://doi.org/10.18160/2G60-ZHAK, 2022. a

Watson, R. T., Noble, I. R., Bolin, B., Ravindranath, N. H., Verardo, D. J., and Dokken, D. J. (Eds.): Land Use, Land-Use Change, and Forestry. A Special Report of the Intergovernmental Panel on Climate Change, Cambridge University Press, The Edinburgh Building Shaftesbury Road, Cambridge CB2 2RU ENGLAND, ISBN 92-9169-114-3, https://www.ipcc.ch/report/land-use-land-use-change-and-forestry/ (last access: 13 October 2023), 2007. a

Wu, Z., Hugelius, G., Luo, Y., Smith, B., Xia, J., Fensholt, R., Lehsten, V., and Ahlström, A.: Approaching the potential of model-data comparisons of global land carbon storage, Scientific Reports, 9, 3367, https://doi.org/10.1038/s41598-019-38976-y, 2019. a, b

Download
Short summary
The study assesses the performance of the dynamic global vegetation model (DGVM) ORCHIDEE in capturing the impact of land-use change on carbon stocks across Europe. Comparisons with observations reveal that the model accurately represents carbon fluxes and stocks. Despite the underestimations in certain land-use conversions, the model describes general trends in soil carbon response to land-use change, aligning with the site observations.