<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing with OASIS Tables v3.0 20080202//EN" "https://jats.nlm.nih.gov/nlm-dtd/publishing/3.0/journalpub-oasis3.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:oasis="http://docs.oasis-open.org/ns/oasis-exchange/table" xml:lang="en" dtd-version="3.0" article-type="research-article">
  <front>
    <journal-meta><journal-id journal-id-type="publisher">GMD</journal-id><journal-title-group>
    <journal-title>Geoscientific Model Development</journal-title>
    <abbrev-journal-title abbrev-type="publisher">GMD</abbrev-journal-title><abbrev-journal-title abbrev-type="nlm-ta">Geosci. Model Dev.</abbrev-journal-title>
  </journal-title-group><issn pub-type="epub">1991-9603</issn><publisher>
    <publisher-name>Copernicus Publications</publisher-name>
    <publisher-loc>Göttingen, Germany</publisher-loc>
  </publisher></journal-meta>
    <article-meta>
      <article-id pub-id-type="doi">10.5194/gmd-19-4547-2026</article-id><title-group><article-title>A hybrid framework for the spin-up and initialization of distributed coupled ecohydrological-biogeochemical models</article-title><alt-title>Spin-up and initialization of distributed soil biogeochemical models</alt-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author" corresp="no" rid="aff1">
          <name><surname>Lian</surname><given-names>Taiqi</given-names></name>
          
        <ext-link>https://orcid.org/0009-0001-9658-4375</ext-link></contrib>
        <contrib contrib-type="author" corresp="no" rid="aff2">
          <name><surname>Zhang</surname><given-names>Ziyan</given-names></name>
          
        <ext-link>https://orcid.org/0000-0003-1497-0510</ext-link></contrib>
        <contrib contrib-type="author" corresp="no" rid="aff3">
          <name><surname>Paschalis</surname><given-names>Athanasios</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="yes" rid="aff1">
          <name><surname>Bonetti</surname><given-names>Sara</given-names></name>
          <email>sara.bonetti@epfl.ch</email>
        <ext-link>https://orcid.org/0000-0001-8856-3438</ext-link></contrib>
        <aff id="aff1"><label>1</label><institution>Laboratory of Catchment Hydrology and Geomorphology, École Polytechnique Fédérale de Lausanne (EPFL),  1951 Sion, Switzerland</institution>
        </aff>
        <aff id="aff2"><label>2</label><institution>Department of Civil and Environmental Engineering, Imperial College London, London, UK</institution>
        </aff>
        <aff id="aff3"><label>3</label><institution>Department of Civil and Environmental Engineering, University of Cyprus, Nicosia, Cyprus</institution>
        </aff>
      </contrib-group>
      <author-notes><corresp id="corr1">Sara Bonetti (sara.bonetti@epfl.ch)</corresp></author-notes><pub-date><day>27</day><month>May</month><year>2026</year></pub-date>
      
      <volume>19</volume>
      <issue>10</issue>
      <fpage>4547</fpage><lpage>4565</lpage>
      <history>
        <date date-type="received"><day>30</day><month>September</month><year>2025</year></date>
           <date date-type="rev-request"><day>16</day><month>December</month><year>2025</year></date>
           <date date-type="rev-recd"><day>30</day><month>April</month><year>2026</year></date>
           <date date-type="accepted"><day>7</day><month>May</month><year>2026</year></date>
      </history>
      <permissions>
        <copyright-statement>Copyright: © 2026 Taiqi Lian et al.</copyright-statement>
        <copyright-year>2026</copyright-year>
      <license license-type="open-access"><license-p>This work is licensed under the Creative Commons Attribution 4.0 International License. To view a copy of this licence, visit <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by/4.0/">https://creativecommons.org/licenses/by/4.0/</ext-link></license-p></license></permissions><self-uri xlink:href="https://gmd.copernicus.org/articles/19/4547/2026/gmd-19-4547-2026.html">This article is available from https://gmd.copernicus.org/articles/19/4547/2026/gmd-19-4547-2026.html</self-uri><self-uri xlink:href="https://gmd.copernicus.org/articles/19/4547/2026/gmd-19-4547-2026.pdf">The full text article is available as a PDF file from https://gmd.copernicus.org/articles/19/4547/2026/gmd-19-4547-2026.pdf</self-uri>
      <abstract><title>Abstract</title>

      <p id="d2e124">Accurate initialization is a critical step in fully distributed ecohydrological and soil biogeochemical modeling applications, yet is often hindered by the computational cost of achieving steady-state conditions across large spatial domains. This study presents a novel initialization framework that combines a flux-tracking 1D spin-up with a random forest (RF) algorithm to efficiently generate spatially heterogeneous and topography-informed initial conditions accounting for lateral fluxes of water, carbon, and nutrients. The framework first performs a limited number of 1D simulations to obtain steady-state conditions in a subset of representative cells, then uses RF to extrapolate these results across the catchment. Applied to T&amp;C-BG-2D, a fully coupled distributed ecohydrological-soil biogeochemical model, the scheme reconstructs over 90 % of the spatial variability in soil carbon and nutrient patterns in terms of probability distribution similarity while reducing computational demands by up to 90 % compared to a fully distributed spin-up procedure. A sensitivity analysis across multiple simulation scenarios reveals that the number of tracked cells required, varying from 20 % to 40 % of total domain grid cells, depends on the catchment's spatial complexity and the environmental covariates embedded in the RF predictors. The framework developed here can be applied to other spatially distributed models that explicitly account for lateral transfer fluxes, enabling large-scale distributed ecohydrological-biogeochemical model initialization under constrained computational budgets.</p>
  </abstract>
    </article-meta>
  </front>
<body>
      

<sec id="Ch1.S1" sec-type="intro">
  <label>1</label><title>Introduction</title>
      <p id="d2e136">Process-based ecohydrological and biogeochemical models have become widely used tools to study the cycling of water, carbon, and nutrients, and to predict ecosystem responses to anthropogenic and climatic changes. However, initialization remains a critical challenge in ecohydrological modeling <xref ref-type="bibr" rid="bib1.bibx67 bib1.bibx17" id="paren.1"/>, as initial conditions strongly influence model outputs and can substantially reduce the model’s ability to predict short-term responses to climatic variability <xref ref-type="bibr" rid="bib1.bibx54" id="paren.2"><named-content content-type="pre">e.g.,</named-content></xref>. This issue becomes even more critical for models including soil biogeochemical components and aiming to simulate the coupled dynamics of water, vegetation, and soil biogeochemistry <xref ref-type="bibr" rid="bib1.bibx67 bib1.bibx11 bib1.bibx24" id="paren.3"/>. A common practice to specify initial conditions for model simulations is to run a model for a chosen time period until equilibrium (dynamic or steady state) is reached <xref ref-type="bibr" rid="bib1.bibx67 bib1.bibx53" id="paren.4"/>. The duration of this spin-up process largely depends on the chosen initial state, the model parameters, and the prescribed forcings <xref ref-type="bibr" rid="bib1.bibx66 bib1.bibx67 bib1.bibx55" id="paren.5"/>. For hydrological models, this stabilization period typically spans days to several years, depending on the variable of interest <xref ref-type="bibr" rid="bib1.bibx41 bib1.bibx53 bib1.bibx19 bib1.bibx2" id="paren.6"><named-content content-type="pre">e.g.,</named-content></xref>. In contrast, soil biogeochemical models often require much longer spin-up periods, sometimes up to thousands of years <xref ref-type="bibr" rid="bib1.bibx50 bib1.bibx49 bib1.bibx62" id="paren.7"/>, to stabilize soil carbon and nutrient pools and avoid unrealistic behavior caused by disequilibrium in biogeochemical fluxes <xref ref-type="bibr" rid="bib1.bibx67 bib1.bibx27 bib1.bibx1 bib1.bibx38" id="paren.8"/>.</p>
      <p id="d2e168">To reduce the computational burden of long spin-up periods, various strategies have been proposed for plot- or ecosystem-scale models. Analytical solutions under simplified forcing conditions can be used for some terrestrial biosphere models <xref ref-type="bibr" rid="bib1.bibx75 bib1.bibx70 bib1.bibx49 bib1.bibx52" id="paren.9"/>. <xref ref-type="bibr" rid="bib1.bibx67" id="text.10"/> introduced several acceleration techniques in the Biome-BGC model, including accelerated decomposition, nitrogen addition, and multivariate minimization. These approaches reduced spin-up time by up to 75 %. <xref ref-type="bibr" rid="bib1.bibx17" id="text.11"/> developed a “slow-relaxation scaling” method to gradually adjust soil carbon and nitrogen pools, improving the stability and realism of spin-up in the CENTURY model. Other approaches include performing spin-up of specific sub-modules only <xref ref-type="bibr" rid="bib1.bibx20" id="paren.12"/>, or integrating spin-up procedures with data assimilation techniques to reduce computational demands across multiple sites <xref ref-type="bibr" rid="bib1.bibx39" id="paren.13"/>. Table S1 in the Supplement presents a non-exhaustive list of spin-up methods and their applications across coupled ecohydrological-soil biogeochemical models applied at plot and ecosystem scales.</p>
      <p id="d2e186">Model initialization is even more challenging in spatially distributed coupled ecohydrological-biogeochemical models <xref ref-type="bibr" rid="bib1.bibx64 bib1.bibx56 bib1.bibx24" id="paren.14"><named-content content-type="pre">e.g.,</named-content></xref>, which require information on distinct soil carbon and nutrient pools, whose spatial distributions are largely modulated by a combination of climatic, pedogenic, topographic, and biological landscape attributes <xref ref-type="bibr" rid="bib1.bibx22 bib1.bibx24 bib1.bibx43" id="paren.15"/>. Due to the scarcity of spatially explicit data on soil biogeochemical pools, most models apply uniform or averaged initial conditions, followed by a spin-up period (typically a few years) to stabilize the system <xref ref-type="bibr" rid="bib1.bibx64 bib1.bibx76 bib1.bibx48" id="paren.16"><named-content content-type="pre">e.g.,</named-content></xref>. In the most extreme case, <xref ref-type="bibr" rid="bib1.bibx7" id="text.17"/> applied a spatially distributed spin-up of more than 2500 years to reach steady state of soil carbon and nitrogen pools. However, prolonged spin-up periods can become computationally prohibitive when applied to thousands of grid cells. Moreover, existing initialization techniques, often designed for plot-scale models or coarser spatial resolutions, are insufficient for spatially distributed systems, as they neglect the effects of lateral transfers of water, carbon, and nutrients among computational cells in evaluating the long-term steady state.</p>
      <p id="d2e205">Recent advances in artificial intelligence and machine learning (ML) have provided new opportunities for environmental modeling, including applications in hydrology, ecology, and soil science <xref ref-type="bibr" rid="bib1.bibx47 bib1.bibx29 bib1.bibx68 bib1.bibx5 bib1.bibx15 bib1.bibx63 bib1.bibx45" id="paren.18"><named-content content-type="pre">e.g.,</named-content></xref>. In spatially distributed models with limited data availability, ML methods, such as random forest (RF) approaches, offer a practical approach to estimate initial conditions. RF approaches have been successfully applied in hydrology and soil science (e.g., for landcover classification, soil moisture estimation, and mapping of soil hydraulic properties). These successes highlight their ability to extrapolate spatial distributions of soil carbon and nutrient pools using environmental covariates, based on a limited number of locations with known steady-state conditions <xref ref-type="bibr" rid="bib1.bibx32 bib1.bibx43 bib1.bibx74" id="paren.19"/>. In this context, if steady-state conditions are known for a subset of grid cells, RF can be used to infer the spatial distribution of biogeochemical pools across the entire catchment. However, a workflow for the effective use of RF approaches to initialize coupled, spatially-distributed ecohydrological-biogeochemical models is currently not available.</p>
      <p id="d2e217">The goal of this study is to develop a generalizable and computationally efficient methodology to initialize spatially distributed ecohydrological–biogeochemical models in complex catchments. Achieving this goal requires addressing three main challenges: (i) the high computational demand of fully coupled spin-up procedures for soil carbon and nutrient pools, (ii) the even greater computational costs of applying such spin-up in distributed 2D simulations, and (iii) the need for a strategy that can be generalized across catchments with different levels of topographic complexity and soil and vegetation variability. To overcome these limitations, we propose a novel initialization scheme that combines a plot-scale flux-tracking spin-up with a RF algorithm. The proposed spin-up procedure captures lateral transfers of water, carbon, and nutrients, while the RF model uses well-established key controlling factors (i.e., topography, vegetation, and soil information) to reconstruct spatial patterns of soil carbon and nutrient pools. We implement this framework in the recently developed fully distributed coupled ecohydrological-soil biogeochemical model T&amp;C-BG-2D <xref ref-type="bibr" rid="bib1.bibx24" id="paren.20"/> and test the following hypotheses: <list list-type="bullet"><list-item>
      <p id="d2e225">(H1) Spatially explicit spin-up is required to capture spatial variability in soil carbon and nutrient pools at the catchment scale, where grid-cell hydrological connectivity and topography-driven heterogeneity in water and energy states concurrently act to shape soil biogeochemical spatial patterns;</p></list-item><list-item>
      <p id="d2e229">(H2) Performing spin-up with lateral transport only at a subset of representative locations and extrapolating the results statistically with a RF model provides sufficient accuracy;</p></list-item><list-item>
      <p id="d2e233">(H3) The number of representative locations required depends on the overall heterogeneity of the catchment as well as on the specific predictors included in the RF model (e.g., additional soil or vegetation attributes);</p></list-item><list-item>
      <p id="d2e237">(H4) Uncoupled spin-up simulations (soil biogeochemistry dynamics with constant vegetation fluxes) provide adequate steady-state conditions for the model initialization.</p></list-item></list> To our knowledge, this is the first study to address initialization in a catchment-scale, fully distributed coupled ecohydrological-soil biogeochemical model, which mechanistically simulates coupled water, vegetation, and soil biogeochemical dynamics and explicitly accounts for water and solute lateral transport as well as topography-controlled variations in microclimatic conditions. The proposed approach provides a flexible and computationally efficient solution to spatial initialization and can be readily adopted in other modeling frameworks.</p>
</sec>
<sec id="Ch1.S2">
  <label>2</label><title>Materials and methods</title>
<sec id="Ch1.S2.SS1">
  <label>2.1</label><title>T&amp;C-BG-2D model and study area</title>
      <p id="d2e257">In this work, we use the recently developed coupled ecohydrological-soil biogeochemical model, T&amp;C-BG-2D <xref ref-type="bibr" rid="bib1.bibx24" id="paren.21"/>, to evaluate the effect of different initialization schemes on the spatial distribution of soil carbon and nutrient pools. The original T&amp;C model <xref ref-type="bibr" rid="bib1.bibx8 bib1.bibx9" id="paren.22"/> simulates coupled water, energy, and carbon cycles at the land surface. It includes modules for soil moisture dynamics and vegetation processes such as plant phenology, carbon allocation, and tissue turnover <xref ref-type="bibr" rid="bib1.bibx8 bib1.bibx9 bib1.bibx10" id="paren.23"/>. The hydrological component simulates precipitation interception, transpiration, evaporation, infiltration, and snow/ice hydrology. Soil water movement is represented using a quasi-3D Richards equation, while surface water flow uses a kinematic wave approach <xref ref-type="bibr" rid="bib1.bibx44" id="paren.24"/>. A soil biogeochemistry component, T&amp;C-BG <xref ref-type="bibr" rid="bib1.bibx11" id="paren.25"/>, was integrated into the T&amp;C model to simulate soil carbon and nutrient dynamics, including nitrogen (N), phosphorus (P), and potassium (K). Vegetation growth is regulated by the availability of these nutrients, as plant tissue formation depends on flexible target stoichiometric constraints between carbon and nutrients, and nutrient limitation can constrain biomass production. The biogeochemistry module partitions plant litter into pools based on decomposability, and represents soil organic carbon (SOC) in three functional forms (mineral-associated, particulate, and dissolved). Soil organic nitrogen (SON) dynamics are tightly coupled to carbon through stoichiometric C : N ratios. The model simulates the temporal dynamics of SON, dissolved organic nitrogen (DON), and nitrogen stored in microbial and macrofauna biomass. It also explicitly tracks inorganic nitrogen pools – ammonium (NH<inline-formula><mml:math id="M1" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">4</mml:mn><mml:mo>+</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula>) and nitrate (NO<inline-formula><mml:math id="M2" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">3</mml:mn><mml:mo>-</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula>) – whose dynamics are regulated by mineralization, immobilization, biological uptake, leaching, volatilization, nitrification, and denitrification. A comprehensive description of T&amp;C-BG is available in <xref ref-type="bibr" rid="bib1.bibx11" id="text.26"/> and <xref ref-type="bibr" rid="bib1.bibx3" id="text.27"/>. Recently, <xref ref-type="bibr" rid="bib1.bibx24" id="text.28"/> extended T&amp;C-BG to a distributed framework (T&amp;C-BG-2D) by explicitly accounting for the lateral transport of seven mobile carbon and nutrient pools, including DOC, NH<inline-formula><mml:math id="M3" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">4</mml:mn><mml:mo>+</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula>, NO<inline-formula><mml:math id="M4" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">3</mml:mn><mml:mo>-</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula>, dissolved inorganic phosphorus (DIP), dissolved potassium ions (K<sup>+</sup>), DON, and dissolved organic phosphorus (DOP), considered as passive tracers transported following the water flow. Tracers are assumed to be well-mixed in each storage compartment after vertical/lateral transport. At each hourly time step, the model updates soil carbon and nutrient pools in each grid cell by accounting for laterally transported solutes, which in turn regulate local vegetation dynamics. This enables the simulation of coupled ecohydrological-soil biogeochemical dynamics with the explicit consideration of the role of topographic features and lateral transport in shaping spatio-temporal carbon and nutrient patterns across a landscape. For clarity, Table <xref ref-type="table" rid="T1"/> summarizes the different models referred to in this study. The proposed hybrid initialization technique is implemented in the coupled ecohydrological–biogeochemical T&amp;C-BG-2D model.</p>

<table-wrap id="T1" specific-use="star"><label>Table 1</label><caption><p id="d2e348">Acronyms, full names, spatial scales, and brief description with key references of the models mentioned in this study. Note that the proposed hybrid initialization technique is for the fully distributed coupled ecohydrological-soil biogeochemical T&amp;C-BG-2D model.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="5">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="justify" colwidth="2.5cm"/>
     <oasis:colspec colnum="3" colname="col3" align="justify" colwidth="2.5cm"/>
     <oasis:colspec colnum="4" colname="col4" align="justify" colwidth="5.5cm"/>
     <oasis:colspec colnum="5" colname="col5" align="justify" colwidth="3cm"/>
     <oasis:thead>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Acronyms</oasis:entry>
         <oasis:entry colname="col2" align="left">Model full name</oasis:entry>
         <oasis:entry colname="col3" align="left">Scale</oasis:entry>
         <oasis:entry colname="col4" align="left">Description</oasis:entry>
         <oasis:entry colname="col5" align="left">Key references</oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">T&amp;C</oasis:entry>
         <oasis:entry colname="col2" align="left">Tethys-Chloris</oasis:entry>
         <oasis:entry colname="col3" align="left">Plot and catchment scale</oasis:entry>
         <oasis:entry colname="col4" align="left">Mechanistic ecohydrological model that simulates coupled dynamics of  energy, water and vegetation at the  land surface</oasis:entry>
         <oasis:entry colname="col5" align="left">
                      <xref ref-type="bibr" rid="bib1.bibx8 bib1.bibx9" id="text.29"/>
                    </oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">T&amp;C-BG</oasis:entry>
         <oasis:entry colname="col2" align="left">Tethys-Chloris-Biogeochemistry</oasis:entry>
         <oasis:entry colname="col3" align="left">Plot scale</oasis:entry>
         <oasis:entry colname="col4" align="left">Extension of T&amp;C to include modules simulating soil biogeochemistry and plant nutrient dynamics</oasis:entry>
         <oasis:entry colname="col5" align="left">
                      <xref ref-type="bibr" rid="bib1.bibx11" id="text.30"/>
                    </oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">T&amp;C-BG-2D</oasis:entry>
         <oasis:entry colname="col2" align="left">Tethys-Chloris-Biogeochemistry-2 Dimensional</oasis:entry>
         <oasis:entry colname="col3" align="left">Catchment scale</oasis:entry>
         <oasis:entry colname="col4" align="left">Extension of T&amp;C-BG that considers lateral transport of carbon and nutrients</oasis:entry>
         <oasis:entry colname="col5" align="left">
                      <xref ref-type="bibr" rid="bib1.bibx24" id="text.31"/>
                    </oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">RF</oasis:entry>
         <oasis:entry colname="col2" align="left">Random Forest</oasis:entry>
         <oasis:entry colname="col3" align="left">–</oasis:entry>
         <oasis:entry colname="col4" align="left">Machine learning algorithm used here to extrapolate spatially heterogeneous initial conditions from flux-tracking simulations</oasis:entry>
         <oasis:entry colname="col5" align="left">
                      <xref ref-type="bibr" rid="bib1.bibx4" id="text.32"/>
                    </oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table></table-wrap>

      <p id="d2e475">The domain used to test and showcase the spin-up procedure in this study is the Erlenbach catchment (Fig. <xref ref-type="fig" rid="F1"/>a), a 0.76 km<sup>2</sup> basin in the central Swiss Pre-Alps <xref ref-type="bibr" rid="bib1.bibx60" id="paren.33"/> where T&amp;C-BG-2D has already been applied <xref ref-type="bibr" rid="bib1.bibx24" id="paren.34"/>. We considered a spatial square grid resolution of 20 m. Based on a previous application of T&amp;C-BG-2D in this catchment <xref ref-type="bibr" rid="bib1.bibx24" id="paren.35"/>, we used meteorological data to force the model covering a 9-year period (1 October 2011 to 30 September 2020). During this time, the catchment received an average annual precipitation of 2200 mm, with a mean air temperature of 6.5 °C <xref ref-type="bibr" rid="bib1.bibx24" id="paren.36"/>. The spatial distribution of soil texture (here mostly loam and clay loam) was retrieved from <xref ref-type="bibr" rid="bib1.bibx16" id="text.37"/>, and vegetation cover was obtained from <xref ref-type="bibr" rid="bib1.bibx46" id="text.38"/>. As only one meteorological station (Erlenhöhe) is available in Erlenbach, constant lapse rate values typical of the region were assumed. For example, we considered a constant thermal lapse rate equal to <inline-formula><mml:math id="M7" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">0.0045</mml:mn></mml:mrow></mml:math></inline-formula> °C m<sup>−1</sup> <xref ref-type="bibr" rid="bib1.bibx40 bib1.bibx24" id="paren.39"/>. The Erlenbach catchment is a suitable study area to showcase the spin-up and initialization procedure proposed here since model parameters have already been calibrated and the area is relatively small (thus reducing the computational costs) but still with a highly pronounced topography. Specific model settings and parameters for the simulations here can be found in <xref ref-type="bibr" rid="bib1.bibx24" id="text.40"/>.</p>

      <fig id="F1" specific-use="star"><label>Figure 1</label><caption><p id="d2e540">The Erlenbach catchment <bold>(a)</bold> and the modified catchment configurations <bold>(b–d)</bold> used to test the proposed initialization procedure. <bold>(a)</bold> Original catchment characteristics showing topography, stream network, vegetation cover, and sand and clay fractions. <bold>(b)</bold> Alternative vegetation scenarios: randomized grassland and forest cover (left), and homogeneous grassland (right). <bold>(c)</bold> Soil texture scenarios, represented in a sand–clay triangle: original soil texture (orange), randomized soil texture distribution (gray), and homogeneous soil texture (red). <bold>(d)</bold> Topographic scenarios generated by scaling the original digital elevation model by factors of 0.5 and 0.2.</p></caption>
          <graphic xlink:href="https://gmd.copernicus.org/articles/19/4547/2026/gmd-19-4547-2026-f01.jpg"/>

        </fig>

</sec>
<sec id="Ch1.S2.SS2">
  <label>2.2</label><title>Initialization procedure</title>
      <p id="d2e576">The core strategy for the initialization framework proposed here involves two main steps: (1) performing a plot-scale spin-up in a limited number of selected cells while explicitly accounting for lateral fluxes of water, carbon, and nutrients, and (2) using a RF model to extrapolate the resulting steady-state pools across the entire model domain. While the RF is not used here to predict an independent external reference state, it enables spatial generalization from the sampled tracked cells to the remaining ones across the catchment. A primitive version of this procedure was successfully applied in <xref ref-type="bibr" rid="bib1.bibx24" id="text.41"/>. However, no formal workflow was developed, and neither large-scale benchmarking nor a systematic sensitivity assessment of the method was conducted. In this study, we formalize the framework, evaluate its performance across multiple scenarios, and test its generality to enable seamless application in other models and study areas.</p>

      <fig id="F2" specific-use="star"><label>Figure 2</label><caption><p id="d2e584">Workflow of the model initialization procedure. <bold>(a)</bold> For each soil–vegetation combination, the T&amp;C model is used to obtain averaged water and plant fluxes, which are used in a 1D spin-up via the T&amp;C-BG biogeochemistry module using long-term average meteorological inputs. The resulting steady-state pools are then assigned to all cells with the same soil–vegetation combination <bold>(b)</bold>. <bold>(c)</bold> A 2D simulation (T&amp;C-BG-2D) is run with these initial conditions, tracking fluxes for all cells (benchmark) or for a given percentage <inline-formula><mml:math id="M9" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula> of all grid cells. <bold>(d)</bold> For these cells, the T&amp;C-BG biogeochemistry module 1D spin-up with dynamic meteorology and imposed lateral fluxes is run to achieve a dynamic steady state in all tracked cells. <bold>(e)</bold> A random forest (RF) model is trained on results from different percentages (<inline-formula><mml:math id="M10" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula>) of the subset of these cells using topographic, soil, and vegetation attributes as predictors to reconstruct the initial conditions for the entire catchment domain <bold>(f)</bold>, which are then used to initialize the final simulations <bold>(g)</bold>. For each simulation scenario, steps <bold>(e)</bold> to <bold>(g)</bold> are repeated for different percentages of tracked cells <inline-formula><mml:math id="M11" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula> to investigate the optimal number of cells for the RF algorithm. <bold>(h)</bold> Additional plot-scale simulations are performed for 10 representative cells (5 forest, 5 grassland) to obtain a reference steady state using the fully coupled T&amp;C-BG model (i.e., also considering dynamic vegetation fluxes), which is compared to the initial states generated in step <bold>(a)</bold>. The solid arrows represent steps needed for the benchmark simulation, the dashed arrows represent the alternative machine learning extrapolation (blue) and the reference check (gray). See Table <xref ref-type="table" rid="T1"/> for details on models used in the spin-up procedure and their acronyms.</p></caption>
          <graphic xlink:href="https://gmd.copernicus.org/articles/19/4547/2026/gmd-19-4547-2026-f02.png"/>

        </fig>

      <p id="d2e651">The scheme follows several steps, as shown in Fig. <xref ref-type="fig" rid="F2"/>. First, a plot-scale simulation using the T&amp;C model is run for each soil–vegetation combination to obtain climatologically-driven, nutrient-unstressed water and vegetation fluxes (dashed box in Fig. <xref ref-type="fig" rid="F2"/>a). Ideally, one should perform this step for each major vegetation and soil type combination. For the application here, given the limited variability in soil texture across the catchment, an averaged soil texture was considered. Then, we run the T&amp;C-BG biogeochemistry module to reach steady state (Fig. <xref ref-type="fig" rid="F2"/>a) using the long-term average meteorological inputs and averaged water and vegetation fluxes (from T&amp;C). The resulting steady-state carbon and nutrient pools are then uniformly assigned to all cells with the same land cover type, resulting in spatially homogeneous initial conditions for each soil–vegetation combination (see Fig. <xref ref-type="fig" rid="F2"/>b for an example in the Erlenbach catchment). Next, a 2D simulation is performed using the initial conditions obtained from step (b) (Fig. <xref ref-type="fig" rid="F2"/>c). In this simulation, lateral biogeochemical fluxes are tracked (i.e., lateral fluxes of DOC, NH<inline-formula><mml:math id="M12" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">4</mml:mn><mml:mo>+</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula>, NO<inline-formula><mml:math id="M13" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">3</mml:mn><mml:mo>-</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula>, DIP, K<sup>+</sup>, DON, and DOP) across all grid cells in the domain. Note that, in this step, available spatially variable information on soil texture <xref ref-type="bibr" rid="bib1.bibx16" id="paren.42"/> is used. The soil biogeochemistry module is then run using dynamic meteorological forcing (here repeating the 9 years of data available), incorporating the tracked lateral exchanges, until a dynamic steady state is reached (Fig. <xref ref-type="fig" rid="F2"/>d). The flux-tracking step (c) captures the influence of topography, although we should note that soil carbon and nutrient pools respond slowly to spatial gradients and are unlikely to fully equilibrate within a short simulation period. In the 2D simulation in Fig. <xref ref-type="fig" rid="F2"/>c, the entire available forcing period should be used to capture representative vegetation dynamics and to track biogeochemical fluxes as accurately as possible. In this case, the 9-year data were used, but a longer simulation would provide a more robust basis for the subsequent spin-up of the biogeochemical pools. A sensitivity analysis comparing the dynamic steady state of the biogeochemical pools using different lengths of the fluxes tracking period is presented in Fig. S1 in the Supplement. Using only 1 year for the 2D simulation in step (c) introduced a bias of 3 % in the steady-state SOC in grassland areas in step (d) compared to using all 9 available years.</p>
      <p id="d2e706">The case in which lateral fluxes are tracked in <inline-formula><mml:math id="M15" display="inline"><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">100</mml:mn></mml:mrow></mml:math></inline-formula> % of the cells is considered here as a benchmark. In this case, the dynamic steady-state conditions from all grid cells are used to initialize the subsequent 9-year simulation (from Fig. <xref ref-type="fig" rid="F2"/>d–f). This approach provides the highest accuracy, as each cell undergoes its own spin-up process while accounting for lateral fluxes and topographic variability. However, such a comprehensive spin-up is computationally demanding and should not be repeated for every new case study, as it would undermine the purpose of an efficient initialization protocol. Our objective is therefore to identify the minimum fraction of tracked cells <inline-formula><mml:math id="M16" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula> needed to approximate this benchmark with acceptable accuracy. For this purpose, a RF model <xref ref-type="bibr" rid="bib1.bibx4 bib1.bibx26" id="paren.43"/> is trained using a subset of tracked cells <inline-formula><mml:math id="M17" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula> (equal to 10 %, 20 %, 40 %, 60 %, or 80 %) to predict the spatial distribution of soil carbon and nutrient pools across the domain (Fig. <xref ref-type="fig" rid="F2"/>e). The covariates used as predictors include topographic features (elevation, slope, drainage area, and profile curvature), soil properties (sand content; clay content is additionally included in the random soil texture scenario described in the following section), and vegetation type (forest or grassland). In most RF models here, vegetation type was represented by integer class labels and used directly as a predictor. For vegetation-specific pools, such as above-ground woody carbon, separate RF models were trained for grassland and forest cells. A stratified random sampling approach based on vegetation classes was used for pixel selection, meaning that grid cells were first grouped according to vegetation type (in this case grassland and forest) and, within each group, pixels were randomly sampled without replacement according to a prescribed sampling fraction (10 %, 20 %, 40 %, 60 %, and 80 %). The steady-state pools predicted by the RF (from Fig. <xref ref-type="fig" rid="F2"/>e to f) are then used to initialize the final simulation over the period of interest (here 9 years, Fig. <xref ref-type="fig" rid="F2"/>g). Because the focus of this study is on robustness across different sampling fractions rather than on optimal sampling design, we adopted a stratified random sampling approach based on vegetation types. Given that the distributions of topographic, soil, and vegetation characteristics across the full domain are known in advance and do not exhibit pronounced extreme tails, this approach leads to training subsets that largely preserve these distributions and thus ensure representativeness (Fig. S2). In cases where predictors exhibit strong spatial heterogeneity or patchiness, more structured hierarchical sampling across multiple predictor bins could be implemented to ensure a balanced representation of environmental gradients.</p>
      <p id="d2e747">The performance of different initialization schemes (i.e., without RF (<inline-formula><mml:math id="M18" display="inline"><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:math></inline-formula> %) and with <inline-formula><mml:math id="M19" display="inline"><mml:mrow><mml:mi>n</mml:mi><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">100</mml:mn></mml:mrow></mml:math></inline-formula> %) is assessed by comparing the spatial distributions of SOC, DOC, and SON with the benchmark case (<inline-formula><mml:math id="M20" display="inline"><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">100</mml:mn></mml:mrow></mml:math></inline-formula> %). First, simulations without RF or dynamic lateral fluxes (the direct output of step (c) in Fig. <xref ref-type="fig" rid="F2"/>) are compared with the benchmark to address hypothesis H1, testing the need for spatially-informed spin-up to capture carbon and nutrient spatial variability. Second, RF-initialized simulations are compared with the benchmark to address H2, thus evaluating whether extrapolation from a subset of representative cells can reasonably approximate the benchmark.</p>
      <p id="d2e788">Random forest models consisting of 100 regression trees were trained to predict each carbon and nitrogen pool. This was implemented using the TreeBagger function in MATLAB R2024b (Statistics and Machine Learning Toolbox). Default settings were used for tree growth and, at each split, a random subset of predictors (one-third of the total predictors) was considered. The robustness of the RF model was validated using <inline-formula><mml:math id="M21" display="inline"><mml:mi>k</mml:mi></mml:math></inline-formula>-fold (<inline-formula><mml:math id="M22" display="inline"><mml:mrow><mml:mi>k</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">10</mml:mn></mml:mrow></mml:math></inline-formula>) cross-validation <xref ref-type="bibr" rid="bib1.bibx61" id="paren.44"/>, and by evaluating the mean absolute percentage error (MAPE) and the normalized root mean squared error (NRMSE). NRMSE was calculated by normalizing RMSE by the range (maximum minus minimum) of the observed values, and both metrics were averaged across the 10 folds (see Table S2).</p>
      <p id="d2e813">A sensitivity analysis was also performed to assess the relative importance of individual predictors in the RF algorithm. The six predictors considered were elevation, slope, flow accumulation area, profile curvature, percentage of sand, and vegetation type. Each predictor was individually removed from the RF training process using <inline-formula><mml:math id="M23" display="inline"><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">40</mml:mn></mml:mrow></mml:math></inline-formula> % tracked cells under the original simulation scenario (see other simulation scenarios in Sect. <xref ref-type="sec" rid="Ch1.S2.SS3"/>). For each reduced predictor set, a new RF model was trained, and its ability to reconstruct the spatial patterns of carbon and nutrient pools was compared against the full model (with all six covariates). This analysis provides insight into the relative influence of individual predictors and helps  assess how predictor choice may affect the sampling requirements (hypothesis H3). The primary purpose of this analysis was pragmatic: to identify which predictors are essential for reproducing spatial variability, and which could be omitted when reliable spatial data are unavailable. Besides, for the original scenario and random soil scenario (see simulation scenarios in Sect. 2.3) with <inline-formula><mml:math id="M24" display="inline"><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">40</mml:mn></mml:mrow></mml:math></inline-formula> % tracked cells, we further implemented a SHAP-based interpretability analysis (Shapley values) <xref ref-type="bibr" rid="bib1.bibx28" id="paren.45"/> to quantify the importance of each predictor in the RF model.</p>
</sec>
<sec id="Ch1.S2.SS3">
  <label>2.3</label><title>Design and analysis of different catchment configurations</title>
      <p id="d2e853">In addition to the original simulation scenario (hereafter <italic>Original</italic>), a set of modified simulation setups was designed to evaluate the generality of the proposed initialization method, assess its performance, and evaluate variations in the required number of tracked cells and predictors under different levels of landscape complexity. Each simulation scenario introduces a targeted modification to one aspect of the catchment configuration, while keeping all other properties unchanged: <list list-type="bullet"><list-item>
      <p id="d2e861">Vegetation cover (Fig. <xref ref-type="fig" rid="F1"/>b): the existing vegetation is replaced either with a randomized spatial distribution while maintaining the original forest-to-grassland ratio (<italic>Random Veg</italic>), or with a uniform grassland cover across the entire domain (<italic>Homog. Veg</italic>).</p></list-item><list-item>
      <p id="d2e873">Soil texture (Fig. <xref ref-type="fig" rid="F1"/>c): the original soil texture map is replaced either with a randomized and extended spatial distribution (<italic>Random Soil</italic>, gray points in Fig. <xref ref-type="fig" rid="F1"/>c) or with a uniform soil texture across the domain (<italic>Homog. Soil</italic>, red point in Fig. <xref ref-type="fig" rid="F1"/>c). Note that the <italic>Random Soil</italic> scenario introduces a much wider soil texture predictor space.</p></list-item><list-item>
      <p id="d2e893">Topography (Fig. <xref ref-type="fig" rid="F1"/>d): this is modified by reducing the elevation of the highest point by factors of 0.5 and 0.2 (0.5 <inline-formula><mml:math id="M25" display="inline"><mml:mrow><mml:msub><mml:mi>h</mml:mi><mml:mi mathvariant="normal">max</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> and 0.2 <inline-formula><mml:math id="M26" display="inline"><mml:mrow><mml:msub><mml:mi>h</mml:mi><mml:mi mathvariant="normal">max</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>), while keeping the lowest elevation fixed and adjusting all intermediate elevations proportionally. As T&amp;C-BG-2D uses spatially distributed meteorological inputs derived from a fixed station and lapse rates, a flatter topography necessarily results in more spatially uniform meteorological forcings in these simulation scenarios. Note that the elevation of the meteorological station was also rescaled in these two simulation scenarios.</p></list-item></list></p>
      <p id="d2e920">In all cases, only one factor is varied per simulation scenario (i.e., vegetation, soil, or topography), while the remaining inputs are kept consistent with the <italic>Original</italic> simulation setup. For each scenario, the full initialization sequence described in Fig. <xref ref-type="fig" rid="F2"/> (steps a–g) is applied, with different RF initialization settings based on the percentage of tracked cells <inline-formula><mml:math id="M27" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula> (steps e–g). For clarity, simulations are denoted as <inline-formula><mml:math id="M28" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula> %, where <inline-formula><mml:math id="M29" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula> indicates the percentage of tracked cells used in the RF initialization (<inline-formula><mml:math id="M30" display="inline"><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:math></inline-formula>, 10, 20, 40, 60, 80, or 100). <inline-formula><mml:math id="M31" display="inline"><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:math></inline-formula> % corresponds to simulations applying initial conditions from the 1D spin-up without RF or dynamic lateral fluxes (the direct output of step (c) in Fig. <xref ref-type="fig" rid="F2"/>), while <inline-formula><mml:math id="M32" display="inline"><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">100</mml:mn></mml:mrow></mml:math></inline-formula> % represents benchmark simulations in which lateral fluxes in all cells are tracked.</p>
      <p id="d2e988">For the soil texture and topography scenarios, results for <inline-formula><mml:math id="M33" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula> equal to 60 % and 80 % are omitted, as a satisfactory performance (i.e., more than 90 % of SOC and SON spatial variability captured) was already achieved with <inline-formula><mml:math id="M34" display="inline"><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">40</mml:mn></mml:mrow></mml:math></inline-formula> % of tracked cells. Note that, for the analysis related to soil texture, in step (a) of the procedure (see Fig. <xref ref-type="fig" rid="F2"/>) the average soil texture is used, while spatial variability in soil textural properties is only taken into account from step (c) onward. The implications of this assumption were investigated with additional dedicated numerical tests and are discussed in Sect. 4. Overall, the design of these scenarios allows us to evaluate how vegetation, soil, and topographic heterogeneity may affect the number of representative locations required for RF initialization (hypothesis H3).</p>
</sec>
<sec id="Ch1.S2.SS4">
  <label>2.4</label><title>Steady state with coupled vegetation dynamics</title>
      <p id="d2e1020">In steps (a) and (d) of the initialization workflow (Fig. <xref ref-type="fig" rid="F2"/>), vegetation fluxes are held constant and only the biogeochemistry module is used for spin-up. This decoupling strategy further reduces computational costs, as instead of solving the full system of coupled ordinary differential equations, the model reduces to a single biogeochemical system that can be solved at minimal computational cost. Such an approach is commonly used to approximate near-equilibrium states in terrestrial biosphere models <xref ref-type="bibr" rid="bib1.bibx20" id="paren.46"/>. To evaluate the effect of neglecting vegetation dynamics during spin-up, we selected five cells for each vegetation type (forest and grassland) across a range of topographic conditions. For each selected cell, we compared the resulting steady state from the decoupled spin-up (using only the biogeochemistry module) to the steady state obtained from the fully coupled T&amp;C-BG model, simulating the long-term vegetation-soil biogeochemistry dynamics (Fig. <xref ref-type="fig" rid="F2"/>h). Details on the selected cells are provided in Fig. S2 and Table S3. For the fully coupled T&amp;C-BG model, the repeated 9-year meteorological data were used to force the model, and the simulations were initialized using the soil carbon and nutrient pools from step (b) in Fig. <xref ref-type="fig" rid="F2"/>. For every 9-year simulation, the long-term trend of all simulated soil carbon and nutrient pools was evaluated, using singular spectrum analysis to exclude interannual seasonalities. The steady state was considered reached when the long-term trends in all soil carbon and nutrient pools satisfied at least one of three predefined criteria over the final 9-year simulation period: (i) an absolute linear trend (slope) smaller than 0.1 [gC m<sup>−2</sup> d<sup>−1</sup> or gN m<sup>−2</sup> d<sup>−1</sup>], (ii) an absolute difference between the first and last values of the fitted linear trend smaller than 0.1 [gC m<sup>−2</sup> or gN m<sup>−2</sup>], or (iii) a relative change between the first and last values of the fitted linear trend smaller than 1 %. Note that the slope threshold of 0.1 [gC m<sup>−2</sup> d<sup>−1</sup> or gN m<sup>−2</sup> d<sup>−1</sup>] differs in units from those used in some previous studies <xref ref-type="bibr" rid="bib1.bibx13" id="paren.47"><named-content content-type="pre">e.g., 1 gC m<sup>−2</sup> yr<sup>−1</sup> in</named-content></xref>, reflecting the daily temporal resolution of our model outputs. However, when converted to annual units, the simulated slopes are of comparable magnitude and generally remain within the range of this stricter criterion (see Fig. S3 for an example). Then, the average of the final 3-year simulation period was adopted and compared to results from the uncoupled dynamics using the biogeochemistry module with constant vegetation fluxes. This numerical experiment was specifically designed to address hypothesis H4, testing whether uncoupled spin-up simulations provide adequate steady-state conditions compared to the fully coupled case.</p>
</sec>
</sec>
<sec id="Ch1.S3">
  <label>3</label><title>Results</title>
<sec id="Ch1.S3.SS1">
  <label>3.1</label><title>Effect of different initialization settings on the spatial distribution of carbon and nutrient pools</title>
      <p id="d2e1199">Figure <xref ref-type="fig" rid="F3"/> shows the spatial distribution and associated probability density functions (PDFs) of 9-year averaged SOC, DOC, and SON (step (g) in Fig. <xref ref-type="fig" rid="F2"/>) under the <italic>Original</italic> setup. Results are shown for the benchmark simulation (<inline-formula><mml:math id="M47" display="inline"><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">100</mml:mn></mml:mrow></mml:math></inline-formula> %), the 2D simulation applying initial conditions from 1D spin-up without RF or dynamic lateral fluxes (<inline-formula><mml:math id="M48" display="inline"><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:math></inline-formula> %, direct output of step (c) in Fig. <xref ref-type="fig" rid="F2"/>), and RF-based initializations using different percentages of tracked cells <inline-formula><mml:math id="M49" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula> (see also Fig. S4 for additional results). Compared to the <inline-formula><mml:math id="M50" display="inline"><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:math></inline-formula> % simulation, which neglects long-term topographic effects on the initial soil biogeochemical conditions, the benchmark case exhibits greater spatial variability in SOC and SON, highlighting the role of topography in shaping soil carbon and nutrient distributions. In contrast, the spatial distribution of DOC (a small fraction of SOC) remains nearly identical between the benchmark and all initialization settings, regardless of the percentage of tracked cells considered. This is likely because DOC is rapidly transported by water and its spatial variability is largely captured during the 9-year simulation, even without dynamic spin-up or RF-based initialization. Since the spatial performance of DOC is satisfactory even without RF, subsequent analyses and results presented here will focus on SOC and SON only. The PDFs of SOC and SON (Fig. <xref ref-type="fig" rid="F3"/>) further indicate that the <inline-formula><mml:math id="M51" display="inline"><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:math></inline-formula> % simulation underestimates both the spatial variability and median values. This bias arises because the <inline-formula><mml:math id="M52" display="inline"><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:math></inline-formula> % simulation uses steady-state conditions from a single location (the meteorological station), which would require much longer simulation time to reflect the effects of heterogeneous environmental conditions and lateral exchanges (see more details in Sect. <xref ref-type="sec" rid="Ch1.S3.SS3"/>). Note that the bimodality of the PDFs arises from the two different types of vegetation cover in the catchment.</p>

      <fig id="F3" specific-use="star"><label>Figure 3</label><caption><p id="d2e1286">Spatial distributions of <bold>(a)</bold> soil organic carbon (SOC), <bold>(b)</bold> dissolved organic carbon (DOC), and <bold>(c)</bold> soil organic nitrogen (SON) in the Erlenbach catchment with the <italic>Original</italic> setup under two initialization conditions: with 0 % cell tracking (0 % Cells) and with 100 % cell tracking (100 % Cells). Probability density functions (PDFs) of each variable are shown for simulations using RF initialization with <inline-formula><mml:math id="M53" display="inline"><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">10</mml:mn></mml:mrow></mml:math></inline-formula> %, 40 %, and 80 %. Each is compared to the corresponding PDFs from the 0 % cells and 100 % cells simulations. The overlap area between PDFs (<inline-formula><mml:math id="M54" display="inline"><mml:mi>A</mml:mi></mml:math></inline-formula> value) is used to quantify the similarity between the RF-based distributions and the benchmark. Dashed and dotted lines represent median values for grassland and forest areas, respectively.</p></caption>
          <graphic xlink:href="https://gmd.copernicus.org/articles/19/4547/2026/gmd-19-4547-2026-f03.png"/>

        </fig>

      <p id="d2e1327">Interestingly, RF initialization using just <inline-formula><mml:math id="M55" display="inline"><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">10</mml:mn></mml:mrow></mml:math></inline-formula> % of the cells can reproduce approximately 75 % of the spatial variability in SOC (Fig. <xref ref-type="fig" rid="F3"/>). Model performance improves with increasing percentages of tracked cells, and using <inline-formula><mml:math id="M56" display="inline"><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">80</mml:mn></mml:mrow></mml:math></inline-formula> % yields results that are nearly indistinguishable from the benchmark simulation. A notable performance jump occurs between <inline-formula><mml:math id="M57" display="inline"><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">20</mml:mn></mml:mrow></mml:math></inline-formula> % and 40 %. For example, in the <italic>Original</italic> simulation scenario, the overlap area (<inline-formula><mml:math id="M58" display="inline"><mml:mi>A</mml:mi></mml:math></inline-formula> value, quantifying the overlap between the PDFs of each simulation and the benchmark, with a value of <inline-formula><mml:math id="M59" display="inline"><mml:mi>A</mml:mi></mml:math></inline-formula> close to 1 indicating high similarity between distributions) increases from 0.82 to 0.91 for SOC and from 0.86 to 0.92 for SON (Table <xref ref-type="table" rid="T2"/>). Considering <inline-formula><mml:math id="M60" display="inline"><mml:mrow><mml:mi>A</mml:mi><mml:mo>≥</mml:mo><mml:mn mathvariant="normal">0.9</mml:mn></mml:mrow></mml:math></inline-formula> as a threshold for satisfactory performance yields <inline-formula><mml:math id="M61" display="inline"><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">40</mml:mn></mml:mrow></mml:math></inline-formula> % of tracked cells being sufficient to accurately capture most of the spatial variability in SOC and SON (see Table <xref ref-type="table" rid="T2"/>) for the <italic>Original</italic> Erlenbach simulation scenario. Values of the Jensen–Shannon divergence, used to further compare similarities between PDFs, are reported in Table S4. Note, however, that the optimal percentage of tracked cells may vary depending on catchment characteristics such as vegetation composition, soil texture, topographic complexity, and background climate (see Sect. <xref ref-type="sec" rid="Ch1.S3.SS2"/>).</p>

<table-wrap id="T2" specific-use="star"><label>Table 2</label><caption><p id="d2e1424">Overlap area (<inline-formula><mml:math id="M62" display="inline"><mml:mi>A</mml:mi></mml:math></inline-formula> value) between the probability density functions (PDFs) of soil organic carbon (SOC) and soil organic nitrogen (SON) from different percentages of tracked cells and those from the benchmark simulation across multiple simulation scenarios. Values in parentheses under the <italic>Random Soil</italic> scenario represent results obtained when clay content was included as an additional predictor in the random forest model. Values greater than or equal to 0.90 are shown in bold.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="7">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="center"/>
     <oasis:colspec colnum="3" colname="col3" align="right"/>
     <oasis:colspec colnum="4" colname="col4" align="right"/>
     <oasis:colspec colnum="5" colname="col5" align="right"/>
     <oasis:colspec colnum="6" colname="col6" align="center"/>
     <oasis:colspec colnum="7" colname="col7" align="center"/>
     <oasis:thead>
       <oasis:row rowsep="1">

         <oasis:entry namest="col1" nameend="col2" align="center">Scenarios/Track cells [%] </oasis:entry>

         <oasis:entry colname="col3">10</oasis:entry>

         <oasis:entry colname="col4">20</oasis:entry>

         <oasis:entry colname="col5">40</oasis:entry>

         <oasis:entry colname="col6">60</oasis:entry>

         <oasis:entry colname="col7">80</oasis:entry>

       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row>

         <oasis:entry rowsep="1" colname="col1" morerows="1"><italic>Original</italic></oasis:entry>

         <oasis:entry colname="col2">SOC</oasis:entry>

         <oasis:entry colname="col3">0.75</oasis:entry>

         <oasis:entry colname="col4">0.82</oasis:entry>

         <oasis:entry colname="col5"><bold>0.91</bold></oasis:entry>

         <oasis:entry colname="col6"><bold>0.95</bold></oasis:entry>

         <oasis:entry colname="col7"><bold>0.97</bold></oasis:entry>

       </oasis:row>
       <oasis:row rowsep="1">

         <oasis:entry colname="col2">SON</oasis:entry>

         <oasis:entry colname="col3">0.81</oasis:entry>

         <oasis:entry colname="col4">0.86</oasis:entry>

         <oasis:entry colname="col5"><bold>0.92</bold></oasis:entry>

         <oasis:entry colname="col6"><bold>0.96</bold></oasis:entry>

         <oasis:entry colname="col7"><bold>0.98</bold></oasis:entry>

       </oasis:row>
       <oasis:row>

         <oasis:entry rowsep="1" colname="col1" morerows="1"><italic>Random Veg</italic></oasis:entry>

         <oasis:entry colname="col2">SOC</oasis:entry>

         <oasis:entry colname="col3">0.80</oasis:entry>

         <oasis:entry colname="col4">0.83</oasis:entry>

         <oasis:entry colname="col5"><bold>0.92</bold></oasis:entry>

         <oasis:entry colname="col6"><bold>0.94</bold></oasis:entry>

         <oasis:entry colname="col7"><bold>0.97</bold></oasis:entry>

       </oasis:row>
       <oasis:row rowsep="1">

         <oasis:entry colname="col2">SON</oasis:entry>

         <oasis:entry colname="col3">0.84</oasis:entry>

         <oasis:entry colname="col4">0.87</oasis:entry>

         <oasis:entry colname="col5"><bold>0.92</bold></oasis:entry>

         <oasis:entry colname="col6"><bold>0.96</bold></oasis:entry>

         <oasis:entry colname="col7"><bold>0.98</bold></oasis:entry>

       </oasis:row>
       <oasis:row>

         <oasis:entry rowsep="1" colname="col1" morerows="1"><italic>Homog. Veg</italic></oasis:entry>

         <oasis:entry colname="col2">SOC</oasis:entry>

         <oasis:entry colname="col3">0.83</oasis:entry>

         <oasis:entry colname="col4"><bold>0.91</bold></oasis:entry>

         <oasis:entry colname="col5"><bold>0.94</bold></oasis:entry>

         <oasis:entry colname="col6"><bold>0.97</bold></oasis:entry>

         <oasis:entry colname="col7"><bold>0.98</bold></oasis:entry>

       </oasis:row>
       <oasis:row rowsep="1">

         <oasis:entry colname="col2">SON</oasis:entry>

         <oasis:entry colname="col3">0.83</oasis:entry>

         <oasis:entry colname="col4"><bold>0.91</bold></oasis:entry>

         <oasis:entry colname="col5"><bold>0.92</bold></oasis:entry>

         <oasis:entry colname="col6"><bold>0.95</bold></oasis:entry>

         <oasis:entry colname="col7"><bold>0.98</bold></oasis:entry>

       </oasis:row>
       <oasis:row>

         <oasis:entry rowsep="1" colname="col1" morerows="1"><italic>Random Soil</italic></oasis:entry>

         <oasis:entry colname="col2">SOC</oasis:entry>

         <oasis:entry colname="col3">0.58 (0.85)</oasis:entry>

         <oasis:entry colname="col4">0.68 (<bold>0.90</bold>)</oasis:entry>

         <oasis:entry colname="col5">0.76 (<bold>0.95</bold>)</oasis:entry>

         <oasis:entry colname="col6"/>

         <oasis:entry colname="col7"/>

       </oasis:row>
       <oasis:row rowsep="1">

         <oasis:entry colname="col2">SON</oasis:entry>

         <oasis:entry colname="col3">0.65 (0.88)</oasis:entry>

         <oasis:entry colname="col4">0.74 (0.89)</oasis:entry>

         <oasis:entry colname="col5">0.80 (<bold>0.96</bold>)</oasis:entry>

         <oasis:entry colname="col6"/>

         <oasis:entry colname="col7"/>

       </oasis:row>
       <oasis:row>

         <oasis:entry rowsep="1" colname="col1" morerows="1"><italic>Homog. Soil</italic></oasis:entry>

         <oasis:entry colname="col2">SOC</oasis:entry>

         <oasis:entry colname="col3">0.74</oasis:entry>

         <oasis:entry colname="col4">0.85</oasis:entry>

         <oasis:entry colname="col5"><bold>0.92</bold></oasis:entry>

         <oasis:entry colname="col6"/>

         <oasis:entry colname="col7"/>

       </oasis:row>
       <oasis:row rowsep="1">

         <oasis:entry colname="col2">SON</oasis:entry>

         <oasis:entry colname="col3">0.82</oasis:entry>

         <oasis:entry colname="col4"><bold>0.90</bold></oasis:entry>

         <oasis:entry colname="col5"><bold>0.95</bold></oasis:entry>

         <oasis:entry colname="col6"/>

         <oasis:entry colname="col7"/>

       </oasis:row>
       <oasis:row>

         <oasis:entry rowsep="1" colname="col1" morerows="1">0.5 <inline-formula><mml:math id="M63" display="inline"><mml:mrow><mml:msub><mml:mi>h</mml:mi><mml:mi mathvariant="normal">max</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula></oasis:entry>

         <oasis:entry colname="col2">SOC</oasis:entry>

         <oasis:entry colname="col3">0.72</oasis:entry>

         <oasis:entry colname="col4">0.82</oasis:entry>

         <oasis:entry colname="col5"><bold>0.91</bold></oasis:entry>

         <oasis:entry colname="col6"/>

         <oasis:entry colname="col7"/>

       </oasis:row>
       <oasis:row rowsep="1">

         <oasis:entry colname="col2">SON</oasis:entry>

         <oasis:entry colname="col3">0.75</oasis:entry>

         <oasis:entry colname="col4">0.85</oasis:entry>

         <oasis:entry colname="col5"><bold>0.91</bold></oasis:entry>

         <oasis:entry colname="col6"/>

         <oasis:entry colname="col7"/>

       </oasis:row>
       <oasis:row>

         <oasis:entry colname="col1" morerows="1">0.2 <inline-formula><mml:math id="M64" display="inline"><mml:mrow><mml:msub><mml:mi>h</mml:mi><mml:mi mathvariant="normal">max</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula></oasis:entry>

         <oasis:entry colname="col2">SOC</oasis:entry>

         <oasis:entry colname="col3">0.77</oasis:entry>

         <oasis:entry colname="col4">0.82</oasis:entry>

         <oasis:entry colname="col5"><bold>0.91</bold></oasis:entry>

         <oasis:entry colname="col6"/>

         <oasis:entry colname="col7"/>

       </oasis:row>
       <oasis:row>

         <oasis:entry colname="col2">SON</oasis:entry>

         <oasis:entry colname="col3">0.83</oasis:entry>

         <oasis:entry colname="col4">0.87</oasis:entry>

         <oasis:entry colname="col5"><bold>0.93</bold></oasis:entry>

         <oasis:entry colname="col6"/>

         <oasis:entry colname="col7"/>

       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table></table-wrap>

      <p id="d2e1861">Black lines in Fig. <xref ref-type="fig" rid="F4"/> show the temporal evolution of the coefficient of variation (CV, defined as the ratio of standard deviation to the mean) for spatial SOC under different RF initialization settings in forested and grassland areas (corresponding results for SON are provided in Fig. S5). In the <inline-formula><mml:math id="M65" display="inline"><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:math></inline-formula> % case, CV increases steadily over the 9-year simulation, indicating that spatial variability has not reached a steady state. In contrast, simulations initialized using the RF scheme maintain relatively stable CV values throughout the simulation period, with variability reflecting responses to dynamic environmental drivers rather than spin-up imbalance. CV values gradually increase across RF initialization settings with higher tracking percentages (<inline-formula><mml:math id="M66" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula>), approaching those of the benchmark. Notably, CV trajectories for RF settings using <inline-formula><mml:math id="M67" display="inline"><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">40</mml:mn></mml:mrow></mml:math></inline-formula> % and 100 % of tracked cells are nearly identical, confirming that <inline-formula><mml:math id="M68" display="inline"><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">40</mml:mn></mml:mrow></mml:math></inline-formula> % is sufficient to reproduce the full temporal dynamics of SOC spatial variability in the catchment here. Table S2 summarizes the robustness of the random forest models trained using 40 % of the tracked cells in the <italic>Original</italic> scenario. For most carbon and nitrogen pools, MAPE values are below 10 % and NRMSE values are below 0.1, indicating that the RF models exhibit satisfactory performance. These results provide direct support for H1, demonstrating that spatially explicit initialization is required to capture spatial variations of SOC and SON, as well as for H2, showing that spin-up at a subset of representative locations combined with RF extrapolation can satisfactorily reproduce benchmark dynamics.</p>

      <fig id="F4" specific-use="star"><label>Figure 4</label><caption><p id="d2e1915">Temporal evolution of the coefficient of variation (CV) of spatial soil organic carbon (SOC) distribution over a 9-year simulation period in forest (top panels) and grassland (bottom panels) under different initialization settings and simulation scenarios. Each column represents one initialization setting: no cells tracking and no random forest (<inline-formula><mml:math id="M69" display="inline"><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:math></inline-formula> %), <inline-formula><mml:math id="M70" display="inline"><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">10</mml:mn></mml:mrow></mml:math></inline-formula> % cells tracking and random forest (10 % Cells), <inline-formula><mml:math id="M71" display="inline"><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">40</mml:mn></mml:mrow></mml:math></inline-formula> % cells tracking and random forest (40 % Cells), and <inline-formula><mml:math id="M72" display="inline"><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">100</mml:mn></mml:mrow></mml:math></inline-formula> % cells tracking (100 % Cells). Line colors and styles indicate results from different simulation scenarios, as shown in the legend.</p></caption>
          <graphic xlink:href="https://gmd.copernicus.org/articles/19/4547/2026/gmd-19-4547-2026-f04.png"/>

        </fig>

</sec>
<sec id="Ch1.S3.SS2">
  <label>3.2</label><title>Sensitivity across different catchment configurations</title>
      <p id="d2e1980">Table <xref ref-type="table" rid="T2"/> and Fig. <xref ref-type="fig" rid="F4"/> also report the performance of using different <inline-formula><mml:math id="M73" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula> values in the RF model across different catchment configurations. Across all simulation scenarios, <inline-formula><mml:math id="M74" display="inline"><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">40</mml:mn></mml:mrow></mml:math></inline-formula> % can capture over 90 % of the spatial variability in both SOC and SON. An exception is observed in the <italic>Homog. Veg</italic> simulation scenario, where even <inline-formula><mml:math id="M75" display="inline"><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">20</mml:mn></mml:mrow></mml:math></inline-formula> % of tracked cells yields satisfactory performance.</p>
      <p id="d2e2022">A comparison among <italic>Original</italic>, <italic>Random Veg</italic>, and <italic>Homog. Veg</italic> simulations suggests that the number of tracked cells required for satisfactory RF performance is more sensitive to the proportion of vegetation types than to their spatial arrangement. As expected, when vegetation cover is spatially homogeneous, fewer cells are needed to achieve accurate predictions.</p>
      <p id="d2e2034">In contrast, results from the <italic>Original</italic>, <italic>Random Soil</italic>, and <italic>Homog. Soil</italic> simulations indicate that reducing the spatial heterogeneity of soil texture lowers the absolute spatial variability of SOC but does not necessarily reduce the number of tracked cells required for RF initialization. This is likely because soil texture is already included as a predictor in the RF model, and because in the <italic>Original</italic> scenario soil texture is relatively uniform (i.e., sand content: mean <inline-formula><mml:math id="M76" display="inline"><mml:mo>=</mml:mo></mml:math></inline-formula> 41 %, standard deviation <inline-formula><mml:math id="M77" display="inline"><mml:mo>=</mml:mo></mml:math></inline-formula> 2 % based on <xref ref-type="bibr" rid="bib1.bibx16" id="text.48"/>). As a result, the model can capture soil-related variability effectively without the need for additional soil information. However, in the <italic>Random Soil</italic> scenario, where soil texture is highly heterogeneous, adding supplementary predictors such as the percentage of clay significantly improves the model’s ability to reproduce benchmark SOC spatial patterns (Table <xref ref-type="table" rid="T2"/>). This highlights the importance of including sufficient soil information when simulating spatial variability under heterogeneous soil conditions. Notably, the spatial CV is also higher in this scenario relative to the others, reflecting higher spatial variability of SOC. This is due to the expanded soil texture parameter space, especially variations in silt and clay contents, as this fine-particle-associated carbon contributes substantially to SOC <xref ref-type="bibr" rid="bib1.bibx58 bib1.bibx12 bib1.bibx34" id="paren.49"/> (see also Table S5). These findings are consistent with previous studies showing that soil properties are among the most influential predictors in digital mapping of SOC <xref ref-type="bibr" rid="bib1.bibx35 bib1.bibx74 bib1.bibx33" id="paren.50"><named-content content-type="pre">e.g.,</named-content></xref>.</p>
      <p id="d2e2080">Simulation results from the flatter topography scenarios (0.5 <inline-formula><mml:math id="M78" display="inline"><mml:mrow><mml:msub><mml:mi>h</mml:mi><mml:mi mathvariant="normal">max</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> and 0.2 <inline-formula><mml:math id="M79" display="inline"><mml:mrow><mml:msub><mml:mi>h</mml:mi><mml:mi mathvariant="normal">max</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>) indicate that a gentler topography does not necessarily lower the number of tracked cells needed for accurate RF initialization. In general, RF models require more samples when predictor distributions are not evenly distributed within their overall range of variation (e.g., when extreme values are underrepresented), in order to adequately represent the edges and tails of the input space <xref ref-type="bibr" rid="bib1.bibx47 bib1.bibx57" id="paren.51"/>. In our sampling design, the distributions of predictor variables were known in advance, and the selected cells were chosen to represent the full range of topographic and land cover features across the domain (Fig. S2). This targeted selection likely contributed to consistent RF performance across topographic conditions. It is also important to note that, in these simulations, the use of flatter topographies only results in reduced elevation differences, slopes, and curvatures, without altering the overall catchment shape, size, and contributing area, as the underlying catchment skeleton remains identical across scenarios. Additionally, the topographies were modified by tilting the surface around the lowest elevation point. While this can affect the average temperature values in the domain (and resulting absolute values of SOC and SON), it has no impact on the number of cells required within the same simulation configuration. Overall, these results highlight that the number of representative locations needed for satisfactory RF initialization is shaped by catchment-specific heterogeneity and predictor distributions, thereby supporting H3.</p>
</sec>
<sec id="Ch1.S3.SS3">
  <label>3.3</label><title>Bias analysis in relation to topography</title>
      <p id="d2e2116">Figure <xref ref-type="fig" rid="F5"/> presents the relationship between relative SOC bias (<inline-formula><mml:math id="M80" display="inline"><mml:mrow><mml:mi mathvariant="normal">Δ</mml:mi><mml:mi mathvariant="normal">SOC</mml:mi><mml:mo>=</mml:mo><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="normal">SOC</mml:mi><mml:mi mathvariant="normal">simulation</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mi mathvariant="normal">SOC</mml:mi><mml:mi mathvariant="normal">benchmark</mml:mi></mml:msub><mml:mo>)</mml:mo><mml:mo>/</mml:mo><mml:msub><mml:mi mathvariant="normal">SOC</mml:mi><mml:mi mathvariant="normal">benchmark</mml:mi></mml:msub><mml:mo>×</mml:mo><mml:mn mathvariant="normal">100</mml:mn></mml:mrow></mml:math></inline-formula>) and the topographic wetness index (TWI) from the <italic>Original</italic> simulation scenario with <inline-formula><mml:math id="M81" display="inline"><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:math></inline-formula> %, 10 %, 20 %, 40 %, together with the PDFs of <inline-formula><mml:math id="M82" display="inline"><mml:mi mathvariant="normal">Δ</mml:mi></mml:math></inline-formula>SOC. Additional results covering all simulation scenarios are provided in Figs. S6 and S7.</p>

      <fig id="F5" specific-use="star"><label>Figure 5</label><caption><p id="d2e2185">Density maps showing the relationship between relative SOC bias (<inline-formula><mml:math id="M83" display="inline"><mml:mrow><mml:mi mathvariant="normal">Δ</mml:mi><mml:mi mathvariant="normal">SOC</mml:mi><mml:mo>=</mml:mo><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="normal">SOC</mml:mi><mml:mi mathvariant="normal">simulation</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mi mathvariant="normal">SOC</mml:mi><mml:mi mathvariant="normal">benchmark</mml:mi></mml:msub><mml:mo>)</mml:mo><mml:mo>/</mml:mo><mml:msub><mml:mi mathvariant="normal">SOC</mml:mi><mml:mi mathvariant="normal">benchmark</mml:mi></mml:msub><mml:mo>×</mml:mo><mml:mn mathvariant="normal">100</mml:mn></mml:mrow></mml:math></inline-formula>) and Topographic Wetness Index (TWI) under <italic>Original</italic> simulation scenarios with <inline-formula><mml:math id="M84" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula> <inline-formula><mml:math id="M85" display="inline"><mml:mo>=</mml:mo></mml:math></inline-formula> <bold>(a)</bold> 0 %, <bold>(b)</bold> 10 %, <bold>(c)</bold> 20 %, and <bold>(d)</bold> 40 %. <bold>(e)</bold> Probability density functions (PDFs) of <inline-formula><mml:math id="M86" display="inline"><mml:mi mathvariant="normal">Δ</mml:mi></mml:math></inline-formula>SOC from <inline-formula><mml:math id="M87" display="inline"><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:math></inline-formula> % (in gray), and with RF-based initialization using <inline-formula><mml:math id="M88" display="inline"><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">10</mml:mn></mml:mrow></mml:math></inline-formula> % (red dashed line), 20 % (yellow dashed line), and 40 % (green) of tracked cells.</p></caption>
          <graphic xlink:href="https://gmd.copernicus.org/articles/19/4547/2026/gmd-19-4547-2026-f05.png"/>

        </fig>

      <p id="d2e2299">Figure <xref ref-type="fig" rid="F5"/>a shows that the <inline-formula><mml:math id="M89" display="inline"><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:math></inline-formula> % initialization configuration results in a systematic underestimation of SOC in many cells, with deviations reaching up to 20 %–30 % in some cases. This is because the initial pools (Fig. <xref ref-type="fig" rid="F2"/>b) are derived from the 1D spin-up at the Erlenhöhe meteorological station, located at a relatively low elevation (1210 m a.s.l.) within the catchment. Since SOC generally increases with elevation due to temperature reductions <xref ref-type="bibr" rid="bib1.bibx42 bib1.bibx65 bib1.bibx78" id="paren.52"/> – an effect captured by T&amp;C-BG-2D <xref ref-type="bibr" rid="bib1.bibx24" id="paren.53"/> – SOC in high-elevation areas gradually accumulates over time. However, the 9-year <inline-formula><mml:math id="M90" display="inline"><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:math></inline-formula> % simulation is insufficient for these areas to equilibrate, leading to a consistent underestimation (compared to the benchmark simulation, which instead captures the increasing trend of SOC with elevation) due to the lack of spatial information in the model initialization.</p>
      <p id="d2e2338">Interestingly, a positive relationship between <inline-formula><mml:math id="M91" display="inline"><mml:mi mathvariant="normal">Δ</mml:mi></mml:math></inline-formula>SOC and TWI is observed in the range TWI <inline-formula><mml:math id="M92" display="inline"><mml:mo>&lt;</mml:mo></mml:math></inline-formula> 11, where the majority of grid cells are located (Fig. <xref ref-type="fig" rid="F5"/>a). This trend can be explained primarily by the influence of water availability on soil respiration and, consequently, on SOC stocks. We note, however, that TWI is weakly but significantly correlated with elevation, and thus with air temperature (<inline-formula><mml:math id="M93" display="inline"><mml:mrow><mml:mi>r</mml:mi><mml:mo>=</mml:mo><mml:mo>-</mml:mo><mml:mn mathvariant="normal">0.14</mml:mn></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M94" display="inline"><mml:mrow><mml:mi>p</mml:mi><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">0.05</mml:mn></mml:mrow></mml:math></inline-formula>). This suggests that temperature may also contribute to the observed relationship, although its influence is relatively minor compared to that of water availability. When the <inline-formula><mml:math id="M95" display="inline"><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:math></inline-formula> % initialization is applied, SOC exhibits almost no trend with TWI (Fig. <xref ref-type="fig" rid="F6"/>a and b). However, the benchmark simulation shows a decreasing SOC trend with increasing TWI for TWI <inline-formula><mml:math id="M96" display="inline"><mml:mo>&lt;</mml:mo></mml:math></inline-formula> 11, reflecting a long-term steady-state pattern shaped by soil respiration processes during spin-up. Simulations with <inline-formula><mml:math id="M97" display="inline"><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">10</mml:mn></mml:mrow></mml:math></inline-formula> % and <inline-formula><mml:math id="M98" display="inline"><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">40</mml:mn></mml:mrow></mml:math></inline-formula> % successfully capture the SOC–TWI relationship, with results from <inline-formula><mml:math id="M99" display="inline"><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">40</mml:mn></mml:mrow></mml:math></inline-formula> % being closer to the benchmark simulation. This relationship is consistent with previous findings showing an increase in soil respiration with water availability due to enhanced microbial activity <xref ref-type="bibr" rid="bib1.bibx30 bib1.bibx31 bib1.bibx71" id="paren.54"/>, as shown in Fig. <xref ref-type="fig" rid="F6"/>c and d. However, under excessively wet conditions (such as TWI <inline-formula><mml:math id="M100" display="inline"><mml:mo>&gt;</mml:mo></mml:math></inline-formula> 11), this relationship reverses due to oxygen limitation <xref ref-type="bibr" rid="bib1.bibx36 bib1.bibx51" id="paren.55"/>. Figure <xref ref-type="fig" rid="F6"/>c and d further shows that while the <inline-formula><mml:math id="M101" display="inline"><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:math></inline-formula> % case reproduces the correct qualitative relationship between fast carbon fluxes (e.g., soil respiration) and TWI, correct flux magnitudes are only captured when <inline-formula><mml:math id="M102" display="inline"><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">10</mml:mn></mml:mrow></mml:math></inline-formula> % of the cells are tracked, and the performance is further improved for <inline-formula><mml:math id="M103" display="inline"><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">40</mml:mn></mml:mrow></mml:math></inline-formula> %. Because the SOC distribution in the benchmark simulation shows much higher spatial variability than in the <inline-formula><mml:math id="M104" display="inline"><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:math></inline-formula> % case, the decreasing SOC–TWI trend from the benchmark dominates the <inline-formula><mml:math id="M105" display="inline"><mml:mi mathvariant="normal">Δ</mml:mi></mml:math></inline-formula>SOC–TWI relationship, leading to an apparent positive correlation in Fig. <xref ref-type="fig" rid="F5"/>a. The PDFs in Fig. <xref ref-type="fig" rid="F5"/>e further confirm that the RF initialization scheme substantially reduces this topography-related bias. Using only <inline-formula><mml:math id="M106" display="inline"><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">10</mml:mn></mml:mrow></mml:math></inline-formula> % of the cells for RF training already eliminates most of the systematic bias (Fig. <xref ref-type="fig" rid="F5"/>b), while <inline-formula><mml:math id="M107" display="inline"><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">20</mml:mn></mml:mrow></mml:math></inline-formula> % (Fig. <xref ref-type="fig" rid="F5"/>c) and <inline-formula><mml:math id="M108" display="inline"><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">40</mml:mn></mml:mrow></mml:math></inline-formula> % (Fig. <xref ref-type="fig" rid="F5"/>d) eliminate it almost entirely. The analysis here clearly highlights that the bias between plot-scale initialization and spatially-informed ones is strongly linked to grid-cell interactions via topography-controlled lateral transport.</p>

      <fig id="F6" specific-use="star"><label>Figure 6</label><caption><p id="d2e2564">Relationship between the topographic wetness index (TWI) and <bold>(a, b)</bold> soil organic carbon (SOC) and <bold>(c, d)</bold> soil respiration in the <italic>Original</italic> simulation scenario, shown separately for forest (left) and grassland (right). Red points and lines represent results from <inline-formula><mml:math id="M109" display="inline"><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:math></inline-formula> % simulations; blue squares and lines show results from <inline-formula><mml:math id="M110" display="inline"><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">100</mml:mn></mml:mrow></mml:math></inline-formula> % simulations. Filled circles indicate the median values within TWI bins of width 1, with vertical bars showing the <inline-formula><mml:math id="M111" display="inline"><mml:mrow><mml:mo>±</mml:mo><mml:mn mathvariant="normal">25</mml:mn></mml:mrow></mml:math></inline-formula> % range within each bin. Dashed red lines and dotted green lines present results from <inline-formula><mml:math id="M112" display="inline"><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">10</mml:mn></mml:mrow></mml:math></inline-formula> % and <inline-formula><mml:math id="M113" display="inline"><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">40</mml:mn></mml:mrow></mml:math></inline-formula> %, respectively.</p></caption>
          <graphic xlink:href="https://gmd.copernicus.org/articles/19/4547/2026/gmd-19-4547-2026-f06.png"/>

        </fig>

</sec>
<sec id="Ch1.S3.SS4">
  <label>3.4</label><title>The role of coupled vegetation dynamics in 1D soil biogeochemical spin-up</title>
      <p id="d2e2649">Figure <xref ref-type="fig" rid="F7"/> compares SOC values obtained from different simulation stages and spin-up strategies. The first column represents the initial SOC values (here, values in Fig. <xref ref-type="fig" rid="F2"/>b are used, but can be derived from the steady state in areas with similar land cover, soil, and climatic conditions, or from a reasonable initial guess). The next three columns show SOC values after 9, 18, and 27 years of simulation using the full T&amp;C-BG model (i.e., with coupled soil biogeochemical and vegetation dynamics). These three columns are used to exemplify the gradual convergence of SOC toward steady-state conditions. The gray box in the middle shows the SOC state obtained using the biogeochemistry-only spin-up with 9-year average vegetation fluxes after 1000 years (i.e., no coupled vegetation dynamics – equivalent to step (a) in Fig. <xref ref-type="fig" rid="F2"/>). Starting from this uncoupled spin-up steady state, an additional 27 years of simulation (<inline-formula><mml:math id="M114" display="inline"><mml:mrow><mml:mn mathvariant="normal">3</mml:mn><mml:mo>×</mml:mo><mml:mn mathvariant="normal">9</mml:mn></mml:mrow></mml:math></inline-formula> years) were performed using the full T&amp;C-BG model, with results shown in the three subsequent central columns. Such trajectories should not be interpreted as generally indicative of the direction toward the final steady state, as SOC and SON may follow non-monotonic pathways in dynamic systems. In the present simulations, however, both SOC and SON evolve toward the long-term coupled reference steady state during the first 27 years. The red box on the far right indicates the reference steady state achieved through long-term simulation with the fully coupled T&amp;C-BG model, which considers the coupled vegetation-soil biogeochemistry dynamics. Each boxplot includes five representative sites with different vegetation types (see Fig. <xref ref-type="fig" rid="F2"/>h). Site-specific vegetation and soil information, along with SOC and SON pool sizes, are provided in Table S3. A corresponding analysis for SON is shown in Fig. S8.</p>

      <fig id="F7" specific-use="star"><label>Figure 7</label><caption><p id="d2e2674">Evolution of soil organic carbon (SOC) during different initialization schemes and simulation durations at 10 selected sites (Fig. <xref ref-type="fig" rid="F2"/>h). Blue/orange boxplots represent forest/grassland sites. Arrows indicate simulation transitions between stages. The initial values (far left) are followed by <inline-formula><mml:math id="M115" display="inline"><mml:mrow><mml:mn mathvariant="normal">3</mml:mn><mml:mo>×</mml:mo><mml:mn mathvariant="normal">9</mml:mn></mml:mrow></mml:math></inline-formula>-year simulations using the fully coupled T&amp;C-BG model (solid green arrows). The average vegetation fluxes are extracted from the first 9 years to obtain a steady-state condition with the biogeochemistry-only module (outlined green arrow, middle gray box). This is followed by three additional 9-year simulations. A comprehensive long-term spin-up using the fully coupled T&amp;C-BG model (i.e., considering coupled vegetation and soil biogeochemical dynamics) is shown in the right red box. Striped arrows indicate the potential direction of SOC change if coupled simulations continue beyond the current duration.</p></caption>
          <graphic xlink:href="https://gmd.copernicus.org/articles/19/4547/2026/gmd-19-4547-2026-f07.png"/>

        </fig>

      <p id="d2e2697">As shown in Fig. <xref ref-type="fig" rid="F7"/>, the initial SOC values are substantially underestimated in the case here and gradually increase over the course of the simulation. After a 27-year simulation period with the fully coupled T&amp;C-BG model, the resulting SOC is still largely underestimated compared to the long-term steady state (red shading). Reaching steady state with the fully coupled model would require approximately 1000 years. A direct comparison of SOC between the uncoupled spin-up steady state (middle gray box in Fig. <xref ref-type="fig" rid="F7"/>) and the reference steady state from the fully coupled plot-scale spin-up (red shading) reveals an average underestimation of 19.2 % and 10.6 % for grassland and forested sites, respectively (see also Table S3). The underestimation is further reduced for the case of SON (see Fig. S8). This uncoupled spin-up steady state can be reached by first running the fully coupled model for only a short period (here 9 years) to obtain average vegetation fluxes, which are then used to drive the biogeochemistry-only spin-up. This provides a substantial gain in computational efficiency, as the computation time for decoupled spin-up is negligible (Table S6) despite the slight disagreement in steady states, thus supporting the choice of adopting an uncoupled spin-up approach as a first step (Fig. <xref ref-type="fig" rid="F2"/>a) within the proposed initialization framework and confirming H4.</p>
      <p id="d2e2707">In summary, the domain used here contains 1859 cells in total. Tracking lateral fluxes in all cells increases computational cost significantly (e.g., by approximately 50 % compared to a simulation without tracking any fluxes when <inline-formula><mml:math id="M116" display="inline"><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">20</mml:mn></mml:mrow></mml:math></inline-formula> %), whereas tracking only <inline-formula><mml:math id="M117" display="inline"><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">10</mml:mn></mml:mrow></mml:math></inline-formula> % of the cells has a less than 10 % impact on the overall spin-up procedure (Table S6). In Erlenbach, SOC requires over 300 years of simulation to reach steady state (Fig. S9) even without coupled vegetation-soil biogeochemical dynamics. The proposed initialization framework reduces this demand by collapsing the full 300-year 2D spin-up into two 9-year simulations (corresponding to Fig. <xref ref-type="fig" rid="F2"/>b and f), combined with a 1D plot-scale spin-up. Ideally, the hybrid spin-up procedure requires two 9-year 2D simulations instead of 300 years (i.e., 18 years in total, corresponding to approximately 6 % of the original simulation length). Wall-clock times of different components of the spin-up procedure are reported in Table S6. While these times are expected to change based on the specific computational platform used, for the case here the hybrid spin-up procedure resulted in a computational saving of approximately 90 % using the recommended <inline-formula><mml:math id="M118" display="inline"><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">40</mml:mn></mml:mrow></mml:math></inline-formula> % configuration.</p>
</sec>
</sec>
<sec id="Ch1.S4">
  <label>4</label><title>Discussion</title>
      <p id="d2e2757">Overall, our experiments confirm the four hypotheses outlined in the Introduction. Spatially explicit spin-up is necessary to capture spatial variability in soil carbon and nutrient pools (H1), while RF-based extrapolation from a subset of representative cells provides an efficient and accurate alternative to a fully distributed spin-up (H2). The required sampling fraction is not universal (H3) but depends on catchment-specific heterogeneity and the choice of predictors. Finally, an uncoupled biogeochemistry-only spin-up offers a computationally efficient first step (H4), yielding steady states sufficiently close to the fully coupled solution, hence justifying its use in practical applications. Beyond the associated computational savings, this framework incorporates lateral transport between computational cells during spin-up, explicitly accounting for topography-induced lateral water redistribution. In this respect, it differs from more conventional grouped representations of landscape heterogeneity in hydrological and land surface models, such as hydrological response units (HRUs), tiles, or other classes of cells sharing similar characteristics <xref ref-type="bibr" rid="bib1.bibx6 bib1.bibx72 bib1.bibx69" id="paren.56"><named-content content-type="pre">e.g.,</named-content></xref>.</p>
      <p id="d2e2765">The approach proposed here is not limited to T&amp;C-BG-2D but can be readily applied to the initialization of other spatially distributed ecohydrological-soil biogeochemical frameworks. Many such models (e.g., RHESSys, <xref ref-type="bibr" rid="bib1.bibx64" id="altparen.57"/>; Flux-PIHM-BGC, <xref ref-type="bibr" rid="bib1.bibx56" id="altparen.58"/>) couple a hydrological component with a plot-scale vegetation and/or soil biogeochemistry module. In these systems, lateral transport of water and carbon can be tracked and incorporated into the plot-scale spin-up, which can then be combined with a RF algorithm for initialization across the entire catchment.</p>
      <p id="d2e2774">The Erlenbach catchment served here as a testbed to evaluate the effectiveness of the new initialization scheme. For this site, tracking only 40 % of the cells was sufficient for the RF model to reconstruct over 90 % of the spatial variability in key soil biogeochemical variables. However, this proportion (40 %) should not be interpreted as a universal threshold. Our scenario experiments indicate that the optimal percentage is catchment-dependent and shaped by the degree of heterogeneity in vegetation, soils, and topography. For instance, in the <italic>Homog. Veg</italic> scenario even 20 % of tracked cells were sufficient, whereas in the <italic>Random Soil</italic> scenario, additional soil predictors were needed to adequately capture SOC variability. Flatter topographies here did not necessarily reduce the required sample size, because the contributing area remained largely unchanged and soil–vegetation heterogeneity still needed to be sufficiently represented in the sampling. Sensitivity analysis suggests that vegetation type is the most important predictor in the RF model. Taken together, these findings indicate that the sampling fraction required for effective RF initialization depends on specific catchment characteristics. In particular, it is influenced by the level of heterogeneity in the landscape, the unevenness of RF predictor distributions, and the number and type of predictors considered (e.g., inclusion of additional soil information or vegetation attributes).</p>
      <p id="d2e2783">A sensitivity analysis of predictor importance in the RF model indicates that the type of vegetation is the most influential variable (Fig. S10), followed by elevation. This statement is further confirmed by the Shapley values reported in Tables S2 and S3. These findings align with previous studies showing that vegetation strongly influences the spatial distribution of SOC <xref ref-type="bibr" rid="bib1.bibx21 bib1.bibx73" id="paren.59"><named-content content-type="pre">e.g.,</named-content></xref> and that elevation is a dominant control in pronounced topographies such as Erlenbach because of the lapse rate control on air temperature <xref ref-type="bibr" rid="bib1.bibx60 bib1.bibx24" id="paren.60"/>. The relative importance of these predictors may vary in catchments with different climatic and topographic characteristics. Note that soil texture is relatively less important in the sensitivity analysis in Fig. S10. This is likely due to the relatively homogeneous soil texture distribution in Erlenbach. However, the <italic>Random Soil</italic> scenario (Sect. <xref ref-type="sec" rid="Ch1.S3.SS2"/>) highlight the essential role of including soil information among the RF predictors, particularly in areas characterized by high soil spatial variability. Shapley values in Table S5 show that clay content is the most important predictor (after vegetation type) for soil carbon prediction, suggesting that clay content should be explicitly considered in generalized RF modeling frameworks.</p>
      <p id="d2e2800">While the proposed framework performs robustly in our tests, a few aspects deserve special attention and should be considered for applications in specific cases. In this study, the average soil texture was used in step (a) of the initialization procedure (Fig. <xref ref-type="fig" rid="F2"/>) for the <italic>Random soil</italic> scenario, and the resulting steady-state conditions were assigned to all cells. This simplification may introduce a potential bias, because soil textures can alter the degree of water limitation experienced by vegetation, plant fluxes, and ultimately the initialization outcome. To evaluate this effect, we tested three soil texture settings (sand (60 % sand, 10 % clay), loam (40.5 % sand, 26.5 % clay), and clay (20 % sand, 50 % clay)) in step (a) of Fig. <xref ref-type="fig" rid="F2"/>, and found that the resulting initialization states and spatial distributions are largely unchanged (see Fig. S11). However, this is likely due to the fact that vegetation in Erlenbach does not experience water limitation. Conversely, in drier regions, soil texture could become a major constraint and separate spin-ups for each soil type may be required instead of using an average soil texture. Apart from soil texture, we did not include elevation-based clusters in step (a) of the initialization procedure (Fig. <xref ref-type="fig" rid="F2"/>), which is potentially necessary if the catchment has significant elevation changes or it spans climatic regimes where processes such as permafrost occurrence or soil freezing are relevant. Furthermore, soil–vegetation coupling is inherently site-specific, and the decoupled spin-up here relies on a 9-year average of plant and soil dynamics. While this assumption is reasonable for the Erlenbach site, it may introduce biases in more extreme ecosystems (e.g., nutrient-limited environments, more variable climatic conditions), where long-term average plant-soil biogeochemical dynamics cannot be adequately captured within a 9-year period. We therefore suggest evaluating this assumption by running longer-term simulations for a small subset of grid cells and verifying that the plant-soil dynamics averaged over the chosen period do not introduce systematic biases relative to longer-term simulations. In addition to the sensitivity analysis discussed in the previous paragraphs, we recommend that, when implementing the proposed spin-up procedure in a new case study, the first step should be to assess the distribution of key environmental predictors in the study area (particularly vegetation type and soil texture) and to apply a proper sampling strategy to represent the predictor space. In more heterogeneous catchments, the sampling strategy could be extended by stratifying not only by vegetation type, but also by additional gradients such as elevation and soil type. The adequacy of the sampling can then be assessed by verifying whether the selected cells sufficiently cover the predictor space. It is also essential to evaluate whether the ecosystem is nutrient-limited and whether the meteorological forcing is representative of the long-term climatic conditions. Based on these considerations, a tracked-cell proportion of <inline-formula><mml:math id="M119" display="inline"><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">10</mml:mn></mml:mrow></mml:math></inline-formula> %–40 % may serve as an initial guideline, with the final choice determined by the trade-off between target accuracy and available computational resources.</p>
      <p id="d2e2824">The initialization scheme proposed here provides steady-state initial conditions. This is indeed a convenient simplification (often adopted in soil biogeochemical modeling), which may not always be realistic, especially over long timescales and in certain contexts (e.g., long-term vegetation colonization, young forest development, post-disturbance recovery after wildfires; <xref ref-type="bibr" rid="bib1.bibx14 bib1.bibx18 bib1.bibx37 bib1.bibx77" id="altparen.61"/>). To apply the proposed approach in systems that are not in steady state, a possible strategy is to assume a pre-disturbance steady state, derive the initial conditions from that point, and then iteratively apply the initialization scheme over shorter time intervals. For example, in an area undergoing young forest growth, the initial condition may be approximated from the land use prior to forest establishment. A short-term simulation (e.g., 1 year) can then be used to obtain average plant fluxes. These fluxes can be applied in a longer simulation period (e.g., 5 years), followed by another short-term run to update the fluxes. This process introduces a loose coupling between vegetation dynamics and soil biogeochemistry, allowing the model to gradually adjust toward a consistent state. The simulation–flux update cycle can be repeated until model outputs align with observations. In such cases, only the 1D spin-up component of the initialization scheme requires modification, while the RF algorithm remains applicable without change. However, if only part of the catchment is affected by disturbance, separate RF models may be needed for disturbed and undisturbed regions.</p>
</sec>
<sec id="Ch1.S5" sec-type="conclusions">
  <label>5</label><title>Conclusions</title>
      <p id="d2e2839">This study presents a novel initialization scheme for spatially distributed coupled ecohydrological-soil biogeochemical models, integrating a flux-tracking spin-up with a RF algorithm, thus contributing to ongoing efforts in the development and reliable application of fully coupled spatially distributed models. Our results underscore the importance of explicitly accounting for lateral fluxes and spatial heterogeneities in model initialization for reducing systematic biases in modeled patterns of soil carbon and nutrients. The proposed framework achieves a dynamic steady state efficiently and demonstrates robust performance of the RF model, which required tracking lateral fluxes for only a relatively limited number of cells to effectively represent spatial variability in the setup of initial conditions for carbon and nutrient pools. The number of cells required for lateral flux tracking is primarily determined by the spatial complexity of the catchment and the amount of spatial information contained in the predictors used in the RF model. While demonstrated using T&amp;C-BG-2D, the methodology proposed here is broadly applicable to other spatially distributed models, offering an efficient framework to improve the accuracy of distributed soil biogeochemical models by explicitly including spatial information in their spin up and initialization, while maintaining a reasonable computational effort. This will ultimately enhance not only our ability to reproduce spatial patterns of soil carbon and nutrients at the catchment scale but also our understanding of the role of topography in modulating coupled processes related to water, energy, and biogeochemical fluxes in complex landscapes.</p>
</sec>

      
      </body>
    <back><notes notes-type="codedataavailability"><title>Code and data availability</title>

      <p id="d2e2846">The T&amp;C-BG-2D model code is available in Zenodo (<ext-link xlink:href="https://doi.org/10.5281/ZENODO.18084473" ext-link-type="DOI">10.5281/ZENODO.18084473</ext-link>, <xref ref-type="bibr" rid="bib1.bibx23" id="altparen.62"/>). The accompanying initialization procedures and setup files for both the two-dimensional and plot-scale simulations, as well as processed meteorological forcing used by the model and derived from publicly available datasets (<ext-link xlink:href="https://doi.org/10.16904/envidat.380" ext-link-type="DOI">10.16904/envidat.380</ext-link>, <xref ref-type="bibr" rid="bib1.bibx59" id="altparen.63"/>), are also archived in Zenodo (<ext-link xlink:href="https://doi.org/10.5281/ZENODO.17213868" ext-link-type="DOI">10.5281/ZENODO.17213868</ext-link>, <xref ref-type="bibr" rid="bib1.bibx25" id="altparen.64"/>). All simulations are based on MATLAB R2024b.</p>
  </notes><app-group>
        <supplementary-material position="anchor"><p id="d2e2868">The supplement related to this article is available online at <inline-supplementary-material xlink:href="https://doi.org/10.5194/gmd-19-4547-2026-supplement" xlink:title="pdf">https://doi.org/10.5194/gmd-19-4547-2026-supplement</inline-supplementary-material>.</p></supplementary-material>
        </app-group><notes notes-type="authorcontribution"><title>Author contributions</title>

      <p id="d2e2877">TL and SB conceived the idea and designed the study. TL carried out the numerical simulations, while ZZ and AP performed additional plot-scale simulations for the fully coupled T&amp;C-BG model. TL and SB analyzed the results. TL drafted the first version of the manuscript, with significant contributions from SB. All authors contributed to the interpretation of the results and the writing of the final version.</p>
  </notes><notes notes-type="competinginterests"><title>Competing interests</title>

      <p id="d2e2883">The contact author has declared that none of the authors has any competing interests.</p>
  </notes><notes notes-type="disclaimer"><title>Disclaimer</title>

      <p id="d2e2889">Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. The authors bear the ultimate responsibility for providing appropriate place names. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.</p>
  </notes><notes notes-type="financialsupport"><title>Financial support</title>

      <p id="d2e2895">Sara Bonetti and Taiqi Lian acknowledge funding from the Swiss National Science Foundation (SNSF, grant 10002612).  The article processing charges for this open-access  publication were covered by EPFL.</p>
  </notes><notes notes-type="reviewstatement"><title>Review statement</title>

      <p id="d2e2906">This paper was edited by Nathaniel Chaney and reviewed by two anonymous referees.</p>
  </notes><ref-list>
    <title>References</title>

      <ref id="bib1.bibx1"><label>Ammann et al.(2009)Ammann, Spirig, Leifeld, and Neftel</label><mixed-citation>Ammann, C., Spirig, C., Leifeld, J., and Neftel, A.: Assessment of the nitrogen and carbon budget of two managed temperate grassland fields, Agr. Ecosyst. Environ., 133, 150–162, <ext-link xlink:href="https://doi.org/10.1016/j.agee.2009.05.006" ext-link-type="DOI">10.1016/j.agee.2009.05.006</ext-link>, 2009.</mixed-citation></ref>
      <ref id="bib1.bibx2"><label>Baroni et al.(2019)Baroni, Schalge, Rakovec, Kumar, Schler, Samaniego, Simmer, and Attinger</label><mixed-citation>Baroni, G., Schalge, B., Rakovec, O., Kumar, R., Schüler, L., Samaniego, L., Simmer, C., and Attinger, S.: A Comprehensive Distributed Hydrological Modeling Intercomparison to Support Process Representation and Data Collection Strategies, Water Resour. Res., 55, 990–1010, <ext-link xlink:href="https://doi.org/10.1029/2018WR023941" ext-link-type="DOI">10.1029/2018WR023941</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bibx3"><label>Botter et al.(2021)Botter, Zeeman, Burlando, and Fatichi</label><mixed-citation>Botter, M., Zeeman, M., Burlando, P., and Fatichi, S.: Impacts of fertilization on grassland productivity and water quality across the European Alps under current and warming climate: insights from a mechanistic model, Biogeosciences, 18, 1917–1939, <ext-link xlink:href="https://doi.org/10.5194/bg-18-1917-2021" ext-link-type="DOI">10.5194/bg-18-1917-2021</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bibx4"><label>Breiman(2001)</label><mixed-citation>Breiman, L.: Random Forests, Mach. Learn., 45, 5–32, <ext-link xlink:href="https://doi.org/10.1023/A:1010933404324" ext-link-type="DOI">10.1023/A:1010933404324</ext-link>, 2001.</mixed-citation></ref>
      <ref id="bib1.bibx5"><label>Carranza et al.(2021)Carranza, Nolet, Pezij, and Van Der Ploeg</label><mixed-citation>Carranza, C., Nolet, C., Pezij, M., and Van Der Ploeg, M.: Root zone soil moisture estimation with Random Forest, J. Hydrol., 593, 125840, <ext-link xlink:href="https://doi.org/10.1016/j.jhydrol.2020.125840" ext-link-type="DOI">10.1016/j.jhydrol.2020.125840</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bibx6"><label>Chaney et al.(2018)Chaney, Van Huijgevoort, Shevliakova, Malyshev, Milly, Gauthier, and Sulman</label><mixed-citation>Chaney, N. W., Van Huijgevoort, M. H. J., Shevliakova, E., Malyshev, S., Milly, P. C. D., Gauthier, P. P. G., and Sulman, B. N.: Harnessing Big Data to Rethink Land Heterogeneity in Earth System Models, Hydrol. Earth Syst. Sci., 22, 3311–3330, <ext-link xlink:href="https://doi.org/10.5194/hess-22-3311-2018" ext-link-type="DOI">10.5194/hess-22-3311-2018</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bibx7"><label>Christensen et al.(2008)Christensen, Tague, and Baron</label><mixed-citation>Christensen, L., Tague, C. L., and Baron, J. S.: Spatial patterns of simulated transpiration response to climate variability in a snow dominated mountain ecosystem, Hydrol. Process., 22, 3576–3588, <ext-link xlink:href="https://doi.org/10.1002/hyp.6961" ext-link-type="DOI">10.1002/hyp.6961</ext-link>, 2008.</mixed-citation></ref>
      <ref id="bib1.bibx8"><label>Fatichi et al.(2012a)Fatichi, Ivanov, and Caporali</label><mixed-citation>Fatichi, S., Ivanov, V. Y., and Caporali, E.: A Mechanistic Ecohydrological Model to Investigate Complex Interactions in Cold and Warm Water-controlled Environments: 2. Spatiotemporal Analyses, J. Adv. Model. Earth Syst., 4, 2011MS000087, <ext-link xlink:href="https://doi.org/10.1029/2011MS000087" ext-link-type="DOI">10.1029/2011MS000087</ext-link>, 2012a. </mixed-citation></ref>
      <ref id="bib1.bibx9"><label>Fatichi et al.(2012b)Fatichi, Ivanov, and Caporali</label><mixed-citation>Fatichi, S., Ivanov, V. Y., and Caporali, E.: A mechanistic ecohydrological model to investigate complex interactions in cold and warm water-controlled environments: 1. Theoretical framework and plot-scale analysis, J. Adv. Model. Earth Syst., 4, 2011MS000086, <ext-link xlink:href="https://doi.org/10.1029/2011MS000086" ext-link-type="DOI">10.1029/2011MS000086</ext-link>, 2012b.</mixed-citation></ref>
      <ref id="bib1.bibx10"><label>Fatichi et al.(2016)Fatichi, Pappas, and Ivanov</label><mixed-citation>Fatichi, S., Pappas, C., and Ivanov, V. Y.: Modeling plant–water interactions: an ecohydrological overview from the cell to the global scale, WIREs Water, 3, 327–368, <ext-link xlink:href="https://doi.org/10.1002/wat2.1125" ext-link-type="DOI">10.1002/wat2.1125</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bibx11"><label>Fatichi et al.(2019)Fatichi, Manzoni, Or, and Paschalis</label><mixed-citation>Fatichi, S., Manzoni, S., Or, D., and Paschalis, A.: A Mechanistic Model of Microbially Mediated Soil Biogeochemical Processes: A Reality Check, Global Biogeochem. Cy., 33, 620–648, <ext-link xlink:href="https://doi.org/10.1029/2018GB006077" ext-link-type="DOI">10.1029/2018GB006077</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bibx12"><label>Feng et al.(2013)Feng, Plante, and Six</label><mixed-citation>Feng, W., Plante, A. F., and Six, J.: Improving Estimates of Maximal Organic Carbon Stabilization by Fine Soil Particles, Biogeochemistry, 112, 81–93, <ext-link xlink:href="https://doi.org/10.1007/s10533-011-9679-7" ext-link-type="DOI">10.1007/s10533-011-9679-7</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bibx13"><label>Friedlingstein et al.(2025)Friedlingstein, O'Sullivan, Jones, Andrew, Hauck, Landschützer, Le Quéré, Li, Luijkx, Olsen, Peters, Peters, Pongratz, Schwingshackl, Sitch, Canadell, Ciais, Jackson, Alin, Arneth, Arora, Bates, Becker, Bellouin, Berghoff, Bittig, Bopp, Cadule, Campbell, Chamberlain, Chandra, Chevallier, Chini, Colligan, Decayeux, Djeutchouang, Dou, Duran Rojas, Enyo, Evans, Fay, Feely, Ford, Foster, Gasser, Gehlen, Gkritzalis, Grassi, Gregor, Gruber, Gürses, Harris, Hefner, Heinke, Hurtt, Iida, Ilyina, Jacobson, Jain, Jarníková, Jersild, Jiang, Jin, Kato, Keeling, Klein Goldewijk, Knauer, Korsbakken, Lan, Lauvset, Lefèvre, Liu, Liu, Ma, Maksyutov, Marland, Mayot, McGuire, Metzl, Monacci, Morgan, Nakaoka, Neill, Niwa, Nützel, Olivier, Ono, Palmer, Pierrot, Qin, Resplandy, Roobaert, Rosan, Rödenbeck, Schwinger, Smallman, Smith, Sospedra-Alfonso, Steinhoff, Sun, Sutton, Séférian, Takao, Tatebe, Tian, Tilbrook, Torres, Tourigny, Tsujino, Tubiello, Van Der Werf, Wanninkhof, Wang, Yang, Yang, Yu, Yuan, Yue, Zaehle, Zeng, and Zeng</label><mixed-citation>Friedlingstein, P., O'Sullivan, M., Jones, M. W., Andrew, R. M., Hauck, J., Landschützer, P., Le Quéré, C., Li, H., Luijkx, I. T., Olsen, A., Peters, G. P., Peters, W., Pongratz, J., Schwingshackl, C., Sitch, S., Canadell, J. G., Ciais, P., Jackson, R. B., Alin, S. R., Arneth, A., Arora, V., Bates, N. R., Becker, M., Bellouin, N., Berghoff, C. F., Bittig, H. C., Bopp, L., Cadule, P., Campbell, K., Chamberlain, M. A., Chandra, N., Chevallier, F., Chini, L.P., Colligan, T., Decayeux, J., Djeutchouang, L. M., Dou, X., Duran Rojas, C., Enyo, K., Evans, W., Fay, A. R., Feely, R. A., Ford, D. J., Foster, A., Gasser, T., Gehlen, M., Gkritzalis, T., Grassi, G., Gregor, L., Gruber, N., Gürses, Ö., Harris, I., Hefner, M., Heinke, J., Hurtt, G. C., Iida, Y., Ilyina, T., Jacobson, A. R., Jain, A. K., Jarníková, T., Jersild, A., Jiang, F., Jin, Z., Kato, E., Keeling, R. F., Klein Goldewijk, K., Knauer, J., Korsbakken, J. I., Lan, X., Lauvset, S. K., Lefèvre, N., Liu, Z., Liu, J., Ma, L., Maksyutov, S., Marland, G., Mayot, N., McGuire, P. C., Metzl, N., Monacci, N. M., Morgan, E. J., Nakaoka, S.-I., Neill, C., Niwa, Y., Nützel, T., Olivier, L., Ono, T., Palmer, P. I., Pierrot, D., Qin, Z., Resplandy, L., Roobaert, A., Rosan, T. M., Rödenbeck, C., Schwinger, J., Smallman, T. L., Smith, S. M., Sospedra-Alfonso, R., Steinhoff, T., Sun, Q., Sutton, A. J., Séférian, R., Takao, S., Tatebe, H., Tian, H., Tilbrook, B., Torres, O., Tourigny, E., Tsujino, H., Tubiello, F., Van Der Werf, G., Wanninkhof, R., Wang, X., Yang, D., Yang, X., Yu, Z., Yuan, W., Yue, X., Zaehle, S., Zeng, N., and Zeng, J.: Global Carbon Budget 2024, Earth Syst. Sci. Data, 17, 965–1039, <ext-link xlink:href="https://doi.org/10.5194/essd-17-965-2025" ext-link-type="DOI">10.5194/essd-17-965-2025</ext-link>, 2025.</mixed-citation></ref>
      <ref id="bib1.bibx14"><label>Guo and Gifford(2002)</label><mixed-citation>Guo, L. B. and Gifford, R. M.: Soil carbon stocks and land use change: a meta analysis, Global Change Biol., 8, 345–360, <ext-link xlink:href="https://doi.org/10.1046/j.1354-1013.2002.00486.x" ext-link-type="DOI">10.1046/j.1354-1013.2002.00486.x</ext-link>, 2002.</mixed-citation></ref>
      <ref id="bib1.bibx15"><label>Gupta et al.(2021)Gupta, Lehmann, Bonetti, Papritz, and Or</label><mixed-citation>Gupta, S., Lehmann, P., Bonetti, S., Papritz, A., and Or, D.: Global Prediction of Soil Saturated Hydraulic Conductivity Using Random Forest in a Covariate‐Based GeoTransfer Function (CoGTF) Framework, J. Adv. Model. Earth Syst., 13, e2020MS002242, <ext-link xlink:href="https://doi.org/10.1029/2020MS002242" ext-link-type="DOI">10.1029/2020MS002242</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bibx16"><label>Gupta et al.(2024)Gupta, Hasler, and Alewell</label><mixed-citation>Gupta, S., Hasler, J. K., and Alewell, C.: Mining soil data of Switzerland: New maps for soil texture, soil organic carbon, nitrogen, and phosphorus, Geoderma Reg., 36, e00747, <ext-link xlink:href="https://doi.org/10.1016/j.geodrs.2023.e00747" ext-link-type="DOI">10.1016/j.geodrs.2023.e00747</ext-link>, 2024.</mixed-citation></ref>
      <ref id="bib1.bibx17"><label>Hashimoto et al.(2011)Hashimoto, Wattenbach, and Smith</label><mixed-citation>Hashimoto, S., Wattenbach, M., and Smith, P.: A new scheme for initializing process-based ecosystem models by scaling soil carbon pools, Ecol. Model., 222, 3598–3602, <ext-link xlink:href="https://doi.org/10.1016/j.ecolmodel.2011.08.011" ext-link-type="DOI">10.1016/j.ecolmodel.2011.08.011</ext-link>, 2011.</mixed-citation></ref>
      <ref id="bib1.bibx18"><label>Ilstedt(2003)</label><mixed-citation>Ilstedt, U.: Changes in soil chemical and microbial properties after a wildfire in a tropical rainforest in Sabah, Malaysia, Soil Biol. Biochem., 35, 1071–1078, <ext-link xlink:href="https://doi.org/10.1016/S0038-0717(03)00152-4" ext-link-type="DOI">10.1016/S0038-0717(03)00152-4</ext-link>, 2003.</mixed-citation></ref>
      <ref id="bib1.bibx19"><label>Kim et al.(2018)Kim, Kwon, and Han</label><mixed-citation>Kim, K. B., Kwon, H.-H., and Han, D.: Exploration of warm-up period in conceptual hydrological modelling, J. Hydrol., 556, 194–210, <ext-link xlink:href="https://doi.org/10.1016/j.jhydrol.2017.11.015" ext-link-type="DOI">10.1016/j.jhydrol.2017.11.015</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bibx20"><label>Krinner et al.(2005)Krinner, Viovy, De Noblet‐Ducoudr, Oge, Polcher, Friedlingstein, Ciais, Sitch, and Prentice</label><mixed-citation>Krinner, G., Viovy, N., De Noblet‐Ducoudré, N., Ogée, J., Polcher, J., Friedlingstein, P., Ciais, P., Sitch, S., and Prentice, I. C.: A dynamic global vegetation model for studies of the coupled atmosphere‐biosphere system, Global Biogeochem. Cy., 19, 2003GB002199, <ext-link xlink:href="https://doi.org/10.1029/2003GB002199" ext-link-type="DOI">10.1029/2003GB002199</ext-link>, 2005.</mixed-citation></ref>
      <ref id="bib1.bibx21"><label>Kunkel et al.(2022)Kunkel, Wells, and Hancock</label><mixed-citation>Kunkel, V., Wells, T., and Hancock, G.: Modelling soil organic carbon using vegetation indices across large catchments in eastern Australia, Sci. Total Environ., 817, 152690, <ext-link xlink:href="https://doi.org/10.1016/j.scitotenv.2021.152690" ext-link-type="DOI">10.1016/j.scitotenv.2021.152690</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bibx22"><label>Li et al.(2020)Li, Zhang, and Liu</label><mixed-citation>Li, J., Zhang, D., and Liu, M.: Factors controlling the spatial distribution of soil organic carbon in Daxing’anling Mountain, Sci. Rep., 10, 12659, <ext-link xlink:href="https://doi.org/10.1038/s41598-020-69590-y" ext-link-type="DOI">10.1038/s41598-020-69590-y</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bibx23"><label>Lian et al.(2025a)Lian, Fatichi, and Bonetti</label><mixed-citation>Lian, T., Fatichi, S., and Bonetti, S.: TeC_BG_2D Ecohydrological Model: V1.0.0, Zenodo [code], <ext-link xlink:href="https://doi.org/10.5281/ZENODO.18084473" ext-link-type="DOI">10.5281/ZENODO.18084473</ext-link>, 2025a.</mixed-citation></ref>
      <ref id="bib1.bibx24"><label>Lian et al.(2025b)Lian, Fatichi, Sthli, and Bonetti</label><mixed-citation>Lian, T., Fatichi, S., Stähli, M., and Bonetti, S.: Assessing Spatial Patterns of Carbon and Nutrient Dynamics in Catchments of Complex Topography, Water Resour. Res., 61, e2025WR040260, <ext-link xlink:href="https://doi.org/10.1029/2025WR040260" ext-link-type="DOI">10.1029/2025WR040260</ext-link>, 2025b.</mixed-citation></ref>
      <ref id="bib1.bibx25"><label>Lian et al.(2025c)Lian, Zhang, Paschalis, and Bonetti</label><mixed-citation>Lian, T., Zhang, Z., Paschalis, A., and Bonetti, S.: TeC_BG_2D_Spin_up: V2.0.0, Zenodo [code], <ext-link xlink:href="https://doi.org/10.5281/ZENODO.17213868" ext-link-type="DOI">10.5281/ZENODO.17213868</ext-link>, 2025c.</mixed-citation></ref>
      <ref id="bib1.bibx26"><label>Loh(2002)</label><mixed-citation>Loh, W.-Y.: Regression trees with unbiased variable selection and interaction detection, Statistica sinica, JSTOR, 361–386, <uri>https://www.jstor.org/stable/24306967</uri> (last access: 24 May 2026), 2002.</mixed-citation></ref>
      <ref id="bib1.bibx27"><label>Lugato et al.(2014)Lugato, Panagos, Bampa, Jones, and Montanarella</label><mixed-citation>Lugato, E., Panagos, P., Bampa, F., Jones, A., and Montanarella, L.: A new baseline of organic carbon stock in European agricultural soils using a modelling approach, Global Change Biol., 20, 313–326, <ext-link xlink:href="https://doi.org/10.1111/gcb.12292" ext-link-type="DOI">10.1111/gcb.12292</ext-link>, 2014.</mixed-citation></ref>
      <ref id="bib1.bibx28"><label>Lundberg and Lee(2017)</label><mixed-citation>Lundberg, S. M. and Lee, S.-I.: A Unified Approach to Interpreting Model Predictions, in: Advances in Neural Information Processing Systems, vol. 30, edited by: Guyon, I., Luxburg, U. V., Bengio, S., Wallach, H., Fergus, R., Vishwanathan, S., and Garnett, R., Curran Associates, Inc., <uri>https://proceedings.neurips.cc/paper_files/paper/2017/file/8a20a8621978632d76c43dfd28b67767-Paper.pdf</uri> (last access: 24 May 2026), 2017.</mixed-citation></ref>
      <ref id="bib1.bibx29"><label>Mahdavi et al.(2018)Mahdavi, Salehi, Granger, Amani, Brisco, and Huang</label><mixed-citation>Mahdavi, S., Salehi, B., Granger, J., Amani, M., Brisco, B., and Huang, W.: Remote sensing for wetland classification: a comprehensive review, GISci. Remote Sens., 55, 623–658, <ext-link xlink:href="https://doi.org/10.1080/15481603.2017.1419602" ext-link-type="DOI">10.1080/15481603.2017.1419602</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bibx30"><label>Manzoni et al.(2012)Manzoni, Schimel, and Porporato</label><mixed-citation>Manzoni, S., Schimel, J. P., and Porporato, A.: Responses of soil microbial communities to water stress: results from a meta-analysis, Ecology, 93, 930–938, <ext-link xlink:href="https://doi.org/10.1890/11-0026.1" ext-link-type="DOI">10.1890/11-0026.1</ext-link>, 2012.</mixed-citation></ref>
      <ref id="bib1.bibx31"><label>Manzoni et al.(2014)Manzoni, Schaeffer, Katul, Porporato, and Schimel</label><mixed-citation>Manzoni, S., Schaeffer, S. M., Katul, G., Porporato, A., and Schimel, J. P.: A theoretical analysis of microbial eco-physiological and diffusion limitations to carbon cycling in drying soils, Soil Biol. Biochem., 73, 69–83, <ext-link xlink:href="https://doi.org/10.1016/j.soilbio.2014.02.008" ext-link-type="DOI">10.1016/j.soilbio.2014.02.008</ext-link>, 2014.</mixed-citation></ref>
      <ref id="bib1.bibx32"><label>Martin et al.(2014)Martin, Orton, Lacarce, Meersmans, Saby, Paroissien, Jolivet, Boulonne, and Arrouays</label><mixed-citation>Martin, M., Orton, T., Lacarce, E., Meersmans, J., Saby, N., Paroissien, J., Jolivet, C., Boulonne, L., and Arrouays, D.: Evaluation of modelling approaches for predicting the spatial distribution of soil organic carbon stocks at the national scale, Geoderma, 223–225, 97–107, <ext-link xlink:href="https://doi.org/10.1016/j.geoderma.2014.01.005" ext-link-type="DOI">10.1016/j.geoderma.2014.01.005</ext-link>, 2014.</mixed-citation></ref>
      <ref id="bib1.bibx33"><label>Martín-López et al.(2023)Martn-Lpez, Verchot, Martius, and Da Silva</label><mixed-citation>Martín-López, J. M., Verchot, L. V., Martius, C., and Da Silva, M.: Modeling the Spatial Distribution of Soil Organic Carbon and Carbon Stocks in the Casanare Flooded Savannas of the Colombian Llanos, Wetlands, 43, 65, <ext-link xlink:href="https://doi.org/10.1007/s13157-023-01705-3" ext-link-type="DOI">10.1007/s13157-023-01705-3</ext-link>, 2023.</mixed-citation></ref>
      <ref id="bib1.bibx34"><label>Matus(2021)</label><mixed-citation>Matus, F. J.: Fine Silt and Clay Content Is the Main Factor Defining Maximal C and N Accumulations in Soils: A Meta-Analysis, Sci. Rep., 11, 6438, <ext-link xlink:href="https://doi.org/10.1038/s41598-021-84821-6" ext-link-type="DOI">10.1038/s41598-021-84821-6</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bibx35"><label>Meersmans et al.(2008)Meersmans, De Ridder, Canters, De Baets, and Van Molle</label><mixed-citation>Meersmans, J., De Ridder, F., Canters, F., De Baets, S., and Van Molle, M.: A multiple regression approach to assess the spatial distribution of Soil Organic Carbon (SOC) at the regional scale (Flanders, Belgium), Geoderma, 143, 1–13, <ext-link xlink:href="https://doi.org/10.1016/j.geoderma.2007.08.025" ext-link-type="DOI">10.1016/j.geoderma.2007.08.025</ext-link>, 2008.</mixed-citation></ref>
      <ref id="bib1.bibx36"><label>Moyano et al.(2013)Moyano, Manzoni, and Chenu</label><mixed-citation>Moyano, F. E., Manzoni, S., and Chenu, C.: Responses of soil heterotrophic respiration to moisture availability: An exploration of processes and models, Soil Biol. Biochem., 59, 72–85, <ext-link xlink:href="https://doi.org/10.1016/j.soilbio.2013.01.002" ext-link-type="DOI">10.1016/j.soilbio.2013.01.002</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bibx37"><label>Nave et al.(2011)Nave, Vance, Swanston, and Curtis</label><mixed-citation>Nave, L. E., Vance, E. D., Swanston, C. W., and Curtis, P. S.: Fire effects on temperate forest soil C and N storage, Ecol. Appl., 21, 1189–1201, <ext-link xlink:href="https://doi.org/10.1890/10-0660.1" ext-link-type="DOI">10.1890/10-0660.1</ext-link>, 2011.</mixed-citation></ref>
      <ref id="bib1.bibx38"><label>Nemo et al.(2017)Nemo, Klumpp, Coleman, Dondini, Goulding, Hastings, Jones, Leifeld, Osborne, Saunders, Scott, Teh, and Smith</label><mixed-citation>Nemo, Klumpp, K., Coleman, K., Dondini, M., Goulding, K., Hastings, A., Jones, M. B., Leifeld, J., Osborne, B., Saunders, M., Scott, T., Teh, Y. A., and Smith, P.: Soil Organic Carbon (SOC) Equilibrium and Model Initialisation Methods: an Application to the Rothamsted Carbon (RothC) Model, Environ. Model. Assess., 22, 215–229, <ext-link xlink:href="https://doi.org/10.1007/s10666-016-9536-0" ext-link-type="DOI">10.1007/s10666-016-9536-0</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bibx39"><label>Ng et al.(2014)Ng, Bedford, and Miller</label><mixed-citation>Ng, G. C., Bedford, D. R., and Miller, D. M.: A mechanistic modeling and data assimilation framework for Mojave Desert ecohydrology, Water Resour. Res., 50, 4662–4685, <ext-link xlink:href="https://doi.org/10.1002/2014WR015281" ext-link-type="DOI">10.1002/2014WR015281</ext-link>, 2014.</mixed-citation></ref>
      <ref id="bib1.bibx40"><label>Nigrelli et al.(2018)Nigrelli, Fratianni, Zampollo, Turconi, and Chiarle</label><mixed-citation>Nigrelli, G., Fratianni, S., Zampollo, A., Turconi, L., and Chiarle, M.: The altitudinal temperature lapse rates applied to high elevation rockfalls studies in the Western European Alps, Theor. Appl. Climatol., 131, 1479–1491, <ext-link xlink:href="https://doi.org/10.1007/s00704-017-2066-0" ext-link-type="DOI">10.1007/s00704-017-2066-0</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bibx41"><label>Noto et al.(2008)Noto, Ivanov, Bras, and Vivoni</label><mixed-citation>Noto, L. V., Ivanov, V. Y., Bras, R. L., and Vivoni, E. R.: Effects of initialization on response of a fully-distributed hydrologic model, J. Hydrol., 352, 107–125, <ext-link xlink:href="https://doi.org/10.1016/j.jhydrol.2007.12.031" ext-link-type="DOI">10.1016/j.jhydrol.2007.12.031</ext-link>, 2008.</mixed-citation></ref>
      <ref id="bib1.bibx42"><label>Parras-Alcántara et al.(2015)Parras-Alcntara, Lozano-Garca, and Galn-Espejo</label><mixed-citation>Parras-Alcántara, L., Lozano-García, B., and Galán-Espejo, A.: Soil organic carbon along an altitudinal gradient in the Despeñaperros Natural Park, southern Spain, Solid Earth, 6, 125–134, <ext-link xlink:href="https://doi.org/10.5194/se-6-125-2015" ext-link-type="DOI">10.5194/se-6-125-2015</ext-link>, 2015.</mixed-citation></ref>
      <ref id="bib1.bibx43"><label>Parvizi and Fatehi(2025)</label><mixed-citation>Parvizi, Y. and Fatehi, S.: Geospatial digital mapping of soil organic carbon using machine learning and geostatistical methods in different land uses, Sci. Rep., 15, 4449, <ext-link xlink:href="https://doi.org/10.1038/s41598-025-88062-9" ext-link-type="DOI">10.1038/s41598-025-88062-9</ext-link>, 2025.</mixed-citation></ref>
      <ref id="bib1.bibx44"><label>Paschalis et al.(2022)Paschalis, Bonetti, Guo, and Fatichi</label><mixed-citation>Paschalis, A., Bonetti, S., Guo, Y., and Fatichi, S.: On the Uncertainty Induced by Pedotransfer Functions in Terrestrial Biosphere Modeling, Water Resour. Res., 58, e2021WR031871, <ext-link xlink:href="https://doi.org/10.1029/2021WR031871" ext-link-type="DOI">10.1029/2021WR031871</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bibx45"><label>Pawusch et al.(2025)Pawusch, Scheurer, Nowak, and Maxwell</label><mixed-citation>Pawusch, L., Scheurer, S., Nowak, W., and Maxwell, R. M.: HydroStartML: A Combined Machine Learning and Physics-Based Approach to Reduce Hydrological Model Spin-up Time, Adv. Water Resour., 206, 105124, <ext-link xlink:href="https://doi.org/10.1016/j.advwatres.2025.105124" ext-link-type="DOI">10.1016/j.advwatres.2025.105124</ext-link>, 2025.</mixed-citation></ref>
      <ref id="bib1.bibx46"><label>Pazúr et al.(2021)Pazr, Huber, Weber, Ginzler, and Price</label><mixed-citation>Pazúr, R., Huber, N., Weber, D., Ginzler, C., and Price, B.: Cropland and grassland map of Switzerland based on Sentinel-2 data, EnviDat [data set], <ext-link xlink:href="https://doi.org/10.16904/ENVIDAT.205" ext-link-type="DOI">10.16904/ENVIDAT.205</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bibx47"><label>Peters et al.(2007)Peters, Baets, Verhoest, Samson, Degroeve, Becker, and Huybrechts</label><mixed-citation>Peters, J., Baets, B. D., Verhoest, N. E., Samson, R., Degroeve, S., Becker, P. D., and Huybrechts, W.: Random forests as a tool for ecohydrological distribution modelling, Ecol. Model., 207, 304–318, <ext-link xlink:href="https://doi.org/10.1016/j.ecolmodel.2007.05.011" ext-link-type="DOI">10.1016/j.ecolmodel.2007.05.011</ext-link>, 2007.</mixed-citation></ref>
      <ref id="bib1.bibx48"><label>Qiu et al.(2023)Qiu, Niu, Baas, and Phanikumar</label><mixed-citation>Qiu, H., Niu, J., Baas, D. G., and Phanikumar, M. S.: An integrated watershed-scale framework to model nitrogen transport and transformations, Sci. Total Environ., 882, 163348, <ext-link xlink:href="https://doi.org/10.1016/j.scitotenv.2023.163348" ext-link-type="DOI">10.1016/j.scitotenv.2023.163348</ext-link>, 2023.</mixed-citation></ref>
      <ref id="bib1.bibx49"><label>Qu et al.(2018)Qu, Maksyutov, and Zhuang</label><mixed-citation>Qu, Y., Maksyutov, S., and Zhuang, Q.: Technical Note: An efficient method for accelerating the spin-up process for process-based biogeochemistry models, Biogeosciences, 15, 3967–3973, <ext-link xlink:href="https://doi.org/10.5194/bg-15-3967-2018" ext-link-type="DOI">10.5194/bg-15-3967-2018</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bibx50"><label>Randerson et al.(2009)Randerson, Hoffman, Thornton, Mahowald, Lindsay, Lee, Nevison, Doney, Bonan, Stckli, Covey, Running, and Fung</label><mixed-citation>Randerson, J. T., Hoffman, F. M., Thornton, P. E., Mahowald, N. M., Lindsay, K., Lee, Y., Nevison, C. D., Doney, S. C., Bonan, G., Stöckli, R., Covey, C., Running, S. W., and Fung, I. Y.: Systematic assessment of terrestrial biogeochemistry in coupled climate–carbon models, Global Change Biol., 15, 2462–2484, <ext-link xlink:href="https://doi.org/10.1111/j.1365-2486.2009.01912.x" ext-link-type="DOI">10.1111/j.1365-2486.2009.01912.x</ext-link>, 2009.</mixed-citation></ref>
      <ref id="bib1.bibx51"><label>Ruiz et al.(2015)Ruiz, Or, and Schymanski</label><mixed-citation>Ruiz, S., Or, D., and Schymanski, S. J.: Soil Penetration by Earthworms and Plant Roots–Mechanical Energetics of Bioturbation of Compacted Soils, PLOS ONE, 10, e0128914, <ext-link xlink:href="https://doi.org/10.1371/journal.pone.0128914" ext-link-type="DOI">10.1371/journal.pone.0128914</ext-link>, 2015.</mixed-citation></ref>
      <ref id="bib1.bibx52"><label>Schwarz et al.(2024)Schwarz, Ghersheen, Belyazid, and Manzoni</label><mixed-citation>Schwarz, E., Ghersheen, S., Belyazid, S., and Manzoni, S.: When and why microbial-explicit soil organic carbon models can be unstable, Biogeosciences, 21, 3441–3461, <ext-link xlink:href="https://doi.org/10.5194/bg-21-3441-2024" ext-link-type="DOI">10.5194/bg-21-3441-2024</ext-link>, 2024.</mixed-citation></ref>
      <ref id="bib1.bibx53"><label>Seck et al.(2015)Seck, Welty, and Maxwell</label><mixed-citation>Seck, A., Welty, C., and Maxwell, R. M.: Spin‐up behavior and effects of initial conditions for an integrated hydrologic model, Water Resour. Res., 51, 2188–2210, <ext-link xlink:href="https://doi.org/10.1002/2014WR016371" ext-link-type="DOI">10.1002/2014WR016371</ext-link>, 2015.</mixed-citation></ref>
      <ref id="bib1.bibx54"><label>Senarath et al.(2000)Senarath, Ogden, Downer, and Sharif</label><mixed-citation>Senarath, S. U. S., Ogden, F. L., Downer, C. W., and Sharif, H. O.: On the calibration and verification of two‐dimensional, distributed, Hortonian, continuous watershed models, Water Resour. Res., 36, 1495–1510, <ext-link xlink:href="https://doi.org/10.1029/2000WR900039" ext-link-type="DOI">10.1029/2000WR900039</ext-link>, 2000.</mixed-citation></ref>
      <ref id="bib1.bibx55"><label>Shi et al.(2013)Shi, Yang, Lawrence, Dickinson, and Subin</label><mixed-citation>Shi, M., Yang, Z.-L., Lawrence, D. M., Dickinson, R. E., and Subin, Z. M.: Spin-up processes in the Community Land Model version 4 with explicit carbon and nitrogen components, Ecol. Model., 263, 308–325, <ext-link xlink:href="https://doi.org/10.1016/j.ecolmodel.2013.04.008" ext-link-type="DOI">10.1016/j.ecolmodel.2013.04.008</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bibx56"><label>Shi et al.(2018)Shi, Eissenstat, He, and Davis</label><mixed-citation>Shi, Y., Eissenstat, D. M., He, Y., and Davis, K. J.: Using a spatially-distributed hydrologic biogeochemistry model with a nitrogen transport module to study the spatial variation of carbon processes in a Critical Zone Observatory, Ecol. Model., 380, 8–21, <ext-link xlink:href="https://doi.org/10.1016/j.ecolmodel.2018.04.007" ext-link-type="DOI">10.1016/j.ecolmodel.2018.04.007</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bibx57"><label>Simon et al.(2023)Simon, Glaum, and Valdovinos</label><mixed-citation>Simon, S. M., Glaum, P., and Valdovinos, F. S.: Interpreting random forest analysis of ecological models to move from prediction to explanation, Sci. Rep., 13, 3881, <ext-link xlink:href="https://doi.org/10.1038/s41598-023-30313-8" ext-link-type="DOI">10.1038/s41598-023-30313-8</ext-link>, 2023.</mixed-citation></ref>
      <ref id="bib1.bibx58"><label>Six et al.(2002)Six, Conant, Paul, and Paustian</label><mixed-citation>Six, J., Conant, R. T., Paul, E. A., and Paustian, K.: Stabilization Mechanisms of Soil Organic Matter: Implications for C-saturation of Soils, Plant Soil, 241, 155–176, <ext-link xlink:href="https://doi.org/10.1023/A:1016125726789" ext-link-type="DOI">10.1023/A:1016125726789</ext-link>, 2002.</mixed-citation></ref>
      <ref id="bib1.bibx59"><label>Stähli(2018)</label><mixed-citation>Stähli, M.: Longterm hydrological observatory alptal (central switzerland), EnviDat [data set], <ext-link xlink:href="https://doi.org/10.16904/envidat.380" ext-link-type="DOI">10.16904/envidat.380</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bibx60"><label>Stähli et al.(2021)Sthli, Seibert, Kirchner, Von Freyberg, and Van Meerveld</label><mixed-citation>Stähli, M., Seibert, J., Kirchner, J. W., Von Freyberg, J., and Van Meerveld, I.: Hydrological trends and the evolution of catchment research in the Alptal valley, central Switzerland, Hydrol. Process., 35, e14113, <ext-link xlink:href="https://doi.org/10.1002/hyp.14113" ext-link-type="DOI">10.1002/hyp.14113</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bibx61"><label>Stone(1976)</label><mixed-citation>Stone, M.: Cross-Validatory Choice and Assessment of Statistical Predictions (With Discussion), J. Roy. Stat. Soc. Ser. B, 38, 102, <ext-link xlink:href="https://doi.org/10.1111/j.2517-6161.1976.tb01573.x" ext-link-type="DOI">10.1111/j.2517-6161.1976.tb01573.x</ext-link>, 1976.</mixed-citation></ref>
      <ref id="bib1.bibx62"><label>Sun et al.(2021)Sun, Goll, Chang, Ciais, Guenet, Helfenstein, Huang, Lauerwald, Maignan, Naipal, Wang, Yang, and Zhang</label><mixed-citation>Sun, Y., Goll, D. S., Chang, J., Ciais, P., Guenet, B., Helfenstein, J., Huang, Y., Lauerwald, R., Maignan, F., Naipal, V., Wang, Y., Yang, H., and Zhang, H.: Global Evaluation of the Nutrient-Enabled Version of the Land Surface Model ORCHIDEE-CNP v1.2 (R5986), Geosci. Model Dev., 14, 1987–2010, <ext-link xlink:href="https://doi.org/10.5194/gmd-14-1987-2021" ext-link-type="DOI">10.5194/gmd-14-1987-2021</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bibx63"><label>Sun et al.(2023)Sun, Goll, Huang, Ciais, Wang, Bastrikov, and Wang</label><mixed-citation>Sun, Y., Goll, D. S., Huang, Y., Ciais, P., Wang, Y.-P., Bastrikov, V., and Wang, Y.: Machine Learning for Accelerating Process-based Computation of Land Biogeochemical Cycles, Global Change Biol., 29, 3221–3234, <ext-link xlink:href="https://doi.org/10.1111/gcb.16623" ext-link-type="DOI">10.1111/gcb.16623</ext-link>, 2023.</mixed-citation></ref>
      <ref id="bib1.bibx64"><label>Tague and Band(2004)</label><mixed-citation>Tague, C. L. and Band, L. E.: RHESSys: Regional Hydro-Ecologic Simulation System – An Object-Oriented Approach to Spatially Distributed Modeling of Carbon, Water, and Nutrient Cycling, Earth Interact., 8, 1–42, <ext-link xlink:href="https://doi.org/10.1175/1087-3562(2004)8&lt;1:RRHSSO&gt;2.0.CO;2" ext-link-type="DOI">10.1175/1087-3562(2004)8&lt;1:RRHSSO&gt;2.0.CO;2</ext-link>, 2004.</mixed-citation></ref>
      <ref id="bib1.bibx65"><label>Teng et al.(2017)Teng, Zeng, Xiao, Huang, Zhou, Yan, and Wang</label><mixed-citation>Teng, M., Zeng, L., Xiao, W., Huang, Z., Zhou, Z., Yan, Z., and Wang, P.: Spatial variability of soil organic carbon in Three Gorges Reservoir area, China, Sci. Total Environ., 599-600, 1308–1316, <ext-link xlink:href="https://doi.org/10.1016/j.scitotenv.2017.05.085" ext-link-type="DOI">10.1016/j.scitotenv.2017.05.085</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bibx66"><label>Thornton et al.(2002)Thornton, Law, Gholz, Clark, Falge, Ellsworth, Goldstein, Monson, Hollinger, Falk, Chen, and Sparks</label><mixed-citation>Thornton, P., Law, B., Gholz, H. L., Clark, K. L., Falge, E., Ellsworth, D., Goldstein, A., Monson, R., Hollinger, D., Falk, M., Chen, J., and Sparks, J.: Modeling and measuring the effects of disturbance history and climate on carbon and water budgets in evergreen needleleaf forests, Agr. Forest Meteorol., 113, 185–222, <ext-link xlink:href="https://doi.org/10.1016/S0168-1923(02)00108-9" ext-link-type="DOI">10.1016/S0168-1923(02)00108-9</ext-link>, 2002.</mixed-citation></ref>
      <ref id="bib1.bibx67"><label>Thornton and Rosenbloom(2005)</label><mixed-citation>Thornton, P. E. and Rosenbloom, N. A.: Ecosystem model spin-up: Estimating steady state conditions in a coupled terrestrial carbon and nitrogen cycle model, Ecol. Model., 189, 25–48, <ext-link xlink:href="https://doi.org/10.1016/j.ecolmodel.2005.04.008" ext-link-type="DOI">10.1016/j.ecolmodel.2005.04.008</ext-link>, 2005.</mixed-citation></ref>
      <ref id="bib1.bibx68"><label>Tyralis et al.(2019)Tyralis, Papacharalampous, and Langousis</label><mixed-citation>Tyralis, H., Papacharalampous, G., and Langousis, A.: A Brief Review of Random Forests for Water Scientists and Practitioners and Their Recent History in Water Resources, Water, 11, 910, <ext-link xlink:href="https://doi.org/10.3390/w11050910" ext-link-type="DOI">10.3390/w11050910</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bibx69"><label>Verhoef et al.(2026)Verhoef, Zeng, Agam, Best, Bonetti, Boussetta, Chaney, Cuntz, Edwards, Gupta, Heitman, Huang, Jarvis, Jiang, Kandala, Kiałka, Lian, Mu, Nemes, Paulus, Raoult, Reddy, Romano, Sabot, Vanderborght, Van Der Ploeg, Van Oevelen, Wang, Wang, Weber, Marthews, and Weihermüller</label><mixed-citation>Verhoef, A., Zeng, Y., Agam, N., Best, M., Bonetti, S., Boussetta, S., Chaney, N., Cuntz, M., Edwards, J., Gupta, S., Heitman, J., Huang, M., Jarvis, N., Jiang, S., Kandala, R., Kiałka, F., Lian, T., Mu, M., Nemes, A., Paulus, S. J., Raoult, N., Reddy, K. N., Romano, N., Sabot, M., Vanderborght, J., Van Der Ploeg, M., Van Oevelen, P., Wang, Y., Wang, Y., Weber, T. K. D., Marthews, T., and Weihermüller, L.: Rethinking Soils in Land Surface Models, B. Am. Meteorol. Soc., 107, E665–E674, <ext-link xlink:href="https://doi.org/10.1175/BAMS-D-26-0003.1" ext-link-type="DOI">10.1175/BAMS-D-26-0003.1</ext-link>, 2026. </mixed-citation></ref>
      <ref id="bib1.bibx70"><label>Xia et al.(2012)Xia, Luo, Wang, Weng, and Hararuk</label><mixed-citation>Xia, J. Y., Luo, Y. Q., Wang, Y.-P., Weng, E. S., and Hararuk, O.: A semi-analytical solution to accelerate spin-up of a coupled carbon and nitrogen land model to steady state, Geosci. Model Dev., 5, 1259–1271, <ext-link xlink:href="https://doi.org/10.5194/gmd-5-1259-2012" ext-link-type="DOI">10.5194/gmd-5-1259-2012</ext-link>, 2012.</mixed-citation></ref>
      <ref id="bib1.bibx71"><label>Yan et al.(2018)Yan, Bond-Lamberty, Todd-Brown, Bailey, Li, Liu, and Liu</label><mixed-citation>Yan, Z., Bond-Lamberty, B., Todd-Brown, K. E., Bailey, V. L., Li, S., Liu, C., and Liu, C.: A moisture function of soil heterotrophic respiration that incorporates microscale processes, Nat. Commun., 9, 2562, <ext-link xlink:href="https://doi.org/10.1038/s41467-018-04971-6" ext-link-type="DOI">10.1038/s41467-018-04971-6</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bibx72"><label>Yang et al.(2025)Yang, Zhao, and Biswas</label><mixed-citation>Yang, Y., Zhao, R., and Biswas, A.: Delineating Dynamic Hydrological Response Units to Improve Simulations of Extreme Runoff Events in Changing Environments, J. Hydrol., 656, 133000, <ext-link xlink:href="https://doi.org/10.1016/j.jhydrol.2025.133000" ext-link-type="DOI">10.1016/j.jhydrol.2025.133000</ext-link>, 2025.</mixed-citation></ref>
      <ref id="bib1.bibx73"><label>Yao et al.(2023)Yao, Dai, Gao, Yi, Wang, and Hu</label><mixed-citation>Yao, Y., Dai, Q., Gao, R., Yi, X., Wang, Y., and Hu, Z.: Characteristics and factors influencing soil organic carbon composition by vegetation type in spoil heaps, Front. Plant Sci., 14, 1240217, <ext-link xlink:href="https://doi.org/10.3389/fpls.2023.1240217" ext-link-type="DOI">10.3389/fpls.2023.1240217</ext-link>, 2023.</mixed-citation></ref>
      <ref id="bib1.bibx74"><label>Zeraatpisheh et al.(2022)Zeraatpisheh, Garosi, Reza Owliaie, Ayoubi, Taghizadeh-Mehrjardi, Scholten, and Xu</label><mixed-citation>Zeraatpisheh, M., Garosi, Y., Reza Owliaie, H., Ayoubi, S., Taghizadeh-Mehrjardi, R., Scholten, T., and Xu, M.: Improving the spatial prediction of soil organic carbon using environmental covariates selection: A comparison of a group of environmental covariates, Catena, 208, 105723, <ext-link xlink:href="https://doi.org/10.1016/j.catena.2021.105723" ext-link-type="DOI">10.1016/j.catena.2021.105723</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bibx75"><label>Zhan et al.(2003)Zhan, Xue, and Collatz</label><mixed-citation>Zhan, X., Xue, Y., and Collatz, G.: An analytical approach for estimating CO<sub>2</sub> and heat fluxes over the Amazonian region, Ecol. Model., 162, 97–117, <ext-link xlink:href="https://doi.org/10.1016/S0304-3800(02)00405-2" ext-link-type="DOI">10.1016/S0304-3800(02)00405-2</ext-link>, 2003.</mixed-citation></ref>
      <ref id="bib1.bibx76"><label>Zhang et al.(2018)Zhang, Li, Zhu, Zheng, Liu, Tang, Su, Zhang, Ju, and Deng</label><mixed-citation>Zhang, W., Li, Y., Zhu, B., Zheng, X., Liu, C., Tang, J., Su, F., Zhang, C., Ju, X., and Deng, J.: A process-oriented hydro-biogeochemical model enabling simulation of gaseous carbon and nitrogen emissions and hydrologic nitrogen losses from a subtropical catchment, Sci. Total Environ., 616–617, 305–317, <ext-link xlink:href="https://doi.org/10.1016/j.scitotenv.2017.09.261" ext-link-type="DOI">10.1016/j.scitotenv.2017.09.261</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bibx77"><label>Zhang et al.(2023)Zhang, Chen, Wang, Wang, Lin, and Ye</label><mixed-citation>Zhang, Y., Chen, A., Wang, Z., Wang, X., Lin, Y., and Ye, C.: Soil organic carbon accumulation along a chronosequence of vegetation colonization on debris flow fans in Southwest China, Catena, 223, 106936, <ext-link xlink:href="https://doi.org/10.1016/j.catena.2023.106936" ext-link-type="DOI">10.1016/j.catena.2023.106936</ext-link>, 2023.</mixed-citation></ref>
      <ref id="bib1.bibx78"><label>Zhu et al.(2019)Zhu, Feng, Qin, Cao, Zhang, Liu, Deo, Zhang, Li, and Li</label><mixed-citation>Zhu, M., Feng, Q., Qin, Y., Cao, J., Zhang, M., Liu, W., Deo, R. C., Zhang, C., Li, R., and Li, B.: The role of topography in shaping the spatial patterns of soil organic carbon, Catena, 176, 296–305, <ext-link xlink:href="https://doi.org/10.1016/j.catena.2019.01.029" ext-link-type="DOI">10.1016/j.catena.2019.01.029</ext-link>, 2019.</mixed-citation></ref>

  </ref-list></back>
    <!--<article-title-html>A hybrid framework for the spin-up and initialization of distributed coupled ecohydrological-biogeochemical models</article-title-html>
<abstract-html/>
<ref-html id="bib1.bib1"><label>Ammann et al.(2009)Ammann, Spirig, Leifeld, and
Neftel</label><mixed-citation>
      
Ammann, C., Spirig, C., Leifeld, J., and Neftel, A.: Assessment of the nitrogen and carbon budget of two managed temperate grassland fields, Agr. Ecosyst. Environ., 133, 150–162, <a href="https://doi.org/10.1016/j.agee.2009.05.006" target="_blank">https://doi.org/10.1016/j.agee.2009.05.006</a>, 2009.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib2"><label>Baroni et al.(2019)Baroni, Schalge, Rakovec, Kumar, Schler,
Samaniego, Simmer, and Attinger</label><mixed-citation>
      
Baroni, G., Schalge, B., Rakovec, O., Kumar, R., Schüler, L., Samaniego, L., Simmer, C., and Attinger, S.: A Comprehensive Distributed Hydrological Modeling Intercomparison to Support Process Representation and Data Collection Strategies, Water Resour. Res., 55, 990–1010, <a href="https://doi.org/10.1029/2018WR023941" target="_blank">https://doi.org/10.1029/2018WR023941</a>, 2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib3"><label>Botter et al.(2021)Botter, Zeeman, Burlando, and
Fatichi</label><mixed-citation>
      
Botter, M., Zeeman, M., Burlando, P., and Fatichi, S.: Impacts of fertilization on grassland productivity and water quality across the European Alps under current and warming climate: insights from a mechanistic model, Biogeosciences, 18, 1917–1939, <a href="https://doi.org/10.5194/bg-18-1917-2021" target="_blank">https://doi.org/10.5194/bg-18-1917-2021</a>, 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib4"><label>Breiman(2001)</label><mixed-citation>
      
Breiman, L.: Random Forests, Mach. Learn., 45, 5–32,
<a href="https://doi.org/10.1023/A:1010933404324" target="_blank">https://doi.org/10.1023/A:1010933404324</a>, 2001.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib5"><label>Carranza et al.(2021)Carranza, Nolet, Pezij, and Van
Der Ploeg</label><mixed-citation>
      
Carranza, C., Nolet, C., Pezij, M., and Van Der Ploeg, M.: Root zone soil
moisture estimation with Random Forest, J. Hydrol., 593, 125840, <a href="https://doi.org/10.1016/j.jhydrol.2020.125840" target="_blank">https://doi.org/10.1016/j.jhydrol.2020.125840</a>, 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib6"><label>Chaney et al.(2018)Chaney, Van Huijgevoort, Shevliakova, Malyshev,
Milly, Gauthier, and Sulman</label><mixed-citation>
      
Chaney, N. W., Van Huijgevoort, M. H. J., Shevliakova, E., Malyshev, S., Milly, P. C. D., Gauthier, P. P. G., and Sulman, B. N.: Harnessing Big Data to Rethink Land Heterogeneity in Earth System Models, Hydrol. Earth Syst. Sci., 22, 3311–3330, <a href="https://doi.org/10.5194/hess-22-3311-2018" target="_blank">https://doi.org/10.5194/hess-22-3311-2018</a>, 2018.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib7"><label>Christensen et al.(2008)Christensen, Tague, and
Baron</label><mixed-citation>
      
Christensen, L., Tague, C. L., and Baron, J. S.: Spatial patterns of simulated transpiration response to climate variability in a snow dominated mountain ecosystem, Hydrol. Process., 22, 3576–3588, <a href="https://doi.org/10.1002/hyp.6961" target="_blank">https://doi.org/10.1002/hyp.6961</a>,
2008.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib8"><label>Fatichi et al.(2012a)Fatichi, Ivanov, and
Caporali</label><mixed-citation>
      
Fatichi, S., Ivanov, V. Y., and Caporali, E.: A Mechanistic Ecohydrological Model to Investigate Complex Interactions in Cold and Warm Water-controlled Environments: 2. Spatiotemporal Analyses, J. Adv. Model. Earth Syst., 4, 2011MS000087, <a href="https://doi.org/10.1029/2011MS000087" target="_blank">https://doi.org/10.1029/2011MS000087</a>, 2012a.


    </mixed-citation></ref-html>
<ref-html id="bib1.bib9"><label>Fatichi et al.(2012b)Fatichi, Ivanov, and
Caporali</label><mixed-citation>
      
Fatichi, S., Ivanov, V. Y., and Caporali, E.: A mechanistic ecohydrological model to investigate complex interactions in cold and warm water-controlled environments: 1. Theoretical framework and plot-scale analysis, J. Adv. Model. Earth Syst., 4, 2011MS000086, <a href="https://doi.org/10.1029/2011MS000086" target="_blank">https://doi.org/10.1029/2011MS000086</a>, 2012b.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib10"><label>Fatichi et al.(2016)Fatichi, Pappas, and
Ivanov</label><mixed-citation>
      
Fatichi, S., Pappas, C., and Ivanov, V. Y.: Modeling plant–water
interactions: an ecohydrological overview from the cell to the global scale,
WIREs Water, 3, 327–368, <a href="https://doi.org/10.1002/wat2.1125" target="_blank">https://doi.org/10.1002/wat2.1125</a>, 2016.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib11"><label>Fatichi et al.(2019)Fatichi, Manzoni, Or, and
Paschalis</label><mixed-citation>
      
Fatichi, S., Manzoni, S., Or, D., and Paschalis, A.: A Mechanistic Model of Microbially Mediated Soil Biogeochemical Processes: A Reality Check, Global Biogeochem. Cy., 33, 620–648, <a href="https://doi.org/10.1029/2018GB006077" target="_blank">https://doi.org/10.1029/2018GB006077</a>, 2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib12"><label>Feng et al.(2013)Feng, Plante, and
Six</label><mixed-citation>
      
Feng, W., Plante, A. F., and Six, J.: Improving Estimates of Maximal Organic
Carbon Stabilization by Fine Soil Particles, Biogeochemistry, 112, 81–93,
<a href="https://doi.org/10.1007/s10533-011-9679-7" target="_blank">https://doi.org/10.1007/s10533-011-9679-7</a>, 2013.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib13"><label>Friedlingstein et al.(2025)Friedlingstein, O'Sullivan, Jones, Andrew, Hauck, Landschützer, Le Quéré, Li, Luijkx, Olsen, Peters, Peters, Pongratz, Schwingshackl, Sitch, Canadell, Ciais, Jackson, Alin, Arneth, Arora, Bates, Becker, Bellouin, Berghoff, Bittig, Bopp, Cadule, Campbell, Chamberlain, Chandra, Chevallier, Chini, Colligan, Decayeux, Djeutchouang, Dou, Duran Rojas, Enyo, Evans, Fay, Feely, Ford, Foster, Gasser, Gehlen, Gkritzalis, Grassi, Gregor, Gruber, Gürses, Harris, Hefner, Heinke, Hurtt, Iida, Ilyina, Jacobson, Jain, Jarníková, Jersild, Jiang, Jin, Kato, Keeling, Klein Goldewijk, Knauer, Korsbakken, Lan, Lauvset, Lefèvre, Liu, Liu, Ma, Maksyutov, Marland, Mayot, McGuire, Metzl, Monacci, Morgan, Nakaoka, Neill, Niwa, Nützel, Olivier, Ono, Palmer, Pierrot, Qin, Resplandy, Roobaert, Rosan, Rödenbeck, Schwinger, Smallman, Smith, Sospedra-Alfonso, Steinhoff, Sun, Sutton, Séférian, Takao, Tatebe, Tian, Tilbrook, Torres, Tourigny, Tsujino, Tubiello, Van Der Werf, Wanninkhof, Wang, Yang, Yang, Yu, Yuan, Yue, Zaehle, Zeng, and Zeng</label><mixed-citation>
      
Friedlingstein, P., O'Sullivan, M., Jones, M. W., Andrew, R. M., Hauck, J.,
Landschützer, P., Le Quéré, C., Li, H., Luijkx, I. T., Olsen, A.,
Peters, G. P., Peters, W., Pongratz, J., Schwingshackl, C., Sitch, S.,
Canadell, J. G., Ciais, P., Jackson, R. B., Alin, S. R., Arneth, A., Arora,
V., Bates, N. R., Becker, M., Bellouin, N., Berghoff, C. F., Bittig, H. C.,
Bopp, L., Cadule, P., Campbell, K., Chamberlain, M. A., Chandra, N., Chevallier, F., Chini, L.P., Colligan, T., Decayeux, J., Djeutchouang, L. M., Dou, X., Duran Rojas, C., Enyo, K., Evans, W., Fay, A. R., Feely, R. A., Ford, D. J., Foster, A., Gasser, T., Gehlen, M., Gkritzalis, T., Grassi, G., Gregor, L., Gruber, N., Gürses, Ö., Harris, I., Hefner, M., Heinke, J., Hurtt, G. C., Iida, Y., Ilyina, T., Jacobson, A. R., Jain, A. K., Jarníková, T., Jersild, A., Jiang, F., Jin, Z., Kato, E., Keeling, R. F., Klein Goldewijk, K., Knauer, J., Korsbakken, J. I., Lan, X., Lauvset, S. K., Lefèvre, N., Liu, Z., Liu, J., Ma, L., Maksyutov, S., Marland, G., Mayot, N., McGuire, P. C., Metzl, N., Monacci, N. M., Morgan, E. J., Nakaoka, S.-I., Neill, C., Niwa, Y., Nützel, T., Olivier, L., Ono, T., Palmer, P. I., Pierrot, D., Qin, Z., Resplandy, L., Roobaert, A., Rosan, T. M., Rödenbeck, C., Schwinger, J., Smallman, T. L., Smith, S. M., Sospedra-Alfonso, R., Steinhoff, T., Sun, Q., Sutton, A. J., Séférian, R., Takao, S., Tatebe, H., Tian, H., Tilbrook, B., Torres,
O., Tourigny, E., Tsujino, H., Tubiello, F., Van Der Werf, G., Wanninkhof,
R., Wang, X., Yang, D., Yang, X., Yu, Z., Yuan, W., Yue, X., Zaehle, S.,
Zeng, N., and Zeng, J.: Global Carbon Budget 2024, Earth Syst. Sci. Data, 17, 965–1039, <a href="https://doi.org/10.5194/essd-17-965-2025" target="_blank">https://doi.org/10.5194/essd-17-965-2025</a>, 2025.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib14"><label>Guo and Gifford(2002)</label><mixed-citation>
      
Guo, L. B. and Gifford, R. M.: Soil carbon stocks and land use change: a meta
analysis, Global Change Biol., 8, 345–360, <a href="https://doi.org/10.1046/j.1354-1013.2002.00486.x" target="_blank">https://doi.org/10.1046/j.1354-1013.2002.00486.x</a>, 2002.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib15"><label>Gupta et al.(2021)Gupta, Lehmann, Bonetti, Papritz, and
Or</label><mixed-citation>
      
Gupta, S., Lehmann, P., Bonetti, S., Papritz, A., and Or, D.: Global
Prediction of Soil Saturated Hydraulic Conductivity Using
Random Forest in a Covariate‐Based GeoTransfer Function (CoGTF) Framework, J. Adv. Model. Earth Syst., 13, e2020MS002242, <a href="https://doi.org/10.1029/2020MS002242" target="_blank">https://doi.org/10.1029/2020MS002242</a>, 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib16"><label>Gupta et al.(2024)Gupta, Hasler, and Alewell</label><mixed-citation>
      
Gupta, S., Hasler, J. K., and Alewell, C.: Mining soil data of Switzerland:
New maps for soil texture, soil organic carbon, nitrogen, and phosphorus,
Geoderma Reg., 36, e00747, <a href="https://doi.org/10.1016/j.geodrs.2023.e00747" target="_blank">https://doi.org/10.1016/j.geodrs.2023.e00747</a>, 2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib17"><label>Hashimoto et al.(2011)Hashimoto, Wattenbach, and
Smith</label><mixed-citation>
      
Hashimoto, S., Wattenbach, M., and Smith, P.: A new scheme for initializing
process-based ecosystem models by scaling soil carbon pools, Ecol. Model., 222, 3598–3602, <a href="https://doi.org/10.1016/j.ecolmodel.2011.08.011" target="_blank">https://doi.org/10.1016/j.ecolmodel.2011.08.011</a>, 2011.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib18"><label>Ilstedt(2003)</label><mixed-citation>
      
Ilstedt, U.: Changes in soil chemical and microbial properties after a wildfire in a tropical rainforest in Sabah, Malaysia, Soil Biol. Biochem., 35, 1071–1078, <a href="https://doi.org/10.1016/S0038-0717(03)00152-4" target="_blank">https://doi.org/10.1016/S0038-0717(03)00152-4</a>, 2003.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib19"><label>Kim et al.(2018)Kim, Kwon, and Han</label><mixed-citation>
      
Kim, K. B., Kwon, H.-H., and Han, D.: Exploration of warm-up period in
conceptual hydrological modelling, J. Hydrol., 556, 194–210,
<a href="https://doi.org/10.1016/j.jhydrol.2017.11.015" target="_blank">https://doi.org/10.1016/j.jhydrol.2017.11.015</a>, 2018.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib20"><label>Krinner et al.(2005)Krinner, Viovy, De Noblet‐Ducoudr, Oge,
Polcher, Friedlingstein, Ciais, Sitch, and Prentice</label><mixed-citation>
      
Krinner, G., Viovy, N., De Noblet‐Ducoudré, N., Ogée, J., Polcher, J.,
Friedlingstein, P., Ciais, P., Sitch, S., and Prentice, I. C.: A dynamic
global vegetation model for studies of the coupled atmosphere‐biosphere
system, Global Biogeochem. Cy., 19, 2003GB002199, <a href="https://doi.org/10.1029/2003GB002199" target="_blank">https://doi.org/10.1029/2003GB002199</a>, 2005.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib21"><label>Kunkel et al.(2022)Kunkel, Wells, and
Hancock</label><mixed-citation>
      
Kunkel, V., Wells, T., and Hancock, G.: Modelling soil organic carbon using
vegetation indices across large catchments in eastern Australia, Sci. Total Environ., 817, 152690, <a href="https://doi.org/10.1016/j.scitotenv.2021.152690" target="_blank">https://doi.org/10.1016/j.scitotenv.2021.152690</a>, 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib22"><label>Li et al.(2020)Li, Zhang, and Liu</label><mixed-citation>
      
Li, J., Zhang, D., and Liu, M.: Factors controlling the spatial distribution of soil organic carbon in Daxing’anling Mountain, Sci. Rep., 10, 12659, <a href="https://doi.org/10.1038/s41598-020-69590-y" target="_blank">https://doi.org/10.1038/s41598-020-69590-y</a>, 2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib23"><label>Lian et al.(2025a)Lian, Fatichi, and
Bonetti</label><mixed-citation>
      
Lian, T., Fatichi, S., and Bonetti, S.: TeC_BG_2D Ecohydrological Model: V1.0.0, Zenodo [code], <a href="https://doi.org/10.5281/ZENODO.18084473" target="_blank">https://doi.org/10.5281/ZENODO.18084473</a>, 2025a.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib24"><label>Lian et al.(2025b)Lian, Fatichi, Sthli, and
Bonetti</label><mixed-citation>
      
Lian, T., Fatichi, S., Stähli, M., and Bonetti, S.: Assessing Spatial
Patterns of Carbon and Nutrient Dynamics in Catchments of Complex
Topography, Water Resour. Res., 61, e2025WR040260, <a href="https://doi.org/10.1029/2025WR040260" target="_blank">https://doi.org/10.1029/2025WR040260</a>, 2025b.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib25"><label>Lian et al.(2025c)Lian, Zhang, Paschalis, and
Bonetti</label><mixed-citation>
      
Lian, T., Zhang, Z., Paschalis, A., and Bonetti, S.: TeC_BG_2D_Spin_up: V2.0.0, Zenodo [code], <a href="https://doi.org/10.5281/ZENODO.17213868" target="_blank">https://doi.org/10.5281/ZENODO.17213868</a>, 2025c.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib26"><label>Loh(2002)</label><mixed-citation>
      
Loh, W.-Y.: Regression trees with unbiased variable selection and interaction
detection, Statistica sinica, JSTOR, 361–386, <a href="https://www.jstor.org/stable/24306967" target="_blank"/> (last access: 24 May 2026), 2002.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib27"><label>Lugato et al.(2014)Lugato, Panagos, Bampa, Jones, and
Montanarella</label><mixed-citation>
      
Lugato, E., Panagos, P., Bampa, F., Jones, A., and Montanarella, L.: A new
baseline of organic carbon stock in European agricultural soils using a
modelling approach, Global Change Biol., 20, 313–326, <a href="https://doi.org/10.1111/gcb.12292" target="_blank">https://doi.org/10.1111/gcb.12292</a>, 2014.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib28"><label>Lundberg and Lee(2017)</label><mixed-citation>
      
Lundberg, S. M. and Lee, S.-I.: A Unified Approach to Interpreting Model
Predictions, in: Advances in Neural Information Processing Systems, vol. 30, edited by: Guyon, I., Luxburg, U. V., Bengio, S., Wallach, H., Fergus, R., Vishwanathan, S., and Garnett, R., Curran Associates, Inc.,
<a href="https://proceedings.neurips.cc/paper_files/paper/2017/file/8a20a8621978632d76c43dfd28b67767-Paper.pdf" target="_blank"/> (last access: 24 May 2026), 2017.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib29"><label>Mahdavi et al.(2018)Mahdavi, Salehi, Granger, Amani, Brisco, and
Huang</label><mixed-citation>
      
Mahdavi, S., Salehi, B., Granger, J., Amani, M., Brisco, B., and Huang, W.:
Remote sensing for wetland classification: a comprehensive review, GISci.
Remote Sens., 55, 623–658, <a href="https://doi.org/10.1080/15481603.2017.1419602" target="_blank">https://doi.org/10.1080/15481603.2017.1419602</a>, 2018.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib30"><label>Manzoni et al.(2012)Manzoni, Schimel, and
Porporato</label><mixed-citation>
      
Manzoni, S., Schimel, J. P., and Porporato, A.: Responses of soil microbial
communities to water stress: results from a meta-analysis, Ecology, 93,
930–938, <a href="https://doi.org/10.1890/11-0026.1" target="_blank">https://doi.org/10.1890/11-0026.1</a>, 2012.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib31"><label>Manzoni et al.(2014)Manzoni, Schaeffer, Katul, Porporato, and
Schimel</label><mixed-citation>
      
Manzoni, S., Schaeffer, S. M., Katul, G., Porporato, A., and Schimel, J. P.: A theoretical analysis of microbial eco-physiological and diffusion limitations to carbon cycling in drying soils, Soil Biol. Biochem., 73, 69–83, <a href="https://doi.org/10.1016/j.soilbio.2014.02.008" target="_blank">https://doi.org/10.1016/j.soilbio.2014.02.008</a>, 2014.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib32"><label>Martin et al.(2014)Martin, Orton, Lacarce, Meersmans, Saby,
Paroissien, Jolivet, Boulonne, and Arrouays</label><mixed-citation>
      
Martin, M., Orton, T., Lacarce, E., Meersmans, J., Saby, N., Paroissien, J.,
Jolivet, C., Boulonne, L., and Arrouays, D.: Evaluation of modelling
approaches for predicting the spatial distribution of soil organic carbon
stocks at the national scale, Geoderma, 223–225, 97–107,
<a href="https://doi.org/10.1016/j.geoderma.2014.01.005" target="_blank">https://doi.org/10.1016/j.geoderma.2014.01.005</a>, 2014.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib33"><label>Martín-López et al.(2023)Martn-Lpez, Verchot, Martius, and
Da Silva</label><mixed-citation>
      
Martín-López, J. M., Verchot, L. V., Martius, C., and Da Silva, M.: Modeling the Spatial Distribution of Soil Organic Carbon and Carbon Stocks in the Casanare Flooded Savannas of the Colombian Llanos, Wetlands, 43, 65, <a href="https://doi.org/10.1007/s13157-023-01705-3" target="_blank">https://doi.org/10.1007/s13157-023-01705-3</a>, 2023.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib34"><label>Matus(2021)</label><mixed-citation>
      
Matus, F. J.: Fine Silt and Clay Content Is the Main Factor Defining Maximal
C and N Accumulations in Soils: A Meta-Analysis, Sci. Rep., 11, 6438, <a href="https://doi.org/10.1038/s41598-021-84821-6" target="_blank">https://doi.org/10.1038/s41598-021-84821-6</a>, 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib35"><label>Meersmans et al.(2008)Meersmans, De Ridder, Canters, De Baets, and
Van Molle</label><mixed-citation>
      
Meersmans, J., De Ridder, F., Canters, F., De Baets, S., and Van Molle, M.: A
multiple regression approach to assess the spatial distribution of Soil
Organic Carbon (SOC) at the regional scale (Flanders, Belgium),
Geoderma, 143, 1–13, <a href="https://doi.org/10.1016/j.geoderma.2007.08.025" target="_blank">https://doi.org/10.1016/j.geoderma.2007.08.025</a>, 2008.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib36"><label>Moyano et al.(2013)Moyano, Manzoni, and
Chenu</label><mixed-citation>
      
Moyano, F. E., Manzoni, S., and Chenu, C.: Responses of soil heterotrophic
respiration to moisture availability: An exploration of processes and models, Soil Biol. Biochem., 59, 72–85, <a href="https://doi.org/10.1016/j.soilbio.2013.01.002" target="_blank">https://doi.org/10.1016/j.soilbio.2013.01.002</a>, 2013.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib37"><label>Nave et al.(2011)Nave, Vance, Swanston, and Curtis</label><mixed-citation>
      
Nave, L. E., Vance, E. D., Swanston, C. W., and Curtis, P. S.: Fire effects on temperate forest soil C and N storage, Ecol. Appl., 21, 1189–1201, <a href="https://doi.org/10.1890/10-0660.1" target="_blank">https://doi.org/10.1890/10-0660.1</a>, 2011.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib38"><label>Nemo et al.(2017)Nemo, Klumpp, Coleman, Dondini, Goulding,
Hastings, Jones, Leifeld, Osborne, Saunders, Scott, Teh, and
Smith</label><mixed-citation>
      
Nemo, Klumpp, K., Coleman, K., Dondini, M., Goulding, K., Hastings, A., Jones, M. B., Leifeld, J., Osborne, B., Saunders, M., Scott, T., Teh, Y. A., and Smith, P.: Soil Organic Carbon (SOC) Equilibrium and Model Initialisation Methods: an Application to the Rothamsted Carbon (RothC) Model, Environ. Model. Assess., 22, 215–229, <a href="https://doi.org/10.1007/s10666-016-9536-0" target="_blank">https://doi.org/10.1007/s10666-016-9536-0</a>, 2017.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib39"><label>Ng et al.(2014)Ng, Bedford, and Miller</label><mixed-citation>
      
Ng, G. C., Bedford, D. R., and Miller, D. M.: A mechanistic modeling and data
assimilation framework for Mojave Desert ecohydrology, Water Resour. Res., 50, 4662–4685, <a href="https://doi.org/10.1002/2014WR015281" target="_blank">https://doi.org/10.1002/2014WR015281</a>, 2014.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib40"><label>Nigrelli et al.(2018)Nigrelli, Fratianni, Zampollo, Turconi, and
Chiarle</label><mixed-citation>
      
Nigrelli, G., Fratianni, S., Zampollo, A., Turconi, L., and Chiarle, M.: The
altitudinal temperature lapse rates applied to high elevation rockfalls
studies in the Western European Alps, Theor. Appl. Climatol., 131, 1479–1491, <a href="https://doi.org/10.1007/s00704-017-2066-0" target="_blank">https://doi.org/10.1007/s00704-017-2066-0</a>, 2018.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib41"><label>Noto et al.(2008)Noto, Ivanov, Bras, and Vivoni</label><mixed-citation>
      
Noto, L. V., Ivanov, V. Y., Bras, R. L., and Vivoni, E. R.: Effects of
initialization on response of a fully-distributed hydrologic model, J. Hydrol., 352, 107–125, <a href="https://doi.org/10.1016/j.jhydrol.2007.12.031" target="_blank">https://doi.org/10.1016/j.jhydrol.2007.12.031</a>, 2008.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib42"><label>Parras-Alcántara et al.(2015)Parras-Alcntara, Lozano-Garca, and
Galn-Espejo</label><mixed-citation>
      
Parras-Alcántara, L., Lozano-García, B., and Galán-Espejo, A.: Soil organic carbon along an altitudinal gradient in the Despeñaperros Natural Park, southern Spain, Solid Earth, 6, 125–134,
<a href="https://doi.org/10.5194/se-6-125-2015" target="_blank">https://doi.org/10.5194/se-6-125-2015</a>, 2015.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib43"><label>Parvizi and Fatehi(2025)</label><mixed-citation>
      
Parvizi, Y. and Fatehi, S.: Geospatial digital mapping of soil organic carbon
using machine learning and geostatistical methods in different land uses,
Sci. Rep., 15, 4449, <a href="https://doi.org/10.1038/s41598-025-88062-9" target="_blank">https://doi.org/10.1038/s41598-025-88062-9</a>, 2025.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib44"><label>Paschalis et al.(2022)Paschalis, Bonetti, Guo, and
Fatichi</label><mixed-citation>
      
Paschalis, A., Bonetti, S., Guo, Y., and Fatichi, S.: On the Uncertainty
Induced by Pedotransfer Functions in Terrestrial Biosphere Modeling, Water Resour. Res., 58, e2021WR031871, <a href="https://doi.org/10.1029/2021WR031871" target="_blank">https://doi.org/10.1029/2021WR031871</a>, 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib45"><label>Pawusch et al.(2025)Pawusch, Scheurer, Nowak, and
Maxwell</label><mixed-citation>
      
Pawusch, L., Scheurer, S., Nowak, W., and Maxwell, R. M.: HydroStartML:
A Combined Machine Learning and Physics-Based Approach to Reduce
Hydrological Model Spin-up Time, Adv. Water Resour., 206, 105124,
<a href="https://doi.org/10.1016/j.advwatres.2025.105124" target="_blank">https://doi.org/10.1016/j.advwatres.2025.105124</a>, 2025.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib46"><label>Pazúr et al.(2021)Pazr, Huber, Weber, Ginzler, and
Price</label><mixed-citation>
      
Pazúr, R., Huber, N., Weber, D., Ginzler, C., and Price, B.: Cropland and
grassland map of Switzerland based on Sentinel-2 data, EnviDat [data set], <a href="https://doi.org/10.16904/ENVIDAT.205" target="_blank">https://doi.org/10.16904/ENVIDAT.205</a>, 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib47"><label>Peters et al.(2007)Peters, Baets, Verhoest, Samson, Degroeve, Becker, and Huybrechts</label><mixed-citation>
      
Peters, J., Baets, B. D., Verhoest, N. E., Samson, R., Degroeve, S., Becker,
P. D., and Huybrechts, W.: Random forests as a tool for ecohydrological
distribution modelling, Ecol. Model., 207, 304–318,
<a href="https://doi.org/10.1016/j.ecolmodel.2007.05.011" target="_blank">https://doi.org/10.1016/j.ecolmodel.2007.05.011</a>, 2007.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib48"><label>Qiu et al.(2023)Qiu, Niu, Baas, and Phanikumar</label><mixed-citation>
      
Qiu, H., Niu, J., Baas, D. G., and Phanikumar, M. S.: An integrated
watershed-scale framework to model nitrogen transport and transformations,
Sci. Total Environ., 882, 163348, <a href="https://doi.org/10.1016/j.scitotenv.2023.163348" target="_blank">https://doi.org/10.1016/j.scitotenv.2023.163348</a>, 2023.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib49"><label>Qu et al.(2018)Qu, Maksyutov, and Zhuang</label><mixed-citation>
      
Qu, Y., Maksyutov, S., and Zhuang, Q.: Technical Note: An efficient method for accelerating the spin-up process for process-based biogeochemistry
models, Biogeosciences, 15, 3967–3973, <a href="https://doi.org/10.5194/bg-15-3967-2018" target="_blank">https://doi.org/10.5194/bg-15-3967-2018</a>, 2018.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib50"><label>Randerson et al.(2009)Randerson, Hoffman, Thornton, Mahowald,
Lindsay, Lee, Nevison, Doney, Bonan, Stckli, Covey, Running, and
Fung</label><mixed-citation>
      
Randerson, J. T., Hoffman, F. M., Thornton, P. E., Mahowald, N. M., Lindsay,
K., Lee, Y., Nevison, C. D., Doney, S. C., Bonan, G., Stöckli, R., Covey,
C., Running, S. W., and Fung, I. Y.: Systematic assessment of terrestrial
biogeochemistry in coupled climate–carbon models, Global Change Biol., 15, 2462–2484, <a href="https://doi.org/10.1111/j.1365-2486.2009.01912.x" target="_blank">https://doi.org/10.1111/j.1365-2486.2009.01912.x</a>, 2009.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib51"><label>Ruiz et al.(2015)Ruiz, Or, and Schymanski</label><mixed-citation>
      
Ruiz, S., Or, D., and Schymanski, S. J.: Soil Penetration by Earthworms and Plant Roots–Mechanical Energetics of Bioturbation of Compacted Soils, PLOS ONE, 10, e0128914, <a href="https://doi.org/10.1371/journal.pone.0128914" target="_blank">https://doi.org/10.1371/journal.pone.0128914</a>, 2015.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib52"><label>Schwarz et al.(2024)Schwarz, Ghersheen, Belyazid, and
Manzoni</label><mixed-citation>
      
Schwarz, E., Ghersheen, S., Belyazid, S., and Manzoni, S.: When and why
microbial-explicit soil organic carbon models can be unstable, Biogeosciences, 21, 3441–3461, <a href="https://doi.org/10.5194/bg-21-3441-2024" target="_blank">https://doi.org/10.5194/bg-21-3441-2024</a>, 2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib53"><label>Seck et al.(2015)Seck, Welty, and Maxwell</label><mixed-citation>
      
Seck, A., Welty, C., and Maxwell, R. M.: Spin‐up behavior and effects of
initial conditions for an integrated hydrologic model, Water Resour. Res., 51, 2188–2210, <a href="https://doi.org/10.1002/2014WR016371" target="_blank">https://doi.org/10.1002/2014WR016371</a>, 2015.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib54"><label>Senarath et al.(2000)Senarath, Ogden, Downer, and
Sharif</label><mixed-citation>
      
Senarath, S. U. S., Ogden, F. L., Downer, C. W., and Sharif, H. O.: On the
calibration and verification of two‐dimensional, distributed, Hortonian,
continuous watershed models, Water Resour. Res., 36, 1495–1510,
<a href="https://doi.org/10.1029/2000WR900039" target="_blank">https://doi.org/10.1029/2000WR900039</a>, 2000.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib55"><label>Shi et al.(2013)Shi, Yang, Lawrence, Dickinson, and
Subin</label><mixed-citation>
      
Shi, M., Yang, Z.-L., Lawrence, D. M., Dickinson, R. E., and Subin, Z. M.:
Spin-up processes in the Community Land Model version 4 with explicit
carbon and nitrogen components, Ecol. Model., 263, 308–325,
<a href="https://doi.org/10.1016/j.ecolmodel.2013.04.008" target="_blank">https://doi.org/10.1016/j.ecolmodel.2013.04.008</a>, 2013.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib56"><label>Shi et al.(2018)Shi, Eissenstat, He, and Davis</label><mixed-citation>
      
Shi, Y., Eissenstat, D. M., He, Y., and Davis, K. J.: Using a
spatially-distributed hydrologic biogeochemistry model with a nitrogen
transport module to study the spatial variation of carbon processes in a
Critical Zone Observatory, Ecol. Model., 380, 8–21,
<a href="https://doi.org/10.1016/j.ecolmodel.2018.04.007" target="_blank">https://doi.org/10.1016/j.ecolmodel.2018.04.007</a>, 2018.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib57"><label>Simon et al.(2023)Simon, Glaum, and
Valdovinos</label><mixed-citation>
      
Simon, S. M., Glaum, P., and Valdovinos, F. S.: Interpreting random forest
analysis of ecological models to move from prediction to explanation, Sci. Rep., 13, 3881, <a href="https://doi.org/10.1038/s41598-023-30313-8" target="_blank">https://doi.org/10.1038/s41598-023-30313-8</a>, 2023.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib58"><label>Six et al.(2002)Six, Conant, Paul, and
Paustian</label><mixed-citation>
      
Six, J., Conant, R. T., Paul, E. A., and Paustian, K.: Stabilization Mechanisms of Soil Organic Matter: Implications for C-saturation of Soils, Plant Soil, 241, 155–176, <a href="https://doi.org/10.1023/A:1016125726789" target="_blank">https://doi.org/10.1023/A:1016125726789</a>, 2002.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib59"><label>Stähli(2018)</label><mixed-citation>
      
Stähli, M.: Longterm hydrological observatory alptal (central switzerland), EnviDat [data set], <a href="https://doi.org/10.16904/envidat.380" target="_blank">https://doi.org/10.16904/envidat.380</a>, 2018.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib60"><label>Stähli et al.(2021)Sthli, Seibert, Kirchner, Von Freyberg, and
Van Meerveld</label><mixed-citation>
      
Stähli, M., Seibert, J., Kirchner, J. W., Von Freyberg, J., and Van Meerveld, I.: Hydrological trends and the evolution of catchment research in the Alptal valley, central Switzerland, Hydrol. Process., 35, e14113,
<a href="https://doi.org/10.1002/hyp.14113" target="_blank">https://doi.org/10.1002/hyp.14113</a>, 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib61"><label>Stone(1976)</label><mixed-citation>
      
Stone, M.: Cross-Validatory Choice and Assessment of Statistical
Predictions (With Discussion), J. Roy. Stat. Soc. Ser. B, 38, 102,
<a href="https://doi.org/10.1111/j.2517-6161.1976.tb01573.x" target="_blank">https://doi.org/10.1111/j.2517-6161.1976.tb01573.x</a>, 1976.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib62"><label>Sun et al.(2021)Sun, Goll, Chang, Ciais, Guenet, Helfenstein, Huang, Lauerwald, Maignan, Naipal, Wang, Yang, and
Zhang</label><mixed-citation>
      
Sun, Y., Goll, D. S., Chang, J., Ciais, P., Guenet, B., Helfenstein, J., Huang, Y., Lauerwald, R., Maignan, F., Naipal, V., Wang, Y., Yang, H., and Zhang, H.: Global Evaluation of the Nutrient-Enabled Version of the Land Surface Model ORCHIDEE-CNP v1.2 (R5986), Geosci. Model Dev., 14,
1987–2010, <a href="https://doi.org/10.5194/gmd-14-1987-2021" target="_blank">https://doi.org/10.5194/gmd-14-1987-2021</a>, 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib63"><label>Sun et al.(2023)Sun, Goll, Huang, Ciais, Wang, Bastrikov, and
Wang</label><mixed-citation>
      
Sun, Y., Goll, D. S., Huang, Y., Ciais, P., Wang, Y.-P., Bastrikov, V., and
Wang, Y.: Machine Learning for Accelerating Process-based Computation of Land
Biogeochemical Cycles, Global Change Biol., 29, 3221–3234,
<a href="https://doi.org/10.1111/gcb.16623" target="_blank">https://doi.org/10.1111/gcb.16623</a>, 2023.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib64"><label>Tague and Band(2004)</label><mixed-citation>
      
Tague, C. L. and Band, L. E.: RHESSys: Regional Hydro-Ecologic
Simulation System – An Object-Oriented Approach to Spatially
Distributed Modeling of Carbon, Water, and Nutrient Cycling,
Earth Interact., 8, 1–42, <a href="https://doi.org/10.1175/1087-3562(2004)8&lt;1:RRHSSO&gt;2.0.CO;2" target="_blank">https://doi.org/10.1175/1087-3562(2004)8&lt;1:RRHSSO&gt;2.0.CO;2</a>, 2004.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib65"><label>Teng et al.(2017)Teng, Zeng, Xiao, Huang, Zhou, Yan, and
Wang</label><mixed-citation>
      
Teng, M., Zeng, L., Xiao, W., Huang, Z., Zhou, Z., Yan, Z., and Wang, P.:
Spatial variability of soil organic carbon in Three Gorges Reservoir
area, China, Sci. Total Environ., 599-600, 1308–1316,
<a href="https://doi.org/10.1016/j.scitotenv.2017.05.085" target="_blank">https://doi.org/10.1016/j.scitotenv.2017.05.085</a>, 2017.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib66"><label>Thornton et al.(2002)Thornton, Law, Gholz, Clark, Falge, Ellsworth,
Goldstein, Monson, Hollinger, Falk, Chen, and
Sparks</label><mixed-citation>
      
Thornton, P., Law, B., Gholz, H. L., Clark, K. L., Falge, E., Ellsworth, D.,
Goldstein, A., Monson, R., Hollinger, D., Falk, M., Chen, J., and Sparks, J.:
Modeling and measuring the effects of disturbance history and climate on
carbon and water budgets in evergreen needleleaf forests, Agr. Forest Meteorol., 113, 185–222, <a href="https://doi.org/10.1016/S0168-1923(02)00108-9" target="_blank">https://doi.org/10.1016/S0168-1923(02)00108-9</a>, 2002.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib67"><label>Thornton and Rosenbloom(2005)</label><mixed-citation>
      
Thornton, P. E. and Rosenbloom, N. A.: Ecosystem model spin-up: Estimating
steady state conditions in a coupled terrestrial carbon and nitrogen cycle
model, Ecol. Model., 189, 25–48, <a href="https://doi.org/10.1016/j.ecolmodel.2005.04.008" target="_blank">https://doi.org/10.1016/j.ecolmodel.2005.04.008</a>, 2005.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib68"><label>Tyralis et al.(2019)Tyralis, Papacharalampous, and
Langousis</label><mixed-citation>
      
Tyralis, H., Papacharalampous, G., and Langousis, A.: A Brief Review of
Random Forests for Water Scientists and Practitioners and Their
Recent History in Water Resources, Water, 11, 910,
<a href="https://doi.org/10.3390/w11050910" target="_blank">https://doi.org/10.3390/w11050910</a>, 2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib69"><label>Verhoef et al.(2026)Verhoef, Zeng, Agam, Best, Bonetti, Boussetta,
Chaney, Cuntz, Edwards, Gupta, Heitman, Huang, Jarvis, Jiang, Kandala,
Kiałka, Lian, Mu, Nemes, Paulus, Raoult, Reddy, Romano, Sabot,
Vanderborght, Van Der Ploeg, Van Oevelen, Wang, Wang, Weber, Marthews, and
Weihermüller</label><mixed-citation>
      
Verhoef, A., Zeng, Y., Agam, N., Best, M., Bonetti, S., Boussetta, S., Chaney, N., Cuntz, M., Edwards, J., Gupta, S., Heitman, J., Huang, M., Jarvis, N., Jiang, S., Kandala, R., Kiałka, F., Lian, T., Mu, M., Nemes, A., Paulus, S. J., Raoult, N., Reddy, K. N., Romano, N., Sabot, M., Vanderborght, J., Van Der Ploeg, M., Van Oevelen, P., Wang, Y., Wang, Y., Weber, T. K. D., Marthews, T., and Weihermüller, L.: Rethinking Soils in Land Surface Models, B. Am. Meteorol. Soc., 107, E665–E674, <a href="https://doi.org/10.1175/BAMS-D-26-0003.1" target="_blank">https://doi.org/10.1175/BAMS-D-26-0003.1</a>, 2026.


    </mixed-citation></ref-html>
<ref-html id="bib1.bib70"><label>Xia et al.(2012)Xia, Luo, Wang, Weng, and
Hararuk</label><mixed-citation>
      
Xia, J. Y., Luo, Y. Q., Wang, Y.-P., Weng, E. S., and Hararuk, O.: A
semi-analytical solution to accelerate spin-up of a coupled carbon and
nitrogen land model to steady state, Geosci. Model Dev., 5, 1259–1271, <a href="https://doi.org/10.5194/gmd-5-1259-2012" target="_blank">https://doi.org/10.5194/gmd-5-1259-2012</a>, 2012.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib71"><label>Yan et al.(2018)Yan, Bond-Lamberty, Todd-Brown, Bailey, Li, Liu, and Liu</label><mixed-citation>
      
Yan, Z., Bond-Lamberty, B., Todd-Brown, K. E., Bailey, V. L., Li, S., Liu, C., and Liu, C.: A moisture function of soil heterotrophic respiration that
incorporates microscale processes, Nat. Commun., 9, 2562, <a href="https://doi.org/10.1038/s41467-018-04971-6" target="_blank">https://doi.org/10.1038/s41467-018-04971-6</a>, 2018.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib72"><label>Yang et al.(2025)Yang, Zhao, and
Biswas</label><mixed-citation>
      
Yang, Y., Zhao, R., and Biswas, A.: Delineating Dynamic Hydrological Response
Units to Improve Simulations of Extreme Runoff Events in Changing Environments, J. Hydrol., 656, 133000, <a href="https://doi.org/10.1016/j.jhydrol.2025.133000" target="_blank">https://doi.org/10.1016/j.jhydrol.2025.133000</a>, 2025.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib73"><label>Yao et al.(2023)Yao, Dai, Gao, Yi, Wang, and
Hu</label><mixed-citation>
      
Yao, Y., Dai, Q., Gao, R., Yi, X., Wang, Y., and Hu, Z.: Characteristics and
factors influencing soil organic carbon composition by vegetation type in
spoil heaps, Front. Plant Sci., 14, 1240217, <a href="https://doi.org/10.3389/fpls.2023.1240217" target="_blank">https://doi.org/10.3389/fpls.2023.1240217</a>, 2023.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib74"><label>Zeraatpisheh et al.(2022)Zeraatpisheh, Garosi, Reza Owliaie, Ayoubi, Taghizadeh-Mehrjardi, Scholten, and Xu</label><mixed-citation>
      
Zeraatpisheh, M., Garosi, Y., Reza Owliaie, H., Ayoubi, S., Taghizadeh-Mehrjardi, R., Scholten, T., and Xu, M.: Improving the spatial
prediction of soil organic carbon using environmental covariates selection:
A comparison of a group of environmental covariates, Catena, 208, 105723,
<a href="https://doi.org/10.1016/j.catena.2021.105723" target="_blank">https://doi.org/10.1016/j.catena.2021.105723</a>, 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib75"><label>Zhan et al.(2003)Zhan, Xue, and Collatz</label><mixed-citation>
      
Zhan, X., Xue, Y., and Collatz, G.: An analytical approach for estimating CO<sub>2</sub> and heat fluxes over the Amazonian region, Ecol. Model., 162,
97–117, <a href="https://doi.org/10.1016/S0304-3800(02)00405-2" target="_blank">https://doi.org/10.1016/S0304-3800(02)00405-2</a>, 2003.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib76"><label>Zhang et al.(2018)Zhang, Li, Zhu, Zheng, Liu, Tang, Su, Zhang, Ju,
and Deng</label><mixed-citation>
      
Zhang, W., Li, Y., Zhu, B., Zheng, X., Liu, C., Tang, J., Su, F., Zhang, C.,
Ju, X., and Deng, J.: A process-oriented hydro-biogeochemical model enabling
simulation of gaseous carbon and nitrogen emissions and hydrologic nitrogen
losses from a subtropical catchment, Sci. Total Environ., 616–617, 305–317, <a href="https://doi.org/10.1016/j.scitotenv.2017.09.261" target="_blank">https://doi.org/10.1016/j.scitotenv.2017.09.261</a>, 2018.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib77"><label>Zhang et al.(2023)Zhang, Chen, Wang, Wang, Lin, and
Ye</label><mixed-citation>
      
Zhang, Y., Chen, A., Wang, Z., Wang, X., Lin, Y., and Ye, C.: Soil organic
carbon accumulation along a chronosequence of vegetation colonization on
debris flow fans in Southwest China, Catena, 223, 106936,
<a href="https://doi.org/10.1016/j.catena.2023.106936" target="_blank">https://doi.org/10.1016/j.catena.2023.106936</a>, 2023.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib78"><label>Zhu et al.(2019)Zhu, Feng, Qin, Cao, Zhang, Liu, Deo, Zhang, Li, and Li</label><mixed-citation>
      
Zhu, M., Feng, Q., Qin, Y., Cao, J., Zhang, M., Liu, W., Deo, R. C., Zhang, C., Li, R., and Li, B.: The role of topography in shaping the spatial patterns of soil organic carbon, Catena, 176, 296–305,
<a href="https://doi.org/10.1016/j.catena.2019.01.029" target="_blank">https://doi.org/10.1016/j.catena.2019.01.029</a>, 2019.

    </mixed-citation></ref-html>--></article>
