CLM5-FruitTree: a new sub-model for deciduous fruit trees in the Community Land Model (CLM5)

. The inclusion of perennial, woody crops in land surface models (LSMs) is crucial for addressing their role in carbon (C) sequestration, food production, and water requirements under climate change. To help quantify the biogeochemical and biogeophysical processes associated with these agroecosystems, we developed and tested a new sub-model, CLM5-FruitTree, for deciduous fruit orchards within the framework of the Community Land Model version 5 (CLM5). The model development included (1) a new perennial crop phenology description, (2) an adapted C and nitrogen allocation scheme, considering both storage and photosynthetic growth of annual and perennial plant organs, (3) typical management practices associated with fruit orchards, and (4) the parameterization of an apple plant functional type. CLM5-FruitTree was tested using extensive ﬁeld measurements from an apple orchard in South Tyrol, Italy. Growth and partitioning of biomass to the individual plant components were well represented by CLM5-FruitTree, and average yield was predicted within 2.3 % of the observed values despite low simulated inter-annual variability compared to observations. The simulated seasonal course of C, energy, and water ﬂuxes was in good agreement with the eddy covariance (EC) measurements owing to the accurate representation of the prolonged growing season and typical leaf area development of the orchard. We found that gross primary production, net radiation, and latent heat ﬂux were highly correlated ( r> 0 . 94) with EC measurements and showed little bias ( < ± 5 %). Simulated respiration components, sensible heat, and soil heat ﬂux were less consistent with observations. This was attributed to simpliﬁcations in the orchard structure and to the presence of additional management practices that are not yet represented in CLM5-FruitTree. Finally, the results suggested that the representation of microbial and autotrophic respiration and energy partitioning in complex, discontinuous canopies in CLM5

Abstract. The inclusion of perennial, woody crops in land surface models (LSMs) is crucial for addressing their role in carbon (C) sequestration, food production, and water requirements under climate change. To help quantify the biogeochemical and biogeophysical processes associated with these agroecosystems, we developed and tested a new submodel, CLM5-FruitTree, for deciduous fruit orchards within the framework of the Community Land Model version 5 (CLM5). The model development included (1) a new perennial crop phenology description, (2) an adapted C and nitrogen allocation scheme, considering both storage and photosynthetic growth of annual and perennial plant organs, (3) typical management practices associated with fruit orchards, and (4) the parameterization of an apple plant functional type. CLM5-FruitTree was tested using extensive field measurements from an apple orchard in South Tyrol, Italy. Growth and partitioning of biomass to the individual plant components were well represented by CLM5-FruitTree, and average yield was predicted within 2.3 % of the observed values despite low simulated inter-annual variability compared to observations. The simulated seasonal course of C, energy, and water fluxes was in good agreement with the eddy covariance (EC) measurements owing to the accurate representation of the prolonged growing season and typical leaf area development of the orchard. We found that gross primary production, net radiation, and latent heat flux were highly correlated (r>0.94) with EC measurements and showed little bias (< ± 5 %). Simulated respiration components, sensible heat, and soil heat flux were less consistent with observations. This was attributed to simplifications in the orchard structure and to the presence of additional management practices that are not yet represented in CLM5-FruitTree. Finally, the results suggested that the representation of microbial and autotrophic respiration and energy partitioning in complex, discontinuous canopies in CLM5 requires further attention. The new CLM5-FruitTree sub-model improved the representation of agricultural systems in CLM5 and can be used to study land surface processes in fruit orchards at the local, regional, or larger scale. suggested perennial agriculture as a possible measure to mitigate climate change and enhance food security (Glover et al., 2010), and many studies have recently investigated this potential for various fruit orchards (Wu et al., 2012;Scandellari et al., 2016;Hammad et al., 2020;Yasin et al., 2021). The study of water and irrigation requirements in fruit orchards has become another field of intense research due to the need for a more resilient agriculture in the context of climate change and water supply shortages (Maestre-Valero et al., 2017;El Jaouhari et al., 2018;O'Connell and Scalisi, 2021;Segovia-Cardozo et al., 2022). In order to answer questions related to C sequestration, water requirements, and sustainable food production of fruit orchards, a better understanding of the related ecosystem processes is vital (Fader et al., 2015).
Models with a comprehensive description of the carbon, water, and energy fluxes, such as global land surface models (LSMs), are a powerful tool to explore complex ecosystems like the abovementioned fruit orchards. The use of LSMs was recently extended to not only model the processes at the land-atmosphere interface, but also to study the response of ecosystems and water resources to climate change (Prentice et al., 2015;Fisher and Koven, 2020;Blyth et al., 2021). To quantify these effects, LSMs need to represent a wide range of land use and vegetation types. However, most LSMs consider only perennials such as deciduous and coniferous trees as well as major annual crops such as wheat, soy, or maize (Lawrence et al., 2018). Recently, some LSMs additionally included bioenergy crops (Schaphoff et al., 2018), while others group crops into a few generic crop types (Noilhan and Mahfouf, 1996;Krinner et al., 2005;Balsamo et al., 2009). Despite their significance, perennial crops, such as fruit trees, are rarely considered in LSMs, and attempts to include them in global and regional modelling environments are scarce (Fader et al., 2015;Cheng et al., 2020). An example of such an attempt is the inclusion of agricultural trees (e.g. grapes, cotton, and apple trees) in the Lund-Potsdam-Jena managed Land (LPJmL) model to improve the representation of Mediterranean agroecosystems (Fader et al., 2015). Here, agricultural trees were modelled as small trees, and fruit harvest was determined as the product of a plant-specific harvest index and the net primary productivity (NPP). Other authors parameterized oil palm trees, a perennial evergreen crop, in the Community Land Model (CLM) version 4.5 (Fan et al., 2015). Palm trees were represented by a new phenology where large palm leaves with fruit bunches emerge successively, leaves are pruned regularly, and harvest occurs once a month. Recently, two perennial grasses for energy production were parameterized in the latest version of the model, CLM5 (Cheng et al., 2020). Parameters for bioenergy crops were tuned using sensitivity analysis and observations, while harvest was represented by removing around 70 % of the aboveground biomass.
While the abovementioned studies describe some common features of perennial plants, they do not, or only partially, represent the seasonal deciduous phenology of fruit trees or the explicit modelling of fruit growth. Furthermore, key aspects such as C reserve accumulation and mobilization in the following spring are generally not considered, possibly due to necessary simplifications or because the drivers of these processes are still not fully understood (Le Roux et al., 2001;Neumann, 2020). The absence of perennial crops in LSMs introduces a significant bias in the representation of biogeophysical and biogeochemical processes in agroecosystems where this type of cultivation is prevalent. As a result, the response to climate change in terms of C sequestration, water requirements, or food production cannot be assessed adequately in regions such as the Mediterranean, where perennial, woody crops are very common and play a vital role in food security and economy (Fader et al., 2015;Lobianco and Roberto, 2006).
Although deciduous fruit trees share certain characteristics with natural vegetation and annual crops in LSMs such as CLM5, several particularities in their growth dynamics and management practices still prevent a meaningful simulation using currently available representations of vegetation. In this study, we therefore provide CLM5 with the ability to model perennial fruit trees and the associated processes. For this purpose, we developed a new sub-model named CLM5-FruitTree within the existing model framework of CLM5. CLM5-FruitTree combines elements of the broadleaf deciduous tree subroutine such as growth and C turnover of woody components, with distinctive phenological stages and a harvestable organ similar to the annual crop subroutine. We first describe the model conceptualization including the new phenology, carbon and nitrogen (CN) allocation, and management options. We further demonstrate the applicability of CLM5-FruitTree by parameterizing a new apple plant functional type (PFT). Finally, we evaluate and discuss the model performance using extensive field data from an apple orchard in South Tyrol, Italy.

Vegetation characterizations in CLM5
The latest version of the Community Land Model, CLM5, simulates the exchange of water, energy, C, and nitrogen (N) between land and atmosphere as well as their storage and transport on the land surface and in the subsurface, driven by climate variability and modulated by soil and vegetation states and characteristics. The land surface in CLM5 is characterized by one of five land units, namely glacier, lake, urban, vegetated, and crop. These units are further divided to capture the variability in soil, vegetation, and management options (i.e. irrigated or non-irrigated). Compared to previous model versions, CLM5 features various improvements in the representation of land use and vegetation modelling, such as plant CN cycling, soil and plant hydrology, and crop modelling (Lawrence et al., 2018;Lombardozzi et al., 2020).
Many of the C and N cycle components of CLM5 were originally derived from the Biome BioGeochemical Cycles (Biome-BGC) model (Thornton et al., 2002). Here, vegetation is represented conceptually by three different plant C and N pools that are maintained separately for the individual plant organs (leaf, live/dead stem, fine root, live/dead coarse root, and grain). The storage pools represent C and N reserves, the transfer pools serve as intermediate pools to separate fluxes in and out of the storage pools, and the display pools represent the actual growth of a given organ (Fig. 1). C made available through photosynthesis is first used to support maintenance respiration of live organs based on organ N content, temperature, and a constant base rate as proposed by Atkin et al. (2015). Dead stem and dead coarse root components are assumed to consist of dead xylem cells, without metabolic function (no C cost for maintenance). The remaining C can then be allocated to the growth of new tissue considering associated growth respiration costs. Maintenance respiration, growth respiration and C cost of N uptake from the soil comprise the autotrophic respiration component (R a ) in CLM5. Plant material reaching the end of its lifespan feeds into different litter pools from where it progressively decomposes to soil organic matter under C losses through heterotrophic respiration (R h ).
For the simulation of fruit orchards, a module for perennial deciduous crops is needed, which is currently missing in CLM5. Such a module must account for the perennial deciduous nature of fruit trees, which is similar to the existing representation of broadleaf deciduous trees (BDTs) included in Biome-BGC but with differences in phenological triggers, vegetation structure, and C partitioning. In addition, it must represent growth and harvest of the fruits and typical management practices, of which some are already conceptualized in the prognostic Biogeochemistry Crop Module (BGCcrop), while others are not yet implemented. The algorithm for the seasonal phenology of BDT controls initial leaf development and senescence that mark the beginning and end of a growing season based on temperature and day length thresholds. Once a new growth period is initiated, C and corresponding N fluxes accumulated in the previous season occur out of the storage pools into the transfer pools, from where they are gradually sent to the display pools (Fig. 1). During the active growth period, C and corresponding N storage pools are replenished based on specified C : N ratios of each plant organ. During leaf senescence, C and N pools feed the litter or coarse woody debris pool except for live stem and live coarse roots that are mostly retained as structural woody tissue (dead stem and dead coarse roots).
BGC-crop, adopted from the prognostic crop module of the Agro-Ecosystem Integrated Biosphere Simulator (Agro-IBIS), currently features eight different annual crop species with interactive crop management options (i.e. irrigation and fertilization). Another 23 currently inactive crop types can be defined but have not been provided with specific crop parameters (Lombardozzi et al., 2020). Crop phenology and CN allocation follow three phenological phases: (1) from planting to leaf emergence, (2) from leaf emergence to the start of grain fill, and (3) from grain fill to grain maturity and harvest, which are controlled by temperature and growing degree-day (GDD) thresholds. Different to natural vegetation, crops have a grain pool representing the harvestable organ but no structural woody tissue. Furthermore, all assimilates are directed to the displayed pools, while the storage pools remain unused. At harvest, C and N from the grain pool are transferred to a grain product pool, while a small amount is kept to reseed the crop in the following year. All remaining plant parts feed the litter cycle (Fig. 1). The reader is referred to Lombardozzi et al. (2020) and the technical documentation of CLM5 for a more detailed description of the BDT and crop representation (Lawrence et al., 2018).
From the above description of the existing vegetation modules, the following limitations for the application of CLM5 to deciduous fruit trees arise.
(1) The current BGC-crop algorithm does not allow the simulation of perennial and/or woody crops. (2) The BDT phenology algorithm, although describing some characteristics common to fruit trees, lacks the capability to simulate a harvestable organ, individual development of different plant parts, and the separation of growth from C reserves of the previous year and photosynthetic growth of the current season. (3) Typical management practices of fruit orchards such as transplanting of tree seedlings and pruning are currently not represented in CLM5. (4) There is no parameterized fruit tree PFT in the default parameter set of CLM5.

Model conceptualization and technical implementation
To resolve the model limitations discussed in Sect. 2.1, we developed a new sub-model, CLM5-FruitTree, to model the ecosystem processes and exchanges of energy and matter of deciduous fruit trees grown in commercial orchards, with a focus on the simulation of biomass growth and yield. More specifically, for the implementation of CLM5-FruitTree, we introduced a new phenology subroutine that describes the main phenological development of fruit trees and includes triggers for seasonal orchard management practices typical under organic or conventional production. In addition, the CN allocation module as well as corresponding modules (C and N state and flux updates) were modified to reproduce the growth dynamics of fruit trees and to model the fates of C and N in the orchard system. The sub-model development does not include any changes to the existing calculation schemes for radiative transfer or momentum, heat, and water fluxes to explicitly account for the discontinuous canopy structure of tree rows and vegetated or non-vegetated alleys in fruit orchards. In-row and between-row planting distances and alley vegetation are not defined directly. Instead, the orchard struc- ture and the area covered by the canopy are accounted for through parameterization of the leaf and stem area indices, the planting density, maximum canopy height, and aerodynamic parameters, similar to the implementation of crops and forest in CLM5. CLM5-FruitTree combines characteristics of both BDT and annual crops to simulate a perennial woody crop with a harvestable organ making use of the existing concepts of storage, transfer, and display vegetation pools described in Sect. 2.1 (Fig. 1). Similar to the existing BDT phenology algorithm in CLM5, the fruit tree algorithm uses a perennial deciduous phenology with standing woody biomass and annual leaf shedding. During the active growth period, however, the phenology and CN allocation of vegetative and harvestable organs are described by distinct growth phases and are driven by a GDD summation similar to the crop phenology.
An orchard is established by transplanting small tree seedlings from a nursery, a typical planting method for this type of cultivation (Wheaton et al., 1990;Corelli-Grappadelli and Marini, 2008). Once planted, the orchard remains productive according to a user-defined lifespan which, depending on fruit tree type and production system, typically ranges between 10 and 30 years (Demestihas et al., 2017;Cerutti et al., 2014). The sub-model makes no specific assumptions about the rootstock, but the effect of different rootstocks in terms of tree height and rooting depth can be set by the user via the respective parameters, ztopmx and root_dmx (Table C1). In CLM5-FruitTree, both stored C and current photosynthesis contribute to the growth of the fruit tree, as leaf and shoot development at the beginning of a growing season utilizes carbohydrate reserves and nitrogenous compounds that were accumulated during the previous season (Tromp, 1983;Oliveira and Priestley, 1988;Loescher et al., 1990). Deciduous fruit trees are dormant in winter and resume growth in spring after meeting species-and cultivar-specific chilling and heat requirements (Anderson et al., 1986;Faust et al., 1997;Zavalloni et al., 2006), which is represented in CLM5-FruitTree using the chilling and forcing model proposed by Cesaraccio et al. (2004). Early in the season, the canopy develops rapidly until it reaches maturity typically by midsummer, while leaf shedding occurs when temperatures drop in autumn (Kozlowski, 1992;Loescher et al., 1990;Lakso et al., 1999). Fruit trees usually start flowering 3-4 weeks after bud break, which is not specifically represented by CLM5-FruitTree, which instead assumes that fruit growth begins at the end of flowering (Lakso et al., 1999). The implementation of flowering to include effects of nonoptimal pollination, frost during flowering, or hormonal processes affecting fruit set and development is outside of the scope of this development and of minor importance for largescale simulations and processes at ecosystem level that are typically the focus of LSMs such as CLM5. Consequently, CLM5-FruitTree does not produce information on fruit size or number but only on total yield, which we consider ade-quate for most applications of the sub-model development. Fruit growth is described by two stages, cell division and cell expansion that together form a sigmoid growth curve observed for many fruit tree species such as apple, pear, and orange (Corelli-Grappadelli and Lakso, 2004;Jackson, 2011).
In the following, the new developments to account for the distinct phenology, CN allocation, and management practices of a fruit orchard are described in more detail. Other biochemical and biophysical processes such as photosynthesis, water and litter cycles, and fixation and uptake of N were not modified except for minor adaptations to the re-translocation of N and respiration to enable the use of certain parts of these scripts for the fruit tree PFT. The technical implementation of some features of the new phenology routine (transplanting, pruning, harvest, and final rotation) was based on CLM-Palm, a previous model development for palm trees in CLM4.5 (Fan et al., 2015, and unpublished code). References where code elements were directly reused or modified based on CLM-Palm are made in the published source code of CLM5-FruitTree (Dombrowski, 2022). Along with the new sub-model, an apple PFT was parameterized using one of the existing but thus far inactive crop types in CLM5, types 35 and 36 (rainfed and irrigated citrus).

Phenology
A new orchard life cycle is initialized by transplanting seedlings at the beginning of the year during dormancy. Tree growth thereafter is described by six post-planting phenological stages, namely (1) bud break, (2) fruit growth, (3) fruit ripening, (4) canopy maturity, (5) fruit maturity and harvest, and (6) start of leaf senescence (Fig. 2).
Bud break is predicted by a sequential model that first accumulates chill days followed by anti-chill days based on a predefined temperature threshold and chilling requirement (Cesaraccio et al., 2004). More information on the sequential model and the calibration of model parameters can be found in Appendix A. Outside the dormant period, leaf and fruit development occurs in parallel but with a time shift as fruit growth typically starts 4-5 weeks after bud break, while canopy development continues until mid-season and leaf senescence does not occur until after the fruits are harvested (Wünsche and Lakso, 2000;Goldschmidt and Lakso, 2005) (Fig. 2).
The thermal thresholds to reach phases (2)-(5) are defined as accumulated GDDs since bud break and can be adjusted by the user via the parameter file, which applies to all parameters listed in Table C1 of the Appendix. GDDs are determined as the difference between the average daily air temperature and a base temperature of 4 • C with a maximum daily increment of 26 degree days (Eq. 1). Different to the existing deciduous phenology, leaf senescence is triggered not by day length but by the drop of the daily mean temperature below a critical temperature threshold, in this case the base temperature. This approach was selected since many fruit trees that Figure 1. Schematic of the main phenology and C allocation features of the broadleaf deciduous tree and annual crop representations in CLM5 as well as the new CLM5-FruitTree sub-model. C pools within the dashed boxes are the individual components that make up the displayed C pool (the same components can be found for the other main plant pools: storage and transfer pools, respectively). Carbon pools and fluxes in green were reused for CLM5-FruitTree, while pools and fluxes in brown were modified or newly added. belong to the Rosaceae family (e.g. apple, pear, plum, and cherry) are unaffected by photoperiod and are instead controlled by temperature (Heide and Prestrud, 2005). The last day of the leaf senescence period marks the beginning of dormancy. The new phenology subroutine of CLM5-FruitTree also controls C reserve dynamics, stem and root turnover, and final rotation, which involves removing and replanting trees when the maximum orchard lifespan is reached.

Carbon and nitrogen allocation
CN allocation to the growth of new tissue (display pools) and to storage pools follows the phenological stages described in Sect. 2.2.1 (Fig. 2). A coupled CN allocation subroutine determines the fate of newly assimilated C from photosynthesis. A user-defined initial biomass can be assigned to leaf and fine root transfer pools via the transplant parameter (Table C1), while additionally 10 % of this biomass is assigned to the dead stem pool to define an initial stem area index >0. Each pool is also assigned the corresponding amount of N. Adjustments to this parameter have only little effect on the biomass growth and yield of the adult trees as the trees reach their maximum canopy height and develop their full leaf area index (LAI) within the first couple of years after transplanting. Thereafter, the potential allocation to the different plant components is based on allocation coefficients and allometric relationships between dead and live parts of stem and coarse root. Throughout the growing period until harvest, 5 % of the newly assimilated C is allocated to the storage pools, as defined by the fcur parameter, except for fruits, where all allocated C is assigned to the displayed pool. For all other organs, the remaining C is also allocated to the displayed C pools. At bud break, a fraction of the C in the storage pool of all plant components, except fruits, is transferred to the actively growing C pools over a period that can be specified by the newly added parameter ndays_stor. This is based on the assumption that resources are partially mobilized to support growth of new tissue (Oliveira and Priestley, 1988;Loescher et al., 1990). Lacking more specific knowledge of the exact fraction, the default of 0.5 used by the seasonal deciduous phenology in CLM5 is adopted for fruit trees.
Before the start of fruit growth, phase (1), newly assimilated C and corresponding N are partitioned between leaf, stem, and root pools. The allocation coefficients are calculated according to a set of equations that were adapted from the AgroIBIS crop phenology algorithm used in CLM5-BGC-crop (Lawrence et al., 2018): . Fruit tree phenological stages of (1) bud break at the end of dormancy, (2) the start of fruit growth, (3) fruit ripening, (4) canopy maturity, (5) harvest, and (6) the start of leaf senescence. The lengths of phenological stages (2)-(5) are determined by their respective growing degree-day (GDD) thresholds starting from bud break (GDD leaf = 0), while stage (6) is determined by a critical temperature threshold (T crit ). Coloured bars correspond to the time any plant organ is present in the field throughout a year.
where GDD T 2 m are the accumulated growing degree days for the 2 m air temperature with maximum increments of 26 degree days, T 2 m is the simulated 2 m air temperature in Kelvin, T f is the freezing temperature of water and equals 273.15 K, GDD leaf , GDD fruit , and GDD lfmat are thermal thresholds for bud break, start of fruit growth, and canopy maturity, respectively, b is an exponential factor, a i leaf , a i froot , and a f froot are initial and final values for the allocation coefficients to leaf (a leaf ) and fine root (a froot ), respectively, and a repr and a livestem are the allocation coefficients to fruit and live stem, respectively. Once fruit growth begins in phase (2), an increasing proportion of the assimilated C and corresponding N is allocated to this organ, causing leaf allocation to decline and fruit allocation to plateau at a high value once canopy maturity is reached. Allocation to fine roots and stem continues to decline and then settles at a constant value until harvest: where GDD mat is the thermal threshold for fruit maturity and harvest, while d L and d stem alloc are stem allocation decline factors.
After harvest and until the start of dormancy, all of the newly assimilated C is sent to the storage pools following the notion that, late in the season, assimilates are used mostly to fill up reserves that can be mobilized to resume growth in the following spring (Le Roux et al., 2001). Fruit trees store C in the perennial woody parts of the tree, from where it is re-mobilized to support the growth of new shoots, leaves, and fine roots (Oliveira and Priestley, 1988;Millard, 1996;Le Roux et al., 2001). Since in CLM5 separate storage pools are assigned to each plant organ, the newly added aleafstor parameter (Table C1) defines the fraction of allocatable C going to the leaf storage pool, while the remainder is split equally between roots and stem.
Fruit trees, similar to other deciduous species, have been observed to translocate N out of senescent leaves to be reused by other tree organs (Millard, 1996;Malaguti et al., 2001;Millard et al., 2006). Therefore, CLM5-FruitTree adopts the same N re-translocation strategy as used in the BDT phenology, during which N is removed from falling litter based on leaf and litter C : N ratios and the available C to pay for the extraction of N from increasingly recalcitrant litter pools. Subsequently, it is transferred to the plant N pool, from where it can be used for the growth of new plant tissue (Lawrence et al., 2018).

Representation of management practices
Furthermore, management practices such as fertilization and stem pruning are represented in the new sub-model. Fertilization is performed on a yearly basis after the occurrence of bud break, as N fertilization in early spring is still the most common practice in fruit orchards even though autumn fertilization or multiple applications via fertigation are also in use to increase fertilizer N use efficiency and reduce N losses (Sanchez et al., 1995;Carranca et al., 2018). We use the existing fertilization scheme of the crop phenology that adds fertilizer directly to the soil mineral N pool. A user-defined fertilization rate or amount can be applied as synthetic fertilizer or manure, respectively, although there currently is no difference in model behaviour for these two fertilizer types (Lawrence et al., 2018).
Winter pruning is a common practice in fruit orchards and may be performed throughout the winter to control the shape and size of fruit trees and partially to manage crop load (Grechi et al., 2008). In many intensive orchard production systems, pruning residues are mulched into the soil, possibly increasing C sequestration (Montanaro et al., 2010;Aguilera et al., 2015). Alternatively, residues may also be exported and treated as waste (Benyei et al., 2018) or utilized for energy production (Kazimierski et al., 2021). In CLM5-FruitTree, pruning is performed as the tree enters dormancy by removing a user-defined fraction, prune_fr (Table C1), of the dead stem from both storage and displayed C pools. We remove C from the dead stem pool instead of the live stem pool since the former is the main wood pool in CLM5 that receives 85 % of the C allocated to total new wood. Furthermore, the implemented live wood turnover in CLM5 converts live stem to dead stem at the end of the growing season to account for differences in maintenance respiration and C : N ratios between these tissue types (Lawrence et al., 2018). Hence the live stem C pool remains rather small and stable over the years, so that applying pruning to this pool would have little effect on total tree biomass. The pruning implemented in CLM5-FruitTree affects only the tree biomass and height that are calculated based on this biomass pool, which in turn affects the calculation of turbulent fluxes of sensible and latent heat. However, this effect is small, and since turbulent fluxes are generally low in winter, the exact timing of pruning does not play a significant role in the magnitudes of these fluxes. During the first 3 years after planting, trees are not pruned to allow some initial stem biomass to grow. The sub-model treats pruning residues in one of two ways to account for their possible difference in fate: (1) residues are added to the wood harvest pool and exported from the field or (2) residues are added to the woody debris pool, thus feeding the litter cycle.
When the orchard reaches the end of its lifespan, C of all biomass pools (storage, transfer, and display) is sent to either the litter pool for leaves and fine roots or the wood harvest pool for live and dead stem and coarse roots, while any remaining C in the fruit pool is harvested. The orchard can then be replanted in the following year. Lastly, the standard irrigation routine implemented in CLM5 can be used for irrigated orchards by selecting the irrigated crop PFT.

Site data
Extensive field measurements from an apple-growing region in the Adige River valley, South Tyrol, Italy (46 • 21 N, 11 • 16 E; 240 m a.s.l.) were used to parameterize and test the new CLM5-FruitTree sub-model along with the new apple PFT (Zanotelli et al., 2013(Zanotelli et al., , 2015(Zanotelli et al., , 2019. Measurements were obtained from an approximately 0.5 ha irrigated apple orchard planted in 2000 with the Fuji apple cultivar grafted on M9 dwarfing rootstock. The apple trees were planted at a row and tree spacing of 3 × 1 m (3333 trees per hectare). A 1.8 m-wide grass strip was grown between the tree rows, which was mowed three times a year. Other management practices included regular pruning, spring fertilization of 7.5 gN m −2 yr −1 , and tillage of the soil directly un-derneath the trees (Zanotelli et al., 2013). Stand-related data included general stand characteristics and phenology observations, LAI, C : N ratios, rooting distribution at three depth ranges (0-20, 20-40, and 40-60 cm), measurements of the biomass growth of different tree organs at a monthly or seasonal interval, and fruit harvest information (Table 1). Furthermore, daily soil respiration measurements from a control and a trenching plot (with (R s ) and without (R h ) root respiration, respectively) were performed in 2010. Additionally, an eddy covariance (EC) station provided measurements of the turbulent exchange of trace gases and energy at the studied apple orchard between 2013 and 2015. The quality check, gap filling, and flux partitioning of collected data followed the procedure outlined in Reichstein et al. (2005). The average closure of the energy balance was 60 %. To correct for the closure failure, the missing energy was assigned to the latent (LE) and sensible (H ) heat fluxes based on the daily Bowen ratio (Zanotelli et al., 2019). Measured or derived fluxes included net ecosystem CO 2 exchange (NEE), ecosystem respiration (R eco ), gross primary production (GPP), LE, H , and evapotranspiration (ET) at half-hourly intervals. Furthermore, soil heat flux (G) measured at 5 cm depth as well as soil moisture measurements up to a depth of 60 cm of soil are available. Table 1 gives a summary of the available data and measurement periods. A complete description of the measurement procedures and instruments can be found in Zanotelli et al. (2013Zanotelli et al. ( , 2015Zanotelli et al. ( , 2019. Meteorological data, recorded partly at the EC tower and at the Laimburg meteorological station located 4 km from the site (46 • 23 N, 11 • 17 E; 224 m a.s.l.), were used at an hourly time step to force the model. Measured data included precipitation, solar radiation, net radiation (R n , only at the EC tower), air temperature, air pressure (only at Laimburg), relative humidity, and wind speed. Measurements of incoming longwave radiation (LW in ) were available for 2010 only, but additional calculations following Konzelmann et al. (1994) and Sedlar and Hock (2009) were produced and used as forcing for the remaining years 2011-2019 (Appendix B). This was necessary since the use of the internally calculated LW in in CLM5 resulted in unrealistic underestimations compared to the available measurements of LW in , leading to a significant bias in R n .

Model set-up
The model was set up in point mode to simulate the apple orchard in the Adige valley using available sand, clay, and organic matter fractions. The model was spun up for 200 years, first in accelerated decomposition and then in normal decomposition mode, until all state variables, such as total ecosystem soil C and soil water, reached equilibrium (Lawrence et al., 2018). For the model spin-up, the CRUNCEPv7 atmospheric forcing data set from 1986 to 2016 was used (Viovy, 2018). The apple orchard was then initiated using the newly developed sub-model and the apple PFT by selecting the site-  specific management (i.e. fertilization with 7.5 gN m −2 yr −1 , irrigation, mulching of pruning material). Simulations were performed for a period of 10 years to mirror the time from orchard establishment in 2000 up to the start of the measurements in 2010 using 10 years (2010-2019) of the available meteorological data from Laimburg meteorological station. Simulations were then extended for another 6 years from 2010 to 2015 for model parameterization and performance evaluation purposes.

Parameterization
Key parameters of the new sub-model as well as other PFTspecific parameters were parameterized using the first 3 years of simulations between 2010 and 2012. The lengths of phenological stages and associated parameters were determined based on field observations of bud break, full bloom, and harvest as well as non-cultivar-specific apple phenology descriptions that were found in the literature (Appendix C). The length of the period where growth is supported out of reserves (ndays_stor) was calibrated based on the biomass measurements and the estimate by Zanotelli et al. (2013) that apple trees use stored carbohydrates in the first 2 months after bud break. C allocation coefficients were calculated based on the monthly measurements in 2010 by dividing the biomass growth of the individual plant organs by the total biomass increment. Subsequently, model parameters associated with the CN allocation subroutine (Eqs. 2-7) were calibrated manually to match the coefficients obtained from the observations and the overall biomass partitioning on a yearly basis. Parameter values for C : N ratios of all plant organs and maximum LAI were based on field observations in 2010 and 2010-2012, respectively. The specific LAI (slatop) was calculated by dividing monthly measurements of LAI by leaf biomass and taking the average of the obtained values. Structural and morphological parameters such as maxi-mum tree height (ztopmx), planting density (nstem), the ratio of stem height to radius at breast height (taper), or rooting depth (root_dmx) were adjusted based on site-specific information (Zanotelli et al., 2013). Initial biomass at transplanting was assumed to be 5 gC m −2 , resulting in an initial tree height of around 100 cm and a stem diameter of 16 mm. As seedlings are dormant at the time of transplanting, their LAI is 0. The CLM5 root distribution parameter (rootprof_beta), which sets the root ratios at different depths, was calibrated by least squares regression of the measured root ratios at 0-20, 20-40, and 40-60 cm depths and the calculated ratios. Optical parameters for leaf transmittance and reflectance in the visible and near infrared (IR) were set to average values reported for apple by Bastías and Corelli-Grappadelli (2012). Stem reflectance and transmittance were assumed to be similar to other woody species and therefore set to the values used for BDT in CLM5, similar to the assumptions made by Fan et al. (2015) for the palm tree development in CLM4.5. The ratio of momentum roughness length to canopy top height (z0mr) was set to the average value of the ranges reported for apple and citrus orchards to account for the differences in canopy structure compared to annual crops and forest (Tanny and Cohen, 2003;de la Fuente-Sáiz et al., 2017). No specific values could be found for the ratio of displacement to top of canopy height (displar), the leaf orientation index (xl), or the intercept to calculate the top of canopy maintenance respiration base rate (lmr_intercept_atkin). These values were assumed to be comparable to other deciduous trees and thus set to the values used for BDT in CLM5. Parameters related to C reserve dynamics (e.g. fcur) and photosynthesis (e.g. the slope of the relationship between leaf N per unit area and the maximum rate of carboxylation at 25 • C, s_vcad) were adjusted to match observed LAI and productivity data. All parameters with their values and references to the literature are summarized in Table C1 of the Appendix.

Sensitivity analysis
A simple one-by-one sensitivity analysis was performed to further tune model parameters and assess the influence of newly added parameters on the simulation results. As a complete sensitivity analysis of all PFT-related parameters would have exceeded the scope of this study, the analysis focused on key parameters of the new phenology and CN allocation subroutines. Other potentially influential parameters were selected based on previously performed sensitivity analyses by Göhler et al. (2013) for CLM3.5 and by Cheng et al. (2020) and Dagon et al. (2020) for CLM5, taking into account differences between previous and current model versions. Parameters selected for the analysis were perturbed by varying a parameter by ±30 %, ±20 %, and ±10 % while keeping the others fixed to the value of the control simulation (after initial parameterization). The goal here was not to perform an in-depth analysis covering the full range of possible parameter values but rather to provide a first indication of influential parameters in the new sub-model similar to the approach of Fan et al. (2015). As a measure of sensitivity, the parameter effect (PE) was calculated using the average of three years of simulations between 2013 and 2015 of the control and the perturbed simulations for selected output variables and the following formula adjusted from Luo et al. (2020): where X is a simulated value of the control or a perturbation run, X is the summed absolute difference between the control and the perturbation run across all perturbations, k is the parameter perturbation factor, i is the ith variable across n = 6 selected output variables including GPP, NEE, R a , LE, maximum LAI, and yield, and j is the j th parameter across m selected parameters. PE i,j is a number between 0 and 1 that represents the sensitivity of an output variable i to the parameter j , with 1 meaning high and 0 meaning low sensitivity. The parameters selected for sensitivity analysis are indicated in Table C1 of the Appendix.

Model performance evaluation
Modelling results are compared to observed biomass, yield, and LAI data as well as ecosystem fluxes retrieved from the EC measurements. Statistical indices for model performance evaluation include the Pearson coefficient of correlation (r), the root mean square error (RMSE), and the percent bias error (%bias): where i is the time step, n is the total number of time steps, X i and X o i are simulated and observed values at each time step, respectively, µ and µ o are simulated and observed mean values, respectively, and σ and σ o are simulated and observed standard deviations.
3 Results and discussion

Sensitivity analysis
A total of 34 parameters were initially considered for the sensitivity analysis, of which the 13 most influential parameters (PE >0.1 for at least one of the selected output variables) are shown in Fig. 3. GPP, NEE, R a , and yield have similar sensitivity patterns and are most sensitive to the leaf C : N ratio (leafcn) and the relationship between leaf N and the maximum rate of carboxylation at 25 • C (s_vcad). Together with the specific leaf area (slatop) and other constants, they control the maximum photosynthetic capacity in the photosynthesis calculation and thus largely influence total C assimilation. As expected, LAI is most influenced by parameters that control the CN allocation to leaves such as the initial leaf allocation coefficient (fleafi), the GDDs needed to reach canopy maturity (lfmat), the maximum LAI (laimx), photosynthetic parameters, and, to a smaller extent, the fraction of C allocated to the leaf storage pool to refill C reserves (aleafstor). The first three parameters influence leaf biomass and thus show a considerable effect on GPP, NEE, R a , and yield. The same output variables are affected in a similar fashion by the GDDs needed until fruit harvest (hybgdd) that control the amount of C allocated to fruits. LE is influenced largely by the parameter controlling stomatal conductance (medlynslope) and the photosynthetic parameters (leafcn, s_vcad).
Overall, photosynthetic parameters play a key role in determining the magnitude of the studied output variables, with an average PE value close to 0.7 across all six variables. Phenological parameters (top seven parameters in Fig. 3) are generally less influential for the same output variables, with average PE values up to 0.43. These findings are largely consistent with earlier studies of parameter sensitivity (Göhler et al., 2013;Cheng et al., 2020;Dagon et al., 2020;Luo et al., 2020). In contrast to Luo et al. (2020), we did not find a strong effect of the root distribution parameter (root-prof_beta) on LE, which can be attributed to the low water stress due to the irrigation management of the studied orchard.
While the one-at-a-time sensitivity analysis provides some insight into model sensitivity, the ranking of influential parameters is strongly influenced by the choice of parameters . Parameter effect (PE) as a measure of sensitivity of selected output variables to the most influential model parameters. Output variables include gross primary production (GPP), net ecosystem exchange (NEE), autotrophic respiration (R a ), maximum leaf area index (LAI max ), latent heat flux (LE), and yield. Parameters are post-harvest leaf allocation coefficient to storage (aleafstor), initial leaf allocation coefficient (fleafi), GDD to canopy maturity (lfmat), root allocation coefficients at the start of fruit development (arootf) and until harvest (arootf2), GDD needed until harvest (hybgdd), maximum LAI (laimx), fraction of allocation that goes to currently displayed growth (fcur), C : N ratios of fruits (graincn) and leaves (leafcn), specific leaf area at top of canopy (slatop), slope of the relationship between leaf N per unit area and the maximum rate of carboxylation at 25 • C (s_vcad), and the medlyn slope of the conductance-photosynthesis relationship (medlynslope). For more details on the parameters, see Appendix C. and output variables, the parameter perturbation strategy (i.e. percent change, linear sampling), and the index chosen as the sensitivity measure. Parameter tuning based on this analysis is further complicated since this approach does not consider parameter covariation that is particularly strong for plant parameters that influence photosynthesis (Göhler et al., 2013). Selecting parameter values based on the individual best simulation hence does not necessarily yield the best overall result (Luo et al., 2020). We therefore decided to first adjust s_vcad to best match the observed average GPP. In the following, we further adjusted fleafi, hybgdd, and medlynslope to improve the simulated biomass components as well as the LE flux, respectively.

Modelling results
In the following, we present the modelling results according to the initial parameterization and the updated parameter values from the sensitivity analysis. Daily simulations or yearly sums are compared to observed biomass, yield, and LAI data as well as ecosystem fluxes retrieved from the EC measure-ments and soil moisture measurements aggregated to daily mean values.

Biomass growth and yield
The patterns in seasonal biomass allocation simulated by CLM5-FruitTree show good agreement with the monthly observations from 2010 (Fig. 4a). The beginning and end of the growing season are well captured. After bud break at the beginning of March, biomass is allocated to the vegetative organs of leaves, fine roots, and woody organs, and growth is supported by C and N reserves until the start of fruit growth in early May (50 d according to the ndays_stor parameter). In the following months, fruit biomass grows rapidly until harvest takes place in mid-October, following the typical sigmoidal growth curve that is well captured by the new phenology and CN allocation. Simulated leaf biomass peaks in mid-June and remains constant thereafter, with leaf senescence starting later in October when temperatures drop below 4 • C. Pruning is performed when the tree enters dormancy by removing 85 % of the stem biomass assimilated over the season according to the observed pruning amounts in the studied apple orchard (Zanotelli et al., 2013(Zanotelli et al., , 2015. From 2010 to 2012, the modelled percentage of biomass allocation to plant organs was generally in agreement with the observations (Zanotelli et al., 2015), with differences ranging between 1 % and 5 % for fruits, leaves, aboveground wood, and roots (Fig. 4b). Penzel et al. (2020) stated that different studies reported biomass allocation to fruits ranging from 50 % to 85 % depending on apple cultivar, suggesting considerable variability in allocation coefficients. This emphasizes the benefit of a cultivar-specific calibration in order to obtain realistic modelling results. On the other hand, it suggests that a more general parameterization that reflects an average apple tree may be necessary to apply CLM5-FruitTree at larger scales and across multiple cultivars.
The timing for initial leaf development in spring and leaf senescence in late autumn are sufficiently well captured by the implemented bud break prediction algorithm and the simple temperature threshold for leaf abscission, respectively (Fig. 5). Observed maximum LAI varied between 2.8 and 3.3 m 2 m −2 and occurred during the first half of July. The simulations reached similar values in 2010 and 2012, matching the observations, while the simulated LAI in 2011 underestimated the measurements due to a smaller C transfer from storage and lower solar radiation early in the growing season. The discrepancy between the low simulated LAI and the high observed LAI in 2011 could have been further exacerbated by a lighter pruning performed in the previous winter compared to other years (Zanotelli et al., 2013). Such practice is sometimes performed in an attempt to counteract the strong alternate bearing behaviour of the Fuji variety, which causes a substantial drop in yield following a high yielding year (Belleggia et al., 2010;Atay et al., 2013;Pasa et al., 2021). As a consequence of the light pruning, a larger num- ber of vegetative and flower buds remained on the tree, leading to more growth and possibly contributing to the larger discrepancy between relatively high observed LAI and relatively low simulated LAI. The adjusted pruning is however based on a somewhat subjective assessment of the farmer, and information about the exact amount is hardly available. Thus, CLM5-FruitTree currently adopts a simplified pruning practice based on the removal of a fixed portion of the seasonal stem growth which manages tree size and total woody biomass without affecting LAI.
Measured LAI showed a slow decline soon after maximum LAI was reached, while simulated values in contrast are assumed to remain constant until leaf senescence is initiated. The observed early decline may be an artefact of the sampling strategy used to determine LAI that extrapolated individual leaf area measurements to the whole tree, assuming a constant leaf distribution within the tree (Zanotelli et al., 2013). Another reason could be some premature leaf fall in the summer at the expense of the inner shadowed leaves, as observed during field sampling. Other studies suggest that the LAI of fruit trees generally stays constant until a rapid decline with the start of senescence (Lakso et al., 1999;Pallas et al., 2016), supporting the simulated LAI dynamic.
Simulated yield averaged 70 t ha −1 between 2010 and 2015 and was within 2.3 % of the observed average yield. While simulated yield varied between 61 and 76 t ha −1 , the observations showed a greater inter-annual variability (IAV), as exemplified in the case of the years 2012 (low yield of 51 t ha −1 ) and 2015 (high yield of 101 t ha −1 ) (Fig. 6). Low IAV of yield has also been observed in previous crop simulations with CLM5 for winter wheat (Boas et al., 2021), suggesting that certain drivers of IAV such as extreme environmental conditions (e.g. frost, heat, and hail) or plant pests and diseases and the resulting plant physiological responses (e.g. stress-induced leaf shedding or failure to flower) (Char- rier et al., 2021) are missing or not represented with sufficient detail in CLM5. In the case of apple trees, yield is also tightly linked to the number of flowers and early fruit growth, which in turn depends on a complex interaction of the environmental conditions during winter dormancy and the start of the new growing season (Chmielewski et al., 2012;Corelli-Grappadelli and Lakso, 2004). Additionally, C reserves accumulated in the previous year (Greer et al., 2002), and crop load management played an important role in determining the final harvest (Penzel et al., 2020). The latter includes pruning or fruit thinning to ensure optimal fruit growth and to reduce the effect of alternate bearing. The low observed yield in 2012 may be a result of such behaviour. This phenomenon and the processes involved are not universal, so that different fruit trees may be bearing regularly, irregularly, or biannually (Hoblyn et al., 1937;Monselise and Goldschmidt, 1982). As such, alternate bearing and its treatment through pruning or fruit thinning cannot easily be generalized and are thus not currently implemented in CLM5-FruitTree, which could have further reduced simulated IAV. Storage growth is considered in CLM5-FruitTree and exhibited an impact on the For the conversion of simulated fruit biomass in gram carbon per square metre to tons per hectare, fruit C content was assumed to be 42 % of total dry weight, harvest efficiency 95 %, and fruit water content 83 % according to Zanotelli et al. (2013). final yield of the following season, as shown by the sensitivity analysis of the aleafstor and fcur parameters (Fig. 3). However, its effect on fruit growth in CLM5-FruitTree is indirect since it supports leaf development in the early growth stage but does not directly contribute to fruit growth. Identifying the driving forces of reserve deposition and mobilization and their quantification remains an unsolved issue, and there is still no consistent formulation of this process in tree modelling (Le Roux et al., 2001;Allen et al., 2005). Predicting final yield in fruit orchards is further complicated by the fact that harvest is usually based on certain fruit quality traits such as firmness or soluble solids and can occur successively as fruits may not mature at the same time (Corelli-Grappadelli and Lakso, 2004;Musacchi and Serra, 2018). Within this context, the proposed simplifications of the C reserve dynamics and fruit harvest are likely contributing to the difference in observed and simulated yields. Considering the many specific challenges in modelling this apple cultivar, we believe that the yield predictions are satisfactory enough in the context of the sub-model development.

Carbon fluxes
As shown in Fig. 7, CLM5-FruitTree was able to capture the overall patterns of GPP, NEE, and R eco , particularly during the transition between dormancy periods and growing seasons (April to November). Simulated C fluxes are highly correlated with observations (r ≥ 0.84), while the RMSE ranges between 1.12 and 1.53 gC m −2 d −1 . Observed and simulated peak C fixation occurred in mid-June (Fig. 7a), correspond-ing to the maximum (negative) NEE (Fig. 7c) and maximum LAI (Fig. 5). Simulated NEE becomes negative (net carbon sink) around April and returns to positive (net carbon source) around November, in agreement with the observed dynamic (Fig. 7c). Observed yearly sums of GPP (NEE) were 1.60 (−0.49), 1.43 (−0.48), and 1.65 (−0.76) kgC m −2 yr −1 for 2013, 2014, and 2015, respectively. Simulated yearly sums of GPP (NEE) were 1.58 (−0.53), 1.56 (−0.51), and 1.53 (−0.57) kgC m −2 yr −1 for the same years, showing a negligible positive bias of on average 0.17 % for GPP (Fig. 7b) and a small underestimation (less negative) of on average 3.8 % for NEE (Fig. 7d). Simulated and observed R eco (Fig. 7e) generally increased until July because of the increase in air temperature and respiratory costs of the developing canopy and declined thereafter as air temperature started to drop. Simulations of R eco tend to slightly underestimate observations between April and late August and to overestimate observations during winter, although discrepancies are relatively small. Observed yearly sums of R eco were 1.13 (2013), 0.98 (2014), and 0.94 (2015) kgC m −2 yr −1 , while simulated values were 1.08, 1.08, and 0.99 kgC m −2 yr −1 , respectively. CLM5-FruitTree overestimated yearly R eco by on average 3.3 %, explaining most of the difference in observed and simulated NEE in 2013, while differences in 2014 and 2015 are due to a combination of small biases in both GPP and R eco . Measured R eco showed irregular fluctuations in the early part of the growing season 2013 and mid to late season 2014 and 2015 that are not reproduced well by the model. These fluctuations mostly correspond to the observed temperature dynamics (not shown) as a result of the applied gap filling that is based on an air (or soil) temperature-R eco relationship (Reichstein et al., 2005). Such discrepancies between observed and simulated dynamics could be further explained by the occurrence of field management practices such as mowing of the grassed alleys or soil tillage under the tree rows, which are currently not represented in CLM5-FruitTree. Such practices could have led to a temporary rise in soil respiration (R s ) due to increased heterotrophic respiration (R h ) as discussed in Zanotelli et al. (2013). Indeed, soil tillage experiments performed in an apple orchard located on the Loess Plateau in Shaanxi Province in China were found to increase R s by 14 %-57 % depending on the tillage method (Hou et al., 2021). Zanotelli et al. (2013) measured a total R s of 801 ± 95 gC m −2 in 2010, contributing around 90 % to R eco , based on soil chamber measurements within the orchard (total soil respiration). The comparison to parallel measurements in a trenched plot produced a high ratio R h /R s of 0.77 for the apple orchard. In contrast, simulated R s was 510 gC m −2 , contributing merely 45 % to R eco for the same year, with a ratio R h /R s of 0.87. Simulated R eco was instead dominated by autotrophic respiration (R a ) due to high C costs for maintenance, mainly of leaf biomass (data not shown). Other studies found that R s contributed 56 %-67 % to R eco in irrigated citrus orchards of different ages that share common management practices (i.e. use of heavy machinery, irrigation, fertilization, tree pruning, and mulching) as well as structural similarities (e.g. planting in tree rows) with the studied apple orchard. Both aspects have a strong influence on soil respiration components in orchards (Martin-Gorriz et al., 2020). In forest ecosystems, where the magnitude of ecosystem fluxes was found to be somewhat comparable to orchards, R s contributed >60 % to R eco (Lasslop et al., 2012;Zanotelli et al., 2013).
In addition to the missing representation of certain management practices, CLM5-FruitTree currently does not account for an active ground cover in the orchard, which has been shown to enhance R s in an Italian olive orchard through increased fine root and microbial biomass (Turrini et al., 2017). Furthermore, the simplified representation of microbial activity in CLM5, through fixed respiration fractions for litter and soil organic matter pools, may limit the ability of CLM5-FruitTree to accurately represent soil respiration processes. Not accounting for mycorrhizal respiration may fail to adequately represent R eco of the orchard, as measurements suggested a substantial contribution of 11 ± 6 % to total R s in an apple orchard (Tomè et al., 2016). Lastly, biases in simulated soil temperature, soil moisture content, and fine root density could further contribute to explaining the abovediscussed differences, as these factors have a major effect on R s in apple orchards (Ceccon et al., 2011).
In contrast to the underestimation of R s in the model, the simulated R a of 693 gC m −2 was almost twice the measured value of 372 ± 195 gC m −2 . In our simulations, maintenance respiration comprised the main part of R a , with on average 78 %. The calculation of maintenance respiration in CLM5 (see Sect. 2.1) does not account for a lower or varying maintenance cost observed in mature apple orchard canopies compared to annual crops (Bepete and Lakso, 1997;Lakso et al., 1999). It therefore seems likely that the tissue maintenance costs in the orchard are overestimated in CLM5-FruitTree, accounting for on average 45 % of R a (28 % of R eco ). This could also explain the lower simulated carbon use efficiency (NPP/GPP) of 0.59 compared to 0.71 found by Zanotelli et al. (2013). Further work and more experimental data are needed to better understand the differences in modelled and observed respiration partitioning and to improve the performance of CLM5-FruitTree to adequately simulate the respiration components in fruit orchards.

Energy and water fluxes
The simulated seasonal course of the energy balance components R n , G, LE, and H agrees well with observed dynamics in the orchard (Fig. 8). CLM5-FruitTree shows a high performance in reproducing R n and LE, with r ≥ 0.97 and RMSE of 15.98 and 17.85 W m −2 , respectively ( Fig. 8a  and c). Due to the lack of LW in measurements, the CLM5internal LW in calculation based on a clear-sky parameterization after Idso (1981) was used initially. This resulted in a significant underestimation of 5 % (511 MJ) for LW in and 18 % (471 MJ) for R n compared to the observations in 2010. The R n bias could be reduced by 14 % for the observed time series when LW in was calculated by considering cloud cover as described in Appendix B. This stresses the necessity of accounting for cloud cover, ideally combined with locally calibrated parameters, for an accurate calculation of LW in . The remaining small negative bias of 4.48 % in R n is due to negative simulated R n during the winter months (Fig. 8b), which may be a result of the higher reflectance of solar radiation from bare soil compared to a grass surface (Bryś et al., 2019). The model assumes a bare soil (except for stem area) during the dormancy period, as the grass-covered alleys in the orchard are not considered explicitly.
The simulated LE (Fig. 8c) shows similar dynamics and variability to the observations following the increase and decrease in GPP (Fig. 7a) and LAI (Fig. 5). Similarly to LE, modelled ET shows a high correlation coefficient of 0.97 and a small RMSE of 0.62 mm d −1 (Fig. 8i). Simulated ET exceeds observed ET by 1.1 mm d −1 on average during its peak in July, but the overall bias is almost negligible (Fig. 8j). Total observed ET is 901 (2013), 858 (2014), and 883 (2015) mm, while the corresponding simulated values are 916, 877, and 925 mm, respectively. When examining the order of magnitudes of the ET components, canopy transpiration takes up around 85 % of ET, followed by soil evaporation and canopy evaporation (data not shown). Typically, apple orchard ET represents a combined flux from the apple trees and the grassed alley system, which is not explicitly represented in CLM5-FruitTree since CLM5 currently does not consider inter-row grass coverage or intercropping. Ntshidi et al. (2021) found that the contribution of understory transpiration is high in young, non-bearing apple orchards but contributes less than 10 % to whole-orchard ET in mature orchards with high canopy cover, which may explain the good model performance despite not considering the grass cover.
Simulated H and G are less consistent with the observations, with r values of 0.54 and 0.64, respectively, and large percent bias ( Fig. 8e and g), which is partially due to the much smaller magnitudes of the two fluxes compared to R n and LE. A possible reason for the lower amplitude of observed G (Fig. 8h) compared to simulated values may be the dampening effect of the grass cover providing additional shading during summer and insolation during winter (Bryś et al., 2019;Oorthuis et al., 2021). Observed H was rather constant throughout the year, with slightly higher values at the start and end of the growing season when the canopy was not yet fully developed or leaves were shedding. CLM5-FruitTree simulated a clear rise of H until April, closely following the observations, but H thereafter declined steeply in May, with negative values in August 2013 and 2015. Negative H during August corresponds to maximum LE and the main simulated irrigation season (June to September) that added 357 (2013), 281 (2014), and 517 mm (2015) of water to the orchard (Fig. 9a). In a study conducted with CLM4.5, Figure 7. Daily instantaneous (a, c, e) and cumulative (b, d, f) observed and simulated fluxes of gross primary production (GPP), net ecosystem exchange (NEE), and ecosystem respiration (R eco ) for the studied apple orchard between 2013 and 2015. Pearson's coefficient of correlation (r), the root mean square error (RMSE), and the percent bias (%bias) are displayed as statistical indices.
intense irrigation was found to strongly influence the convective heat fluxes by increasing LE and decreasing H (Zeng et al., 2017). Although precise measurements of the irrigation amount in the orchard are not available for the studied period, the average yearly irrigation was estimated around 200 mm, with no irrigation in 2014 due to sufficient rainfall (Montagnani et al., 2018). The difference in irrigation amounts may in part explain why the described phenomenon is not observed in the measurements. Indeed, negative simulated H in the summer months occurred as a result of strong evaporative cooling of ground and vegetation temperature through energy absorption by LE following irrigation that caused simulated LE to exceed simulated R n . This behaviour was not observed in the measurements where LE rarely exceeded R n and was mostly due to an overestimation of simulated LE compared to the measurements. Persisting model weaknesses in the partitioning of the energy balance were pointed out by a recent study examining land surface processes over a tropical rainforest using CLM4.5 and CLM5 and were linked to missing detail in the representation of the canopy and an oversensitivity of vegetation temperature to incoming solar radiation, among others (Song et al., 2020). As a result, the authors observed an overestimation of LE and unrealistically high dayto-night changes in G, which was also observed in this study when examining the model output at an hourly time step (results not shown).
Energy partitioning in orchards is strongly influenced by the positioning and pruning of branches to optimize tree architecture for higher productivity, planting density, tree height, and LAI distribution (López-Olivari et al., 2016). Consequently, the contribution of H and LE can significantly differ in the discontinuous orchard canopy (grass-covered alleys between tree rows) compared to the closed canopies of annual crops (de la Fuente-Sáiz et al., 2017). Currently CLM5 is still limited to the assumption of a closed canopy structure that is uniform in space, and hence biases are likely to arise from this model limitation. Future developments towards integrating multi-layer schemes for canopy processes and the explicit representation of the canopy to improve the related processes are desirable for a more realistic representation of the orchard canopy structure.

Soil moisture variation
Simulated mean soil moisture (SM) at 5 cm depth was within 1.6 vol % of the observed value during the three observed growing seasons, despite the higher simulated irrigation amount (Fig. 9b). Simulated daily values show a greater variability than the measured data in response to precipitation and to frequent irrigation ( Fig. 9a-b). In contrast, observed SM in the deeper soils (30-60 cm) was 3 vol %-11 vol % higher during the growing season compared to simulated values ( Fig. 9c-d). Considering the total investigated soil depth, simulations exhibit a larger variability in SM throughout the year, with a general overestimation in winter and underestimation during the growing season (especially in the deeper soils). However, the collected SM data were limited to a single soil profile that may not adequately reflect the average soil moisture of the apple orchard, which should be considered when comparing measurements and simulations. Even though the measurements are incomplete, the constant high observed SM in the deeper soils suggests an ample supply of water due to capillary rise from the shallow groundwater table that typically ranges between 1.2 and 1.85 m in the area (Montagnani et al., 2018). This process replenishes the water removed by ET processes and may explain the reduced need for irrigation compared to the simulations. Despite the shallow simulated groundwater table (generally 1.2 m depth), groundwater could not be used for root water uptake in the simulation as the rooting depth of the orchard was restricted to 0.8 m according to local measurements, and capillary rise is currently not implemented in CLM5.

Conclusions
The novel CLM5-FruitTree was developed to model perennial deciduous fruit orchards and thus extended the repre-sentation of agricultural systems in CLM5. The development included a new phenology subroutine to account for the perennial nature, prolonged growing season, and distinct phenological development of fruit trees compared to annual crops. Furthermore, C reserve dynamics of perennial deciduous trees were considered by adapting the CN allocation, and typical management practices associated with fruit orchards were represented, such as transplanting of seedlings and winter pruning. To evaluate the development, a new apple PFT was parameterized, and the model was set up and tested using extensive site data of a mature apple orchard in northern Italy.
One-by-one parameter sensitivity analysis revealed that photosynthetic parameters and parameters associated with canopy conductance have the highest influence on GPP, NEE, LE, and yield, while phenological parameters were more influential in biomass partitioning to the different plant organs. Due to the high number of model parameters and parameter covariation, future studies could propose a more comprehensive sensitivity analysis with a training data set consisting of multiple sites, which would give more insight into model sensitivity and could further improve the parameterization.
CLM5-FruitTree was able to capture the seasonal biomass development as well as the average relative partitioning of the total biomass into the different plant organs. The inclusion of C reserves next to photosynthetic growth was imperative to enable regrowth at the end of a dormancy period and influenced LAI development, total seasonal biomass, and yield. Average simulated yield was within 2.3 % of the observation even though CLM5-FruitTree showed a lower IAV likely due to the simplification of C reserve dynamics, specific management practices, and the alternate bearing behaviour exhibited by the Fuji apple cultivar.
The new phenology and CN allocation algorithms well represented the seasonal course of carbon, water, and energy fluxes of the orchard. The magnitude of ecosystem fluxes was particularly well captured for GPP, R n , LE, and ET, with correlation coefficients >0.94 and %bias < ± 5 %. The model exhibited small biases in NEE and R eco that were most likely caused by the overestimation of R a , especially leaf maintenance respiration, and an underestimation of R s . Possible reasons for the smaller simulated contribution of R s to R eco could be the missing representation of the grass-covered alleys, differences in simulated and actual soil temperature or organic matter content, and oversimplification of microbial respiration processes. Additionally, large negative biases in simulated H were found over most of the main irrigation season during summer as the model simulated a strong evaporative cooling of the surface temperature.
Further model developments should consider the improvement of canopy processes related to energy partitioning and the inclusion of an active ground cover in the orchard representation to improve the yearly energy budget calculations and possibly soil respiration. An explicit representation of the microbial community and a more flexible calculation of R a , i.e. considering tissue age, should also be the focus of future model improvements. While the particular alternate bearing of the Fuji variety posed a challenge in this specific study, the pruning routine that is currently implemented may be sufficient for most other apple cultivars and fruit tree species for which this behaviour is less pronounced or not exhibited. However, future developments could be envisioned once the model is further tested and applied. In addition, management practices such as mowing or soil tillage could further enhance the model capability of capturing the dynamics and fate of assimilated C. Fruit thinning is another common practice in orchards, but its implementation would be more challenging, as the current model structure does not represent individual fruits. This process could however be implicitly accounted for through parameterization of the C allocation to fruits. Finally, the application of the newly developed sub-model to different geographical regions and other types of fruit trees or apple cultivars is needed to further validate the model and give more insight into the transferability of the development to different types of orchards.
Overall, our results demonstrate the ability of the newly developed CLM5-FruitTree sub-model to represent the seasonal dynamics and magnitudes of growth and ecosystem fluxes in a deciduous fruit orchard. As such, this development constitutes an important contribution to a more comprehensive representation of the agricultural land surface in CLM5 by adding a perennial, woody crop to the existing annual crop types. This will allow for a more realistic evaluation of land use and climate change effects or water availability at regional scale such as the Mediterranean or parts of China and the US, where perennial agriculture such as fruit orchards covers large parts of the agricultural landscape.
Appendix A: Sequential model for bud break prediction The bud break prediction in CLM5-FruitTree is based on the sequential model developed by Cesaraccio et al. (2004). Negative chill days (C d ) are accumulated from 1 November followed by positive anti-chill days (C a ) to overcome the different stages of tree dormancy, rest, and quiescence. The chilling requirement (C R ) defines the threshold for the accumulation of C d and is reached when C d ≤ C R . Thereafter, C a accumulation begins until C R + C a ≥ 0, at which bud break occurs. The accumulation of C d and C a on a given day is calculated from maximum (T x ) and minimum (T n ) daily air temperature as well as a temperature threshold for chill accumulation (T C ) and varies depending on five possible temperature cases that relate T x , T n , T C , and 0 • C to the daily mean air temperature (Table A1). The optimal values for C R and T C were calibrated based on bud break observations from 2010 to 2013 for the Adige site by minimizing the RMSE between observations and predicted bud break. The optimal value for C R was −68, while T C was 4 • C, resulting in an RMSE of 7.2 d. Table A1. Chill day (C d ) and anti-chill day (C a ) calculation for five different temperature cases relating maximum (T x ) and minimum (T n ) air temperature to the air temperature threshold (T C ) and 0 • C; T M is the air mean temperature.

Temperature cases Chill days
Anti-chill days

Appendix B: Calculation of incoming longwave radiation
Incoming longwave radiation (LW in ) can be expressed based on the Stefan-Boltzmann law as where ε eff is the effective emissivity that can be expressed by multiplying the clear-sky atmospheric emissivity ε cs by a cloud factor F (always ≥ 1) that expresses the increase in LW in under cloudy conditions, σ is the Stefan-Boltzmann constant (5.67 × 10 −8 W m −2 K −1 ), and T is the 2 m air temperature in Kelvin. Clear-sky emissivity was obtained using the Konzelmann et al. (1994) parameterization as follows: where e is the vapour pressure in Pascal at 2 m. Equation (B1) can be rearranged to obtain F as follows: F was calculated at an hourly interval using measured LW in data from 2010 and ε cs was calculated using the above Eq. (B2).
As proposed by Sedlar and Hock (2009), in the absence of cloud data, the cloud factor F can be parameterized as a function of the atmospheric transmissivity index τ , which is defined as follows: where SW in is the incoming shortwave radiation, and SW toa is the theoretical shortwave radiation received at the top of the atmosphere. Figure B1 shows the linear equation that was fitted to the relationship of F and τ for the year 2010. For the calculation of clear-sky emissivity, all data where τ was greater than 0.7 (N = 3863) were considered based on the suggestion by Campbell (1985).
For the nighttime values and for very low incoming shortwave radiation (SW in <15 W m −2 ), τ was gap-filled with the mean of the two surrounding values to obtain a complete time series of LW in data. Figure B2 shows the results of the LW in parameterization compared to LW in calculated by CLM5 and to the observed data for the year 2010. As performance statistics, Pearson's r, the RMSE, and the percent bias are given. Figure B1. Cloud factor F as a function of atmospheric emissivity τ for hourly observations. The black line represents the linear equation for F (τ ) and F ≥ 1. Clear-sky emissivity is parameterized based on Konzelmann et al. (1994).   Zanotelli et al. (2013Zanotelli et al. ( , 2015 and the sequential model (Cesaraccio et al., 2004) crit_temp Critical temperature to initiate leaf senescence for fruit tree crops K 278.15 Adjusted based on LAI measurements (Zanotelli et al., 2013) grnfill * (GDDfruit) GDD needed from bud break to beginning of fruit development • days 400 Based on observed and commonly used values for apple trees (Zanotelli et al., 2013;Lakso et al., 2001;Neumann, 2020;Penzel et al., 2020) grnrp * (GDDripe) GDD needed from bud break to the fruitripening phase • days 1100 Based on observed and commonly used values for apple trees (Lakso et al., 2001;Zanotelli et al., 2013;Neumann, 2020;Penzel et al., 2020) huileaf (GDDleaf) GDD accumulated at the moment of bud break (end of dormancy period) • days -Calculated based on the sequential model for bud break prediction (Cesaraccio et al., 2004) hybgdd * (GDDmat) GDD needed from bud break until fruit harvest • days 2880 Based on observed and commonly used values for apple trees (Lakso et al., 2001;Zanotelli et al., 2013;Neumann, 2020;Penzel et al., 2020) laimx * Maximum leaf area index m 2 m −2 3 Based on observed and commonly used values for apple trees (Valancogne et al., 1999;Li et al., 2002;Zanotelli et al., 2013) lfmat * (GDDlfmat) GDD needed from bud break to canopy maturity • days 1350 Based on observed and commonly used values for apple trees (Lakso et al., 2001;Zanotelli et al., 2013;Neumann, 2020;Penzel et al., 2020) (Lakso et al., 2001;Zanotelli et al., 2013;Penzel et al., 2020) ndays_stor Length of period for storage growth of fruit tree crops d 50 Based on common values for fruit orchards (Kozlowski, 1992;DeJong and Grossman, 1994;Wünsche and Lakso, 2000) perennial  (Zanotelli et al., 2013) aleafstor * Leaf allocation coefficient for storage postharvest used in CN allocation Unitless 0.3 Adjusted based on monthly biomass measurements (Zanotelli et al., 2013) allconss * Power to control the shape of the stem allocation curve Unitless 1.5 Adjusted based on monthly biomass measurements (Zanotelli et al., 2013) (Zanotelli et al., 2013) arootf2 * Final root allocation coefficient until harvest Unitless 0.08 Adjusted based on monthly biomass measurements (Zanotelli et al., 2013) arooti * Initial root allocation coefficient Unitless 0.7 Adjusted based on monthly biomass measurements (Zanotelli et al., 2013) astemf * Final stem allocation coefficient Unitless 0.22 Adjusted based on monthly biomass measurements (Zanotelli et al., 2013) bfact * Exponential factor used for fraction allocated to leaf Unitless −0.5 Adjusted based on monthly biomass measurements (Zanotelli et al., 2013) declfact * Decline factor to control the shape of the stem allocation curve Unitless 4 Adjusted based on monthly biomass measurements (Zanotelli et al., 2013) fcur * Fraction of C and N allocated to the displayed pools Unitless 0.95 Tuned based on observed LAI and yield data (Zanotelli et al., 2013) fleafi * Initial leaf allocation coefficient Unitless 0.85 Adjusted based on monthly biomass measurements (Zanotelli et al., 2013) flivewd Fraction of new wood that is live Unitless 0.15 Same as BDT in CLM5 frootCN Fine root C : N ratio gC gN −1 32 Average of six measurements (Zanotelli, 2010, unpublished data) grainCN * Fruit C : N ratio gC gN −1 139 Average of six measurements (Zanotelli, 2010, unpublished data) leafCN * Leaf C : N ratio gC gN −1 19.7 Average of six measurements (Zanotelli, 2010, unpublished data) lflitCN Litter C : N ratio gC gN −1 60 Average of four measurements (Zanotelli, 2010, unpublished data) livewdCN Livewood C : N ratio gC gN −1 60 Average of six measurements (Zanotelli, 2010, unpublished data) transplant Initial carbon for crops transplanted from nursery gC m −2 5 Photosynthetic parameters i_vcad * Intercept of the relationship between leaf N per unit area and Vcmax25top µmolCO 2 m −2 s −1 5.2 Adjusted in between BDT and crop medlynslope * Medlyn slope of conductance-photosynthesis relationship µmolH 2 O µmolCO −1 2 8.2 Tuned based on observed GPP and ET data (Zanotelli et al., 2015) s_vcad * Slope of the relationship between leaf N per unit area and Vcmax25top µmolCO 2 s −1 gN −1 34 Tuned based on observed LAI and yield data (Zanotelli et al., 2013) slatop * Specific leaf area at top of canopy m 2 gC −1 0.028 Mean value for the growing season based on LAI and leaf biomass measurements (Zanotelli et al., 2013) Vegetation structure and management displar Ratio of displacement height to canopy top height Unitless 0.67 Same as BDT in CLM5 mulch_pruning Binary flag for mulching (1) or export (0) of pruning material Unitless 1 Based on reported organic farming practices (Zanotelli et al., 2013) prune_fr Fraction of dead stem that is pruned Unitless 0.85 Based on reported pruning quantity (Zanotelli et al., 2015) nstem Planting density # m −2 0.33 Based on reported planting density (Zanotelli et al., 2013) taper Ratio of stem height to radius at breast height Unitless 120 Based on reported tree allometry and height (Zanotelli et al., 2013) (Tanny and Cohen, 2003) orchards ztopmx Maximum canopy height for crops m 3.6 Based on reported tree heights (Zanotelli et al., 2013)  Code availability. The new CLM5-FruitTree sub-model is freely available via Zenodo at https://doi.org/10.5281/zenodo.6595378 (Dombrowski, 2022).
Data availability. Data from the Laimburg weather station were kindly provided by the station operator Martin Thalheimer (Research Centre for Agriculture and Forestry, Laimburg, Bolzano) upon request. All other data from the apple orchard in South Tyrol, Italy were provided by Damiano Zanotelli and his team and are licensed under the Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0/, last access: 23 June 2022). They can be made available upon request.
Author contributions. OD developed and modified the code for the sub-model, designed, performed, and analysed the simulations, and prepared the original draft of the manuscript. HB, HJHF, and CB supervised the research, and, together with DZ, contributed to the manuscript writing through review and editing.
Competing interests. The contact author has declared that neither they nor their co-authors have any competing interests.
Disclaimer. Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Acknowledgements. The authors are grateful to Damiano Zanotelli and his team for providing the field data of the apple orchard in South Tyrol, Italy. The authors thank Yuanchao Fan for sharing the source files for his development of CLM-Palm (Fan et al., 2015) that aided the development of the new CLM5-FruitTree sub-model. Furthermore, the authors thank Martin Thalheimer for providing the meteorological data from Laimburg meteorological station.
Financial support. This research has been supported by Horizon 2020 (ATLAS (grant no. 857125)) and by the Deutsche Forschungsgemeinschaft under Germany's Excellence Strategy (grant no. EXC-2070-390732324-PhenoRob).
The article processing charges for this open-access publication were covered by the Forschungszentrum Jülich.
Review statement. This paper was edited by Christoph Müller and reviewed by two anonymous referees.