Runoff evaluation in an Earth System Land Model for permafrost regions in Alaska

Huang, Xiang; Zhang, Yu; Gao, Bo; Abolt, Charles J.; Crumley, Ryan L.; Demir, Cansu; Fiorella, Richard P.; Busey, Bob; Bolton, Bob; Painter, Scott L.; Bennett, Katrina E.

doi:10.5194/gmd-19-1193-2026

Articles | Volume 19, issue 3

https://doi.org/10.5194/gmd-19-1193-2026

Articles | Volume 19, issue 3

Model evaluation paper

09 Feb 2026

Model evaluation paper |

| 09 Feb 2026

Runoff evaluation in an Earth System Land Model for permafrost regions in Alaska

Xiang Huang, Yu Zhang, Bo Gao, Charles J. Abolt, Ryan L. Crumley, Cansu Demir, Richard P. Fiorella, Bob Busey, Bob Bolton, Scott L. Painter, and Katrina E. Bennett

Abstract

Modeling of hydrological runoff is essential for accurately capturing spatiotemporal feedbacks within the land–atmosphere system, particularly in sensitive regions such as permafrost landscapes. However, substantial uncertainties persist in the terrestrial runoff parameterization schemes used in Earth system and land surface models. This is particularly true in permafrost regions, where landscape heterogeneity is high and reliable observational data are scarce. In this study, we evaluate the performance of runoff parameterization schemes in the Energy Exascale Earth System Model (E3SM) land model (ELM). Our proposed framework leverages simulation results from the Advanced Terrestrial Simulator (ATS), which is a physics-based integrated surface/subsurface hydrologic model that has been successfully evaluated previously in Arctic tundra regions. We used ATS to simulate runoff from 22 representative hillslopes in the Sagavanirktok River basin, located on the North Slope of Alaska, then compared the output with ELM's parameterized representation of total runoff. Results show that (1) ELM's total runoff was the same order of magnitude as the ATS simulations, and both models were similarly variable over time; (2) minor adjustments to coefficients in ELM's runoff parameterization improved the match between the ATS simulation and ELM's parameterized representation of annual and seasonal total runoff; (3) overall, runoff responses in ATS and ELM are more similar in flat hillslope environments compared to steep hillslopes; and (4) shallower active layer thicknesses and higher precipitation simulations resulted in lower correlations between the two models due to greater total runoff. By incorporating the optimized runoff coefficients from the Sagavanirktok River basin into ELM, the simulated total runoff better matched the streamflow observations at a small watershed located on the Seward Peninsula of Alaska. Our findings revealed important insights into the effectiveness of runoff parameterizations in land surface models and pathways for improving runoff coefficients in typical Arctic regions.

Download & links

Article (PDF, 2750 KB)

Supplement (491 KB)

Download & links

How to cite.

Received: 11 Apr 2025 – Discussion started: 21 May 2025 – Revised: 06 Oct 2025 – Accepted: 26 Dec 2025 – Published: 09 Feb 2026

1 Introduction

Runoff parameterization schemes play a critical role in the accuracy of Earth system models, particularly in sensitive environments such as high latitude permafrost regions. These areas are increasingly vulnerable to climate variability, and the hydrological responses associated with warming temperatures can have profound implications for ecosystems and water resources (Bring et al., 2016; Yang and Kane, 2020). The interplay between hydrology and climate dynamics in permafrost zones is complex because conditions such as vegetation, snow, soil wetness, ground ice content, and biogeochemical activities vary significantly over small spatial extents (Holmes et al., 2013; Bennett et al., 2022). At the same time, Earth system models and land surface models are designed for pan-Arctic scale simulations, creating a strong mismatch between the scale at which these processes occur and the models designed to represent them (Lique et al., 2016).

Despite the importance of runoff in accurately modeling ecosystem dynamics, considerable uncertainties remain in the parameterization schemes employed by land surface models. The accuracy of one process often depends on the scheme chosen for another, creating interdependencies that can complicate model accuracy. In permafrost regions, the presence of ice-rich permafrost can disrupt water infiltration processes, leading to increased surface runoff and altered drainage patterns (Kuchment et al., 2000; Walvoord and Kurylyk, 2016; Bennett et al., 2023). These heterogeneous conditions complicate efforts to accurately represent hydrological dynamics and highlight the necessity for improved modeling techniques. The scarcity of observational data and unmeasurable model parameters exacerbate these challenges, resulting in significant discrepancies between model outputs and real-world hydrological behavior (Bring et al., 2016; Clark et al., 2015, 2017). Addressing these uncertainties is essential for developing reliable predictive models that can support resource management and conservation efforts within rapidly changing Arctic ecosystems (Schädel et al., 2024).

Recent land model intercomparison projects (e.g., Boone et al., 2004; Lawrence et al., 2016; Collier et al., 2018; Mwanthi et al., 2024) have summarized various implementations of runoff schemes, ranging from simple bucket models to more advanced topography-based runoff models. These studies highlight significant variability in lateral surface runoff and subsurface runoff (baseflow) among different land models. Clark et al. (2015) emphasized the need to integrate groundwater-surface water interactions in Earth system models, while Maxwell et al. (2014) demonstrated the benefits of coupling surface and subsurface models for better predictions in complex landscapes. By improving soil freeze-thaw processes and incorporating soil heterogeneity, Liang and Xie (2001) and Swenson et al. (2012) achieved better runoff alignment with observed streamflow. Fan et al. (2019) identified lateral water flow as a crucial runoff component for the water cycle in the Arctic, with additional studies highlighting significant uncertainties in runoff parameterization schemes in high-latitude cold regions (e.g., Zheng et al., 2017; Hou et al., 2023; Abdelhamed et al., 2024). These efforts collectively highlight the pressing need to refine hydrological runoff simulations to improve predictions, particularly in permafrost regions as climate change intensifies.

Many previous studies have evaluated runoff parameterization by comparing different schemes against streamflow observations at large scales and coarse resolutions (e.g., Niu et al., 2007; Li et al., 2011; Swenson et al., 2012; Zheng et al., 2017; Li et al., 2024), however, high-quality streamflow data that can be used to validate runoff production are difficult to obtain in permafrost regions. This highlights the need for a more cost-effective and flexible framework to rapidly evaluate parameterization effectiveness using alternative approaches, for example, leveraging simulations from robust computational tools for physics-based permafrost thermal hydrology processes. The permafrost thermal hydrology capability (Painter et al., 2016) in the Advanced Terrestrial Simulator (ATS) (Coon et al., 2020) has emerged as a valuable tool in this regard. ATS has been successfully compared to snow depth, supra-permafrost water table depth, and vertical profiles of soil temperatures (Atchley et al., 2015; Harp et al., 2016; Jan et al., 2020) and to catchment-scale evapotranspiration and runoff (Painter et al, 2023) in continuous permafrost regions. ATS's permafrost thermal hydrology capabilities have been used in a variety of modeling studies (e.g., Atchley et al., 2015; Sjöberg et al., 2016; Jafarov et al., 2018; Abolt et al., 2020; Jan and Painter, 2020; Hamm and Frampton, 2021; Painter et al., 2023)

This study aims to evaluate and improve the parameterization of runoff processes in the Department of Energy's Energy Exascale Earth System Model (E3SM) Land Model (ELM) (e.g., Oleson et al., 2013; Bisht et al., 2018; Xu et al., 2022; Shi et al., 2024) using detailed simulations from ATS. We quantitatively assess ELM's runoff parameterization, focusing on total runoff rather than individual components separately. The method we detail in this work directly addresses the scale gap between local- to global-scale process representation in models, using intercomparison with local-scale ATS simulations and parameters updates in ELM to improve Arctic runoff processes. By adopting a total water mass balance perspective, this approach provides insights into the strengths and limitations of ELM's runoff schemes, ultimately enhancing its predictive capabilities in Arctic environments. Additionally, it offers a comprehensive understanding of how landscape features and thermal hydrological processes interact in permafrost regions.

2 Model description

2.1 ELM runoff parameterization schemes

The runoff parameterization within ELM is designed to represent how water moves across the land surface between grid cells and is influenced by numerous factors, including soil moisture, topography, vegetation cover, etc. ELM's runoff scheme (ported from CLM v4.5, Oleson et al., 2013) is based on a simple TOPMODEL-based concept with a simplified topography representation (Niu et al., 2005). The runoff in ELM is partitioned into surface and subsurface flows, both of which are assumed to be related to water storage, vertical infiltration, and groundwater-soil water interactions (Beven and Kirkby, 1979; Niu et al., 2005, 2007). There are more than twenty runoff components (variables) defined in ELM, but essentially, they can be categorized into three groups in a simulation with fixed land use: (i) surface runoff, (ii) subsurface runoff, and (iii) runoff from overland water bodies like wetlands, lakes, and glaciers. The top 10 layers in ELM considered soil to a depth of ∼ 3.8 m and are hydrologically and biogeochemical active. The remaining five ground layers in each column are considered to be dry bedrock that extend to a depth of ∼ 42.1 m. The discretization of the fifteen layers, including their depths and thicknesses, is described in Sect. S1 and Table S1 in the Supplement. Here, we only explicitly list the key equations representing the crucial components of the former two groups representing for surface/subsurface runoff; for a more detailed description of the underlying physics and complete formulations, readers are referred to existing literature (Oleson et al., 2013; Niu et al., 2005; Bisht et al., 2018; Liao et al., 2025).

Surface runoff is composed of two components: (i) outflow from the saturated portion of a grid cell with excess water, q_over , and (ii) outflow from surface water storage such as a pond, q_h2osfc. The first term is written as:

\begin{matrix} (1) & q_{over} = q_{liq, 0} f_{\max} \exp (- 0.5 f_{over} Z_{\nabla, perch}) \end{matrix}

where q_liq,0 is the sum of liquid precipitation reaching the ground and melt water from snow (kg m⁻² s⁻¹); f_max is the ratio of the area that has higher compound topographic index (CTI) values than the mean CTI value of the grid cell, with a consideration of geomorphological features; f_over is a decay factor that is often calibrated using the recession curve of the observed hydrograph, taken as 0.5 m⁻¹; Z_∇,perch is the perched groundwater table depth (m) within the thawed soil layers. The second term is formulated as:

\begin{matrix} (2) & q_{h 2 osfc} = k_{h 2 osfc} f_{connected} (W_{sfc} - W_{c}) \frac{1}{Δ t} \end{matrix}

where the storage coefficient k_h2osfc=sin(β) is a function of grid cell mean topographic slope β (in radians); f_connected is the fraction of the inundated portion of the interconnected grid cell, calculated as $f_{connected} = {(f_{h 2 osfc} - 0.5)}^{0.14}$ , which is equal to 0 if f_h2osfc is smaller than 0.5, and the f_h2osfc is the fraction of the area that is inundated. W_sfc represents surface storage water (kg m⁻²), determined by the surface-inundated fraction f_h2osfc, the slope β, the ponded water height, and microtopographic features. W_c is the amount of surface water present when f_h2osfc=0.5; and Δt is the model time step (0.5 h was used in this study).

Subsurface runoff is also composed of two components: (i) drainage in the frozen soils where the groundwater table remained dynamic under partially frozen conditions, q_drain, and (ii) drainage from the thawed active layer, q_drain,perch. The first term is based on the following exponential relationship:

\begin{matrix} (3) & q_{drain} = 10^{Θ_{ice}} \cdot 10 \sin (β) \cdot \exp (- 2.5 Z_{\nabla}) \end{matrix}

where Θ_ice is an exponent of the ice impedance factor. It is calculated as $Θ_{ice} = - 6 (\sum_{i = j w t}^{N_{levsoi}} S_{ice, i} Δ Z_{i} / \sum_{i = j w t}^{N_{levsoi}} Δ Z_{i})$ , where S_ice,i is the saturation degree of ice in soil layer i; ΔZ_i is the layer thickness; jwt is the index of the layer directly above the water table; and N_levsoi=15 refers to the total number of soil layers. Z_∇ is the groundwater table depth (m). It should be noted that for continuous permafrost or frozen soil, its drainage is equal to zero or tiny values, and here the last term in Eq. (3) is reduced to a very small value, i.e., 2.8 × 10⁻¹⁰. The second term refers to the lateral drainage from the perched saturated zone between layers N_perch and N_frost, written as:

\begin{matrix} (4) & \begin{aligned} q_{drain, perch} & = 10^{- 5} \sin (β) \\ [\sum_{i = N_{perch}}^{i = N_{frost}} 10^{- 6 (\frac{S_{ice, i} + S_{ice, i + 1}}{2})} k_{sat, i} Δ Z_{i}] \\ (Z_{frost} - Z_{\nabla, perch}) \end{aligned} \end{matrix}

where k_sat,i is soil hydraulic conductivity (m s⁻¹) at saturated unfrozen status in soil layer i. Z_frost is the frost table defined as the shallowest frozen layers having an unfrozen layer above it (m). Z_∇,perch is the perched groundwater table depth (m) within the thawed layers above icy permafrost ground.

In this study, the total runoff from ELM is calculated as the sum of the above four runoff components, expressed as $q_{total} = q_{drain} + q_{drain, perch} + q_{over} + q_{h 2 osfc}$ (with units in mass water flux, kg m⁻² s⁻¹).

2.2 ATS runoff generation schemes

ATS solves integrated surface/subsurface flow in complex topographic landscapes with complex soil structures, which can capture a wide array of processes and their interactions to produce a holistic system understanding of a system (Painter et al., 2016; Coon et al., 2020; Gao and Coon, 2022). As a physics-based hydrological model, ATS uses physically based representations for surface runoff, subsurface runoff, and river routing. Here, only the key governing equations are presented.

The subsurface variably saturated flow is based on the Richards equation with phase change to solve the conservation of water mass, written as:

\begin{matrix} (5) & \frac{\partial}{\partial t} [\emptyset (ω_{g} m_{g} s_{g} + m_{l} s_{l} + m_{i} s_{i})] = - \nabla \cdot (m_{l} q_{l}) + Q_{w} \end{matrix}

where ∅ is porosity; the subscripts g, l, and i refer to the gas, liquid, and ice phases; ω is the gaseous mole fraction (mol mol⁻¹) referring to a molar fraction of water vapor within all gas in the pore space; m is the molar density of a particular phase (mol m⁻³); s is saturation ( $s_{g} + s_{l} + s_{i} = 1)$ ; and Q_w refers to sources and sinks (mol s⁻¹). The Darcy velocity (m s⁻¹) is presented as $q_{l} = - k_{int} k_{rl} / μ_{l} (\nabla P_{l} + ρ_{l} g \nabla z)$ , where k_int is intrinsic permeability (m²), k_rl is relative permeability, μ_l is dynamic viscosity (Pa s), P_l is pressure head (Pa), ρ_l is water density (kg m⁻³), g is the gravitational acceleration (m s⁻²), and z is the vertical elevation (m). The vapor pressure in the pore space is assumed to be in equilibrium with the liquid phase above the freezing temperature and in equilibrium with the ice phase below freezing. The parameterizations and constitutive relationships, such as the van Genuchten soil water retention curve and water-ice phase transition functions, are omitted here. The conservation of energy in the subsurface assumes local thermal equilibrium among the ice, liquid, gas, and solid grains, presented as:

\begin{matrix} (6) & \begin{aligned} \frac{\partial}{\partial t} & [\sum_{j = l, g, i} \emptyset m_{j} s_{j} u_{j} + (1 - \emptyset) C_{e} T] = - \nabla \cdot (m_{l} h_{l} q_{l}) \\ + \nabla \cdot (λ_{e} \nabla T) + Q_{E} \end{aligned} \end{matrix}

where T is the temperature (K); u is the specific internal energy (J mol⁻¹); h is the specific enthalpy (J mol⁻¹); C_e and λ_e are the equivalent heat capacity (J m⁻³ K⁻¹) and thermal conductivity (W m⁻¹ K⁻¹) of the soil composite (liquid, ice, gas, and solid grains); Q_E is the thermal energy sources and sinks (W m⁻³).

The thermal surface flow with phase change is governed by three core equations (Painter et al., 2016): the mass balance for water in the liquid and ice phases, a diffusion wave approximation for surface flow extended to include an immobile ice phase, and the energy balance equation. The effects of surface water freezing and thawing are incorporated through a liquid–ice partitioning factor, or unfrozen fraction χ, which depends on the surface water temperature and varies smoothly from 0 to 1 as the temperature rises through the freezing point. These governing equations are expressed as follows:

\begin{matrix} (7) & \frac{\partial}{\partial t} [δ_{w} χ m_{l} + δ_{w} (1 - χ) m_{i}] + \nabla \cdot (δ_{w} χ m_{l} U_{w}) = Q_{ss} \\ (8) & U_{w} = - \frac{{(χ δ_{w})}^{2 / 3}}{n_{mann} {(∥\nabla Z_{s}∥ + ε)}^{1 / 2}} \nabla (Z_{s} + δ_{w}) \\ (9) & \begin{aligned} \frac{\partial}{\partial t} & [δ_{w} χ m_{l} u_{l} + δ_{w} (1 - χ) m_{i} u_{i}] + \nabla \cdot (δ_{w} χ m_{l} U_{w} h_{l}) \\ - \nabla \cdot [δ_{w} (χ κ_{l} + (1 - χ) κ_{i}) \nabla T_{s}] = Q_{ess} \end{aligned} \end{matrix}

where U_w is the surface flow velocity (m s⁻¹); Q_ss and Q_ess are the mass (mol m⁻² s⁻¹) and energy (W m⁻²) source/sink terms, respectively; δ_w is ponded depth (m); n_mann is Manning's coefficient (s m $^{- 1 / 3}$ ); ε is a small positive regularization (m) to keep the equations non-singular in regions with zero slope ratio. The ponded depth and surface elevation Z_s are defined in two dimensions (x–y), and the vector operators are to be interpreted accordingly.

The land surface energy is calculated either at the surface of a snowpack or ponded water, presented as:

\begin{matrix} (10) & \begin{aligned} (1 - α) Q_{sw}^{in} & + Q_{lw}^{in} + Q_{lw}^{out} + Q_{h} (T_{s}) + Q_{e} (T_{s}) \\ + Q_{c} (T_{s}) = 0 \end{aligned} \end{matrix}

where α is surface albedo and T_s is the surface temperature (K); $Q_{sw}^{in}$ is incoming shortwave radiation (W m⁻²); $Q_{lw}^{in}$ is incoming longwave radiation (W m⁻²); $Q_{lw}^{out}$ is outgoing longwave radiation (W m⁻²); Q_h is the sensible heat flux (W m⁻²); Q_e is the latent heat flux (W m⁻²); and Q_c is the conductive heat flux (W m⁻²).

The above Eqs. (5)–(10) represent the integrated surface–subsurface thermal hydrological processes. Continuity of primary scalar fields and fluxes (e.g., pressure, temperature, and water content) is enforced across the surface–subsurface interface. The fully coupled system is solved simultaneously to capture key hydrological dynamics, including freeze–thaw transitions and energy–water exchanges, enabling the generation of physically consistent hydrological outputs. Here, we consider the cumulative discharge (mol s⁻¹) at the downstream outlet as the total runoff from the simulation domain.

3 Methodology

3.1 Study areas

The first study area is located in the Sagavanirktok (Sag) River basin, located on the North Slope of Alaska (Fig. 1a). Temperatures in the Sag range from −25 °C in January to 15 °C in July, with approximately half of the annual precipitation occurring as snowfall from September through April, while summer rainfall contributes around 50 % of the total precipitation. The basin is characterized by broad alluvial valleys and rolling tundra topography underlain by continuous permafrost. Vegetation is dominated by Arctic tundra communities, including mosses, lichens, and dwarf shrubs. Soils are generally silty loams with an organic-rich active layer overlying mineral, and permafrost extends to considerable depths. The shallow active layer, typically less than 1.0 m, strongly regulates hydrological processes, as thaw depth controls infiltration, runoff generation, and subsurface drainage. These conditions make the Sag basin representative of cold, dry Arctic watersheds.

https://gmd.copernicus.org/articles/19/1193/2026/gmd-19-1193-2026-f01

Figure 1Map of the (a) Teller27 and the (b) Sag River sites, and inset map of Alaska with sites indicated with red boxes. Detailed site maps (a) and (b) include watershed boundaries (red), rivers (blue), Sag River hillslopes (white dots, 22), the Teller27 discharge gauge (red dot), and the Digital Elevation Models (DEMs) for each site. The 2.5 km² Teller27 site, being much smaller and with much more limited elevational range than the Sag River site, is 10 times smaller than the Sag River basin elevational range (see legend). Sag hillslopes (22) are not found on the lower elevation North Slope coastal plain, below 250 m.

The second study area is the Teller watershed, a 2.5 km² drainage basin located approximately 27 miles from Nome on Teller Highway, located on the Seward Peninsula of Alaska (hereafter referred to as Teller27, Fig. 1b). Compared to the cold and dry climate of the Sag River basin, the Teller27 site experiences a warmer and wetter climate. The Sag site receives over twice the annual snowfall of Teller27, while the Teller27 site is 7–8 °C warmer on average (Gao and Coon, 2022). The watershed is situated in a landscape of rolling hills underlain by discontinuous permafrost, where thaw depth varies considerably across slope positions and is strongly influenced by soil and vegetation cover. The land cover is dominated by moist tundra ecosystems, including mosses, dwarf shrubs, and patches of willow along riparian zones. Soils generally consist of an organic-rich surface horizon underlain by silty to loamy mineral substrates. Streamflow measurements were collected at the Teller27 watershed river outlet (Busey et al., 2019) from 2016–2023. For additional climate, snow, subsurface properties, and permafrost at the Teller27 site, refer to Bennett et al. (2022), Jafarov et al. (2018), Léger et al. (2019), Thaler et al. (2023), and Wang et al. (2024).

3.2 Numerical Experiment Design

Our experimental design for this work consists of three main steps as follows. First, results from detailed ATS simulations at the Sag River basin were compared to ELM's runoff parameterization schemes. Specifically, ATS-simulated thaw depth, water table depth, and ice content were used in ELM's parameterization Eqs. (1)–(4), and the results were then compared to ATS-simulated total discharges. Second, runoff coefficients derived from the Sag site were implemented into ELM's source code and tested for transferability at the Teller27 site, without the need for additional ATS simulations. Third, ELM simulations at the Teller27 site were evaluated directly against observed streamflow data from the Teller watershed to assess the performance and generalizability of the adjusted runoff coefficients.

The ATS model was implemented to simulate local-scale hillslope hydrological processes at the Sag River site only. Meteorological forcing data for this region are sourced from the Daymet version 4 dataset (Thornton et al., 2020), developed at a daily timestep. ATS modeling follows three steps. First, a soil column with an initial temperature above freezing was subjected to bottom-up freezing by imposing a constant temperature of −10 °C at the bottom boundary until a steady-state frozen soil profile was formed. In this stage, only the subsurface flow-energy processes were included. The resulting pressure and temperature profiles were then used to initialize spin-up runs for each hillslope model until a cyclically steady state was achieved. During the spin-up phase, the full physics system was applied, including surface and subsurface thermal flow as well as surface energy balance processes, using smoothed meteorological forcings. Each cyclically steady hillslope was then used as the initial condition for transient simulations driven by real forcings. For all hillslope simulations, the bottom boundary temperature was fixed at −10 °C. Closed boundary conditions were applied to all other subsurface and surface boundaries except at the surface outlet, where a seepage face boundary condition was applied. Both the column and hillslope models were represented with three soil layers: the top two organic soil layers (i.e., acrotelm, catotelm) and the bottom mineral layer. The thickness and properties of the soil layers for hillslopes were determined based on the corresponding land cover types according to O'Connor et al. (2020). The soil properties of each soil layer are listed in Table S2. The thermal conductivity representation followed Atchley et al. (2015). For further details on ATS model mesh generation, boundary conditions, model setup, and initialization, refer to Jan et al. (2020), Gao and Coon (2022), and Coon et al. (2022). We randomly selected 22 hillslope sites within the Sag River basin, ensuring representation across a range of slope, aspect, and drainage area, providing a basis for sensitivity analysis.

ELM was implemented to simulate subgrid hillslope hydrological processes specifically for the Teller27 watershed. We configured ELM at a spatial resolution of 0.5°, incorporating finer-scale surface and land use input parameters, but not recently developed features such as the topounit or elevation bands that represent sub-grid variability (Hao et al., 2022; Tesfa and Leung, 2017). ELM was driven by 1-hourly meteorological forcing data from ERA5-Land (Muñoz-Sabater et al., 2021), ensuring realistic and high-temporal-resolution atmospheric inputs. We used the Offline Land Model Testbed (OLMT) to standardize the ELM case setup and model spin-up (e.g., Sinha et al., 2023). Model spin-up proceeds through two phases after Thornton and Rosenbloom (2005): the first phase features accelerated biogeochemical cycling, while the second phase uses standard biogeochemical reaction rates. These spin-up phases are run for 260 and 200 years, respectively, to ensure vegetation and biogeochemistry have approached a steady-state condition before beginning a transient run that spans 1850–2024. To validate the runoff from ELM, we used the streamflow measurements collected at the gage. To evaluate the influence of soil property parameters on ELM-simulated runoff, a series of parameter sensitivity analyses was conducted. In each simulation, one parameter was perturbed by ±50 % from its default average value (ELM's surface and land use files are extracted from the global 0.5° resolution files), while the values of all other parameters were fixed.

3.3 ELM runoff parameterization evaluation

We constructed variable-width hillslope models for each of the 22 hillslope sites to represent both vertical and lateral dynamics and simulated the fully coupled cryo-hydrological processes using ATS. To facilitate comparison with ELM, we post-processed the ATS simulation outputs by averaging the surface and subsurface variables across ATS's multiple horizontal units (columns), preserving the number of vertical layers (rows, Fig. 2). This produced a 1D multi-layer (rows), single column profile that mimics ELM's 15-layer subgrid-scale structure. From this 1D multi-layer representation, key model variables such as maximum seasonally thawed depth of the upper soil layers above the permafrost (referred to as active layer thickness, ALT), surface water content, water table depth, soil moisture and ice content were extracted and used as inputs to ELM's runoff parameterization equations (Eqs. 1–4) to compute surface and subsurface runoff components. The resulting total runoff from ELM was then compared against the cumulative runoff stimulated directly by ATS. Figure 2 illustrates the overall workflow, including the extraction and processing of ATS's variables, the calculation of ELM runoff components, and the comparison with ATS's discharge accumulation at the outlet, as well as the averaging of ATS horizontal units to a 1D multi-layer representation. Layers can be of variable depth, as indicated in Fig. 2.

https://gmd.copernicus.org/articles/19/1193/2026/gmd-19-1193-2026-f02

Figure 2Map (a) Workflow for calculating ELM's total runoff, highlighting key variables, runoff components, and computational steps involved. (b) Averaging of ATS's simulation results, illustrating the transformation from high-resolution pseudo-2D ATS outputs averaged across all hillslopes, to simplified 1D ELM column representations.

Download

To obtain the optimized runoff coefficients, we formulated objective functions that minimize the difference between the ATS-simulated runoff and a weighted sum of ELM's runoff components. The optimization was carried out using a constrained numerical minimization algorithm, with non-negativity bounds imposed on all coefficients. This procedure was conducted across multiple hillslope sites and under both annual and seasonal (warm–cold) conditions, resulting in site-independent adjusted coefficients for each case. To assess the performance of the optimized runoff coefficient and quantify the differences between them, three statistical metrics were calculated: root mean square error (RMSE), mean absolute error (MAE), and Nash–Sutcliffe efficiency coefficient (NSE). These metrics were selected as complementary evaluation measures following standard hydrological model assessment practices (Moriasi et al., 2007). RMSE emphasizes large errors and highlights peak mismatches, MAE reflects the average magnitude of deviations and is less sensitive to outliers, and NSE evaluates overall model efficiency relative to the observed mean. Together, they provide a balanced and robust assessment of ELM runoff performance. They are defined as follows:

\begin{matrix} (11) & RMSE (c_{1}, c_{2}, c_{3}, c_{4}) = \sqrt{\frac{1}{n} \sum_{i = 1}^{n} {(y_{i} - {\hat{y}}_{i})}^{2}} \\ (12) & MAE (c_{1}, c_{2}, c_{3}, c_{4}) = \frac{1}{n} \sum_{i = 1}^{n} |y_{i} - {\hat{y}}_{i}| \\ (13) & NSE (c_{1}, c_{2}, c_{3}, c_{4}) = 1 - \frac{\sum_{i = 1}^{n} {(y_{i} - {\hat{y}}_{i})}^{2}}{\sum_{i = 1}^{n} {(y_{i} - {\overline{y}}_{i})}^{2}} \end{matrix}

where y_i and ${\overline{y}}_{i}$ represent the ATS simulated value and its average value, respectively; ${\hat{y}}_{i} = c_{1} q_{drain} + c_{2} q_{drain, perch} + c_{3} q_{over} + c_{4} q_{h 2 osfc}$ is the calculated (or optimized) value based on ELM's equations. The coefficients c₁, c₂, c₃, and c₄ are weighting factors used to evaluate the relative contribution of each component. In the non-adjusted case, all four coefficients are set to 1.0, directly summing all runoff components. In the adjusted case, the coefficients are optimized through regression against the ATS-simulated runoff to best match the observed behavior. n is the number of annual or seasonal cumulative runoff values used in the comparison.

4 Results

4.1 Evaluation of the runoff schemes in ELM

ELM runoff results were evaluated in comparison with ATS's drainage at the outlet. In the plots below, we compared the annual and seasonal total cumulative runoff using both adjusted and non-adjusted coefficients c_i, under a variety of conditions across different hillslope sites (e.g., forcings, slope, ALT, etc.). The three columns in Fig. 3 correspond to different ELM parameterization cases: (1) unadjusted coefficients, (2) annually optimized coefficients, and (3) seasonally (warm–cold) optimized coefficients. The three rows represent different color-coded variables: slope (top), ALT (middle), and total precipitation (bottom).

https://gmd.copernicus.org/articles/19/1193/2026/gmd-19-1193-2026-f03

Figure 3Comparison of cumulative total runoff between ATS and ELM models under various hillslope conditions. The different colors represent slope (a–c), active layer thickness (d–f), and total precipitation (g–i). Columns (from left to right) correspond to different coefficient settings in ELM: unadjusted (a, d, g), annually adjusted (b, e, h), and seasonally adjusted (c, f, i). The results are displayed for different runoff parameterization coefficients (as indicated on the x axis) in ELM and compared against ATS-simulated runoff.

Download

It can be seen that ELM's annual total cumulative runoff is generally smaller than ATS's runoff when using unadjusted runoff coefficients, with an NSE of 0.30 (Fig. 3a). However, when adjusted coefficients are applied, the two models show much better agreement, achieving an NSE of 0.68 when adjusted based on annual discharge (Fig. 3b) and 0.83 when optimized using separate adjustments for the warm and cold periods (Fig. 3c). The seasonal total cumulative runoff is calculated for the warm season (May to August) and cold season (September to April) within the calendar year. This improved alignment in the seasonal total cumulative runoff is also reflected in performance metrics, with the RMSE, MAE, and NSE increasing by 39 %, 41 %, and 22 %, respectively, over the total runoff without adjusted coefficients, respectively.

The total runoff during the cold season is normally lower than that of the warm season. This is primarily because when the ground is fully frozen, overland flow (q_over) becomes the only dominant component of total runoff, which occurs due to excess meltwater from snow after limited vertical infiltration. As shown by the diamond symbols in Fig. 3c, f, and i, ELM's results are generally well-aligned with ATS's drainage during cold seasons, though some values are slightly lower. In the cold season, no active layer exists, whereas it does exist during the warm season. Cold-season runoff on steeper slopes tends to be lower, likely due to limited infiltration in frozen soils, which reduces subsurface flow pathways. In contrast, during the warm season, total runoff generally increases with slope, as shown in Fig. 3c, likely due to enhanced overland flow and faster hydrological response. Despite this trend, the runoff alignment in the seasonal total runoff (Fig. 3c) remains better compared to the total annual (Fig. 3b). This indicates that runoff differences between the two models cannot be fully explained by variations in slope, highlighting the influence of additional factors such as model parameterizations and physical processes.

By carefully examining the results with the adjusted coefficients in Fig. 3b, we observed that ELM performs well in representing lower slopes compared to the ATS benchmark, with a few exceptions. On steeper slopes (> 8°), ELM predicts lower runoff values than ATS, with differences reaching up to 150 kg m⁻². Interestingly, ELM also underpredicts runoff on shallower slopes (< 2°), suggesting that runoff differences between the two models are influenced by more than just slope. This variation could arise from differences in how the two models handle topographic gradients, basin size, or spatial heterogeneity in climate forcings. Figure 3e highlights that ELM performs better with deeper ALT, likely because these active layers experience less freezing, allowing for increased water storage in the deeper thawed zones and reduced lateral drainage. Figure 3h reveals a strong relationship between total cumulative runoff and precipitation, and ELM generally captures lower precipitation events more accurately, leading to correspondingly lower total runoff. As expected, Fig. 3i reveals a clear trend between warm-season total runoff and precipitation magnitude. However, ELM tends to underpredict runoff under higher precipitation conditions. This underscores the critical role of climate forcing data in shaping model performance and highlights ELM's sensitivity to precipitation variability.

To better understand these findings, it may be important to consider spatial variations in terrain landforms (e.g., concave vs. convex) across the modeled hillslopes, as well as alternative hillslope representations through structural formulations (e.g., Swenson et al., 2019; Lawrence and Swenson, 2024). In addition, these watersheds span a broad latitudinal range (approximately 68 to 70° N), which likely introduces considerable climatic and topographic variability. For example, steeper slopes can promote rapid surface runoff, while gentler slopes with deeper thawed layers may enhance water retention and reduce lateral drainage.

This improved alignment between ELM and ATS's annual total cumulative runoff is also evident in time series comparisons of total runoff from a selected hillslope, shown in Fig. 4, both with and without adjusted coefficients. Figure 4a illustrates the results using the unadjusted coefficients, where ELM underestimates runoff peaks and exhibits a relatively weak correlation with ATS discharge. The RMSE, MAE, and NSE values (0.55, 0.22, and 0.48, respectively) indicate moderate agreement but suggest room for improvement, particularly in capturing peak runoff events. Figure 4b shows the impact of applying annually adjusted coefficients, which improves model performance, as reflected by reduced RMSE (0.51) and MAE (0.19), along with a higher NSE (0.56). The adjusted coefficients result in better agreement during high-runoff periods, though some discrepancies remain in capturing the full variability of runoff responses. Figure 4c presents the results with seasonally adjusted coefficients, which yield the best overall performance among the three cases. The RMSE and MAE decrease further to 0.46 and 0.18, respectively, while the NSE increases to 0.64, demonstrating a stronger correlation between ELM and ATS runoff. The seasonal adjustment appears to enhance the model's ability to capture the timing and magnitude of runoff peaks (see Fig. 4d), particularly during snowmelt periods. However, some deviations remain, likely due to limitations in parameterizing snowmelt-driven surface flow and subsurface hydrological interactions in ELM.

https://gmd.copernicus.org/articles/19/1193/2026/gmd-19-1193-2026-f04

Figure 4Time series comparison of total runoff between ATS and ELM for a typical hillslope with an average slope of 6.6°, a watershed area of 2.43 km², and the ALT ranging from 0.43 to 0.59 m. Results are shown with (a) unadjusted coefficients, (b) annually-adjusted coefficients, and (c) seasonally-adjusted coefficients.

Download

These results highlight the importance of refining runoff coefficients and incorporating seasonal variations to improve the predictive capability of ELM, particularly in Arctic environments where snowmelt dynamics play a dominant role in runoff generation. In general, better alignment is observed when ATS's cumulative runoff remains below a certain threshold, such as 15 kg m⁻² d⁻¹. For larger runoff, ELM tends to underpredict total runoff, indicating limitations in its ability to represent higher runoff scenarios accurately.

4.2 Teller27 watershed evaluation

The improved ELM runoff coefficients were evaluated using observed streamflow data from the Teller27 watershed, spanning 2016–2019 (Busey et al., 2019). Results in Fig. 5 indicate that the adjusted coefficients significantly enhance the agreement between ELM-simulated runoff and observations. The unadjusted coefficients consistently underpredict runoff, with the largest discrepancy observed in 2016 (ELM: 374.66 kg m⁻², observed: 648.25 kg m⁻²). Both annually and seasonally adjusted coefficients reduce this gap, with the seasonally adjusted coefficients providing better performance, particularly in years with greater interannual variability in precipitation and thaw depth.

https://gmd.copernicus.org/articles/19/1193/2026/gmd-19-1193-2026-f05

Figure 5Comparison of cumulative annual total runoff between ELM-simulated and observed data at the Teller27 watershed, Alaska, from 2016 to 2019. Symbol sizes represent ELM results using unadjusted coefficients, annually adjusted coefficients, and seasonally adjusted coefficients.

Download

Figure 6 presents the outcomes of the parameter sensitivity experiments, illustrating how changes in land surface properties influence simulated total runoff. It can be seen that total runoff is relatively insensitive to variations in surface soil properties such as clay, sand, and organic matter content, suggesting that soil texture and composition play a secondary role in controlling runoff dynamics within the model. Simulated runoff totals under these parameter perturbations remained within a relatively narrow envelope, regardless of whether baseline or optimized runoff coefficients were applied. This occurs because runoff is most due to overland flow during the spring snowmelt, when the ground is still frozen. These differences were notably smaller than the variations observed across different years or watersheds. The broader implications of these findings are further discussed in Sect. 5.2.

https://gmd.copernicus.org/articles/19/1193/2026/gmd-19-1193-2026-f06

Figure 6Variations in annual cumulative total runoff from ELM simulations at the Teller27 watershed, Alaska for the mean of all years (2016–2019). Different bars groupings along the x axis illustrate scenarios where parameters are reduced or increased from their default average values in ELM's soil physical property and snow thermal conductivity. Bar colours represent results with unadjusted coefficients (teal), annually adjusted coefficients (green), and seasonally adjusted coefficients (grey). The mean annual cumulative runoff from Teller observations (mean all years 2016–2019) is given in the dashed line.

Download

Figure 7 shows a temporal comparison of weekly and monthly total runoff, while Table 1 quantifies performance using RMSE, MAE, and NSE metrics. The adjusted coefficients yield lower RMSE and MAE values and higher NSE scores than the unadjusted coefficients, confirming their effectiveness, despite being derived from a study site some distance from the Teller27 watershed. These results clearly demonstrate that ELM's subgrid-scale runoff scheme performs well at the monthly timescale, but struggles to capture sub-monthly (e.g., weekly) runoff variability. Although ELM's total runoff is of the same order of magnitude as the observed discharge at Teller creek, discrepancies arise during specific events. Notably, spikes in runoff occur during rapid snowmelt in early spring due to ponded surface water, and the baseflow during recession periods are poorly simulated.

https://gmd.copernicus.org/articles/19/1193/2026/gmd-19-1193-2026-f07

Figure 7Comparison of ELM-simulated (a) weekly and (b) monthly total runoff with observed data at the Teller27 watershed, Alaska. Simulated results are displayed using different runoff parameterization coefficients in ELM, represented by distinct colored lines. Rain and snow precipitation are shown as bar plots using the top x axis (time) and left y axis (mm) in both subplots.

Download

Table 1Performance metrics (RMSE, MAE, and NSE) for ELM-simulated total runoff compared to observed streamflow across all temporal scales (daily, weekly, and monthly) at the Teller27 watershed, Alaska.

Download Print Version | Download XLSX

The weekly average results (Fig. 7a) reveal significant discrepancies in runoff peaks with unadjusted coefficients, while those biases are reduced by the simulations with adjusted coefficients. This may be due to large peak flow simulated during snowmelt in ELM that is not represented in the observations, thus flows throughout the rest of the summer are too low in the simulations. We believe that this response may be occurring due to a frozen active layer that leads to fast runoff that is not observed in the hydrological records. While we observed this overestimate of flow peaks, the seasonally adjusted coefficients still perform better in capturing both the timing and magnitude of runoff peaks, perhaps because the seasonal coefficients adjust for some attenuation of runoff. The monthly results (Fig. 7b) show improved agreement with observed data during high runoff periods, particularly those driven by snowmelt and seasonal precipitation. This is likely because the monthly results accumulate flow responses over longer periods of time, accounting for and spreading out this large peak flow runoff response that was incorrectly simulated at the weekly time scale. For example, at the monthly scale, NSE improves markedly from 0.07 with unadjusted coefficients to 0.58 and 0.60 with annually and seasonally adjusted coefficients, respectively. These results highlight the efficacy of the adjusted coefficients and the critical role of incorporating seasonal variability into runoff parameterization for improving ELM's performance.

5 Discussion

Intercomparison studies to consider differences and similarities between a land surface scheme to a physically-based model are useful because they allow for quantitative evaluation of each model and its performance for different components of the hydrologic cycle. The technique applied here was to extract the land surface parameterization schemes and evaluate them within a fine-scale, physics model, to consider the total cumulative runoff from the land surface scheme (y^ELM_runoff) and compare it with the physics model estimates (y^ATS_runoff). This novel approach allowed us to focus on the formulations over other interacting effects that may have potentially obscured the results. We were then able to make an estimate of runoff coefficients that could be applied to improve the match between the physics-model and the land surface model. These coefficients, when applied to a small watershed located in sub-arctic Alaska, improved the land surface model (y^ELM_runoff) significantly, suggesting that this approach may allow for fine-tuning of runoff within similar systems. Overall, further research is needed to determine how flexible these coefficients are; however, this work was foundational in that it provided a novel approach to both model intercomparison as well as for model improvement.

A sensitivity analysis conducted as part of this study identified that soil properties may exert less influence on the runoff dynamics in the land surface model than snow accumulation and melt processes, as well as subsurface thermal hydrology. This finding links this component of our research to investigations into snow processes and model improvement in these same systems (e.g., Bennett et al., 2022; Clark et al., 2015). It is clear that to improve runoff processes within permafrost dominant systems, a holistic approach to understanding the system is required to determine the key processes and model components that require adjustment. This study provided an initial enquiry into this process that developed the foundation for our work towards improving the Earth system model for these Arctic systems.

5.1 Differences in simulated runoff from representative hillslopes

The apparent mismatch between y^ELM_runoff and y^ATS_runoff results from 22 representative hillslopes in the Sag River basin warrants a critical examination. However, such evaluations must consider the inherent differences between the two models. Several factors could contribute to these differences. First, a key distinction between ATS and ELM is their treatment of runoff generation processes. ATS implicitly simulates variably saturated flow, lateral subsurface flow and transport, and dynamic freeze-thaw cycles in a high-resolution 3D domain (Painter et al., 2016; Coon et al., 2020). In contrast, ELM utilizes parameterized equations to represent subgrid-scale heterogeneities, which are not expected to fully capture the complexity of permafrost hydrology (Bisht et al., 2018; Xu et al., 2024). This averaging process that reduces a 3D system into a pseudo-3D system in ATS can represent a loss of spatial variability in hydrological processes, particularly in heterogeneous permafrost regions where local-scale topography and subsurface heterogeneity play a critical role in runoff generation (Zhao and Li, 2015; Abolt et al., 2024). This simplification can also lead to discrepancies in how surface and subsurface runoff are partitioned (Liao et al., 2025), particularly in regions with high spatial variability in soil moisture, ice content, and thaw depth. For instance, ELM may under- or over-predict runoff during high precipitation events (see Fig. 5) compared with the spatially resolved, physics-based implementation in ATS.

Secondly, although y^ATS_runoff is treated as the benchmark in this study, it is not without uncertainties. The accuracy of y^ATS_runoff simulations depends on input data such as soil properties, meteorological forcings, and initial conditions, all of which contain inherent uncertainties (Harp et al., 2016; Jafarov et al., 2018; Zhang et al., 2023; Huang et al., 2022, 2024). In permafrost landscapes, soil heterogeneity is particularly difficult to characterize, and small variations in soil thermal and hydrological properties can lead to substantial differences in runoff predictions (Decharme et al., 2013; Vereecken et al., 2022).

Additionally, the accuracy of y^ELM_runoff predictions is highly dependent on the parameterization of surface hydrological processes. Although optimized coefficients have been implemented to improve agreement with y^ATS_runoff, these parameterizations may still inadequately capture the nonlinear interactions between infiltration, permafrost thaw, and lateral flow (Swenson et al., 2012; Liao et al., 2025). This issue is particularly relevant in ice-rich permafrost terrains, where abrupt changes in thaw depth and active layer dynamics can lead to nonlinear responses in runoff generation (e.g., Hinzman et al., 2022). The lack of explicit lateral flow representation in ELM further limits its ability to capture runoff redistribution processes that are well-resolved in ATS simulations.

Another factor that may contribute to the mismatch is the difference in how the two models resolve seasonal freeze/thaw processes, especially under varying precipitation and thawing conditions. Runoff generation in permafrost regions is highly sensitive to seasonal thawing and freezing dynamics, as well as precipitation regimes (e.g., Zhang et al., 2010; Guo et al., 2025). In particular, mismatches may arise during transitional periods such as spring snowmelt, when small differences in temperature and soil conditions can lead to substantial variations in runoff production.

Understanding these potential differences is crucial for interpreting the model responses and guiding future ELM model improvements. Addressing the uncertainties in ATS and refining the transformation of parameterization schemes between models could reduce these mismatches. Similarly, enhancing ELM's parameterization by incorporating insights from ATS simulations, such as better representation of lateral flow and freeze-thaw processes, could lead to improved alignment with ATS and more accurate predictions in Arctic environments under a changing climate.

5.2 Sensitivity and uncertainties of model performance to land surface parameters in ELM

The sensitivity analysis assessed the impact of various surface and subsurface parameters, including soil properties (clay, sand, and organic content) and snow thermal conductivity, on the simulated total y^ELM_runoff. Understanding the sensitivity of y^ELM_runoff simulations to these parameters is crucial for improving hydrological predictions in permafrost regions, where land surface processes interact with freeze-thaw dynamics in complex ways (Bisht et al., 2018; Walvoord and Kurylyk, 2016).

The limited sensitivity of total y^ELM_runoff to soil property variations observed in our ELM simulations raises important implications for land surface model development. Previous studies have shown that soil texture can strongly affect bidirectional water exchange between groundwater and the soil during freeze–thaw transitions (e.g., Xie et al., 2021; Huang and Rudolph, 2023; Yang et al., 2025). However, our findings suggest that these processes may have less influence on annual runoff generation in permafrost regions. This discrepancy may be explained by the dominant role of snowmelt dynamics and shallow subsurface hydrology in controlling surface runoff. In permafrost-dominated landscapes, runoff generation is often driven by the timing and intensity of snowmelt, seasonal freeze–thaw cycles, and the spatial distribution of near-surface permafrost. These factors likely outweigh the direct effects of variations in soil grain size or organic matter content (e.g., Swenson et al., 2012). The presence of an impermeable permafrost layer beneath the active layer restricts deep infiltration and causes excess water to remain near the surface, limiting the direct impact of soil properties on runoff partitioning. Similar findings have been reported in other permafrost hydrology studies, where hydraulic conductivity and soil texture exert minimal influence on runoff formation compared to freeze-thaw dynamics and snowmelt timing (e.g., Zhang et al., 1999; Walvoord and Kurylyk, 2016). However, further investigation is needed to evaluate whether subsurface water redistribution, active layer depth variability, and lateral flow dynamics could play a more significant role in influencing ELM's runoff performance.

In contrast to soil parameters, snow thermal conductivity exhibits a strong influence on simulated runoff, demonstrating its critical role in shaping hydrological responses in Arctic environments. An increase in snow thermal conductivity enhances heat transfer within the snowpack, leading to earlier and more rapid snowmelt. This, in turn, alters the seasonal timing of water availability and increases runoff magnitudes during peak melt periods (Musselman et al., 2017). Higher thermal conductivity results in faster warming of the snowpack, reducing the buffering effect of snow insulation and exposing the underlying soil to greater temperature fluctuations. This phenomenon has been observed in field studies, where changes in snow properties significantly impact the timing and magnitude of spring runoff (Würzer et al., 2016; Liljedahl et al., 2016). The results in Fig. 6 suggest that accurate representation of snow properties is essential for improving y^ELM_runoff predictions in permafrost landscapes. Over- or under-estimating snow thermal conductivity could lead to systematic biases in modeled y^ELM_runoff timing, potentially affecting the accuracy of hydrological assessments in Arctic watersheds.

Collectively, these findings reinforce the critical role of snowmelt-driven hydrological processes in shaping runoff dynamics in permafrost landscapes and illustrate key sensitivities within ELM's runoff parameterization. The results suggest that ELM's performance is particularly influenced by representations of snow accumulation and melt processes, as well as subsurface thermal hydrology. In particular, sensitivity to snow density, thermal conductivity, and freeze–thaw transitions points to the value of incorporating physically based formulations that capture snowpack variability (e.g., Bennett et al., 2022; Lackner et al., 2022; Tao et al., 2024; Wang et al., 2024) and lateral subsurface flow (e.g., Swenson et al., 2012; Liao et al., 2025). These process-level influences appear to exert a stronger control on y^ELM_runoff behavior than surface soil properties alone, underscoring their importance in cold-region hydrology and land surface modeling.

5.3 Implications for improving runoff parameterization coefficients in land surface models

Hydrological runoff-related parameters in land models are often calibrated against the observed streamflow data (e.g., Niu et al., 2007; Li et al., 2013), which can be limited or unavailable in remote permafrost regions. This study introduces a novel evaluation framework, which shifts the traditional paradigm of directly comparing coarse-scale land surface models to fine-scale physics-based models, by deriving optimized runoff coefficients by leveraging high-fidelity simulations from the integrated surface/subsurface hydrological simulators such as the ATS. These optimized coefficients are then incorporated into the land surface model, allowing for physics-informed improvements without direct site-based calibration. This approach offers a process-oriented alternative to conventional calibration, providing an avenue for improving parameterizations in data-scarce regions.

A key implication of this framework is its potential transferability across diverse Arctic watersheds. In this study, coefficients derived from the Sag River hillslopes were applied to the Teller27 watershed without additional tuning, resulting in significantly improved y^ELM_runoff performance at monthly and seasonal scales. One possible reason for this is that while these two systems are located in very different environments and some distance from each other, they are similar in that both exhibit moderately graded slopes and elevations, vegetation of grasses and shrubs, and both have some degree of permafrost (albeit discontinuous permafrost in Teller27). However, in this iteration of ELM, most of the sub-grid variability within these features are not represented, despite their importance to the influence of snowmelt runoff and other water balances (e.g., Beer, 2016; Shirley et al., 2025). This demonstration suggests that physically guided coefficients obtained through fine-scale process-resolving models may be generalized to other Arctic catchments with similar characteristics, offering a possible strategy for parameter refinement in Earth system land models at coarse scales. Such transferability is especially valuable for land model intercomparison projects that seek robust parameterizations applicable across diverse permafrost regions (e.g., Clark et al., 2015; Lawrence et al., 2019; Fan et al., 2019). However, more research is required to determine the extent of this transferability. Future applications of this approach at permafrost sites across the pan-Arctic, which is part of the next phase of this project, could further enhance the robustness and generalizability of this framework.

Our coefficient-calibration approach complements, but also differs from, the representative hillslope approach developed for the CLM (Swenson et al., 2019; Lawrence and Swenson, 2024). In CLM's representative hillslope approach, intrahillslope lateral subsurface flow is explicitly represented and scaled through representative hillslopes that account for slope, aspect, and lateral redistribution of water. This structural modification allows the model to capture lateral connectivity in a process-based manner without requiring high-resolution benchmarks. By contrast, our method leverages physics-based ATS simulations to calibrate and optimize runoff coefficients in ELM and then evaluates their transferability across sites. Whereas the CLM approach introduces structural reformulations of runoff schemes, our framework focuses on refining existing parameterizations through calibration against physics-based permafrost models. Taken together, these strategies represent complementary pathways for advancing runoff representation in permafrost regions and for bridging local-scale hydrological processes with land surface components of Earth system models. At the same time, ELM has recently developed an improved subgrid hillslope hydrologic connectivity that represents lateral water movement across topographic units (topounits) within a gridcell. This new implementation shares similarities with the CLM hillslope approach. Future evaluation of the runoff performance of that parameterization alongside our coefficient-calibration framework will be an important next step toward improving parameterization and scaling of hydrological processes in ELM. Such integration of coefficient calibration, representative hillslope formulations, and the new subgrid framework will be essential for capturing hydrological variability across diverse permafrost landscapes and for improving the predictive fidelity of Earth system land models under changing Arctic climate conditions.

6 Limitations and future work

Based on our analysis and discussion, we acknowledge several limitations that may be further improved in future studies:

Simplified watershed representation. The hillslopes used for ATS simulations in the Sag River basin are pseudo-2D variable-width simplifications of the 3D landscapes, which do not fully capture the heterogeneity of real Arctic landscapes, such as ice-wedge polygons, thermokarst features, and microtopographic variations. Future studies should incorporate many more diverse landscapes (up to hundreds of hillslope models) and ensure identical topographic representations across both models.

Transferability evaluation. The optimized runoff coefficients were derived from ATS simulations of Sag River hillslopes and then directly applied to the Teller Creek watershed without site-specific adjustments. This transferability was only evaluated at a single site (Teller27), further evaluation across diverse Arctic watersheds with long-term streamflow measurements is needed to build broader confidence in the generalizability of the approach. Moreover, because seasonal variability plays a key role in runoff generation, this approach may work reasonably well for colder Arctic regions but may be less applicable to sub-Arctic environments.

Limited consideration of lateral flow and subsurface heterogeneity. This study primarily focused on ELM's grid-scale runoff generation, neglecting lateral water movement and groundwater interactions. In Arctic environments, lateral subsurface flow can play a crucial role in redistributing water across permafrost landscapes, affecting both surface runoff and baseflow dynamics in different land units, which will be evaluated in future work.

Limited assessment of meteorological forcing biases. While the parameterization in this study is based on state variables derived from ATS simulations, rather than directly on precipitation or other forcings, model performance in applications such as the Teller Creek test can still be sensitive to uncertainties in meteorological inputs. In high-latitude regions, sparse station coverage and undercatch issues can introduce substantial uncertainty in precipitation, temperature, and radiation datasets. Future work should assess how these uncertainties propagate through the model and influence runoff simulations under different forcing scenarios.

7 Conclusions

We evaluated a land surface model's runoff parameterization using detailed fine-scale physics-based simulations of 22 hillslopes in the Sag River basin and identified empirical adjustments that improve the runoff parameterization. Seasonal optimization of these coefficients improved the model's ability to capture hydrological variability at monthly scales, particularly in snowmelt-driven runoff processes. The adjusted parameterization improved ELM simulated runoff from the Teller27 watershed. That demonstration of transferability of the adjusted parameterization is encouraging but needs further study across diverse Arctic catchments. Sensitivity analysis revealed that runoff in ELM is largely insensitive to soil properties but highly sensitive to snow thermal conductivity, underscoring the importance of accurate snow process representation in permafrost regions. These findings demonstrate the value of spatially resolved fine-scale simulators from physics-based models as benchmarks for refining land surface models and highlight the need for process-specific parameterization improvements in hydrological runoff schemes of land surface models.

Despite these advancements, challenges remain in capturing subsurface hydrological processes, including lateral flow, permafrost thaw dynamics, and active layer variability, which are critical for Arctic runoff simulations. Future improvements should focus on incorporating water redistribution within an ELM gridcell due to lateral flow, refining subgrid-scale hydrological parameterizations, and evaluating model updates across diverse Arctic catchments. By addressing these gaps, land surface models could achieve more accurate runoff predictions, ultimately enhancing their utility for climate impact assessments and water resource management in permafrost regions.

Code and data availability

All data sets used in this work are archived at the Environmental System Science Data Infrastructure for a Virtual Ecosystem (ESS-DIVE). ELM and ATS data sets will be archived here: https://doi.org/10.15485/2550570 (Huang et al., 2025). Data sets will become live once the paper is accepted. ERA5 forcing data to run the ELM model can be downloaded from the ECMWF Climate Data Store: (https://cds.climate.copernicus.eu/, last access: 18 November 2024). ATS forcings data can be retrieved from the Daymet version 4 dataset (https://doi.org/10.3334/ORNLDAAC/1850, Thornton et al., 2020). Streamflow data was derived from https://doi.org/10.5440/1618330 (Busey et al., 2019) and a log-in account is required to access the data. Data sets are in the transfer process between the NGEE Arctic data portal and the ESS DIVE repository (https://doi.org/10.15485/2550570, Huang et al., 2025). The description and codes of E3SM v3.0 (including ELM v3.0) are publicly available at https://doi.org/10.11578/E3SM/dc.20240301.3 (E3SM Project (DOE), 2024) and https://github.com/E3SM-Project/E3SM/releases/tag/v3.0.0 (last access: 18 November 2024), respectively.

Supplement

The supplement related to this article is available online at https://doi.org/10.5194/gmd-19-1193-2026-supplement.

Author contributions

The study was conceptualized and designed by YZ, SP, XH and KB. XH completed the data analysis, visualization, and the original draft. All authors contributed to editing the manuscript.

Competing interests

The contact author has declared that none of the authors has any competing interests.

Disclaimer

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. The authors bear the ultimate responsibility for providing appropriate place names. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.

Acknowledgements

The authors gratefully acknowledge Ian Shirley and Baptiste Dafflon for their support with data curation at the Teller Creek watershed. We also thank Peter Thornton for his technical assistance with the implementation and interpretation of the E3SM-ELM model. This work is financially supported by the Next Generation Ecosystem Experiment (NGEE) Arctic project from the Office of Biological and Environmental Research in the U.S. Department of Energy's Office of Science. The authors appreciate constructive suggestions and comments from anonymous reviewers.

Financial support

This research has been supported by the Biological and Environmental Research (NGEE-Arctic).

Review statement

This paper was edited by Yongze Song and reviewed by Mousong Wu and two anonymous referees.

References

Abolt, C. J., Young, M. H., Atchley, A. L., Harp, D. R., and Coon, E. T.: Feedbacks between surface deformation and permafrost degradation in ice wedge polygons, Arctic Coastal Plain, Alaska, Journal of Geophysical Research: Earth Surface, 125, e2019JF005349, https://doi.org/10.1029/2019JF005349, 2020.

Abolt, C. J., Atchley, A. L., Harp, D. R., Jorgenson, M. T., Witharana, C., Bolton, W. R., Schwenk, J., Rettelbach, T., Grosse, G., Boike, J., Nitze, I., Liljedahl, A. K., Rumpca, C. T., Wilson, C. J., and Bennett, K. E.: Topography controls variability in circumpolar permafrost thaw pond expansion, Journal of Geophysical Research: Earth Surface, 129, e2024JF007675, https://doi.org/10.1029/2024JF007675, 2024.

Abdelhamed, M. S., Razavi, S., Elshamy, M. E., and Wheater, H. S.: Assessment of a hydrologic-land surface model to simulate thermo-hydrologic evolution of permafrost regions, Journal of Hydrology, 645, 132161, https://doi.org/10.1016/j.jhydrol.2024.132161, 2024.

Atchley, A. L., Painter, S. L., Harp, D. R., Coon, E. T., Wilson, C. J., Liljedahl, A. K., and Romanovsky, V. E.: Using field observations to inform thermal hydrology models of permafrost dynamics with ATS (v0.83), Geoscientific Model Development, 8, 2701–2722, https://doi.org/10.5194/gmd-8-2701-2015, 2015.

Bennett, K. E., Miller, G., Busey, R., Chen, M., Lathrop, E. R., Dann, J. B., Nutt, M., Crumley, R., Dillard, S. L., Dafflon, B., Kumar, J., Bolton, W. R., Wilson, C. J., Iversen, C. M., and Wullschleger, S. D.: Spatial patterns of snow distribution in the sub-Arctic, The Cryosphere, 16, 3269–3293, https://doi.org/10.5194/tc-16-3269-2022, 2022.

Bennett, K. E., Schwenk, J., Bachand, C., Gasarch, E., Stachelek, J., Bolton, W. R., and Rowland, J. C.: Recent streamflow trends across permafrost basins of North America, Frontiers in Water, 5, 1099660, https://doi.org/10.3389/frwa.2023.1099660, 2023.

Beer, C.: Permafrost sub-grid heterogeneity of soil properties key for 3-D soil processes and future climate projections, Frontiers in Earth Science, 4, https://doi.org/10.3389/feart.2016.00081, 2016.

Beven, K. J. and Kirkby, M. J.: A physically based, variable contributing area model of basin hydrology, Hydrological Sciences Journal, 24, 43–69, 1979.

Bisht, G., Riley, W. J., Hammond, G. E., and Lorenzetti, D. M.: Development and evaluation of a variably saturated flow model in the global E3SM Land Model (ELM) version 1.0, Geoscientific Model Development, 11, 4085–4102, https://doi.org/10.5194/gmd-11-4085-2018, 2018.

Boone, A., Habets, F., Noilhan, J., Clark, D., Dirmeyer, P., Fox, S., Gusev, Y., Haddeland, I., Koster, R., Lohmann, D., Mahanama, S., Mitchell, K., Nasonova, O., Niu, G.-Y., Pitman, A., Polcher, J., Shmakin, A. B., Tanaka, K., van den Hurk, B., Vérant, S., Verseghy, D., Viterbo, P., and Yang, Z. L.: The Rhone-aggregation land surface scheme intercomparison project: An overview, Journal of Climate, 17, 187–208, 2004.

Busey, B., Wales, N., Newman, B., Wilson, C., and Bolton, B.: Surface Water: Stage, Temperature and Discharge, Teller Road Mile Marker 27, Seward Peninsula, Alaska, beginning 2016 (No. NGA185). Next-Generation Ecosystem Experiments (NGEE) Arctic, ESS-DIVE repository [data set], https://doi.org/10.5440/1618330, 2019.

Bring, A., Fedorova, I., Dibike, Y., Hinzman, L., Mård, J., Mernild, S. H., Prowse, T., Semenova, O., Stuefer, S., and Woo, M. K.: Arctic terrestrial hydrology: A synthesis of processes, regional effects, and research challenges, Journal of Geophysical Research: Biogeosciences, 121, 621–649, 2016.

Clark, M. P., Fan, Y., Lawrence, D. M., Adam, J. C., Bolster, D., Gochis, D. J., Hooper, R. P., Kumar, M., Ruby Leung, L., Scott Mackay, D., Maxwell, R. M., Shen, C., Swenson, S. C., and Zeng, X.: Improving the representation of hydrologic processes in Earth System Models, Water Resources Research, 51, 5929–5956, 2015.

Clark, M. P., Bierkens, M. F. P., Samaniego, L., Woods, R. A., Uijlenhoet, R., Bennett, K. E., Pauwels, V. R. N., Cai, X., Wood, A. W., and Peters-Lidard, C. D.: The evolution of process-based hydrologic models: historical challenges and the collective quest for physical realism, Hydrol. Earth Syst. Sci., 21, 3427–3440, https://doi.org/10.5194/hess-21-3427-2017, 2017.

Collier, N., Hoffman, F. M., Lawrence, D. M., Keppel-Aleks, G., Koven, C. D., Riley, W. J., Mu, M., and Randerson, J. T.: The International Land Model Benchmarking (ILAMB) system: design, theory, and implementation, Journal of Advances in Modeling Earth Systems, 10, 2731–2754, 2018.

Coon, E. T., Moulton, J. D., Kikinzon, E., Berndt, M., Manzini, G., Garimella, R., Lipnikov, K., and Painter, S. L.: Coupling surface flow and subsurface flow in complex soil structures using mimetic finite differences, Advances in Water Resources, 144, 103701, https://doi.org/10.1016/j.advwatres.2020.103701, 2020.

Coon, E., Berndt, M., Jan, A., Svyatsky, D., Atchley, A., Kikinzon, E., Harp, D., Manzini, G., Shelef, E., Lipnikov, K., Garimella, R., Xu, C., Moulton, D., Karra, S., Painter, S., Jafarov, E., and Molins, S.: Advanced Terrestrial Simulator (ATS) v1.2.0 (1.2.0), Zenodo [code], https://doi.org/10.5281/zenodo.6679807, 2022.

Decharme, B., Martin, E., and Faroux, S.: Reconciling soil thermal and hydrological lower boundary conditions in land surface models, Journal of Geophysical Research: Atmospheres, 118, 7819–7834, 2013.

E3SM Project (DOE): Energy Exascale Earth System Model v3.0.0, DOE [software], https://doi.org/10.11578/E3SM/dc.20240301.3, 2024.

Fan, Y., Clark, M., Lawrence, D. M., Swenson, S., Band, L. E., Brantley, S. L., Brooks, P. D., Dietrich, W. E., Flores, A., Grant, G., Kirchner, J. W., Mackay, D. S., McDonnell, J. J., Milly, P. C. D., Sullivan, P. L. Tague, C., Ajami, H., Chaney, N., Hartmann, A., Hazenberg, P., McNamara, J., Pelletier, J., Perket, J., Rouholahnejad-Freund, E., Wagener, T., Zeng, X., Beighley, E., Buzan, J., Huang, M., Livneh, B., Mohanty, B. P., Nijssen, B., Safeeq, M., Shen, C., van Verseveld, W., Volk, J., and Yamazaki, D.: Hillslope hydrology in global change research and earth system modelling, Water Resources Research, 55, 1737–1772, 2019.

Gao, B. and Coon, E. T.: Evaluating simplifications of subsurface process representations for field-scale permafrost hydrology models, The Cryosphere, 16, 4141–4162, https://doi.org/10.5194/tc-16-4141-2022, 2022.

Guo, L., Wang, G., Song, C., Sun, S., Li, J., Li, K., Huang, P., and Ma, J.: Hydrological changes caused by integrated warming, wetting, and greening in permafrost regions of the Qinghai-Tibetan Plateau, Water Resources Research, 61, e2024WR038465, https://doi.org/10.1029/2024WR038465, 2025.

Hamm, A. and Frampton, A.: Impact of lateral groundwater flow on hydrothermal conditions of the active layer in a high-Arctic hillslope setting, The Cryosphere, 15, 4853–4871, https://doi.org/10.5194/tc-15-4853-2021, 2021.

Hao, D., Bisht, G., Huang, M., Ma, P. L., Tesfa, T., Lee, W. L., Gu, Y., and Leung, L. R.: Impacts of sub-grid topographic representations on surface energy balance and boundary conditions in the E3SM land model: A case study in Sierra Nevada, Journal of Advances in Modeling Earth Systems, 14, e2021MS002862, https://doi.org/10.1029/2021MS002862, 2022.

Harp, D. R., Atchley, A. L., Painter, S. L., Coon, E. T., Wilson, C. J., Romanovsky, V. E., and Rowland, J. C.: Effect of soil property uncertainties on permafrost thaw projections: a calibration-constrained analysis, The Cryosphere, 10, 341–358, https://doi.org/10.5194/tc-10-341-2016, 2016.

Hinzman, A. M., Sjöberg, Y., Lyon, S., Schaap, P., and van der Velde, Y.: Using a mechanistic model to explain the rising non-linearity in storage discharge relationships as the extent of permafrost decreases in Arctic catchments, Journal of Hydrology, 612, 128162, https://doi.org/10.1016/j.jhydrol.2022.128162, 2022.

Holmes, R. M., Coe, M. T., Fiske, G. J., Gurtovaya, T., McClelland, J. W., Shiklomanov, A. I., Spencer, R. G. M., Tank, S. E., and Zhulidov, A. V.: Climate change impacts on the hydrology and biogeochemistry of Arctic rivers, Climatic change and global warming of inland waters, 26 pp., https://doi.org/10.1002/9781118470596, 2013.

Hou, Y., Guo, H., Yang, Y., and Liu, W.: Global evaluation of runoff simulation from climate, hydrological and land surface models, Water Resources Research, 59, e2021WR031817, https://doi.org/10.1029/2021WR031817, 2023.

Huang, X. and Rudolph, D. L.: Numerical study of coupled water and vapor flow, heat transfer, and solute transport in variably-saturated deformable soil during freeze-thaw cycles, Water Resources Research, 59, e2022WR032146, https://doi.org/10.1029/2022WR032146, 2023.

Huang, X., Rudolph, D. L., and Glass, B.: A coupled thermal-hydraulic-mechanical approach to modeling the impact of roadbed frost loading on water main failure, Water Resources Research, 58, e2021WR030933, https://doi.org/10.1029/2021WR030933, 2022.

Huang, X., Abolt, C. J., and Bennett, K. E.: How does humidity data impact land surface modeling of hydrothermal regimes at a permafrost site in Utqiaġvik, Alaska?, Science of the Total Environment, 912, 168697, https://doi.org/10.1016/j.scitotenv.2023.168697, 2024.

Huang, X., Demir, C., Gao, B., Zhang, Y., Abolt, C., Crumley, R., Fiorella, R., Busey, B., Bolton, B., Painter, S., and Bennett, K. E.: Data Files for Runoff Evaluation in an Earth System Land Model for Permafrost Regions. Next-Generation Ecosystem Experiments (NGEE) Arctic, ESS-DIVE repository [data set], https://doi.org/10.15485/2550570, 2025.

Jafarov, E. E., Coon, E. T., Harp, D. R., Wilson, C. J., Painter, S. L., Atchley, A. L., and Romanovsky, V. E.: Modeling the role of preferential snow accumulation in through talik development and hillslope groundwater flow in a transitional permafrost landscape, Environmental Research Letters, 13, 105006, https://doi.org/10.1088/1748-9326/aadd30, 2018.

Jan, A. and Painter, S. L.: Permafrost thermal conditions are sensitive to shifts in snow timing, Environmental Research Letters, 15, 084026, https://doi.org/10.1088/1748-9326/ab8ec4, 2020.

Jan, A., Coon, E. T., and Painter, S. L.: Evaluating integrated surface/subsurface permafrost thermal hydrology models in ATS (v0.88) against observations from a polygonal tundra site, Geoscientific Model Development, 13, 2259–2276, https://doi.org/10.5194/gmd-13-2259-2020, 2020.

Kuchment, L. S., Gelfan, A. N., and Demidov, V. N.: A distributed model of runoff generation in the permafrost regions, Journal of Hydrology, 240, 1–22, 2000.

Lackner, G., Domine, F., Nadeau, D. F., Lafaysse, M., and Dumont, M.: Snow properties at the forest–tundra ecotone: predominance of water vapor fluxes even in deep, moderately cold snowpacks, The Cryosphere, 16, 3357–3373, https://doi.org/10.5194/tc-16-3357-2022, 2022.

Lawrence, D. M. and Swenson, S. C.: Improving terrestrial hydrologic process representation in Earth System Models: Accounting for slope, aspect, and lateral water transfer through representative hillslopes, in: AGU Fall Meeting Abstracts, Vol. 2024, H13R-03, https://agu.confex.com/agu/agu24/meetingapp.cgi/Paper/1685795 (last access: 30 September 2025), 2024.

Lawrence, D. M., Hurtt, G. C., Arneth, A., Brovkin, V., Calvin, K. V., Jones, A. D., Jones, C. D., Lawrence, P. J., de Noblet-Ducoudré, N., Pongratz, J., Seneviratne, S. I., and Shevliakova, E.: The Land Use Model Intercomparison Project (LUMIP) contribution to CMIP6: rationale and experimental design, Geoscientific Model Development, 9, 2973–2998, https://doi.org/10.5194/gmd-9-2973-2016, 2016.

Lawrence, D. M., Fisher, R. A., Koven, C. D., Oleson, K. W., Swenson, S. C., Bonan, G., Collier, N., Ghimire, B., van Kampenhout, L., Kennedy, D., Kluzek, E., Lawrence, P. J., Li, F., Li, Vertenstein, M., Wieder, W. R., Xu, C. G., Ali, A. A., Badger, A. M., Bisht, G., van den Broeke, M., Brunke, M. A., Burns, S. P., Buzan, J., Clark, M., Craig, A., Dahlin, K., Drewniak, B., Fisher, J. B., Flanner, M., Fox, A. M., Gentine, P., Hoffman, F., Keppel-Aleks, G., Knox, R., Kumar, S., Lenaerts, J., Leung, L. R., Lipscomb, W. H., Lu, Y. Q., Pandey, A., Pelletier, J. D., Perket, J., Randerson, J. T., Ricciuto, D. M., Sanderson, B. M., Slater, A., Subin, Z. M., Tang, J. Y., Thomas, R. Q., Martin, M. V., and Zeng, X. B.: The Community Land Model Version 5: Description of New Features, Benchmarking, and Impact of Forcing Uncertainty, Journal of Advances in Modeling Earth Systems, 11, 4245–4287, 2019.

Léger, E., Dafflon, B., Robert, Y., Ulrich, C., Peterson, J. E., Biraud, S. C., Romanovsky, V. E., and Hubbard, S. S.: A distributed temperature profiling method for assessing spatial variability in ground temperatures in a discontinuous permafrost region of Alaska, The Cryosphere, 13, 2853–2867, https://doi.org/10.5194/tc-13-2853-2019, 2019.

Liang, X. and Xie, Z.: A new surface runoff parameterization with subgrid-scale soil heterogeneity for land surface models, Advances in Water Resources, 24, 1173–1193, 2001.

Liao, C., Leung, L. R., Fang, Y., Tesfa, T., and Negron-Juarez, R.: Representing lateral groundwater flow from land to river in Earth system models, Geoscientific Model Development, 18, 4601–4624, https://doi.org/10.5194/gmd-18-4601-2025, 2025.

Li, H., Huang, M., Wigmosta, M. S., Ke, Y., Coleman, A. M., Leung, L. R., Wang, A., and Ricciuto, D. M.: Evaluating runoff simulations from the Community Land Model 4.0 using observations from flux towers and a mountainous watershed, Journal of Geophysical Research: Atmospheres, 116, https://doi.org/10.1029/2011JD016276, 2011.

Li, H., Wigmosta, M. S., Wu, H., Huang, M., Ke, Y., Coleman, A. M., and Leung, L. R.: A physically based runoff routing model for land surface and Earth system models, Journal of Hydrometeorology, 14, 808–828, 2013.

Li, J., Gan, Y., Zhang, G., Gou, J., Lu, X., and Miao, C.: Quantifying the interactions of Noah-MP land surface processes on the simulated runoff over the Tibetan Plateau, Journal of Geophysical Research: Atmospheres, 129, e2023JD040343, https://doi.org/10.1029/2023JD040343, 2024.

Liljedahl, A. K., Boike, J., Daanen, R. P., Fedorov, A. N., Frost, G. V., Grosse, G., Hinzman, L. D., Iijma, Y., Jorgenson, J. C., Matveyeva, N., Necsoiu, M., Raynolds, M. K., Romanovsky, V. E., Schulla, J., Tape, K. D., Walker, D. A., Wilson, C. J., Yabuki, H., and Zona, D.: Pan-Arctic ice-wedge degradation in warming permafrost and its influence on tundra hydrology, Nature Geoscience, 9, 312–318, 2016.

Lique, C., Holland, M. M., Dibike, Y. B., Lawrence, D. M., and Screen, J. A.: Modeling the Arctic freshwater system and its integration in the global system: Lessons learned and future challenges, Journal of Geophysical Research: Biogeosciences, 121, 540–566, 2016.

Maxwell, R. M., Putti, M., Meyerhoff, S., Delfs, J.-O., Ferguson, I. M., Ivanov, V., Kim, J., Kolditz, O., Kollet, S. J., Kumar, M., Lopez, S., Niu, J., Paniconi, C., Park, Y.-J., Phanikumar, M. S., Shen, C., Sudicky, E. A., and Sulis, M.: Surface-subsurface model intercomparison: A first set of benchmark results to diagnose integrated hydrology and feedbacks, Water Resources Research, 50, 1531–1549, 2014.

Moriasi, D. N., Arnold, J. G., Van Liew, M. W., Bingner, R. L., Harmel, R. D., and Veith, T. L.: Model evaluation guidelines for systematic quantification of accuracy in watershed simulations, Transactions of the ASABE, 50, 885–900, 2007.

Muñoz-Sabater, J., Dutra, E., Agustí-Panareda, A., Albergel, C., Arduini, G., Balsamo, G., Boussetta, S., Choulga, M., Harrigan, S., Hersbach, H., Martens, B., Miralles, D. G., Piles, M., Rodríguez-Fernández, N. J., Zsoter, E., Buontempo, C., and Thépaut, J.-N.: ERA5-Land: a state-of-the-art global reanalysis dataset for land applications, Earth System Science Data, 13, 4349–4383, https://doi.org/10.5194/essd-13-4349-2021, 2021.

Musselman, K. N., Clark, M. P., Liu, C., Ikeda, K., and Rasmussen, R.: Slower snowmelt in a warmer world, Nature Climate Change, 7, 214–219, 2017.

Mwanthi, A. M., Mutemi, J. N., Dyer, E., James, R., Opijah, F. J., Webb, T., Mutua, F., Washington, R., Senior, C., Segele, Z., and Artan, G.: Representation of land–atmosphere coupling processes over Africa in coupled model intercomparison project Phase 6, Climate Dynamics, 62, 8389–8401, 2024.

Niu, G. Y., Yang, Z. L., Dickinson, R. E., and Gulden, L. E.: A simple TOPMODEL-based runoff parameterization (SIMTOP) for use in global climate models, Journal of Geophysical Research: Atmospheres, 110, 1–15, 2005.

Niu, G. Y., Yang, Z. L., Dickinson, R. E., Gulden, L. E., and Su, H.: Development of a simple groundwater model for use in climate models and evaluation with Gravity Recovery and Climate Experiment data, Journal of Geophysical Research: Atmospheres, 112, 1–14, 2007.

O'Connor, M. T., Cardenas, M. B., Ferencz, S. B., Wu, Y., Neilson, B. T., Chen, J., and Kling, G. W.: Empirical models for predicting water and heat flow properties of permafrost soils, Geophysical Research Letters, 47, e2020GL087646, https://doi.org/10.1029/2020GL087646, 2020.

Oleson, K. W., Lawrence, D. M., Bonan, G. B., Fisher, R. A., Lawrence, P. J., and Muszala, S. P.: Technical description of version 4.5 of the Community Land Model (CLM), Technical description of version 4.5 of the Community Land Model (CLM) (2013) NCAR/TN-503+ STR, 503, https://doi.org/10.5065/D6RR1W7M, 2013.

Painter, S. L., Coon, E. T., Atchley, A. L., Berndt, M., Garimella, R., Moulton, J. D., Wilson, C. J., and Wilson, C. J.: Integrated surface/subsurface permafrost thermal hydrology: Model formulation and proof-of-concept simulations, Water Resources Research, 52, 6062–6077, 2016.

Painter, S. L., Coon, E. T., Khattak, A. J., and Jastrow, J. D.: Drying of tundra landscapes will limit subsidence-induced acceleration of permafrost thaw, Proceedings of the National Academy of Sciences, 120, e2212171120, https://doi.org/10.1073/pnas.2212171120, 2023.

Schädel, C., Rogers, B. M., Lawrence, D. M., Koven, C. D., Brovkin, V., Burke, E. J., Genet, H., Huntzinger, D. N., Jafarov, E., McGuire, A. D., Riley, W. J., and Natali, S. M.: Earth system models must include permafrost carbon processes, Nature Climate Change, 14, 114–116, 2024.

Shi, X., Wang, Y., Mao, J., Thornton, P. E., Riccuito, D. M., Hoffman, F. M., and Hao, Y.: Quantifying the long-term changes of terrestrial water storage and their driving factors, Journal of Hydrology, 635, 131096, https://doi.org/10.1016/j.jhydrol.2024.131096, 2024.

Shirley, I., Uhlemann, S., Peterson, J., Bennett, K., Hubbard, S. S., and Dafflon, B.: Disentangling the impacts of microtopography and shrub distribution on snow depth in a subarctic watershed: Toward a predictive understanding of snow spatial variability, Journal of Geophysical Research: Biogeosciences, 130, e2024JG008604, https://doi.org/10.1029/2024JG008604, 2025.

Sinha, E., Calvin, K. V., Bond-Lamberty, B., Drewniak, B. A., Ricciuto, D. M., Sargsyan, K., Cheng, Y., Carl Bernacchi, C., and Moore, C. E.: Modeling perennial bioenergy crops in the E3SM land model (ELMv2), Journal of Advances in Modeling Earth Systems, 15, e2022MS003171, https://doi.org/10.1029/2022MS003171, 2023.

Sjöberg, Y., Coon, E., K. Sannel, A. B., Pannetier, R., Harp, D., Frampton, A., Painter, S. C., and Lyon, S. W.: Thermal effects of groundwater flow through subarctic fens: A case study based on field observations and numerical modelling, Water Resources Research, 52, 1591–1606, 2016.

Swenson, S. C., Lawrence, D. M., and Lee, H.: Improved simulation of the terrestrial hydrological cycle in permafrost regions by the Community Land Model, Journal of Advances in Modeling Earth Systems, 4, 1–15, 2012.

Swenson, S. C., Clark, M., Fan, Y., Lawrence, D. M., and Perket, J.: Representing intrahillslope lateral subsurface flow in the community land model, Journal of Advances in Modeling Earth Systems, 11, 4044–4065, 2019.

Tao, J., Riley, W. J., and Zhu, Q.: Evaluating the impact of peat soils and snow schemes on simulated active layer thickness at pan-Arctic permafrost sites, Environmental Research Letters, 19, 054027, https://doi.org/10.1088/1748-9326/ad38ce, 2024.

Tesfa, T. K. and Leung, L.-Y. R.: Exploring new topography-based subgrid spatial structures for improving land surface modeling, Geoscientific Model Development, 10, 873–888, https://doi.org/10.5194/gmd-10-873-2017, 2017.

Thaler, E. A., Uhleman, S., Rowland, J. C., Schwenk, J., Wang, C., Dafflon, B., and Bennett, K. E.: High-Resolution Maps of Near-Surface Permafrost for Three Watersheds on the Seward Peninsula, Alaska Derived From Machine Learning, Earth and Space Science, 10, e2023EA003015, https://doi.org/10.1029/2023EA003015, 2023.

Thornton, M. M., Wei, Y., Thornton, P. E., Shrestha, R., Kao, S., and Wilson, B. E.: Daymet: Station-Level Inputs and Cross Validation Result for North America, Version 4, Oak Ridge, Tennessee, USA, ORNL DAAC [data set], https://doi.org/10.3334/ORNLDAAC/1850, 2020.

Thornton, P. E. and Rosenbloom, N. A.: Ecosystem model spin-up: Estimating steady state conditions in a coupled terrestrial carbon and nitrogen cycle model, Ecological Modelling, 189, 25–48, 2005.

Vereecken, H., Amelung, W., Bauke, S. L., Bogena, H., Brüggemann, N., Montzka, C., Vanderborght, J., Bechtold, M., Blöschl, G., Carminati, A., Javaux, M., Konings, A. G., Kusche, J., Neuweiler, I., Or, D., Steele-Dunne, S., Verhoef, A., Young, M., and Zhang, Y.: Soil hydrology in the Earth system, Nature Reviews Earth & Environment, 3, 573–587, 2022.

Walvoord, M. A. and Kurylyk, B. L.: Hydrologic impacts of thawing permafrost – A review, Vadose Zone Journal, 15, https://doi.org/10.2136/vzj2016.01.0010, 2016.

Wang, C., Shirley, I., Wielandt, S., Lamb, J., Uhlemann, S., Breen, A., Busey, R. C., Bolton, W. R., Hubbard, S., and Dafflon, B.: Local-scale heterogeneity of soil thermal dynamics and controlling factors in a discontinuous permafrost region, Environmental Research Letters, 19, 034030, https://doi.org/10.1088/1748-9326/ad27bb, 2024.

Würzer, S., Jonas, T., Wever, N., and Lehning, M.: Influence of initial snowpack properties on runoff formation during rain-on-snow events, Journal of Hydrometeorology, 17, 1801–1815, 2016.

Xie, H.-Y., Jiang, X.-W., Tan, S.-C., Wan, L., Wang, X.-S., Liang, S.-H., and Zeng, Y.: Interaction of soil water and groundwater during the freezing–thawing cycle: field observations and numerical modeling, Hydrology and Earth System Sciences, 25, 4243–4257, https://doi.org/10.5194/hess-25-4243-2021, 2021.

Xu, D., Bisht, G., Sargsyan, K., Liao, C., and Leung, L. R.: Using a surrogate-assisted Bayesian framework to calibrate the runoff-generation scheme in the Energy Exascale Earth System Model (E3SM) v1, Geoscientific Model Development, 15, 5021–5043, https://doi.org/10.5194/gmd-15-5021-2022, 2022.

Xu, D., Bisht, G., Tan, Z., Liao, C., Zhou, T., Li, H.-Y., and Leung, L. R.: Disentangling the hydrological and hydraulic controls on streamflow variability in Energy Exascale Earth System Model (E3SM) V2 – a case study in the Pantanal region, Geoscientific Model Development, 17, 1197–1215, https://doi.org/10.5194/gmd-17-1197-2024, 2024.

Yang, D. and Kane, D. L. (Eds.): Arctic hydrology, permafrost and ecosystems, Springer Nature, https://doi.org/10.1007/978-3-030-50930-9, 2020.

Yang, J., Zhang, R., Li, X., Wang, X., Dyck, M., Wang, L., Wu, Q., and He, H.: Soil property changes following a thaw-induced mass movement event in the permafrost region of the Qinghai–Tibetan Plateau, Catena, 252, 108850, https://doi.org/10.1016/j.catena.2025.108850, 2025.

Zhang, T., Barry, R. G., Knowles, K., Heginbottom, J. A., and Brown, J.: Statistics and characteristics of permafrost and ground-ice distribution in the Northern Hemisphere, Polar Geography, 23, 132–154, 1999.

Zhang, Y., Carey, S. K., and Quinton, W. L.: Evaluation of the algorithms and parameterizations for ground thawing and freezing simulation in permafrost regions, Journal of Geophysical Research: Atmospheres, 115, D22107, https://doi.org/10.1029/2007JD009343, 2010.

Zhang, Y., Jafarov, E., Piliouras, A., Jones, B., Rowland, J. C., and Moulton, J. D.: The thermal response of permafrost to coastal floodplain flooding, Environmental Research Letters, 18, 035004, https://doi.org/10.1088/1748-9326/acba32, 2023.

Zhao, W. and Li, A.: A review on land surface processes modelling over complex terrain, Advances in Meteorology, 2015, 607181, https://doi.org/10.1155/2015/607181, 2015.

Zheng, D., Van Der Velde, R., Su, Z., Wen, J., and Wang, X.: Assessment of Noah land surface model with various runoff parameterizations over a Tibetan River, Journal of Geophysical Research: Atmospheres, 122, 1488–1504, 2017.

Articles

Short summary

Predicting hydrological runoff in Arctic permafrost regions is difficult due to limited observations and complex terrain. We used a detailed physics-based model simulations to improve runoff estimates in a Earth system land model. Our method improved runoff accuracy and worked well across two different Arctic regions. This helps make runoff parameterization schemes more reliable for understanding water flow in permafrost areas under a changing climate.