<span style="" class="text typewriter">smash</span> v1.0: a differentiable and regionalizable high-resolution hydrological modeling and data assimilation framework

Colleoni, François; Huynh, Ngo Nghi Truyen; Garambois, Pierre-André; Jay-Allemand, Maxime; Organde, Didier; Renard, Benjamin; De Fournas, Thomas; El Baz, Apolline; Demargne, Julie; Javelle, Pierre

doi:https://doi.org/10.5194/gmd-18-7003-2025

Articles | Volume 18, issue 19

https://doi.org/10.5194/gmd-18-7003-2025

Articles | Volume 18, issue 19

Model description paper

10 Oct 2025

Model description paper |

| 10 Oct 2025

smash v1.0: a differentiable and regionalizable high-resolution hydrological modeling and data assimilation framework

François Colleoni, Ngo Nghi Truyen Huynh, Pierre-André Garambois, Maxime Jay-Allemand, Didier Organde, Benjamin Renard, Thomas De Fournas, Apolline El Baz, Julie Demargne, and Pierre Javelle

Abstract

The smash software is a differentiable and regionalizable framework enabling modular high-resolution hydrological modeling and data assimilation, from catchment to regional and country scales, for water research and operational applications. smash combines various process-based conceptual operators for vertical and lateral flows, which can be hybridized with a descriptor-to-parameter neural network for regionalization. smash features an efficient, differentiable Fortran solver using Tapenade to automatically derive the adjoint model that supports CPU forward–inverse parallel computing and spatially distributed optimization of large parameter vectors thanks to an accurate cost gradient, interfaced in Python using f90wrap. This article presents smash algorithms and their open-source code, documentation, and tutorials. It highlights foundational research, benchmarking on state-of-the-art datasets, and readiness for scientific and operational use. To ensure reproducibility, open-source datasets are used to demonstrate the main functionalities of smash, including parallel computation performances and the application of multiple spatially distributed conceptual model structures over a large catchment sample. These functionalities include uniform or spatially distributed calibration and regionalization by learning the relation between descriptors and parameters. The provided Python tool allows application to any other catchment from globally available datasets. Using CAMELS, as per recent articles, a median Kling–Gupta efficiency (KGE)>0.8 is obtained in local spatially distributed calibration for daily Génie Rural (GR)-like and variable infiltration capacity (VIC)-like model structures at $d x = 1^{'} 30^{''}$ (∼3 km) and KGE>0.6 in spatiotemporal validation in a regionalization context. The regionalization of a high-resolution hourly GR-like model structure at dx=500 m over a difficult Mediterranean flash-flood-prone case results in a Nash–Sutcliffe efficiency (NSE)>0.6 in spatiotemporal validation. The proposed differentiable and regionalizable spatially distributed modeling framework is designed for gradient-based variational data assimilation, applicable to initial state (not shown) and parameter estimation at multiple timescales, and is intended for collaborative research and operational applications. Additionally, smash supports the implementation of other differentiable hydrological and hydraulic models, as well as hybrid physics–AI models, further enhancing its versatility and applicability.

Download & links

Article (PDF, 8719 KB)

Download & links

How to cite.

Received: 13 Feb 2025 – Discussion started: 12 Mar 2025 – Revised: 18 Jul 2025 – Accepted: 21 Jul 2025 – Published: 10 Oct 2025

1 Introduction

Hydrological models are indispensable tools for understanding the functioning of hydrosystems, flood and low-flow forecasting, sustainable water management and infrastructure design, environmental protection, and adaptation to a changing climate. Indeed, measurements of hydrological responses are not ubiquitously available (e.g., Beven, 2011), while “everywhere relevant” (Bierkens et al., 2015) estimation of hydrological state fluxes is expected. A model is hence needed to extend and predict those quantities of interest based on available data.

High-resolution spatial datasets have become increasingly accessible, often on a global scale, and enable the description of topography–soil–vegetation properties and atmospheric variables. Examples include the ECMWF atmospheric reanalysis version 5 (ERA5) (Hersbach et al., 2020) and Multi-Source Weighted-Ensemble Precipitation (MSWEP) rainfall product (Beck et al., 2019), flow directions IHU (Eilander et al., 2021) from MERIT terrain elevations (Yamazaki et al., 2017), the SoilGrids pedology (Hengl et al., 2017), and daily discharge from Caravan-CAMELS (Kratzert et al., 2023; Addor et al., 2017), which are used hereafter. Such data can be directly exploited by grid-based spatially distributed hydrological models, whose development at “hyper-resolution” (1 km² or finer) is recognized as a “grand challenge for hydrology” to address water problems facing society (Wood et al., 2011; Bierkens et al., 2015).

Hydrological responses result from combined nonlinear vertical and lateral physical processes occurring at multiple scales in the critical zone, and their limited observability (e.g., Beven, 1989; Milly, 1994; Blöschl and Sivapalan, 1995; Refsgaard, 1997; Vereecken et al., 2019) makes hydrological modeling uncertain and difficult (e.g., Liu and Gupta, 2007)). In the absence of directly exploitable first principles in hydrology (e.g., Dooge, 1986), as opposed to flow mechanistic equations in continuous media such as river hydraulics, meteorology, or oceanography, and given the high heterogeneities of continental hydrosystem compartments and the lack of “scale-relevant theories” (Beven, 1987), process-based hydrological models generally include a certain amount of empiricism. This represents an avenue for the fusion of data assimilation (DA) and uncertainty quantification (UQ) with machine learning (ML) and deep learning (DL) techniques to better exploit the informative richness of multi-source data.

The differentiability of the forward numerical model is a key enabler for gradient-based optimization of high-dimensional parameter vectors, for example, in variational data assimilation for 1D or 2D hydraulic models (Monnier et al., 2016; Brisset et al., 2018) or in spatialized hydrology (Castaings et al., 2009; Jay-Allemand et al., 2020). While differentiability may appear unnecessary for simple lumped hydrological models with only a few parameters, where sampling-based calibration or gradient-free methods remain efficient, the situation changes drastically for spatially distributed models involving thousands of parameters. In such high-dimensional settings, exhaustive sampling becomes computationally infeasible. Numerical differentiability enables the computation of accurate gradients of the cost function or model outputs with respect to high-dimensional parameters, thereby facilitating the use of efficient gradient-based optimization methods. This is particularly important when coupling physical models with neural networks requiring accurate gradients, as demonstrated in recent work on learnable regionalization (Huynh et al., 2024 b) and internal flux correction (Huynh et al., 2024 a), with large-scale evaluations in Huynh et al. (2025). These approaches rely on numerically differentiable solvers and accurate gradients, enabling thousands of parameters to be trained effectively. This perspective aligns with Shen et al. (2023), who emphasize the importance and potential of differentiable modeling in geosciences, highlighting how it can enhance learning, inference, and integration of physical knowledge within hybrid modeling frameworks.

The “resolution–complexity continuum” (Clark et al., 2017) has been explored over the past 5 decades through various modeling approaches, ranging from point-scale processes numerically integrated at larger scales to spatially lumped representations of system responses (Hrachowitz and Clark, 2017). Among the diverse hydrological models and their underlying hypotheses, components generally describe water storage and transfer (e.g., Fenicia et al., 2011) through various combinations and parameterizations of vertical and lateral storage-flux operators. Several model comparison experiments have analyzed differences between various modeling approaches, evaluating performance in terms of streamflow modeling (Perrin et al., 2001; Reed et al., 2004; Duan et al., 2006; Orth et al., 2015) and internal states such as soil moisture (Orth et al., 2015; Bouaziz et al., 2021). Orth et al. (2015) concluded that “added complexity does not necessarily lead to improved performance of hydrological models”. Notably, parsimonious conceptual models, whether lumped or semi-lumped, have performed efficiently in large-sample studies (e.g., the Génie Rural (GR) model in Perrin et al., 2001, GRSD model in De Lavenne et al., 2019, GR and MORDOR models in Mathevet et al., 2020, FUSE models in Lane et al., 2019, and references therein). Large-sample studies have also been undertaken with spatially distributed models, including variable infiltration capacity (VIC) (Mizukami et al., 2017) with a multiscale parameter regionalization (MPR) (Samaniego et al., 2010) or with pixel-wise calibration on global maps of streamflow characteristics (Yang et al., 2019), a gridded version of Hydrologiska Byråns Vattenbalansavdelning (HBV) applied with MPR-like descriptor-to-parameter regressions on a global dataset (Beck et al., 2020), GloFas (Hirpa et al., 2018), National Hydrologic Model (NHM) (Towler et al., 2023), Wflow (Aerts et al., 2022; van Verseveld et al., 2024), and runoff-relevant parameters of Energy Exascale Earth System Model (E3SM) using a surrogate-assisted Bayesian framework (Xu et al., 2022). Differentiable numerical hydrological modeling has made significant progress in recent years (spatially distributed variational data assimilation (VDA) in Castaings et al., 2009; Lee et al., 2012; Jay-Allemand et al., 2020) for large catchment sample studies with hybrid physics–AI, both with lumped approaches (e.g., Feng et al., 2024) and with high-resolution spatially distributed frameworks (Huynh et al., 2024 b, 2025). These large-sample studies enable more general and statistically sound analyses of model performances (Andréassian et al., 2009; Gupta et al., 2014), addressing large-scale challenges with consistent methodologies across various scales and conditions.

All hydrological models are inherently conceptual, and calibration or learning is generally required due to limitations and uncertainties in their structure, parameter representativity, data availability, and initial and boundary conditions. These models are typically calibrated and validated using discharge time series at the catchment outlet(s) (Sebben et al., 2013). However, calibrating hydrological model parameters from sparse and integrative discharge data is a challenging inverse problem complicated by equifinality issues (Bertalanffy, 1968; Beven, 1993, 2001), especially for distributed models with a large number of cells and parameters (“curse of dimensionality”). Using spatially uniform parameters may not be the best way to exploit a spatially distributed model (under-parameterization), while fully distributed parameter calibration, which requires a gradient-based approach (Castaings et al., 2009; Lee et al., 2012; Jay-Allemand et al., 2020), faces over-parameterization. Therefore, a parameter regionalization approach using multi-linear descriptor-to-parameter transfer functions has been proposed for distributed models (Beck et al., 2020). More recently, this approach has been advanced with regionalization neural networks (Huynh et al., 2024 b) integrated into the differentiable spatialized smash model (Colleoni et al., 2022), which is the focus of the present article, introducing a new numerical code and conducting original tests on a large sample of catchments with open-source data. This approach also enables learning via cost functions based on hydrological signatures, which are obtained using automatic signal analysis algorithms applicable to large samples with smash (Huynh et al., 2023).

This article presents the computational framework smash dedicated to Spatially distributed Modeling and ASsimilation for Hydrology. The smash framework combines vertical and lateral flow operators, either process-based conceptual or hybrid with neural networks (which allows learning regionalization relations between descriptors and parameters), and performs high-dimensional optimization from multi-source data. It is based on an efficient and automatically differentiable Fortran solver enabling CPU parallel computing, which is interfaced in Python using f90wrap (Kermode, 2020; Jay-Allemand et al., 2022) (https://github.com/jameskermode/f90wrap, last access: 25 July 2025). This open-source smash code, version v1.0 (https://github.com/DassHydro/smash, last access: 25 July 2025), is presented here in terms of mathematical formulation, numerical modeling approach, and functionalities, and full details can be found in our research articles from which this software stems (Colleoni et al., 2022; Huynh et al., 2023, 2024 b) and in the online documentation (https://smash.recover.inrae.fr, last access: 25 July 2025). Note that smash has also been developed for operational applications. It is the core solver of the French flash flood forecasting system (Piotte et al., 2020). The proposed framework leverages adjoint-based VDA, enabling the simultaneous inference of high-dimensional and spatially distributed parameters (as illustrated) and initial states (implementation available and tested in smash v1.0 but not shown), applicable at both long and short timescales.

This article is organized as follows. Section 2 describes the smash forward model and the inverse algorithm. In Sect. 3 we describe the smash build system framework, documentation, and computational performance. Some applications of smash are demonstrated in Sect. 4 using open-source datasets, focusing on the contiguous US (CONUS) and on a high-resolution flash-flood-prone case study in France. Section 5 illustrates other aspects of smash not presented in Sect. 4, followed by conclusions in Sect. 6.

2 Model and optimization algorithm description

The smash framework contains various hydrological model structures with varying vertical and lateral flow operators and spatialized routing schemes. It is designed to simulate discharge hydrographs and hydrological states at any spatial location within a structured mesh, and it reproduces the hydrological response of contrasted catchments by taking advantage of spatially distributed meteorological forcings, physiographic data, and hydrometric observations. Cost function gradient maps with respect to tunable parameters are a key feature of smash and can easily be combined to gradients of external operators, such as a regionalization neural network (Huynh et al., 2024 b) with a chain rule in the context of high-dimensional optimization.

2.1 Forward model statement

Let Ω⊂ℝ² denote a 2D spatial domain that can contain one to many gauges, with x∈Ω being the spatial coordinate, $t \in]0, T]$ the physical time, and 𝒟_Ω a drainage plan over Ω. The spatially distributed rainfall–runoff model ℳ is a dynamic operator projecting the input fields of atmospheric forcings 𝓘 onto the fields of surface discharge Q, internal states h, and internal fluxes q, as expressed in Eq. (1):

\begin{array}{l} U (x, t) & = [Q, h, q] (x, t) \\ (1) & = M (D_{Ω}; I (x, t); [θ, h_{0}] (x)), \end{array}

with U(x,t) being the modeled state-flux variables and θ and h₀ being the spatially distributed parameters and initial states of the hydrological model.

The spatially distributed rainfall–runoff model ℳ is obtained by partial composition (each operator taking various other input data and parameters) of the flow operators as follows:

\begin{matrix} (2) & M = M_{hy} (., M_{rr} (., M_{snw})) . \end{matrix}

Several process-based conceptual operators are available in smash for composing a model:

Snow operator ℳ_snw. This optional operator simulates melt flux m_lt(x,t), which feeds the hydrological operator in addition to rain.
- zero: no module
- ssn: degree-day module
Hydrological operator ℳ_rr. The simulation occurs at the pixel scale of elementary runoff q_t(x,t), feeding the routing operator.
- gr4: GR-like module (Perrin et al., 2003; Mathevet, 2005)
- gr5: GR-like module (Le Moine, 2008; Ficchì et al., 2019)
- grd: GR-like module (Perrin et al., 2003; Jay-Allemand et al., 2020)
- loieau: GR-like module (Perrin et al., 2003; Folton and Arnaud, 2020)
- vic3l: VIC-like module adapted from Liang et al. (1994)
Routing operator ℳ_hy. Runoff is routed from pixel to pixel to obtain spatiotemporal discharge Q(x,t).
- lag0: instantaneous module
- lr: linear reservoir module
- kw: kinematic wave module (a classical 1D conceptual kinematic wave model, applied over a D8 drainage plan 𝒟_Ω without channel and solved numerically using a linearized implicit scheme) (Chow et al., 1998).

The operator chaining principle is schematized in Fig. 1 with input data and internal states and fluxes. The operators available in smash are listed above and further detailed in Appendix D and in the online documentation (https://smash.recover.inrae.fr/math_num_documentation/forward_structure.html, last access: 25 July 2025).

https://gmd.copernicus.org/articles/18/7003/2025/gmd-18-7003-2025-f01

Figure 1Flowchart of input data, operator chaining to obtain the forward differentiable model ℳ that includes a learnable regionalization mapping ϕ (Huynh et al., 2024 b), and simulated states and fluxes. The forward model ℳ is obtained by partial composition (each operator taking various other input data and parameters) of the flow operators $M = M_{hy} (., M_{rr} (., M_{snw}))$ .

Download

Originally, a differentiable descriptor-to-parameter mapping ϕ can be used to constrain spatially distributed conceptual parameters θ(x) and initial states h₀(x) from physical descriptors D(x) for regionalization learning (Huynh et al., 2024 b):

\begin{matrix} (3) & [θ, h_{0}] (x) = ϕ (D (x), ρ), \forall x \in Ω, \end{matrix}

with D being the N_D-dimensional vector of physical descriptor maps covering the spatial domain Ω and ρ being the vector of the tunable regionalization parameters of the available mappings (written for θ only for brevity).

The first component is a set 𝒫 of multiple regression operators for each parameter of the forward hydrological model ℳ:
$\begin{array}{l} θ_{k} (x, D, ρ_{k}) = & s_{k} (α_{k, 0} + \sum_{d = 1}^{N_{D}} α_{k, d} D_{d}^{β_{k, d}} (x)), \\ (4) & \forall k \in [1 \dots N_{θ}] \end{array}$
where $s_{k} (z) = l_{k} + (u_{k} - l_{k}) / (1 + e^{- z})$ , and $\forall z \in R$ is a transformation based on a sigmoid function with values in [l_k,u_k] imposing constraints onto the forward model such that $l_{k} < θ_{k} (x) < u_{k}, \forall x \in Ω$ . The bounds l_k and u_k associated with each conceptual parameter θ_k are spatially uniform. The regional parameter control vector used for estimation in this case is $ρ \equiv {[{(ρ_{k})}_{k = 1}^{N_{θ}}]}^{T} \equiv {[{(α_{k, 0}, {(α_{k, d}, β_{k, d})}_{d = 1}^{N_{D}})}_{k = 1}^{N_{θ}}]}^{T}$ , and a multiple linear regression mapping is obtained by imposing $β_{k, d} = 1$ .
The second component is an artificial neural network (ANN) denoted 𝒩, consisting of a multi-layer perceptron, aimed at learning the descriptor-to-parameter mapping such that
$\begin{matrix} (5) & θ (x, D, ρ) = N (D (x), W, b), \forall x \in Ω, \end{matrix}$
where W and b are the weights and biases of the neural network composed of N_L dense layers respectively. The architecture of the neural network and the forward propagation is detailed in Huynh et al. (2024 b). Note that an output layer consisting of a scaling transformation is used to impose bound constraints as above. The regional control vector in this case is $ρ \equiv {[W, b]}^{T} \equiv {[{(W_{j}, b_{j})}_{j = 1}^{N_{L}}]}^{T}$ .

Note the following:

The available mappings for ϕ are also implemented to predict the initial state vector h₀ using physical descriptor fields that can include previous states and can be used for short-range data assimilation (not studied here).
By construction, the complete forward model ℳ is learnable in terms of parameter regionalization, through the regionalization mapping ϕ embedded into ℳ that is also differentiable, and its parameters ρ can be trained using a cost gradient as explained hereafter.

2.2 Inverse algorithm

Given observed and simulated discharge time series $Q^{*} = (Q_{g = 1 \dots N_{G}}^{*})^{T}$ and $Q = (Q_{g = 1 \dots N_{G}})^{T}$ , with N_G being the number of gauges over the study domain Ω, the model misfit to multi-site observations is measured through a cost function J that can be written as

\begin{matrix} (6) & J (Q^{*}, Q) = \sum_{g = 1}^{N_{G}} w_{g} j_{g} (Q_{g}^{*}, Q_{g}) + j_{reg}, \end{matrix}

with w_g being the weight associated with the cost function j_g at each gauge g, where $\sum_{g = 1}^{N_{g}} w_{g} = 1$ . This multi-gauge observation cost function is also used for mono-gauge calibration with N_G=1. A regularization term j_reg can be considered for ill-posed inverse problems (Jay-Allemand et al., 2020, 2024). In this study, equal weights were assigned to each gauge (i.e., $w_{g} = 1 / N_{G}$ ), which corresponds to minimizing the average of the individual cost functions. Additionally, no regularization term was applied.

The gauge cost function is defined as

\begin{matrix} (7) & j_{g} (Q_{g}^{*}, Q_{g}) = \sum_{c = 1}^{N_{C}} w_{c} j_{c} (Q_{g}^{*}, Q_{g}), \end{matrix}

with j_c being based on any efficiency metric (e.g., Nash–Sutcliffe efficiency, NSE; Kling–Gupta efficiency, KGE; Gupta et al., 2009) or a signature-based cost function including N_C continuous and event-based components (Huynh et al., 2023) and w_c being their relative weights. For the multi-score calibration strategy using continuous NSE and event-based flood signatures, see Huynh et al. (2023).

The global cost function J is defined as a convex and differentiable function, involving the response of the forward model ℳ through its output Q, and consequently depends on the model parameters θ and hence on the parameters ρ of the regional mapping ϕ when used (Eq. 3).

Therefore, the optimization problem is formulated as in Eq. (8):

\begin{matrix} (8) & \hat{ρ} = \arg min_{ρ} J (Q^{*}, M (., ϕ (., ρ))) . \end{matrix}

This high-dimensional inverse problem can be tackled with gradient-based optimization algorithms. A limited-memory quasi-Newton approach, such as L-BFGS-B (Zhu et al., 1997), is suitable for smooth objective functions, while an adaptive learning rate approach, exemplified by Adam (Kingma and Ba, 2014), is effective for non-smooth objective functions. These approaches necessitate obtaining the gradient ∇_ρJ of the cost function with respect to the tunable control parameter ρ obtained by solving the adjoint D_ρℳ of the forward model ℳ. The adjoint model is obtained by automatic differentiation using the Tapenade engine (Hascoet and Pascual, 2013) (https://team.inria.fr/ecuador/fr/tapenade/, last access: 25 July 2025). The complete forward model and VDA process are illustrated in Fig. 2.

https://gmd.copernicus.org/articles/18/7003/2025/gmd-18-7003-2025-f02

Figure 2Flowchart of the inverse algorithm that uses the cost gradient ∇_ρJ with respect to the tunable control parameter ρ obtained by solving the adjoint model D_ρℳ of the forward model ℳ, which is obtained by automatic source code differentiation and enabling accurate gradient computation (adapted from VDA course Monnier, 2024).

3 Computational software and performance

In this section, we focus on code architecture, documentation, and computational performance. smash is based on a computationally efficient Fortran core enabling parallel computations over large domains with OpenMP (Dagum and Menon, 1998) (https://www.openmp.org, last access: 25 July 2025) and is automatically differentiable with the Tapenade engine (Hascoet and Pascual, 2013) to generate the numerical adjoint model. It is interfaced in Python using f90wrap (Kermode, 2020) to provide a user-friendly and versatile interface for quick learning and efficient development and to make the wealth of Python modules and libraries (Table 1) developed by a large and active community directly accessible (data pre-/post-processing, geographic information system, deep learning, etc.).

Harris et al. (2020)Virtanen et al. (2020)pandas development team (2020)Kermode (2020)

Table 1External Python libraries used by smash.

Download Print Version | Download XLSX

3.1 From sources to ready-to-use Python library

smash contains a Python core for all user interface functions, both pre- and post-processing, and a Fortran core (with a few C files) for high-performance numerical computations. In order to produce a Python library, including binary files, which can be installed directly from the package manager, PyPI (https://pypi.org, last access: 25 July 2025), several steps are necessary. The first step is to generate the Fortran adjoint file from the Fortran sources. This is done via the Tapenade automatic differentiation engine (Hascoet and Pascual, 2013), which requires the use of Java. Next, the Fortran code is wrapped for use in Python. f90wrap (Kermode, 2020) builds on the capabilities of the popular F2PY (https://numpy.org/doc/stable/f2py, last access: 25 July 2025) utility by generating a simpler Fortran interface to the original Fortran sources, which is then suitable for wrapping with F2PY, together with a higher-level Pythonic wrapper that makes the existence of an additional layer transparent to the final user. The entire build system (except for the generation of the adjoint file, which is external for debugging reasons) is handled by meson (https://mesonbuild.com, last access: 25 July 2025), a multi-platform, multi-language open-source build system that allows us to generate smash binaries on Linux, macOS, and Windows quite easily (Fig. 3).

https://gmd.copernicus.org/articles/18/7003/2025/gmd-18-7003-2025-f03

Figure 3The smash build system framework. It starts with source files written in Fortran and Python (0). In intermediate step (1), Fortran sources are processed by Tapenade to generate the adjoint code and wrapped using f90wrap to create Python interfaces (f90wrap-itf). The F2PY tool is then used to generate a Fortran/C/Python binding from the wrapped interfaces. During compilation step (2), the original Fortran sources, adjoint code, and f90wrap-itf are compiled with appropriate compilers to produce a binary Python/C extension module. Finally, in installation step (3), the Python module is assembled, combining the original Python sources with f90wrap-itf and the compiled binary Python/C extension module, making the high-performance Fortran code accessible from Python.

Download

3.2 Documentation

The smash online documentation (Fig. 4) is divided into four main sections:

Getting started. This section describes how to install smash from the Python package index PyPI (https://smash.recover.inrae.fr/getting_started, last access: 25 July 2025).
User guide. This section provides step-by-step examples (and scripts) from basic (simulation run) to complex (regionalization) applications of smash and input data conventions (https://smash.recover.inrae.fr/user_guide, last access: 25 July 2025).
API reference. This section details the different modules and the application programming interface. Modules are documented using the NumPy-style Python docstring (https://smash.recover.inrae.fr/api_reference, last access: 25 July 2025) .
Math/num documentation. This last section details the conceptual and mathematical basis of the forward and inverse modeling problems, their numerical resolution, and their optimization and estimation algorithms (https://smash.recover.inrae.fr/math_num_documentation, last access: 25 July 2025).

https://gmd.copernicus.org/articles/18/7003/2025/gmd-18-7003-2025-f04

Figure 4smash documentation home page accessible at https://smash.recover.inrae.fr (last access: 25 July 2025).

All documentation is implemented using Sphinx (https://www.sphinx-doc.org/en/master, last access: 25 July 2025) to automatically compile and update an online version.

3.3 Computational performance

In this section, we compare the performance of smash in terms of computation time and memory usage between direct and adjoint runs (an adjoint run is equivalent to a single call to the adjoint model D_ρℳ here). The aim is to highlight the resources required to run smash on configurations similar to real cases. We compare smash over three zones, Sardinia, Great Britain/Ireland, and North America, at a spatial resolution of $1^{'} 30^{''}$ ( $\sim 3 km \times 3 km$ ) over a period of 1 year, from 31 July 2010–31 July 2011, randomly chosen at a daily time step. These three zones were chosen simply to provide three zones of variable surface area (Fig. 5). In addition to the three zones, with smash enabling different assemblies of operators, two structures are compared, s1 and s2, representing the simplest (ℳ_snw: zero, ℳ_rr: grd, and ℳ_hy: lag0) and the most complex (ℳ_snw: ssn, ℳ_rr: vic3l, and ℳ_hy: kw) structures in terms of the number of operations per cell respectively. All the simulations (1 year of simulation at a daily time step) were run on a server with AMD EPYC 7643 CPUs (Appendix E) and 255 GB of RAM.

https://gmd.copernicus.org/articles/18/7003/2025/gmd-18-7003-2025-f05

Figure 5Spatial representations of the three different geographical regions used in the performance benchmarks.

The range of computation times across all simulations (Fig. 6) varies from approximately 0.1 s for a direct run with eight threads in the Sardinia region using the s1 structure to just over an hour for an adjoint run with one thread in the North America region using the s2 structure. Systematically, regardless of the region, number of threads, or type of run, the difference in computation time between the s1 and s2 structures is about a factor of 2. Regarding the differences between a direct run and an adjoint run, the computation time factor varies depending on the number of threads, ranging from a factor of 12 for 1 thread to a factor of 6 for 16 threads. This difference highlights better thread scaling for the adjoint run, with a speedup of around 4 for a direct run and 7 for an adjoint run with 16 threads, likely because the adjoint run is more computationally demanding than the forward run. It is worth noting that in the case of the Sardinia region, which has the fewest grid cells, thread scaling is poor compared to the other two regions, even reaching the limit where thread overhead increases the computation time. Although the time-stepping loop cannot be parallelized, and the routing scheme in smash must be solved sequentially from upstream to downstream, allowing only partial parallelization over the entire spatial domain, the approach still offers a substantial reduction in computation time.

https://gmd.copernicus.org/articles/18/7003/2025/gmd-18-7003-2025-f06

Figure 6Benchmarking results for both direct (a–c) and adjoint (d–f) run simulations over a period of 1 year at a daily time step, using varying numbers of threads (from 1 to 16). Each plot corresponds to a different geographical region: Sardinia, Great Britain/Ireland, and North America from left to right.

Download

Regarding memory usage (Table 2), values range from 0.17 GB for a direct run in the Sardinia region with the s1 structure to 27 GB for an adjoint run in the North America region with the s2 structure. Systematically, memory usage is higher in the adjoint run than in the direct run and scales with the size of the domain. The main contributor to memory usage in the adjoint run is the forward sweep, which includes a time-stepping loop where iteration n depends on the results of previous iterations. The memory allocation during the forward sweep is freed during the backward sweep, but it still results in a significant memory peak. This memory peak has been considerably reduced in the smash version presented here by including checkpoints within the time-stepping loop. These checkpoints allow us to alternate between forward and backward sweeps, leading to much smaller memory peaks compared to a single sweep. The downside of using checkpoints is the increase in computation time, but this was considered less significant compared to the memory saving (see Hascoet and Pascual, 2013, for further details about forward and backward sweeps and checkpointing).

Table 2Memory usage in gigabyte (GB) for both direct and adjoint run simulations over a period of 1 year at a daily time step.

Download Print Version | Download XLSX

In conclusion, these computation times and memory usage demonstrate the feasibility of the model for large-scale applications. The critical point is parameter estimation. In the case of parameter estimation using a gradient-based optimizer, one or more adjoint runs are evaluated at each iteration, significantly multiplying the total computation time. As an example, Huynh et al. (2024 b) performed a calibration at a spatial resolution of 1 km over a domain of more than 20 000 cells and at a temporal resolution of 1 h over a period of 4 years. The calibration required 350 calls to the adjoint model, resulting in a computation time of around 180 h. Currently, memory usage in the adjoint run is less of a limiting factor than computation time for large-domain applications. Thus, further improving computation time is a priority to expand the model's application to finer spatial and temporal scales.

4 Applications

4.1 Numerical experiments presented

The main functionalities and operators of smash are illustrated in open-source global datasets over the contiguous US (CONUS) (Table 3, Fig. 7) and in a higher-resolution open-source regional dataset in France (Table 4, Fig. 7). Models in CONUS will be at a spatial resolution of $1^{'} 30^{''}$ ( $\sim 3 km \times 3 km$ ) and daily time step, while higher-resolution models will be set up in France at 500 m spatial resolution and an hourly time step. The numerical results presented are as follows:

split-sample temporal cross-validation of different model structure combinations over CONUS (Sect. 4.2)
regionalization over CONUS (Sect. 4.3)
high-resolution regionalization over the Aude River in France (Sect. 4.4).

(Beck et al., 2019)(Beck et al., 2019)(Oudin et al., 2005)(Hersbach et al., 2020)(Hersbach et al., 2020)(Eilander et al., 2021)(Yamazaki et al., 2017)(Hengl et al., 2017)(Hengl et al., 2017)(Beck et al., 2019)(Oudin et al., 2005)(Hersbach et al., 2020)(Beck et al., 2019)(Hersbach et al., 2020)Kratzert et al., 2023Addor et al., 2017

Table 3Model input data from open-source databases available worldwide used over CONUS: atmospheric forcings $I = {P, N, E, T}$ , flow direction map 𝒟_Ω, physical descriptors $D = {d_{1}, \dots, d_{6}}$ for regionalization based on the Beck et al. (2020) study, and discharge time series Q^*. The liquid and solid precipitation, P and N, derived from the Multi-Source Weighted-Ensemble Precipitation (MSWEP) (Beck et al., 2019), is divided into liquid and solid parts using a parametric S-shaped curve (Garavaglia et al., 2017) and is disaggregated from 0.1 to 0.025°. The temperature and potential evapotranspiration, T and E, derived from ERA5 (Hersbach et al., 2020), are disaggregated from 0.25 to 0.025°, using the Oudin formula (Oudin et al., 2005) to obtain the potential evapotranspiration. The flow direction, 𝒟_Ω, from MERIT Hydro IHU (Eilander et al., 2021), was upscaled from 0.008 to 0.025° using pyflwdir (https://github.com/Deltares/pyflwdir, last access: 25 July 2025) (Eilander, 2023). The topographic slope, d₁, is derived from MERIT DEM (Yamazaki et al., 2017), upscaled from 0.008 to 0.025° using the gdaldem slope (https://gdal.org/en/latest/programs/gdaldem.html, last access: 25 July 2025). The sand and clay content, d₂ and d₃, from SoilGrids (Hengl et al., 2017), was upscaled and reprojected from 250 m to 0.025°. The meteo-climatic data, d₄, d₅, and d₆, are derived from P and E. The discharge time series, Q^*, comes from Caravan-CAMELS (Kratzert et al., 2023; Addor et al., 2017).

Download Print Version | Download XLSX

(Champeaux et al., 2009)(Oudin et al., 2005)(Quintana-Seguí et al., 2008; Vidal et al., 2010)(Leblois and Sauquet, 1999)(Leblois and Sauquet, 1999)Organde et al. (2013)

Table 4Model input data from national open-source databases used over the Aude River in France: atmospheric forcings $I = {P, E}$ , flow direction map 𝒟_Ω, physical descriptors $D = {d_{1}, \dots, d_{7}}$ for regionalization, and discharge time series Q^*. The liquid precipitation, P, comes from the ANTILOPE J+1 Météo-France product (Champeaux et al., 2009), a radar–gauge reanalysis disaggregated from 1 km to 500 m. The potential evapotranspiration, E, is derived from the SAFRAN Météo-France temperature (Quintana-Seguí et al., 2008; Vidal et al., 2010) disaggregated from 8 km to 500 m, using the Oudin formula (Oudin et al., 2005) to obtain daily interannual potential evapotranspiration. The flow direction, 𝒟_Ω, comes from HydroDem (Leblois and Sauquet, 1999). The land cover data, d₁, d₂, d₃, and d₄, are derived from CORINE Land Cover 2018 (https://doi.org/10.2909/71c95a07-e296-44fc-b22b-415f42acfdf0, last access: 25 July 2025), rasterized at 50 m and upscaled to 500 m using the average resampling method. The topographic slope, d₅, is derived from the HydroDem DEM (Leblois and Sauquet, 1999) using the gdaldem slope. The drainage density, d₆, comes from Organde et al. (2013), representing the number of cells crossed by a river. The percentage of karst, d₇, comes from BDLISA (https://bdlisa.eaufrance.fr, last access: 25 July 2025), rasterized at 50 m and upscaled to 500 m using the average resampling method. The discharge time series, Q^*, comes from the HydroPortail Service Central Vigicrues (https://hydro.eaufrance.fr, last access: 25 July 2025).

Download Print Version | Download XLSX

https://gmd.copernicus.org/articles/18/7003/2025/gmd-18-7003-2025-f07

Figure 7Location of the catchments for the CONUS (a) and France (b) applications. For the split-sample test over CONUS, all 482 catchments from the CAMELS dataset (Addor et al., 2017) are used (orange and black circles), whereas for the regionalization, a subset of 398 catchments is used (only orange circles), removing catchments whose performance is less than 0.75 KGE from local calibration. For the France application over the Aude River, a set of 25 sub-catchments is used for regionalization, with 12 upstream catchments (red-shaded regions) and 13 downstream catchments (gray-shaded regions).

4.2 CONUS – CAMELS: split-sample temporal cross-validation

4.2.1 Numerical experiment settings

A set of 482 catchments (Fig. 7) is modeled with the following experimental design:

A set of hydrological models is considered, including four GR-like structures (Perrin et al., 2003) (ℳ_rr: gr4, gr5, grd, loieau) and one VIC-like structure (Liang et al., 1994) (ℳ_rr: vic3l). The way in which these models are integrated into smash, which differs from the original models, is described in the documentation (https://smash.recover.inrae.fr/math_num_documentation/forward_structure.html, last access: 25 July 2025) in the forward structure section. For each hydrological model, the same snow module (ℳ_snw: ssn) and routing module (ℳ_hy: kw) are used. A description of the calibrated parameters is provided in Appendix A.
A split-sample temporal validation procedure (Klemeš, 1983) is set up, splitting the time window covered by hydrometric data into two complementary subsets over sub-periods of 7 years: p1 (from 1 August 2000 to 31 July 2007) and p2 (from 1 August 2007 to 31 July 2014) are both used for calibration and validation. For each period, the 10 preceding years are used as model “warm-up”.
Two calibration mappings on each catchment are tested, including spatially uniform parameters (gradient-free optimization) and spatially distributed parameters (gradient-based optimization).
A single-gauge cost function based on the KGE ( $J = 1 - KGE$ ) is used.

4.2.2 Results

The performance of the models resulting from the spatially uniform or distributed calibration is evaluated using the Kling–Gupta efficiency (KGE) for both the calibration and the validation in period p2 only for brevity in this software article (Fig. 8). Overall, performance is satisfactory, with a median between 0.8 and 0.87 for KGE over the calibration period and between 0.72 and 0.78 for KGE over the validation period. With regard to the calibration method, for any model, calibration and validation performances are better with a spatially distributed calibration. This is an expected result for the calibration period, given that spatially distributed calibration is over-parameterized and offers the maximum level of flexibility in the search for the optimal set of parameters, unlike spatially uniform calibration, which is under-parameterized, imposing a single parameter set for each catchment. However, despite this over-parameterization with calibration of spatially distributed parameters, which can lead to over-fitting over the calibration period, the models offer good performance in temporal validation. The differences between the structures are mainly explained by (i) the varying levels of model complexity, two parameters for the grd model and four for the gr5 model, and (ii) the expert knowledge of the different models, which influences, among other things, the choice of initial values, bounds, and parameters to be optimized. The smash historical development based on the GR-like models led to much more substantial expert knowledge than for the VIC-like model recently implemented. Summary statistics of the calibrated parameters are provided in Appendix A.

https://gmd.copernicus.org/articles/18/7003/2025/gmd-18-7003-2025-f08

Figure 8Comparison of the Kling–Gupta efficiency (KGE) performance of different smash hydrological models under spatially uniform and distributed calibration. The models evaluated include gr4, gr5, grd, loieau, and vic3l. Panel (a) shows results for the calibration, while panel (b) displays results for temporal validation in period p2. For each model, results are shown for spatially uniform (solid boxes) and spatially distributed (hatched boxes) calibrations, with the median value highlighted at the top of the boxplot.

Download

Concerning the spatial distribution of KGE values (Fig. 9), the results for the gr4 hydrological model after a spatially distributed calibration show that the best performances are located over the east and west sides of CONUS, while the worst performances are located over the Great Plains area. This spatial pattern of hydrological model performance has also been obtained in other studies (Newman et al., 2015; Beck et al., 2016; Mizukami et al., 2017).

https://gmd.copernicus.org/articles/18/7003/2025/gmd-18-7003-2025-f09

Figure 9Spatial distribution of the Kling–Gupta efficiency (KGE) scores across different catchments for the gr4 model under local calibration with spatially distributed parameters. Panel (a) shows KGE scores during the calibration period, while panel (b) shows results for temporal validation, using the p2 period.

4.3 CONUS – CAMELS: regionalization

4.3.1 Numerical experiment settings

A set of 398 catchments (Fig. 7) from the CAMELS dataset (Addor et al., 2017) is evaluated in a regionalization context at a spatial resolution of $1^{'} 30^{''}$ and at a daily time step using worldwide databases (Table 3). The experimental design is as follows:

A subset of catchments from Sect. 4.2 is selected, eliminating catchments where KGE<0.75 from local calibration. This selection is made in order to avoid introducing catchments whose performance could greatly degrade the calibration metric in a multi-gauge context.
One hydrological model is considered, which is identical to the gr4 model in Sect. 4.2 with the same snow and routing module (ℳ_snw: ssn, ℳ_rr: gr4, and ℳ_hy: kw). A description of the calibrated parameters is provided in Appendix B.
A spatiotemporal validation procedure is set up by the following:
- splitting the time window covered by hydrometric data into two complementary subsets over sub-periods of 7 years: p1 (from 1 August 2000 to 31 July 2007) and p2 (from 1 August 2007 to 31 July 2014), with p1 used as the calibration period and p2 as the validation period (for each period, 10 years is used as model “warm-up”);
- randomly splitting the catchment set into four groups, calibrating on three of the groups with the fourth group held out and used for validation, and then rotating them such that each group is used for validation once.
Three calibration mappings across the whole CONUS are tested:
- Uniform. Spatially uniform parameters (gradient-free optimization) are used.
- Multi-linear. Multiple linear regression is used as a transfer function from descriptors to spatialized parameters (gradient-based optimization).
- ANN. A multi-layer perceptron composed of three hidden layers is used as a transfer function from descriptors to spatialized parameters (gradient-based optimization).
A multi-gauge cost function is used based on the average KGE of the calibrated catchments ( $J = \frac{1}{N_{G}} \sum^{N_{G}} 1 - KGE$ ).
A final calibration is performed over the total period p1+p2, including all gauges with the ANN mapping and the same multi-gauge cost function used to analyze the output model parameters and their correlations with input descriptors.

https://gmd.copernicus.org/articles/18/7003/2025/gmd-18-7003-2025-f10

Figure 10Spatiotemporal validation performance over period p2. The boxplots in panel (a) represent the distribution of Kling–Gupta efficiency (KGE) scores for three calibration methods: uniform, multi-linear, and artificial neural network (ANN). Median values are displayed at the top of each boxplot. The map in panel (b) illustrates the spatial distribution of the KGE values for the ANN mapping across different catchments.

4.3.2 Results

The regional calibration over the CAMELS dataset was performed on four groups of randomly selected catchments, as explained above. The performances in the spatial and/or temporal validation are shown in Fig. 10 and detailed by catchment group in Table B2 for the ANN mapping, which is the best performer. In the spatiotemporal validation, the most challenging extrapolation case, a uniform mapping leads to a median KGE of 0.5, while the two regionalization methods result in a KGE of 0.61 or 0.63 respectively for multi-linear and ANN mapping. These fairly good performances, obtained with a relatively simple setup in terms of descriptors and cost function in particular, are comparable with regionalization works in the literature (Mizukami et al., 2019; Beck et al., 2020; Feng et al., 2024). In a manner similar to the previous section (Sect. 4.2), the worst performances are found in the Great Plains and, more clearly than in the local calibrations, in the western part of the country.

Following the evaluation of performance in spatiotemporal validation, a regional calibration with the ANN mapping over the period including p1 and p2 and with all gauges is carried out. This calibration enables us to analyze the correlations between the physiographic descriptors and the parameters obtained (Fig. 11) in a more robust way than with the various spatiotemporal validation groups. The correlation matrix highlights significant linear correlations, notably between the melt coefficient (k_mlt) and the topographical slope (d₁), the size of the production reservoir (c_p) and the mean annual rainfall (d₄), and the moisture content (d₆) and the routing parameters (a_kw, b_kw) with the same moisture content (d₆). Conversely, the exchange parameter (k_exc), a parameter directly affecting the model's mass balance in a non-conservative way, shows almost no linear correlation with the descriptors and is almost spatially uniform over the whole domain around the value of 0. While a detailed regionalization study on CAMELS datasets using our original adjoint-based algorithms is beyond the scope of this software article, the achieved performance across this large sample already showcases the algorithm's potential for global applicability. It also demonstrates the algorithm's effectiveness in enforcing spatially distributed hydrologic model constraints at the pixel scale, leading to seamless parameter maps at a reasonable computational cost. Regarding computation times, the calibration with ANN mapping over periods p1 and p2 took 95 h. This calibration involved 350 iterations, corresponding to 350 calls to the adjoint model, and was performed using 16 threads. For comparison, a single adjoint model run takes approximately 16 min, whereas a direct model run takes around 5 min using the same number of threads.

https://gmd.copernicus.org/articles/18/7003/2025/gmd-18-7003-2025-f11

Figure 11Analysis of input descriptors and output model parameters for the ANN mapping. Spatial distribution of physical descriptors (d₁–d₆) in the top panel, with details provided in Table 3. Spatial distribution of calibrated hydrological parameters (k_mlt, c_p, c_t, k_exc, a_kw, b_kw) in the lower-left panel and linear correlation between descriptors and parameters in the lower-right panel.

Finally, leveraging the fully distributed nature of smash, regional streamflow maps can be generated. An example is shown in Fig. 12, which illustrates the dynamics of Hurricane Katrina over a 6 d period from 27 August to 1 September 2005. Notably, the routing model used in this exercise, the kinematic wave, was applied uniformly across the entire domain, including areas outside its validity range, such as downstream of major rivers on flat topography. Further work focuses on enriching smash with hydraulic models, starting with a 1D and 2D dynamic wave model that neglects the convective acceleration term but retains local acceleration and pressure gradient terms (Bates et al., 2010) for numerical implementation simplicity. Additionally, physics-based differential equations for hydrologic water balance at the pixel scale will be incorporated. Hybrid physics–AI formulations, which embed neural networks capable of learning parameterization and potentially uncertain model operators from data, can be explored thanks to the differentiable nature of the models within smash.

https://gmd.copernicus.org/articles/18/7003/2025/gmd-18-7003-2025-f12

Figure 12Streamflow dynamics during Hurricane Katrina from 27 August to 1 September 2005 for the ANN mapping. Each panel depicts the streamflow distribution across the affected region. To visualize the temporal evolution of the spatialized discharge pattern, note that the kinematic wave routing was applied on flat topography, i.e., out of its validity range.

https://gmd.copernicus.org/articles/18/7003/2025/gmd-18-7003-2025-f13

Figure 13Performance in spatiotemporal validation over period p2 using calibration on upstream gauges (triangles). The boxplots in panel (a) represent the distribution of Nash–Sutcliffe efficiency (NSE) scores for the three calibration methods: uniform, multi-linear, and ANN. Median values are displayed at the top of each boxplot. The map in panel (b) illustrates the spatial distribution of the NSE values for the ANN mapping for the downstream validation catchments.

https://gmd.copernicus.org/articles/18/7003/2025/gmd-18-7003-2025-f14

Figure 14Analysis of input descriptors and output model parameters for the ANN descriptor-to-parameter mapping. Spatial distribution of physical descriptors (d₁–d₇) in the top panel, with details provided in Table 4. Spatial distribution of calibrated hydrological parameters (c_p, c_t, k_exc, a_kw, b_kw) in the lower-left panel and linear correlation between descriptors and parameters in the lower-right panel.

https://gmd.copernicus.org/articles/18/7003/2025/gmd-18-7003-2025-f15

Figure 15Observed and simulated streamflow of the six most downstream gauges of the Aude River for the ANN mapping. Each panel represents streamflow (m³ s⁻¹) for a specific gauge. The dashed black lines indicate observed values (Obs), while the solid red lines represent simulated values (Sim).

Download

https://gmd.copernicus.org/articles/18/7003/2025/gmd-18-7003-2025-f16

Figure 16Spatially distributed gradients of the cost function J with respect to the model parameters at initial and final iterations for ANN mapping. The first row shows the gradients $\nabla_{θ^{*}} J$ at the initial iteration, while the second row presents the gradients $\nabla_{\hat{θ}} J$ at the final iteration after optimization. Each column corresponds to the partial derivative of J with respect to a specific parameter: c_p, c_t, k_exc, a_kw, and b_kw. These gradients are used in the optimization process of the control vector ρ using $\nabla_{ρ} J = \nabla_{θ} J . \nabla_{ρ} θ$ with $θ = N (., ρ)$ , where 𝒩 is the multi-layer perceptron used.

Download

4.4 France – Aude River: high-resolution regionalization

4.4.1 Numerical experiment settings

A set of 35 catchments (Fig. 7) over the Aude River in France (Addor et al., 2017) is evaluated in a regionalization context at a spatial resolution of 500 m and at an hourly time step using national databases (Table 4). This section is similar to the previous one, using the same regionalization method, but with differences in the gauges selected for calibration and validation and differences in the cost function. This section focuses on national data at a finer spatiotemporal scale, applying the method at the watershed level, which is more relevant for operational flood forecasting. The experimental design is as follows:

One hydrological model is considered, identical to the gr4 model in Sect. 4.2 with the same routing module but without any snow modeling given the limited impact of snow in this Mediterranean basin (ℳ_snw: zero, ℳ_rr: gr4, ℳ_hy: kw). A description of the calibrated parameters is provided in Appendix C.
A spatiotemporal validation procedure is set up by the following:
- splitting the time window covered by hydrometric data into two complementary subsets over sub-periods of 4 years: p1 (from 1 August 2015 to 31 July 2019) and p2 (from 1 August 2019 to 31 July 2023), with p1 used as the calibration period and p2 as the validation period (for each period, 1 year is used as model “warm-up”)
- splitting the catchment set into two groups, upstream and downstream, calibrating on the upstream group and validating on the downstream group.
Three calibration mappings across the whole Aude River are tested:
- Uniform. Spatially uniform parameters (gradient-free optimization) are used.
- Multi-linear. Multiple linear regression is used as the transfer function from descriptors to spatialized parameters (gradient-based optimization).
- ANN. A multi-layer perceptron composed of three hidden layers is used as the transfer function from descriptors to spatialized parameters (gradient-based optimization).
A multi-gauge cost function is used based on the average NSE of the calibrated catchments ( $J = \frac{1}{N_{G}} \sum^{N_{G}} (1 - NSE)$ ).

4.4.2 Results

The results of the regional mappings were validated on downstream gauges following Huynh et al. (2024 b). The spatiotemporal validation performance, which assesses the model outside of the calibration gauges and period, is particularly challenging at such a high resolution and given the complex variabilities of physical factors and hydrological responses over this Mediterranean flash-flood-prone case. The results are shown in validation only, for brevity again, in Fig. 13. A uniform mapping yields a poor median NSE of 0.15, while descriptor-to-parameter mappings achieve 0.62 and 0.69 for multi-linear and ANN approaches respectively. Conceptual parameter maps obtained by learning from physical descriptors are shown in Fig. 14 for the ANN mapping only (with the best NSE result). The correlation matrix highlights significant correlations, especially between production capacity (c_p) and topographic slope (d₅) and between exchange parameter (k_exc) or routing parameter (a_kw) and topographic slope (d₅). For each parameter, a correlation is also found with vegetation cover rate (d₃) or forest cover rate (d₂). This illustrates the interpretability of our neural-network-based regionalization algorithm in the space of conceptual model parameters. Regarding computation times, the calibration with ANN mapping over period p1 took 31 h. This calibration involved 350 iterations, corresponding to 350 calls to the adjoint model, and was performed using 10 threads. For comparison, a single adjoint model run takes approximately 6 min, whereas a direct model run takes around 1 min and 30 s using the same number of threads. A key feature of smash is its ability to accurately and efficiently compute spatially distributed cost gradients, as shown in Fig. 16 in the conceptual parameter (θ) space for interpretability, in the case of a differentiable spatially distributed hydrological model that includes an NN-based regionalization mapping and a kinematic wave routing model (the partial differential equation numerical solver also being differentiated). Finally, simulated hydrographs are plotted for the six most downstream validation gauges in Fig. 15, with better reproduction for most downstream gauges in the present test configuration with the calibrated ANN regionalization (in agreements with results of Huynh et al., 2024 b, over the whole French Mediterranean region).

These performances are very encouraging since they were obtained with a relatively simple regionalization setup in a complex flash-flood-prone area. Further research with smash will focus on improving the versatility of the hydrological model to better account for high rainfall intensities (e.g., Peredo et al., 2022) or groundwater/karstic effects, with classical or hybrid differential equations capable of learning from data at multiple scales, and to enrich regionalization algorithms with advanced cost functions and spatial relaxation/regularization strategies. These improvements are necessary to better extract information with the VDA algorithm from multiple discharge gauges and other data sources (descriptors, satellite moisture, temperature, etc.). Additionally, incorporating more realistic hydraulic routing embedded within the differentiable hydrologic model will also enable the integration of hydraulic information (water levels, flow videos, etc.), as introduced in Pujol et al. (2022).

5 Other smash features

In addition to the core differentiable spatialized hydrological solvers and regionalization learning algorithms illustrated above, smash enables performing the following:

automatic hydrograph segmentation and flood detection over large samples (Huynh et al., 2023);
parameter calibration using signature-based cost functions (Huynh et al., 2023) in addition to continuous metrics;
parameter calibration using a spatial regularization term (Jay-Allemand et al., 2020);
initial state estimation, including with regionalization mapping, even over short time windows, which is applicable to short-range VDA for operational forecasting;
simulation of discharge ensembles from rainfall ensemble forecasting;
Bayesian approach for parameter estimation and uncertainty quantification, with the consideration of structural model errors and observation errors.

6 Conclusions

The recently released smash framework represents an advancement in modular, regionalizable, and differentiable numerical modeling, as well as in hydrological data assimilation. This conclusion synthesizes the key principles, implementation features, performance indicators, and future prospects of smash, as presented in this article.

smash is built around three foundational principles: a modular operator chaining, enabling flexible representation of vertical and lateral hydrological processes; a regionalization mapping through hybrid approaches, combining conceptual models with descriptor-to-parameter neural networks; and a robust inverse algorithm that supports VDA.

The software leverages automatic differentiation to facilitate gradient-based calibration. Its seamless integration with Python via f90wrap ensures user-friendly access and flexibility, complemented by an automatic build system that simplifies deployment. Furthermore, smash supports parallel computing on CPUs, significantly accelerating computations for large-scale applications.

In terms of hydrological modeling, smash achieves interesting results. Using CAMELS datasets, a median KGE>0.8 is observed in local spatially distributed calibration for daily GR-like and VIC-like model structures at $d x = 1^{'} 30^{''}$ (∼3 km). Additionally, regionalization learning across CONUS of conceptual parameters from physical descriptors yields KGE>0.6 in spatiotemporal validation. High-resolution hourly modeling at dx=500 m for Mediterranean flash-flood scenarios demonstrates NSE>0.6.

Planned enhancements to smash include the integration of additional differentiable hydrological, hydraulic, and land surface models; the expansion of hybrid physics–AI frameworks; and the refinement of data assimilation techniques. These advancements aim to further improve model accuracy, computational efficiency, and applicability in both research and operational settings.

Appendix A: CONUS – CAMELS: split-sample temporal cross-validation

Table A1Summary of model operators and their associated parameters. For each operator, the parameter name, description, initial value, and allowable value range are provided.

Download Print Version | Download XLSX

Table A2Summary statistics of the calibrated parameters across the set of 482 catchments. For each parameter, the median, standard deviation, and coefficient of variation ( $μ / σ$ ) are reported for the two calibration configurations: spatially uniform (“Uniform”) and spatially distributed (“Distributed”). For the spatially distributed calibration, statistics were computed based on the spatial average of parameter values within each catchment.

Download Print Version | Download XLSX

Appendix B: CONUS – CAMELS: regionalization

Table B1Summary of model operators and their associated parameters. For each operator, the parameter name, description, initial value, and allowable value range are provided.

Download Print Version | Download XLSX

Table B2Median KGE obtained in regionalization mapping calibration–validation over four groups of randomly selected basins.

Download Print Version | Download XLSX

Appendix C: France – Aude: high-resolution regionalization

Table C1Summary of model operators and their associated parameters. For each operator, the parameter name, description, initial value, and allowable value range are provided.

Download Print Version | Download XLSX

Appendix D: smash operators

This section describes the various operators available in smash with mathematical or numerical expressions, input data $[I, D] (x, t)$ , tunable conceptual parameters θ(x,t), and simulated states and fluxes $U (x, t) = [Q, h, q] (x, t)$ . These operators are written below for a given pixel x of the 2D spatial domain Ω and for a time t in the simulation window ]0,T].

D1 Snow operator ℳ_snw

zero

This snow operator simply means that there is no snow operator.
$m_{lt} (x, t) = 0$
Here m_lt is the melt flux.
ssn

This snow operator is a simple degree-day snow operator.

Update the snow reservoir state h_s for $t^{*} \in]t - 1, t[$ :
$h_{s} (x, t^{*}) = h_{s} (x, t - 1) + S (x, t) .$
Compute the melt flux m_lt:
$\begin{array}{l} m_{lt} (x, t) = \\ \{\begin{cases} 0 & if T_{e} (x, t) \leq 0 \\ min (h_{s} (x, t^{*}), k_{mlt} (x) \cdot T_{e} (x, t)) & otherwise. \end{cases} \end{array}$
Update the snow reservoir state h_s:
$h_{s} (x, t) = h_{s} (x, t^{*}) - m_{lt} (x, t),$
with m_lt being the melt flux, S the snow, T_e the temperature, k_mlt the melt coefficient, and h_s the state of the snow reservoir.

D2 Hydrological operator ℳ_hy

gr4

This hydrological operator is derived from the GR4 model (Perrin et al., 2003).
Interception

Compute interception evapotranspiration e_i:
$\begin{array}{l} e_{i} (x, t) = min (E (x, t), P (x, t) + m_{lt} (x, t) \\ + \tilde{h_{i}} (x, t - 1) \cdot c_{i} (x)) . \end{array}$
Compute the neutralized precipitation p_n and evapotranspiration e_n:
$\begin{array}{l} p_{n} (x, t) = max (0, P (x, t) + m_{lt} (x, t) \\ - c_{i} (x) (1 - \tilde{h_{i}} (x, t - 1)) - e_{i} (x, t)) \\ e_{n} (x, t) = E (x, t) - e_{i} (x, t) . \end{array}$
Update the normalized interception reservoir state $\tilde{h_{i}}$ :
$\begin{array}{l} \tilde{h_{i}} (x, t) & = \tilde{h_{i}} (x, t - 1) \\ + \frac{P (x, t) + m_{lt} (x, t) + e_{i} (x, t) - p_{n} (x, t)}{c_{i} (x)} . \end{array}$

Production

Compute the production infiltrating precipitation p_s and evapotranspiration e_s:
$\begin{array}{l} p_{s} (x, t) = & c_{p} (x) (1 - \tilde{h_{p}} (x, t - 1)^{2}) \\ \cdot \frac{\tanh (\frac{p_{n} (x, t)}{c_{p} (x)})}{1 + \tilde{h_{p}} (x, t - 1) \tanh (\frac{p_{n} (x, t)}{c_{p} (x)})} \\ e_{s} (x, t) = & \tilde{h_{p}} (x, t - 1) \cdot c_{p} (x) \cdot (2 - \tilde{h_{p}} (x, t - 1)) \\ \cdot \frac{\tanh (\frac{e_{n} (x, t)}{c_{p} (x)})}{1 + (1 - \tilde{h_{p}} (x, t - 1)) \tanh (\frac{e_{n} (x, t)}{c_{p} (x)})} . \end{array}$
Update the normalized production reservoir state $\tilde{h_{p}}$ :
$\tilde{h_{p}} (x, t^{*}) = \tilde{h_{p}} (x, t - 1) + \frac{p_{s} (x, t) - e_{s} (x, t)}{c_{p} (x)} .$
Compute the production runoff p_r:
$\begin{array}{l} p_{r} (x, t) = \\ \{\begin{cases} 0 if p_{n} (x, t) \leq 0 \\ p_{n} (x, t) - (\tilde{h_{p}} (x, t^{*}) - \tilde{h_{p}} (x, t - 1)) c_{p} (x) \\ otherwise . \end{cases} \end{array}$
Compute the production percolation p_erc:
$\begin{array}{l} p_{erc} (x, t) = & \tilde{h_{p}} (x, t^{*}) \cdot c_{p} (x) \\ \cdot (1 - {(1 + {(\frac{4}{9} \tilde{h_{p}} (x, t^{*}))}^{4})}^{- 1 / 4}) . \end{array}$
Update the normalized production reservoir state $\tilde{h_{p}}$ :
$\tilde{h_{p}} (x, t) = \tilde{h_{p}} (x, t^{*}) - \frac{p_{erc} (x, t)}{c_{p} (x)} .$

Exchange

Compute the exchange flux l_exc:
$l_{exc} (x, t) = k_{exc} (x) \cdot \tilde{h_{t}} (x, t - 1)^{7 / 2} .$

Transfer

Split the production runoff p_r into two branches (transfer and direct), p_rr and p_rd:
$\begin{array}{l} p_{rr} (x, t) = 0.9 (p_{r} (x, t) + p_{erc} (x, t)) + l_{exc} (x, t) \\ p_{rd} (x, t) = 0.1 (p_{r} (x, t) + p_{erc} (x, t)) . \end{array}$
Update the normalized transfer reservoir state $\tilde{h_{t}}$ :
$\tilde{h_{t}} (x, t^{*}) = max (0, \tilde{h_{t}} (x, t - 1) + \frac{p_{rr} (x, t)}{c_{t} (x)}) .$
Compute the transfer branch elemental discharge q_r:
$\begin{array}{l} q_{r} (x, t) = & \tilde{h_{t}} (x, t^{*}) \cdot c_{t} (x) \\ - {({(\tilde{h_{t}} (x, t^{*}) \cdot c_{t} (x))}^{- 4} + c_{t} (x)^{- 4})}^{- 1 / 4} . \end{array}$
Update the normalized transfer reservoir state $\tilde{h_{t}}$ :
$\tilde{h_{t}} (x, t) = \tilde{h_{t}} (x, t^{*}) - \frac{q_{r} (x, t)}{c_{t} (x)} .$
Compute the direct branch elemental discharge q_d:
$q_{d} (x, t) = max (0, p_{rd} (x, t) + l_{exc} (x, t)) .$
Compute the elemental discharge q_t:
$q_{t} (x, t) = q_{r} (x, t) + q_{d} (x, t),$
with q_t being the elemental discharge, P the precipitation, E the potential evapotranspiration, m_lt the melt flux from the snow operator, c_i the maximum capacity of the interception reservoir, c_p the maximum capacity of the production reservoir, c_t the maximum capacity of the transfer reservoir, k_exc the exchange coefficient, $\tilde{h_{i}}$ the state of the normalized interception reservoir, $\tilde{h_{p}}$ the state of the normalized production reservoir, and $\tilde{h_{t}}$ the state of the normalized transfer reservoir.
gr5

This hydrological operator is derived from the GR4 model (Le Moine, 2008). It consists of a GR4-like model structure (see above) with a modified exchange flux with two parameters to account for seasonal variations.
Interception

Same as gr4 Interception

Production

Same as gr4 Production

Exchange

Compute the exchange flux l_exc:
$l_{exc} (x, t) = k_{exc} (x) \cdot \tilde{h_{t}} (x, t - 1)^{7 / 2} .$

Transfer

Same as gr4 Transfer, with q_t being the elemental discharge, P the precipitation, E the potential evapotranspiration, m_lt the melt flux from the snow operator, c_i the maximum capacity of the interception reservoir, c_p the maximum capacity of the production reservoir, c_t the maximum capacity of the transfer reservoir, k_exc the exchange coefficient, a_exc the exchange threshold, $\tilde{h_{i}}$ the state of the normalized interception reservoir, $\tilde{h_{p}}$ the state of the normalized production reservoir, and $\tilde{h_{t}}$ the state of the normalized transfer reservoir.
grd

This hydrological operator is derived from the GR models and is a simplified structure used in Jay-Allemand et al. (2020).
Interception

Compute the interception evapotranspiration e_i:
$e_{i} (x, t) = min (E (x, t), P (x, t) + m_{lt} (x, t)) .$
Compute the neutralized precipitation p_n and evapotranspiration e_n:
$\begin{array}{l} p_{n} (x, t) = max (0, P (x, t) + m_{lt} (x, t) - e_{i} (x, t)) \\ e_{n} (x, t) = E (x, t) - e_{i} (x, t) . \end{array}$

Production

Same as gr4 Production

Transfer

Update the normalized transfer reservoir state $\tilde{h_{t}}$ :
$\tilde{h_{t}} (x, t^{*}) = max (0, \tilde{h_{t}} (x, t - 1) + \frac{p_{r} (x, t)}{c_{t} (x)}) .$
Compute the transfer branch elemental discharge q_r:
$\begin{array}{l} q_{r} (x, t) & = \tilde{h_{t}} (x, t^{*}) \cdot c_{t} (x) \\ - {({(\tilde{h_{t}} (x, t^{*}) \cdot c_{t} (x))}^{- 4} + c_{t} (x)^{- 4})}^{- 1 / 4} . \end{array}$
Update the normalized transfer reservoir state $\tilde{h_{t}}$ :
$\tilde{h_{t}} (x, t) = \tilde{h_{t}} (x, t^{*}) - \frac{q_{r} (x, t)}{c_{t} (x)} .$
Compute the elemental discharge q_t:
$q_{t} (x, t) = q_{r} (x, t),$
with q_t being the elemental discharge, P the precipitation, E the potential evapotranspiration, m_lt the melt flux from the snow operator, c_p the maximum capacity of the production reservoir, c_t the maximum capacity of the transfer reservoir, $\tilde{h_{p}}$ the state of the normalized production reservoir, and $\tilde{h_{t}}$ the state of the normalized transfer reservoir.
loieau

This hydrological operator is derived from the GR model (Folton and Arnaud, 2020).
Interception

Same as gr4 Interception

Production

Same as gr4 Production

Transfer

Split the production runoff p_r into two branches (transfer and direct), p_rr and p_rd:
$\begin{array}{l} p_{rr} (x, t) = 0.9 (p_{r} (x, t) + p_{erc} (x, t)) \\ p_{rd} (x, t) = 0.1 (p_{r} (x, t) + p_{erc} (x, t)) . \end{array}$
Update the normalized transfer reservoir state $\tilde{h_{c}}$ :
$\tilde{h_{c}} (x, t^{*}) = max (0, \tilde{h_{c}} (x, t - 1) + \frac{p_{rr} (x, t)}{c_{c} (x)}) .$
Compute the transfer branch elemental discharge q_r:
$\begin{array}{l} q_{r} (x, t) & = \tilde{h_{c}} (x, t^{*}) \cdot c_{c} (x) \\ - {({(\tilde{h_{c}} (x, t^{*}) \cdot c_{c} (x))}^{- 3} + c_{c} (x)^{- 3})}^{- 1 / 3} . \end{array}$
Update the normalized transfer reservoir state $\tilde{h_{c}}$ :
$\tilde{h_{c}} (x, t) = \tilde{h_{c}} (x, t^{*}) - \frac{q_{r} (x, t)}{c_{c} (x)} .$
Compute the direct branch elemental discharge q_d:
$q_{d} (x, t) = max (0, p_{rd} (x, t)) .$
Compute the elemental discharge q_t:
$q_{t} (x, t) = k_{b} (x) \cdot (q_{r} (x, t) + q_{d} (x, t)),$
with q_t being the elemental discharge, P the precipitation, E the potential evapotranspiration, m_lt the melt flux from the snow operator, c_a the maximum capacity of the production reservoir, c_c the maximum capacity of the transfer reservoir, k_b the transfer coefficient, $\tilde{h_{a}}$ the state of the normalized production reservoir, and $\tilde{h_{c}}$ the state of the normalized transfer reservoir.
vic3l

This hydrological operator is derived from the VIC model (Liang et al., 1994).
Canopy layer interception

Compute the canopy layer interception evapotranspiration e_c:
$\begin{array}{l} e_{c} (x, t) = & min (E (x, t) \tilde{h_{cl}} (x, t - 1)^{2 / 3}, \\ P (x, t) + m_{lt} (x, t) + \tilde{h_{cl}} (x, t - 1)) . \end{array}$
Compute the neutralized precipitation p_n and evapotranspiration e_n:
$\begin{array}{l} p_{n} (x, t) = max (0, P (x, t) + m_{lt} (x, t) \\ - (1 - \tilde{h_{cl}} (x, t - 1)) - e_{c} (x, t)) \\ e_{n} (x, t) = E (x, t) - e_{c} (x, t) . \end{array}$
Update the normalized canopy layer interception state $\tilde{h_{cl}}$ :
$\begin{array}{l} \tilde{h_{cl}} (x, t) = & \tilde{h_{cl}} (x, t - 1) + P (x, t) \\ - e_{c} (x, t) - p_{n} (x, t) . \end{array}$

Upper soil layer evapotranspiration

Compute the maximum infiltration i_m and the corresponding soil saturation infiltration i₀:
$\begin{array}{l} i_{m} (x, t) = (1 + b (x)) c_{usl} (x) \\ i_{0} (x, t) = i_{m} (x, t) \\ \cdot (1 - (1 - \tilde{h_{usl}} (x, t - 1))^{1 / (1 - b (x))}) . \end{array}$
Compute the upper soil layer evapotranspiration e_s:
$e_{s} (x, t) = \{\begin{cases} e_{n} (x, t) & if i_{0} (x, t) \geq i_{m} (x, t) \\ β (x, t) e_{n} (x, t) & otherwise, \end{cases}$
with β being the ARNO evapotranspiration beta function (Todini, 1996).

Update the normalized upper soil layer reservoir state $\tilde{h_{usl}}$ :
$\tilde{h_{usl}} (x, t) = \tilde{h_{usl}} (x, t - 1) - \frac{e_{s} (x, t)}{c_{usl} (x)} .$

Infiltration

Compute the maximum capacity c_umsl, soil moisture w_umsl, and relative state h_umsl of the first two layers:
$\begin{array}{l} c_{umsl} (x) = c_{usl} (x) + c_{msl} (x) \\ w_{umsl} (x, t - 1) = \tilde{h_{usl}} (x, t - 1) c_{usl} (x) \\ + \tilde{h_{msl}} (x, t - 1) c_{msl} (x) \\ h_{umsl} (x, t - 1) = \frac{w_{umsl} (x, t - 1)}{c_{umsl} (x)} . \end{array}$
Compute maximum i_m and infiltration i₀:
$\begin{array}{l} i_{m} (x, t) = & (1 + b (x)) c_{umsl} (x) \\ i_{0} (x, t) = & i_{m} (x, t) \\ \cdot (1 - (1 - h_{umsl} (x, t - 1))^{1 / (1 - b (x))}) . \end{array}$
Compute infiltration i:
$i (x, t) = \{\begin{cases} c_{umsl} (x) - w_{umsl} (x, t - 1) \\ if i_{0} (x, t) + p_{n} (x, t) > i_{m} (x, t) \\ \begin{array}{l} c_{umsl} (x) - w_{umsl} (x, t - 1) \\ - c_{umsl} (x) {(1 - \frac{i_{0} (x, t) + p_{n} (x, t)}{i_{m} (x, t)})}^{b (x) + 1} \end{array} \\ otherwise . \end{cases}$
Distribute infiltration between the upper two layers:
$\begin{array}{l} i_{usl} (x, t) = min ( & (1 - \tilde{h_{usl}} (x, t - 1)) c_{usl} (x), \\ i (x, t)) \\ i_{msl} (x, t) = min ( & (1 - \tilde{h_{msl}} (x, t - 1)) c_{msl} (x), \\ i (x, t) - i_{usl} (x, t)) . \end{array}$
Update the reservoir states:
$\begin{array}{l} \tilde{h_{usl}} (x, t) & = \tilde{h_{usl}} (x, t - 1) + i_{usl} (x, t) \\ \tilde{h_{msl}} (x, t) & = \tilde{h_{msl}} (x, t - 1) + i_{msl} (x, t) . \end{array}$
Compute runoff:
$q_{r} (x, t) = p_{n} (x, t) - (i_{usl} (x, t) + i_{msl} (x, t)) .$

Drainage

Compute the soil moisture in the first two layers:
$\begin{array}{l} w_{usl} (x, t - 1) & = \tilde{h_{usl}} (x, t - 1) c_{usl} (x) \\ w_{msl} (x, t - 1) & = \tilde{h_{msl}} (x, t - 1) c_{msl} (x) . \end{array}$
Compute the initial drainage flux:
$d_{umsl} (x, t^{*}) = k_{s} (x) \cdot \tilde{h_{usl}} (x, t - 1)^{p_{bc}} .$
Update the drainage flux:
$\begin{array}{l} d_{umsl} (x, t) = min (d_{umsl} (x, t^{*}), \\ min (w_{usl} (x, t - 1), c_{msl} (x) - w_{msl} (x, t - 1))) . \end{array}$
Update normalized reservoir states:
$\begin{array}{l} \tilde{h_{usl}} (x, t) & = \tilde{h_{usl}} (x, t - 1) - \frac{d_{umsl} (x, t)}{c_{usl} (x)} \\ \tilde{h_{msl}} (x, t) & = \tilde{h_{msl}} (x, t - 1) + \frac{d_{umsl} (x, t)}{c_{msl} (x)} . \end{array}$
The same approach is performed for drainage between medium and bottom soil layers. For brevity, we skip the first steps and directly give the update equations.

Update the normalized medium and bottom reservoir states:
$\begin{array}{l} \tilde{h_{msl}} (x, t) & = \tilde{h_{msl}} (x, t - 1) - \frac{d_{mbsl} (x, t)}{c_{msl} (x)} \\ \tilde{h_{bsl}} (x, t) & = \tilde{h_{bsl}} (x, t - 1) + \frac{d_{mbsl} (x, t)}{c_{bsl} (x)} . \end{array}$

Baseflow

Compute baseflow q_b:
$\begin{array}{l} q_{b} (x, t) = \\ \{\begin{cases} \frac{d_{sm} (x) d_{s} (x)}{w_{s} (x)} \tilde{h_{bsl}} (x, t - 1) \\ if \tilde{h_{bsl}} (x, t - 1) \leq w_{s} (x) \\ \begin{array}{l} \frac{d_{sm} (x) d_{s} (x)}{w_{s} (x)} \tilde{h_{bsl}} (x, t - 1) \\ + d_{sm} (x) (1 - \frac{d_{s} (x)}{w_{s} (x)}) {(\frac{\tilde{h_{bsl}} (x, t - 1) - w_{s} (x)}{1 - w_{s} (x)})}^{2} \end{array} \\ otherwise . \end{cases} \end{array}$
Update the normalized bottom soil layer reservoir:
$\tilde{h_{bsl}} (x, t) = \tilde{h_{bsl}} (x, t - 1) - \frac{q_{b} (x, t)}{c_{bsl} (x)},$
with q_t being the elemental discharge, P the precipitation, E the potential evapotranspiration, m_lt the melt flux from the snow operator, b the variable infiltration curve parameter, c_usl the maximum capacity of the upper soil layer, c_msl the maximum capacity of the medium soil layer, c_bsl the maximum capacity of the bottom soil layer, k_s the saturated hydraulic conductivity, p_bc the Brooks and Corey exponent, d_sm the maximum velocity of baseflow, d_s the nonlinear baseflow threshold maximum velocity, w_s the nonlinear baseflow threshold soil moisture, $\tilde{h_{cl}}$ the state of the normalized canopy layer, $\tilde{h_{usl}}$ the state of the normalized upper soil layer, $\tilde{h_{msl}}$ the state of the normalized medium soil layer, and $\tilde{h_{bsl}}$ the state of the normalized bottom soil layer.

D3 Routing operator ℳ_rr

lag0

This routing operator is a simple aggregation of upstream discharge to downstream discharge following the drainage plan.
Upstream discharge

Compute the upstream discharge q_up:
$q_{up} (x, t) = \{\begin{cases} 0 & if Ω_{x} = \emptyset \\ \sum_{k \in Ω_{x}} Q (k, t) & otherwise, \end{cases}$
where Ω_x is the set of upstream cells flowing into cell x.

Surface discharge

Compute the surface discharge Q:
$Q (x, t) = q_{up} (x, t) + α (x) q_{t} (x, t),$
where α(x) is a unit conversion factor from $mm \cdot Δ t^{- 1}$ to $m^{3} \cdot s^{- 1}$ for a single cell.

Q is the surface discharge, q_t is the elemental discharge, and Ω_x is a 2D spatial domain that corresponds to all upstream cells flowing into cell x, i.e., the whole upstream catchment. Note that Ω_x is a subset of Ω, Ω_x⊂Ω, and for the most upstream cells Ω_x=∅.
lr

This routing operator uses a linear reservoir to route upstream discharge to downstream discharge following the drainage plan.
Upstream discharge

Same as lag0 Upstream discharge

Surface discharge

Update the routing reservoir state h_lr:
$h_{lr} (x, t^{*}) = h_{lr} (x, t) + \frac{1}{β (x)} q_{up} (x, t),$
where β(x) is a conversion factor from $mm \cdot Δ t^{- 1}$ to $m^{3} \cdot s^{- 1}$ for the entire upstream domain Ω_x.

Compute the routed discharge q_rt:
$q_{rt} (x, t) = h_{lr} (x, t^{*}) (1 - \exp (\frac{- Δ t}{60 \times l_{lr}})) .$
Update the routing reservoir state h_lr:
$h_{lr} (x, t) = h_{lr} (x, t^{*}) - q_{rt} (x, t) .$
Compute the surface discharge Q:
$Q (x, t) = β (x) q_{rt} (x, t) + α (x) q_{t} (x, t),$
where α(x) is a conversion factor from $mm \cdot Δ t^{- 1}$ to $m^{3} \cdot s^{- 1}$ for a single cell.
kw

This routing operator is based on a conceptual 1D kinematic wave model that is numerically solved with a linearized implicit numerical scheme (Chow et al., 1998). This is applicable given the drainage plan 𝒟_Ω(x) that enables the routing problem to be reduced to 1D.

The kinematic wave model is a simplification of the 1D Saint-Venant hydraulic equations.

First, the mass conservation equation is written as
$\begin{matrix} (D1) & \frac{\partial A}{\partial t} + \frac{\partial Q}{\partial x} = q, \end{matrix}$
where $\partial_{□}$ denotes partial differentiation with respect to time or space, A is the cross-sectional flow area, Q is the discharge, and q represents lateral inflows.

The momentum equation is simplified by assuming that the water surface slope equals the bed slope; i.e., the flow is locally uniform and gradually varied:
$\begin{matrix} (D2) & S_{0} = S_{f}, \end{matrix}$
where S₀ is the bed slope, and S_f is the friction slope. This implies that the energy grade line is parallel to the channel bottom.

This simplification leads to an empirical relation between discharge and flow area or depth, as described by Chow et al. (1998):
$\begin{matrix} (D3) & A = a_{kw} Q^{b_{kw}}, \end{matrix}$
where a_kw and b_kw are two empirical constants that can also be related to the Manning friction law.

Injecting the parameterization from Eq. (D3) into the mass conservation equation (Eq. D1) yields the following one-equation form of the kinematic wave model (Chow et al., 1998):
$\begin{matrix} (D4) & \frac{\partial Q}{\partial x} + a_{kw} b_{kw} Q^{b_{kw} - 1} \frac{\partial Q}{\partial t} = q . \end{matrix}$
For the sake of clarity, the following variables are renamed for this section and the finite-difference numerical scheme.

Table D1Renamed variables.

Download Print Version | Download XLSX

Upstream discharge

Same as lag0 Upstream discharge with q_up denoted $Q_{i - 1}^{j}$ .

Surface discharge

Compute the intermediate variables d₁ and d₂:

\begin{array}{l} d_{1} & = \frac{Δ t}{Δ x}, \\ d_{2} & = a_{kw} b_{kw} {(\frac{Q_{i}^{j - 1} + Q_{i - 1}^{j}}{2})}^{b_{kw} - 1} . \end{array}

Compute the intermediate variables n₁, n₂, and n₃:

\begin{array}{l} n_{1} & = d_{1} Q_{i - 1}^{j}, \\ n_{2} & = d_{2} Q_{i}^{j - 1}, \\ n_{3} & = d_{1} \frac{q_{i}^{j - 1} + q_{i}^{j}}{2} . \end{array}

Compute the surface discharge $Q_{i}^{j}$ :

Q_{i}^{j} = Q (x, t) = \frac{n_{1} + n_{2} + n_{3}}{d_{1} + d_{2}},

with Q being the surface discharge, q_t the elemental discharge, a_kw the alpha kinematic wave parameter, b_kw the beta kinematic wave parameter, and Ω_x a 2D spatial domain that corresponds to all upstream cells flowing into cell x. Note that Ω_x is a subset of Ω, Ω_x⊂Ω, and for the most upstream cells Ω_x=∅.

Appendix E: CPU information

Architecture:             x86_64
  CPU op-mode(s):         32 bit, 64 bit
  Address sizes:          48 bits physical, 48 bits virtual
  Byte order:             Little Endian
CPU(s):                   192
  On-line CPU(s) list:    0-191
Vendor ID:                AuthenticAMD
  Model name:             AMD EPYC 7643 48-Core Processor
    CPU family:           25
    Model:                1
    Thread(s) per core:   2
    Core(s) per socket:   48
    Socket(s):            2
    Stepping:             1
    Frequency boost:      enabled
    CPU max MHz:          2300.0000
    CPU min MHz:          1500.0000
    BogoMIPS:             4591.48
Virtualization features:
  Virtualization:         AMD-V
Caches (sum of all):
  L1d:                    3 MiB (96 instances)
  L1i:                    3 MiB (96 instances)
  L2:                     48 MiB (96 instances)
  L3:                     512 MiB (16 instances)
NUMA:
  NUMA node(s):           2
  NUMA node0 CPU(s):      0-47,96-143
  NUMA node1 CPU(s):      48-95,144-191

Code and data availability

The source code of smash, version 1.0, is available and preserved on multiple platforms: GitHub at https://github.com/DassHydro/smash/tree/v1.0.2 (last access: 25 July 2025), PyPI at https://pypi.org/project/hydro-smash/1.0.2 (last access: 25 July 2025), and Zenodo at https://doi.org/10.5281/zenodo.14841726 (Colleoni et al., 2025 a) (last access: 25 July 2025). The datasets presented in this paper are also available on Zenodo at https://doi.org/10.5281/zenodo.14865491 (Colleoni et al., 2025 b) (last access: 25 July 2025). smash is released under the GPL-3 license and is developed openly at https://github.com/DassHydro/smash (last access: 25 July 2025). The documentation is accessible at https://smash.recover.inrae.fr (last access: 25 July 2025).

Author contributions

FC: lead developer of smash v1.0, conceptualization, numerical experiments, result analyses, paper preparation. NNTH: main developer of smash v1.0, conceptualization, result analyses, paper preparation. PAG: co-developer, conceptualization, research plan and supervision, result analyses, paper preparation, funding. MJA: co-developer of smash v1.0, main developer of the first wrapping and differentiable code, paper review. DO: main developer of the first Fortran code, paper review. BR: co-developer of smash v1.0, result analyses, research co-supervision, paper review. TDF, AEB, JD: contribution to co-development of smash v1.0, paper review. PJ: result analyses, paper review, funding.

Competing interests

The contact author has declared that none of the authors has any competing interests.

Disclaimer

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors.

Acknowledgements

The French national flood forecasting center, Service Central Vigicrues (ex. SCHAPI), is gratefully acknowledged for software development, operational application, and long-term collaboration on flood forecasting and data sharing. During the preparation of this work, the authors used Mistral AI in order to correct and improve the English language. After using this tool, the authors reviewed and edited the content as needed and take full responsibility for the content of the publication.

Financial support

This research was supported by the Agence Nationale de la Recherche MUFFINS project (grant no. ANR-21-CE04-0021-01).

Review statement

This paper was edited by Dalei Hao and reviewed by two anonymous referees.

References

Addor, N., Newman, A. J., Mizukami, N., and Clark, M. P.: The CAMELS data set: catchment attributes and meteorology for large-sample studies, Hydrol. Earth Syst. Sci., 21, 5293–5313, https://doi.org/10.5194/hess-21-5293-2017, 2017. a, b, c, d, e, f

Aerts, J. P. M., Hut, R. W., van de Giesen, N. C., Drost, N., van Verseveld, W. J., Weerts, A. H., and Hazenberg, P.: Large-sample assessment of varying spatial resolution on the streamflow estimates of the wflow_sbm hydrological model, Hydrol. Earth Syst. Sci., 26, 4407–4430, https://doi.org/10.5194/hess-26-4407-2022, 2022. a

Andréassian, V., Perrin, C., Berthet, L., Le Moine, N., Lerat, J., Loumagne, C., Oudin, L., Mathevet, T., Ramos, M.-H., and Valéry, A.: HESS Opinions “Crash tests for a standardized evaluation of hydrological models”, Hydrol. Earth Syst. Sci., 13, 1757–1764, https://doi.org/10.5194/hess-13-1757-2009, 2009. a

Bates, P. D., Horritt, M. S., and Fewtrell, T. J.: A simple inertial formulation of the shallow water equations for efficient two-dimensional flood inundation modelling, J. Hydrol., 387, 33–45, https://doi.org/10.1016/j.jhydrol.2010.03.027, 2010. a

Beck, H. E., van Dijk, A. I. J. M., de Roo, A., Miralles, D. G., McVicar, T. R., Schellekens, J., and Bruijnzeel, L. A.: Global-scale regionalization of hydrologic model parameters, Water Resour. Res., 52, 3599–3622, https://doi.org/10.1002/2015WR018247, 2016. a

Beck, H. E., Wood, E. F., Pan, M., Fisher, C. K., Miralles, D. G., van Dijk, A. I. J. M., McVicar, T. R., and Adler, R. F.: MSWEP V2 global 3-hourly 0.1° precipitation: methodology and quantitative assessment, B. Am. Meteorol. Soc., 100, 473–500, https://doi.org/10.1175/BAMS-D-17-0138.1, 2019. a, b, c, d, e, f

Beck, H. E., Pan, M., Lin, P., Seibert, J., van Dijk, A. I. J. M., and Wood, E. F.: Global fully distributed parameter regionalization based on observed streamflow from 4,229 headwater catchments, J. Geophys. Res.-Atmos., 125, e2019JD031485, https://doi.org/10.1029/2019JD031485, 2020. a, b, c, d

Bertalanffy, L. V.: General System Theory: Foundations, Development, Applications, G. Braziller, ISBN 0807600156, 1968. a

Beven, K.: Towards a new paradigm in hydrology, in: Water for the Future: Hydrology in Perspective, IAHS Publication, https://agupubs.onlinelibrary.wiley.com/doi/10.1029/2020wr028091, 1987. a

Beven, K.: Changing ideas in hydrology – the case of physically-based models, J. Hydrol., 105, 157–172, https://doi.org/10.1016/0022-1694(89)90101-7, 1989. a

Beven, K.: Prophecy, reality and uncertainty in distributed hydrological modelling, Adv. Water Resour., 16, 41–51, https://doi.org/10.1016/0309-1708(93)90028-E, 1993. a

Beven, K.: How far can we go in distributed hydrological modelling?, Hydrol. Earth Syst. Sci., 5, 1–12, https://doi.org/10.5194/hess-5-1-2001, 2001. a

Beven, K. J.: Rainfall – Runoff Modelling, The Primer, John Wiley and Sons, LTD, https://doi.org/10.1002/9781119951001, 2011. a

Bierkens, M. F. P., Bell, V. A., Burek, P., Chaney, N., Condon, L. E., David, C. H., de Roo, A., Döll, P., Drost, N., Famiglietti, J. S., Flörke, M., Gochis, D. J., Houser, P., Hut, R., Keune, J., Kollet, S., Maxwell, R. M., Reager, J. T., Samaniego, L., Sudicky, E., Sutanudjaja, E. H., van de Giesen, N., Winsemius, H., and Wood, E. F.: Hyper-resolution global hydrological modelling: what is next?, Hydrol. Process., 29, 310–320, https://doi.org/10.1002/hyp.10391, 2015. a, b

Blöschl, G. and Sivapalan, M.: Scale issues in hydrological modelling: a review, Hydrol. Process., 9, 251–290, https://doi.org/10.1002/hyp.3360090305, 1995. a

Bouaziz, L. J. E., Fenicia, F., Thirel, G., de Boer-Euser, T., Buitink, J., Brauer, C. C., De Niel, J., Dewals, B. J., Drogue, G., Grelier, B., Melsen, L. A., Moustakas, S., Nossent, J., Pereira, F., Sprokkereef, E., Stam, J., Weerts, A. H., Willems, P., Savenije, H. H. G., and Hrachowitz, M.: Behind the scenes of streamflow model performance, Hydrol. Earth Syst. Sci., 25, 1069–1095, https://doi.org/10.5194/hess-25-1069-2021, 2021. a

Brisset, P., Monnier, J., Garambois, P.-A., and Roux, H.: On the assimilation of altimetric data in 1D Saint-Venant river flow models, Adv. Water Resour., 119, 41–59, https://doi.org/10.1016/j.advwatres.2018.06.004, 2018. a

Castaings, W., Dartus, D., Le Dimet, F.-X., and Saulnier, G.-M.: Sensitivity analysis and parameter estimation for distributed hydrological modeling: potential of variational methods, Hydrol. Earth Syst. Sci., 13, 503–517, https://doi.org/10.5194/hess-13-503-2009, 2009. a, b, c

Champeaux, J.-L., Dupuy, P., Laurantin, O., Soulan, I., Tabary, P., and Soubeyroux, J.-M.: Les mesures de précipitations et l'estimation des lames d'eau à Météo-France : état de l'art et perspectives, La Houille Blanche, 95, 28–34, https://doi.org/10.1051/lhb/2009052, 2009. a, b

Chow, V. T., Maidment, D. R., and Mays, L. W.: Applied Hydrology, in: McGraw-Hill Series in Water Resources and Environmental Engineering, McGraw-Hill, ISBN 9780070108103, https://wecivilengineers.wordpress.com/wp-content/uploads/2017/10/applied-hydrology-ven-te-chow.pdf (last access: 25 July 2025), 1998. a, b, c, d

Clark, M. P., Bierkens, M. F. P., Samaniego, L., Woods, R. A., Uijlenhoet, R., Bennett, K. E., Pauwels, V. R. N., Cai, X., Wood, A. W., and Peters-Lidard, C. D.: The evolution of process-based hydrologic models: historical challenges and the collective quest for physical realism, Hydrol. Earth Syst. Sci., 21, 3427–3440, https://doi.org/10.5194/hess-21-3427-2017, 2017. a

Colleoni, F., Garambois, P.-A., Javelle, P., Jay-Allemand, M., and Arnaud, P.: Adjoint-based spatially distributed calibration of a grid GR-based parsimonious hydrological model over 312 French catchments with SMASH platform, EGUsphere [preprint], https://doi.org/10.5194/egusphere-2022-506, 2022. a, b

Colleoni, F., Huynh, N. N. T., Garambois, P.-A., Jay-Allemand, M., Organde, D., Renard, B., De Fournas, T., El Baz, A., Demargne, J., and Javelle, P.: SMASH v1.0.2, Zenodo [code], https://doi.org/10.5281/zenodo.14841726, 2025a. a

Colleoni, F., Huynh, N. N. T., Garambois, P.-A., Jay-Allemand, M., Organde, D., Renard, B., De Fournas, T., El Baz, A., Demargne, J., and Javelle, P.: SMASH v1.0.2 cases, Zenodo [data set], https://doi.org/10.5281/zenodo.14865491, 2025b. a

Dagum, L. and Menon, R.: OpenMP: an industry standard API for shared-memory programming, IEEE Computational Science and Engineering, 5, 46–55, https://doi.org/10.1109/99.660313, 1998. a

De Lavenne, A., Andréassian, V., Thirel, G., Ramos, M.-H., and Perrin, C.: A regularization approach to improve the sequential calibration of a semidistributed hydrological model, Water Resour. Res., 55, 8821–8839, 2019. a

Dooge, J. C. I.: Looking for hydrologic laws, Water Resour. Res., 22, 46S–58S, https://doi.org/10.1029/WR022i09Sp0046S, 1986. a

Duan, Q., Schaake, J., Andréassian, V., Franks, S., Goteti, G., Gupta, H., Gusev, Y., Habets, F., Hall, A., Hay, L., Hogue, T., Huang, M., Leavesley, G., Liang, X., Nasonova, O., Noilhan, J., Oudin, L., Sorooshian, S., Wagener, T., and Wood, E.: Model Parameter Estimation Experiment (MOPEX): an overview of science strategy and major results from the second and third workshops, J. Hydrol., 320, 3–17, https://doi.org/10.1016/j.jhydrol.2005.07.031, 2006. a

Eilander, D.: pyFlwDir, Zenodo [code], https://doi.org/10.5281/zenodo.7759261, 2023. a

Eilander, D., van Verseveld, W., Yamazaki, D., Weerts, A., Winsemius, H. C., and Ward, P. J.: A hydrography upscaling method for scale-invariant parametrization of distributed hydrological models, Hydrol. Earth Syst. Sci., 25, 5287–5313, https://doi.org/10.5194/hess-25-5287-2021, 2021. a, b, c

Feng, D., Beck, H., de Bruijn, J., Sahu, R. K., Satoh, Y., Wada, Y., Liu, J., Pan, M., Lawson, K., and Shen, C.: Deep dive into hydrologic simulations at global scale: harnessing the power of deep learning and physics-informed differentiable models (δHBV-globe1.0-hydroDL), Geosci. Model Dev., 17, 7181–7198, https://doi.org/10.5194/gmd-17-7181-2024, 2024. a, b

Fenicia, F., Kavetski, D., and Savenije, H. H.: Elements of a flexible approach for conceptual hydrological modeling: 1. Motivation and theoretical development, Water Resour. Res., 47, 11, https://doi.org/10.1029/2010WR010174, 2011. a

Ficchì, A., Perrin, C., and Andréassian, V.: Hydrological modelling at multiple sub-daily time steps: model improvement via flux-matching, J. Hydrol., 575, 1308–1327, https://doi.org/10.1016/j.jhydrol.2019.05.084, 2019. a

Folton, N. and Arnaud, P.: Indicateurs sur la ressource en eau estimés par une modélisation pluie-débit régionalisée: la base de données Web LoiEau, La Houille Blanche, 106, 22–29, https://doi.org/10.1051/lhb/2020034, 2020. a, b

Garavaglia, F., Le Lay, M., Gottardi, F., Garçon, R., Gailhard, J., Paquet, E., and Mathevet, T.: Impact of model structure on flow simulation and hydrological realism: from a lumped to a semi-distributed approach, Hydrol. Earth Syst. Sci., 21, 3937–3952, https://doi.org/10.5194/hess-21-3937-2017, 2017. a

Gupta, H. V., Kling, H., Yilmaz, K. K., and Martinez, G. F.: Decomposition of the mean squared error and NSE performance criteria: implications for improving hydrological modelling, J. Hydrol., 377, 80–91, https://doi.org/10.1016/j.jhydrol.2009.08.003, 2009. a

Gupta, H. V., Perrin, C., Blöschl, G., Montanari, A., Kumar, R., Clark, M., and Andréassian, V.: Large-sample hydrology: a need to balance depth with breadth, Hydrol. Earth Syst. Sci., 18, 463–477, https://doi.org/10.5194/hess-18-463-2014, 2014. a

Harris, C. R., Millman, K. J., van der Walt, S. J., Gommers, R., Virtanen, P., Cournapeau, D., Wieser, E., Taylor, J., Berg, S., Smith, N. J., Kern, R., Picus, M., Hoyer, S., van Kerkwijk, M. H., Brett, M., Haldane, A., del Río, J. F., Wiebe, M., Peterson, P., Gérard-Marchant, P., Sheppard, K., Reddy, T., Weckesser, W., Abbasi, H., Gohlke, C., and Oliphant, T. E.: Array programming with NumPy, Nature, 585, 357–362, https://doi.org/10.1038/s41586-020-2649-2, 2020. a

Hascoet, L. and Pascual, V.: The Tapenade automatic differentiation tool: principles, model, and specification, ACM T. Math. Software, 39, 1–43, 2013. a, b, c, d

Hengl, T., Mendes de Jesus, J., Heuvelink, G. B. M., Ruiperez Gonzalez, M., Kilibarda, M., Blagotić, A., Shangguan, W., Wright, M. N., Geng, X., Bauer-Marschallinger, B., Guevara, M. A., Vargas, R., MacMillan, R. A., Batjes, N. H., Leenaars, J. G. B., Ribeiro, E., Wheeler, I., Mantel, S., and Kempen, B.: SoilGrids250m: global gridded soil information based on machine learning, PLoS One, 12, 1–40, https://doi.org/10.1371/journal.pone.0169748, 2017. a, b, c, d

Hersbach, H., Bell, B., Berrisford, P., Hirahara, S., Horányi, A., Muñoz-Sabater, J., Nicolas, J., Peubey, C., Radu, R., Schepers, D., Simmons, A., Soci, C., Abdalla, S., Abellan, X., Balsamo, G., Bechtold, P., Biavati, G., Bidlot, J., Bonavita, M., De Chiara, G., Dahlgren, P., Dee, D., Diamantakis, M., Dragani, R., Flemming, J., Forbes, R., Fuentes, M., Geer, A., Haimberger, L., Healy, S., Hogan, R. J., Hólm, E., Janisková, M., Keeley, S., Laloyaux, P., Lopez, P., Lupu, C., Radnoti, G., de Rosnay, P., Rozum, I., Vamborg, F., Villaume, S., and Thépaut, J.-N.: The ERA5 global reanalysis, Q. J. Roy. Meteor. Soc., 146, 1999–2049, https://doi.org/10.1002/qj.3803, 2020. a, b, c, d, e, f

Hirpa, F. A., Salamon, P., Beck, H. E., Lorini, V., Alfieri, L., Zsoter, E., and Dadson, S. J.: Calibration of the Global Flood Awareness System (GloFAS) using daily streamflow data, J. Hydrol., 566, 595–606, https://doi.org/10.1016/j.jhydrol.2018.09.052, 2018. a

Hrachowitz, M. and Clark, M. P.: HESS Opinions: The complementary merits of competing modelling philosophies in hydrology, Hydrol. Earth Syst. Sci., 21, 3953–3973, https://doi.org/10.5194/hess-21-3953-2017, 2017. a

Huynh, N. N. T., Garambois, P.-A., Colleoni, F., and Javelle, P.: Signatures-and-sensitivity-based multi-criteria variational calibration for distributed hydrological modeling applied to Mediterranean floods, J. Hydrol., 625, 129992, https://doi.org/10.1016/j.jhydrol.2023.129992, 2023. a, b, c, d, e, f

Huynh, N. N. T., Garambois, P.-A., Colleoni, F., Renard, B., Monnier, J., and Roux, H.: Multiscale learnable physical modeling and data assimilation framework: application to high-resolution regionalized hydrological simulation of flash flood, Authorea Preprints, https://doi.org/10.22541/au.170709054.44271526/v2, 2024a. a

Huynh, N. N. T., Garambois, P.-A., Colleoni, F., Renard, B., Roux, H., Demargne, J., Jay-Allemand, M., and Javelle, P.: Learning regionalization using accurate spatial cost gradients within a differentiable high-resolution hydrological model: application to the French Mediterranean region, Water Resour. Res., 60, e2024WR037544, https://doi.org/10.1029/2024WR037544, 2024b. a, b, c, d, e, f, g, h, i, j, k

Huynh, N. N. T., Garambois, P.-A., Renard, B., Colleoni, F., Monnier, J., and Roux, H.: A Distributed Hybrid Physics-AI Framework for Learning Corrections of Internal Hydrological Fluxes and Enhancing High-Resolution Regionalized Flood Modeling, EGUsphere [preprint], https://doi.org/10.5194/egusphere-2024-3665, 2025. a, b

Jay-Allemand, M., Javelle, P., Gejadze, I., Arnaud, P., Malaterre, P.-O., Fine, J.-A., and Organde, D.: On the potential of variational calibration for a fully distributed hydrological model: application on a Mediterranean catchment, Hydrol. Earth Syst. Sci., 24, 5519–5538, https://doi.org/10.5194/hess-24-5519-2020, 2020. a, b, c, d, e, f, g

Jay-Allemand, M., Colleoni, F., Garambois, P.-A., Javelle, P., and Demargne, J.: SMASH – patially distributed Modelling and Assimilation for Hydrology: Python wrapping towards enhances research-to-operations transfer, IAHS 2022 – Montpellier, poster, https://hal.science/hal-03683657 (last access: 25 July 2025), 2022. a

Jay-Allemand, M., Demargne, J., Garambois, P.-A., Javelle, P., Gejadze, I., Colleoni, F., Organde, D., Arnaud, P., and Fouchier, C.: Spatially distributed calibration of a hydrological model with variational optimization constrained by physiographic maps for flash flood forecasting in France, Proc. IAHS, 385, 281–290, https://doi.org/10.5194/piahs-385-281-2024, 2024. a

Kermode, J. R.: f90wrap: an automated tool for constructing deep Python interfaces to modern Fortran codes, J. Phys. Condens. Matter, https://doi.org/10.1088/1361-648X/ab82d2, 2020. a, b, c, d

Kingma, D. P. and Ba, J.: Adam: a method for stochastic optimization, arXiv [preprint], https://doi.org/10.48550/arXiv.1412.6980, 2014. a

Klemeš, V.: Conceptualization and scale in hydrology, J. Hydrol., 65, 1–23, https://doi.org/10.1016/0022-1694(83)90208-1, 1983. a

Kratzert, F., Nearing, G., Addor, N., Erickson, T., Gauch, M., Gilon, O., Gudmundsson, L., Hassidim, A., Klotz, D., Nevo, S., Shalev, G., and Matias, Y.: Caravan – a global community dataset for large-sample hydrology, Scientific Data, 10, 61, https://doi.org/10.1038/s41597-023-01975-w, 2023. a, b, c

Lane, R. A., Coxon, G., Freer, J. E., Wagener, T., Johnes, P. J., Bloomfield, J. P., Greene, S., Macleod, C. J. A., and Reaney, S. M.: Benchmarking the predictive capability of hydrological models for river flow and flood peak predictions across over 1000 catchments in Great Britain, Hydrol. Earth Syst. Sci., 23, 4011–4032, https://doi.org/10.5194/hess-23-4011-2019, 2019. a

Leblois, E. and Sauquet, E.: Grid elevation models in hydrology – Part 1: Principales and literrature review; Part2: HydroDEM, User's manual, Tech. rep., Cemagref, https://hess.copernicus.org/articles/14/99/2010/hess-14-99-2010.pdf, 1999. a, b, c, d

Lee, H., Seo, D.-J., Liu, Y., Koren, V., McKee, P., and Corby, R.: Variational assimilation of streamflow into operational distributed hydrologic models: effect of spatiotemporal scale of adjustment, Hydrol. Earth Syst. Sci., 16, 2233–2251, https://doi.org/10.5194/hess-16-2233-2012, 2012. a, b

Le Moine, N.: Le bassin versant de surface vu par le souterrain: une voie d'amélioration des performances et du réalisme des modles pluie-débit ?, PhD thesis, Cemagref, UR HBAN, Antony, https://theses.hal.science/tel-02591478 (last access: 25 July 2025), 2008. a, b

Liang, X., Lettenmaier, D. P., Wood, E. F., and Burges, S. J.: A simple hydrologically based model of land surface water and energy fluxes for general circulation models, J. Geophys. Res.-Atmos., 99, 14415–14428, https://doi.org/10.1029/94JD00483, 1994. a, b, c

Liu, Y. and Gupta, H. V.: Uncertainty in hydrologic modeling: toward an integrated data assimilation framework, Water Resour. Res., 43, 2007. a

Mathevet, T.: Quels modeles pluie-debit globaux au pas de temps horaire? Développements empiriques et intercomparaison de modeles sur un large échantillon de bassins versants, PhD thesis, ENGREF, 463 pp., https://hal.inrae.fr/tel-02587642v1 (last access: 25 July 2025), 2005. a

Mathevet, T., Gupta, H., Perrin, C., Andréassian, V., and Le Moine, N.: Assessing the performance and robustness of two conceptual rainfall-runoff models on a worldwide sample of watersheds, J. Hydrol., 585, 124698, https://doi.org/10.1016/j.jhydrol.2020.124698, 2020. a

Milly, P.: Climate, interseasonal storage of soil water, and the annual water balance, Adv. Water Resour., 17, 19–24, https://doi.org/10.1016/0309-1708(94)90020-5, 1994. a

Mizukami, N., Clark, M. P., Newman, A. J., Wood, A. W., Gutmann, E. D., Nijssen, B., Rakovec, O., and Samaniego, L.: Towards seamless large-domain parameter estimation for hydrologic models, Water Resour. Res., 53, 8020–8040, https://doi.org/10.1002/2017WR020401, 2017. a, b

Mizukami, N., Rakovec, O., Newman, A. J., Clark, M. P., Wood, A. W., Gupta, H. V., and Kumar, R.: On the choice of calibration metrics for “high-flow” estimation using hydrologic models, Hydrol. Earth Syst. Sci., 23, 2601–2614, https://doi.org/10.5194/hess-23-2601-2019, 2019. a

Monnier, J.: Data Assimilation – Inverse Problems, Assimilation, Control, Learning, INSA Toulouse, https://www.math.univ-toulouse.fr/~jmonnie/Enseignement/CourseVDA.pdf (last access: 25 July 2025), 2024. a

Monnier, J., Couderc, F., Dartus, D., Larnier, K., Madec, R., and Vila, J.-P.: Inverse algorithms for 2D shallow water equations in presence of wet dry fronts: application to flood plain dynamics, Adv. Water Resour., 97, 11–24, https://doi.org/10.1016/j.advwatres.2016.07.005, 2016. a

Newman, A. J., Clark, M. P., Sampson, K., Wood, A., Hay, L. E., Bock, A., Viger, R. J., Blodgett, D., Brekke, L., Arnold, J. R., Hopson, T., and Duan, Q.: Development of a large-sample watershed-scale hydrometeorological data set for the contiguous USA: data set characteristics and assessment of regional variability in hydrologic model performance, Hydrol. Earth Syst. Sci., 19, 209–223, https://doi.org/10.5194/hess-19-209-2015, 2015. a

Organde, D., Arnaud, P., Fine, J.-A., Fouchier, C., Folton, N., and Lavabre, J.: Régionalisation d'une méthode de prédétermination de crue sur l'ensemble du territoire français: la méthode SHYREG, Revue des Sciences de l'Eau, 26, 65–78, 2013. a, b

Orth, R., Staudinger, M., Seneviratne, S. I., Seibert, J., and Zappa, M.: Does model performance improve with complexity? A case study with three hydrological models, J. Hydrol., 523, 147–159, https://doi.org/10.1016/j.jhydrol.2015.01.044, 2015. a, b, c

Oudin, L., Hervieu, F., Michel, C., Perrin, C., Andréassian, V., Anctil, F., and Loumagne, C.: Which potential evapotranspiration input for a lumped rainfall–runoff model?: Part 2 – Towards a simple and efficient potential evapotranspiration model for rainfall–runoff modelling, J. Hydrol., 303, 290–306, https://doi.org/10.1016/j.jhydrol.2004.08.026, 2005. a, b, c, d, e

pandas development team: pandas-dev/pandas: Pandas, Zenodo [code], https://doi.org/10.5281/zenodo.3509134, 2020. a

Peredo, D., Ramos, M.-H. V. A., and Oudin, L.: Investigating hydrological model versatility to simulate extreme flood events, Hydrolog. Sci. J., 67, 628–645, https://doi.org/10.1080/02626667.2022.2030864, 2022. a

Perrin, C., Michel, C., and Andréassian, V.: Does a large number of parameters enhance model performance? Comparative assessment of common catchment model structures on 429 catchments, J. Hydrol., 242, 275–301, https://doi.org/10.1016/S0022-1694(00)00393-0, 2001. a, b

Perrin, C., Michel, C., and Andrèassian, V.: Improvement of a parsimonious model for streamflow simulation, J. Hydrol., 279, 275–289, 2003. a, b, c, d, e

Piotte, O., Montmerle, T., Fouchier, C., Belleudy, A., Garandeau, L., Janet, B., Jauffret, C., Demargne, J., and Organde, D.: Les évolutions du service d'avertissement sur les pluies intenses et les crues soudaines en France, La Houille Blanche, 106, 75–84, https://doi.org/10.1051/lhb/2020055, 2020. a

Pujol, L., Garambois, P.-A., and Monnier, J.: Multi-dimensional hydrological-hydraulic model with variational data assimilation for river networks and floodplains, EGUsphere [preprint], https://doi.org/10.5194/egusphere-2022-10, 2022. a

Quintana-Seguí, P., Le Moigne, P., Durand, Y., Martin, E., Habets, F., Baillon, M., Canellas, C., Franchisteguy, L., and Morel, S.: Analysis of near-surface atmospheric variables: validation of the SAFRAN analysis over France, J. Appl. Meteorol., 47, 92, https://doi.org/10.1175/2007JAMC1636.1, 2008. a, b

Reed, S., Koren, V., Smith, M., Zhang, Z., Moreda, F., Seo, D.-J., and DMIP Participants: Overall distributed model intercomparison project results, J. Hydrol., 298, 27–60, 2004. a

Refsgaard, J. C.: Parameterisation, calibration and validation of distributed hydrological models, J. Hydrol., 198, 69–97, https://doi.org/10.1016/S0022-1694(96)03329-X, 1997. a

Samaniego, L., Kumar, R., and Attinger, S.: Multiscale parameter regionalization of a grid-based hydrologic model at the mesoscale, Water Resour. Res., 46, https://doi.org/10.1029/2008WR007327, 2010. a

Sebben, M. L., Werner, A. D., Liggett, J. E., Partington, D., and Simmons, C. T.: On the testing of fully integrated surface subsurface hydrological models, Hydrol. Process., 27, 1276–1285, https://doi.org/10.1002/hyp.9630, 2013. a

Shen, C., Appling, A. P., Gentine, P., Bandai, T., Gupta, H., Tartakovsky, A., Baity-Jesi, M., Fenicia, F., Kifer, D., Li, L., Liu, X., Ren, W., Zheng, Y., Harman, C. J., Clark, M., Farthing, M., Feng, D., Kumar, P., Aboelyazeed, D., Rahmani, F., Song, Y., Beck, H. E., Bindas, T., Dwivedi, D., Fang, K., Höge, M., Rackauckas, C., Mohanty, B., Roy, T., Xu, C., and Lawson, K.: Differentiable modelling to unify machine learning and physical models for geosciences, Nature Reviews Earth and Environment, 4, 552–567, https://doi.org/10.1038/s43017-023-00450-9, 2023. a

Todini, E.: The ARNO rainfall – runoff model, J. Hydrol., 175, 339–382, https://doi.org/10.1016/S0022-1694(96)80016-3, 1996. a

Towler, E., Foks, S. S., Dugger, A. L., Dickinson, J. E., Essaid, H. I., Gochis, D., Viger, R. J., and Zhang, Y.: Benchmarking high-resolution hydrologic model performance of long-term retrospective streamflow simulations in the contiguous United States, Hydrol. Earth Syst. Sci., 27, 1809–1825, https://doi.org/10.5194/hess-27-1809-2023, 2023. a

van Verseveld, W. J., Weerts, A. H., Visser, M., Buitink, J., Imhoff, R. O., Boisgontier, H., Bouaziz, L., Eilander, D., Hegnauer, M., ten Velden, C., and Russell, B.: Wflow_sbm v0.7.3, a spatially distributed hydrological model: from global data to local applications, Geosci. Model Dev., 17, 3199–3234, https://doi.org/10.5194/gmd-17-3199-2024, 2024. a

Vereecken, H., Weihermüller, L., Assouline, S., Šimůnek, J., Verhoef, A., Herbst, M., Archer, N., Mohanty, B., Montzka, C., Vanderborght, J., Balsamo, G., Bechtold, M., Boone, A., Chadburn, S., Cuntz, M., Decharme, B., Ducharne, A., Ek, M., Garrigues, S., Goergen, K., Ingwersen, J., Kollet, S., Lawrence, D. M., Li, Q., Or, D., Swenson, S., de Vrese, P., Walko, R., Wu, Y., and Xue, Y.: Infiltration from the pedon to global grid scales: an overview and outlook for land surface modeling, Vadose Zone J., 18, 180191, https://doi.org/10.2136/vzj2018.10.0191, 2019. a

Vidal, J.-P., Martin, E., Franchistéguy, L., Baillon, M., and Soubeyroux, J.-M.: A 50-year high-resolution atmospheric reanalysis over France with the Safran system, Int. J. Climatol., 30, 1627–1644, https://doi.org/10.1002/joc.2003, 2010. a, b

Virtanen, P., Gommers, R., Oliphant, T. E., Haberland, M., Reddy, T., Cournapeau, D., Burovski, E., Peterson, P., Weckesser, W., Bright, J., van der Walt, S. J., Brett, M., Wilson, J., Millman, K. J., Mayorov, N., Nelson, A. R. J., Jones, E., Kern, R., Larson, E., Carey, C. J., Polat, İ., Feng, Y., Moore, E. W., VanderPlas, J., Laxalde, D., Perktold, J., Cimrman, R., Henriksen, I., Quintero, E. A., Harris, C. R., Archibald, A. M., Ribeiro, A. H., Pedregosa, F., van Mulbregt, P., and SciPy 1.0 Contributors: SciPy 1.0: fundamental algorithms for scientific computing in Python, Nat. Methods, 17, 261–272, https://doi.org/10.1038/s41592-019-0686-2, 2020. a

Wood, E. F., Roundy, J. K., Troy, T. J., van Beek, L. P. H., Bierkens, M. F. P., Blyth, E., de Roo, A., Döll, P., Ek, M., Famiglietti, J., Gochis, D., van de Giesen, N., Houser, P., Jaffé, P. R., Kollet, S., Lehner, B., Lettenmaier, D. P., Peters-Lidard, C., Sivapalan, M., Sheffield, J., Wade, A., and Whitehead, P.: Hyperresolution global land surface modeling: meeting a grand challenge for monitoring Earth's terrestrial water, Water Resour. Res., 47, https://doi.org/10.1029/2010WR010090, 2011. a

Xu, D., Bisht, G., Sargsyan, K., Liao, C., and Leung, L. R.: Using a surrogate-assisted Bayesian framework to calibrate the runoff-generation scheme in the Energy Exascale Earth System Model (E3SM) v1, Geosci. Model Dev., 15, 5021–5043, https://doi.org/10.5194/gmd-15-5021-2022, 2022. a

Yamazaki, D., Ikeshima, D., Tawatari, R., Yamaguchi, T., O'Loughlin, F., Neal, J. C., Sampson, C. C., Kanae, S., and Bates, P. D.: A high-accuracy map of global terrain elevations, Geophys. Res. Lett., 44, 5844–5853, https://doi.org/10.1002/2017GL072874, 2017. a, b, c

Yang, Y., Pan, M., Beck, H. E., Fisher, C. K., Beighley, R. E., Kao, S.-C., Hong, Y., and Wood, E. F.: In quest of calibration density and consistency in hydrologic modeling: distributed parameter calibration against streamflow characteristics, Water Resour. Res., 55, 7784–7803, https://doi.org/10.1029/2018WR024178, 2019. a

Zhu, C., Byrd, R. H., Lu, P., and Nocedal, J.: Algorithm 778: L-BFGS-B: Fortran Subroutines for Large-Scale Bound-Constrained Optimization, ACM T. Math. Software, 23, 550–560, 1997. a

smash v1.0: a differentiable and regionalizable high-resolution hydrological modeling and data assimilation framework

2.1 Forward model statement

2.2 Inverse algorithm

3.1 From sources to ready-to-use Python library

3.2 Documentation

3.3 Computational performance

4.1 Numerical experiments presented

4.2 CONUS – CAMELS: split-sample temporal cross-validation

4.2.1 Numerical experiment settings

4.2.2 Results

4.3 CONUS – CAMELS: regionalization

4.3.1 Numerical experiment settings

4.3.2 Results

4.4 France – Aude River: high-resolution regionalization

4.4.1 Numerical experiment settings

4.4.2 Results

D1 Snow operator ℳsnw

D2 Hydrological operator ℳhy

D3 Routing operator ℳrr

D1 Snow operator ℳ_snw

D2 Hydrological operator ℳ_hy

D3 Routing operator ℳ_rr