Potential-based thermodynamics with consistent  conservative cascade transport for implicit large  eddy simulation: PTerodaC<sup>3</sup>TILES version 1.0

Thuburn, John

doi:https://doi.org/10.5194/gmd-18-3331-2025

Articles | Volume 18, issue 11

https://doi.org/10.5194/gmd-18-3331-2025

Articles | Volume 18, issue 11

Model description paper

06 Jun 2025

Model description paper |

| 06 Jun 2025

Potential-based thermodynamics with consistent conservative cascade transport for implicit large eddy simulation: PTerodaC³TILES version 1.0

John Thuburn

Abstract

A new computational fluid dynamics code for large eddy simulation (LES) of the atmospheric boundary layer and convection is presented and made available. A key novelty is that moist thermodynamics is formulated in terms of thermodynamic potentials, ensuring thermodynamic consistency. Despite the apparent complexity of the thermodynamic potential approach, the model's performance demonstrates that it is feasible and effective at reasonable computational cost for three-dimensional simulations. Semi-implicit semi-Lagrangian numerical methods are used; such methods are unusual for simulating boundary layer and convective flows and are more typical of global atmospheric models. Moreover, the model includes no explicit scheme to represent subgrid-scale fluxes of scalars and momentum but relies instead on the mixing and dissipation resulting from the numerical methods used; in other words, it employs implicit LES (ILES). Sample results from several standard LES test cases show that the model's ability to capture the main aspects of the flows is comparable to other LES models. At the same time, the results highlight limitations of the ILES approach near the bottom boundary and suggest that ILES might need to be augmented in some way, for example, by distributing the convergence of surface fluxes over several model layers. Also, results for a marine stratocumulus case show a significant sensitivity to different options for the numerical methods and parameters used. Further development and application of the code would benefit from a deeper understanding of both the bottom boundary behaviour and the sensitivities to numerics.

Download & links

Article (PDF, 4607 KB)

Supplement (316 KB)

Download & links

How to cite.

Received: 15 Aug 2024 – Discussion started: 11 Oct 2024 – Revised: 04 Mar 2025 – Accepted: 13 Mar 2025 – Published: 06 Jun 2025

1 Introduction

Large eddy simulation (LES) has been an invaluable computational tool in atmospheric science since the early 1970s, both for advancing our understanding of complex atmospheric processes such as boundary layer turbulence and convection and for informing the development of parameterizations of those processes for use in weather and climate models. This article presents the formulation of a new LES code developed by the author with a threefold motivation:

to demonstrate the feasibility of using thermodynamic potentials to achieve a consistent representation of moist thermodynamics in a three-dimensional fluid dynamics code,
to evaluate the use in LES of the sort of numerical methods more usually used in synoptic- and global-scale models, and
to provide a modelling tool that is easy to set up and run and in which it is easy to set up new test cases and diagnostics.

The thermodynamics of moist air is complicated, and atmospheric models often make approximations. Some common approximations introduce inconsistencies between different aspects of the system or with the laws of thermodynamics – see Thuburn (2017 b) for examples. Thermodynamic consistency can be ensured by deriving all thermodynamic quantities from one of the four standard thermodynamic potentials, internal energy, enthalpy, Helmholtz free energy, or Gibbs function, expressed as a function of its natural variables. Provided any approximations are made directly to the thermodynamic potential before deriving other quantities, consistency is maintained. This approach has been proposed for use in ocean modelling for consistent treatment of the complex equation of state of seawater (IOC et al., 2010). More recently, the approach has been advocated for deriving consistent equation sets for the thermodynamics of moist air in atmospheric models (Vallis, 2017; Eldred et al., 2022; Staniforth, 2022). As an alternative, a model can be formulated and implemented directly in terms of the thermodynamic potential and its derivatives (Thuburn, 2017 b; Bowen and Thuburn, 2022 a, b); in this way different approximations to the thermodynamics can be implemented by modifying the minimal number of routines.

For the case of thermodynamic equilibrium (no supersaturation, no condensate in subsaturated air, all components and phases at the same temperature) and in the absence of ice, Thuburn (2017 b) presented a semi-implicit semi-Lagrangian model solving the compressible Euler equations in which the moist thermodynamics was formulated in terms of the Gibbs function for moist air. Whilst encouraging, this initial implementation suffered several limitations. First, because the formulation works in terms of the total Gibbs function for moist air, the water content must be partitioned into vapour and condensate whenever the Gibbs function is evaluated, complicating the calculation. More importantly, since the natural variables for the Gibbs function pressure p and temperature T are intensive variables, knowledge of p and T (and total specific humidity q) alone is insufficient to completely determine the equilibrium state, particularly the partition of water into its three phases, at the triple point. Thus, the implementation is restricted to two phases: vapour and liquid. Finally, the formulation of Thuburn (2017 b) could not represent important nonequilibrium effects such as the delayed freezing of supercooled cloud droplets or the evaporation of rain in subsaturated air.

These limitations were overcome by Bowen and Thuburn (2022 a, b). They formulated the thermodynamics in terms of the internal energy, whose natural variables are the extensive variables specific volume α and specific entropy η, avoiding the difficulty at the triple point. Moreover, by working with the individual internal energy potentials for dry air, water vapour, liquid water, and ice, rather than a combined thermodynamic potential for the air parcel, they were able to separate the calculation of the potentials from the calculation of the air parcel equilibrium state, simplifying the formulation. By expressing the evolution of a subset of variables in terms of thermodynamic forces and a resistivity matrix, they were able to account for departures from thermodynamic equilibrium, while seamlessly approaching the equilibrium case in the limit of zero resistivity.

The codes developed by Thuburn (2017 b) and Bowen and Thuburn (2022 a, b) were two-dimensional vertical slice models, and they were applied to simple buoyant bubble test problems. The apparent complexity of the thermodynamic potential approach, with accompanying concerns about its computational cost, might discourage model developers from pursuing the approach in three-dimensional models. A primary goal of the work described here is to demonstrate that the approach can be applied successfully, without excessive expense, in a three-dimensional model suitable for studying complex boundary layer and convective flows.

Traditionally, LES models for atmospheric applications are often based on relatively simple numerical methods supplemented by more or less sophisticated subgrid models (e.g. Siebesma et al., 2003). Typically, advection schemes are Eulerian and time stepping is explicit, so that stability requires the advective and diffusive Courant numbers to be less than some threshold value of order 1. Global weather and climate models, on the other hand, often use sophisticated (and relatively expensive) advection schemes that are stable for large advective Courant numbers (e.g. Temperton et al., 2001; Lin, 2004; Wood et al., 2014; Melvin et al., 2024), though they require the deformational Courant number to be bounded. Bartello and Thomas (1996) have argued that such large-time-step advection schemes are no longer cost-effective in flow regimes where the energy spectrum is shallower, the Lagrangian and Eulerian timescales become more comparable, and the deformational Courant number is much closer to the advective Courant number. Nevertheless, traditional LES codes are often run with time steps more than an order of magnitude smaller than could be used by a semi-implicit semi-Lagrangian scheme at the same resolution (e.g. Stevens et al., 2005, and compare section 5 below). This observation, combined with recent progress in improving the efficiency of conservative semi-implicit semi-Lagrangian solvers (Thuburn, 2024), encouraged the author to revisit the question by implementing a semi-implicit semi-Lagrangian LES model.

On a closely related point, global models with sophisticated advection schemes sometimes do not include a subgrid model to handle the turbulent downscale cascades of potential enstrophy and energy but rely instead on the dissipative nature of the advection scheme to play that role (e.g. Walters et al., 2017; ECMWF, 2023). In other words, they use a form of implicit large eddy simulation or ILES (e.g. Margolin et al., 2006; Grinstein et al., 2007). Although there have been some pioneering attempts to use or evaluate ILES for boundary layer and convective-scale atmospheric flows (Margolin et al., 1999; Brown et al., 2000; Smolarkiewicz and Prusa, 2002), and there is growing interest (e.g. Pressel et al., 2017; Souza et al., 2023), the approach is still far from mainstream, and there remain many open questions about its strengths and weaknesses and how they depend on details of the numerics. The need to answer these questions is becoming increasingly pressing as global prediction models begin to be used at the kilometre scale (Satoh et al., 2008; Stevens et al., 2019; Hohenegger et al., 2023; Tomassini et al., 2023). PTerodaC³TILES includes no subgrid model and uses the ILES approach. An important secondary goal for its development is to provide a tool to facilitate the study of the ILES properties of semi-implicit, semi-Lagrangian schemes for simulating convective-scale flows.

Finally, the author perceived a need for an LES tool that could be set up to run production science with minimal effort on a desktop machine or even a laptop. PTerodaC³TILES v1.0 comprises a single standalone fortran code and a namelist file. No external packages are needed other than a fortran compiler (with OpenMP shared memory parallel capability if desired). Just by selecting namelist options, the code can generate initial data and implement forcing terms for a number of standard test cases; new cases can easily be implemented by using the routines for existing cases as templates. To avoid large volumes of output, diagnostics are calculated online at run time. Many standard diagnostics are available via namelist switches; others can easily be implemented and output using existing routines as templates. Further details are given in the user manual, available from Zenodo; see the “Code availability” section.

Some aspects of the model formulation are novel or noteworthy; these are introduced briefly here and discussed in more detail in Sect. 2.

Unlike most atmospheric models, changes of phase of water and latent heat release are not treated as separate physics source terms but are fully integrated within the dynamical core's semi-implicit semi-Lagrangian time stepping (Thuburn, 2017 b; Thuburn et al., 2022; Bowen and Thuburn, 2022 a, b, Sect. 2.1 and 2.3 below). To maintain consistency of the thermodynamics, surface fluxes of water imply surface fluxes of mass. The same is true for the somewhat artificial sources of water specified in the domain interior for some test cases. Because of the model's Charney–Phillips vertical staggering, with total density stored at p levels and specific humidities stored at w levels, special care is needed to maintain that consistency (Sect. 2.7). Mass sources accompanying water sources are neglected in most atmospheric models.

In contrast to Thuburn (2017 b) and Bowen and Thuburn (2022 a, b), in PTerodaC³TILES conservative options are available for the advection of moisture and entropy variables. Although the numerical methods do not exactly conserve energy, energy conservation is significantly improved as a side effect of a conservative treatment of entropy and water (Thuburn, 2022).

Even with the use of cheap advection updates during the main solver iterations (Thuburn, 2024), advection remains one of the most expensive components of the model (along with the elliptic solver). Some modifications are made to the SLICE conservative semi-Lagrangian advection scheme (Zerroukat et al., 2009) to improve its efficiency, particularly to minimize the use of conditional code by avoiding searching. Geometrical calculations of coordinate line intersections to determine intermediate departure points are replaced by additional trajectory calculations, and information generated in remapping volume and mass is reused in remapping other fields (Sect. 2.6).

Linearization of the thermodynamics leads to an 11×14 linear subsystem at each model grid point that must be diagonalized in order to build the Helmholtz problem and to enable back substitution for the semi-implicit time stepping. To reduce what would otherwise potentially be a significant computational expense, the equations and unknowns are reordered to exploit the moderate sparsity of the thermodynamic subsystem, which is essentially the same at all grid points. The cost of the diagonalization is thereby reduced to about 30 % of the cost of a full Gaussian elimination for the case of equilibrium thermodynamics. The cost is further reduced by carrying out the diagonalization on all matrices in a grid column at the same time, with the inner loop over vertical levels, to improve vectorization (Sect. 2.10).

Some of the test cases implemented specify the use of Monin–Obukhov theory to compute surface momentum fluxes. Appendix A4 presents a slight reformulation of Monin–Obukhov theory that, with the aid of some curve fitting, enables the friction velocity U_* to be obtained without the need for an iterative calculation and guarantees the existence of a unique solution for U_* even when the validity of Monin–Obukhov theory breaks down.

Section 4 summarizes some of the ways in which the correctness of the formulation and implementation have been verified. Section 5 presents some sample results from standard LES test cases to demonstrate the performance of the model. The conclusions and areas where further work is needed are discussed in Sect. 6.

2 Model formulation

The formulation of PTerodaC³TILES is inspired by semi-implicit semi-Lagrangian dynamical cores such as ENDGame (Wood et al., 2014) and GungHo (Melvin et al., 2024) that iterate towards a (possibly off-centred) Crank–Nicolson time discretization along trajectories. Such a formulation has been found to be stable and robust at large time steps for synoptic-scale flow. A significant departure, though, is in the treatment of moist thermodynamics. First, to guarantee consistency, the thermodynamics is expressed in terms of thermodynamic potentials, in this case the internal energy. Second, processes such as phase changes and latent heat release are not treated as separate physics source terms but directly couple to the dynamics through the dependence of pressure and buoyancy on the thermodynamic state. The wide range of timescales associated with the thermodynamics, ranging from instantaneous for processes in equilibrium to many minutes for some nonequilibrium processes, is naturally handled by the semi-implicit time discretization.

A feature of the consistent treatment of thermodynamics is that any surface or interior source of moisture implies a corresponding source of mass (Sect. 2.7). Such a mass source is neglected in most atmospheric models.

PTerodaC³TILES version 1.0 includes only equilibrium thermodynamics, but almost all of the machinery is in place to deal with the nonequilibrium case. Also, this version includes no representation of microphysical processes; any condensate is simply carried along with the flow. The inclusion of a simple microphysics scheme is a priority for future work.

2.1 Continuous governing equations

The continuous equations to be solved are the same as those solved by Bowen and Thuburn (2022 a), extended to three spatial dimensions, and with the inclusion of Coriolis terms and external source terms for mass, momentum, and water. See that article for a derivation and in-depth discussion. For the purpose of exposition it is convenient to split the governing equations into dynamics and thermodynamics.

2.1.1 Dynamics

The subset of governing equations involving material derivatives is the following:

\begin{array}{l} (1) & \frac{1}{V} \frac{D}{D t} (V ρ) = S_{ρ}, \\ (2) & \frac{D u}{D t} + 2 Ω \times u + α \nabla p + \nabla Φ = S_{u}, \\ (3) & \frac{1}{V} \frac{D}{D t} (V ρ q) = ρ S_{q}, \\ (4) & \frac{1}{V} \frac{D}{D t} (V ρ η) = ρ J^{T} P + ρ S_{η}, \\ (5) & \frac{D X}{D t} = J + S_{X} . \end{array}

Here, ρ is the total fluid density, $u = (u, v, w)$ is the fluid (barycentric) velocity, q is the total specific humidity, η is the total specific entropy, $α = 1 / ρ$ is the total specific volume, p is the pressure, and Ω is the rotation vector of the frame of reference. Φ=gz is the geopotential with g the gravitational acceleration. S and S terms indicate external sources.

The vector $X = (q^{l}, q^{f}, q^{v} η^{v}, q^{l} η^{l}, q^{f} η^{f})^{T}$ (superscript T meaning transpose) encodes the additional thermodynamic information that needs to be predicted in the nonequilibrium case. Superscripts d, v, l, and f indicate dry air, water vapour, liquid water, and frozen water, respectively. J is a vector of thermodynamic fluxes, with P the corresponding thermodynamic forces, defined below. The expression J^TP in Eq. (4) gives the entropy source per unit mass due to nonequilibrium processes¹.

$D / D t$ is the material derivative. In Eqs. (1), (3), and (4), 𝒱 is the material volume element. These equations are written in a form that lends itself to numerical solution using a conservative semi-Lagrangian scheme. The material derivative in Eq. (5) is written in a form that anticipates discretization using an interpolating semi-Lagrangian scheme, on the assumption that it will be sufficient to advect the total specific humidity and entropy conservatively, with a cheaper non-conserving scheme for the components of X. Nevertheless, an option for conservative advection is available. In the equilibrium case, however, J is not needed (see Eq. 13 below), so advection of X becomes superfluous and is automatically switched off.

The user can also choose to include additional tracers stored either at p levels or at w levels (see Sect. 2.2). Since these do not feed back on the dynamics or thermodynamics, we largely omit them from further discussion.

2.1.2 Thermodynamics

The diagnostic equations describing the thermodynamics are as follows:

\begin{array}{l} (6) & q^{v} α^{v} - q^{d} α^{d} = 0, \\ (7) & (1 - δ^{l}) λ^{l} - δ^{l} q^{l} = 0, \\ (8) & (1 - δ^{f}) λ^{f} - δ^{f} q^{f} = 0, \\ (9) & q^{v} + q^{l} + q^{f} - q = 0, \\ (10) & q^{v} α^{v} + q^{l} α^{l} + q^{f} α^{f} - α = 0, \\ (11) & q^{d} η^{d} + q^{v} η^{v} + q^{l} η^{l} + q^{f} η^{f} - η = 0, \\ (12) & p + e_{α}^{d} + e_{α}^{v} = 0, \\ (13) & R J - (P - C) \equiv - R_{PE} = 0 . \end{array}

Where needed, the mass fraction of dry air is given by $q^{d} = 1 - q$ .

Equation (6) states that the water vapour and dry air occupy the same volume within an air parcel. Equations (9)–(11) state that the total specific humidity equals q, the total specific volume equals α, and the total specific entropy equals η.

Equation (12) expresses Dalton's law of partial pressures and is key for coupling the thermodynamics to the dynamics. Here, e^d(α^d,η^d), e^v(α^v,η^v), e^l(η^l), and e^f(η^f) are the internal energy potentials for dry air, water vapour, liquid water, and frozen water, respectively, expressed as functions of their natural variables (Appendix A2). As in Bowen and Thuburn (2022 a, b), condensate is assumed to be incompressible, with α^l and α^f specified constants, so we suppress the dependence of e^l on α^l and of e^f on α^f, and the pressure within any condensate (for example, as needed to compute the Gibbs function) is equal to the total pressure of the surrounding gas. Subscripts α and η on the species internal energies indicate partial derivatives with respect to the respective natural variables.

Equation (13) is a set of phenomenological equations relating thermodynamic fluxes J to the thermodynamic forces P that push the system towards equilibrium (de Groot and Mazur, 1984). R_PE is a residual that should be driven towards zero by the iterative solver. In the present system P is given by

\begin{matrix} (14) & \begin{aligned} P = & \nabla_{X} η = \frac{1}{T^{d}} (g^{v} - g^{l}, g^{v} - g^{f}, T^{d} - T^{v}, T^{d} - T^{l}, \\ {T^{d} - T^{f})}^{T}, \end{aligned} \end{matrix}

where the gradient ∇_Xη is taken at constant mass and energy, Tⁱ is the temperature of species i, and gⁱ is the Gibbs function for species i, $i \in {d, v, l, f}$ (Appendix A2). R is a symmetric and positive semi-definite resistivity matrix controlling the rate at which an air parcel approaches thermodynamic equilibrium. $C = η_{ref} (δ^{l} λ^{l}, δ^{f} λ^{f}, 0, 0, 0)^{T}$ is a vector of switched Lagrange multipliers, with $δ^{l}, δ^{f} \in {0, 1}$ , used in enforcing the constraints that q^l and q^f must be non-negative. For example, if the thermodynamic forces imply evaporation of liquid but q^l is already zero, then the liquid constraint is switched on (δ^l=1) and the solution for λ^l balances the relevant component of P. Finally, Eqs. (7) and (8) are a convenient way to express the complementarity conditions (e.g. Nocedal and Wright, 2006) that either q^l or λ^l must be zero and either q^f or λ^f must be zero. η_ref is an arbitrary reference value, here equal to 1000 J kg⁻¹ K⁻¹, introduced to ensure that Eqs. (7) and (8) are dimensionally correct.

Bowen and Thuburn (2022 b) show how R can be related to the thermal conductivity of air and the molecular diffusivity of water vapour in air for cloud droplets of a given radius. In PTerodaC³TILES v1.0 the resistivity R is set to zero, imposing local thermodynamic equilibrium. Nevertheless, almost all of the machinery is in place to handle the nonequilibrium case.

Note that PTerodaC³TILES has no explicit representation of subgrid variability in temperature or humidity, hence in condensate. Each grid cell is either entirely saturated or entirely unsaturated. In other words, it is an all-or-nothing representation of saturation and condensate.

2.2 Domain, grid, and discretization

The domain of PTerodaC³TILES v1.0 is rectangular and doubly periodic in the horizontal, with flat rigid boundaries at the bottom and top. Cartesian coordinates (x, y, z) are aligned with the domain. When a full three-dimensional Coriolis force is used, then the x and y directions are assumed to be east and north, respectively, but otherwise the model is agnostic about which direction is north.

A C-grid staggering (Arakawa and Lamb, 1977) is used in the horizontal, and a Charney–Phillips grid staggering (Charney and Phillips, 1953) is used in the vertical (Fig. 1). The choice of a Charney–Phillips vertical grid is unusual for LES models but is more common in global models. It avoids computational modes – oscillatory vertical profiles of thermodynamic variables that spuriously satisfy discrete hydrostatic balance – and gives more accurate coupling between vertical velocity and buoyancy on small vertical scales but makes it difficult to formulate an exactly energy-conserving scheme.

https://gmd.copernicus.org/articles/18/3331/2025/gmd-18-3331-2025-f01

Figure 1Schematic showing the placement of model variables on the vertically staggered Charney–Phillips grid. Superscript i can stand for any of d, v, l, and f.

Download

In the following, integer indices are used for p points, with an offset of $1 / 2$ in the relevant direction for velocity points. Vertical indices 1 and N_z correspond to the lowermost and uppermost p levels; vertical indices $1 / 2$ and $N_{z} + 1 / 2$ correspond to the w levels at the bottom and top model boundaries. Horizontal grid spacings Δx and Δy are uniform but need not be equal to each other. The vertical grid spacing Δz may be uniform or stretched. See Appendix A1 for the grid specification in the stretched case. All of the results shown in Sects. 4 and 5 use uniform Δz.

Simple finite-difference or finite-volume approximations are used for gradient and divergence operators. The components of the pressure gradient (and similarly the geopotential gradient) are needed at velocity points:

\begin{matrix} (15) & \begin{aligned} {\frac{\partial p}{\partial x}|}_{i + 1 / 2 j k} = \frac{p_{i + 1 j k} - p_{i j k}}{Δ x}, \\ {\frac{\partial p}{\partial y}|}_{i j + 1 / 2 k} = \frac{p_{i j + 1 k} - p_{i j k}}{Δ y}, \\ {\frac{\partial p}{\partial z}|}_{i j k + 1 / 2} = \frac{p_{i j k + 1} - p_{i j k}}{Δ z_{k + 1 / 2}}, \end{aligned} \end{matrix}

where $Δ z_{k + 1 / 2} = z_{k + 1} - z_{k}$ and with the obvious modifications to allow for periodic boundary conditions. The divergence of the velocity is needed at p points and is given by

\begin{matrix} (16) & \begin{aligned} {\nabla \cdot u|}_{i j k} & = \frac{u_{i + 1 / 2 j k} - u_{i - 1 / 2 j k}}{Δ x} + \frac{v_{i j + 1 / 2 k} - v_{i j - 1 / 2 k}}{Δ y} \\ + \frac{w_{i j k + 1 / 2} - w_{i j k - 1 / 2}}{Δ z_{k}}, \end{aligned} \end{matrix}

where $Δ z_{k} = z_{k + 1 / 2} - z_{k - 1 / 2}$ , with an analogous expression for the divergence of mass flux increments.

Averaging between velocity points and p points is needed at various places in the discretization. Horizontal averaging uses a simple two-point average with weights $1 / 2$ , indicated by ${\overline{(.)}}^{x}$ or ${\overline{(.)}}^{y}$ . For example, the values of α used in the horizontal components of Eq. (2) are given by $1 / {\overline{ρ}}^{x}$ or $1 / {\overline{ρ}}^{y}$ .

Because the vertical grid may be stretched, four different vertical averaging operators are possible:

${\overline{(.)}}^{w}$ , linear interpolation from p levels to w levels;
${\overline{(.)}}^{r}$ , piecewise constant conservative remapping from p levels to w levels;
${\overline{(.)}}^{p}$ , linear interpolation from w levels to p levels; and
${\overline{(.)}}^{s}$ , piecewise constant conservative remapping from w levels to p levels.

Complete expressions for these four operators along with some useful conservation and discrete product rule properties are given in Appendix B of Thuburn et al. (2022). For example, the averaging of velocity components to compute the Coriolis terms (Eq. 2) and departure points (Sect. 2.6) uses a combination of ${\overline{(.)}}^{x}$ , ${\overline{(.)}}^{y}$ , ${\overline{(.)}}^{p}$ , and ${\overline{(.)}}^{w}$ . When a value for p is needed at the bottom and top boundaries, a variant of the ${\overline{(.)}}^{w}$ operator is employed that uses linear extrapolation rather than the default constant extrapolation.

In order to ensure consistent and conservative transport of water and entropy, a key aspect of the model formulation is that a mass budget for a dual w-level density ${\overline{ρ}}^{r}$ that is consistent with the p-level ρ budget should be satisfied (Konor and Arakawa, 2000; Thuburn, 2022; Bendall et al., 2023). Achieving this consistency requires some care in the advection of w-level scalars (Sect. 2.6), both for SLICE and for the cheap advection updates, and also in the discrete formulation of surface sources and interior sources (Sect. 2.7).

In the following sections, for clarity, details of the spatial discretization are suppressed except to indicate where different vertical averaging operators are used.

2.3 Overview of information flow

Before delving further into details, it is useful to take an overview of the flow of information between the dynamics and thermodynamics parts of the model formulation (Fig. 2) to help clarify the organization of the linearization and the derivation of the Helmholtz problem in Sect. 2.8–2.10.

https://gmd.copernicus.org/articles/18/3331/2025/gmd-18-3331-2025-f02

Figure 2Schematic showing the flow of information between the dynamics, thermodynamics at p levels, and thermodynamics at w levels for the case of equilibrium thermodynamics. (For the nonequilibrium case the w-level thermodynamics retains some memory of qⁱ and ηⁱ.) Superscript i stands for any of d, v, l, or f. An overbar indicates a vertical averaging operation from p levels to w levels or vice versa.

Download

For the continuous equations, the role of the thermodynamics is to return the pressure p, given the density ρ, total water q, and total entropy η. To do so, the thermodynamics must determine how q and η are partitioned among the different components and phases q^v, q^l, q^f, η^d, η^v, η^l, and η^f.

The situation is more complicated when discretized with a Charney–Phillips vertical grid staggering, since p and ρ are stored at p levels while q and η are stored at w levels. Moreover, in order to obtain optimal coupling between vertical velocity and buoyancy, the w-level specific volume that appears in the vertical pressure gradient term in Eq. (2) must be calculated not simply from a vertically averaged density $1 / {\overline{ρ}}^{r}$ , but from the w-level η and q and a vertically averaged pressure ${\overline{p}}^{w}$ (Thuburn, 2017 a). It is convenient to express this requirement through an additional equation,

\begin{matrix} (17) & {\overline{p}}^{w} - p^{(w)} \equiv - R_{\overline{p}} = 0, \end{matrix}

where p^(w) is the pressure appearing in Eq. (12) on w levels, and $R_{\overline{p}}$ is a residual that should be driven to zero by the iterative solver. Thus, on w levels a full set of thermodynamic equations (Eqs. 6–13) is solved, while on p levels the following subset of thermodynamic equations is solved:

\begin{array}{l} (18) & {\overline{q^{v}}}^{p} α^{v (p)} + {\overline{q^{l}}}^{p} α^{l} + {\overline{q^{f}}}^{p} α^{f} - \frac{1}{ρ} = 0, \\ (19) & {\overline{q^{v}}}^{p} α^{v (p)} - {\overline{q^{d}}}^{p} α^{d (p)} = 0, \\ (20) & p + e_{α}^{d} + e_{α}^{v} = 0 . \end{array}

Equation (18) determines the p-level specific volume of water vapour α^v(p), and then Eq. (19) determines the p-level specific volume of dry air α^d(p). Finally, the pressure is computed from the dry air and water vapour internal energy potentials in terms of their p-level natural variables α^d(p), ${\overline{η^{d}}}^{p}$ , α^v(p), and ${\overline{η^{v}}}^{p}$ . Note that the specific volumes α^d(p) and α^v(p) are calculated directly from ρ on p levels rather than using vertical averages of the w-level values ${\overline{α^{d}}}^{p}$ , ${\overline{α^{v}}}^{p}$ , ensuring that p responds correctly and locally to changes in ρ.

2.4 Semi-implicit semi-Lagrangian scheme

Splitting the momentum equation into its horizontal and vertical components with $v = (u, v, 0)$ being the horizontal velocity, a semi-implicit semi-Lagrangian discretization of Eqs. (1)–(5) is

\begin{matrix} (21) & {[ρ - a Δ t S_{ρ}]}^{n + 1} - {[ρ + b Δ t S_{ρ}]}_{T}^{n} \equiv - R_{ρ} = 0, \end{matrix}

\begin{matrix} (22) & \begin{aligned} [v & {+ a Δ t ((2 Ω \times u)_{H} + α \nabla_{H} p - S_{v})]}^{n + 1} \\ - {[v - b Δ t ((2 Ω \times u)_{H} + α \nabla_{H} p - S_{v})]}_{D}^{n} \\ \equiv - R_{v} = 0, \end{aligned} \\ (23) & \begin{aligned} [w & {+ a Δ t ((2 Ω \times u)_{V} + α \frac{\partial p}{\partial z} + \frac{\partial Φ}{\partial z} - S_{w})]}^{n + 1} \\ - {[w - b Δ t ((2 Ω \times u)_{V} + α \frac{\partial p}{\partial z} + \frac{\partial Φ}{\partial z} - S_{w})]}_{D}^{n} \\ \equiv - R_{w} = 0, \end{aligned} \\ (24) & \begin{aligned} {[{\overline{ρ}}^{r} (η - a Δ t S_{η})]}^{n + 1} & - {[{\overline{ρ}}^{r} (η + b Δ t S_{η})]}_{T}^{n} \\ - Δ t {\overline{ρ}}^{r} J^{T} P^{n + 1} \equiv - R_{ρ η} = 0, \end{aligned} \\ (25) & \begin{aligned} {[{\overline{ρ}}^{r} (q - a Δ t S_{q})]}^{n + 1} & - {[{\overline{ρ}}^{r} (q + b Δ t S_{q})]}_{T}^{n} \\ \equiv - R_{ρ q} = 0, \end{aligned} \\ (26) & {[X - a Δ t S_{X}]}^{n + 1} - {[X + b Δ t S_{X}]}_{D}^{n} - Δ t J = 0 . \end{matrix}

Here, superscripts n and n+1 are time step indices. Subscripts H and V indicate horizontal and vertical components, respectively. Subscript D indicates a quantity interpolated or remapped to a semi-Lagrangian trajectory departure point or cell, while subscript T indicates a conservatively transported quantity defined by $V^{n + 1} ψ_{T} = [V ψ]_{D}^{n}$ .

Δt is the time step, and a and $b = 1 - a$ are off-centring parameters for the dynamics. To damp acoustic waves that might be generated by initial perturbations imposed to trigger turbulence or by the switching on of forcing terms at the initial time, a is smoothly adjusted from 1 to a user-specified value over the first 900 s of a model run. A value of a=0.51 (after the initial adjustment) is used for the results shown in Sect. 5.

The terms R_ρ, R_v, R_w, R_ρη, and R_ρq represent residuals in their respective equations. In the target discretization these residuals should be zero. The iterative solver described in the following sections attempts to drive those residuals to zero. No residual appears in Eq. (26) because that equation is used to diagnose the fluxes J so it is always satisfied exactly.

The subsystem of thermodynamics equations (Eqs. 6–13) is solved at step n+1, and it is Pⁿ⁺¹ that appears in Eq. (24), effectively giving a backward Euler treatment of the thermodynamics. The thermodynamic processes of interest typically involve relaxation towards equilibrium rather than oscillation about equilibrium. A backward Euler treatment should be sufficiently accurate for nonequilibrium processes whose timescale is much longer than Δt, such as evaporation of falling rain. For processes whose timescale is shorter than about $Δ t / 2$ , a backward Euler step is preferable to a Crank–Nicolson step since Crank–Nicolson can overshoot the equilibrium solution. If the form of the resistivity matrix implies that any part of the system is in equilibrium (e.g. T^v=T^d), then a backward Euler step is essential, since the equilibrium must be imposed at step n+1 and not as a time average.

2.5 Time-stepping algorithm

The semi-implicit semi-Lagrangian scheme described in Sect. 2.4 is both nonlinear in the unknown step n+1 values and nonlocal because unknown values at neighbouring grid points are coupled. The equations are solved using an iterative quasi-Newton algorithm (Algorithm 1). By elimination of unknowns, the linear system for the Newton update is reduced to a more or less standard Helmholtz problem, which is solved using a multigrid method (Sect. 2.11). To avoid the expense of computing conservative semi-Lagrangian transport multiple times per step, a single full advection calculation is made once at the start of the time step, and relatively cheap updates to the transport are made at each solver iteration (Zerroukat and Allen, 2020). These cheap transport updates use simple upwind or centred schemes and are made in such a way that the transport of scalars remains conservative and bounded (assuming conservative and bounded options have been chosen by the user) and consistent with the transport of mass at every solver iteration (Thuburn, 2024).

Algorithm 1Computations performed to take one model time step.

Compute time step n terms

Initialize step n+1 state variables to step n state

Full advection calculation: compute [.]_D and [.]_T terms

for ℓ=1 to N_ℓ do

Compute step n+1 terms based on iteration ℓ−1 values

Compute residuals

Build and solve the Helmholtz problem

Back-substitute, updating all transport terms and state variables

end for

The number of solver iterations takes a default value N_ℓ=3. This was found to be sufficient for all test cases simulated except DYCOMS, for which N_ℓ=4 was needed. This exceptional case is discussed in Sects. 4 and 5.

When cheap transport updates are employed, the resulting time integration scheme at solver convergence is not quite as written in Sect. 2.4, with semi-Lagrangian advection by the trajectory-average velocity. Rather, the net transport results from semi-Lagrangian advection using the first-guess trajectory-average velocity followed by a sequence of small corrections. Nevertheless, the end result is very close to the target discretization and appears to work well in practice.

An attractive aspect of the use of the consistent and conservative cheap transport updates is that, after the first solver iteration, the residuals in Eqs. (21), (24), and (25) and the equations for any advected tracers are very small and result only from changes in source terms between one iteration and the next (Thuburn, 2024). It is likely that this helps solver convergence (Sect. 4.4).

2.6 Advection

All advection terms are computed using semi-Lagrangian schemes. Different advection options are available depending on the advected variable and its location on the grid. All momentum equation components are advected with an interpolating (non-conservative) semi-Lagrangian scheme. Terms in the total fluid density equation and the mass fractions of any p-level tracers are advected using the mass-conserving SLICE scheme (Zerroukat et al., 2009). Advected terms in the water and entropy equations (Eqs. 24–26), as well as any w-level tracers, may be advected with either an interpolating semi-Lagrangian scheme or with SLICE.

Both the interpolating semi-Lagrangian scheme and SLICE use the cascade idea (Purser and Leslie, 1991) to replace a three-dimensional interpolation or remapping by a sequence of one-dimensional interpolations or remappings (Fig. 3).

https://gmd.copernicus.org/articles/18/3331/2025/gmd-18-3331-2025-f03

Figure 3Departure points and cascade remapping in two dimensions. The red line D_f to A_f is an example full trajectory to a v-point arrival point A_f from the corresponding departure point D_f. The x and y coordinates of A_f are known, and the x and y coordinates of D_f are to be calculated. The blue line D_i to A_i is an example intermediate trajectory to A_i from D_i. The x coordinate of A_i and the y coordinate of D_i are known, and the y coordinate of A_i and the x coordinate of D_i are to be calculated. Once the intermediate and full departure points are found, fields are conservatively remapped in the x direction, from the model grid cells (straight black lines) to intermediate cells (dashed blue lines), and then in the s direction to the departure cells (red curves). The remapped field is then transported from its departure cells (e.g. pink cell) to the corresponding arrival cells (e.g. grey cell).

Download

The interpolating semi-Lagrangian scheme is based on cubic Lagrange interpolation. For the vertical interpolation, modifications are needed near the upper and lower boundaries. Between the uppermost pair of data points and between the lowermost pair of data points a modified cubic interpolation is used that uses only the two nearest data points and is very close to a linear interpolation. For terms in the u and v equations, when extrapolation above the uppermost data point is needed constant extrapolation is used. When extrapolation below the lowest data point is needed, the scheme uses either constant extrapolation or linear interpolation between the data value at level 1 and zero at the surface, according to whether the user selects a freeslip or noslip $⊔ ⊔$ boundary condition for the advection.

For the SLICE scheme, for each advected field the user may choose between piecewise constant, piecewise parabolic (Colella and Woodward, 1984), and parabolic spline (Zerroukat et al., 2007) remapping schemes. Experimentation suggests that, for most purposes, piecewise constant remapping is adequate for cell volume, divergence, and density. The results shown in Sect. 5 use this option, with parabolic spline method remapping for entropy and water. For each advected field the user has the option to use a simple limiter ensuring boundedness of the advected field. The limiter is redundant in the case of SLICE advection with piecewise constant remapping. In Sect. 5 the limiter is used for advection of entropy and water but not for advection of velocity components.

Some modifications to previous implementations of cascade advection schemes are made to reduce expensive searching and conditional code and to maximize reuse of information. First, in addition to the trajectory departure points, three-dimensional cascade interpolation or remapping requires two sets of intermediate departure points. Rather than construct these intermediate departure points by computing intersections between the arrival coordinate system defined by the model grid and the departure or Lagrangian coordinate system, as in previous work (e.g. Purser and Leslie, 1991; Nair et al., 2002; Zerroukat et al., 2002), here they are computed by separate trajectory calculations. The idea is illustrated in Fig. 3 for the two-dimensional case. A first guess followed by a single fixed point iteration is used for all trajectory calculations.

Second, we want to remap the density field using a volume coordinate to ensure consistency with the trajectory-average divergence (see below), and we want to remap water, entropy, and tracers in a mass coordinate to ensure that their advection is conservative, bounded if desired, and consistent with the density advection. However, neither the volume coordinate nor the mass coordinate is a simple function of cell index, so it might appear, at first glance, that expensive searching is needed to determine the necessary origin grid indices in these coordinates. This difficulty is circumvented with the aid of the following insight. To compute a one-dimensional remapping of a field f from an origin grid to a destination grid, the information needed comprises the origin grid cell average values of f, the origin grid coordinate intervals Δs_k, the origin grid indices k_i corresponding to destination grid cell edges i, and the cell fractions ξ_i in the s coordinate (Fig. 4). Fields are then remapped in the following sequence.

Remap cell volume in a geometrical coordinate x, y, or z; the origin grid indices k_i and cell fractions ξ_i are easily determined.
Remap density in a volume coordinate; the origin grid indices k_i are unchanged while the cell fractions ξ_i are obtained as a by-product of the volume remapping. For example, if Fig. 4 represents the remapping of volume in a geometrical coordinate, then the volume-coordinate cell fraction needed for the density remapping is given by the ratio of the dark shaded area to the full shaded area.
Remap p-level tracer mass fractions in a mass coordinate; the origin grid indices k_i are again unchanged while the cell fractions ξ_i are obtained as a by-product of the density remapping.
Remap w-level scalars in a mass coordinate. The origin grid indices k_i and cell fractions ξ_i must correspond to a mass coordinate based on ${\overline{ρ}}^{r}$ ; they can be computed from the mass coordinate k_i and ξ_i at the p levels immediately below and above the w level in question.

https://gmd.copernicus.org/articles/18/3331/2025/gmd-18-3331-2025-f04

Figure 4Information used for one-dimensional conservative remapping of a field between an origin grid (cell edges indicated in blue) and a destination grid (cell edges indicated in red). The remapping coordinate s may be a geometrical coordinate (x, y, or z), volume, or mass. Δs_k is an origin grid cell size in the s coordinate. k_i is the origin grid cell index in which the destination grid edge i sits. ξ_i is the fraction of the origin grid cell k_i, measured in the s coordinate, to the left of destination edge i.

Download

Thuburn et al. (2010) found that the accuracy of a semi-implicit semi-Lagrangian scheme with conservative semi-Lagrangian advection of density relies on the semi-Lagrangian departure volumes (departure areas in the shallow water context) being consistent with the trajectory-average divergence. Subsequent work showed that this divergence consistency criterion is crucial for stability too, unless a strong off-centring is employed. However, it appears very difficult to enforce this criterion directly within the trajectory calculations. Thuburn et al. (2010) solved this problem by conservatively advecting the divergence field, hence constructing the required departure cell areas, and using an area coordinate for the final SLICE remapping sweep in the advection of mass. Here a slightly different approach is taken to enable the remapping sequence discussed above. First, the divergence field is conservatively advected, and the information is used to compute the required departure cell volumes. These required departure cell volumes are compared with the actual departure cell volumes returned by SLICE, and the difference is used to compute a (small) divergent velocity increment sufficient to correct the discrepancy. The semi-Lagrangian advection of all other advected fields is then carried out, following which the machinery for making cheap advection updates is used to update the advection of all fields using this velocity increment and so satisfy divergence consistency.

One of the benefits of the Charney–Phillips vertical grid is that the colocation of w and η permits a tight coupling between vertical motion and buoyancy. However, when the entropy transport is computed in flux form, that tight coupling is lost unless the horizontal fluxes are corrected to account for the vertical gradients of η, $F_{x} = {\overline{ρ}}^{x} u$ , and $F_{y} = {\overline{ρ}}^{y} v$ (Thuburn, 2022). Thuburn (2022) showed how the horizontal remapping of w-level fields in SLICE could be modified to implement the required correction. Here, since the correction is generally small, a much simpler approach is taken. Before carrying out the main SLICE advection of entropy, a small transport correction is made to the entropy using the horizontal fluxes,

\begin{matrix} (27) & \begin{aligned} F_{x}^{η} = a^{w} b^{w} Δ z^{2} \frac{\partial F_{x}}{\partial z} {\overline{\frac{\partial {\overline{η}}^{x}}{\partial z}}}^{w}, \\ F_{y}^{η} = a^{w} b^{w} Δ z^{2} \frac{\partial F_{y}}{\partial z} {\overline{\frac{\partial {\overline{η}}^{y}}{\partial z}}}^{w}, \end{aligned} \end{matrix}

where a^w and b^w are the coefficients associated with the ${\overline{(.)}}^{w}$ operator. All quantities are evaluated at step n, and the fluxes are applied over the time step Δt. Subscripts $k + 1 / 2$ indicating the vertical level have been omitted for clarity. Equation (27) is a slight generalization of Eq. (7) of Thuburn (2022) to allow for a vertically stretched grid. An analogous correction is applied to the transport of all conservatively transported w-level scalars. However, because this simple formulation does not guarantee boundedness of advected scalars, the user may choose whether or not to include the correction. The correction is included for the results shown in Sect. 5.

In the back-substitution stage of the time step, cheap updates are made to all transported terms [.]_D and [.]_T to account for the velocity increments u^′, applied over the backward part of the time step aΔt. The transported terms in the momentum equation are updated using an advective-form first-order upwind scheme. The transported term in the density equation is updated using a flux-form scheme with mass flux increments $F^{'} = \hat{ρ} u^{'}$ , where the cell edge values $\hat{ρ}$ are given by a second-order centred scheme. The transported terms in any p-level tracer equations are updated using a flux-form scheme with tracer flux increments $F^{'}^{χ} = F^{'} \hat{χ}$ , where the cell edge values $\hat{χ}$ are given by a first-order upwind scheme. Finally, the transport terms for any w-level scalars are updated using a flux-form scheme with scalar flux increments $F^{'}^{χ} = {\overline{F^{'}}}^{d} \hat{χ}$ , with $\hat{χ}$ again given by a first-order upwind scheme. The notation ${\overline{(.)}}^{d}$ here indicates that horizontal mass flux increments have been mapped to w levels using the ${\overline{(.)}}^{r}$ operator and vertical mass flux increments have been mapped to p levels using the ${\overline{(.)}}^{p}$ operator for consistency with the dual mass budget (Thuburn et al., 2022). This way of constructing the scalar flux increments ensures that the scalar transport remains consistent with density transport and conservative and bounded if the full semi-Lagrangian transport was conservative and bounded.

2.7 Boundary and interior forcing terms

Each simulated case requires the implementation of appropriate boundary and interior forcing terms. For clarity of the code, these boundary and forcing terms are implemented on a case-by-case basis. For the surface fluxes, the cases that are already implemented include examples of the setups most commonly used in LES: heat and moisture fluxes as specified functions of time, heat and moisture fluxes determined from a specified surface temperature and relative humidity using a simple bulk model, momentum fluxes determined by a simple bulk model, and momentum fluxes determined by Monin–Obukhov similarity theory.² A case using Monin–Obukhov theory to determine heat and moisture fluxes has not yet been implemented.

This section highlights some aspects of how the boundary and interior forcing terms are implemented for consistency with the rest of the model formulation.

2.7.1 Consistent surface fluxes of mass and water

Under a careful and consistent treatment of the moist thermodynamics, a surface flux of water implies a surface flux of mass. Most atmospheric models neglect that flux of mass. Because of the Charney–Phillips vertical staggering and the fact that the predicted density is the total density rather than the dry density, care is needed in computing the density and water increments to ensure that the changes in both total mass and total water within the model agree with the time-integrated surface fluxes.

First consider the forward part of the time step, i.e. the […]ⁿ terms in Eqs. (21) and (25). Let ℱ^ρ be the surface flux of mass and ${\overline{ρ^{n}}}_{1 / 2}^{r} F^{q}$ be the surface flux of water (both in kg m⁻² s⁻¹). Recall that the w-level density at level $k + 1 / 2$ must equal the conservatively remapped p-level density ${\overline{ρ}}_{k + 1 / 2}^{r}$ . Thus, any change in total density at level 1 affects the density of water at level $3 / 2$ , ${\overline{ρ^{n}}}_{3 / 2}^{r} q_{3 / 2}$ , as well as the density of water at level $1 / 2$ , ${\overline{ρ^{n}}}_{1 / 2}^{r} q_{1 / 2}$ . Nevertheless, imposing the constraints that

only the density at level 1 may change, and the total change in column mass per unit area must equal ℱ^ρbΔt, and
only the specific humidity at level $1 / 2$ may change, and the total change in column water mass per unit area must equal ${\overline{ρ^{n}}}_{1 / 2}^{r} F^{q} b Δ t$ ,

leads to unique solutions for the increment in density at level 1,

\begin{matrix} (28) & δ ρ_{1} = \frac{F^{ρ} b Δ t}{Δ z_{1}}, \end{matrix}

and the increment in specific humidity at level $1 / 2$ ,

\begin{matrix} (29) & δ q_{1 / 2} = \frac{(ρ_{1}^{n} F^{q} - {\overline{q^{n}}}_{1}^{s} F^{ρ}) b Δ t}{ρ_{1}^{+} Δ z_{1 / 2}}, \end{matrix}

where $ρ_{1}^{+} = ρ_{1}^{n} + δ ρ_{1}$ , and we have used the fact that ${\overline{ρ}}_{1 / 2}^{r} \equiv ρ_{1}$ .

For the backward part of the time step, i.e. the […]ⁿ⁺¹ terms in Eqs. (21) and (25), an analogous argument leads to

\begin{matrix} (30) & δ ρ_{1} = \frac{F^{ρ} a Δ t}{Δ z_{1}} \end{matrix}

and

\begin{matrix} (31) & δ q_{1 / 2} = \frac{(ρ_{1}^{n + 1} F^{q} - {\overline{q^{n + 1}}}_{1}^{s} F^{ρ}) a Δ t}{ρ_{1}^{-} Δ z_{1 / 2}}, \end{matrix}

where $ρ_{1}^{-} = ρ_{1}^{n + 1} - δ ρ_{1}$ . Terms at time step n+1 are approximated by the latest available estimate.

The treatment of surface entropy fluxes is analogous, with q replaced by η and ℱ^q replaced by ℱ^η.

2.7.2 Surface drag

The various test cases implemented specify how the vertical flux of horizontal momentum is to be parameterized at the surface. Since there are no parameterized subgrid fluxes in the interior of the domain, the convergence of the parameterized momentum flux occurs entirely in the lowest model layer; i.e. it is applied to the u and v components at model level 1.

Some of the test cases implemented specify that the surface momentum flux is to be computed using Monin–Obukhov similarity theory. However, there are two difficulties. First, Monin–Obukhov similarity theory expresses the mean flow speed U(z) at some height z as a nonlinear function of the friction velocity U_*. What we require in a numerical model is to express the friction velocity, hence the surface momentum flux, in terms of the known flow speed at the lowest model level; however, in the usual approach, this requires the iterative solution of a nonlinear problem in each grid column. Second, and more seriously, in the stable case (negative surface buoyancy flux) in light winds Monin–Obukhov similarity theory is valid only over a limited height range that might be less than the height of the lowest model level z₁. In this case, with commonly used stability functions, the theory can produce either no solution or multiple solutions for U_*, given U(z₁). To avoid these difficulties, a slight reformulation of Monin–Obukhov theory is used to compute surface momentum fluxes (Appendix A4).

2.7.3 Consistent interior forcing terms

For test cases that require a source of moisture in the interior of the model domain, consistency between the p-level mass budget and the w-level dual mass budget requires that

\begin{matrix} (32) & {(ρ S_{q})}_{k + 1 / 2} = {({\overline{S_{ρ}}}^{r})}_{k + 1 / 2} . \end{matrix}

The simplest way to satisfy this condition is to specify the required moisture/mass source first on p levels and then remap conservatively to w levels. For simplicity, interior entropy sources are specified in the same way.

2.8 Linearized dynamics

To take a model time step, a quasi-Newton method is used to solve Eqs. (21)–(26) along with the p-level and w-level thermodynamic equations. After some number of quasi-Newton iterations, using the latest available estimates to evaluate step n+1 terms, the residuals R_ρ, R_v, R_w, R_ρη, R_ρq, R_PE, and $R_{\overline{p}}$ will generally be non-zero. We seek increments or updates to the step n+1 model variables that will reduce those residuals. This is done through an approximate linearization of the governing equations (this section and Sect. 2.9), leading to a large linear system for the increments. Despite the apparent complexity of this system, systematic elimination of unknowns, partly manually and partly numerically, leads to a nearly standard Helmholtz problem for a single unknown per model grid point: the pressure increment p^′ (Sect. 2.10). The Helmholtz problem can be solved efficiently using well-established methods; here a multigrid method is used. Once p^′ is found, the other increments can be found through back substitution (Sect. 2.11).

The approximate linearization of Eqs. (21)–(26) is

\begin{matrix} (33) & ρ^{'} + a Δ t \nabla \cdot F^{'} = R_{ρ}, \\ (34) & v^{'} + a Δ t (α_{*} \nabla_{H} p^{'} - S_{v}^{'}) = R_{v}, \\ (35) & w^{'} + a Δ t (α_{*} \frac{\partial p^{'}}{\partial z} + α^{'} \frac{\partial p_{*}}{\partial z}) = R_{w}, \end{matrix}

\begin{matrix} (36) & \begin{aligned} {\overline{ρ^{(ℓ + 1)}}}^{r} η^{'} + {\overline{ρ^{'}}}^{r} {(η - a Δ t S_{η})}^{(ℓ)} & - {[{\overline{ρ}}^{r} (η + b Δ t S_{η})]}_{T}^{'} \\ = R_{ρ η}, \end{aligned} \\ (37) & \begin{aligned} {\overline{ρ^{(ℓ + 1)}}}^{r} q^{'} + {\overline{ρ^{'}}}^{r} {(q - a Δ t S_{q})}^{(ℓ)} & - {[{\overline{ρ}}^{r} (q + b Δ t S_{q})]}_{T}^{'} \\ = R_{ρ q}, \end{aligned} \\ (38) & X^{'} - Δ t J^{'} = 0 . \end{matrix}

An asterisk on any variable indicates a reference value for the linearization. Here the most recent estimate for the step n+1 value is used as the reference value, avoiding the need to store additional three-dimensional fields. Certain terms in Eqs. (36) and (37) are explicitly evaluated at iteration number ℓ or ℓ+1. This specific way of writing the linearization ensures that, when Eqs. (36) and (37) are used in the back substitution, η and q are incremented exactly according to the consistent and conservative transport updates. Since the source terms S_ρ are generally non-stiff, their linearizations are mostly omitted. The exception is a linearization of the surface drag at model level 1 $S_{v}^{'} = - v^{'} / τ_{drag}$ , where τ_drag is a surface drag timescale computed alongside the surface momentum flux. This term is omitted from the Helmholtz problem (in principle it could be included) but is included in the back substitution.

In order to arrive at a standard Helmholtz problem for the pressure increments we must make a w^′N² term appear in the linearization, where N² is an appropriately defined buoyancy frequency squared. This requires us to work with a linearization of advective-form transport equations for η and q. Consider the η equation (Eq. 36). In the back substitution the transport increment is computed as

\begin{matrix} (39) & {[{\overline{ρ}}^{r} (η + b Δ t S_{η})]}_{T}^{'} = - a Δ t \nabla \cdot ({\overline{F^{'}}}^{d} \hat{η}) \end{matrix}

for some (upwind) cell edge values $\hat{η}$ . Thus, Eq. (36) becomes

\begin{matrix} (40) & {\overline{ρ^{(ℓ + 1)}}}^{r} η^{'} + {\overline{ρ^{'}}}^{r} {(η - a Δ t S_{η})}^{(ℓ)} + a Δ t \nabla \cdot ({\overline{F^{'}}}^{d} \hat{η}) = R_{ρ η} . \end{matrix}

Subtracting $\hat{η}$ times a vertical average of Eq. (33) and approximating ${\overline{ρ^{(ℓ + 1)}}}^{r}$ by a generic reference density ρ_* gives

\begin{matrix} (41) & η^{'} + a Δ t w^{'} \frac{\partial η}{\partial z} = \frac{R_{ρ η} - \hat{η} \overline{R_{ρ}}}{ρ_{*}} \equiv R_{η} . \end{matrix}

Proceeding in a similar way for the q equation (Eq. 37) gives

\begin{matrix} (42) & q^{'} + a Δ t w^{'} \frac{\partial q}{\partial z} = \frac{R_{ρ q} - \hat{q} \overline{R_{ρ}}}{ρ_{*}} \equiv R_{q} . \end{matrix}

2.9 Linearized thermodynamics

When the w-level thermodynamic state is updated, certain fields are diagnosed, ensuring that Eqs. (6)–(11) are satisfied exactly: q^d is set equal to 1−q, Eq. (9) gives q^v, Eq. (10) gives α^v, and Eq. (6) gives α^d. Also, the updating of terms in Eqs. (7) and (8) during back substitution is done in such a way that those equations remain exactly satisfied. Thus, no residual term appears in the linearized versions of these equations.

The linearized forms of Eqs. (6)–(11) are then as follows:

\begin{matrix} (43) & q^{v} α^{v}^{'} + α^{v} q^{v}^{'} - q^{d} α^{d}^{'} + α^{d} q^{'} = 0, \\ (44) & (1 - δ^{l}) λ^{l}^{'} - δ^{l} q^{l}^{'} = 0, \\ (45) & (1 - δ^{f}) λ^{f}^{'} - δ^{f} q^{f}^{'} = 0, \\ (46) & q^{v}^{'} + q^{l}^{'} + q^{f}^{'} - q^{'} = 0, \\ (47) & q^{v} α^{v}^{'} + α^{v} q^{v}^{'} + α^{l} q^{l}^{'} + α^{f} q^{f}^{'} - α^{'} = 0, \\ (48) & \begin{aligned} q^{d} η^{d}^{'} & - η^{d} q^{'} + q^{v} η^{v}^{'} + η^{v} q^{v}^{'} + q^{l} η^{l}^{'} + η^{l} q^{l}^{'} + q^{f} η^{f}^{'} \\ + η^{f} q^{f}^{'} - η^{'} = 0 . \end{aligned} \end{matrix}

Note that $q^{d}^{'} = - q^{'}$ has been used to eliminate increments of the dry mass fraction. The linearization of Eq. (17) is

\begin{matrix} (49) & {\overline{p^{'}}}^{w} - p^{(w)}^{'} = R_{\overline{p}} . \end{matrix}

With the aid of Eq. (38), the linearized phenomenological equations become

\begin{matrix} (50) & \frac{1}{Δ t} R X^{'} - P^{'} + C^{'} = R_{PE}; \end{matrix}

for brevity the details of P^′ and C^′ are suppressed.

Two modifications are then made to the linear system. First, a change of variable is made, introducing ${\tilde{λ}}^{l}^{'} = λ^{l}^{'} + q^{l}^{'}$ and ${\tilde{λ}}^{f}^{'} = λ^{f}^{'} + q^{f}^{'}$ . Second, η_ref times Eq. (44) is added to the q^l^′ phenomenological equation and η_ref times Eq. (45) is added to the q^f^′ phenomenological equation. These two modifications guarantee that the coefficient of q^l^′ in Eq. (44) and the coefficient of q^f^′ in Eq. (45) are nonzero and that the coefficients of ${\tilde{λ}}^{l}^{'}$ and ${\tilde{λ}}^{f}^{'}$ in the relevant phenomenological equations are nonzero. In this way, the sparsity pattern of the system matrix is known irrespective of the state of the switches δ^l and δ^f and is then effectively the same at all grid points.

The resulting system of linear equations may be compactly written

\begin{matrix} (51) & \tilde{M} Z^{'} = {\tilde{R}}_{M}, \end{matrix}

where $\tilde{M}$ is an 11×14 matrix, and $(Z^{'})^{T} = ((Y^{'})^{T}, q^{'}, α^{'}, η^{'})$ , where $(Y^{'})^{T} = (α^{v}^{'}, q^{l}^{'}, q^{f}^{'}, q^{v}^{'}, α^{d}^{'}, η^{d}^{'}, η^{v}^{'}, η^{l}^{'}, η^{f}^{'}, {\tilde{λ}}^{l}^{'}, {\tilde{λ}}^{f}^{'})$ . The different ordering of the rows and columns of $\tilde{M}$ compared to Bowen and Thuburn (2022 a) allows a better exploitation of the sparsity in the Gaussian elimination step discussed in Sect. 2.10.

The linearized versions of the p-level thermodynamic equations (Eqs. 18–20) are

\begin{array}{l} (52) & {\overline{q^{v}^{'}}}^{p} α^{v (p)} + {\overline{q^{v}}}^{p} α^{v (p)}^{'} + {\overline{q^{l}^{'}}}^{p} α^{l} + {\overline{q^{f}^{'}}}^{p} α^{f} + \frac{ρ^{'}}{ρ_{*}^{2}} = 0, \\ (53) & {\overline{q^{v}^{'}}}^{p} α^{v (p)} + {\overline{q^{v}}}^{p} α^{v (p)}^{'} + {\overline{q^{'}}}^{p} α^{d (p)} - {\overline{q^{d}}}^{p} α^{d (p)}^{'} = 0, \\ (54) & p^{'} + e_{α α}^{d} α^{d (p)^{'}} + e_{α η}^{d} {\overline{η^{d}^{'}}}^{p} + e_{α α}^{v} α^{v (p)^{'}} + e_{α η}^{v} {\overline{η^{v}^{'}}}^{p} = 0 . \end{array}

2.10 Derivation of the Helmholtz problem

In order to derive a Helmholtz problem, it will be useful to express all other thermodynamic increments at w levels (the components of Y^′) in terms of q^′, α^′, and η^′. This is done by carrying out a numerical Gaussian elimination on $\tilde{M}$ to leave

\begin{matrix} (55) & M Z^{'} = R_{M}, \end{matrix}

where M is of the form

\begin{matrix} (56) & M = (I C_{1} C_{2} C_{3}) . \end{matrix}

I is the 11×11 identity matrix, and C₁, C₂, and C₃ are columns of (generally) nonzero entries. For efficiency, the Gaussian elimination exploits the known sparsity pattern of the matrix $\tilde{M}$ , and, since the sparsity pattern is the same for all grid points, the elimination can be done without conditional code and can be implemented with the innermost loop over model levels.

The entries M_ij of the eliminated matrix M and the entries R_Mi of the eliminated right-hand side R_M are used in building the coefficients and right-hand side of the Helmholtz problem. A subset of them (rows 2, 3, and 7–11 of C₁, C₂, C₃, and R_M, corresponding to the equations for ( $λ^{l}^{'}, λ^{f}^{'}, q^{l}^{'}, q^{f}^{'}, η^{v}^{'}, η^{l}^{'}, η^{f}^{'}$ )) are saved for use in back substitution.

Next, we need an equation relating p^′, α^′, and w^′ at w levels and an analogous equation relating p^′, ρ^′, and w^′ at p levels. With the aid of Eq. (55), the pressure perturbation at w levels is given by

\begin{matrix} (57) & \begin{aligned} p^{(w)}^{'} & = - e_{α α}^{d} α^{d}^{'} - e_{α η}^{d} η^{d}^{'} - e_{α α}^{v} α^{v}^{'} - e_{α η}^{v} η^{v}^{'} \\ = \frac{\partial p}{\partial q} q^{'} + \frac{\partial p}{\partial α} α^{'} + \frac{\partial p}{\partial η} η^{'} + R_{TDP}, \end{aligned} \end{matrix}

where

\begin{array}{l} (58) & \frac{\partial p}{\partial q} = e_{α α}^{d} M_{5 12} + e_{α η}^{d} M_{6 12} + e_{α α}^{v} M_{1 12} + e_{α η}^{v} M_{7 12}, \\ (59) & \frac{\partial p}{\partial α} = e_{α α}^{d} M_{5 13} + e_{α η}^{d} M_{6 13} + e_{α α}^{v} M_{1 13} + e_{α η}^{v} M_{7 13}, \\ (60) & \frac{\partial p}{\partial η} = e_{α α}^{d} M_{5 14} + e_{α η}^{d} M_{6 14} + e_{α α}^{v} M_{1 14} + e_{α η}^{v} M_{7 14}, \\ (61) & R_{TDP} = e_{α α}^{d} R_{M 5} + e_{α η}^{d} R_{M 6} + e_{α α}^{v} R_{M 1} + e_{α η}^{v} R_{M 7} . \end{array}

Defining the sound speed c by

\begin{matrix} (62) & \frac{\partial p}{\partial α} = - ρ_{*}^{2} c^{2} \end{matrix}

and using Eqs. (36) and (37) to eliminate η^′ and q^′, Eq. (49) becomes

\begin{matrix} (63) & \frac{{\overline{p^{'}}}^{w}}{c^{2}} + ρ_{*}^{2} α^{'} + a Δ t ρ_{*} w^{'} \frac{N^{2}}{g} = R_{buoy}, \end{matrix}

where

\begin{matrix} (64) & ρ_{*} \frac{N^{2}}{g} = \frac{1}{c^{2}} \{\frac{\partial p}{\partial q} \frac{\partial q}{\partial z} + \frac{\partial p}{\partial η} \frac{\partial η}{\partial z}\} \end{matrix}

and

\begin{matrix} (65) & R_{buoy} = \frac{1}{c^{2}} \{R_{\overline{p}} + R_{TDP} + \frac{\partial p}{\partial q} R_{q} + \frac{\partial p}{\partial η} R_{η}\} . \end{matrix}

Deriving the analogous equation on p levels is a little more subtle because p^′ depends both on ρ^′ and, via qⁱ^′ and ηⁱ^′, on the w-level α^′ averaged to p levels. Setting qⁱ^′ and ηⁱ^′ to zero in Eqs. (52)–(54) shows that

\begin{matrix} (66) & {\frac{\partial p}{\partial ρ}|}_{{\overline{α}}^{p}} = \frac{1}{ρ_{*}^{2}} (\frac{e_{α α}^{d}}{{\overline{q^{d}}}^{p}} + \frac{e_{α α}^{v}}{{\overline{q^{v}}}^{p}}) \equiv {\hat{c}}^{2}, \end{matrix}

where $\hat{c}$ is a reduced sound speed (typically very close to and slightly larger than c). Hence

\begin{matrix} (67) & {\frac{\partial p}{\partial {\overline{α}}^{p}}|}_{ρ} = - ρ_{*}^{2} (c^{2} - {\hat{c}}^{2}), \end{matrix}

while $\partial p / \partial q$ and $\partial p / \partial η$ are given sufficiently accurately by their w-level values averaged to p levels. Proceeding in this way, Eq. (54) becomes

\begin{matrix} (68) & p^{'} - {\hat{c}}^{2} ρ^{'} + ρ_{*}^{2} (c^{2} - {\hat{c}}^{2}) {\overline{α^{'}}}^{p} - \frac{\partial p}{\partial q} {\overline{q^{'}}}^{p} - \frac{\partial p}{\partial η} {\overline{η^{'}}}^{p} = {\overline{R_{TDP}}}^{p} . \end{matrix}

Using Eq. (63) to eliminate α^′ and Eqs. (36) and (37) to eliminate η^′ and q^′ leaves

\begin{matrix} (69) & \begin{aligned} \frac{p^{'}}{{\hat{c}}^{2}} & - ρ^{'} + (\frac{1}{c^{2}} - \frac{1}{{\hat{c}}^{2}}) {\overline{{\overline{p^{'}}}^{w}}}^{p} + a Δ t {\overline{ρ_{*} w^{'} \frac{N^{2}}{g}}}^{p} \\ = {\overline{R_{buoy}}}^{p} - \frac{{\overline{R_{\overline{p}}}}^{p}}{{\hat{c}}^{2}} . \end{aligned} \end{matrix}

Next, use Eq. (63) to eliminate α^′ from Eq. (35) and combine the terms involving p^′ into a single operator:

\begin{matrix} (70) & ρ_{*} w^{'} + D_{1} (p^{'}) = \frac{ρ_{*} R_{w} + a Δ t g R_{buoy}}{1 + a^{2} Δ t^{2} N^{2}} \equiv ρ_{*} R_{w D p}, \end{matrix}

where

\begin{matrix} (71) & D_{1} (p^{'}) \equiv \frac{a Δ t}{1 + a^{2} Δ t^{2} N^{2}} (\frac{\partial p^{'}}{\partial z} + \frac{g}{c^{2}} {\overline{p^{'}}}^{w}) . \end{matrix}

Also, use Eq. (69) to eliminate ρ^′ from Eq. (33) and combine the terms involving w^′ into a single operator:

\begin{matrix} (72) & \begin{aligned} \frac{p^{'}}{{\hat{c}}^{2}} & + (\frac{1}{c^{2}} - \frac{1}{{\hat{c}}^{2}}) {\overline{{\overline{p^{'}}}^{w}}}^{p} + a Δ t \nabla_{H} \cdot (ρ_{*} v^{'}) + D_{2} (ρ_{*} w^{'}) \\ = {\overline{R_{buoy}}}^{p} - \frac{{\overline{R_{\overline{p}}}}^{p}}{{\hat{c}}^{2}} + R_{ρ}, \end{aligned} \end{matrix}

where

\begin{matrix} (73) & D_{2} (ρ_{*} w^{'}) \equiv a Δ t (\frac{\partial}{\partial z} ρ_{*} w^{'} + \frac{N^{2}}{g} {\overline{ρ_{*} w^{'}}}^{p}) . \end{matrix}

Finally, eliminate u^′, v^′, and w^′ using Eqs. (34) and (70) to obtain the Helmholtz problem

\begin{matrix} (74) & \begin{aligned} \frac{p^{'}}{{\hat{c}}^{2}} & + (\frac{1}{c^{2}} - \frac{1}{{\hat{c}}^{2}}) {\overline{{\overline{p^{'}}}^{w}}}^{p} - a^{2} Δ t^{2} \nabla_{H} \cdot \nabla_{H} p^{'} - D_{2} D_{1} (p^{'}) \\ = {\overline{R_{buoy}}}^{p} - \frac{{\overline{R_{\overline{p}}}}^{p}}{{\hat{c}}^{2}} + R_{ρ} - a Δ t \nabla_{H} \cdot R_{v} - D_{2} (ρ_{*} R_{w D p}) . \end{aligned} \end{matrix}

The form of the Helmholtz problem is slightly unusual in the appearance of the first two terms rather than a single term $p^{'} / c^{2}$ . As noted above, the ${\overline{{\overline{p^{'}}}^{w}}}^{p}$ term appears because of the dependence of p^′ on ${\overline{α^{'}}}^{p}$ and of α^′ on ${\overline{p^{'}}}^{w}$ . Nevertheless, this slightly different form does not affect the stencil of the discrete Helmholtz operator or the difficulty of solving the Helmholtz equation numerically.

The vertical part of the Helmholtz operator must be modified near the top and bottom boundaries to impose $w^{'} = 0$ there. This modification amounts to

omitting the contributions from levels $1 / 2$ and $N_{z} + 1 / 2$ in 𝒟₂(ρ_*R_wDp) on the right-hand side of Eq. (74) and
omitting the contributions 𝒟₁(p^′) from levels $1 / 2$ and $N_{z} + 1 / 2$ in 𝒟₂𝒟₁(p^′).

The switching of the Lagrange multipliers used to enforce non-negativity of q^l and q^f can significantly change the linearization of the thermodynamics, particularly the value of N². Therefore, the matrix M and the coefficients of the Helmholtz problem are rebuilt at every solver iteration.

2.11 Helmholtz solver and back substitution

The Helmholtz problem is solved using a horizontal multigrid method. Each smoother iteration uses a Jacobi method in the horizontal with a tridiagonal direct vertical solve. Key parameters of the multigrid solver – the depth of the V cycles, the number of smoother iterations on the coarsest grid, and the number of V cycles – are automatically chosen to ensure that the pressure increments are sufficiently accurate (Appendix A3).

Having found the solution for p^′, increments to other variables are found through back substitution:

u^′, v^′, and w^′ are found using Eqs. (34) and (70);
velocity increments are used to compute mass flux increments F^′ and hence ρ^′ using Eq. (33);
velocity and mass flux increments are used to update all transported terms and hence increment q and η; and
α^′ is computed using Eq. (63), and then ( $λ^{l}^{'}, λ^{f}^{'}, q^{l}^{'}, q^{f}^{'}, η^{v}^{'}, η^{l}^{'}, η^{f}^{'}$ ) are computed from q^′, η^′, and α^′ using the coefficients saved after the Gaussian elimination Eq. (55).

Before applying the increments ( $λ^{l}^{'}, λ^{f}^{'}, q^{l}^{'}, q^{f}^{'}, η^{v}^{'}, η^{l}^{'}, η^{f}^{'}$ ), a check is made to see if they would imply breaking any of the constraints q^l≥0, q^f≥0, λ^l≤0, and λ^f≤0. If they would, then, at that grid point, a partial increment is made to these seven variables such that the updated variables sit at the constraint boundary, and the corresponding δⁱ is switched from 0 to 1 or vice versa. All other variables receive their full increments.

After the completion of the solver iterations the model's pressure field is diagnosed using Eqs. (18)–(20). This ensures that the pressure is consistent with the other model thermodynamic fields and enables bit-reproducible restarts without the need to save the pressure field. During early stages of model development, Eqs. (18)–(20) were also used to recompute p at the end of each solver iteration. However, this formulation is adversely impacted by the switching of constraints, as follows. The total entropy always receives its full increment η^′. When constraint switching leads to partial increments η^v^′, η^l^′, and η^f^′, the dry entropy must take up the slack. Consequently, using Eqs. (18)–(20) to update the pressure can result in local pressure changes much larger than the p^′ returned by the Helmholtz problem, leading to large local residuals in the momentum equation. Therefore, in the present formulation, p is updated by adding the increment p^′ returned by the Helmholtz problem at each solver iteration. In the presence of constraint switching, this alternative way of updating the pressure greatly reduces the residuals in the momentum equation but leads to a larger residual $R_{\overline{p}}$ in Eq. (17). Nevertheless, the overall effect is beneficial, accelerating solver convergence.

3 Test cases

3.1 Generating initial data

The model includes general-purpose routines for constructing a horizontally uniform initial state in discrete hydrostatic balance given specified vertical profiles of a temperature variable, a humidity variable, and the horizontal wind components. The temperature variable may be potential temperature, liquid water potential temperature, virtual potential temperature, or temperature itself. The humidity variable may be total specific humidity or relative humidity.

PTerodaC³TILES version 1.0 has routines in place, which can be selected via namelist options, to generate initial data and forcing terms for the following test cases: ATEX (Stevens et al., 2001), ARM (Brown et al., 2002), BOMEX (Siebesma et al., 2003), BUBBLE (a 3D version of Bryan and Fritsch, 2002), CBL (a dry convective boundary layer, Sullivan and Patton, 2011), DYCOMS (Stevens et al., 2005), and a NEUTRAL boundary layer with shear. The user can easily implement new test cases by using existing routines as templates.

PTerodaC³TILES version 1.0 can also generate initial data for the LBA case (Grabowski et al., 2006). However, the current version cannot represent the microphysics and precipitation necessary to simulate this case successfully, and the forcing terms are not yet implemented.

3.2 Output and diagnostics

A variety of diagnostics are computed online and may be selected by the user for output. These include

time series of global diagnostics such as total mass and total water, along with accumulated source terms;
column diagnostics of horizontal means of model fields and derived quantities such as turbulent kinetic energy, vertical fluxes, and cloud fraction;
two-dimensional slices in the x–y, x–z, and y–z planes of the main model fields plus some derived quantities like cloud top height;
diagnostics of quasi-Newton solver convergence; and
diagnostics of quantities potentially related to model stability.

Full details are given in the user manual, available from Zenodo; see the “Code availability” section.

4 Verification

All components of the model have been thoroughly tested during development. This section highlights some aspects that require particularly careful checking or that can appear to work despite errors in formulation or coding.

4.1 Advection

The advection routines have been tested with specified velocity fields, independently of the rest of the model, to verify their overall accuracy, including correct behaviour at vertical and lateral boundaries and, where relevant, to confirm their conservation, consistency, and boundedness properties. It is particularly important that the w-level dual mass budget is satisfied (Sect. 2.2) so that w-level scalars are correctly advected. Conservation, consistency, and boundedness have also been verified in full model simulations to ensure that they are maintained by both the full cascade advection and the cheap advection updates.

4.2 Divergence consistency

It has been verified that the divergent velocity increments discussed in Sect. 2.6 correctly compensate for any discrepancy between the departure volumes computed by SLICE and the trajectory-average divergence. When these compensating increments are not included the model is found to become unstable, as expected theoretically.

4.3 Thermodynamics and linearized thermodynamics

The correctness and consistency of the formulation of thermodynamics in terms of internal energy potentials was verified in standalone code before incorporation in PTerodaC³TILES. The use of switched constraints and partial increments within the quasi-Newton solver allows q^l and q^f to be zero but not negative, as intended. For the PTerodaC³TILES implementation, the correctness of the various test case initial states, constructed to be in hydrostatic balance and thermodynamic equilibrium, and the broadly correct evolution of thermodynamic profiles and clouds provide further verification of the thermodynamics.

The correct linearization of the thermodynamics encoded in the matrix $\tilde{M}$ is critical to the success of the formulation but is fiddly and susceptible to errors. The linearization has been verified by comparing the actual changes in the left-hand sides of Eqs. (6)–(13) with the changes predicted by the linearization when individual thermodynamic variables are perturbed. This testing considered base states with and without liquid and frozen water to ensure that all cases were covered.

4.4 Solver convergence

Good solver convergence is also critical for the success of the formulation. A useful rule of thumb is that the residuals should decrease by roughly an order of magnitude per solver iteration so that only a small number of iterations is needed per time step. However, ensuring correct performance of the solver is far from trivial, as it depends on correct formulation and implementation of the linearization, derivation of the Helmholtz problem, solution of the Helmholtz problem, and back substitution. Therefore, the ability of the solver to correct known errors was directly verified, as follows.

A known solution for the model state at step n+1 is required. This may be a known steady state such as a horizontally uniform state in hydrostatic balance or a more complex three-dimensional state obtained by taking a sufficiently large number of solver iterations. When a small perturbation is made to this state, the solver should be able to fix that perturbation almost completely in one iteration. Testing included perturbing each model state variable in turn and considered perturbations with different spatial structures, including globally uniform, horizontally uniform vertically localized, horizontally localized vertically uniform, and localized at a single point in the domain interior or at a top or bottom boundary.

Figure 5 shows the maximum residuals in the q and u equations and the maximum pressure increment versus iteration number at hour 9 of the ARM case when both the dynamics and moist thermodynamics are very active. For the purpose of calculating these diagnostics, the model was restarted for a single time step, and 12 solver iterations were taken rather than the default 3. There is a very large drop in the maximum q residual between the first and second iterations associated with the cheap transport updates (Sect. 2.6). There is a rather small reduction in the pressure increment between the first and second iterations, probably related to constraint switching in the thermodynamics. Otherwise, all three quantities steadily decrease by nearly an order of magnitude per iteration. Note that the residuals and pressure increments calculated at the fourth iteration are a measure of the errors due to incomplete solver convergence that remain upon completion of the third iteration.

https://gmd.copernicus.org/articles/18/3331/2025/gmd-18-3331-2025-f05

Figure 5Maximum absolute value within the model domain of (a) q equation residual (dimensionless), (b) u equation residual (m s⁻¹), and (c) p^′ (Pa) versus iteration number. These diagnostics were computed for a single time step, restarting from hour 9 of the ARM case and taking 12 solver iterations.

Download

As discussed in Sect. 2.11, when the thermodynamic state switches between absence and presence of condensate, a subset of thermodynamic variables local to the switch receive only partial increments. It is important to understand the extent to which such partial increments adversely affect the solver convergence. The DYCOMS case is especially challenging in this regard. If the cloud deck begins to break up, then holes form in the cloud (e.g. Fig. 12). If the horizontal advective Courant number is greater than 1, then a very large number of grid cells per step switch between absence and presence of liquid water. Moreover, the extremely large gradients in humidity and entropy at cloud top, which get folded into the cloud holes, make the thermodynamic state at the cloud top and cloud hole edges very sensitive to the solver advection updates, so that switching can occur at the second and subsequent solver iterations.

An initial attempt to run the DYCOMS case at $128 \times 128 \times 300$ resolution ( $Δ x = Δ y = 25$ m, Δz=5 m) with N_ℓ=3 solver iterations resulted in failure of the model after about 32 min. Diagnostics indicated that the switching had not completely settled down after three solver iterations. Nevertheless, the number of switches decreases by 2 orders of magnitude per iteration, and increasing the number of iterations to N_ℓ=4 is sufficient for virtually all of the switching to settle down (Table 1), allowing the DYCOMS case to run successfully. Although constraint switching can slow solver convergence, no cases have been encountered in which constraint switching prevents solver convergence.

Table 1Representative number of grid points at which switches occurred at different solver iterations. The numbers were monitored over 10 time steps 1 h into the DYCOMS case. The total number of model grid points was $128 \times 128 \times 300$ .

Download Print Version | Download XLSX

4.5 Stability

The semi-implicit semi-Lagrangian formulation of PTerodaC³TILES should be stable for large acoustic, gravity-wave, and advective Courant numbers. Diagnostics optionally output by the model confirm that the model does run stably with advective Courant numbers greater than 1 and with very large acoustic Courant numbers. For example, in the ARM case presented in Sect. 5.2 the horizontal and vertical acoustic Courant numbers remain around 53 and 87, respectively. The horizontal advective Courant number in the x direction varies in the range 1.5 to 2, while the vertical advective Courant number peaks at about 3. Both the gravity-wave and convective Courant numbers $\sqrt{\max (N^{2})} Δ t$ and $\sqrt{\max (- N^{2})} Δ t$ , respectively, peak at around 0.6. For the DYCOMS case presented in Sect. 5.3, the horizontal and vertical acoustic Courant numbers remain around 49 and 344, respectively. The horizontal advective Courant numbers in both the x and y directions are greater than 1, while the vertical advective Courant number peaks at over 3. The gravity-wave Courant number is about 0.9 while the convective Courant number varies between 0.8 and 1.4.

Because the ILES formulation does not include an eddy-diffusion-based subgrid scheme, the diffusive Courant number is not relevant for model stability.

In all cases the model does become unstable if the time step is chosen too large. Further careful study is needed to understand exactly what limits the model stability. However, based on the theoretical properties of the methods and experience to date, likely candidates for instability mechanisms include the following.

When the deformational Courant number, or, more precisely, the maximum magnitude of the eigenvalues of bΔt∇u, approaches 1, the semi-Lagrangian trajectory calculations become increasingly inaccurate. This might lead to badly distorted departure cells such that the simple small-amplitude advecting velocity correction cannot restore divergence consistency (Sect. 2.6), or there might be other feedbacks via advection that amplify errors. Diagnostics of the components of Δt∇u for the cases presented in Sect. 5 suggest that the model is close to that regime in the surface layer where vertical shear is strongest.
If a²N²Δt² approaches −1, then the factor 1+a²N²Δt² in the denominator of Eqs. (70) and (71) approaches zero and those coefficients blow up. This is a known limitation of the semi-implicit treatment of gravity waves when the stratification becomes unstable (Davies et al., 2005); it can be mitigated by bounding the value of N² that appears in those coefficients. In PTerodaC³TILES no such bound is applied.
Any factors that inhibit convergence of the quasi-Newton solver can mean that the theoretical stability of a semi-implicit, semi-Lagrangian scheme is not attained. This could occur, for example,
- if the change in state over one time step is too large, hence too nonlinear, to be captured by the quasi-Newton linearization;
- if insufficient iterations are taken for constraint switching to settle down (as discussed in Sect. 4.4); and
- if terms omitted from the linearization, such as $u \partial η / \partial x$ , become important.
(Coriolis terms are omitted from the linearization Eqs. (34) and (35), but $| Ω | Δ t$ is unlikely to be large enough, in practice, for this to matter.)

It would be a valuable addition to the model to include mitigation strategies, such as adaptive time stepping, if the mechanisms limiting the model stability could be better understood and quantified.

4.6 Conservation

PTerodaC³TILES is designed to have closed budgets for mass, water, and entropy. This property requires conservative advection of these quantities by both SLICE and the cheap transport updates (Sect. 2.6), as well as careful formulation of surface and interior sources (Sect. 2.7), all taking into account the w-level dual mass budget (Sect. 2.2).

Figure 6 shows time series of various budget quantities for the ARM case. Figure 6a shows that the change in total mass (blue curve) agrees with the accumulated source of total mass, in this case due to the surface moisture flux (blue symbols), while the dry mass is exactly conserved. Figure 6b shows that the change in total water mass (black curve) agrees with the accumulated source of water (black symbols), even while some water condenses into liquid (red curve). Figure 6d shows that the change in total entropy (black curve) agrees with the accumulated source of entropy (black symbols). Figure 6c confirms that the momentum budget is not closed because the semi-Lagrangian advection of momentum is not conservative. Figure 6e shows that the change in total energy (black curve), which averages 432 W m⁻², is slightly less than the accumulated total energy source (black symbols). The difference, shown in Fig. 6f, is due to numerical dissipation of kinetic energy and mixing of water and entropy. This energy loss averages about 12 W m⁻².

https://gmd.copernicus.org/articles/18/3331/2025/gmd-18-3331-2025-f06

Figure 6Global changes and accumulated sources versus time for key budget quantities for the ARM case. (a) Total mass and dry mass (kg m⁻²). (b) Total water; 100× global liquid water and frozen water (kg m⁻²). (c) u momentum and v momentum (kg m s⁻¹ m⁻²). (d) Entropy (J K⁻¹ m⁻²). (e) Energy (J m⁻²). (f) Numerical energy change, i.e. the actual energy change minus the change due to explicit sources (J m⁻²).

Download

Dissipation of kinetic energy and mixing of constituents and heat should result in a source of entropy that exactly compensates for this energy loss; however, this entropy source is currently neglected. See further discussion in Sect. 6. Because the current formulation assumes thermodynamic equilibrium in each grid cell, entropy sources due to departures from equilibrium (the J^TP term in Eq. 4) are zero.

4.7 Bit reproducibility

PTerodaC³TILES produces bit-reproducible results when restarted from a checkpoint file³. This property is invaluable for development and testing, as well as when rerunning sections of a simulation to obtain additional diagnostics. PTerodaC³TILES also produces bit-identical answers whether run with or without OpenMP shared memory parallelism, verifying the correctness of the parallel implementation.

5 Evaluation

This section presents results from some standard LES test cases to demonstrate the performance of PTerodaC³TILES version 1.0⁴. The same spatial resolution is used as in the original intercomparison articles defining the test case specifications. Although relatively coarse by current-day standards, this facilitates comparison with those published results and helps to highlight any limitations of PTerodaC³TILES version 1.0 that might be less conspicuous at finer resolution.

All results shown in this section used SLICE advection with piecewise constant remapping for density and parabolic spline remapping with a limiter and the Charney–Phillips grid correction for water and entropy. Semi-Lagrangian advection of velocity components used no limiter and the freeslip option for extrapolation near the bottom boundary.

5.1 BOMEX

The BOMEX test case (Siebesma et al., 2003) is based on observations made during the Barbados Oceanographic and Meteorological Experiment. It simulates a scenario of shallow cumulus over the ocean in which large-scale forcing, radiation, and turbulent and convective fluxes maintain a quasi-steady balance.

As in Siebesma et al. (2003), PTerodaC³TILES used a $64 \times 64 \times 75$ grid with $Δ x = Δ y = 100$ m, Δz=40 m. The time step was Δt=10 s and the simulation was run for 6 h.

Figure 7 shows time series of three key quantities: total cloud cover, liquid water path (LWP), and turbulent kinetic energy (TKE). All three time series agree broadly with Fig. 2 of Siebesma et al. (2003). After an initial spin-up during the first hour or so, the total cloud cover and LWP fluctuate but have little trend, while the TKE continues to grow slowly. The PTerodaC³TILES cloud cover is slightly lower and the TKE slightly larger than the ensemble means in Siebesma et al. (2003) but within the typical inter-model spread.

https://gmd.copernicus.org/articles/18/3331/2025/gmd-18-3331-2025-f07

Figure 7Time series of (a) total cloud cover, (b) liquid water path, and (c) turbulent kinetic energy for the BOMEX case.

Download

Figure 8 shows several horizontally averaged profiles from the BOMEX case. Figure 8b–d agree well with the corresponding figures from Siebesma et al. (2003) (their Figs. 6, 3d, and 4a). Figure 8a, e, and f also broadly agree with the corresponding figures from Siebesma et al. (2003) (their Figs. 3c, 4e, and 5a). However, an excessively strong shear layer has formed between model levels 1 and 2, consistent with the idea that the ILES approach poorly represents vertical subgrid transports near a horizontal boundary. This excessively strong shear layer leads to noise in the lowest two to three levels in the profiles of u, momentum flux, and TKE.

https://gmd.copernicus.org/articles/18/3331/2025/gmd-18-3331-2025-f08

Figure 8Vertical profiles of horizontally averaged quantities from the BOMEX case. (a) u and v (m s⁻¹). (b) Cloud fraction (dimensionless). (c) Liquid water q^l (dimensionless). (d) Vertical eddy flux of total water $〈 w^{'} q^{'} 〉$ (m s⁻¹). (e) Vertical eddy fluxes of momentum $〈 w^{'} u^{'} 〉$ and $〈 w^{'} v^{'} 〉$ (m² s⁻²). (f) TKE (m² s⁻²). Panels (a)–(c) are averaged over the last hour of simulation; panels (d)–(f) are averaged over the last 3 h.

Download

On a closely related point, numerical experimentation revealed that the momentum budget and the boundary layer u profile are strongly sensitive to the details of the semi-Lagrangian interpolation scheme for velocity components near the bottom boundary (Sect. 2.6). For example, switching on the limiter for velocity advection or using the noslip $⊔ ⊔$ bottom boundary extrapolation option resulted in a large spurious numerical source of eastward momentum and a significant shift to the right of the boundary layer u profile.

5.2 ARM

The ARM test case (Brown et al., 2002) simulates the diurnal evolution of shallow convection over land, starting from an initially stable and cloud-free boundary layer. The number, size, and depth of clouds evolve during the day, and clouds often overshoot their level of neutral buoyancy.

As in Brown et al. (2002), PTerodaC³TILES used a $96 \times 96 \times 110$ grid with $Δ x = Δ y = 66.7$ m, Δz=40 m. The time step was Δt=10 s and the simulation was run for 14.5 h. Large-scale forcing terms were omitted, since they have only a small effect on the simulation (Brown et al., 2002).

Figure 9 shows a time–height plot of cloud fraction and TKE. The evolution of the cloud fraction, which peaks at a little over 0.15 around 6 or 7 h, as well as the height of cloud base and cloud top, agrees well with Fig. 5 of Brown et al. (2002). The intensity of TKE in the boundary layer grows and then decays in concert with the strength of the surface heat flux, and there is a clear signature of the formation of gravity waves in and above the cloud layer.

https://gmd.copernicus.org/articles/18/3331/2025/gmd-18-3331-2025-f09

Figure 9TKE (shading) and cloud fraction (contours) versus time and height for the ARM case. The contour values are 0.0001, 0.05, 0.1, and 0.15. The peak TKE value is 1.74 m² s⁻².

Download

Figure 10 shows profiles of u and w variances at 3 and 9 h. The peak in w variance after 3 h is somewhat smaller than most ensemble members in Brown et al. (2002) (their Fig. 6), but otherwise these profiles agree well with Brown et al. (2002), including the secondary peak in w variance in the cloud layer at 9 h and the small peak in u variance near the boundary layer top at 3 h, which disappears after clouds form (see discussion in Brown et al., 2002, Sect. 3b).

https://gmd.copernicus.org/articles/18/3331/2025/gmd-18-3331-2025-f10

Figure 10Vertical profiles of (a) w variance and (b) u variance for the ARM case at 3 and 9 h. The profiles are also averaged in time between minus and plus 30 min of the nominal diagnostic time.

Download

Figure 11 shows snapshots of cloud top height at selected times. The behaviour is consistent with that documented for the ARM case, with many small and shallow clouds at early times, gradually growing in depth, evolving towards fewer, larger, and deeper clouds with smaller total cloud cover at later times. The horizontal motion of the higher cloud tops is almost exactly in the x direction (the direction of the background geostrophic wind), while near cloud base there is a small component of motion in the y direction, leading to a distinct characteristic tilt to the clouds.

https://gmd.copernicus.org/articles/18/3331/2025/gmd-18-3331-2025-f11

Figure 11Snapshots of cloud top height (grey) for the ARM case at 6, 8, and 10 h. For this purpose, grid cells with $q^{l} + q^{f} > 10^{- 5}$ are defined to be cloudy (though q^f≡0 for ARM). The green background indicates no cloud in that column, and the contour interval is 100 m; refer to Fig. 9 for the lowest cloud base and highest cloud top at these times.

Download

5.3 DYCOMS

The DYCOMS test case (Stevens et al., 2005) simulates a very different boundary layer regime from BOMEX and ARM: a nocturnal stratocumulus cloud layer over the ocean. The cloud layer is capped by a very strong and sharp inversion, with q decreasing by a factor of 6 and a jump in potential temperature of more than 9 K over 5 m, the recommended vertical grid spacing. Observations suggest that the cloud cover should be maintained close to 100 %. However, LES often fail to maintain the cloud cover, since excessive mixing across the inversion can lead to evaporative cooling driving cloud-free downdraughts.

https://gmd.copernicus.org/articles/18/3331/2025/gmd-18-3331-2025-f12

Figure 12Snapshots of cloud top height (pale grey) for the DYCOMS case at 1, 2, and 3 h. The blue background indicates no cloud in that column, and the contour interval is 50 m; refer to Fig. 14 for the lowest cloud base and highest cloud top at these times.

Download

The DYCOMS case is expected to be particularly testing for the PTerodaC³TILES formulation, for several reasons. First, there are no physical parameters that can be adjusted to control the strength of mixing near the inversion, since the mixing is entirely associated with the numerics. There is, however, some sensitivity to the choice of numerical options and parameters, as discussed below. Second, good convergence of the semi-implicit iterative solver depends on having a sufficiently good linearization. The presence of near discontinuities in the distributions of total entropy and total specific humidity mean that the linearized advection terms Eqs. (41) and (42) are necessarily less accurate. The solver convergence is also affected by the switching of thermodynamic constraints (Sects. 2.11 and 4.4). Once cloud-free downdraughts form (Fig. 12) and are advected across the grid with horizontal Courant numbers greater than 1, condensate appears or disappears at large numbers of grid points every step. Moreover, the large gradients in entropy and specific humidity mean that transport increments resulting from small velocity increments at one solver iteration are enough to cause constraint switching at the next solver iteration. Experience to date suggests that more solver iterations per step are required for DYCOMS than for other test cases in order for the constraint switching to settle down (Sect. 4.4).

As in Stevens et al. (2005), PTerodaC³TILES used a $96 \times 96 \times 300$ grid with $Δ x = Δ y = 35 m$ , Δz=5m. The time step was Δt=5s and the simulation was run for 4h.

Figure 13 (compare Stevens et al., 2005, Fig. 2) shows that in the PTerodaC³TILES simulation cloud cover gradually falls to about 40 % while the liquid water path falls to around 0.02 kg m⁻² (about 0.035 kg m⁻² for the ensemble mean in Stevens et al., 2005). The TKE settles down at about 200 kg s⁻², about half of the ensemble mean in Stevens et al. (2005) (noting the different units used). The gradual reduction in cloud cover is also clearly illustrated in Fig. 14. The figure confirms that turbulence grows initially both from the surface (due to the surface sensible heat flux) and from cloud top (due to cloud top radiative cooling).

https://gmd.copernicus.org/articles/18/3331/2025/gmd-18-3331-2025-f13

Figure 13Time series of (a) total cloud cover, (b) liquid water path, and (c) turbulent kinetic energy for the DYCOMS case.

Download

https://gmd.copernicus.org/articles/18/3331/2025/gmd-18-3331-2025-f14

Figure 14TKE (shading) and cloud fraction (contours) versus time and height for the DYCOMS case. The contour values are 0.3 to 0.9 in steps of 0.1 and 0.99. The peak TKE value is 1.16 m² s⁻².

Download

The PTerodaC³TILES simulation of cloud in the DYCOMS case appears to be quite sensitive to the choice of numerical options and parameters. For example, when the time step is reduced from 5 to 2 s, the cloud breakup is slowed, leaving 70 % cloud cover after 4 h. On the other hand, when parabolic spline remapping is replaced by piecewise parabolic method remapping for the advection of entropy and water, almost all cloud disappears after 4 h. Further experiments and diagnostics to understand these sensitivities would be valuable.

5.4 Computational cost

While noting the usual caveats that computational costs are sensitive to computing platform, compiler, output files written, and many other factors, it is nevertheless useful to give would-be users an idea of the computational cost of PTerodaC³TILES. Table 2 gives the wall-clock times for the three test cases discussed in this section. The cases were run on a 2022 MacBook Pro with an 8 core M2 chip. The gfortran compiler was used with the −O₃ optimization flag and OpenMP for shared memory parallelism. The parallel speed-up ranged from 574 % for BOMEX to 652 % for DYCOMS.

Table 2Computational cost of PTerodaC³TILES for three standard test cases on a 2022 MacBook Pro laptop.

Download Print Version | Download XLSX

The multigrid elliptic solver and the full transport calculation dominate the cost, accounting for approximately 40 % and 35 % of the total cost, respectively, with some variation from case to case. Creating the thermodynamic subsystem matrix and carrying out the Gaussian elimination account for less than 15 % of the total cost.

The cost of the model scales linearly with the number of time steps and hence almost inversely with the time step size Δt. The scaling is not exactly inverse because the multigrid scheme parameters take account of the acoustic Courant number (Appendix A3). The scaling of the cost with spatial resolution is affected by memory usage as well as floating point operations, so it will be strongly system dependent.

6 Conclusions and discussion

It has been demonstrated that a consistent treatment of moist thermodynamics, expressed in terms of internal energy potentials, can be incorporated, without excessive computational expense, in a three-dimensional computational fluid dynamics code suitable for the study of atmospheric boundary layers and shallow convection. In the current implementation the moist thermodynamics is fully incorporated within the dynamical core rather than treated as separate physics source terms.

The iterative solver for the semi-implicit time integration scheme requires a linearization of the thermodynamics, with elimination of unknowns to leave a typical Helmholtz problem for the pressure increment. The moderate sparsity and fixed sparsity pattern of the Jacobian matrix of the (w-level) thermodynamic subsystem are exploited to reduce the cost of the elimination.

The use of internal energy potentials has several advantages over the Gibbs function approach used by Thuburn (2017 b) (see Sect. 1). However, one disadvantage of the internal energy potential approach is that it does not permit seamlessly switching to a (quasi-)Boussinesq equation of state since e_α terms (used throughout the algorithm) would become undefined. A nearly Boussinesq fluid could be simulated by making the sound speed extremely large. Alternatively, if a Boussinesq option is a requirement, then the above difficulty could be avoided by using specific enthalpies as the potentials in formulating the thermodynamics (Chris Eldred, personal communication, 2024).

The numerical methods used by PTerodaC³TILES are more typical of those used in global weather and climate models than traditional LES models. The semi-implicit semi-Lagrangian scheme permits time steps significantly larger than are commonly used in traditional LES models at similar resolution. Experience to date, consistent with the theoretical properties of the numerical methods, suggests that the deformational Courant number is most often the factor limiting the maximum stable time step. However, there is no simple, easily monitored stability criterion (computing the eigenvalues of Δt∇u at every grid point and every step would be expensive), so a degree of experimentation is required to find a suitable time step for each simulation. An important caveat is that stability does not imply accuracy. For example, the DYCOMS case (Sect. 5.3) shows significant sensitivity to the size of the time step. Further investigation is required to quantify the extent to which accuracy declines as we push the time step towards the stability limit and which flow regimes, like DYCOMS, are especially sensitive to the time step.

For the BOMEX, ARM, and DYCOMS cases presented here, and other cases not discussed, PTerodaC³TILES can produce plausible simulations of standard LES test cases, comparable to other model results in the literature. These results are encouraging for the ILES approach and, more widely, for the use of global models at sub-kilometre resolution. At the same time, these results have given some useful initial indications of the limitations of the ILES approach, as well as highlighting areas in need of further investigation.

Most notably, ILES produces weak vertical fluxes of momentum and scalars near the bottom boundary. These weak fluxes lead to excessive vertical gradients, which, in turn, result in further errors. For example, in BOMEX the spuriously strong vertical shear exacerbates the conservation errors of the cubic semi-Lagrangian advection of momentum and makes the results very sensitive to the details of the semi-Lagrangian interpolation in the lowest levels. In an ARM simulation with finer vertical resolution near the surface (not shown), the spuriously weak mixing near the surface allows a thin layer of fog to form briefly during the first hour. Some initial experimentation showed benefits in distributing the convergence of the surface momentum flux over several layers near the surface rather than just a single layer. In future work it is planned to investigate this idea further, applied to scalars as well as momentum, and to try to determine how the optimal distribution depends on the flow regime as well as numerical factors such as the grid resolution and anisotropy.

Another broad area in need of further investigation is the sensitivity to numerical methods and parameters. The sensitivity of the DYCOMS case to the time step and the advection remapping scheme has been mentioned already. Other potentially significant factors include the grid isotropy, the use of different advection options such as the use of limiters and the Charney–Phillips grid correction (Eq. 27), and the modifications to momentum interpolation near the bottom boundary.

Taking inspiration from global models that predict potential temperature, PTerodaC³TILES has a closed budget for entropy rather than energy. The entropy source that should be associated with numerical mixing of scalars and dissipation of kinetic energy is neglected, resulting in a small but systematic energy loss (Fig. 6). If desired, the global numerical energy loss could easily be diagnosed and returned as an entropy source to close the energy budget. Initial attempts to diagnose the local numerical energy loss (not shown) suggest that the calculation is subtle and far from trivial and might not even be well defined, in large part because of the Charney–Phillips vertical grid staggering. Further work will be needed to diagnose the local numerical energy loss, if, indeed, it is possible at all. A strong motivation to continue these attempts is that such an estimate of energy dissipation would be a key input to a stochastic parameterization of backscatter (e.g. Mason and Thomson, 1992; Brown et al., 1994), which would be a useful extension to the model's functionality.

Two closely related priorities for future work are the inclusion of a simple microphysics scheme with precipitation and the inclusion of thermodynamic nonequilibrium effects. These developments will enhance the capabilities of PTerodaC³TILES, allowing it to be applied to a wider range of cases. Equally importantly, they will test whether the thermodynamic potential approach can be applied straightforwardly and efficiently to more complex physical processes, beyond the coupling of the equation of state to the dynamical core.

Appendix A

A1 Stretched vertical grid

When a stretched vertical grid is chosen, the height of the model level with index k is given by

\begin{matrix} (A1) & z_{k} = a_{(z)} \log (1 + d_{(z)} b_{(z)}^{k}) + c_{(z)}, \end{matrix}

where k is an integer for p levels and an integer plus $1 / 2$ for w levels. The parameters in Eq. (A1) are set so that the grid spacing approaches a stretching factor b_(z) per level for small k and a uniform grid spacing u_(z) for large k, with b_(z) and u_(z) specified by the user when creating initial data. In terms of N_z, the number of p levels, and $D_{z} = z_{N_{z} + 1 / 2} - z_{1 / 2}$ , the domain depth with $z_{1 / 2} = 0$ the height of the bottom boundary, the parameters are given by

\begin{array}{l} (A2) & a_{(z)} = u_{(z)} / \log b_{(z)}, \\ (A3) & d_{(z)} = \frac{1 - \exp (D_{z} / a_{(z)})}{b_{(z)}^{1 / 2} (b_{(z)}^{N_{z}} - \exp (D_{z} / a_{(z)}))}, \\ (A4) & c_{(z)} = D_{z} - a_{(z)} \log (1 + d_{(z)} b_{(z)}^{N_{z} + 1 / 2}) . \end{array}

Importantly, Eq. (A1) has the virtue of being invertible, allowing the cell index k to be determined for any height z without the need for expensive searching in the semi-Lagrangian departure-point calculations.

A2 Expressions for internal energy and related quantities

The expressions used for specific internal energy of dry air, water vapour, liquid water, and frozen water are

\begin{array}{l} (A5) & e^{d} (α^{d}, η^{d}) = C_{v}^{d} T_{0} \exp (\frac{η^{d} - C_{p}^{d}}{C_{v}^{d}}) {(\frac{α_{0}^{d}}{α^{d}})}^{R^{d} / C_{v}^{d}}, \\ (A6) & e^{v} (α^{v}, η^{v}) = C_{v}^{v} T_{0} \exp (\frac{η^{v} - C_{p}^{v}}{C_{v}^{v}}) {(\frac{α_{0}^{v}}{α^{v}})}^{R^{v} / C_{v}^{v}} + L_{00}^{s}, \end{array}

\begin{array}{l} (A7) & e^{l} (η^{l}) = C^{l} T_{0} \exp (\frac{η^{l} - C^{l}}{C^{l}} + \frac{L_{00}^{v} - α^{l} p_{0}^{sat}}{C^{l} T_{0}}) + L_{00}^{f}, \\ (A8) & e^{f} (η^{f}) = C^{f} T_{0} \exp (\frac{η^{f} - C^{f}}{C^{f}} + \frac{L_{00}^{s} - α^{f} p_{0}^{sat}}{C^{f} T_{0}}) . \end{array}

These expressions differ from those used by Bowen and Thuburn (2022 a) in that the constant $L_{00}^{s} / T_{0}$ has been subtracted from the specific entropy of all water phases.

For $i = d, v, l, f$ , the species temperature is given by

\begin{matrix} (A9) & T^{i} = e_{η}^{i} . \end{matrix}

For $i = d, v$ , the species partial pressure is given by

\begin{matrix} (A10) & p^{i} = - e_{α}^{i} . \end{matrix}

The Gibbs function for water vapour is given by

\begin{matrix} (A11) & g^{v} = e^{v} + α^{v} p^{v} - η^{v} T^{v}, \end{matrix}

while for $i = l, f$ ,

\begin{matrix} (A12) & g^{i} = e^{i} + α^{i} p - η^{i} T^{i}, \end{matrix}

with $p = p^{d} + p^{v}$ . (The Gibbs function for dry air is not needed.)

A3 Parameters for multigrid scheme

The tuning of parameters in numerical solvers for elliptic problems is often empirical and case-dependent. Here, both the Helmholtz problem and the properties of the multigrid solver are well understood, which allows some key parameters in the multigrid scheme to be set automatically.

Let $C_{ac}^{x} = c Δ t / Δ x$ and $C_{ac}^{y} = c Δ t / Δ y$ be the horizontal acoustic wave Courant numbers in the x and y directions, respectively, appropriate to whichever grid in the hierarchy is under consideration. Let $C_{ac} = \max (C_{ac}^{x}, C_{ac}^{y})$ . In the following calculations the temporal off-centring parameter a is effectively set to 1.

Each smoother iteration uses a direct solve in the vertical direction; thus, we must consider the action of the smoother on errors of different horizontal scales. The error amplification factor for a uniform error for one Jacobi smoother iteration is

\begin{matrix} (A13) & A_{ε} = 1 - \frac{μ}{1 + 2 ({(C_{ac}^{x})}^{2} + {(C_{ac}^{y})}^{2})}, \end{matrix}

where μ is the under-relaxation parameter, while the error amplification factor for a checkerboard pattern error is

\begin{matrix} (A14) & A_{ε} = 1 - μ - \frac{2 μ ({(C_{ac}^{x})}^{2} + {(C_{ac}^{y})}^{2})}{1 + 2 ({(C_{ac}^{x})}^{2} + {(C_{ac}^{y})}^{2})} . \end{matrix}

Under-relaxation parameter. The under-relaxation parameter is set to μ=0.8. This provides an appropriate compromise between the damping of large-scale errors on the coarsest grid, where C_ac≈1 implies A_ε≈0.85, and the damping of grid-scale errors on the finest grid, where C_ac≫1 implies A_ε≈0.6.
Depth of V cycles. As the horizontal grid is coarsened, the horizontal acoustic wave Courant number C_ac decreases, the horizontal part of the Helmholtz problem becomes more diagonally dominant, and the smoother iterations damp the error more and more quickly. Once the horizontal acoustic Courant number reaches about 1, there is little to be gained by coarsening the grid further. Thus, the desired V-cycle depth is set to the smallest depth needed to reduce the horizontal acoustic Courant number below 1. The maximum possible V-cycle depth permitted by the number of grid points in the x and y directions is also computed. The actual V-cycle depth is set equal to the minimum of the desired depth and the maximum permitted depth. If the maximum permitted depth is smaller than the desired depth, then a warning message printed and the scheme attempts to compensate by increasing the number of coarsest-grid smoother iterations.
Number of coarsest-grid smoother iterations. The rate at which large-scale errors are damped is determined by the damping of uniform errors on the coarsest grid. Using Eq. (A13) to estimate the large-scale error damping rate, the number of smoother iterations on the coarsest grid is chosen so that a uniform error is damped by a factor $1 / 10$ on each V cycle.
Number of V cycles. To ensure mass conservation, the back substitution computes density increments from the divergence of mass flux increments, with velocity increments computed from the gradient of pressure increments. As a consequence of the gradient and divergence operations, any grid-scale errors in the pressure increments due to imperfect convergence of the multigrid solver get amplified by a factor equal to the (fine-grid) horizontal acoustic Courant number squared. Thus, to ensure good convergence of the quasi-Newton solver, any grid-scale errors in the pressure increments computed by the multigrid solver must be much smaller, by a factor an order of magnitude smaller than $1 / C_{ac}^{2}$ , than the pressure increments themselves. Using the estimate from Eq. (A14) that $| A_{ε} | \approx | 1 - 2 μ |$ for large C_ac, we can estimate the total number of fine-grid smoother iterations required and hence the number of V cycles.

A4 Alternative formulation of Monin–Obukhov theory for surface momentum flux

Monin–Obukhov similarity theory gives the flow speed U(z) at height z in terms of the friction velocity U_* (e.g. Stull, 1988):

\begin{matrix} (A15) & U (z) = \frac{U_{*}}{κ} \{\log (z / z_{0}) + Ψ (ζ, ζ_{0})\}, \end{matrix}

where κ is the von Kármán constant; $ζ = z / L$ is a non-dimensional height; $ζ_{0} = z_{0} / L$ , where z₀ is the surface roughness length; $L = - U_{*}^{3} / F^{b}$ is the Obukhov length with F^b the surface buoyancy flux; and Ψ is an integrated stability function, discussed below. As written, Eq. (A15) must be solved iteratively to obtain U_*, given U(z) at some z. Moreover, in the stable case (F^b<0) with light winds Monin–Obukhov similarity theory is valid only for sufficiently small z and may be outside its range of validity at the height of the lowest model level; in that case, with commonly used expressions for Ψ, the solution for U_* might not be unique or might not exist (e.g. Fig. A1).

https://gmd.copernicus.org/articles/18/3331/2025/gmd-18-3331-2025-f15

Figure A1Dimensionless friction velocity ${\hat{U}}_{*} = ζ^{1 / 3}$ versus dimensionless wind speed $\hat{U} = 1 / \hat{s}$ at height z in the stable regime given by Monin–Obukhov theory with the Businger–Dyer stability function (Dyer, 1974, asterisks), with the Cheng and Brutsaert (2005) stability function (open circles), with the modified Cheng and Brutsaert stability function Eq. (A26) (filled circles), and with the fit Eq. (A29) (thick curve) (a) for $r = z_{0} / z = 0.001$ and (b) for $r = z_{0} / z = 0.1$ . Note that with the Businger–Dyer stability function there may be zero or two solutions for ${\hat{U}}_{*}$ , while with both the original and modified Cheng and Brutsaert stability functions there may be multiple solutions for ${\hat{U}}_{*}$ .

Download

Here a slight modification of the usual Monin–Obukhov similarity theory is presented that ensures the existence of a unique solution for U_* even when Monin–Obukhov similarity theory is outside its range of validity. (The modification does not, of course, extend that range of validity.) We focus on the case in which F^b is known and the task is to determine U_*. The case in which F^b is also to be determined is more complicated (Bull and Derbyshire, 1990).

Since the expressions used for stability functions are typically derived by curve fitting to observations or simulations, it seems reasonable to attempt to provide an inverse to Eq. (A15) with aid of some curve fitting. The fitting can be done in such a way as to guarantee the existence of a unique solution for U_* satisfying the reasonable requirement $U_{*} \to 0$ as U(z)→0.

It is convenient to define $r = z_{0} / z = ζ_{0} / ζ$ and to introduce a non-dimensional inverse flow speed,

\begin{matrix} (A16) & \hat{s} = \frac{{(- F^{b} z)}^{1 / 3}}{κ (U (z_{1}) + ε)}, \end{matrix}

where ε is a small safety parameter to prevent infinite $\hat{s}$ as U(z₁)→0. Then Eq. (A15) becomes

\begin{matrix} (A17) & \hat{s} = \frac{ζ^{1 / 3}}{{- \log r + Ψ (ζ, r ζ)}} . \end{matrix}

To parameterize the surface momentum flux we need to be able to compute $ζ = ζ (\hat{s}, r)$ ; ζ is a function of the two parameters $\hat{s}$ and r.

To construct a functional fit for $ζ (\hat{s}, r)$ , proceed as follows. For both the stable case and the unstable case, carry out an asymptotic expansion of Eq. (A17) for the limits of small $| \hat{s} |$ and large $| \hat{s} |$ and rearrange to obtain the limiting expressions for $ζ (\hat{s}, r)$ . Then seek a simple expression that agrees with the asymptotic expressions in the two limits.

For the unstable case we begin with the integrated stability function given by Benoit (1977):

\begin{matrix} (A18) & \begin{aligned} Ψ (ζ, r ζ) & = \log \{\frac{(x_{0}^{2} + 1) {(x_{0} + 1)}^{2}}{(x^{2} + 1) (x + 1)^{2}}\} \\ + 2 (\arctan (x) - \arctan (x_{0})), \end{aligned} \end{matrix}

where

\begin{matrix} (A19) & x = (1 - 15 ζ)^{1 / 4}, x_{0} = (1 - 15 r ζ)^{1 / 4} . \end{matrix}

We find

\begin{matrix} (A20) & ζ^{1 / 3} \sim \hat{s} \{- \log r - \frac{15}{4} (1 - r) (\log r)^{3} {\hat{s}}^{3}\} as \hat{s} \to 0 \end{matrix}

and

\begin{matrix} (A21) & ζ^{1 / 3} \sim \hat{s} {\{- \frac{1}{15} {(4 (r^{- 1 / 4} - 1))}^{4} {\hat{s}}^{- 3}\}}^{1 / 7} as \hat{s} \to \infty . \end{matrix}

These asymptotic limits are captured by the functional fit

\begin{matrix} (A22) & ζ^{1 / 3} = {(\tilde{a} + \tilde{b} {(\tilde{c} + (- \hat{s})^{\tilde{p}})}^{\tilde{q}})}^{\tilde{r}}, \end{matrix}

with the parameters given by

\begin{matrix} (A23) & \tilde{p} = 3; \tilde{q} = 2 / 7; \tilde{r} = 3 / (7 \tilde{p} \tilde{q}); \\ (A24) & \begin{aligned} \tilde{A} = - \log r; & \tilde{B} = \frac{15}{4} (1 - r) (\log r)^{3}; \\ \tilde{C} = {\{\frac{1}{15} {(4 (r^{- 1 / 4} - 1))}^{4}\}}^{1 / 7}; \end{aligned} \\ (A25) & \tilde{a} = {\tilde{A}}^{1 / \tilde{r}} - \tilde{b} {\tilde{c}}^{\tilde{q}}; \tilde{b} = {\tilde{C}}^{1 / \tilde{r}}; \tilde{c} = {(\frac{\tilde{B} {\tilde{A}}^{(1 - \tilde{r}) / \tilde{r}}}{\tilde{b} \tilde{q} \tilde{r}})}^{1 / (\tilde{q} - 1)} . \end{matrix}

For the stable case we begin with a modification of the integrated stability function given by Cheng and Brutsaert (2005):

\begin{matrix} (A26) & Ψ (ζ, r ζ) = a \log (\frac{ζ + {(c^{b} + ζ^{b})}^{1 / b}}{ζ_{0} + {(c^{b} + ζ_{0}^{b})}^{1 / b}}), \end{matrix}

with a=3, b=2.5, and c=0.5. (The original Cheng and Brutsaert (2005) scheme corresponds to a=6.1, b=2.5, and c=1. Figure A1 shows the effect of this modification.) We find

\begin{matrix} (A27) & ζ^{1 / 3} \sim \hat{s} \{- \log r + \frac{a}{c} (1 - r) (- \log r)^{3} {\hat{s}}^{3}\} as \hat{s} \to 0 \end{matrix}

and

\begin{matrix} (A28) & ζ^{1 / 3} \sim \hat{s} {- (1 + a) \log r} as \hat{s} \to \infty . \end{matrix}

These asymptotic limits are captured by the functional fit

\begin{matrix} (A29) & ζ^{1 / 3} = \tilde{c} + {(\tilde{a} + \tilde{b} \hat{s})^{\tilde{p}})}^{\tilde{q}}, \end{matrix}

with the parameters given by

\begin{matrix} (A30) & \tilde{p} = 3; \tilde{q} = - 3; \\ (A31) & \begin{aligned} \tilde{A} = - \log r; & \tilde{B} = \frac{a}{c} (1 - r) (- \log r)^{3}; \\ \tilde{C} = - (1 + a) \log r; \end{aligned} \\ (A32) & \tilde{a} = (\tilde{A} - \tilde{c})^{1 / \tilde{q}}; \tilde{b} = \tilde{B} {\tilde{a}}^{1 / \tilde{q}} / \tilde{q}; \tilde{c} = \tilde{C} . \end{matrix}

Code availability

The model code used to produce the results in this paper (PTerodaC³TILES 1.0, PTerodaC³TILES 1.2), an example namelist file, example plotting routines, and the user manual are available from Zenodo: https://doi.org/10.5281/zenodo.13899066 (Thuburn, 2025). The code is made available under the MIT licence.

The namelist files needed to create initial data for the BOMEX, ARM, and DYCOMS cases and to reproduce the simulations presented here, along with the plotting routines used to produce Figs. 5 to 14, are provided in the Supplement.

Data availability

No external data sets were used in this article.

Supplement

The supplement related to this article is available online at https://doi.org/10.5194/gmd-18-3331-2025-supplement.

Author contributions

Apart from the implementation of the dry CBL and NEUTRAL cases noted below, all work presented was carried out by the author.

Competing interests

The author has declared that there are no competing interests.

Disclaimer

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors.

Acknowledgements

The dry CBL and NEUTRAL cases were implemented by Georgios Efstathiou and Yuhang Tong, respectively. I am grateful to two reviewers whose thoughtful questions and comments helped to improve the manuscript.

Review statement

This paper was edited by Mohamed Salim and reviewed by Radouan Boukharfane and one anonymous referee.

References

Arakawa, A. and Lamb, V. R.: Computational design of the basic dynamical processes of the UCLA general circulation model, in: General Circulation Models of the Atmosphere, vol. 17 of Methods in Computational Physics, edited by: Chang, J., 173–265, Academic Press, ISBN 978-0124608177, 1977. a

Bartello, P. and Thomas, S. J.: The cost-effectiveness of semi-Lagrangian schemes, Mon. Weather Rev., 124, 2883–2897, 1996. a

Bendall, T., Wood, N., Thuburn, J., and Cotter, C. J.: A solution to the trilemma of the moist Charney-Phillips staggering, Q. J. Roy. Meteorol. Soc., 149, 262–276, 2023. a

Benoit, R.: On the integral of the surface layer profile-gradient functions, J. Appl. Meteorol., 16, 859–860, 1977. a

Bowen, P. and Thuburn, J.: Consistent and flexible thermodynamics in atmospheric models using internal energy as a thermodynamic potential. Part I: Equilibrium regime, Q. J. Roy. Meteorol. Soc., 148, 3730–3755, 2022a. a, b, c, d, e, f, g, h, i

Bowen, P. and Thuburn, J.: Consistent and flexible thermodynamics in atmospheric models using internal energy as a thermodynamic potential. Part II: Non-equilibrium regime, Q. J. Roy. Meteorol Soc., 148, 3540–3565, 2022b. a, b, c, d, e, f, g

Brown, A. R., Derbyshire, S. H., and Mason, P. J.: Large-eddy simulation of stable atmospheric boundary layers with a revised stochastic subgrid model, Q. J. Roy. Meteorol. Soc., 120, 1485–1512, 1994. a

Brown, A. R., MacVean, M. K., and Mason, P. J.: The effects of numerical dissipation in Large Eddy Simulations, J. Atmos. Sci., 57, 3337–3348, 2000. a

Brown, A. R., Cederwall, R. T., Chlond, A., Duynkerke, P. G., Golaz, J.-C., Khairoutdinov, M., Lewellen, D. C., Lock, A. P., MacVean, M. K., Moeng, C.-H., Neggers, R. A. J., Siebesma, A. P., and Stevens, B.: Large-eddy simulation of the diurnal cycle of shallow cumulus convection over land, Q. J. Roy. Meteorol. Soc., 128, 1075–1093, 2002. a, b, c, d, e, f, g, h

Bryan, G. H. and Fritsch, J. M.: A benchmark simulation for moist nonhydrostatic numerical models, Mon. Weather Rev., 130, 2917–2928, 2002. a

Bull, J. M. and Derbyshire, S. H.: Numerical solution of the surface layer equations, Turbulence and Diffusion Note No. 197, Met Office, 1990. a

Charney, J. G. and Phillips, N. A.: Numerical integration of the quasi-geostrophic equations for barotropic and simple baroclinic flow, J. Meteorol., 10, 71–99, 1953. a

Cheng, Y. and Brutsaert, W.: Flux-profile relationships for wind speed and temperature in the stable atmospheric boundary layer, Bound.-Lay. Meteorol., 114, 519–538, 2005. a, b, c

Colella, P. and Woodward, P.: The piecewise parabolic method (PPM) for gas-dynamical simulations, J. Comput. Phys., 54, 174–201, 1984. a

Davies, T., Cullen, M. J. P., Malcolm, A. J., Mawson, M. H., Staniforth, A., White, A. A., and Wood, N.: A new dynamical core for the Met Office's global and regional modelling of the atmosphere, Q. J. Roy. Meteorol. Soc., 131, 1759–1782, 2005. a

de Groot, S. R. and Mazur, P.: Non-equilibrium thermodynamics, Dover Publications, ISBN 13:978-0486153506, 1984. a

Dyer, A. J.: A review of flux-profile relationships, Bound.-Lay. Meteorol., 7, 363–372, 1974. a

ECMWF: IFS documentation – Cy48r1. Part III: Dynamics and numerical procedures, https://doi.org/10.21957/26f0ad3473, 2023. a

Eldred, C., Taylor, M., and Guba, O.: Thermodynamically consistent versions of approximations used in modelling moist air, Q. J. Roy. Meteorol. Soc., 148, 3184–3210, 2022. a

Grabowski, W. W., Bechtold, P., Cheng, A., Forbes, R., Halliwell, C., Khairoutdinov, M., Lang, S., Nasuno, T., Petch, J., Tao, W.-K., Wong, R., Wu, X., and Xu, K.-M.: Daytime convective development over land: A model intercomparison based on LBA observations, Q. J. Roy. Meteorol. Soc., 132, 317–344, 2006. a

Grinstein, F. F., Margolin, L. G., and Rider, W. J.: Implicit Large Eddy Simulation: Computing Turbulent Fluid Dynamics, Cambridge University Press, ISBN 978-0521869829, 2007. a

Hohenegger, C., Korn, P., Linardakis, L., Redler, R., Schnur, R., Adamidis, P., Bao, J., Bastin, S., Behravesh, M., Bergemann, M., Biercamp, J., Bockelmann, H., Brokopf, R., Brüggemann, N., Casaroli, L., Chegini, F., Datseris, G., Esch, M., George, G., Giorgetta, M., Gutjahr, O., Haak, H., Hanke, M., Ilyina, T., Jahns, T., Jungclaus, J., Kern, M., Klocke, D., Kluft, L., Kölling, T., Kornblueh, L., Kosukhin, S., Kroll, C., Lee, J., Mauritsen, T., Mehlmann, C., Mieslinger, T., Naumann, A. K., Paccini, L., Peinado, A., Praturi, D. S., Putrasahan, D., Rast, S., Riddick, T., Roeber, N., Schmidt, H., Schulzweida, U., Schütte, F., Segura, H., Shevchenko, R., Singh, V., Specht, M., Stephan, C. C., von Storch, J.-S., Vogel, R., Wengel, C., Winkler, M., Ziemen, F., Marotzke, J., and Stevens, B.: ICON-Sapphire: simulating the components of the Earth system and their interactions at kilometer and subkilometer scales, Geosci. Model Dev., 16, 779–811, https://doi.org/10.5194/gmd-16-779-2023, 2023. a

IOC, SCOR, and IAPSO: The international thermodynamic equation of seawater – 2010: Calculation and use of thermodynamic properties, Manuals and Guides No. 56, Intergovernmental Oceanographic Commission, http://www.TEOS-10.org (last access: 3 June 2025), 2010. a

Konor, C. S. and Arakawa, A.: Choice of a vertical grid in incorporating condensation heating into an isentropic vertical coordinate model, Mon. Weather Rev., 128, 3901–3910, 2000. a

Lin, S.-J.: A “Vertically Lagrangian” finite-volume dynamical core for global models, Mon. Weather Rev., 132, 2293–2307, 2004. a

Margolin, L. G., Smolarkiewicz, P. K., and Sorbjan, Z.: Large-eddy simulations of convective boundary layers using nonoscillatory differencing, Physica D, 133, 390–397, 1999. a

Margolin, L. G., Rider, W. J., and Grinstein, F. F.: Modeling turbulent flow with implicit LES, J. Turbulence, 7, n15, https://doi.org/10.1080/14685240500331595, 2006. a

Mason, P. J. and Thomson, D. J.: Stochastic backscatter in large-eddy simulations of boundary layers, J. Fluid Mech., 242, 51–78, 1992. a

Melvin, T., Shipway, B., Wood, N., Benacchio, T., Bendall, T., Boutle, I., Brown, A., Johnson, C., Kent, J., Pring, S., Smith, C., Zerroukat, M., Cotter, C., and Thuburn, J.: A mixed finite-element, finite-volume, semi-implicit discretisation for atmospheric dynamics: Spherical geometry, Q. J. Roy. Meteorol. Soc., 150, 4252–426, 2024. a, b

Nair, R. D., Scroggs, J. S., and Semazzi, F. H. M.: Efficient conservative global transport schemes for climate and atmospheric chemistry models, Mon. Weather Rev., 130, 2059–2073, 2002. a

Nocedal, J. and Wright, S. J.: Numerical Optimization, Springer, New York, ISBN 978-0387303031, 2006. a

Pressel, K. G., Mishra, S., Schneider, T., Kaul, C. M., and Tan, Z.: Numerics and subgrid-scale modeling in large eddy simulations of stratocumulus clouds, J. Adv. Model. Earth Syst., 9, 1342–1365, 2017. a

Purser, R. J. and Leslie, L. M.: An efficient interpolation procedure for high-order three-dimensional semi-Lagrangian models, Mon. Weather Rev., 119, 2492–2498, 1991. a, b

Satoh, M., Matsuno, T., Tomita, H., Miura, H., Nasuno, T., and Iga, S.: Nonhydrostatic icosahedral atmospheric model (NICAM) for global cloud resolving simulations, J. Comput. Phys., 227, 3486–3514, 2008. a

Siebesma, A. P., Bretherton, C. S., Brown, A., Chlond, A., Cuxart, J., , Duynkerke, P. G., Jiang, H., Khairoutdinov, M., Lewellen, D., Moeng, C.-H., Sanchez, E., Stevens, B., and Stevens, D. E.: A Large Eddy Simulation intercomparison study of shallow cumulus convection, J. Atmos. Sci., 60, 1201–1219, 2003. a, b, c, d, e, f, g, h

Smolarkiewicz, P. K. and Prusa, J. M.: VLES modeling of geophysical fluids with nonoscillatory forward-in-time schemes, Int. J. Num. Meth. Fluids, 39, 799–819, 2002. a

Souza, A. N., He, J., Bischoff, T., Waruszewski, M., Novak, L., Barra, V., Gibson, T., Srdhar, A., Kandala, S., Byrne, S., Wilcox, L. C., Kozdon, J., Giraldo, F. X., Knoth, O., Marshall, J., Ferrari, R., and Schneider, T.: The flux-differencing discontinuous Galerkin method applied to an idealized fully compressible nonhydrostatic dry atmosphere, J. Adv. Model. Earth Syst., 15, e2022MS003527, https://doi.org/10.1029/2022MS003527, 2023. a

Staniforth, A.: Global Atmospheric and Oceanic Modelling: Fundamental Equations, Cambridge University Press, ISBN 978-1108838337, 2022. a

Stevens, B., Ackerman, A. S., Albrecht, B. A., Brown, A. R., Chlond, A., Cuxart, J., Duynkerke, P. G., Lewellen, D. C., MacVean, M. K., Neggers, R. A. J., Sánchez, E., Siebesma, A. P., and Stevens, D. E.: Simulations of trade wind cumuli under a strong inversion, J. Atmos. Sci., 58, 1870–1891, 2001. a

Stevens, B., Moeng, C.-H., Ackerman, A. S., Bretherton, C. S., Chlond, A., de Roode, S., Edwards, J., Golaz, J.-C., Jiang, H., Khairoutdinov, M., Kirkpatrick, M. P., Lewellen, D. C., Lock, A., Müller, F., Stevens, D. E., Whelan, E., and Zhu, P.: Evaluation of Large-Eddy Simulations via observations of nocturnal marine stratocumulus, Mon. Weather Rev., 133, 1443–1462, 2005. a, b, c, d, e, f, g

Stevens, B., Satoh, M., Auger, L., Biercamp, J., Bretherton, C. S., Chen, X., Düben, P., Judt, F., Khairoutdinov, M., Klocke, D., Kodama, C., Kornblueh, L., Lin, S.-J., Neumann, P., Putman, W. M., Röber, N., Shibuya, R., Vanniere, B., Vidale, P. L., Wedi, N., and Zhou, L.: DYAMOND: the DYnamics of the Atmospheric general circulation Modeled On Non-hydrostatic Domains, Prog. Earth Planet. Sci., 6, 61, https://doi.org/10.1186/s40645-019-0304-z, 2019. a

Stull, R. B.: An Introduction to Boundary Layer Meteorology, Kluwer Academic, ISBN 978-9027727695, 1988. a

Sullivan, P. and Patton, E. G.: The effect of mesh resolution on convective boundary layer statistics and structures generated by Large-Eddy Simulation, J. Atmos. Sci., 68, 2395–2415, 2011. a

Temperton, C., Hortal, M., and Simmons, A.: A two-time-level semi-Lagrangian global spectral model, Q. J. Roy. Meteorol. Soc., 127, 111–127, 2001. a

Thuburn, J.: Vertical discretizations giving optimal representation of normal modes: General equations of state, Q. J. Roy. Meteorol. Soc., 143, 1714–1720, 2017a. a

Thuburn, J.: Use of the Gibbs thermodynamic potential to express the equation of state in atmospheric models, Q. J. Roy. Meteorol. Soc., 143, 1185–1196, 2017b. a, b, c, d, e, f, g, h

Thuburn, J.: Numerical entropy conservation without sacrificing Charney-Phillips grid optimal wave propagation, Q. J. Roy. Meteorol. Soc., 148, 2755–2768, 2022. a, b, c, d, e

Thuburn, J.: Consistent, conservative, and efficient advection updates for iterative-implicit atmospheric solvers, Q. J. Roy. Meteorol. Soc., 150, 2522–2532, 2024. a, b, c, d

Thuburn, J.: PTerodaC³TILES version 1.2: Potential based Thermodynamics with Consistent Conservative Cascade Transport for Implicit Large Eddy Simulation:, Zenodo [code], https://doi.org/10.5281/zenodo.13899066, 2025. a

Thuburn, J., Zerroukat, M., Wood, N., and Staniforth, A.: Coupling a mass-conserving semi-Lagrangian scheme (SLICE) to a semi-implicit discretisation of the shallow-water equations: Minimizing the dependence on a reference atmosphere, Q. J. Roy. Meteorol. Soc., 136, 146–154, 2010. a, b

Thuburn, J., Efstathiou, G. A., and McIntyre, W. A.: A two-fluid single-column model of turbulent shallow convection, Part II: Single-column model formulation and numerics, Q. J. Roy. Meteorol. Soc., 148, 3448–3469, 2022. a, b, c

Tomassini, L., Willett, M., Sellar, A., Lock, A., Walters, D., Whitall, M., Sanchez, C., Heming, J., Earnshaw, P., Rodriguez, J. M., Ackerley, D., Xavier, P., Franklin, C., and Senior, C. A.: Confronting the convective gray zone in the global configuration of the Met Office Unified Model, J. Adv. Model. Earth Syst., 15, e2022MS003418, https://doi.org/10.1029/2022MS003418, 2023. a

Vallis, G. K.: Atmospheric and Oceanic Fluid Dynamics, in: 2nd Edn., Cambridge University Press, ISBN 978-1107065505, 2017. a

Walters, D., Boutle, I., Brooks, M., Melvin, T., Stratton, R., Vosper, S., Wells, H., Williams, K., Wood, N., Allen, T., Bushell, A., Copsey, D., Earnshaw, P., Edwards, J., Gross, M., Hardiman, S., Harris, C., Heming, J., Klingaman, N., Levine, R., Manners, J., Martin, G., Milton, S., Mittermaier, M., Morcrette, C., Riddick, T., Roberts, M., Sanchez, C., Selwood, P., Stirling, A., Smith, C., Suri, D., Tennant, W., Vidale, P. L., Wilkinson, J., Willett, M., Woolnough, S., and Xavier, P.: The Met Office Unified Model Global Atmosphere 6.0/6.1 and JULES Global Land 6.0/6.1 configurations, Geosci. Model Dev., 10, 1487–1520, https://doi.org/10.5194/gmd-10-1487-2017, 2017. a

Wood, N., Staniforth, A., White, A., Allen, T., Diamantakis, M., Gross, M., Melvin, T., Smith, C., Vosper, S., Zerroukat, M., and Thuburn, J.: An inherently mass-conserving semi-implicit semi-Lagrangian discretisation of the deep-atmosphere global nonhydrostatic equations, Q. J. Roy. Meteorol. Soc., 140, 1505–1520, https://doi.org/10.1002/qj.2235, 2014. a, b

Zerroukat, M. and Allen, T.: SLIC: a Semi-Lagrangian Implicitly Corrected method for solving the compressible Euler equations, J. Comput. Phys., 421, 109739, https://doi.org/10.1016/j.jcp.2020.109739, 2020. a

Zerroukat, M., Wood, N., and Staniforth, A.: SLICE: A Semi-Lagrangian Inherently Conserving and Efficient scheme for transport problems, Q. J. Roy. Meteorol. Soc., 128, 2801–2820, 2002. a

Zerroukat, M., Wood, N., and Staniforth, A.: Application of the Parabolic Spline Method (PSM) to a multidimensional conservative transport scheme (SLICE), J. Comput. Phys., 225, 935–948, 2007. a

Zerroukat, M., Wood, N., and Staniforth, A.: An improved version of SLICE for conservative monotonic remapping on a C-grid, Q. J. Roy. Meteorol. Soc., 135, 541–546, 2009. a, b

This entropy source term does not include the effects of viscosity and mixing of air parcels; see Sect. 4.6.

The ATEX case definition calls for momentum fluxes to be determined by near-surface wind direction and a specified friction velocity. However, the ILES formulation performs poorly with that setup, so a simple bulk model is used instead. The specified friction velocity setup can be restored with very minor code changes.

In theory, the multigrid solver parameters automatically set by the model could change upon restart, breaking bit reproducibility. The author has not noticed any cases where this happens. Nevertheless, this loophole should be closed in a future model version.

⁴

While this paper was under review an error was noticed in the code for the Monin–Obukhov surface momentum flux calculation. Therefore, the results presented here for the ARM case use the bug-fixed version 1.2. The results for BOMEX and DYCOMS are unaffected.

Articles

Short summary

A new computational fluid dynamics code for simulating the atmospheric boundary layer and convection is presented. Moist thermodynamics is formulated via thermodynamic potentials, avoiding inconsistencies that can be introduced with conventional approaches. Numerical methods typical of weather and climate models are used, with no explicit subgrid scheme. Results highlight some advantages (e.g. large time steps) and disadvantages (e.g. weak vertical fluxes near the surface) of this approach.

Potential-based thermodynamics with consistent conservative cascade transport for implicit large eddy simulation: PTerodaC3TILES version 1.0

2.1 Continuous governing equations

2.1.1 Dynamics

2.1.2 Thermodynamics

2.2 Domain, grid, and discretization

2.3 Overview of information flow

2.4 Semi-implicit semi-Lagrangian scheme

2.5 Time-stepping algorithm

2.6 Advection

2.7 Boundary and interior forcing terms

2.7.1 Consistent surface fluxes of mass and water

2.7.2 Surface drag

2.7.3 Consistent interior forcing terms

2.8 Linearized dynamics

2.9 Linearized thermodynamics

2.10 Derivation of the Helmholtz problem

2.11 Helmholtz solver and back substitution

3.1 Generating initial data

3.2 Output and diagnostics

4.1 Advection

4.2 Divergence consistency

4.3 Thermodynamics and linearized thermodynamics

4.4 Solver convergence

4.5 Stability

4.6 Conservation

4.7 Bit reproducibility

5.1 BOMEX

5.2 ARM

5.3 DYCOMS

5.4 Computational cost

A1 Stretched vertical grid

A2 Expressions for internal energy and related quantities

A3 Parameters for multigrid scheme

A4 Alternative formulation of Monin–Obukhov theory for surface momentum flux

Potential-based thermodynamics with consistent conservative cascade transport for implicit large eddy simulation: PTerodaC³TILES version 1.0