Climate-model-informed deep learning of global soil moisture distribution

Klingmüller, Klaus; Lelieveld, Jos

doi:https://doi.org/10.5194/gmd-14-4429-2021

Articles | Volume 14, issue 7

https://doi.org/10.5194/gmd-14-4429-2021

Articles | Volume 14, issue 7

Development and technical paper

19 Jul 2021

Development and technical paper |

| 19 Jul 2021

Climate-model-informed deep learning of global soil moisture distribution

Klaus Klingmüller and Jos Lelieveld

Abstract

We present a deep neural network (DNN) that produces accurate predictions of observed surface soil moisture, applying meteorological data from a climate model. The network was trained on daily satellite retrievals of soil moisture from the European Space Agency (ESA) Climate Change Initiative (CCI). The predictors precipitation, temperature and humidity were simulated with the ECHAM/MESSy atmospheric chemistry–climate model (EMAC). Our evaluation shows that predictions of the trained DNN are highly correlated with the observations, both spatially and temporally, and free of bias. This offers an alternative for parameterisation schemes in climate models, especially in simulations that use but may not focus on soil moisture, which we illustrate with the threshold wind speed for mineral dust emissions. Moreover, the DNN can provide proxies for missing values in satellite observations to produce realistic, comprehensive and high-resolution global datasets. As the approach presented here could be similarly used for other variables and observations, the study is a proof of concept for basic but expedient machine learning techniques in climate modelling, which may motivate additional applications.

Download & links

Article (PDF, 4367 KB)

Supplement (3587 KB)

Download & links

How to cite.

Received: 23 Dec 2020 – Discussion started: 17 Feb 2021 – Revised: 17 May 2021 – Accepted: 29 May 2021 – Published: 19 Jul 2021

1 Introduction

For decades global climate and atmospheric chemistry models have relied on supercomputers with a traditional cluster architecture that utilise many powerful compute nodes in parallel. The individual nodes are typically very potent by themselves, using general purpose central processing unit (CPU) cores with large memory. They are required to serve the needs of diverse algorithms representing many different physical and chemical processes. The implementations of these algorithms quite often have a source code legacy with a history spanning multiple decades. This creates a dependency on the system type, which may limit progress in model studies once the CPU performance increase slows down, which has been typical in the recent past. In fact, due to challenges in the continued miniaturisation of semiconductors, such a slowdown is ongoing, which motivates the search for other performance gains (Leiserson et al., 2020).

Progress has been made to accelerate weather, climate and atmospheric–chemistry models with general-purpose graphics processing units (GPUs) (Yashiro et al., 2016; Alvanos and Christoudias, 2017; Sun et al., 2018; Fuhrer et al., 2018; Müller et al., 2019), but a wider utilisation is pending. Other high-performance computing applications, such as lattice quantum chromodynamics, which have a typically smaller codebase and less legacy code, swiftly exploited the computational resources of GPUs (Egri et al., 2007; Clark et al., 2010). A discipline that not only uses the full potential of GPUs but also builds its present success on the advent of GPU computing is machine learning, specifically deep learning (Krizhevsky et al., 2012). Strong commercial interest has even boosted the development of specialised machine learning hardware and software (e.g. Abadi et al., 2015; Jouppi et al., 2017; Markidis et al., 2018). Atmospheric and climate models can benefit from this development by making use of the computational capabilities of the new hardware to accelerate existing algorithms (Hatfield et al., 2019) or by applying machine learning techniques to complement existing, physical-process-based and (especially) empirical parameterisations (Chevallier et al., 2000).

Given the success of deep learning in many different disciplines (Schmidhuber, 2015; LeCun et al., 2015; Silver et al., 2016) it seems likely that atmospheric modelling and climate science in general can benefit from this methodology (not only in terms of performance gains), and promising results have been presented lately to this end (e.g. Kadow et al., 2020; Chandra et al., 2021). However, reservations about machine learning exist. In contrast to physical models, where the simulation result is deduced from the laws of physics and physical parameters, trained models often represent black boxes where the rules by which they compute the output tend to be non-transparent and cannot easily be modified to represent varied physical conditions. On the other hand, complex conventional, phenomenologically derived parameterisations at times also lack a clear physical meaning, and their parameters are often tuned to obtain realistic results, which is essentially a non-systematic form of training. Moreover, methods for interpreting machine learning models are emerging (Montavon et al., 2018; Kohoutová et al., 2020). Besides, depending on the application, it may be irrelevant whether a process is implemented as a black box or not if the scientific focus is on other processes. For instance, the present study was motivated by the need for reliable soil moisture data to develop a mineral dust emission scheme.

Soil moisture near the surface has a significant impact on the emissions of mineral dust (Klingmüller et al., 2016), and it is generally of great importance for weather and climate. It has been identified by the Global Observing System for Climate (GCOS) as an essential climate variable (ECV, Bojinski et al., 2014) and, for example, soil moisture can greatly impact mesoscale convective systems (Klein and Taylor, 2020). While detailed parameterisations of soil moisture exist (e.g. Ekici et al., 2014), many models, such as the ECHAM/MESSy atmospheric chemistry–climate model (EMAC) (Jöckel et al., 2006), still use relatively simple models.

On the other hand, because moisture at the surface affects the dielectric properties of the soil, it can be retrieved well remotely using microwaves, and an extensive global daily dataset covering the past 4 decades is provided by the European Space Agency (ESA) Climate Change Initiative (CCI) (Dorigo et al., 2017; Gruber et al., 2017, 2019).

To make use of the ESA CCI surface soil moisture data in climate models, they cannot be imported directly because the daily subsets have substantial gaps depending on the local retrieval conditions at overpass time. Moreover, merely importing observations would limit model applications to hindcasting. Therefore, we pursued an alternative approach and use the satellite data for supervised training of a deep neural network (DNN) to predict soil moisture based on modelled meteorological data. In doing so, we explored the potential of machine learning to complement physical models using an introductory example, which demonstrates the useful results that can be achieved with limited technical effort and which might instigate further applications.

This article is structured as follows: the datasets used are presented in Sect. 2, the DNN is introduced and evaluated in Sect. 3, and applications of the DNN are discussed in Sect. 4 before we draw conclusions in Sect. 5.

2 Data

We used results from a 10-year simulation with the ECHAM/MESSy atmospheric chemistry–climate model (EMAC) (Jöckel et al., 2006) version 2.52, covering the years 2006 to 2015. The exact configuration is described by Klingmüller et al. (2020). Horizontally, the setup employs a Gaussian T63 grid with approximately 1.9^∘ spacing. EMAC assimilates observational data by nudging temperature, vorticity and divergence above the boundary layer to meteorological reanalysis data of the European Centre for Medium-Range Weather Forecasts (ECMWF) and by using the sea surface temperature from the same dataset.

The EMAC output variables we considered are precipitation, surface temperature and humidity, which are preprocessed to daily average values. In addition, we made use of the static ecosystem rooting depth map of the online dust emission scheme, originally from Schenk and Jackson (2009).

The EMAC data were complemented by daily volumetric soil moisture (i.e. the ratio of water relative to soil volume) observations from the ESA CCI Soil Moisture Product Release v04.5 (Dorigo et al., 2017; Gruber et al., 2017, 2019). This dataset is representative of the soil moisture in the topmost few centimetres of soil (down to about 5 cm). We used the dataset that combines retrievals from active and passive spaceborne microwave instruments, which we aggregated from the original spatial grid with 0.25^∘ spacing to the Gaussian T63 grid of the EMAC results.

We subdivided the 10-year period covered by the EMAC simulation into a training period of 8 years from 2006 to 2013 and a test period of 2 years from 2014 to 2015. This choice maximises the length of the training period while keeping more than 1 year for testing, allowing the identification of interannual variations during the testing period. Moreover, using training and testing periods in chronological order, all predictions in the testing period represent forecasts. The test period was exclusively used to evaluate the DNN after training. Every third year of the training period (2008 and 2011) was used for validating and monitoring the training procedure.

3 DNN model

The basic concept of our approach is to relate the observed soil moisture to relevant quantities modelled by the global climate model. We applied a simple DNN architecture which operates on one grid cell at a time. As a consequence, the DNN can easily be integrated as a submodel in global climate models such as EMAC. The meteorological variables used as predictors are selected based on their availability in the climate simulation output and physical relevance: the rain rate represents the source of soil moisture, whereas surface temperature and atmospheric humidity control soil drying. To account for cumulative effects, in addition to the actual daily mean rain rate, surface temperature and specific humidity, we provided the DNN with the corresponding values lagged by 1 d, the mean of values with 2 to 3 d lag, and the mean of 4 to 7 d lagged values. According to preliminary tests, considering longer lags did not yield a significant improvement, but it might become relevant if the overall performance can be enhanced in future versions. To account for regional characteristics such as soil properties, the DNN uses longitude and latitude, encoded in the triple sin(lon), cos(lon), and sin(lat), as well as the local rooting depth. Likewise, to account for seasonality, for example due to vegetation variation, we supplied the DNN with the time of year encoded as $\sin (2 π t / a)$ and $\cos (2 π t / a)$ , where $t / a$ is the time in years. In total, this amounts to 18 input variables that had to be mapped to one output variable, i.e. the surface soil moisture.

https://gmd.copernicus.org/articles/14/4429/2021/gmd-14-4429-2021-f01

Figure 1DNN architecture and hyperparameters. The input variables on the left are processed by four fully connected hidden layers of 512 units each to yield the soil moisture on the right. All four hidden layers use a rectified linear unit function activation and during the training use a dropout rate of 10 % to avoid overfitting. The output layer with only one unit applies a linear activation function to obtain the final result. The DNN operates on normalised variables with a standard deviation of 1 and mean of 0.

Download

For the above purpose, we employed a generic DNN of linearly stacked densely connected layers as illustrated in Fig. 1. Four hidden layers of 512 units with rectified linear activation are followed by the output layer with a single unit and linear activation. There is no strict formula for the number of layers and units per layer, but DNNs that are too small are not capable of learning complex rules, whereas DNNs that are too large are more at risk of overfitting and require more computational resources. Our values are a compromise that is proving to work well, but systematic optimisation could possibly yield better results. To generalise the DNN and prevent overfitting during training, we regularised it by applying a 10 % dropout rate to the hidden layers (Hinton et al., 2012) and stopped the training process as soon as the validation loss was no longer improving. Before training, all input and output variables were normalised independently to have a mean of 0 and a standard deviation of 1. Accordingly, before using the DNN for predictions, the same transformations had to be applied to the input data, and an inverse transformation had to be applied to the output.

We implemented the DNN and performed the training and inference using the TensorFlow library (Abadi et al., 2015) with the Keras interface (Chollet et al., 2015, 2017) for R (R Core Team, 2019). The training took about 40 min on an Nvidia Tesla V100 GPU accelerator. The much less computationally demanding predictions were evaluated on common desktop hardware.

https://gmd.copernicus.org/articles/14/4429/2021/gmd-14-4429-2021-f02

Figure 2Comparison of observed and predicted volumetric surface soil moisture considering all daily grid cell values available globally during the years 2014 and 2015. The colours represent the density distribution of the scatter, and outliers are represented by dots. Pearson correlation coefficient r, root-mean-square error (RMSE), mean absolute error (MAE) and the number of data points n are provided in the upper-left corner.

Download

To assess the overall predictive power of the DNN, we compared the volumetric soil moisture calculations with the corresponding observations for all grid cells and days during the test period (2014 to 2015) where retrievals are available. Figure 2 shows the scatter of the 1247041 observation–prediction pairs. It demonstrates the remarkably high quality of the predictions, which are strongly correlated with the observations, with a Pearson correlation coefficient of 0.92, and do not show any bias, resulting in a small root-mean-square error of 0.033 and a mean absolute error of 0.024, which is an order of magnitude smaller than the average data values. Note that the Gaussian grid has more grid cells per area at higher latitudes, giving those regions more weight in this comparison, but the number of relevant grid cells in polar regions is small.

https://gmd.copernicus.org/articles/14/4429/2021/gmd-14-4429-2021-f03

Figure 3Spatial correlation of predicted and observed volumetric soil moisture throughout the test period.

Download

While Fig. 2 combines the effect of the spatial and temporal variability, Fig. 3 shows the spatial correlation for each day separately, using all grid cells with observations during that day. The correlation coefficient attains high values around 0.92 throughout the test period and rarely drops below 0.9. Considering that the training takes place before the test period, it is noteworthy that there is no significant decline of the correlation over the two years. Essentially, at any time the DNN yields a realistic representation over the globe, predicting low soil moisture in arid regions and high soil moisture in wet areas.

https://gmd.copernicus.org/articles/14/4429/2021/gmd-14-4429-2021-f04

Figure 4Correlation coefficient of the observed and predicted volumetric soil moisture time series in the individual grid cells during the test years 2014 and 2015. In the grey regions no observations are available during the this period.

Equally important is a realistic representation of the temporal variation for each grid cell. Due to the strong temporal variability of weather and precipitation in particular, this is more challenging, and we do not expect equally high correlation coefficients as those found for the spatial correlation. Nevertheless, Fig. 4 shows that the temporal correlation coefficients are globally high, with a mean of almost 0.7, which appears to be a good performance when compared with state-of-the-art model data (Dorigo et al., 2017). The correlation coefficient is larger than 0.5 in almost all grid cells except those representing the extremely dry soils of the Sahara, Rub' al Khali, Taklamakan and Gobi deserts. Related to this, a slight overestimation by the DNN towards the lower end of the soil moisture range can be identified in Fig. 2. Training and evaluation of the DNN are challenging in this range because satellite retrievals are both sparse and uncertain (see, e.g. Fig. S1 in the Supplement); therefore, depending on the application, some additional effort focusing on these regions might be required. For dust emissions, the soil moisture in these regions is too low to significantly influence the results, while it is more relevant in semi-arid regions, where observations and predictions are more reliable (Figs. 4 and S2 in the Supplement).

https://gmd.copernicus.org/articles/14/4429/2021/gmd-14-4429-2021-f05

Figure 5Time series of the observed and predicted daily volumetric soil moisture values and the EMAC soil water in the grid cell centred at 49.4^∘ N, 7.5^∘ E in Europe.

Download

Considering the grid cell centred at 49.4^∘ N, 7.5^∘ E, the upper panel of Fig. 5 exemplifies how closely the predicted volumetric soil moisture time series resembles the observations. There is no bias in the predictions and the seasonal cycle is well represented. Moreover, the short-term variations in the predictions show a clear similarity to those of the observations. Both have a comparable amplitude and frequency, and characteristic features in the observed time series are reproduced by the predictions, e.g. in July 2014, October 2014, December 2014/January 2015 and March 2015 (Fig. S3 in the Supplement). These features occur irregularly and are not repeated year after year, which demonstrates that the DNN did not simply learn one representative climatology but instead utilises information from the meteorological data provided by the climate model. We reiterate that the nudged EMAC meteorology is close but not identical to the real atmosphere, and hence small deviations are expected.

For the realistic representation of the temporal soil moisture variation in each grid cell, the meteorological predictors are essential. The climatological model obtained by training the DNN and omitting the meteorological predictors still reaches an overall correlation coefficient of close to 0.9, but the global mean of temporal correlation coefficient drops substantially from close to 0.7 to below 0.5 (Fig. S4 in the Supplement). In addition to the interannual variations, the variability on timescales shorter than 1 month is lost by ignoring the meteorology (Fig. S5 in the Supplement). The relevance of the individual meteorological predictors for the temporal variations differs regionally. For example, in Europe the DNN predictions are sensitive to all meteorological predictors, but in arid and semi-arid regions of the Middle East they are not sensitive to reductions in rain (Figs. S6 to S8 in the Supplement). Here, all variability is inferred from temperature and humidity. This does not necessarily imply rain to be the least important driver of soil moisture variability in the region, rather it is the least reliable predictor due to the uncertainty in modelled precipitation.

The meteorological predictors are also required to make the DNN sensitive to climatic changes. Increasing temperature or decreasing humidity promotes soil drying, whereas increasing rain enhances the soil moisture, which is reproduced by the DNN (Figs. S6 to S8). Therefore, if climatic changes are represented in the meteorological predictors, the DNN soil moisture will respond to theses changes. The response might be limited because in the present configuration the DNN can substitute meteorological information with the knowledge of season and location. Consequently, for applications in long-term climate projections, training and evaluation on longer timescales and possibly refinements of the DNN architecture are advisable.

The spatial coordinate predictors are crucial for a realistic spatial variation in the soil moisture. They are the parameters encoding information on a wide range of surface properties, including soil and vegetation types, and substantially improve the DNN. Less relevant are the seasonal predictors, but, if provided, the DNN makes use of them so that the DNN predictions are sensitive to variations in the time of year (Figs. S6 to S8) and the temporal correlation is improved (Fig. S4).

4 Applications

The DNN operates on a single grid cell at a time and therefore can easily be incorporated in climate models, for example in the submodel core layer of EMAC. This provides realistic soil moisture values to other submodels, which can optionally replace the current parameterisation of soil hydrology. For instance, this could be advantageous for mineral dust emission schemes, which should account for reduced emissions from wet soils, but so far these have been limited by the inadequate representation of soil moisture in the topmost surface layer by the physically modelled soil water (Klingmüller et al., 2018).

The lower panel of Fig. 5 shows the time series of the physically modelled EMAC soil water. Evidently, this time series is largely unrelated to the observed surface soil moisture in the upper panel: the short-term variability is smaller, whereas the long-term variability is much larger, showing a strong decline during summer 2015 that is not present in the observations. Regardless of the question as to whether this decline reflects a true decline in the water content in deeper soil layers or only a model artefact, it is obviously impossible to map the EMAC soil water to a realistic representation of the observed surface soil moisture in the panel above. However, it is the latter that is required for parameterisations like the mineral dust emission scheme. Because the DNN presented here fulfils this requirement, we propose using it as an improvement over the EMAC computed soil water and as a viable alternative to more sophisticated physical soil moisture models. The algorithm cannot replace physically based soil moisture representations for first-principle process studies, but it is an accurate substitute for parameterisations that depend on limited empirical information.

Mineral dust emissions are predominantly caused by saltation bombardment where saltating particles on impact with the surface eject finer dust sediments or disintegrate to finer particles themselves. To activate and sustain a horizontal flux of saltating particles, the surface friction velocity of the air has to exceed a threshold that depends on the soil properties. Soil moisture increases this threshold, thereby reducing the dust emissions. We studied this effect using the parameterisation presented by Fécan et al. (1999),

\begin{matrix} (1) & \frac{u_{* t}}{u_{* td}} = \sqrt{1 + 1.21 (w - (0.0014 ϕ_{clay}^{2} + 0.17 ϕ_{clay}))^{0.68}}, \end{matrix}

where u_*t is the threshold surface friction velocity, u_*td is the corresponding threshold for dry soil, w is the gravimetric soil moisture in percent and ϕ_clay the soil clay fraction in percent. The equation is applied if the soil moisture exceeds $0.0014 ϕ_{clay}^{2} + 0.17 ϕ_{clay}$ , representing the minimum soil moisture required to induce an increase in the threshold. Like Astitha et al. (2012), we combined this soil moisture dependency with the threshold surface friction velocity parameterisation of Marticorena and Bergametti (1995), applied to a saltation particle diameter of 60 µm. The full equation for the threshold surface friction velocity is reproduced in Appendix A. We evaluated the equation for air density ρ_air = 1.2 kg m⁻³ using the clay fraction distribution from Shangguan et al. (2014). To convert the volumetric to gravimetric soil moisture we assumed a soil bulk density of 1600 kg m⁻³. For comparison, we converted the EMAC soil water from water volume per area to gravimetric soil moisture using the same bulk density and assuming the water to be evenly distributed over the soil column defined by the rooting depth.

https://gmd.copernicus.org/articles/14/4429/2021/gmd-14-4429-2021-f06

Figure 6The surface friction velocity threshold above which dust is emitted, averaged over the Arabian Peninsula and Mesopotamia. The surface friction velocity threshold u_*t is computed using the parameterisation provided in the Appendix A. The threshold in terms of the wind speed at 10 m altitude u_10 m,t represents the climatological dataset retrieved by Pu et al. (2020).

Download

We focused our analysis on Mesopotamia and the Arabian Peninsula, where a significant correlation of soil moisture and dust emissions was reported (Klingmüller et al., 2016), and considered the regional average of the threshold surface friction velocity over the territory of Iraq, Israel, Jordan, Kuwait, Lebanon, Oman, Qatar, Saudi Arabia, the State of Palestine, Syria, the United Arab Emirates and Yemen. The threshold surface friction velocity during the test period is shown in Fig. 6 and was calculated using the observed, predicted and EMAC-calculated soil moisture. The results based on the observed and predicted soil moisture show good agreement and a strong seasonal cycle. During summer, the threshold calculated based on the DNN predictions tends to be slightly higher than that based on the observed values, indicating an effect of the aforementioned challenges in hyper arid regions. Nevertheless, the difference between both results is much smaller than the interannual variations, and the DNN result is in the range of the short-term variations in the observation-based results that are due to the varying retrieval coverage. In contrast, the result based on the EMAC soil water has little variability and is therefore inconsistent with the other two results, irrespective of the precise conversion factor used to obtain the gravimetric surface soil moisture. We conclude that the value of the EMAC soil water, which represents the total water including moisture in deeper soil layers, is limited in this context, but so far it has been the only estimate available for EMAC simulations.

Figure 6 additionally shows results from a recent dataset of Pu et al. (2020), who have retrieved a climatological monthly global distribution of the threshold in terms of the wind speed at 10 m altitude based on satellite and reanalysis data. Two versions of the data represent different assumptions used to identify dust events based on the dust optical depth (DOD). The seasonal cycle is similar to that of the predicted threshold surface friction velocity with a comparable relative amplitude and a similar but slightly shifted phase. Assuming a logarithmic wind profile, the predicted surface friction velocity threshold can be converted to the corresponding 10 m wind speed u_10 m,t = $u_{* t} \ln (10 m / z_{0}) / κ$ , where κ≈0.4 is the Von Kármán constant and z₀ the surface roughness. Consistent with Astitha et al. (2012), we used the surface roughness z₀ = 0.0001 m. According to the logarithmic profile, the threshold that Eq. (A1) yields for dry soils, i.e. the minimal threshold u_*td = 0.26 m s⁻¹, corresponds to the 10 m wind speed u_10 m,td = 7.5 m s⁻¹. This value is higher than most of the climatological values, however, the latter were derived based on 6-hourly wind speeds, whereas the parameterisation we used is meant for and applied to instantaneous surface friction velocities that vary on shorter timescales (only limited by the model time step, e.g. 12 min in our T63 simulation and even shorter for higher spatial resolutions) and reach higher peak values. Therefore, the threshold definitions differ and do not allow direct comparisons of the absolute values. Additionally, because the retrieval of the climatology does not account for dust transport, it may regionally underestimate the threshold.

The substantial variations in the threshold surface friction velocity obtained based on the observed soil moisture emphasise the relevance of soil moisture for dust emissions. The soil moisture predicted by the DNN is sufficiently realistic to reproduce these variations and to account for this important effect in global climate model simulations.

https://gmd.copernicus.org/articles/14/4429/2021/gmd-14-4429-2021-f07

Figure 7Global distribution of the observed and predicted volumetric soil moisture on a Northern Hemisphere winter (15 January 2012) and summer (15 July 2012) day in the training period.

In addition to the incorporation in global climate models, another application of the DNN is the reprocessing of remote sensing data. Based on meteorological input data, the DNN predicts the global daily soil moisture distribution consistent with the observations. In contrast to the observational datasets that have substantial gaps in regions and time periods where conditions do not allow retrievals, the meteorological input data does not have any missing values, and consequently the same applies to the predicted soil moisture. The latter can therefore also be used to consistently fill the gaps in the observations to obtain a complete daily global soil moisture dataset. Figure 7 shows the global distribution of the observed and predicted soil moisture on two example days from the training period, one during winter (15 January 2012) and the other during summer (15 July 2012), both in the Northern Hemisphere. Observations outside the training period can be processed as well (Fig. S9 in the Supplement), but to obtain optimal results the training should include the period of interest. Regardless of the extensive regions without observations, the prediction yields global values within a reasonable range, closely resembling the observations where available. Note that in regions or seasons where no or only few observations are available throughout the training period, the predictions have to be interpreted with caution. This applies to the rain forests, central deserts and regions permanently or seasonally frozen or covered by snow. In the complete soil moisture distribution the seasonal variations become apparent, promoting the DNN predictions for use in further studies, such as trend analyses. Moreover, in contrast to the incomplete observations, the optimised predictions can straightforwardly be assimilated into climate models.

5 Conclusions

We have presented a machine learning model that relates soil moisture to meteorological conditions. Informed by a climate model, this DNN is able to accurately predict satellite-based surface soil moisture observations, as demonstrated by our evaluation. Using the example of the threshold wind speed for mineral dust emissions, we showed that the DNN predictions can be used for improved representations of processes depending on the surface soil moisture within climate models.

The DNN in its present form should be regarded as a proof of concept, and there is room for improvement. The current DNN architecture, the simple stack of several densely connected layers, is very generic. While it is generally quite powerful, it is not tailored to our specific application and other concepts might be considered as well. Convolutional neural networks could exploit the spatial relationship of neighbouring grid cells and recurrent neural networks might more optimally account for the causal relationship of the soil moisture at successive days including long-term accumulative effects. The causal relation is partly addressed in our implementation by the consideration of lagged meteorological variables, representing a temporal convolution. However, the prediction is not informed about the conditions prior to 1 week in the past. This apparently works well for the surface soil moisture but is probably not sufficient for additional applications, such as those that require information about the moisture in deeper soil layers. The hyperparameters of the DNN, including the number of layers, the number of units per layer and the selection of input variables, are chosen to be appropriate for the problem but have not been systematically optimised. We conclude that there are various pathways for future developments that may enhance the DNN performance. Nevertheless, the present DNN setup can already be beneficial for applications such as online mineral dust emission schemes in climate models. Therefore, the trained DNN will be implemented as an EMAC submodel using the MESSy interface.

The CPU time required for the inferencing using the trained DNN is negligible compared to the total computational demand of a global climate model. Other applications of DNNs within climate models may be more demanding, in particular if they process three-dimensional data instead of only two-dimensional surface data and if they have to be called more often than once per day. In this case, GPUs or specialised inferencing hardware can be employed to evaluate the trained model, which by design efficiently utilises such accelerators. In this way, climate models can benefit from GPUs that are available in many supercomputers without the need for porting complex algorithms to GPUs, and from the rapid development of machine learning hardware.

For hindcasting applications, an alternative to implementing the trained soil moisture DNN into climate models is to import the consistent and comprehensive global soil moisture prediction from the DNN at runtime. Because such a dataset is also of general use, it appears to be promising to repeat the procedure presented here with high-resolution meteorological reanalysis data.

Overall, our example demonstrates that machine learning models informed by data from traditional, physical-process-based climate models can perform well in learning and predicting observational data. In return, they can complement process parameterisations in climate models, especially when the parameterisations rely on limited empirical data. Furthermore, they may help climate models to efficiently utilise recent hardware architectures.

Appendix A: Surface friction velocity threshold for dust emissions

The full equation for the threshold surface friction velocity u_*t used in Sect. 4 is as follows:

\begin{matrix} (A1) & \begin{aligned} u_{* t} = & 0.129 \sqrt{\frac{D_{p}}{ρ_{air}} (ρ_{p} g + \frac{0.006 g \sqrt{cm} s^{- 2}}{D_{p}^{5 / 2}})} \\ \times \{\begin{cases} \frac{1}{\sqrt{1.928 B^{0.092} - 1}} & B < 10 \\ (1 - 0.0858 e^{- 0.0617 (B - 10)}) & B \geq 10 \end{cases} \\ \times {(1 - \frac{\ln \frac{z_{o}}{z_{os}}}{\ln (0.35 {(\frac{10 cm}{z_{os}})}^{0.8})})}^{- 1} \\ \times \sqrt{\begin{array}{c} 1 + 1.21 max \\ \cdot {(0, (w - (0.0014 ϕ_{clay}^{2} + 0.17 ϕ_{clay})))}^{0.68} \end{array}} . \end{aligned} \end{matrix}

The key to Eq. (A1) is as follows.

D_p = 60 µm	saltation particle diameter
ρ_air	air density
ρ_p = 2.65 g cm⁻³	particle density
g = 9.80665 m s⁻²	gravitational acceleration
B = $\frac{u_{* t} D_{p}}{v}$	friction Reynolds number,
	initially B = $1331 (D_{p} / cm)^{1.56}$
	+ 0.38
v = 0.157 × 10⁻⁴ m² s⁻¹	kinematic viscosity of air
z_o = 0.01 cm	surface roughness length
z_os = 0.00333 cm	local roughness length of the
	uncovered surface
w	gravimetric soil moisture in %
ϕ_clay	clay fraction in %

Code and data availability

The ECHAM climate model is available to the scientific community under the MPI-M Software License Agreement (https://mpimet.mpg.de/en/science/modeling-with-icon/code-availability, last access: 11 November 2020, MPI-M, 2020). The Modular Earth Submodel System (MESSy) is continuously further developed and applied by a consortium of institutions. The usage of MESSy and access to the source code are licensed to all affiliates of institutions that are members of the MESSy Consortium. Institutions can become a member of the MESSy Consortium by signing the MESSy Memorandum of Understanding. More information can be found on the MESSy Consortium Website (https://www.messy-interface.org, last access: 11 November 2020, MESSy, 2020). The data and DNN parameters used in this study are available at https://edmond.mpdl.mpg.de/imeji/collection/eLt_AnQ98XFaaznl (last access: 22 June 2021) (Klingmüller, 2021).

Supplement

The supplement related to this article is available online at: https://doi.org/10.5194/gmd-14-4429-2021-supplement.

Author contributions

KK conceived the study, implemented the DNN and, supported by JL, wrote the manuscript. Both authors discussed the results and finalised the article.

Competing interests

The authors declare that they have no conflict of interest.

Disclaimer

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Acknowledgements

We acknowledge financial support from the MaxWater Initiative of the Max Planck Society.

Financial support

The article processing charges for this open-access publication were covered by the Max Planck Society.

Review statement

This paper was edited by Rohitash Chandra and reviewed by three anonymous referees.

References

Abadi, M., Agarwal, A., Barham, P., Brevdo, E., Chen, Z., Citro, C., Corrado, G. S., Davis, A., Dean, J., Devin, M., Ghemawat, S., Goodfellow, I., Harp, A., Irving, G., Isard, M., Jia, Y., Jozefowicz, R., Kaiser, L., Kudlur, M., Levenberg, J., Mané, D., Monga, R., Moore, S., Murray, D., Olah, C., Schuster, M., Shlens, J., Steiner, B., Sutskever, I., Talwar, K., Tucker, P., Vanhoucke, V., Vasudevan, V., Viégas, F., Vinyals, O., Warden, P., Wattenberg, M., Wicke, M., Yu, Y., and Zheng, X.: TensorFlow: Large-Scale Machine Learning on Heterogeneous Systems, available at: https://www.tensorflow.org/ (last access: 14 July 2021), software available from tensorflow.org, 2015. a, b

Alvanos, M. and Christoudias, T.: GPU-accelerated atmospheric chemical kinetics in the ECHAM/MESSy (EMAC) Earth system model (version 2.52), Geosci. Model Dev., 10, 3679–3693, https://doi.org/10.5194/gmd-10-3679-2017, 2017. a

Astitha, M., Lelieveld, J., Abdel Kader, M., Pozzer, A., and de Meij, A.: Parameterization of dust emissions in the global atmospheric chemistry-climate model EMAC: impact of nudging and soil properties, Atmos. Chem. Phys., 12, 11057–11083, https://doi.org/10.5194/acp-12-11057-2012, 2012. a, b

Bojinski, S., Verstraete, M., Peterson, T. C., Richter, C., Simmons, A., and Zemp, M.: The Concept of Essential Climate Variables in Support of Climate Research, Applications, and Policy, B. Am. Meteorol. Soc., 95, 1431–1443, https://doi.org/10.1175/BAMS-D-13-00047.1, 2014. a

Chandra, R., Cripps, S., Butterworth, N., and Muller, R. D.: Precipitation reconstruction from climate-sensitive lithologies using Bayesian machine learning, Environ. Modell. Softw., 139, 105002, https://doi.org/10.1016/j.envsoft.2021.105002, 2021. a

Chevallier, F., Morcrette, J.-J., Chéruy, F., and Scott, N. A.: Use of a neural-network-based long-wave radiative-transfer scheme in the ECMWF atmospheric model, Q. J. Roy. Meteor. Soc., 126, 761–776, https://doi.org/10.1002/qj.49712656318, 2000. a

Chollet, F., Allaire, J., et al.: R Interface to Keras, available at: https://github.com/rstudio/keras (last access: 14 July 2021), GitHub, 2017. a

Chollet, F., et al.: Keras, available at: https://keras.io (last access: 14 July 2021), keras.io, 2015. a

Clark, M., Babich, R., Barros, K., Brower, R., and Rebbi, C.: Solving lattice QCD systems of equations using mixed precision solvers on GPUs, Comput. Phys. Commun., 181, 1517–1528, https://doi.org/10.1016/j.cpc.2010.05.002, 2010. a

Dorigo, W., Wagner, W., Albergel, C., Albrecht, F., Balsamo, G., Brocca, L., Chung, D., Ertl, M., Forkel, M., Gruber, A., Haas, E., Hamer, P. D., Hirschi, M., Ikonen, J., de Jeu, R., Kidd, R., Lahoz, W., Liu, Y. Y., Miralles, D., Mistelbauer, T., Nicolai-Shaw, N., Parinussa, R., Pratola, C., Reimer, C., van der Schalie, R., Seneviratne, S. I., Smolander, T., and Lecomte, P.: ESA CCI Soil Moisture for improved Earth system understanding: State-of-the art and future directions, Remote Sens. Environ., 203, 185–215, https://doi.org/10.1016/j.rse.2017.07.001, Earth Observation of Essential Climate Variables, 2017. a, b, c

Egri, G. I., Fodor, Z., Hoelbling, C., Katz, S. D., Nógrádi, D., and Szabó, K. K.: Lattice QCD as a video game, Comput. Phys. Commun., 177, 631–639, https://doi.org/10.1016/j.cpc.2007.06.005, 2007. a

Ekici, A., Beer, C., Hagemann, S., Boike, J., Langer, M., and Hauck, C.: Simulating high-latitude permafrost regions by the JSBACH terrestrial ecosystem model, Geosci. Model Dev., 7, 631–647, https://doi.org/10.5194/gmd-7-631-2014, 2014. a

Fuhrer, O., Chadha, T., Hoefler, T., Kwasniewski, G., Lapillonne, X., Leutwyler, D., Lüthi, D., Osuna, C., Schär, C., Schulthess, T. C., and Vogt, H.: Near-global climate simulation at 1 km resolution: establishing a performance baseline on 4888 GPUs with COSMO 5.0, Geosci. Model Dev., 11, 1665–1681, https://doi.org/10.5194/gmd-11-1665-2018, 2018. a

Fécan, F., Marticorena, B., and Bergametti, G.: Parametrization of the increase of the aeolian erosion threshold wind friction velocity due to soil moisture for arid and semi-arid areas, Ann. Geophys., 17, 149–157, https://doi.org/10.1007/s00585-999-0149-7, 1999. a

Gruber, A., Dorigo, W. A., Crow, W., and Wagner, W.: Triple Collocation-Based Merging of Satellite Soil Moisture Retrievals, IEEE T. Geosci. Remote, 55, 6780–6792, 2017. a, b

Gruber, A., Scanlon, T., van der Schalie, R., Wagner, W., and Dorigo, W.: Evolution of the ESA CCI Soil Moisture climate data records and their underlying merging methodology, Earth Syst. Sci. Data, 11, 717–739, https://doi.org/10.5194/essd-11-717-2019, 2019. a, b

Hatfield, S., Chantry, M., Düben, P., and Palmer, T.: Accelerating High-Resolution Weather Models with Deep-Learning Hardware, in: Proceedings of the Platform for Advanced Scientific Computing Conference, PASC '19, Association for Computing Machinery, New York, NY, USA, 12–14 June 2019, https://doi.org/10.1145/3324989.3325711, 2019. a

Hinton, G. E., Srivastava, N., Krizhevsky, A., Sutskever, I., and Salakhutdinov, R.: Improving neural networks by preventing co-adaptation of feature detectors, CoRR, arXiv [preprint], 1207.0580, 2012. a

Jöckel, P., Tost, H., Pozzer, A., Brühl, C., Buchholz, J., Ganzeveld, L., Hoor, P., Kerkweg, A., Lawrence, M. G., Sander, R., Steil, B., Stiller, G., Tanarhte, M., Taraborrelli, D., van Aardenne, J., and Lelieveld, J.: The atmospheric chemistry general circulation model ECHAM5/MESSy1: consistent simulation of ozone from the surface to the mesosphere, Atmos. Chem. Phys., 6, 5067–5104, https://doi.org/10.5194/acp-6-5067-2006, 2006. a, b

Jouppi, N. P., Young, C., Patil, N., Patterson, D., Agrawal, G., Bajwa, R., Bates, S., Bhatia, S., Boden, N., Borchers, A., Boyle, R., Cantin, P.-l., Chao, C., Clark, C., Coriell, J., Daley, M., Dau, M., Dean, J., Gelb, B., Ghaemmaghami, T. V., Gottipati, R., Gulland, W., Hagmann, R., Ho, C. R., Hogberg, D., Hu, J., Hundt, R., Hurt, D., Ibarz, J., Jaffey, A., Jaworski, A., Kaplan, A., Khaitan, H., Killebrew, D., Koch, A., Kumar, N., Lacy, S., Laudon, J., Law, J., Le, D., Leary, C., Liu, Z., Lucke, K., Lundin, A., MacKean, G., Maggiore, A., Mahony, M., Miller, K., Nagarajan, R., Narayanaswami, R., Ni, R., Nix, K., Norrie, T., Omernick, M., Penukonda, N., Phelps, A., Ross, J., Ross, M., Salek, A., Samadiani, E., Severn, C., Sizikov, G., Snelham, M., Souter, J., Steinberg, D., Swing, A., Tan, M., Thorson, G., Tian, B., Toma, H., Tuttle, E., Vasudevan, V., Walter, R., Wang, W., Wilcox, E., and Yoon, D. H.: In-Datacenter Performance Analysis of a Tensor Processing Unit, in: ISCA '17: The 44th Annual International Symposium on Computer Architecture, ACM, New York, NY, USA, 24–28 June 2017, https://doi.org/10.1145/3079856.3080246, 2017. a

Kadow, C., Hall, D. M., and Ulbrich, U.: Artificial intelligence reconstructs missing climate information, Nat. Geosci., 13, 408–413, https://doi.org/10.1038/s41561-020-0582-5, 2020. a

Klein, C. and Taylor, C. M.: Dry soils can intensify mesoscale convective systems, P. Natl. Acad. Sci. USA, 117, 21132–21137, https://doi.org/10.1073/pnas.2007998117, 2020. a

Klingmüller, K.: Climate model-informed deep learning of global soil moisture distribution – data, available at: https://edmond.mpdl.mpg.de/imeji/collection/eLt_AnQ98XFaaznl (last access: 22 June 2021), Max Planck Society, 2021. a

Klingmüller, K., Pozzer, A., Metzger, S., Stenchikov, G. L., and Lelieveld, J.: Aerosol optical depth trend over the Middle East, Atmos. Chem. Phys., 16, 5063–5073, https://doi.org/10.5194/acp-16-5063-2016, 2016. a, b

Klingmüller, K., Metzger, S., Abdelkader, M., Karydis, V. A., Stenchikov, G. L., Pozzer, A., and Lelieveld, J.: Revised mineral dust emissions in the atmospheric chemistry–climate model EMAC (MESSy 2.52 DU_Astitha1 KKDU2017 patch), Geosci. Model Dev., 11, 989–1008, https://doi.org/10.5194/gmd-11-989-2018, 2018. a

Klingmüller, K., Karydis, V. A., Bacer, S., Stenchikov, G. L., and Lelieveld, J.: Weaker cooling by aerosols due to dust–pollution interactions, Atmos. Chem. Phys., 20, 15285–15295, https://doi.org/10.5194/acp-20-15285-2020, 2020. a

Kohoutová, L., Heo, J., Cha, S., Lee, S., Moon, T., Wager, T. D., and Woo, C.-W.: Toward a unified framework for interpreting machine-learning models in neuroimaging, Nat. Protoc., 15, 1399–1435, https://doi.org/10.1038/s41596-019-0289-5, 2020. a

Krizhevsky, A., Sutskever, I., and Hinton, G. E.: ImageNet Classification with Deep Convolutional Neural Networks, in: Advances in Neural Information Processing Systems, edited by: Pereira, F., Burges, C. J. C., Bottou, L., and Weinberger, K. Q., vol. 25, pp. 1097–1105, Curran Associates, Inc., available at: https://proceedings.neurips.cc/paper/2012/file/c399862d3b9d6b76c8436e924a68c45b-Paper.pdf (last access: 14 July 2021), 2012. a

LeCun, Y., Bengio, Y., and Hinton, G.: Deep learning, Nature, 521, 436–444, https://doi.org/10.1038/nature14539, 2015. a

Leiserson, C. E., Thompson, N. C., Emer, J. S., Kuszmaul, B. C., Lampson, B. W., Sanchez, D., and Schardl, T. B.: There's plenty of room at the Top: What will drive computer performance after Moore's law?, Science, 368, eaam9744, https://doi.org/10.1126/science.aam9744, 2020. a

Markidis, S., Chien, S. W. D., Laure, E., Peng, I. B., and Vetter, J. S.: NVIDIA Tensor Core Programmability, Performance Precision, in: 2018 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW), 21–25 May 2018, Vancouver, Canada, pp. 522–531, https://doi.org/10.1109/IPDPSW.2018.00091, 2018. a

Marticorena, B. and Bergametti, G.: Modeling the atmospheric dust cycle: 1. Design of a soil-derived dust emission scheme, J. Geophys. Res., 100, 16415, https://doi.org/10.1029/95JD00690, 1995. a

MESSy: The Modular Earth Submodel System, available at: https://www.messy-interface.org (last access: 11 November 2020), MESSy Consortium, 2020. a

Montavon, G., Samek, W., and Müller, K.-R.: Methods for interpreting and understanding deep neural networks, Digit. Signal Process., 73, 1–15, https://doi.org/10.1016/j.dsp.2017.10.011, 2018. a

MPI-M: Code availability, available at: https://mpimet.mpg.de/en/science/modeling-with-icon/code-availability (last access: 11 November 2020), Max Planck Institute for Meteorology, 2020. a

Müller, A., Deconinck, W., Kühnlein, C., Mengaldo, G., Lange, M., Wedi, N., Bauer, P., Smolarkiewicz, P. K., Diamantakis, M., Lock, S.-J., Hamrud, M., Saarinen, S., Mozdzynski, G., Thiemert, D., Glinton, M., Bénard, P., Voitus, F., Colavolpe, C., Marguinaud, P., Zheng, Y., Van Bever, J., Degrauwe, D., Smet, G., Termonia, P., Nielsen, K. P., Sass, B. H., Poulsen, J. W., Berg, P., Osuna, C., Fuhrer, O., Clement, V., Baldauf, M., Gillard, M., Szmelter, J., O'Brien, E., McKinstry, A., Robinson, O., Shukla, P., Lysaght, M., Kulczewski, M., Ciznicki, M., Pia̧tek, W., Ciesielski, S., Błażewicz, M., Kurowski, K., Procyk, M., Spychala, P., Bosak, B., Piotrowski, Z. P., Wyszogrodzki, A., Raffin, E., Mazauric, C., Guibert, D., Douriez, L., Vigouroux, X., Gray, A., Messmer, P., Macfaden, A. J., and New, N.: The ESCAPE project: Energy-efficient Scalable Algorithms for Weather Prediction at Exascale, Geosci. Model Dev., 12, 4425–4441, https://doi.org/10.5194/gmd-12-4425-2019, 2019. a

Pu, B., Ginoux, P., Guo, H., Hsu, N. C., Kimball, J., Marticorena, B., Malyshev, S., Naik, V., O'Neill, N. T., Pérez García-Pando, C., Paireau, J., Prospero, J. M., Shevliakova, E., and Zhao, M.: Retrieving the global distribution of the threshold of wind erosion from satellite data and implementing it into the Geophysical Fluid Dynamics Laboratory land–atmosphere model (GFDL AM4.0/LM4.0), Atmos. Chem. Phys., 20, 55–81, https://doi.org/10.5194/acp-20-55-2020, 2020. a, b

R Core Team: R: A Language and Environment for Statistical Computing, R Foundation for Statistical Computing, Vienna, Austria, available at: https://www.R-project.org/ (last access: 14 July 2021), 2019. a

Schenk, H. J. and Jackson, R. B.: ISLSCP II Ecosystem Rooting Depths, ORNL Distributed Active Archive Center, https://doi.org/10.3334/ORNLDAAC/929, 2009. a

Schmidhuber, J.: Deep learning in neural networks: An overview, Neural Networks, 61, 85–117, https://doi.org/10.1016/j.neunet.2014.09.003, 2015. a

Shangguan, W., Dai, Y., Duan, Q., Liu, B., and Yuan, H.: A global soil data set for earth system modeling, J. Adv. Model. Earth Sy., 6, 249–263, https://doi.org/10.1002/2013MS000293, 2014. a

Silver, D., Huang, A., Maddison, C. J., Guez, A., Sifre, L., van den Driessche, G., Schrittwieser, J., Antonoglou, I., Panneershelvam, V., Lanctot, M., Dieleman, S., Grewe, D., Nham, J., Kalchbrenner, N., Sutskever, I., Lillicrap, T., Leach, M., Kavukcuoglu, K., Graepel, T., and Hassabis, D.: Mastering the game of Go with deep neural networks and tree search, Nature, 529, 484–489, https://doi.org/10.1038/nature16961, 2016. a

Sun, J., Fu, J. S., Drake, J. B., Zhu, Q., Haidar, A., Gates, M., Tomov, S., and Dongarra, J.: Computational Benefit of GPU Optimization for the Atmospheric Chemistry Modeling, J. Adv. Model. Earth Sy., 10, 1952–1969, https://doi.org/10.1029/2018MS001276, 2018. a

Yashiro, H., Terai, M., Yoshida, R., Iga, S.-I., Minami, K., and Tomita, H.: Performance Analysis and Optimization of Nonhydrostatic ICosahedral Atmospheric Model (NICAM) on the K Computer and TSUBAME2.5, in: Proceedings of the Platform for Advanced Scientific Computing Conference, PASC '16, Association for Computing Machinery, New York, NY, USA, 8–10 June 2016, Lausanne, Switzerland, https://doi.org/10.1145/2929908.2929911, 2016. a

Articles

Short summary

Soil moisture is of great importance for weather and climate. We present a machine learning model that produces accurate predictions of satellite-observed surface soil moisture, based on meteorological data from a climate model. It can be used as soil moisture parametrisation in climate models and to produce comprehensive global soil moisture datasets. Moreover, it may motivate similar applications of machine learning in climate science.