the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
Deep Dive into Global Hydrologic Simulations: Harnessing the Power of Deep Learning and Physics-informed Differentiable Models (δHBV-globe1.0-hydroDL)
Dapeng Feng
Hylke Beck
Jens de Bruijn
Reetik Kumar Sahu
Yusuke Satoh
Yoshihide Wada
Jiangtao Liu
Ming Pan
Kathryn Lawson
Abstract. Accurate hydrological modeling is vital to characterizing how the terrestrial water cycle responds to climate change. Pure deep learning (DL) models have shown to outperform process-based ones while remaining difficult to interpret. More recently, differentiable, physics-informed machine learning models with a physical backbone can systematically integrate physical equations and DL, predicting untrained variables and processes with high performance. However, it was unclear if such models are competitive for global-scale applications with a simple backbone. Therefore, we use – for the first time at this scale – differentiable hydrologic models (fullname δHBV-globe1.0-hydroDL and shorthanded δHBV) to simulate the rainfall-runoff processes for 3753 basins around the world. Moreover, we compare the δHBV models to a purely data-driven long short-term memory (LSTM) model to examine their strengths and limitations. Both LSTM and the δHBV models provide competent daily hydrologic simulation capabilities in global basins, with median Kling-Gupta efficiency values close to or higher than 0.7 (and 0.78 with LSTM for a subset of 1675 basins with long-term records), significantly outperforming traditional models. Moreover, regionalized differentiable models demonstrated stronger spatial generalization ability (median KGE 0.64) than a traditional parameter regionalization approach (median KGE 0.46) and even LSTM for ungauged region tests in Europe and South America. Nevertheless, relative to LSTM, the differentiable model was hampered by structural deficiencies for cold or polar regions, and highly arid regions, and basins with significant human impacts. This study also sets the benchmark for hydrologic estimates around the world and builds foundations for improving global hydrologic simulations.
- Preprint
(2271 KB) - Metadata XML
- BibTeX
- EndNote
Dapeng Feng et al.
Status: open (until 01 Jan 2024)
-
RC1: 'Comment on gmd-2023-190', Anonymous Referee #1, 06 Nov 2023
reply
This study presents a simulation using a differentiable model at 3753 basins globally. It represents an advance in the field and is thus worthy of being published somewhere. However, the used model and the design of the experiments are almost the same as those published in the authors' previous papers. Readers of GMD would expect more progress in those aspects. Thus, I suggest a major revision.
Major comments
- The authors should introduce more details of the experimental design, such as the metrics used in the training, how many experiments (temporal generalization, PUB, and PUR; correct me if I am wrong), and the purpose of the experiments. Some details may have been presented somewhere else. I find this manuscript difficult to follow without reading the authors' previous publications.
- L124: what are the criteria for the selection? Can you describe the erroneous cases? Do the erroneous cases include the data processing error described in L350?
- L306-L317: Can you discuss more about comparing the traditional regionalization method and PUR? The PUR regionalization method can utilize a large number of observations to calibrate/train the differentiable model, whereas the traditional method can only use very limited samples. In other words, PUR may have a much higher chance of finding the optimal parameters than the traditional method.Minor comments
- Title: the phrase, global hydrologic simulations, is misleading. The simulations are conducted at 3753 basins across the globe. It represents a concept different from the "global hydrologic simulations."
- L127: is the classification from Beck et al., 2020b?
- L26 & L212: How is the subset of 1675 basins selected? What is the objective of the selection?
- L223: can you describe more about the structural issues? Why does the explicit solution scheme introduce numerical errors here?
- L291-L293: please rewrite the sentence. It is difficult to read.
- L398, "the underrepresentation of the processes...": this conclusion is too general. The difficulty of representing arid/polar/anthropogenic processes is known before reading this paper. The conclusion should be specific.Citation: https://doi.org/10.5194/gmd-2023-190-RC1 -
CEC1: 'Comment on gmd-2023-190', Juan Antonio Añel, 19 Nov 2023
reply
Dear authors,
I have checked the "Code and Data Availability" section in your manuscript. I would like to point out that the sentence saying that an updated version of the code and data will be available upon acceptance is misleading. Obviously, if your manuscript is accepted for publication, you have to publish it with the most updated code and data, and we do not accept "upon acceptance" statements. Right now, your statement seems to imply that you have not published your code and data in advance, however, they are included in the Zenodo repository. I would like to ask you to modify the sentence to avoid confusion.
Also, please clarify which is the PyTorch version that you have used for your work.
Regards,
Juan A. Añel
Geosci. Model Dev. Executive Editor
Citation: https://doi.org/10.5194/gmd-2023-190-CEC1
Dapeng Feng et al.
Dapeng Feng et al.
Viewed
HTML | XML | Total | BibTeX | EndNote | |
---|---|---|---|---|---|
377 | 183 | 10 | 570 | 7 | 7 |
- HTML: 377
- PDF: 183
- XML: 10
- Total: 570
- BibTeX: 7
- EndNote: 7
Viewed (geographical distribution)
Country | # | Views | % |
---|
Total: | 0 |
HTML: | 0 |
PDF: | 0 |
XML: | 0 |
- 1