GP-SWAT (v1.0): a two-level graph-based parallel simulation tool for the SWAT model

Zhang, Dejian; Lin, Bingqing; Wu, Jiefeng; Lin, Qiaoying

doi:https://doi.org/10.5194/gmd-14-5915-2021

Articles | Volume 14, issue 10

https://doi.org/10.5194/gmd-14-5915-2021

© Author(s) 2021. This work is distributed under
the Creative Commons Attribution 4.0 License.

https://doi.org/10.5194/gmd-14-5915-2021

© Author(s) 2021. This work is distributed under
the Creative Commons Attribution 4.0 License.

Articles | Volume 14, issue 10

Development and technical paper

|

30 Sep 2021

Development and technical paper |

| 30 Sep 2021

GP-SWAT (v1.0): a two-level graph-based parallel simulation tool for the SWAT model

Dejian Zhang, Bingqing Lin, Jiefeng Wu, and Qiaoying Lin

Download

Final revised paper (published on 30 Sep 2021)
Preprint (discussion started on 27 Jan 2021)

Interactive discussion

Status: closed

RC1: 'Comment on gmd-2020-429', Anonymous Referee #1, 17 Mar 2021

This manuscript presents a two-layer graph-based parallel simulation framework for the SWAT model, which can provide valuable reference for watershed management. I suggest the manuscript can be accepted after major revision.

1. The specific results of the parallel-computing performance should be presented in the abstract.

2. Line 95: It's better to state why this research want to propose a new parallelization scheme?

3. Section 2.1 should be integrated to the introduction part, which can help to figure out what is the missing part of existing researches.

4. Fig. 3: it is not convinect for most reader to read the codes. Some diagrams are needed to express the same meanings.

5. Fig 5: How can the actural speedup ratios larger than te theoretical ones?

6. Line 230: some details of the study areas should be added.

7. Line 235: it seems that the computation amount is not very large in the case study. I wonder how the proposed parallel computing system performs for the applications with many different amount of simulation units.

Citation: https://doi.org/10.5194/gmd-2020-429-RC1
RC2: 'Comment on gmd-2020-429', Liangjun Zhu, 13 May 2021

General comments:

This manuscript proposed a unified parallelization strategy for both watershed-level and subbasin-level parallelization and developed the GP-SWAT accordingly, which is valuable and useful for both developers and users in the scientific community. Except for the specific comments posted by Anonymous Referee #1 (https://gmd.copernicus.org/preprints/gmd-2020-429#RC1), I have some other specific comments.

Specific comments:

1. The phase “two-layer graph-based parallelization” is ambiguous and inexact. (1) Do you mean “layer” equals “level” since you use “model-level” (or "watershed-level", please unify the terms) and “subbasin-level” in the manuscript? (2) The “graph-based” is the parallelization strategy for “subbasin-level” not for “model-level”, since model runs are independent to each other and only their outputs are concerned.

2. In the introduction, the first paragraph stated that both the single model and numerous models require prohibitive execution time. However, the first three of the four methods introduced in the second paragraph are used for model-level applications. In my view, it is more clear to introduce the methods for alleviating computational burden for the model level applications and the single model, separately. And then both focus on parallel computing.

3. In the sentence around No 65, it is more precisely to cite Liu et al. (2016) and/or Zhu et al. (2019) rather than Liu et al. (2014).

4. In the sentence around No 80, “However, these methods have two major limitations: … complex computational facilities that may not be readily available…” should be reconsidered. In fact, many parallel computing models (e.g., MPI) can also be running on the personal computer and obtain a good speedup ratio.

5. This manuscript focuses on the parallelization of both model-level and subbasin-level but missed existing similar methods such as Zhu et al. (2019).

6. Overall, the introduction failed to raise the scientific issue clearly and precisely, that is, there is no unified parallelization strategy for both watershed-level and subbasin-level parallelization that do not need to reconstruct source code of hydrologic model to handle data communication among subbasins explicitly (e.g., using MPI). If this is correct, the title may also be changed accordingly.

7. What is the phrase “the current computation step” mean in the sentence around No 140? To my understanding, the proposed method is different from the spatial-temporal discretization proposed by Wang et al. (2013). For example, in Fig 4, Subbasin 1 and Subbasin 2 are executed for the entire simulation period (e.g., 5 years) first, then Subbasin 3 begins to run, and so on. This may lead to a poor load balance, and hence a low speed up ratio, especially for a single model run. Is this right?

8. In Section 4.1, all the results are compared through speed-up ratios, you should also give the actual execution times. I want to know the performance of the subbasin-level parallelization for a single model simulation compared with the original SWAT model.

9. Why use two study areas? From my perspective, the two case studies have no significant difference. We need more information about the study areas.

10. In sentences around No 235, it is weird that the numbers of HRUs per subbasin can be set the same.

Technical corrections:

1. Did you mean that the “Spark-SWAT” is the alias of the “GP-SWAT”?

2. In the code, all file paths are specific to the author’s computer. This is not suitable for code distribution. Even so, the modification of these paths should be clarified in the tutorial.

References:

Liu, J., Zhu, A.-X., Qin, C.-Z., Wu, H., and Jiang, J.: A two-level parallelization method for distributed hydrological models, Environmental Modelling & Software, 80, 175–184, https://doi.org/10.1016/j.envsoft.2016.02.032, 2016.

Wang, H., Fu, X., Wang, Y., and Wang, G.: A High-performance temporal-spatial discretization method for the parallel computing of river basins, Computers & Geosciences, 58, 62–68, https://doi.org/10.1016/j.cageo.2013.04.026, 2013.

Zhu, L.-J., Liu, J., Qin, C.-Z., and Zhu, A.-X.: A modular and parallelized watershed modeling framework, Environmental Modelling & Software, 122, 104526, https://doi.org/10.1016/j.envsoft.2019.104526, 2019.

Citation: https://doi.org/10.5194/gmd-2020-429-RC2
AC1: 'Comment on gmd-2020-429', Qiaoying Lin, 03 Jun 2021

The comment was uploaded in the form of a supplement: https://gmd.copernicus.org/preprints/gmd-2020-429/gmd-2020-429-AC1-supplement.pdf

Citation: https://doi.org/10.5194/gmd-2020-429-AC1

Peer review completion

AR: Author's response | RR: Referee report | ED: Editor decision | EF: Editorial file upload

AR by Qiaoying Lin on behalf of the Authors (03 Jun 2021) Author's response Author's tracked changes Manuscript

ED: Referee Nomination & Report Request started (18 Jun 2021) by Wolfgang Kurtz

RR by Anonymous Referee #1 (19 Jun 2021)

RR by Liangjun Zhu (21 Jun 2021)

Suggestions for revision or reasons for rejection

General comments:
The revised manuscript is greatly improved, especially the introduction section. Most of the comments have been well addressed. However, I still have some specific comments.
The basic idea of this manuscript is to alleviate the development burden of hydrological modelers to achieve high-performance watershed modeling without reconstruction of model code, which is novel and clearly stated. The implementation based on the SWAT model, i.e., GP-SWAT, must be helpful for the scientific community. Overall, I am glad to suggest an acceptance for publication after a minor revision.

Specific comments:
1. In Line 53-54, the author introduced three types of parallelization strategies, such as model-level, submodel-level, and spatial-decomposition. But, in my view, the author has confused the spatial-decomposition method with the submodel-level, i.e., Line 64-79 should be the spatial-decomposition method, or more precisely, the spatial(-temporal) decomposition method, and Line 80-90 should be the submodel-level method. I mean, the so-called submodel level is a special case derived from the spatial(-temporal) decomposition method. In such a case, each submodel is a full model executed on one part of the watershed (i.e., subbasin). Besides, each parallelization type should have a short and precise definition. Please consider my suggestion.
2. The title used “a…simulation framework”, but the introduction only listed some parallelization strategies (or named parallelization schemes). I would suggest introducing existing hydrological modeling frameworks based on parallel computing and raise their weakness. I think that will be the answer to the second comment of #referee 1 (Line 95: It's better to state why this research wants to propose a new parallelization scheme?). Also, in the main text, the author used “a two-level parallelization scheme”, why not “framework”, and what is the difference?
3. The authors claimed that “indeed, the actual speedup ratio that can be achieved is largely dependent on the structure of the stream network.” and “The intention of using two study areas in this study was to demonstrate how stream network complexities can affect GP-SWAT performance”. Although the revised manuscript added some more descriptions of the two study areas, I cannot find the quantitative or qualitative analysis of the different stream networks' structures and the consequent result differences. So, I may suggest only retain the Jinjiang study area. Or, if the author can give a calculation method of theoretical speedup ratio considering the structure of stream networks and the available computing resources, that will be much valuable to adopt the two distinct study areas.

Hide

ED: Reconsider after major revisions (02 Jul 2021) by Wolfgang Kurtz

AR by Qiaoying Lin on behalf of the Authors (23 Jul 2021) Author's response Author's tracked changes Manuscript

ED: Referee Nomination & Report Request started (12 Aug 2021) by Wolfgang Kurtz

RR by Liangjun Zhu (13 Aug 2021)

ED: Publish as is (04 Sep 2021) by Wolfgang Kurtz

AR by Qiaoying Lin on behalf of the Authors (06 Sep 2021) Manuscript

Short summary

GP-SWAT is a two-layer model parallelization tool for a SWAT model based on the graph-parallel Pregel algorithm. It can be employed to perform both individual and iterative model parallelization, endowing it with a range of possible applications and great flexibility in maximizing performance. As a flexible and scalable tool, it can run in diverse environments, ranging from a commodity computer with a Microsoft Windows, Mac or Linux OS to a Spark cluster consisting of a large number of nodes.