Preprints
https://doi.org/10.5194/gmd-2020-158
https://doi.org/10.5194/gmd-2020-158
Submitted as: development and technical paper
 | 
29 Sep 2020
Submitted as: development and technical paper |  | 29 Sep 2020
Status: this preprint was under review for the journal GMD but the revision was not accepted.

Development and performance optimization of a parallel computing infrastructure for an unstructured-mesh modelling framework

Zhuang Liu, Yi Zhang, Xiaomeng Huang, Jian Li, Dong Wang, Mingqing Wang, and Xing Huang

Abstract. This paper describes the development and performance optimization of a parallel computing infrastructure for an unstructured-mesh global model (GRIST; Global-to-Regional Integrated forecast SysTem). The focus is on three major aspects that facilitate rapid iterative development, including parallel computing, index optimization and an efficient group I/O strategy. For parallel computing, the METIS tool is used for the partition of the global mesh, which is flexible and convenient for both the quasi-uniform and variable-resolution simulations. The scaling tests show that the partition method is efficient. To improve the cache efficiency, several mesh index reordering strategies are investigated to optimize the performance of the indirect addressing scheme used in the stencil calculations. The numerical results show that the indexing strategies are able to speed up the calculations, especially for running with a small number of processes. To overcome the bottleneck of poor I/O efficiency for the high-resolution or massively parallel simulations, a group parallel I/O method is implemented and proven to be of high efficiency in the numerical experiments. Altogether, these three aspects of the parallel computing toolkits are encapsulated in a few interfaces, which can be used for general parallel modelling on unstructured meshes.

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this preprint. The responsibility to include appropriate place names lies with the authors.
Zhuang Liu, Yi Zhang, Xiaomeng Huang, Jian Li, Dong Wang, Mingqing Wang, and Xing Huang
 
Status: closed
Status: closed
AC: Author comment | RC: Referee comment | SC: Short comment | EC: Editor comment
Printer-friendly Version - Printer-friendly version Supplement - Supplement
 
Status: closed
Status: closed
AC: Author comment | RC: Referee comment | SC: Short comment | EC: Editor comment
Printer-friendly Version - Printer-friendly version Supplement - Supplement
Zhuang Liu, Yi Zhang, Xiaomeng Huang, Jian Li, Dong Wang, Mingqing Wang, and Xing Huang
Zhuang Liu, Yi Zhang, Xiaomeng Huang, Jian Li, Dong Wang, Mingqing Wang, and Xing Huang

Viewed

Total article views: 1,413 (including HTML, PDF, and XML)
HTML PDF XML Total BibTeX EndNote
1,070 288 55 1,413 52 59
  • HTML: 1,070
  • PDF: 288
  • XML: 55
  • Total: 1,413
  • BibTeX: 52
  • EndNote: 59
Views and downloads (calculated since 29 Sep 2020)
Cumulative views and downloads (calculated since 29 Sep 2020)

Viewed (geographical distribution)

Total article views: 1,274 (including HTML, PDF, and XML) Thereof 1,272 with geography defined and 2 with unknown origin.
Country # Views %
  • 1
1
 
 
 
 

Cited

Latest update: 13 Dec 2024
Download
Short summary
This paper describes several techniques for the parallelization and performance optimization of an unstructured-mesh global atmospheric model. The purpose of this research is to facilitate the rapid iterative model development. These techniques are general and can be used for other parallel modeling on unstructured meshes.