Articles | Volume 8, issue 9
https://doi.org/10.5194/gmd-8-2977-2015
https://doi.org/10.5194/gmd-8-2977-2015
Development and technical paper
 | 
30 Sep 2015
Development and technical paper |  | 30 Sep 2015

Development of efficient GPU parallelization of WRF Yonsei University planetary boundary layer scheme

M. Huang, J. Mielikainen, B. Huang, H. Chen, H.-L. A. Huang, and M. D. Goldberg

Download

Interactive discussion

Status: closed
Status: closed
AC: Author comment | RC: Referee comment | SC: Short comment | EC: Editor comment
Printer-friendly Version - Printer-friendly version Supplement - Supplement

Peer-review completion

AR: Author's response | RR: Referee report | ED: Editor decision
AR by Melin Huang on behalf of the Authors (11 Jun 2015)  Author's response   Manuscript 
ED: Referee Nomination & Report Request started (29 Jun 2015) by Adrian Sandu
RR by Anonymous Referee #1 (15 Jul 2015)
RR by Anonymous Referee #3 (21 Aug 2015)
ED: Publish subject to technical corrections (03 Sep 2015) by Adrian Sandu
AR by Melin Huang on behalf of the Authors (11 Sep 2015)  Author's response   Manuscript 
Download
Short summary
To expedite weather research and prediction, we have put tremendous effort into developing an accelerated implementation of the entire WRF model using GPU massive parallel computing architecture. This paper presents our efficient GPU-based design on WRF YSU PBL scheme. Using one NVIDIA Tesla K40 GPU, the GPU-based YSU PBL scheme achieves a speedup of 193x with respect to its runtime on 1 CPU core. We can even boost the speedup to 360x with respect to 1 CPU core as two K40 GPUs are applied.