Articles | Volume 8, issue 9
https://doi.org/10.5194/gmd-8-2977-2015
https://doi.org/10.5194/gmd-8-2977-2015
Development and technical paper
 | 
30 Sep 2015
Development and technical paper |  | 30 Sep 2015

Development of efficient GPU parallelization of WRF Yonsei University planetary boundary layer scheme

M. Huang, J. Mielikainen, B. Huang, H. Chen, H.-L. A. Huang, and M. D. Goldberg

Related authors

Comparisons of IASI-A and AATSR measurements of top-of-atmosphere radiance over an extended period
Manik Bali, Jonathan P. Mittaz, Eileen Maturi, and Mitchell D. Goldberg
Atmos. Meas. Tech., 9, 3325–3336, https://doi.org/10.5194/amt-9-3325-2016,https://doi.org/10.5194/amt-9-3325-2016, 2016
Short summary
Intel Xeon Phi accelerated Weather Research and Forecasting (WRF) Goddard microphysics scheme
J. Mielikainen, B. Huang, and A. H.-L. Huang
Geosci. Model Dev. Discuss., https://doi.org/10.5194/gmdd-7-8941-2014,https://doi.org/10.5194/gmdd-7-8941-2014, 2014
Revised manuscript has not been submitted

Related subject area

Atmospheric sciences
SCIATRAN software package (V4.6): update and further development of aerosol, clouds, surface reflectance databases and models
Linlu Mei, Vladimir Rozanov, Alexei Rozanov, and John P. Burrows
Geosci. Model Dev., 16, 1511–1536, https://doi.org/10.5194/gmd-16-1511-2023,https://doi.org/10.5194/gmd-16-1511-2023, 2023
Short summary
Deep learning models for generation of precipitation maps based on numerical weather prediction
Adrian Rojas-Campos, Michael Langguth, Martin Wittenbrink, and Gordon Pipa
Geosci. Model Dev., 16, 1467–1480, https://doi.org/10.5194/gmd-16-1467-2023,https://doi.org/10.5194/gmd-16-1467-2023, 2023
Short summary
An inconsistency in aviation emissions between CMIP5 and CMIP6 and the implications for short-lived species and their radiative forcing
Robin N. Thor, Mariano Mertens, Sigrun Matthes, Mattia Righi, Johannes Hendricks, Sabine Brinkop, Phoebe Graf, Volker Grewe, Patrick Jöckel, and Steven Smith
Geosci. Model Dev., 16, 1459–1466, https://doi.org/10.5194/gmd-16-1459-2023,https://doi.org/10.5194/gmd-16-1459-2023, 2023
Short summary
On the use of Infrared Atmospheric Sounding Interferometer (IASI) spectrally resolved radiances to test the EC-Earth climate model (v3.3.3) in clear-sky conditions
Stefano Della Fera, Federico Fabiano, Piera Raspollini, Marco Ridolfi, Ugo Cortesi, Flavio Barbara, and Jost von Hardenberg
Geosci. Model Dev., 16, 1379–1394, https://doi.org/10.5194/gmd-16-1379-2023,https://doi.org/10.5194/gmd-16-1379-2023, 2023
Short summary
Incorporation of aerosol into the COSPv2 satellite lidar simulator for climate model evaluation
Marine Bonazzola, Hélène Chepfer, Po-Lun Ma, Johannes Quaas, David M. Winker, Artem Feofilov, and Nick Schutgens
Geosci. Model Dev., 16, 1359–1377, https://doi.org/10.5194/gmd-16-1359-2023,https://doi.org/10.5194/gmd-16-1359-2023, 2023
Short summary

Cited articles

Betts, A., Hong, S.-Y., and Pan, H.-L.: Comparison of NCEPNCAR reanalysis with 1987 FIFE data, Mon. Weather Rev., 124, 1480–1498, 1996.
Bright, D. R. and Mullen, S. L.: The sensitivity of the numerical simulation of the southwest monsoon boundary layer to the choice of PBL turbulence parameterization in MM5, Weather Forecast., 17, 99–114, 2002.
Continental US (CONUS): WRF V3 Parallel Benchmark Page, Single domain, medium size, 12 km CONUS, Oct. 2001, available at: http://www2.mmm.ucar.edu/wrf/WG2/benchv3/#_Toc212961288, last access: 18 June 2008.
Durran, D. R. and Klemp, J. B.: The effects of moisture on trapped mountain lee waves, J. Atmos. Sci., 39, 2490–2506, 1982.
Hong, S.-Y. and Pan, H.-L.: Nonlocal boundary layer vertical diffusion in a Medium-Range Forecast model, Mon. Weather Rev., 124, 2322–2339, 1996.
Download
Short summary
To expedite weather research and prediction, we have put tremendous effort into developing an accelerated implementation of the entire WRF model using GPU massive parallel computing architecture. This paper presents our efficient GPU-based design on WRF YSU PBL scheme. Using one NVIDIA Tesla K40 GPU, the GPU-based YSU PBL scheme achieves a speedup of 193x with respect to its runtime on 1 CPU core. We can even boost the speedup to 360x with respect to 1 CPU core as two K40 GPUs are applied.