<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing with OASIS Tables v3.0 20080202//EN" "journalpub-oasis3.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:oasis="http://docs.oasis-open.org/ns/oasis-exchange/table" xml:lang="en" dtd-version="3.0" article-type="research-article"><?xmltex \bartext{Model description paper}?>
  <front>
    <journal-meta><journal-id journal-id-type="publisher">GMD</journal-id><journal-title-group>
    <journal-title>Geoscientific Model Development</journal-title>
    <abbrev-journal-title abbrev-type="publisher">GMD</abbrev-journal-title><abbrev-journal-title abbrev-type="nlm-ta">Geosci. Model Dev.</abbrev-journal-title>
  </journal-title-group><issn pub-type="epub">1991-9603</issn><publisher>
    <publisher-name>Copernicus Publications</publisher-name>
    <publisher-loc>Göttingen, Germany</publisher-loc>
  </publisher></journal-meta>
    <article-meta>
      <article-id pub-id-type="doi">10.5194/gmd-16-199-2023</article-id><title-group><article-title>WRF–ML v1.0: a bridge between WRF v4.3 and machine learning parameterizations and its application to atmospheric<?xmltex \hack{\break}?> radiative transfer</article-title><alt-title>WRF–ML 1.0: a bridge between WRF and machine learning parameterization</alt-title>
      </title-group><?xmltex \runningtitle{WRF--ML 1.0: a bridge between WRF and machine learning parameterization}?><?xmltex \runningauthor{X. Zhong et al.}?>
      <contrib-group>
        <contrib contrib-type="author" corresp="no">
          <name><surname>Zhong</surname><given-names>Xiaohui</given-names></name>
          
        <ext-link>https://orcid.org/0000-0001-6706-2335</ext-link></contrib>
        <contrib contrib-type="author" corresp="no">
          <name><surname>Ma</surname><given-names>Zhijian</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="no">
          <name><surname>Yao</surname><given-names>Yichen</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="no">
          <name><surname>Xu</surname><given-names>Lifei</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="no">
          <name><surname>Wu</surname><given-names>Yuan</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="yes">
          <name><surname>Wang</surname><given-names>Zhibin</given-names></name>
          <email>zhibin.waz@alibaba-inc.com</email>
        </contrib>
        <aff id="aff1"><institution>Damo Academy, Alibaba Group, Hangzhou 311121, China</institution>
        </aff>
      </contrib-group>
      <author-notes><corresp id="corr1">Zhibin Wang (zhibin.waz@alibaba-inc.com)</corresp></author-notes><pub-date><day>6</day><month>January</month><year>2023</year></pub-date>
      
      <volume>16</volume>
      <issue>1</issue>
      <fpage>199</fpage><lpage>209</lpage>
      <history>
        <date date-type="received"><day>7</day><month>September</month><year>2022</year></date>
           <date date-type="rev-request"><day>23</day><month>September</month><year>2022</year></date>
           <date date-type="rev-recd"><day>7</day><month>December</month><year>2022</year></date>
           <date date-type="accepted"><day>7</day><month>December</month><year>2022</year></date>
      </history>
      <permissions>
        <copyright-statement>Copyright: © 2023 </copyright-statement>
        <copyright-year>2023</copyright-year>
      <license license-type="open-access"><license-p>This work is licensed under the Creative Commons Attribution 4.0 International License. To view a copy of this licence, visit <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by/4.0/">https://creativecommons.org/licenses/by/4.0/</ext-link></license-p></license></permissions><self-uri xlink:href="https://gmd.copernicus.org/articles/.html">This article is available from https://gmd.copernicus.org/articles/.html</self-uri><self-uri xlink:href="https://gmd.copernicus.org/articles/.pdf">The full text article is available as a PDF file from https://gmd.copernicus.org/articles/.pdf</self-uri>
      <abstract><title>Abstract</title>

      <p id="d1e127">In numerical weather prediction (NWP) models, physical parameterization schemes are the most computationally expensive components, despite being greatly simplified. In the past few years, an increasing number of studies have demonstrated that machine learning (ML) parameterizations of subgrid physics have the potential to accelerate and even outperform conventional physics-based schemes. However, as the ML models are commonly implemented using the ML libraries written in Python, very few ML-based parameterizations have been successfully integrated with NWP models due to the difficulty of embedding Python functions into Fortran-based NWP models. To address this issue, we developed a coupler to allow the ML-based parameterizations to be coupled with a widely used NWP model, i.e., the Weather Research and Forecasting (WRF) model. Similar to the WRF I/O methodologies, the coupler provides the options to run the ML model inference with exclusive processors or the same processors for WRF calculations. In addition, to demonstrate the effectiveness of the coupler, the ML-based radiation emulators are trained and coupled with the WRF model successfully.</p>
  </abstract>
    </article-meta>
  </front>
<body>
      

<sec id="Ch1.S1" sec-type="intro">
  <label>1</label><title>Introduction</title>
      <p id="d1e139">Numerical weather prediction (NWP) models have become the most important tools for operational weather forecasting and have many applications in different domains, including energy, traffic, logistics, and planning <xref ref-type="bibr" rid="bib1.bibx7 bib1.bibx22" id="paren.1"/>. The physics-based parameterizations are essential in the NWP models as they approximate some processes that are either too small-scale to be explicitly resolved at the model grid resolution or too complex and not fully understood. Although many simplifications are made to parameterizations to reduce the computational cost, the calculations of physical parameterizations still account for a significant portion of the total computational time of NWP models <xref ref-type="bibr" rid="bib1.bibx31" id="paren.2"/>. Also, the parameterization schemes often contain uncertain parameters estimated from more faithful high-resolution simulations with statistical models <xref ref-type="bibr" rid="bib1.bibx28" id="paren.3"/>. For example, the parameters in radiative transfer parameterizations can be fitted  to the output of the most accurate line-by-line model <xref ref-type="bibr" rid="bib1.bibx6" id="paren.4"/>, and parameters in cloud turbulence parameterizations can be inferred from large-eddy simulations (LESs) <xref ref-type="bibr" rid="bib1.bibx17" id="paren.5"/>. However, other parameters can only be learned from observations as the related governing equations are unknown <xref ref-type="bibr" rid="bib1.bibx24" id="paren.6"/>.</p>
      <p id="d1e161">One alternative is to train machine learning (ML) models to replace the traditional physics-based parameterization schemes. The ML-based parameterizations have the potential to outperform traditional parameterizations with higher computational efficiency. For example, the radiative transfer parameterization scheme is one of the most computationally expensive components in NWP models, and it has the longest history of developing ML-based radiation emulators. <xref ref-type="bibr" rid="bib1.bibx4 bib1.bibx5" id="text.7"/> developed a neural network (NN) based longwave radiation parameterization (NeuroFlux) and has been used operationally in the European Centre for Medium-Range Weather Forecasts (ECMWF) four-dimensional variational data assimilation system. The NeuroFlux is 7 times faster than the previous scheme with comparable accuracy <xref ref-type="bibr" rid="bib1.bibx14" id="paren.8"/>. Recently, <xref ref-type="bibr" rid="bib1.bibx27" id="text.9"/> developed and used the neural network based radiation emulators in the operational weather forecasting model in the Korea Meteorological Administration. They demonstrated that using NN-based emulators frequently can improve real-time weather forecasting in terms of accuracy and speed compared to the original method, which infrequently uses the original radiation parameterization.</p>
      <p id="d1e173">Similarly, ML-based emulators have been developed for other parameterization schemes in NWP models. <xref ref-type="bibr" rid="bib1.bibx23" id="text.10"/> successfully developed and coupled an NN-based convection parameterization into an aquaplanet general circulation model (GCM). They showed that the NN-based parameterization was able to perform multi-year simulations, of which results were close to that of the original simulations. For planetary boundary layer (PBL) parameterization, <xref ref-type="bibr" rid="bib1.bibx31" id="text.11"/> used the inputs and outputs from Yonsei University (YSU) PBL scheme of the Weather Research and Forecasting (WRF) model to develop the NN-based parameterizations. They showed that the NN-based parameterization successfully simulated the vertical profiles of velocities, temperature, and water vapor within the PBL over the entire diurnal cycle. Urban land surface models (ULSMs) are reasonably fast, but none of them is best at predicting all the main surface fluxes <xref ref-type="bibr" rid="bib1.bibx8 bib1.bibx9" id="paren.12"/>. One solution is to run an ensemble of ULSMs coupled to the NWP models to improve predictions, which is technically difficult to implement and would require more computational time due to running multiple ULSMs simultaneously. <xref ref-type="bibr" rid="bib1.bibx18" id="text.13"/> developed an urban NN (UNN) to emulate the ensemble mean of 22 ULSMs. They demonstrate that the WRF model coupled with the UNN is more accurate than the WRF with a single well-known USLM. <xref ref-type="bibr" rid="bib1.bibx10" id="text.14"/> demonstrated the potential of an NN-based cloud cover parameterization to accurately reproduce the subgrid scale cloud fraction to improve the low-resolution climate model predictions.</p>
      <p id="d1e191">In many cases, ML-based parameterization schemes are only evaluated in offline settings due to the practical challenge of coupling ML-based emulators with NWP models. However, the offline performance of ML-based emulators does not necessarily reflect their online performance when coupled with other components of NWP models. ML-based parameterizations are often trained using high-level programming languages like Python and easy-to-use ML libraries such as PyTorch <xref ref-type="bibr" rid="bib1.bibx21" id="paren.15"/> and Keras <xref ref-type="bibr" rid="bib1.bibx11" id="paren.16"/>. However, the NWP codes are mainly written in Fortran, making it difficult to integrate directly with ML-based emulators. Researchers used multiple approaches to circumvent these issues. <xref ref-type="bibr" rid="bib1.bibx16" id="text.17"/> developed a single-layer NN software for developing NN-based radiation emulators in NWP models <xref ref-type="bibr" rid="bib1.bibx1 bib1.bibx15 bib1.bibx27" id="paren.18"/>. Some researchers wrote a Fortran module that reproduces NN architectures using matrix–matrix multiplication with the weights saved from offline training <xref ref-type="bibr" rid="bib1.bibx3 bib1.bibx12 bib1.bibx23" id="paren.19"/>, which only works well for simple NN architectures such as the fully connected (FC) NNs and becomes increasingly difficult to implement for more complex NN structures. <xref ref-type="bibr" rid="bib1.bibx20" id="text.20"/> produced an open source software library, the Fortran–Keras Bridge (FKB), which enables users to access many features of the Keras application programming interface (API) in Fortran and implement NNs in Fortran code. However, the FKB can only support FC or dense layer NNs. Therefore, researchers cannot fully use the most advanced NN structures, such as convolution, self-attention, and variational autoencoder structures, to build powerful ML-based emulators for online applications. <xref ref-type="bibr" rid="bib1.bibx32" id="text.21"/> proposed an NN–GCM coupler to allow Python-based NNs to communicate with the Fortran-based GCM. Although using the coupler can make the ML-based emulators be deployed efficiently, its requirement for data transfer on a hard disk makes it hard to achieve speed-up.</p>
      <p id="d1e217">To address the abovementioned problems, we developed a WRF–ML coupler that allows any ML-based parameterization schemes to be coupled with the WRF model. The WRF model is selected as it is a popular open source NWP model used by many researchers and operational organizations. Also, to demonstrate the applicability of the coupler, we trained and coupled the ML-based radiation emulators with the WRF model. The ML-based radiation emulators were studied most among all the physical parameterization schemes and coupled with the ECMWF Integrated Forecasting System (IFS), the National Centers for Environmental Prediction (NCEP) Climate Forecast System (CFS), and the WRF model in the previous studies <xref ref-type="bibr" rid="bib1.bibx27 bib1.bibx19" id="paren.22"/>. Also, the rapid radiative transfer model for general circulation models (RRTMG; <xref ref-type="bibr" rid="bib1.bibx13" id="altparen.23"/>) is selected for radiation as it is widely used in weather and climate models, such as the NCEP Global Forecast System (GFS) and the China Meteorological Administration (CMA) Global/Regional Assimilation and Prediction System (GRAPES). To train the ML-based radiation emulators, the WRF model is run to generate a dataset for training and evaluation. We also illustrate the performance of the ML-based radiation emulators in the online setting. Lastly, we compare the accuracy and computational costs of the WRF simulations coupled with ML-based radiation emulators with the WRF simulations using original RRTMG schemes.</p>
      <p id="d1e226">The remainder of the paper is organized as follows. Section 2 describes the original and ML-based RRTMG module. Section 3 introduces the WRF I/O quilt processors and the WRF–ML coupler. Section 4 briefly presents the WRF model setup and ML-based radiation emulators. Section 5 presents the analysis and results in online settings. Conclusions and discussions are in Sect. 6.</p>
</sec>
<sec id="Ch1.S2">
  <label>2</label><title>Description of original and ML-based RRTMG model</title>
<sec id="Ch1.S2.SS1">
  <label>2.1</label><title>Description of original RRTMG module</title>
      <p id="d1e244">The RRTMG is developed by the Atmospheric and Environmental Research (AER) and comprises shortwave (SW) and longwave (LW) schemes that calculate SW and LW radiation fluxes and heating rates, respectively. Figure <xref ref-type="fig" rid="Ch1.F1"/> presents the pseudo-code for the original radiation module in the WRF model. At the beginning of the radiation driver module, the input and output variables are initialized. Then, the corresponding radiation calculations are started according to the parameters (i.e., ra_lw_physics and ra_sw_physics) specified in the configuration file (i.e., namelist.input). When the RRTMG schemes are selected, the subroutine rrtmg_lwinit is called to perform calculations needed to initialize the longwave calculations. Next, the subroutines rrtmg_lwrad and rrtmg_swrad are called to perform the RRTMG SW and LW radiation calculations.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F1"><?xmltex \currentcnt{1}?><?xmltex \def\figurename{Figure}?><label>Figure 1</label><caption><p id="d1e251">Pseudo-code for the original RRTMG module.</p></caption>
          <?xmltex \igopts{width=199.169291pt}?><graphic xlink:href="https://gmd.copernicus.org/articles/16/199/2023/gmd-16-199-2023-f01.png"/>

        </fig>

<sec id="Ch1.S2.SS1.SSS1">
  <label>2.1.1</label><title>Description of ML-based RRTMG module</title>
      <p id="d1e267">The original radiation computations process one atmospheric column at a time and loop over all the horizontal grid points. The three-dimensional variables associated with the grid cell (indexed as <inline-formula><mml:math id="M1" display="inline"><mml:mrow><mml:mo>(</mml:mo><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi><mml:mo>,</mml:mo><mml:mi>k</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> in the WRF model output files) are indexed as <inline-formula><mml:math id="M2" display="inline"><mml:mrow><mml:mo>(</mml:mo><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mi>k</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> in memory by the WRF to make it more cache friendly and increase the cache hit rate. The index <inline-formula><mml:math id="M3" display="inline"><mml:mi>i</mml:mi></mml:math></inline-formula>, <inline-formula><mml:math id="M4" display="inline"><mml:mi>j</mml:mi></mml:math></inline-formula>, and <inline-formula><mml:math id="M5" display="inline"><mml:mi>k</mml:mi></mml:math></inline-formula> represent the west–east, south–north, and vertical directions, respectively. With the ML-based RRTMG model, inference can be implemented on a batch of data, which requires the input data to be packed into batches. The input features of ML-based radiation emulators have dimensions of <inline-formula><mml:math id="M6" display="inline"><mml:mrow><mml:mi>W</mml:mi><mml:mo>×</mml:mo><mml:mi>H</mml:mi></mml:mrow></mml:math></inline-formula>, where <inline-formula><mml:math id="M7" display="inline"><mml:mi>W</mml:mi></mml:math></inline-formula> and <inline-formula><mml:math id="M8" display="inline"><mml:mi>H</mml:mi></mml:math></inline-formula> represent the number of features and vertical levels, respectively. Therefore, the original three-dimensional variables are reshaped to be indexed as <inline-formula><mml:math id="M9" display="inline"><mml:mrow><mml:mo>(</mml:mo><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi><mml:mo>,</mml:mo><mml:mi>k</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> to match the dimensions requirement of ML-based input features (done by preprocess_layout in Fig. <xref ref-type="fig" rid="Ch1.F2"/>). The ML model inference is completed within the function infer_run. Unlike physics-based radiation schemes, ML-based schemes do not need to calculate intermediate variables such as optical depth. Instead, the ML-based model accomplishes the mapping from the input features to outputs. After inference is finished, ML model outputs will be post-processed to match the dimensions of the original WRF data array to continue model simulations.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F2"><?xmltex \currentcnt{2}?><?xmltex \def\figurename{Figure}?><label>Figure 2</label><caption><p id="d1e382">Pseudo-code for the ML-based RRTMG module.</p></caption>
            <?xmltex \igopts{width=113.811024pt}?><graphic xlink:href="https://gmd.copernicus.org/articles/16/199/2023/gmd-16-199-2023-f02.png"/>

          </fig>

</sec>
</sec>
</sec>
<sec id="Ch1.S3">
  <label>3</label><title>WRF–ML coupler</title>
      <p id="d1e401">This subsection briefly describes the WRF I/O quilt server techniques as a similar method is adopted in the WRF–ML coupler to have exclusive servers for ML model inference. Then the methodologies of the WRF–ML coupler are described in detail.</p>
<sec id="Ch1.S3.SS1">
  <label>3.1</label><title>Description of WRF I/O quilt servers</title>
      <p id="d1e411">The WRF model is designed to run efficiently on parallel distributed-memory computing systems. To do this, the WRF applies domain decomposition to split the entire domain into nprocs (equal to <inline-formula><mml:math id="M10" display="inline"><mml:mrow><mml:msub><mml:mi>n</mml:mi><mml:mi>x</mml:mi></mml:msub><mml:mo>×</mml:mo><mml:msub><mml:mi>n</mml:mi><mml:mi>y</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>) number of subdomains so that the total amount of work is divided into nprocs compute tasks, where <inline-formula><mml:math id="M11" display="inline"><mml:mrow><mml:msub><mml:mi>n</mml:mi><mml:mi>x</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M12" display="inline"><mml:mrow><mml:msub><mml:mi>n</mml:mi><mml:mi>y</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> are the number of processors along the west–east and south–north directions. Furthermore, the WRF model contains an I/O layer to gather all the distributed data arrays onto the master message passing interface (MPI) rank 0 while other MPI ranks block until the master completes the write. However, as the domain size increases or resolution increases, leading to an increasing amount of information to write, the increasing time cost of writing output files becomes a bottleneck for the overall performance. Therefore, the WRF model also provides an option to use asynchronous I/O (quilt) servers that deal exclusively with I/O so that the compute processors can continue without waiting for writing output files. More specifically, the WRF allows users to specify the number of groups of I/O servers to allocate (nio_groups) and the number of I/O ranks per group (nio_tasks_per_group). Similar to the I/O quilt servers, the WRF–ML coupler also provides the option to use several processors for ML model inference exclusively, of which more details are described in the following subsection. While GPUs are typically more powerful than CPUs, both CPUs and GPUs are supported for inference. CPUs are cheaper than GPUs and more widely available, and using CPUs for inference can also avoid extra costs due to GPU–CPU data transfer.</p>
<sec id="Ch1.S3.SS1.SSS1">
  <label>3.1.1</label><title>Description of the WRF–ML coupler</title>
      <p id="d1e461">Figure <xref ref-type="fig" rid="Ch1.F3"/> shows that when the WRF model launches the initialization process, the subroutine infer_init is called to initialize the ML model inference services. Similar to the asynchronous I/O servers, several processes are assigned to exclusively work on ML model inference using either GPUs or CPUs. The number of inference processors and the number of inference ranks per group are specified in the configuration file by setting the newly added variables ninfer_tasks_per_group and ninfer_groups in the namelist.input file. The difference between the inference processors and the asynchronous I/O servers is that the inference processors apply the synchronous method, as WRF calculations run sequentially, and the subsequent calculations of the WRF model rely on the outputs from ML-based parameterizations. Usually, the number of inference processors is much smaller than that of the WRF compute processors, so a single inference processor must process data from multiple processors. The inference processor listens and receives data from the corresponding WRF compute processors through the MPI transmission while blocking the compute processes (see the right part in Fig. <xref ref-type="fig" rid="Ch1.F4"/>). After inference is finished, the inference process sends outputs of ML-based schemes back to the compute processors to continue the standard WRF calculations. In addition, the ninfer_tasks_per_group and ninfer_groups can be set to <inline-formula><mml:math id="M13" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:math></inline-formula> to execute the ML model inference using the same processors as for the WRF compute processes. Then, memory copy is not needed between the WRF compute processors and inference processors.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F3" specific-use="star"><?xmltex \currentcnt{3}?><?xmltex \def\figurename{Figure}?><label>Figure 3</label><caption><p id="d1e480">Framework of the WRF–ML coupler.</p></caption>
            <?xmltex \igopts{width=398.338583pt}?><graphic xlink:href="https://gmd.copernicus.org/articles/16/199/2023/gmd-16-199-2023-f03.png"/>

          </fig>

      <p id="d1e489">The functions for implementing the ML-based model inference are all written in Python, commonly used by the machine learning community. To allow the Fortran-based WRF code to call those Python functions, it is necessary first to link the Fortran executables to the system Python library. iso_c_binding is an intrinsic module defined in Fortran 2003 for interoperability with C. As shown in the left part of Fig. <xref ref-type="fig" rid="Ch1.F3"/>, the iso_c_binding is imported within the Fortran code. Although no C code is needed, the C foreign function interface (CFFI) for Python is used to generate C-style binding interfaces within the Python script (see the middle part of Fig. <xref ref-type="fig" rid="Ch1.F3"/>). The interfaces for C-function are also defined in the Python scripts, which are used to generate a dynamic library that Fortran modules can link.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F4" specific-use="star"><?xmltex \currentcnt{4}?><?xmltex \def\figurename{Figure}?><label>Figure 4</label><caption><p id="d1e499">Synchronous sequence implementation between WRF and ML-based parameterization.</p></caption>
            <?xmltex \igopts{width=341.433071pt}?><graphic xlink:href="https://gmd.copernicus.org/articles/16/199/2023/gmd-16-199-2023-f04.png"/>

          </fig>

      <p id="d1e508">This WRF–ML coupler allows the ML models to be easily coupled with the WRF model with minimal effort. Also, researchers are able to make full use of the state-of-the-art ML model structures. This methodology is made open source in the hope that more and more machine learning researchers will participate in improving the traditional NWP models.</p>
</sec>
</sec>
</sec>
<sec id="Ch1.S4">
  <label>4</label><title>Experiment setup</title>
      <p id="d1e521">The ML-based radiation emulator is coupled with the WRF model to demonstrate the coupler's practicality. This section firstly describes the WRF model setup, ML-based radiation emulators' network structures, and offline evaluation metrics.</p>
<sec id="Ch1.S4.SS1">
  <label>4.1</label><title>WRF model setup</title>
      <p id="d1e531">In this work, the WRF model version 4.3 is compiled using the GNU Fortran (gfortran version 6.5.1) compiler with the “dmpar” option. To generate a dataset for model training, the WRF model is run using the domain configuration shown in Fig. <xref ref-type="fig" rid="Ch1.F5"/>. The WRF model is set up within the GFS model grid with a single domain at a horizontal grid spacing of 5 km and has 190 <inline-formula><mml:math id="M14" display="inline"><mml:mo>×</mml:mo></mml:math></inline-formula> 170 grid points in the west–east and north–south directions (i.e., 32 300 grid cells). The number of vertical levels is 57, and the model top is 10 hPa. In addition, the WRF model is configured using physics schemes, including the New Thompson et al. scheme <xref ref-type="bibr" rid="bib1.bibx29" id="paren.24"/>, Bougeault–Lacarrère (BouLac) planetary boundary layer (PBL) scheme <xref ref-type="bibr" rid="bib1.bibx2" id="paren.25"/>, RUC land surface model <xref ref-type="bibr" rid="bib1.bibx25 bib1.bibx26" id="paren.26"/>, and RRTMG for both SW and LW radiation <xref ref-type="bibr" rid="bib1.bibx13" id="paren.27"/>. The WRF simulations were run for 3 d per month, and the initialization days were randomly selected. The first 2 d of every 3 d simulations are used for training, and the last day is used for testing. The model generates radiation inputs and outputs every 30 model minutes.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F5"><?xmltex \currentcnt{5}?><?xmltex \def\figurename{Figure}?><label>Figure 5</label><caption><p id="d1e558">Digital evaluation data of the single WRF domain with horizontal resolution at 5 km.</p></caption>
          <?xmltex \igopts{width=236.157874pt}?><graphic xlink:href="https://gmd.copernicus.org/articles/16/199/2023/gmd-16-199-2023-f05.png"/>

        </fig>

</sec>
<sec id="Ch1.S4.SS2">
  <label>4.2</label><title>ML-based radiation emulators and offline evaluations</title>
      <p id="d1e575">Many researchers <xref ref-type="bibr" rid="bib1.bibx5 bib1.bibx15 bib1.bibx27" id="paren.28"/> have previously used the FC networks to replace the radiation schemes within the operational NWP models. Additionally, <xref ref-type="bibr" rid="bib1.bibx30" id="text.29"/> showed that bidirectional recurrent NNs (RNNs) are more accurate than the feed-forward NNs (FNNs) for emulating atmospheric radiative transfer. <xref ref-type="bibr" rid="bib1.bibx33" id="text.30"/> also demonstrated that the bidirectional long short-term memory (Bi-LSTM) achieves the best accuracy in radiative transfer parameterization. Therefore, this study has trained and tested both FC networks and Bi-LSTM models. As an increasing number of trainable parameters of a ML model increases both accuracy and computational cost, finding a balance is crucial for applying ML models in operational NWP models. Table <xref ref-type="table" rid="Ch1.T1"/> illustrates the structures of five ML models, including two FC networks (models A and B) and three Bi-LSTM models (models C to E). Models A and D, as well as models B and E, have approximately the same number of parameters, respectively. More details about the model training and comparisons can be referenced in <xref ref-type="bibr" rid="bib1.bibx33" id="text.31"/>. All the ML-based emulators have been converted to ONNX and run using the ONNX Runtime library version 1.7.</p>

<?xmltex \floatpos{t}?><table-wrap id="Ch1.T1" specific-use="star"><?xmltex \currentcnt{1}?><label>Table 1</label><caption><p id="d1e595">Description of different ML models used for emulating radiative transfer calculations, including description of model structures and total number of parameters.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="5">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="left"/>
     <oasis:colspec colnum="3" colname="col3" align="right"/>
     <oasis:colspec colnum="4" colname="col4" align="right"/>
     <oasis:colspec colnum="5" colname="col5" align="right"/>
     <oasis:thead>
       <oasis:row>
         <oasis:entry colname="col1">Model name</oasis:entry>
         <oasis:entry colname="col2">Network structure</oasis:entry>
         <oasis:entry colname="col3">Number of layers</oasis:entry>
         <oasis:entry colname="col4">Number of nodes</oasis:entry>
         <oasis:entry colname="col5">Number of parameters</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"/>
         <oasis:entry colname="col3">(bidirectional layers</oasis:entry>
         <oasis:entry colname="col4"/>
         <oasis:entry colname="col5"/>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"/>
         <oasis:entry colname="col3">for Bi-LSTM models)</oasis:entry>
         <oasis:entry colname="col4"/>
         <oasis:entry colname="col5"/>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row>
         <oasis:entry colname="col1">Model A</oasis:entry>
         <oasis:entry colname="col2">FC</oasis:entry>
         <oasis:entry colname="col3">5</oasis:entry>
         <oasis:entry colname="col4">30</oasis:entry>
         <oasis:entry colname="col5">70 248</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Model B</oasis:entry>
         <oasis:entry colname="col2">FC</oasis:entry>
         <oasis:entry colname="col3">10</oasis:entry>
         <oasis:entry colname="col4">200</oasis:entry>
         <oasis:entry colname="col5">840 028</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Model C</oasis:entry>
         <oasis:entry colname="col2">Bi-LSTM</oasis:entry>
         <oasis:entry colname="col3">1</oasis:entry>
         <oasis:entry colname="col4">16</oasis:entry>
         <oasis:entry colname="col5">6788</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Model D</oasis:entry>
         <oasis:entry colname="col2">Bi-LSTM</oasis:entry>
         <oasis:entry colname="col3">3</oasis:entry>
         <oasis:entry colname="col4">32</oasis:entry>
         <oasis:entry colname="col5">67 844</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Model E</oasis:entry>
         <oasis:entry colname="col2">Bi-LSTM</oasis:entry>
         <oasis:entry colname="col3">5</oasis:entry>
         <oasis:entry colname="col4">96</oasis:entry>
         <oasis:entry colname="col5">993 028</oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table></table-wrap>

      <p id="d1e756">Figure <xref ref-type="fig" rid="Ch1.F6"/> shows the inference time of different ML-based radiation emulators on one Intel Xeon Platinum 8269CY CPU and one NVIDIA Tesla T4 GPU, respectively. Model E has the longest inference time and is much longer than model B, even though they have a similar number of parameters. This is because the FC networks have a much lower time complexity than LSTM models. Similarly, although the number of parameters of model A is about 10.3 and 1.03 times that of models C and D (see Table <xref ref-type="table" rid="Ch1.T1"/>), model A is still faster than models C and D.</p>
      <p id="d1e764">The performance of different ML-based radiation emulators is evaluated on the testing data in the offline setting. Table <xref ref-type="table" rid="Ch1.T2"/> summarizes the root mean square error (RMSE) of fluxes and heating rates which are the final outputs of the radiation schemes. Since SW fluxes at the surface and top-of-atmosphere (TOA) radiation are particularly important to climate and weather prediction, their RMSE is also shown in Table <xref ref-type="table" rid="Ch1.T2"/>. It is shown that model E is the most accurate, with RMSE of SW and LW fluxes of about 6.157 and 1.274 W m<inline-formula><mml:math id="M15" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>, RMSE of SW and LW heating rates of about 0.051 and 0.095 K d<inline-formula><mml:math id="M16" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>, and SW fluxes at the surface and TOA of about 6.871 and 5.576 W m<inline-formula><mml:math id="M17" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>. Model D has comparable accuracy to the best-performing model E while having only <inline-formula><mml:math id="M18" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>/</mml:mo><mml:mn mathvariant="normal">14</mml:mn></mml:mrow></mml:math></inline-formula> of the parameters and a much shorter inference time. Furthermore, considering that model C only has the smallest number of parameters, model C has relatively good accuracy, while the RMSE of SW heating rate is 5 times higher than model E. On the other hand, models A and B have at least 4 times higher RMSE than model E in terms of heating rates, with RMSE of SW fluxes at the surface and top of the atmosphere at least 20 times higher than model E. In summary, it is demonstrated that the Bi-LSTM model is more accurate than FC networks for radiation emulation, and model D achieves the best balance between accuracy and computing efficiency.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F6"><?xmltex \currentcnt{6}?><?xmltex \def\figurename{Figure}?><label>Figure 6</label><caption><p id="d1e822">Inference time of different ML-based emulators run on one CPU core <bold>(a)</bold> and GPU <bold>(b)</bold> with different batch sizes. The number of vertical levels is 57.</p></caption>
          <?xmltex \igopts{width=236.157874pt}?><graphic xlink:href="https://gmd.copernicus.org/articles/16/199/2023/gmd-16-199-2023-f06.png"/>

        </fig>

<?xmltex \floatpos{t}?><table-wrap id="Ch1.T2" specific-use="star"><?xmltex \currentcnt{2}?><label>Table 2</label><caption><p id="d1e840">RMSE of different ML-based radiation emulators for test data in offline evaluation.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="7">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="right"/>
     <oasis:colspec colnum="3" colname="col3" align="right"/>
     <oasis:colspec colnum="4" colname="col4" align="right"/>
     <oasis:colspec colnum="5" colname="col5" align="right"/>
     <oasis:colspec colnum="6" colname="col6" align="right"/>
     <oasis:colspec colnum="7" colname="col7" align="right"/>
     <oasis:thead>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">SW flux</oasis:entry>
         <oasis:entry colname="col3">LW flux</oasis:entry>
         <oasis:entry colname="col4">SW heating rate</oasis:entry>
         <oasis:entry colname="col5">LW heating rate</oasis:entry>
         <oasis:entry colname="col6">Surface SW flux</oasis:entry>
         <oasis:entry colname="col7">TOA SW flux</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">W m<inline-formula><mml:math id="M19" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col3">W m<inline-formula><mml:math id="M20" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col4">K d<inline-formula><mml:math id="M21" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col5">K d<inline-formula><mml:math id="M22" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col6">W m<inline-formula><mml:math id="M23" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col7">W m<inline-formula><mml:math id="M24" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula></oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row>
         <oasis:entry colname="col1">Model A</oasis:entry>
         <oasis:entry colname="col2">247.0</oasis:entry>
         <oasis:entry colname="col3">41.46</oasis:entry>
         <oasis:entry colname="col4">1.172</oasis:entry>
         <oasis:entry colname="col5">2.035</oasis:entry>
         <oasis:entry colname="col6">209.2</oasis:entry>
         <oasis:entry colname="col7">273.5</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Model B</oasis:entry>
         <oasis:entry colname="col2">195.2</oasis:entry>
         <oasis:entry colname="col3">29.64</oasis:entry>
         <oasis:entry colname="col4">0.946</oasis:entry>
         <oasis:entry colname="col5">2.172</oasis:entry>
         <oasis:entry colname="col6">173.3</oasis:entry>
         <oasis:entry colname="col7">216.3</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Model C</oasis:entry>
         <oasis:entry colname="col2">7.658</oasis:entry>
         <oasis:entry colname="col3">2.032</oasis:entry>
         <oasis:entry colname="col4">0.277</oasis:entry>
         <oasis:entry colname="col5">0.337</oasis:entry>
         <oasis:entry colname="col6">8.534</oasis:entry>
         <oasis:entry colname="col7">6.628</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Model D</oasis:entry>
         <oasis:entry colname="col2">5.355</oasis:entry>
         <oasis:entry colname="col3">1.305</oasis:entry>
         <oasis:entry colname="col4">0.099</oasis:entry>
         <oasis:entry colname="col5">0.161</oasis:entry>
         <oasis:entry colname="col6">5.854</oasis:entry>
         <oasis:entry colname="col7">4.804</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Model E</oasis:entry>
         <oasis:entry colname="col2">6.157</oasis:entry>
         <oasis:entry colname="col3">1.274</oasis:entry>
         <oasis:entry colname="col4">0.051</oasis:entry>
         <oasis:entry colname="col5">0.095</oasis:entry>
         <oasis:entry colname="col6">6.871</oasis:entry>
         <oasis:entry colname="col7">5.576</oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table></table-wrap>

      <p id="d1e1108">Since it is difficult to know the performance of the ML-based emulators in the online settings based on offline evaluation, the following Section presents the online evaluations of different ML-based emulators coupled with the WRF model. In addition, the WRF model is also run with the original RRTMG schemes as a reference to evaluate the performance of the ML-based emulators.</p>
</sec>
</sec>
<sec id="Ch1.S5">
  <label>5</label><title>Online evaluation results</title>
      <p id="d1e1121">The ML-based radiation emulators are coupled with the WRF model using the WRF–ML coupler. The WRF simulations coupled with ML-based emulators were run using three configurations: zero exclusive processes for inference, four exclusive inference processes on CPUs, and four exclusive inference processes on GPUs. Also, the total number of WRF processes is kept to 24 to ensure fair comparisons. The calculation time of radiation driver using different ML-based radiation emulators is compared. The WRF model is initialized at 12:00 UTC on a randomly selected day (i.e., 9 November) in the year 2021 and runs for three days, which is not part of the previously used dataset to ensure the ML-based emulators are evaluated on unseen data. Finally, the temperature at 2 m (<inline-formula><mml:math id="M25" display="inline"><mml:mrow><mml:msub><mml:mi>T</mml:mi><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mi mathvariant="normal">m</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>) and wind speed at 10 m (<inline-formula><mml:math id="M26" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="normal">WS</mml:mi><mml:mrow><mml:mn mathvariant="normal">10</mml:mn><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mi mathvariant="normal">m</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>) output from the WRF model are compared.</p>
<sec id="Ch1.S5.SS1">
  <label>5.1</label><title>Computational efficiency evaluation of radiation driver</title>
      <p id="d1e1161">The calculation time of the WRF radiation driver using original RRTMG schemes and ML-based radiation emulators are profiled using the timing function rsl_internal_microclock provided by the WRF model. Table <xref ref-type="table" rid="Ch1.T3"/> shows that it takes about 1441 ms (ms: milliseconds) to run a radiation driver using the original RRTMG schemes. In terms of ML-based radiation emulators, using GPUs for inference is significantly faster than using CPUs for inference, especially for the Bi-LSTM model, as it is less computationally efficient than the FC networks given a similar number of parameters. When GPUs are used for inference, all ML-based emulators except model E are at least 3 times faster than the original RRTMG schemes. However, whether this speed-up is due to applying ML-based emulators or the upgrade in hardware (using GPUs instead of CPUs) is unclear and is beyond the scope of this paper. On the other hand, when CPUs are used for inference, having four processes exclusively for inference is slower than having inference executed on the WRF compute processors, which is probably due to the more significant increase in time cost due to data copy between CPUs than the decrease in time cost from having CPUs exclusively for inference.</p>

<?xmltex \floatpos{t}?><table-wrap id="Ch1.T3"><?xmltex \currentcnt{3}?><label>Table 3</label><caption><p id="d1e1169">Calculation time (in units of milliseconds) of radiation driver when using the original RRTMG schemes and the ML-based radiation emulators. The batch size is 32 300, as the domain has 190 <inline-formula><mml:math id="M27" display="inline"><mml:mo>×</mml:mo></mml:math></inline-formula> 170 grid points.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="4">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="right"/>
     <oasis:colspec colnum="3" colname="col3" align="right"/>
     <oasis:colspec colnum="4" colname="col4" align="right"/>
     <oasis:thead>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Total processes</oasis:entry>
         <oasis:entry namest="col2" nameend="col4" align="center">24 </oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Original RRTMG</oasis:entry>
         <oasis:entry namest="col2" nameend="col4" align="center">1441.24 ms </oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">ML inference processes</oasis:entry>
         <oasis:entry colname="col2">0</oasis:entry>
         <oasis:entry colname="col3">4 CPUs</oasis:entry>
         <oasis:entry colname="col4">4 GPUs</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Model A</oasis:entry>
         <oasis:entry colname="col2">91.73</oasis:entry>
         <oasis:entry colname="col3">237.57</oasis:entry>
         <oasis:entry colname="col4">176.87</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Model B</oasis:entry>
         <oasis:entry colname="col2">365.31</oasis:entry>
         <oasis:entry colname="col3">808.43</oasis:entry>
         <oasis:entry colname="col4">331.34</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Model C</oasis:entry>
         <oasis:entry colname="col2">666.37</oasis:entry>
         <oasis:entry colname="col3">921.56</oasis:entry>
         <oasis:entry colname="col4">258.66</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Model D</oasis:entry>
         <oasis:entry colname="col2">2620.92</oasis:entry>
         <oasis:entry colname="col3">3383.42</oasis:entry>
         <oasis:entry colname="col4">418.28</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Model E</oasis:entry>
         <oasis:entry colname="col2">7166.46</oasis:entry>
         <oasis:entry colname="col3">10 330.05</oasis:entry>
         <oasis:entry colname="col4">1690.76</oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table></table-wrap>

      <?xmltex \floatpos{t}?><fig id="Ch1.F7" specific-use="star"><?xmltex \currentcnt{7}?><?xmltex \def\figurename{Figure}?><label>Figure 7</label><caption><p id="d1e1310">Differences in temperature at 2 m at forecast horizons of 24, 48, and 72 h between WRF simulations coupled with ML models A, B, C, D, and E and the WRF simulation with the original RRTMG schemes for radiation emulation. The title of each figure indicates the mean absolute difference (MAD) of the WRF domain.</p></caption>
          <?xmltex \igopts{width=497.923228pt}?><graphic xlink:href="https://gmd.copernicus.org/articles/16/199/2023/gmd-16-199-2023-f07.png"/>

        </fig>

      <?xmltex \floatpos{t}?><fig id="Ch1.F8" specific-use="star"><?xmltex \currentcnt{8}?><?xmltex \def\figurename{Figure}?><label>Figure 8</label><caption><p id="d1e1322">Differences in wind speed at 10 m at forecast horizons of 24, 48, and 72 h between WRF simulations coupled with ML models A, B, C, D, and E and the WRF simulation with the original RRTMG schemes for radiation emulation.</p></caption>
          <?xmltex \igopts{width=497.923228pt}?><graphic xlink:href="https://gmd.copernicus.org/articles/16/199/2023/gmd-16-199-2023-f08.png"/>

        </fig>

</sec>
<sec id="Ch1.S5.SS2">
  <label>5.2</label><title>Prognostic validation</title>
      <p id="d1e1339">Figures <xref ref-type="fig" rid="Ch1.F7"/> and <xref ref-type="fig" rid="Ch1.F8"/> present the difference in 24, 48, and 72 h forecasts between the WRF simulations coupled with ML-based radiation emulators and the WRF simulation with original RRTMG schemes. On the spatial difference, the red and blue patterns indicate significantly positive and negative biases of ML-based simulations, respectively. On the other hand, green patterns indicate no or small difference from the original WRF simulations. A domain-averaged mean absolute difference (MAD) is also calculated to measure the overall performance of ML-based emulators in online settings. Notably, the difference does not increase with the simulation time, as the difference at 72 forecast hours is similar to that of 24 forecast hours. The WRF simulations coupled with FC networks (models A and B) have the worst performance, with a much larger difference from the original WRF simulation than WRF simulations coupled with Bi-LSTM models (models D and E) over the entire domain. Model B is more accurate than model A in offline settings but performs much worse than model A in the online evaluation. The MAD of <inline-formula><mml:math id="M28" display="inline"><mml:mrow><mml:msub><mml:mi>T</mml:mi><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mi mathvariant="normal">m</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M29" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="normal">WS</mml:mi><mml:mrow><mml:mn mathvariant="normal">10</mml:mn><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mi mathvariant="normal">m</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> is greater than 5.6 and 1.3, respectively, for model B, while the MAD of <inline-formula><mml:math id="M30" display="inline"><mml:mrow><mml:msub><mml:mi>T</mml:mi><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mi mathvariant="normal">m</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M31" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="normal">WS</mml:mi><mml:mrow><mml:mn mathvariant="normal">10</mml:mn><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mi mathvariant="normal">m</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> is slightly less than 2.3 and 1.0, respectively, for models A. Model C performs best among all the three FC networks, with the MAD slightly worse than the MAD of models D and E. Models D and E have a similar spatial distribution of difference, although model E has about 10 times the number of parameters as model D and a slightly higher accuracy in offline validation.</p>
      <p id="d1e1407">Overall, model E demonstrates the best performance, with the domain-averaged MAD smaller than 0.10 for both temperature at 2 m and wind speed at 10 m. Both model D and model E show a comparable forecast with the original WRF simulation over the entire domain. Overall, model D is more suitable to replace the original RRTMG schemes as it is more computationally efficient than model E. Using model D with GPU as inference processors is about 7 times faster.</p>
      <p id="d1e1410">In summary, the WRF–ML coupler enables all the ML-based radiation emulators to couple with the WRF models efficiently. Moreover, the WRF simulations coupled with ML models are run successfully using CPU or GPU for inference. Furthermore, the WRF–ML coupler can be a valuable tool for researchers to evaluate their ML-based parameterization in online settings.</p>
</sec>
</sec>
<sec id="Ch1.S6" sec-type="conclusions">
  <label>6</label><title>Summary and conclusions</title>
      <p id="d1e1422">As demonstrated in many previous studies, emulating the subgrid physics using ML models has the potential to be faster or, by training on observations or detailed high-resolution models, even more accurate than the conventional parameterization schemes. However, the fact that ML models are commonly implemented using Python while the NWP models are coded in Fortran makes it challenging to implement the ML-based parameterization in NWP models for operational applications. Previously, researchers usually hard-coded the operations of NNs into Fortran, which is time-consuming and prone to error as even minor changes to the ML model require rewriting the Fortran code. Some researchers used simple Fortran-based NN libraries, which could convert existing models trained in Python to models usable in the Fortran framework. However, these methods are only feasible for applying simple, dense, layer-based NNs. In this paper, we propose a WRF–ML coupler that provides researchers with the tool to implement ML models into NWP models with minimal effort. The advantage of this coupler is that more complex and advanced ML model structures are supported, as it can directly communicate with Python-based modules. The coupler is made open source in the hope that more and more researchers can integrate the ML-based parametrization to improve the traditional NWP models for more accurate weather or climate forecast. In addition, the methodologies also allow the integration of any Python scripts into the WRF model. An example of coupling the primitive Python subroutines into the WRF code is also provided in the code repository.</p>
      <p id="d1e1425"><?xmltex \hack{\newpage}?>Furthermore, the capability of the WRF–ML coupler was demonstrated by coupling the ML-based radiation parameterization into the WRF model. It was also illustrated that ML-based parameterizations' computational efficiency and accuracy should be balanced to achieve the best overall performance for operational applications.</p>
</sec>

      
      </body>
    <back><notes notes-type="codedataavailability"><title>Code and data availability</title>

      <p id="d1e1433">The source code and data used in this are available at <ext-link xlink:href="https://doi.org/10.5281/zenodo.7407487" ext-link-type="DOI">10.5281/zenodo.7407487</ext-link> <xref ref-type="bibr" rid="bib1.bibx34" id="paren.32"/>.</p>
  </notes><notes notes-type="authorcontribution"><title>Author contributions</title>

      <p id="d1e1445">YY trained the machine learning models and calculated the statistics of models' offline performance. XZ, YW, LX, and ZM modified the WRF source code and developed the WRF–ML coupler. XZ conducted the WRF model simulations coupled with the original RRTMG schemes for ML model training and offline evaluation. XZ and ZM conducted the WRF simulations coupled with ML-based radiation emulators. XZ wrote, reviewed, and edited the original draft; ZW supervised and supported this research and gave valuable opinions. All of the authors have contributed to and agreed to the published version of the paper.</p>
  </notes><notes notes-type="competinginterests"><title>Competing interests</title>

      <p id="d1e1451">The contact author has declared that none of the authors has any competing interests.</p>
  </notes><notes notes-type="disclaimer"><title>Disclaimer</title>

      <p id="d1e1457">Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.</p>
  </notes><notes notes-type="financialsupport"><title>Financial support</title>

      <p id="d1e1463">This research has been supported by the Zhejiang Science and Technology Program under grant no. 2021C01017.</p>
  </notes><notes notes-type="reviewstatement"><title>Review statement</title>

      <p id="d1e1469">This paper was edited by Klaus Klingmüller and reviewed by Peter Ukkonen and Sergey Osipov.</p>
  </notes><ref-list>
    <title>References</title>

      <ref id="bib1.bibx1"><?xmltex \def\ref@label{{Belochitski and Krasnopolsky(2021)}}?><label>Belochitski and Krasnopolsky(2021)</label><?label belochitski2021robustness?><mixed-citation>Belochitski, A. and Krasnopolsky, V.: Robustness of neural network emulations of radiative transfer parameterizations in a state-of-the-art general circulation model, Geosci. Model Dev., 14, 7425–7437, <ext-link xlink:href="https://doi.org/10.5194/gmd-14-7425-2021" ext-link-type="DOI">10.5194/gmd-14-7425-2021</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bibx2"><?xmltex \def\ref@label{{{Bougeault} and {Lacarr{\`{e}}re}(1989)}}?><label>Bougeault and Lacarrère(1989)</label><?label Bougeault1989?><mixed-citation>Bougeault, P. and Lacarrère, P.: Parameterization of orographic
induced turbulence in a mesobeta scale model, Mon. Weather Rev., 117,
1872–1890, <ext-link xlink:href="https://doi.org/10.1175/1520-0493(1989)117&lt;1872:POOITI&gt;2.0.CO;2" ext-link-type="DOI">10.1175/1520-0493(1989)117&lt;1872:POOITI&gt;2.0.CO;2</ext-link>, 1989.</mixed-citation></ref>
      <ref id="bib1.bibx3"><?xmltex \def\ref@label{{Chantry et~al.(2021)Chantry, Hatfield, Dueben, Polichtchouk, and
Palmer}}?><label>Chantry et al.(2021)Chantry, Hatfield, Dueben, Polichtchouk, and
Palmer</label><?label Chantry2021?><mixed-citation>Chantry, M., Hatfield, S., Dueben, P., Polichtchouk, I., and Palmer, T.:
Machine Learning Emulation of Gravity Wave Drag in Numerical Weather
Forecasting, J. Adv. Model. Earth Sy., 13,
e2021MS002477, <ext-link xlink:href="https://doi.org/10.1029/2021MS002477" ext-link-type="DOI">10.1029/2021MS002477</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bibx4"><?xmltex \def\ref@label{{Chevallier et~al.(1998)Chevallier, Ch{\'{e}}ruy, Scott, and
Ch{\'{e}}din}}?><label>Chevallier et al.(1998)Chevallier, Chéruy, Scott, and
Chédin</label><?label chevallier1998neural?><mixed-citation>
Chevallier, F., Chéruy, F., Scott, N., and Chédin, A.: A neural network
approach for a fast and accurate computation of a longwave radiative budget,
J. Appl. Meteorol., 37, 1385–1397, 1998.</mixed-citation></ref>
      <ref id="bib1.bibx5"><?xmltex \def\ref@label{{Chevallier et~al.(2000)Chevallier, Morcrette, Ch{\'{e}}ruy, and
Scott}}?><label>Chevallier et al.(2000)Chevallier, Morcrette, Chéruy, and
Scott</label><?label chevallier2000use?><mixed-citation>
Chevallier, F., Morcrette, J.-J., Chéruy, F., and Scott, N.: Use of a
neural-network-based long-wave radiative-transfer scheme in the ECMWF
atmospheric model, Q. J. Roy. Meteor. Soc.,
126, 761–776, 2000.</mixed-citation></ref>
      <ref id="bib1.bibx6"><?xmltex \def\ref@label{{Clough et~al.(2005)Clough, Shephard, Mlawer, Delamere, Iacono,
Cady-Pereira, Boukabara, and Brown}}?><label>Clough et al.(2005)Clough, Shephard, Mlawer, Delamere, Iacono,
Cady-Pereira, Boukabara, and Brown</label><?label clough2005atmospheric?><mixed-citation>
Clough, S., Shephard, M., Mlawer, E., Delamere, J., Iacono, M., Cady-Pereira,
K., Boukabara, S., and Brown, P.: Atmospheric radiative transfer modeling: a
summary of the AER codes, J. Quant. Spectrosc. Ra., 91, 233–244, 2005.</mixed-citation></ref>
      <ref id="bib1.bibx7"><?xmltex \def\ref@label{{Coiffier(2011)}}?><label>Coiffier(2011)</label><?label Coiffier2011?><mixed-citation>Coiffier, J.: Fundamentals of Numerical Weather Prediction, 1st edn., Cambridge
University Press, ISBN 9780511734458, <ext-link xlink:href="https://doi.org/10.1017/CBO9780511734458" ext-link-type="DOI">10.1017/CBO9780511734458</ext-link>, 2011.</mixed-citation></ref>
      <ref id="bib1.bibx8"><?xmltex \def\ref@label{{Grimmond et~al.(2010)Grimmond, Blackett, Best, Barlow, Baik, Belcher,
Bohnenstengel, Calmet, Chen, Dandou, Fortuniak, Gouvea, Hamdi, Hendry, Kawai,
Kawamoto, Kondo, Krayenhoff, Lee, Loridan, Martilli, Masson, Miao, Oleson,
Pigeon, Porson, Ryu, Salamanca, Shashua-Bar, Steeneveld, Tombrou, Voogt,
Young, and Zhang}}?><label>Grimmond et al.(2010)Grimmond, Blackett, Best, Barlow, Baik, Belcher,
Bohnenstengel, Calmet, Chen, Dandou, Fortuniak, Gouvea, Hamdi, Hendry, Kawai,
Kawamoto, Kondo, Krayenhoff, Lee, Loridan, Martilli, Masson, Miao, Oleson,
Pigeon, Porson, Ryu, Salamanca, Shashua-Bar, Steeneveld, Tombrou, Voogt,
Young, and Zhang</label><?label Grimmond2010?><mixed-citation>Grimmond, C. S. B., Blackett, M., Best, M. J., Barlow, J., Baik, J.-J.,
Belcher, S. E., Bohnenstengel, S. I., Calmet, I., Chen, F., Dandou, A.,
Fortuniak, K., Gouvea, M. L., Hamdi, R., Hendry, M., Kawai, T., Kawamoto, Y.,
Kondo, H., Krayenhoff, E. S., Lee, S.-H., Loridan, T., Martilli, A., Masson,
V., Miao, S., Oleson, K., Pigeon, G., Porson, A., Ryu, Y.-H., Salamanca, F.,
Shashua-Bar, L., Steeneveld, G.-J., Tombrou, M., Voogt, J., Young, D., and
Zhang, N.: The international urban energy balance models comparison project:
First results from phase 1, J. Appl. Meteorol. Clim.,
49, 1268–1292, <ext-link xlink:href="https://doi.org/10.1175/2010JAMC2354.1" ext-link-type="DOI">10.1175/2010JAMC2354.1</ext-link>, 2010.</mixed-citation></ref>
      <ref id="bib1.bibx9"><?xmltex \def\ref@label{{Grimmond et~al.(2011)Grimmond, Blackett, Best, Baik, Belcher,
Bohnenstengel, Calmet, Chen, Coutts, Dandou, Fortuniak, Gouvea, Hamdi,
Hendry, , Kanda, Kawai, Kawamoto, Kondo, Krayenhoff, Lee, Loridan, Martilli,
Masson, Miao, Oleson, Ooka, , Pigeon, Porson, Ryu, Salamanca, Steeneveld,
Tombrou, Voogt, Young, and Zhang}}?><label>Grimmond et al.(2011)Grimmond, Blackett, Best, Baik, Belcher,
Bohnenstengel, Calmet, Chen, Coutts, Dandou, Fortuniak, Gouvea, Hamdi,
Hendry, , Kanda, Kawai, Kawamoto, Kondo, Krayenhoff, Lee, Loridan, Martilli,
Masson, Miao, Oleson, Ooka, , Pigeon, Porson, Ryu, Salamanca, Steeneveld,
Tombrou, Voogt, Young, and Zhang</label><?label Grimmond2011?><mixed-citation>Grimmond, C. S. B., Blackett, M., Best, M. J., Baik, J.-J., Belcher, S. E.,
Bohnenstengel, S. I., Calmet, I., Chen, F., Coutts, A., Dandou, A.,
Fortuniak, K., Gouvea, M. L., Hamdi, R., Hendry, M., Kanda, M., Kawai, T.,
Kawamoto, Y., Kondo, H., Krayenhoff, E. S., Lee, S.-H., Loridan, T.,
Martilli, A., Masson, V., Miao, S., Oleson, K., Ooka, R., Pigeon, G.,
Porson, A., Ryu, Y.-H., Salamanca, F., Steeneveld, G.-J., Tombrou, M., Voogt,
J., Young, D., and Zhang, N.: Initial results from Phase 2 of the
international urban energy balance model comparison, Int. J.
Climatol., 31, 244–272,  <ext-link xlink:href="https://doi.org/10.1002/joc.2227" ext-link-type="DOI">10.1002/joc.2227</ext-link>, 2011.</mixed-citation></ref>
      <ref id="bib1.bibx10"><?xmltex \def\ref@label{{Grundner et~al.(2021)Grundner, Beucler, Iglesias-suarez, Gentine,
Giorgetta, and Eyring}}?><label>Grundner et al.(2021)Grundner, Beucler, Iglesias-suarez, Gentine,
Giorgetta, and Eyring</label><?label Grundner2021?><mixed-citation>Grundner, A., Beucler, T., Iglesias-suarez, F., Gentine, P., Giorgetta, M. A.,
and Eyring, V.: Deep Learning Based Cloud Cover Parameterization for ICON,
arXiv [preprint], <ext-link xlink:href="https://doi.org/10.48550/arXiv.2112.11317" ext-link-type="DOI">10.48550/arXiv.2112.11317</ext-link>, 21 December 2021.</mixed-citation></ref>
      <ref id="bib1.bibx11"><?xmltex \def\ref@label{{Gulli and Pal(2017)}}?><label>Gulli and Pal(2017)</label><?label Gulli2017?><mixed-citation>
Gulli, A. and Pal, S.: Deep learning with Keras, 1st edn., Packt Publishing Ltd, ISBN 9781787128422, 2017.</mixed-citation></ref>
      <ref id="bib1.bibx12"><?xmltex \def\ref@label{{Hatfield et~al.(2021)Hatfield, Chantry, Dueben, Lopez, Geer, and
Palmer}}?><label>Hatfield et al.(2021)Hatfield, Chantry, Dueben, Lopez, Geer, and
Palmer</label><?label Hatfield2021?><mixed-citation>Hatfield, S., Chantry, M., Dueben, P., Lopez, P., Geer, A., and Palmer, T.:
Building Tangent-Linear and Adjoint Models for Data Assimilation With Neural
Networks, J. Adv. Model. Earth Sy., 13, e2021MS002521, <ext-link xlink:href="https://doi.org/10.1029/2021MS002521" ext-link-type="DOI">10.1029/2021MS002521</ext-link>,
2021.</mixed-citation></ref>
      <ref id="bib1.bibx13"><?xmltex \def\ref@label{{Iacono et~al.(2008)Iacono, Delamere, Mlawer, Shephard, Clough, and
Collins}}?><label>Iacono et al.(2008)Iacono, Delamere, Mlawer, Shephard, Clough, and
Collins</label><?label Iacono2008?><mixed-citation>Iacono, M. J., Delamere, J. S., Mlawer, E. J., Shephard, M. W., Clough, S. A.,
and Collins, W. D.: Radiative forcing by long-lived greenhouse gases:
Calculations with the AER radiative transfer models, J. Geophys.
Res.-Atmos., 113, D13103, <ext-link xlink:href="https://doi.org/10.1029/2008JD009944" ext-link-type="DOI">10.1029/2008JD009944</ext-link>, 2008.</mixed-citation></ref>
      <ref id="bib1.bibx14"><?xmltex \def\ref@label{{Janisková et~al.(2002)}}?><label>Janisková et al.(2002)</label><?label Janiskovaetal2002?><mixed-citation>
Janisková, M., Mahfouf, J. F., Morcrette, J. J., and Chevallier, F.: Linearized radiation and cloud schemes in the ECMWF model: Development and evaluation, Q. J. Roy. Meteor. Soc., 128, 1505–1527, 2002.</mixed-citation></ref>
      <ref id="bib1.bibx15"><?xmltex \def\ref@label{{Krasnopolsky et~al.(2010)Krasnopolsky, Fox-Rabinovitz, Hou, Lord, and
Belochitski}}?><label>Krasnopolsky et al.(2010)Krasnopolsky, Fox-Rabinovitz, Hou, Lord, and
Belochitski</label><?label krasnopolsky2010accurate?><mixed-citation>
Krasnopolsky, V., Fox-Rabinovitz, M., Hou, Y., Lord, S., and Belochitski, A.:
Accurate and fast neural network emulations of model radiation for the NCEP
coupled climate forecast system: Climate simulations and seasonal
predictions, Mon. Weather Rev., 138, 1822–1842, 2010.</mixed-citation></ref>
      <ref id="bib1.bibx16"><?xmltex \def\ref@label{{Krasnopolsky(2014)}}?><label>Krasnopolsky(2014)</label><?label Krasnopolsky2014?><mixed-citation>Krasnopolsky, V. M.: NN-TSV, NCEP neural network training and validation
system; brief description of NN background and training software, NOAA,
<ext-link xlink:href="https://doi.org/10.7289/v5qr4v2z" ext-link-type="DOI">10.7289/v5qr4v2z</ext-link>, 2014.</mixed-citation></ref>
      <ref id="bib1.bibx17"><?xmltex \def\ref@label{{Mason(1989)}}?><label>Mason(1989)</label><?label Mason1989?><mixed-citation>Mason, P. J.: Large-Eddy Simulation of the Convective Atmospheric Boundary
Layer, J. Atmos. Sci., 46, 1492–1516,
<ext-link xlink:href="https://doi.org/10.1175/1520-0469(1989)046&lt;1492:LESOTC&gt;2.0.CO;2" ext-link-type="DOI">10.1175/1520-0469(1989)046&lt;1492:LESOTC&gt;2.0.CO;2</ext-link>, 1989.</mixed-citation></ref>
      <ref id="bib1.bibx18"><?xmltex \def\ref@label{{Meyer et~al.(2022{\natexlab{a}})Meyer, Grimmond, Dueben, Hogan, and
Reeuwijk}}?><label>Meyer et al.(2022a)Meyer, Grimmond, Dueben, Hogan, and
Reeuwijk</label><?label Meyer2022?><mixed-citation>Meyer, D., Grimmond, S., Dueben, P., Hogan, R., and Reeuwijk, M. V.: Machine
Learning Emulation of Urban Land Surface Processes, J. Adv.
Model. Earth Sy., 14, e2021MS002744, <ext-link xlink:href="https://doi.org/10.1029/2021MS002744" ext-link-type="DOI">10.1029/2021MS002744</ext-link>, 2022a.</mixed-citation></ref>
      <ref id="bib1.bibx19"><?xmltex \def\ref@label{{Meyer et~al.(2022{\natexlab{b}})Meyer, Hogan, Dueben, and
Mason}}?><label>Meyer et al.(2022b)Meyer, Hogan, Dueben, and
Mason</label><?label Meyer_2022?><mixed-citation>Meyer, D., Hogan, R. J., Dueben, P. D., and Mason, S. L.: Machine Learning
Emulation of 3D Cloud Radiative Effects, J. Adv. Model.
Earth Sy., 14, e2021MS002550, <ext-link xlink:href="https://doi.org/10.1029/2021ms002550" ext-link-type="DOI">10.1029/2021ms002550</ext-link>, 2022b.</mixed-citation></ref>
      <ref id="bib1.bibx20"><?xmltex \def\ref@label{{Ott et~al.(2020)Ott, Pritchard, Best, Linstead, Curcic, and
Baldi}}?><label>Ott et al.(2020)Ott, Pritchard, Best, Linstead, Curcic, and
Baldi</label><?label Ott2020?><mixed-citation>Ott, J., Pritchard, M., Best, N., Linstead, E., Curcic, M., and Baldi, P.: A
Fortran-Keras Deep Learning Bridge for Scientific Computing, arXiv [preprint], <ext-link xlink:href="https://doi.org/10.48550/arXiv.2004.10652" ext-link-type="DOI">10.48550/arXiv.2004.10652</ext-link>, 14 April 2020.</mixed-citation></ref>
      <ref id="bib1.bibx21"><?xmltex \def\ref@label{{Paszke et~al.(2017)Paszke, Gross, Chintala, Chanan, Yang, DeVito,
Lin, Desmaison, Antiga, and Lerer}}?><label>Paszke et al.(2017)Paszke, Gross, Chintala, Chanan, Yang, DeVito,
Lin, Desmaison, Antiga, and Lerer</label><?label Paszke2017?><mixed-citation>Paszke, A., Gross, S., Chintala, S., Chanan, G., Yang, E., DeVito, Z., Lin, Z.,
Desmaison, A., Antiga, L., and Lerer, A.: Automatic Differentiation in
PyTorch, in: NIPS 2017 Workshop on Autodiff, Long Beach, California, USA and 9 December 2017,
<uri>https://openreview.net/forum?id=BJJsrmfCZ</uri> (last access: 23 December 2022), 2017.</mixed-citation></ref>
      <ref id="bib1.bibx22"><?xmltex \def\ref@label{{Pu and Kalnay(2018)}}?><label>Pu and Kalnay(2018)</label><?label Pu2018?><mixed-citation>Pu, Z. and Kalnay, E.: Numerical Weather Prediction Basics: Models, Numerical
Methods, and Data Assimilation, Springer Berlin Heidelberg, 67–97, <ext-link xlink:href="https://doi.org/10.1007/978-3-642-40457-3_11-1" ext-link-type="DOI">10.1007/978-3-642-40457-3_11-1</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bibx23"><?xmltex \def\ref@label{{Rasp et~al.(2018)Rasp, Pritchard, and Gentine}}?><label>Rasp et al.(2018)Rasp, Pritchard, and Gentine</label><?label Rasp2018?><mixed-citation>Rasp, S., Pritchard, M. S., and Gentine, P.: Deep learning to represent subgrid
processes in climate models, P. Natl. Acad. Sci. USA, 115, 9684–9689, <ext-link xlink:href="https://doi.org/10.1073/PNAS.1810286115" ext-link-type="DOI">10.1073/PNAS.1810286115</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bibx24"><?xmltex \def\ref@label{{Schneider et~al.(2017)Schneider, Lan, Stuart, and
Teixeira}}?><label>Schneider et al.(2017)Schneider, Lan, Stuart, and
Teixeira</label><?label Schneider2017?><mixed-citation>Schneider, T., Lan, S., Stuart, A., and Teixeira, J.: Earth System Modeling
2.0: A Blueprint for Models That Learn From Observations and Targeted
High-Resolution Simulations, Geophys. Res. Lett., 44,
12396–12417, <ext-link xlink:href="https://doi.org/10.1002/2017GL076101" ext-link-type="DOI">10.1002/2017GL076101</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bibx25"><?xmltex \def\ref@label{{Smirnova et~al.(1997)Smirnova, Brown, and Benjamin}}?><label>Smirnova et al.(1997)Smirnova, Brown, and Benjamin</label><?label Smirnova1997?><mixed-citation>Smirnova, T. G., Brown, J. M., and Benjamin, S. G.: Performance of Different
Soil Model Configurations in Simulating Ground Surface Temperature and
Surface Fluxes, Mon. Weather Rev., 125, 1870–1884,
<ext-link xlink:href="https://doi.org/10.1175/1520-0493(1997)125&lt;1870:PODSMC&gt;2.0.CO;2" ext-link-type="DOI">10.1175/1520-0493(1997)125&lt;1870:PODSMC&gt;2.0.CO;2</ext-link>, 1997.</mixed-citation></ref>
      <ref id="bib1.bibx26"><?xmltex \def\ref@label{{{Smirnova} et~al.(2000){Smirnova}, {Brown}, {Benjamin}, and
{Kim}}}?><label>Smirnova et al.(2000)Smirnova, Brown, Benjamin, and
Kim</label><?label Smirnova2000?><mixed-citation>Smirnova, T. G., Brown, J. M., Benjamin, S. G., and Kim, D.:
Parameterization of cold-season processes in the MAPS land-surface scheme,
J. Geophys. Res., 105, 4077–4086, <ext-link xlink:href="https://doi.org/10.1029/1999JD901047" ext-link-type="DOI">10.1029/1999JD901047</ext-link>,
2000.</mixed-citation></ref>
      <ref id="bib1.bibx27"><?xmltex \def\ref@label{{Song and Roh(2021)}}?><label>Song and Roh(2021)</label><?label Song&Rong2021?><mixed-citation>Song, H.-J. and Roh, S.: Improved Weather Forecasting Using Neural Network
Emulation for Radiation Parameterization, J. Adv. Model.
Earth Sy., 13, e2021MS002609, <ext-link xlink:href="https://doi.org/10.1029/2021MS002609" ext-link-type="DOI">10.1029/2021MS002609</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bibx28"><?xmltex \def\ref@label{{Stensrud(2013)}}?><label>Stensrud(2013)</label><?label Stensrud2013?><mixed-citation>Stensrud, D. J.: Parameterization schemes: keys to understanding numerical
weather prediction models, 2nd edn., Cambridge University Press, ISBN 9780511812590,
<ext-link xlink:href="https://doi.org/10.1017/CBO9780511812590" ext-link-type="DOI">10.1017/CBO9780511812590</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bibx29"><?xmltex \def\ref@label{{Thompson et~al.(2008)Thompson, Field, Rasmussen, and
Hall}}?><label>Thompson et al.(2008)Thompson, Field, Rasmussen, and
Hall</label><?label Thompson2008?><mixed-citation>Thompson, G., Field, P. R., Rasmussen, R. M., and Hall, W. D.: Explicit
Forecasts of Winter Precipitation Using an Improved Bulk Microphysics Scheme.
Part II: Implementation of a New Snow Parameterization, Mon. Weather
Rev., 136, 5095–5115, <ext-link xlink:href="https://doi.org/10.1175/2008MWR2387.1" ext-link-type="DOI">10.1175/2008MWR2387.1</ext-link>, 2008.</mixed-citation></ref>
      <ref id="bib1.bibx30"><?xmltex \def\ref@label{{Ukkonen(2022)}}?><label>Ukkonen(2022)</label><?label ukkonen2022exploring?><mixed-citation>Ukkonen, P.: Exploring pathways to more accurate machine learning emulation of
atmospheric radiative transfer, J. Adv. Model. Earth
Sy., 14, e2021MS002875, <ext-link xlink:href="https://doi.org/10.1029/2021MS002875" ext-link-type="DOI">10.1029/2021MS002875</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bibx31"><?xmltex \def\ref@label{{Wang et~al.(2019)Wang, Balaprakash, and Kotamarthi}}?><label>Wang et al.(2019)Wang, Balaprakash, and Kotamarthi</label><?label WangJ2019?><mixed-citation>Wang, J., Balaprakash, P., and Kotamarthi, R.: Fast domain-aware neural network emulation of a planetary boundary layer parameterization in a numerical weather forecast model, Geosci. Model Dev., 12, 4261–4274, <ext-link xlink:href="https://doi.org/10.5194/gmd-12-4261-2019" ext-link-type="DOI">10.5194/gmd-12-4261-2019</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bibx32"><?xmltex \def\ref@label{{Wang et~al.(2022)Wang, Han, Xue, Yang, and Zhang}}?><label>Wang et al.(2022)Wang, Han, Xue, Yang, and Zhang</label><?label WangX2022?><mixed-citation>Wang, X., Han, Y., Xue, W., Yang, G., and Zhang, G. J.: Stable climate simulations using a realistic general circulation model with neural network parameterizations for atmospheric moist physics and radiation processes, Geosci. Model Dev., 15, 3923–3940, <ext-link xlink:href="https://doi.org/10.5194/gmd-15-3923-2022" ext-link-type="DOI">10.5194/gmd-15-3923-2022</ext-link>, 2022.
</mixed-citation></ref><?xmltex \hack{\newpage}?>
      <ref id="bib1.bibx33"><?xmltex \def\ref@label{{Yao et~al.(2023)Yao, Zhong, Zheng, and Wang}}?><label>Yao et al.(2023)Yao, Zhong, Zheng, and Wang</label><?label YaoZhong2022?><mixed-citation>
Yao, Y., Zhong, X., Zheng, Y., and Wang, Z.: A Physics-Incorporated Deep
Learning Framework for Parameterization of Atmospheric Radiative Transfer,
J. Adv. Model. Earth Sy., in review, 2023.</mixed-citation></ref>
      <ref id="bib1.bibx34"><?xmltex \def\ref@label{{Zhong et~al.(2022)Zhong, Ma, Yao, Xue, Wu, and Wang}}?><label>Zhong et al.(2022)Zhong, Ma, Yao, Xue, Wu, and Wang</label><?label code2022?><mixed-citation>Zhong, X., Ma, Z., Yao, Y., Xue, L., Wu, Y., and Wang, Z.: Code and Data for
“A Bridge between WRF and Deep Learning Parameterization and
Application on Deep Learning Parameterization of Atmospheric Radiative
Transfer”, Zenodo [code and data set], <ext-link xlink:href="https://doi.org/10.5281/zenodo.7407487" ext-link-type="DOI">10.5281/zenodo.7407487</ext-link>, 2022.</mixed-citation></ref>

  </ref-list></back>
    <!--<article-title-html>WRF–ML v1.0: a bridge between WRF v4.3 and machine learning parameterizations and its application to atmospheric radiative transfer</article-title-html>
<abstract-html/>
<ref-html id="bib1.bib1"><label>Belochitski and Krasnopolsky(2021)</label><mixed-citation>
Belochitski, A. and Krasnopolsky, V.: Robustness of neural network emulations of radiative transfer parameterizations in a state-of-the-art general circulation model, Geosci. Model Dev., 14, 7425–7437, <a href="https://doi.org/10.5194/gmd-14-7425-2021" target="_blank">https://doi.org/10.5194/gmd-14-7425-2021</a>, 2021.
</mixed-citation></ref-html>
<ref-html id="bib1.bib2"><label>Bougeault and Lacarrère(1989)</label><mixed-citation>
Bougeault, P. and Lacarrère, P.: Parameterization of orographic
induced turbulence in a mesobeta scale model, Mon. Weather Rev., 117,
1872–1890, <a href="https://doi.org/10.1175/1520-0493(1989)117&lt;1872:POOITI&gt;2.0.CO;2" target="_blank">https://doi.org/10.1175/1520-0493(1989)117&lt;1872:POOITI&gt;2.0.CO;2</a>, 1989.
</mixed-citation></ref-html>
<ref-html id="bib1.bib3"><label>Chantry et al.(2021)Chantry, Hatfield, Dueben, Polichtchouk, and
Palmer</label><mixed-citation>
Chantry, M., Hatfield, S., Dueben, P., Polichtchouk, I., and Palmer, T.:
Machine Learning Emulation of Gravity Wave Drag in Numerical Weather
Forecasting, J. Adv. Model. Earth Sy., 13,
e2021MS002477, <a href="https://doi.org/10.1029/2021MS002477" target="_blank">https://doi.org/10.1029/2021MS002477</a>, 2021.
</mixed-citation></ref-html>
<ref-html id="bib1.bib4"><label>Chevallier et al.(1998)Chevallier, Chéruy, Scott, and
Chédin</label><mixed-citation>
Chevallier, F., Chéruy, F., Scott, N., and Chédin, A.: A neural network
approach for a fast and accurate computation of a longwave radiative budget,
J. Appl. Meteorol., 37, 1385–1397, 1998.
</mixed-citation></ref-html>
<ref-html id="bib1.bib5"><label>Chevallier et al.(2000)Chevallier, Morcrette, Chéruy, and
Scott</label><mixed-citation>
Chevallier, F., Morcrette, J.-J., Chéruy, F., and Scott, N.: Use of a
neural-network-based long-wave radiative-transfer scheme in the ECMWF
atmospheric model, Q. J. Roy. Meteor. Soc.,
126, 761–776, 2000.
</mixed-citation></ref-html>
<ref-html id="bib1.bib6"><label>Clough et al.(2005)Clough, Shephard, Mlawer, Delamere, Iacono,
Cady-Pereira, Boukabara, and Brown</label><mixed-citation>
Clough, S., Shephard, M., Mlawer, E., Delamere, J., Iacono, M., Cady-Pereira,
K., Boukabara, S., and Brown, P.: Atmospheric radiative transfer modeling: a
summary of the AER codes, J. Quant. Spectrosc. Ra., 91, 233–244, 2005.
</mixed-citation></ref-html>
<ref-html id="bib1.bib7"><label>Coiffier(2011)</label><mixed-citation>
Coiffier, J.: Fundamentals of Numerical Weather Prediction, 1st edn., Cambridge
University Press, ISBN 9780511734458, <a href="https://doi.org/10.1017/CBO9780511734458" target="_blank">https://doi.org/10.1017/CBO9780511734458</a>, 2011.
</mixed-citation></ref-html>
<ref-html id="bib1.bib8"><label>Grimmond et al.(2010)Grimmond, Blackett, Best, Barlow, Baik, Belcher,
Bohnenstengel, Calmet, Chen, Dandou, Fortuniak, Gouvea, Hamdi, Hendry, Kawai,
Kawamoto, Kondo, Krayenhoff, Lee, Loridan, Martilli, Masson, Miao, Oleson,
Pigeon, Porson, Ryu, Salamanca, Shashua-Bar, Steeneveld, Tombrou, Voogt,
Young, and Zhang</label><mixed-citation>
Grimmond, C. S. B., Blackett, M., Best, M. J., Barlow, J., Baik, J.-J.,
Belcher, S. E., Bohnenstengel, S. I., Calmet, I., Chen, F., Dandou, A.,
Fortuniak, K., Gouvea, M. L., Hamdi, R., Hendry, M., Kawai, T., Kawamoto, Y.,
Kondo, H., Krayenhoff, E. S., Lee, S.-H., Loridan, T., Martilli, A., Masson,
V., Miao, S., Oleson, K., Pigeon, G., Porson, A., Ryu, Y.-H., Salamanca, F.,
Shashua-Bar, L., Steeneveld, G.-J., Tombrou, M., Voogt, J., Young, D., and
Zhang, N.: The international urban energy balance models comparison project:
First results from phase 1, J. Appl. Meteorol. Clim.,
49, 1268–1292, <a href="https://doi.org/10.1175/2010JAMC2354.1" target="_blank">https://doi.org/10.1175/2010JAMC2354.1</a>, 2010.
</mixed-citation></ref-html>
<ref-html id="bib1.bib9"><label>Grimmond et al.(2011)Grimmond, Blackett, Best, Baik, Belcher,
Bohnenstengel, Calmet, Chen, Coutts, Dandou, Fortuniak, Gouvea, Hamdi,
Hendry, , Kanda, Kawai, Kawamoto, Kondo, Krayenhoff, Lee, Loridan, Martilli,
Masson, Miao, Oleson, Ooka, , Pigeon, Porson, Ryu, Salamanca, Steeneveld,
Tombrou, Voogt, Young, and Zhang</label><mixed-citation>
Grimmond, C. S. B., Blackett, M., Best, M. J., Baik, J.-J., Belcher, S. E.,
Bohnenstengel, S. I., Calmet, I., Chen, F., Coutts, A., Dandou, A.,
Fortuniak, K., Gouvea, M. L., Hamdi, R., Hendry, M., Kanda, M., Kawai, T.,
Kawamoto, Y., Kondo, H., Krayenhoff, E. S., Lee, S.-H., Loridan, T.,
Martilli, A., Masson, V., Miao, S., Oleson, K., Ooka, R., Pigeon, G.,
Porson, A., Ryu, Y.-H., Salamanca, F., Steeneveld, G.-J., Tombrou, M., Voogt,
J., Young, D., and Zhang, N.: Initial results from Phase 2 of the
international urban energy balance model comparison, Int. J.
Climatol., 31, 244–272,  <a href="https://doi.org/10.1002/joc.2227" target="_blank">https://doi.org/10.1002/joc.2227</a>, 2011.
</mixed-citation></ref-html>
<ref-html id="bib1.bib10"><label>Grundner et al.(2021)Grundner, Beucler, Iglesias-suarez, Gentine,
Giorgetta, and Eyring</label><mixed-citation>
Grundner, A., Beucler, T., Iglesias-suarez, F., Gentine, P., Giorgetta, M. A.,
and Eyring, V.: Deep Learning Based Cloud Cover Parameterization for ICON,
arXiv [preprint], <a href="https://doi.org/10.48550/arXiv.2112.11317" target="_blank">https://doi.org/10.48550/arXiv.2112.11317</a>, 21 December 2021.
</mixed-citation></ref-html>
<ref-html id="bib1.bib11"><label>Gulli and Pal(2017)</label><mixed-citation>
Gulli, A. and Pal, S.: Deep learning with Keras, 1st edn., Packt Publishing Ltd, ISBN 9781787128422, 2017.
</mixed-citation></ref-html>
<ref-html id="bib1.bib12"><label>Hatfield et al.(2021)Hatfield, Chantry, Dueben, Lopez, Geer, and
Palmer</label><mixed-citation>
Hatfield, S., Chantry, M., Dueben, P., Lopez, P., Geer, A., and Palmer, T.:
Building Tangent-Linear and Adjoint Models for Data Assimilation With Neural
Networks, J. Adv. Model. Earth Sy., 13, e2021MS002521, <a href="https://doi.org/10.1029/2021MS002521" target="_blank">https://doi.org/10.1029/2021MS002521</a>,
2021.
</mixed-citation></ref-html>
<ref-html id="bib1.bib13"><label>Iacono et al.(2008)Iacono, Delamere, Mlawer, Shephard, Clough, and
Collins</label><mixed-citation>
Iacono, M. J., Delamere, J. S., Mlawer, E. J., Shephard, M. W., Clough, S. A.,
and Collins, W. D.: Radiative forcing by long-lived greenhouse gases:
Calculations with the AER radiative transfer models, J. Geophys.
Res.-Atmos., 113, D13103, <a href="https://doi.org/10.1029/2008JD009944" target="_blank">https://doi.org/10.1029/2008JD009944</a>, 2008.
</mixed-citation></ref-html>
<ref-html id="bib1.bib14"><label>Janisková et al.(2002)</label><mixed-citation>
Janisková, M., Mahfouf, J. F., Morcrette, J. J., and Chevallier, F.: Linearized radiation and cloud schemes in the ECMWF model: Development and evaluation, Q. J. Roy. Meteor. Soc., 128, 1505–1527, 2002.
</mixed-citation></ref-html>
<ref-html id="bib1.bib15"><label>Krasnopolsky et al.(2010)Krasnopolsky, Fox-Rabinovitz, Hou, Lord, and
Belochitski</label><mixed-citation>
Krasnopolsky, V., Fox-Rabinovitz, M., Hou, Y., Lord, S., and Belochitski, A.:
Accurate and fast neural network emulations of model radiation for the NCEP
coupled climate forecast system: Climate simulations and seasonal
predictions, Mon. Weather Rev., 138, 1822–1842, 2010.
</mixed-citation></ref-html>
<ref-html id="bib1.bib16"><label>Krasnopolsky(2014)</label><mixed-citation>
Krasnopolsky, V. M.: NN-TSV, NCEP neural network training and validation
system; brief description of NN background and training software, NOAA,
<a href="https://doi.org/10.7289/v5qr4v2z" target="_blank">https://doi.org/10.7289/v5qr4v2z</a>, 2014.
</mixed-citation></ref-html>
<ref-html id="bib1.bib17"><label>Mason(1989)</label><mixed-citation>
Mason, P. J.: Large-Eddy Simulation of the Convective Atmospheric Boundary
Layer, J. Atmos. Sci., 46, 1492–1516,
<a href="https://doi.org/10.1175/1520-0469(1989)046&lt;1492:LESOTC&gt;2.0.CO;2" target="_blank">https://doi.org/10.1175/1520-0469(1989)046&lt;1492:LESOTC&gt;2.0.CO;2</a>, 1989.
</mixed-citation></ref-html>
<ref-html id="bib1.bib18"><label>Meyer et al.(2022a)Meyer, Grimmond, Dueben, Hogan, and
Reeuwijk</label><mixed-citation>
Meyer, D., Grimmond, S., Dueben, P., Hogan, R., and Reeuwijk, M. V.: Machine
Learning Emulation of Urban Land Surface Processes, J. Adv.
Model. Earth Sy., 14, e2021MS002744, <a href="https://doi.org/10.1029/2021MS002744" target="_blank">https://doi.org/10.1029/2021MS002744</a>, 2022a.
</mixed-citation></ref-html>
<ref-html id="bib1.bib19"><label>Meyer et al.(2022b)Meyer, Hogan, Dueben, and
Mason</label><mixed-citation>
Meyer, D., Hogan, R. J., Dueben, P. D., and Mason, S. L.: Machine Learning
Emulation of 3D Cloud Radiative Effects, J. Adv. Model.
Earth Sy., 14, e2021MS002550, <a href="https://doi.org/10.1029/2021ms002550" target="_blank">https://doi.org/10.1029/2021ms002550</a>, 2022b.
</mixed-citation></ref-html>
<ref-html id="bib1.bib20"><label>Ott et al.(2020)Ott, Pritchard, Best, Linstead, Curcic, and
Baldi</label><mixed-citation>
Ott, J., Pritchard, M., Best, N., Linstead, E., Curcic, M., and Baldi, P.: A
Fortran-Keras Deep Learning Bridge for Scientific Computing, arXiv [preprint], <a href="https://doi.org/10.48550/arXiv.2004.10652" target="_blank">https://doi.org/10.48550/arXiv.2004.10652</a>, 14 April 2020.
</mixed-citation></ref-html>
<ref-html id="bib1.bib21"><label>Paszke et al.(2017)Paszke, Gross, Chintala, Chanan, Yang, DeVito,
Lin, Desmaison, Antiga, and Lerer</label><mixed-citation>
Paszke, A., Gross, S., Chintala, S., Chanan, G., Yang, E., DeVito, Z., Lin, Z.,
Desmaison, A., Antiga, L., and Lerer, A.: Automatic Differentiation in
PyTorch, in: NIPS 2017 Workshop on Autodiff, Long Beach, California, USA and 9 December 2017,
<a href="https://openreview.net/forum?id=BJJsrmfCZ" target="_blank"/> (last access: 23 December 2022), 2017.
</mixed-citation></ref-html>
<ref-html id="bib1.bib22"><label>Pu and Kalnay(2018)</label><mixed-citation>
Pu, Z. and Kalnay, E.: Numerical Weather Prediction Basics: Models, Numerical
Methods, and Data Assimilation, Springer Berlin Heidelberg, 67–97, <a href="https://doi.org/10.1007/978-3-642-40457-3_11-1" target="_blank">https://doi.org/10.1007/978-3-642-40457-3_11-1</a>, 2018.
</mixed-citation></ref-html>
<ref-html id="bib1.bib23"><label>Rasp et al.(2018)Rasp, Pritchard, and Gentine</label><mixed-citation>
Rasp, S., Pritchard, M. S., and Gentine, P.: Deep learning to represent subgrid
processes in climate models, P. Natl. Acad. Sci. USA, 115, 9684–9689, <a href="https://doi.org/10.1073/PNAS.1810286115" target="_blank">https://doi.org/10.1073/PNAS.1810286115</a>, 2018.
</mixed-citation></ref-html>
<ref-html id="bib1.bib24"><label>Schneider et al.(2017)Schneider, Lan, Stuart, and
Teixeira</label><mixed-citation>
Schneider, T., Lan, S., Stuart, A., and Teixeira, J.: Earth System Modeling
2.0: A Blueprint for Models That Learn From Observations and Targeted
High-Resolution Simulations, Geophys. Res. Lett., 44,
12396–12417, <a href="https://doi.org/10.1002/2017GL076101" target="_blank">https://doi.org/10.1002/2017GL076101</a>, 2017.
</mixed-citation></ref-html>
<ref-html id="bib1.bib25"><label>Smirnova et al.(1997)Smirnova, Brown, and Benjamin</label><mixed-citation>
Smirnova, T. G., Brown, J. M., and Benjamin, S. G.: Performance of Different
Soil Model Configurations in Simulating Ground Surface Temperature and
Surface Fluxes, Mon. Weather Rev., 125, 1870–1884,
<a href="https://doi.org/10.1175/1520-0493(1997)125&lt;1870:PODSMC&gt;2.0.CO;2" target="_blank">https://doi.org/10.1175/1520-0493(1997)125&lt;1870:PODSMC&gt;2.0.CO;2</a>, 1997.
</mixed-citation></ref-html>
<ref-html id="bib1.bib26"><label>Smirnova et al.(2000)Smirnova, Brown, Benjamin, and
Kim</label><mixed-citation>
Smirnova, T. G., Brown, J. M., Benjamin, S. G., and Kim, D.:
Parameterization of cold-season processes in the MAPS land-surface scheme,
J. Geophys. Res., 105, 4077–4086, <a href="https://doi.org/10.1029/1999JD901047" target="_blank">https://doi.org/10.1029/1999JD901047</a>,
2000.
</mixed-citation></ref-html>
<ref-html id="bib1.bib27"><label>Song and Roh(2021)</label><mixed-citation>
Song, H.-J. and Roh, S.: Improved Weather Forecasting Using Neural Network
Emulation for Radiation Parameterization, J. Adv. Model.
Earth Sy., 13, e2021MS002609, <a href="https://doi.org/10.1029/2021MS002609" target="_blank">https://doi.org/10.1029/2021MS002609</a>, 2021.
</mixed-citation></ref-html>
<ref-html id="bib1.bib28"><label>Stensrud(2013)</label><mixed-citation>
Stensrud, D. J.: Parameterization schemes: keys to understanding numerical
weather prediction models, 2nd edn., Cambridge University Press, ISBN 9780511812590,
<a href="https://doi.org/10.1017/CBO9780511812590" target="_blank">https://doi.org/10.1017/CBO9780511812590</a>, 2013.
</mixed-citation></ref-html>
<ref-html id="bib1.bib29"><label>Thompson et al.(2008)Thompson, Field, Rasmussen, and
Hall</label><mixed-citation>
Thompson, G., Field, P. R., Rasmussen, R. M., and Hall, W. D.: Explicit
Forecasts of Winter Precipitation Using an Improved Bulk Microphysics Scheme.
Part II: Implementation of a New Snow Parameterization, Mon. Weather
Rev., 136, 5095–5115, <a href="https://doi.org/10.1175/2008MWR2387.1" target="_blank">https://doi.org/10.1175/2008MWR2387.1</a>, 2008.
</mixed-citation></ref-html>
<ref-html id="bib1.bib30"><label>Ukkonen(2022)</label><mixed-citation>
Ukkonen, P.: Exploring pathways to more accurate machine learning emulation of
atmospheric radiative transfer, J. Adv. Model. Earth
Sy., 14, e2021MS002875, <a href="https://doi.org/10.1029/2021MS002875" target="_blank">https://doi.org/10.1029/2021MS002875</a>, 2022.
</mixed-citation></ref-html>
<ref-html id="bib1.bib31"><label>Wang et al.(2019)Wang, Balaprakash, and Kotamarthi</label><mixed-citation>
Wang, J., Balaprakash, P., and Kotamarthi, R.: Fast domain-aware neural network emulation of a planetary boundary layer parameterization in a numerical weather forecast model, Geosci. Model Dev., 12, 4261–4274, <a href="https://doi.org/10.5194/gmd-12-4261-2019" target="_blank">https://doi.org/10.5194/gmd-12-4261-2019</a>, 2019.
</mixed-citation></ref-html>
<ref-html id="bib1.bib32"><label>Wang et al.(2022)Wang, Han, Xue, Yang, and Zhang</label><mixed-citation>
Wang, X., Han, Y., Xue, W., Yang, G., and Zhang, G. J.: Stable climate simulations using a realistic general circulation model with neural network parameterizations for atmospheric moist physics and radiation processes, Geosci. Model Dev., 15, 3923–3940, <a href="https://doi.org/10.5194/gmd-15-3923-2022" target="_blank">https://doi.org/10.5194/gmd-15-3923-2022</a>, 2022.

</mixed-citation></ref-html>
<ref-html id="bib1.bib33"><label>Yao et al.(2023)Yao, Zhong, Zheng, and Wang</label><mixed-citation>
Yao, Y., Zhong, X., Zheng, Y., and Wang, Z.: A Physics-Incorporated Deep
Learning Framework for Parameterization of Atmospheric Radiative Transfer,
J. Adv. Model. Earth Sy., in review, 2023.
</mixed-citation></ref-html>
<ref-html id="bib1.bib34"><label>Zhong et al.(2022)Zhong, Ma, Yao, Xue, Wu, and Wang</label><mixed-citation>
Zhong, X., Ma, Z., Yao, Y., Xue, L., Wu, Y., and Wang, Z.: Code and Data for
“A Bridge between WRF and Deep Learning Parameterization and
Application on Deep Learning Parameterization of Atmospheric Radiative
Transfer”, Zenodo [code and data set], <a href="https://doi.org/10.5281/zenodo.7407487" target="_blank">https://doi.org/10.5281/zenodo.7407487</a>, 2022.
</mixed-citation></ref-html>--></article>
