<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing with OASIS Tables v3.0 20080202//EN" "https://jats.nlm.nih.gov/nlm-dtd/publishing/3.0/journalpub-oasis3.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:oasis="http://docs.oasis-open.org/ns/oasis-exchange/table" xml:lang="en" dtd-version="3.0" article-type="research-article">
  <front>
    <journal-meta><journal-id journal-id-type="publisher">GMD</journal-id><journal-title-group>
    <journal-title>Geoscientific Model Development</journal-title>
    <abbrev-journal-title abbrev-type="publisher">GMD</abbrev-journal-title><abbrev-journal-title abbrev-type="nlm-ta">Geosci. Model Dev.</abbrev-journal-title>
  </journal-title-group><issn pub-type="epub">1991-9603</issn><publisher>
    <publisher-name>Copernicus Publications</publisher-name>
    <publisher-loc>Göttingen, Germany</publisher-loc>
  </publisher></journal-meta>
    <article-meta>
      <article-id pub-id-type="doi">10.5194/gmd-19-3875-2026</article-id><title-group><article-title>Representing subgrid-scale cloud effects in a radiation parameterization using machine learning: MLe-radiation v1.0</article-title><alt-title>Improved radiation parameterization</alt-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author" corresp="yes" rid="aff1 aff2">
          <name><surname>Hafner</surname><given-names>Katharina</given-names></name>
          <email>hafner@iup.physik.uni-bremen.de</email>
        <ext-link>https://orcid.org/0009-0009-5272-0409</ext-link></contrib>
        <contrib contrib-type="author" corresp="no" rid="aff3">
          <name><surname>Shamekh</surname><given-names>Sara</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="no" rid="aff4">
          <name><surname>Bertoli</surname><given-names>Guillaume</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="no" rid="aff2">
          <name><surname>Lauer</surname><given-names>Axel</given-names></name>
          
        <ext-link>https://orcid.org/0000-0002-9270-1044</ext-link></contrib>
        <contrib contrib-type="author" corresp="no" rid="aff5">
          <name><surname>Pincus</surname><given-names>Robert</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="no" rid="aff2">
          <name><surname>Savre</surname><given-names>Julien</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="no" rid="aff2 aff1">
          <name><surname>Eyring</surname><given-names>Veronika</given-names></name>
          
        <ext-link>https://orcid.org/0000-0002-6887-4885</ext-link></contrib>
        <aff id="aff1"><label>1</label><institution>University of Bremen, Institute of Environmental Physics (IUP), Bremen, Germany</institution>
        </aff>
        <aff id="aff2"><label>2</label><institution>Deutsches Zentrum für Luft- und Raumfahrt (DLR), Institut für Physik der Atmosphäre, Oberpfaffenhofen, Germany</institution>
        </aff>
        <aff id="aff3"><label>3</label><institution>Courant Institute of Mathematical Sciences, New York University (NYU), New York, NY, USA</institution>
        </aff>
        <aff id="aff4"><label>4</label><institution>Department of Earth and Environmental Engineering, Columbia University, New York, NY, USA</institution>
        </aff>
        <aff id="aff5"><label>5</label><institution>Lamont-Doherty Earth Observatory, Palisades, New York, USA</institution>
        </aff>
      </contrib-group>
      <author-notes><corresp id="corr1">Katharina Hafner (hafner@iup.physik.uni-bremen.de)</corresp></author-notes><pub-date><day>13</day><month>May</month><year>2026</year></pub-date>
      
      <volume>19</volume>
      <issue>9</issue>
      <fpage>3875</fpage><lpage>3891</lpage>
      <history>
        <date date-type="received"><day>6</day><month>October</month><year>2025</year></date>
           <date date-type="rev-request"><day>16</day><month>October</month><year>2025</year></date>
           <date date-type="rev-recd"><day>6</day><month>March</month><year>2026</year></date>
           <date date-type="accepted"><day>7</day><month>April</month><year>2026</year></date>
      </history>
      <permissions>
        <copyright-statement>Copyright: © 2026 Katharina Hafner et al.</copyright-statement>
        <copyright-year>2026</copyright-year>
      <license license-type="open-access"><license-p>This work is licensed under the Creative Commons Attribution 4.0 International License. To view a copy of this licence, visit <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by/4.0/">https://creativecommons.org/licenses/by/4.0/</ext-link></license-p></license></permissions><self-uri xlink:href="https://gmd.copernicus.org/articles/19/3875/2026/gmd-19-3875-2026.html">This article is available from https://gmd.copernicus.org/articles/19/3875/2026/gmd-19-3875-2026.html</self-uri><self-uri xlink:href="https://gmd.copernicus.org/articles/19/3875/2026/gmd-19-3875-2026.pdf">The full text article is available as a PDF file from https://gmd.copernicus.org/articles/19/3875/2026/gmd-19-3875-2026.pdf</self-uri>
      <abstract><title>Abstract</title>

      <p id="d2e163">Improvements of Machine Learning (ML)-based radiation emulators remain constrained by the underlying assumptions to represent horizontal and vertical subgrid-scale cloud distributions, which continue to introduce substantial uncertainties. In this study, we introduce a method to represent the impact of subgrid-scale clouds by applying ML to learn processes from high-resolution model output with a horizontal grid spacing of 5 <inline-formula><mml:math id="M1" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow></mml:math></inline-formula>. In global storm resolving models, clouds begin to be explicitly resolved. Coarse-graining these high-resolution simulations to the resolution of coarser Earth System Models yields radiative heating rates that implicitly include subgrid-scale cloud effects, without assumptions about their horizontal or vertical distributions. We define the cloud radiative impact as the difference between all-sky and clear-sky radiative fluxes, and train the ML component solely on this cloud-induced contribution to heating rates. The clear-sky tendencies remain being computed with a conventional physics-based radiation scheme. This hybrid design enhances generalization, since the machine-learned part addresses only subgrid-scale cloud effects, while the clear-sky component remains responsive to changes in greenhouse gas or aerosol concentrations. Applied to coarse-grained data offline, the ML-enhanced radiation scheme reduces errors by a factor of 4–10 compared with a conventional coarse-scale radiation scheme. This shows the potential of representing subgrid-scale cloud effects in radiation schemes with ML for the next generation of Earth System Models.</p>
  </abstract>
    
<funding-group>
<award-group id="gs1">
<funding-source>Deutsche Forschungsgemeinschaft</funding-source>
<award-id>EY 22/2-1</award-id>
</award-group>
<award-group id="gs2">
<funding-source>Horizon 2020</funding-source>
<award-id>855187</award-id>
</award-group>
<award-group id="gs3">
<funding-source>Deutsches Klimarechenzentrum</funding-source>
<award-id>bd1179</award-id>
</award-group>
<award-group id="gs4">
<funding-source>National Science Foundation</funding-source>
<award-id>2019625</award-id>
</award-group>
<award-group id="gs5">
<funding-source>Jülich Supercomputing Centre, Forschungszentrum Jülich</funding-source>
<award-id>icon-a-ml</award-id>
</award-group>
</funding-group>
</article-meta>
  </front>
<body>
      

      
<sec id="Ch1.S1" sec-type="intro">
  <label>1</label><title>Introduction</title>
      <p id="d2e185">For climate projections, coarse-scale Earth System Models (ESMs) typically have horizontal resolutions of 100–200 <inline-formula><mml:math id="M2" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow></mml:math></inline-formula>
<xref ref-type="bibr" rid="bib1.bibx8" id="paren.1"/>, in which clouds cannot be resolved explicitly. Therefore, these models require parameterizations of fractional cloudiness, particularly for cloud–radiation interactions at subgrid scales. A widely used approach is the Monte Carlo Independent Column Approach (McICA) <xref ref-type="bibr" rid="bib1.bibx34" id="paren.2"/>, where g-points are randomly assigned as cloudy or clear-sky according to the cloud fraction. This stochastic simplification introduces uncertainties in cloud–radiation interactions of up to 100 <inline-formula><mml:math id="M3" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">W</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="normal">m</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula> in surface fluxes, corresponding to relative errors of 10 % or more. However, these errors are unbiased compared to the Independent Column Approach (ICA) <xref ref-type="bibr" rid="bib1.bibx1" id="paren.3"/>, where entire subcolumns are randomly designed as cloudy or clear, and thus tend to average out in long ESM simulations <xref ref-type="bibr" rid="bib1.bibx34" id="paren.4"/>. Vertical cloud overlap is commonly parameterized using the maximum-random overlap assumption <xref ref-type="bibr" rid="bib1.bibx37" id="paren.5"/>, whereby adjacent layers overlap maximally while distant cloud layers overlap randomly. Alternatively, some models employ an <italic>all-or-nothing</italic> cloud cover scheme, which is a good approximation in high-resolution simulations <xref ref-type="bibr" rid="bib1.bibx14 bib1.bibx23" id="paren.6"/>.</p>
      <p id="d2e235">Machine learning (ML)-based radiation emulators have been developed for more than two decades <xref ref-type="bibr" rid="bib1.bibx42 bib1.bibx47 bib1.bibx39 bib1.bibx33 bib1.bibx28 bib1.bibx29" id="paren.7"/>. One potentially appealing aspect of ML-based emulators is their relative speed compared to traditional radiative transfer models, which in principle allows more frequent calls during ESM integration. In practice, however, the speed-up potential has proven limited <xref ref-type="bibr" rid="bib1.bibx2 bib1.bibx43 bib1.bibx18" id="paren.8"/>, with faster performance achievable through code optimization <xref ref-type="bibr" rid="bib1.bibx10 bib1.bibx44" id="paren.9"/>. Moreover, improvements from more frequent radiation calls <xref ref-type="bibr" rid="bib1.bibx18" id="paren.10"/> remain marginal. Nonetheless, ML-based radiation emulators are valuable in specific context, such as differentiable ESMs <xref ref-type="bibr" rid="bib1.bibx27 bib1.bibx26" id="paren.11"/> – where the derivative can be calculated for a complete model integration step – or GPU-based ESMs, where they are more energy efficient than physics-based schemes <xref ref-type="bibr" rid="bib1.bibx43" id="paren.12"/>. Despite their high accuracy, emulators still face challenges in cloudy conditions <xref ref-type="bibr" rid="bib1.bibx19" id="paren.13"/>, which may partly explain why they have not yet shown substantial improvements over traditional radiation schemes.</p>
      <p id="d2e260">As more high-resolution Global Storm Resolving Model (GSRM) data become available, they offer increasing opportunities to enhance coarse-scale ESMs with machine learning. High-resolution simulations have been applied to nudge coarse-scale models toward fine-scale states <xref ref-type="bibr" rid="bib1.bibx6" id="paren.14"/>, to learn subgrid tendencies directly <xref ref-type="bibr" rid="bib1.bibx21 bib1.bibx7" id="paren.15"/>, to infer subgrid effects from coarse-scale states <xref ref-type="bibr" rid="bib1.bibx15 bib1.bibx40" id="paren.16"/> and to learn all physics parameterizations <xref ref-type="bibr" rid="bib1.bibx45" id="paren.17"/>. Beyond these applications, high-resolution model simulations enable new strategies for representing fractional cloudiness and cloud overlap in coarse-scale ESMs using ML. Specifically, GSRM output can be coarse-grained to the resolution of the target ESM, and a neural network (NN) can then be trained to learn the subgrid-scale distribution of clouds from the underlying statistics. For instance, <xref ref-type="bibr" rid="bib1.bibx20" id="text.18"/> predicted coarse-grained cloud fields to reduce radiative biases; however, the cloud overlap assumption in the radiation scheme remains unchanged, limiting improvements. Leveraging ML to represent subgrid-scale cloud effects on radiation thus provide a promising pathway toward accurate radiation schemes in ESMs for climate projections.</p>
      <p id="d2e278">In this work, we demonstrate how the representation of cloud radiative impacts on heating rates can be improved by separating them from the all-sky heating rates and learning the cloud contribution directly from high-resolution simulations. Because these simulations explicitly resolve cloud systems without relying on assumptions about their horizontal and vertical distributions, the ML model can implicitly account for subgrid-scale cloud effects on the radiative heating rates. A similar separation between clear-sky and cloudy radiative fluxes was previously proposed by <xref ref-type="bibr" rid="bib1.bibx9" id="text.19"/>. Moreover, <xref ref-type="bibr" rid="bib1.bibx32" id="text.20"/> focused on learning 3D cloud radiative effects. However, the explicit separation of cloud radiative impacts from all-sky radiation, especially including subgrid-scale effects, remains largely unexplored.</p>
      <p id="d2e288">We aim to use ML to encode the true variability of vertical and horizontal subgrid-scale clouds and their radiative impacts from high-resolution simulations, rather than relying on statistical schemes. This raises the central question: “Can the representation of cloud–radiation interactions be improved by training on high-resolution simulation data?” To address this, we compare our ML approach against a state-of-the-art physics-based radiative transfer model, namely RTE+RRTMGP <xref ref-type="bibr" rid="bib1.bibx35" id="paren.21"/> with McICA <xref ref-type="bibr" rid="bib1.bibx34" id="paren.22"/> and maximum-random overlap <xref ref-type="bibr" rid="bib1.bibx37" id="paren.23"/> to represent subgrid-scale cloudiness. In particular, we evaluate how a physics-based coarse-scale radiation scheme performs when applied to coarse-grained data. This comparison allows us to disentangle differences arising from changes in the input parameters from those due to different resolutions.</p>
      <p id="d2e300">This paper is structured as follows. Section <xref ref-type="sec" rid="Ch1.S2"/> describes the method used to learn the cloud radiative impact from high-resolution simulation data. Section <xref ref-type="sec" rid="Ch1.S3"/> introduces the main datasets, generated with the ICOsahedral Non-hydrostatic (ICON) model<xref ref-type="bibr" rid="bib1.bibx13 bib1.bibx14" id="paren.24"/> in both high- and coarse-resolution configurations, and includes a comparison of the input and output variables of the radiation parameterization across these resolutions. In Sect. <xref ref-type="sec" rid="Ch1.S4"/>, we present the main results by comparing the physics-based coarse-scale radiation parameterization with the ML-enhanced scheme on coarse-grained test data. Finally, Sect. <xref ref-type="sec" rid="Ch1.S5"/> provides a discussion and conclusion.</p>

      <fig id="F1" specific-use="star"><label>Figure 1</label><caption><p id="d2e316">Sketch of constructing the cloud radiative impact on heating rates. Radiation schemes calculate fluxes for the same scene once with and once without clouds resulting in all-sky and clear-sky fluxes. The corresponding heating rates can be inferred from the fluxes and the residual yields an approximation of the cloud radiative impact on heating rates for all layers in a column.</p></caption>
        <graphic xlink:href="https://gmd.copernicus.org/articles/19/3875/2026/gmd-19-3875-2026-f01.png"/>

      </fig>

</sec>
<sec id="Ch1.S2">
  <label>2</label><title>Learning the Cloud Radiative Impact on Heating Rates</title>
      <p id="d2e333">We define the cloud radiative impact on heating rates in a column as the residual between the all-sky and clear-sky heating rates, where clear-sky represents the same atmospheric conditions as all-sky but without clouds (Fig. <xref ref-type="fig" rid="F1"/>a). In ESMs such as the ICON model <xref ref-type="bibr" rid="bib1.bibx13 bib1.bibx14" id="paren.25"/>, radiation parameterizations like RTE+RRTMGP <xref ref-type="bibr" rid="bib1.bibx35" id="paren.26"/> first calculate gas optical properties and then add cloud optical properties. Then, these combined properties are used to calculate all-sky radiative fluxes, and clear-sky fluxes can be obtained omitting the cloud optical properties. Heating rates are then derived from the flux divergence for both all-sky and clear-sky conditions. The residual of these heating rates, which is the cloud radiative impact (CRI) on the heating rates, serves as the training target for the ML-based radiation parameterization:

          <disp-formula id="Ch1.E1" content-type="numbered"><label>1</label><mml:math id="M4" display="block"><mml:mrow><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:mo>∂</mml:mo><mml:msub><mml:mi>T</mml:mi><mml:mtext>CRI</mml:mtext></mml:msub></mml:mrow><mml:mrow><mml:mo>∂</mml:mo><mml:mi>t</mml:mi></mml:mrow></mml:mfrac></mml:mstyle><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:mo>∂</mml:mo><mml:msub><mml:mi>T</mml:mi><mml:mtext>all-sky</mml:mtext></mml:msub></mml:mrow><mml:mrow><mml:mo>∂</mml:mo><mml:mi>t</mml:mi></mml:mrow></mml:mfrac></mml:mstyle><mml:mo>-</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:mo>∂</mml:mo><mml:msub><mml:mi>T</mml:mi><mml:mtext>clear-sky</mml:mtext></mml:msub></mml:mrow><mml:mrow><mml:mo>∂</mml:mo><mml:mi>t</mml:mi></mml:mrow></mml:mfrac></mml:mstyle><mml:mo>.</mml:mo></mml:mrow></mml:math></disp-formula></p>
      <p id="d2e400">The heating rates <inline-formula><mml:math id="M5" display="inline"><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mrow><mml:mo>∂</mml:mo><mml:msub><mml:mi>T</mml:mi><mml:mi>k</mml:mi></mml:msub></mml:mrow><mml:mrow><mml:mo>∂</mml:mo><mml:mi>t</mml:mi></mml:mrow></mml:mfrac></mml:mstyle></mml:math></inline-formula> in a layer <inline-formula><mml:math id="M6" display="inline"><mml:mi>k</mml:mi></mml:math></inline-formula> are calculated from the net flux <inline-formula><mml:math id="M7" display="inline"><mml:mrow><mml:msub><mml:mi>F</mml:mi><mml:mtext>Net</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula> at the layer boundaries <inline-formula><mml:math id="M8" display="inline"><mml:mrow><mml:mi>k</mml:mi><mml:mo>±</mml:mo><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">1</mml:mn><mml:mn mathvariant="normal">2</mml:mn></mml:mfrac></mml:mstyle></mml:mrow></mml:math></inline-formula>


          <disp-formula id="Ch1.E2" content-type="numbered"><label>2</label><mml:math id="M9" display="block"><mml:mrow><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:mo>∂</mml:mo><mml:msub><mml:mi>T</mml:mi><mml:mi>k</mml:mi></mml:msub></mml:mrow><mml:mrow><mml:mo>∂</mml:mo><mml:mi>t</mml:mi></mml:mrow></mml:mfrac></mml:mstyle><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:msub><mml:mi>F</mml:mi><mml:mrow><mml:mtext>Net</mml:mtext><mml:mo>,</mml:mo><mml:mi>k</mml:mi><mml:mo>+</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>/</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:mrow></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mi>F</mml:mi><mml:mrow><mml:mtext>Net</mml:mtext><mml:mo>,</mml:mo><mml:mi>k</mml:mi><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>/</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:mrow></mml:msub></mml:mrow><mml:mrow><mml:msub><mml:mi>c</mml:mi><mml:mi mathvariant="normal">v</mml:mi></mml:msub><mml:msub><mml:mi>m</mml:mi><mml:mtext>air</mml:mtext></mml:msub></mml:mrow></mml:mfrac></mml:mstyle><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>

        where <inline-formula><mml:math id="M10" display="inline"><mml:mrow><mml:msub><mml:mi>m</mml:mi><mml:mtext>air</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula> is the mass of moist air per area, and specific heat at constant volume <inline-formula><mml:math id="M11" display="inline"><mml:mrow><mml:msub><mml:mi>c</mml:mi><mml:mi mathvariant="normal">v</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> scales with the tracer mixing ratios.</p>
      <p id="d2e560">The central idea is to train a neural network (NN) that learns only the cloud impact on heating rates (see Fig. <xref ref-type="fig" rid="F1"/>). Cloud–radiation interactions are subject to large uncertainties, since the subgrid-scale horizontal and vertical distributions of clouds are not resolved in coarse-scale ESMs. In the hybrid ML-physics radiation parameterization (Fig. <xref ref-type="fig" rid="F1"/>b), the NN predicts only the cloud radiative impact, while the clear-sky component is still computed by the original radiation parameterization. This design ensures that the ML-enhanced radiation scheme retains sensitivity to changes in GHG and aerosols through the clear-sky part, potentially improving generalization across different climates. The ML-cloud component can respond to GHG and aerosols changes only indirectly, for example through modifications of the cloud distribution. However, this hybrid approach does not capture secondary effects arising from reflected radiation. The validity is further discussed in the Appendix <xref ref-type="sec" rid="App1.Ch1.S4"/>.</p>
      <p id="d2e569">At first glance, a linear decomposition of the clear-sky heating rate and the cloud radiative impacts on heating rates may seem counterintuitive, given the inherently nonlinear interaction among specific humidity, clouds, and trace gases such as ozone <xref ref-type="bibr" rid="bib1.bibx5" id="paren.27"/>. Nevertheless, we adopt a linear decomposition framework in this work, as illustrated on Fig. <xref ref-type="fig" rid="F1"/>. Within this framework, the NN is tasked with learning the nonlinear relationships based on the prevailing atmospheric conditions and cloud-related variables (e.g., cloud liquid water and cloud ice).</p>
<sec id="Ch1.S2.SS1">
  <label>2.1</label><title>Method</title>
      <p id="d2e586">We use a bidirectional long short term memory (BiLSTM) based on <xref ref-type="bibr" rid="bib1.bibx19" id="text.28"/> to learn the cloud radiative impact in an atmospheric column. Bidirectional architectures are particularly well-suited for radiative transfer problems <xref ref-type="bibr" rid="bib1.bibx42 bib1.bibx47 bib1.bibx43 bib1.bibx2" id="paren.29"/>. Unlike their common usage for temporal sequences, BiLSTMs for radiation scan the vertical dimension in both directions, resembling upward and downward fluxes. The NN consists of one BiLSTM layer with <inline-formula><mml:math id="M12" display="inline"><mml:mrow><mml:mi>t</mml:mi><mml:mi>a</mml:mi><mml:mi>n</mml:mi><mml:mi>h</mml:mi></mml:mrow></mml:math></inline-formula> activation and a hidden dimension of 96, with two LSTMs scanning the input vertical profiles in upward and downward directions. The combined output of the LSTMs is then processed by a dense layer that predicts the heating rate at each level. The training is split between shortwave (SW) and longwave (LW) radiation as shortwave temperature tendencies are only calculated for sun-lit areas and longwave temperature tendencies are always computed, totaling in 82k trainable parameters per NN.</p>
      <p id="d2e609">The input variables are vertical profiles of mass mixing ratios of specific humidity <inline-formula><mml:math id="M13" display="inline"><mml:mrow><mml:msub><mml:mi>q</mml:mi><mml:mi mathvariant="normal">v</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, cloud liquid <inline-formula><mml:math id="M14" display="inline"><mml:mrow><mml:msub><mml:mi>q</mml:mi><mml:mi mathvariant="normal">l</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, cloud ice <inline-formula><mml:math id="M15" display="inline"><mml:mrow><mml:msub><mml:mi>q</mml:mi><mml:mi mathvariant="normal">i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, snow <inline-formula><mml:math id="M16" display="inline"><mml:mrow><mml:msub><mml:mi>q</mml:mi><mml:mi mathvariant="normal">s</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> and ozone <inline-formula><mml:math id="M17" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">O</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>, plus the cloud fraction <inline-formula><mml:math id="M18" display="inline"><mml:mrow><mml:mi>c</mml:mi><mml:mi>l</mml:mi></mml:mrow></mml:math></inline-formula>, air density <inline-formula><mml:math id="M19" display="inline"><mml:mi mathvariant="italic">ρ</mml:mi></mml:math></inline-formula>, and temperature <inline-formula><mml:math id="M20" display="inline"><mml:mi>T</mml:mi></mml:math></inline-formula>. For SW, we additionally use surface albedo <inline-formula><mml:math id="M21" display="inline"><mml:mi mathvariant="italic">α</mml:mi></mml:math></inline-formula> and incoming solar flux at the top of the atmosphere <inline-formula><mml:math id="M22" display="inline"><mml:mrow><mml:msub><mml:mi>F</mml:mi><mml:mrow><mml:mo>↓</mml:mo><mml:mo>,</mml:mo><mml:mtext>TOA</mml:mtext><mml:mo>,</mml:mo><mml:mtext>SW</mml:mtext></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>, which is the solar constant weighted by the solar zenith angle and change in Earth-Sun distance to account for daily and seasonal variations. For LW, we additionally use the surface temperature <inline-formula><mml:math id="M23" display="inline"><mml:mrow><mml:msub><mml:mi>T</mml:mi><mml:mtext>surf</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula> as input. As mentioned above, the output is the cloud radiative effect on heating rates derived from all-sky and clear-sky heating rates (Fig.  <xref ref-type="fig" rid="F1"/>).</p>
      <p id="d2e733">Normalization is important for faster convergence of the training <xref ref-type="bibr" rid="bib1.bibx30" id="paren.30"/> and generalization <xref ref-type="bibr" rid="bib1.bibx3" id="paren.31"/>. <inline-formula><mml:math id="M24" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">O</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M25" display="inline"><mml:mi mathvariant="italic">ρ</mml:mi></mml:math></inline-formula>, <inline-formula><mml:math id="M26" display="inline"><mml:mi>T</mml:mi></mml:math></inline-formula>, and <inline-formula><mml:math id="M27" display="inline"><mml:mrow><mml:msub><mml:mi>T</mml:mi><mml:mtext>surf</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula> are normalized using their mean values <inline-formula><mml:math id="M28" display="inline"><mml:mi mathvariant="italic">μ</mml:mi></mml:math></inline-formula> and standard deviation <inline-formula><mml:math id="M29" display="inline"><mml:mi mathvariant="italic">σ</mml:mi></mml:math></inline-formula>, also known as <inline-formula><mml:math id="M30" display="inline"><mml:mi>z</mml:mi></mml:math></inline-formula>-score normalization (<inline-formula><mml:math id="M31" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mtext>norm</mml:mtext></mml:msub><mml:mo>=</mml:mo><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mrow><mml:mi>x</mml:mi><mml:mo>-</mml:mo><mml:mi mathvariant="italic">μ</mml:mi></mml:mrow><mml:mi mathvariant="italic">σ</mml:mi></mml:mfrac></mml:mstyle></mml:mrow></mml:math></inline-formula>). The normalization factors are computed using all cells of one time step and are one-dimensional such that the vertical structure of the variables remain. The vertical structure is an important aspect that the BilSTM uses to make a vertically correlated prediction. The water related variables <inline-formula><mml:math id="M32" display="inline"><mml:mrow><mml:msub><mml:mi>q</mml:mi><mml:mi mathvariant="normal">l</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M33" display="inline"><mml:mrow><mml:msub><mml:mi>q</mml:mi><mml:mi mathvariant="normal">i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M34" display="inline"><mml:mrow><mml:msub><mml:mi>q</mml:mi><mml:mi mathvariant="normal">s</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> are normalized by the ambient total (radiatively active) water (<inline-formula><mml:math id="M35" display="inline"><mml:mrow><mml:msub><mml:mi>q</mml:mi><mml:mi mathvariant="normal">v</mml:mi></mml:msub><mml:mo>+</mml:mo><mml:msub><mml:mi>q</mml:mi><mml:mi mathvariant="normal">l</mml:mi></mml:msub><mml:mo>+</mml:mo><mml:msub><mml:mi>q</mml:mi><mml:mi mathvariant="normal">i</mml:mi></mml:msub><mml:mo>+</mml:mo><mml:msub><mml:mi>q</mml:mi><mml:mi mathvariant="normal">s</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>). Here, radiatively active means used in the radiation scheme. <inline-formula><mml:math id="M36" display="inline"><mml:mrow><mml:msub><mml:mi>q</mml:mi><mml:mi mathvariant="normal">v</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> uses <inline-formula><mml:math id="M37" display="inline"><mml:mi>z</mml:mi></mml:math></inline-formula>-score normalization providing information that relates to the absolute mass mixing ratios. <inline-formula><mml:math id="M38" display="inline"><mml:mrow><mml:mi>c</mml:mi><mml:mi>l</mml:mi></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M39" display="inline"><mml:mi mathvariant="italic">α</mml:mi></mml:math></inline-formula> are not normalized as they already vary between 0 and 1. <inline-formula><mml:math id="M40" display="inline"><mml:mrow><mml:msub><mml:mi>F</mml:mi><mml:mrow><mml:mo>↓</mml:mo><mml:mo>,</mml:mo><mml:mtext>TOA</mml:mtext><mml:mo>,</mml:mo><mml:mtext>SW</mml:mtext></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> is normalized by 1360 W m<sup>−2</sup>, which is close to the solar constant. The cloud radiative impact on heating rates is only converted to <inline-formula><mml:math id="M42" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">K</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="normal">d</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>, which is on the order of one.</p>
      <p id="d2e975">The loss we minimize during training consists of the sum of mean absolute error (MAE) and mean squared error (MSE). We use the Adam optimizer <xref ref-type="bibr" rid="bib1.bibx25" id="paren.32"/>, and set a learning rate of 10<sup>−3</sup>, which is reduced by a factor of 2 when the validation loss does not decrease by 0.1 % for 5 epochs. To avoid overfitting, we use early stopping, which stops the training if the validation loss does not decrease for 10 epochs.</p>
</sec>
</sec>
<sec id="Ch1.S3">
  <label>3</label><title>Data</title>
      <p id="d2e1002">ICON is a weather and climate model permitting simulations across different resolutions, from a few to hundreds of kilometers. For global and long-term applications, the atmospheric component ICON-A <xref ref-type="bibr" rid="bib1.bibx13" id="paren.33"/> often runs at 80 <inline-formula><mml:math id="M44" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow></mml:math></inline-formula> horizontal resolution, and 47 vertical levels covering the altitude range 0–83 <inline-formula><mml:math id="M45" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow></mml:math></inline-formula>, with parameterizations for radiation, cloud microphysics, turbulence, convection and gravity waves. The ICON model has the option to be run as a GSRM <xref ref-type="bibr" rid="bib1.bibx41 bib1.bibx14 bib1.bibx23" id="paren.34"/>. The high-resolution simulations used here follow the QUBICC (Quasi-Biennial oscillation in a changing climate) protocol from <xref ref-type="bibr" rid="bib1.bibx14" id="text.35"/>. The QUBICC simulations have 5 <inline-formula><mml:math id="M46" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow></mml:math></inline-formula> horizontal resolution, and the vertical dimension spans 83 <inline-formula><mml:math id="M47" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow></mml:math></inline-formula> on 191 levels. The high-resolution allows the model to run without a convection scheme and gravity wave parameterization, as these processes are starting to be resolved.</p>
      <p id="d2e1047">We performed QUBICC simulations that cover a total of 40 <inline-formula><mml:math id="M48" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">d</mml:mi></mml:mrow></mml:math></inline-formula> evenly distributed across four months: November 2004, January, April, July 2005. The simulations have a physics time step of 40 <inline-formula><mml:math id="M49" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">s</mml:mi></mml:mrow></mml:math></inline-formula> and a radiation time step of 6 <inline-formula><mml:math id="M50" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">min</mml:mi></mml:mrow></mml:math></inline-formula>. These simulations are run with prescribed sea surface temperatures, sea ice concentrations, greenhouse gas concentrations but no aerosols. The outputs are saved every 192 <inline-formula><mml:math id="M51" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">min</mml:mi></mml:mrow></mml:math></inline-formula>. The uneven output interval was chosen to cover a large variability of different solar zenith angles at different locations. The first 6 <inline-formula><mml:math id="M52" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">d</mml:mi></mml:mrow></mml:math></inline-formula> of each 10 <inline-formula><mml:math id="M53" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">d</mml:mi></mml:mrow></mml:math></inline-formula> period is used for training, the next 2 <inline-formula><mml:math id="M54" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">d</mml:mi></mml:mrow></mml:math></inline-formula> for validation and the last 2 <inline-formula><mml:math id="M55" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">d</mml:mi></mml:mrow></mml:math></inline-formula> for testing. Using QUBICC data to train our ML model for ICON-A requires coarse-graining the high-resolution QUBICC dataset. All variables are horizontally and vertically coarse-grained from high-resolution simulations as in <xref ref-type="bibr" rid="bib1.bibx15" id="text.36"/>. For horizontal remapping, the high-resolution cells are weighted by their cell area that is contained in each coarse cell. High-resolution cells can be contained fully or partially in a coarse cell, which is represented by a smaller cell area. Similarly, the layer thickness was used for vertical coarse-graining, corresponding to a weighted average. This method is valid for all input and output variables used here, i.e., concentrations and fluxes. Cloud-related variables are expressed as mass fraction, ensuring consistent vertical coarse-graining. Absolute mass variables, such as air mass <inline-formula><mml:math id="M56" display="inline"><mml:mrow><mml:msub><mml:mi>m</mml:mi><mml:mtext>air</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula> in Eq. (<xref ref-type="disp-formula" rid="Ch1.E2"/>), are coarse-grained by the (weighted) vertical sum that are partially or fully contained in a coarse layer. We discarded a small number of coarse-grained cells if their surface height showed inconsistencies, which can occur over complex terrain. Specifically, the coarse-grained surface height was computed from the high-resolution grid file and compared to the surface height from the coarse-scale grid file, which is slightly rotated and shifted. Consequently, some high-resolution cells are only partially contained in a coarse cell, which can lead to a small mismatch in surface height. Cells with deviations of more than 0.5 <inline-formula><mml:math id="M57" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">m</mml:mi></mml:mrow></mml:math></inline-formula> in surface height were discarded. Then, we randomly sampled 35k and 5k grid points per time step for the training and validation set. For the test set, we randomly selected 75k cells per time step for LW and 35k cells for SW. This yields 2.3 million training samples, 260k validation samples, 1.9 million test samples for shortwave and 4.2 million test samples for longwave radiation.</p>
      <p id="d2e1140">For comparison with the high-resolution data, we also made coarse-scale ICON-A simulations for the same periods and with a similar configuration at 80 <inline-formula><mml:math id="M58" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow></mml:math></inline-formula> horizontal resolution to compare differences in various distributions. The ICON-A simulations are based on the version 2.6.4 described in <xref ref-type="bibr" rid="bib1.bibx13" id="text.37"/>, while the QUBICC simulations are based on the version icon-2024.10 <xref ref-type="bibr" rid="bib1.bibx24" id="paren.38"/>. To make the coarse-scale simulation more comparable, we used the same microphysics scheme <xref ref-type="bibr" rid="bib1.bibx12" id="paren.39"/> and no aerosols. See Appendix for a comparison with the default microphysics scheme. Both simulations use the radiation scheme RTE+RRTMGP <xref ref-type="bibr" rid="bib1.bibx35" id="paren.40"/>. ICON-A uses McICA and maximum-random overlap to represent subgrid-scale cloud-radiative impacts. In the QUBICC simulation, the <italic>all-or-nothing</italic> scheme is used, assuming horizontal homogeneous distribution of clouds. The physics and radiation time step is 6 <inline-formula><mml:math id="M59" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">min</mml:mi></mml:mrow></mml:math></inline-formula> for the ICON-A simulation, matching the radiation time step in QUBICC.</p>

      <fig id="F2" specific-use="star"><label>Figure 2</label><caption><p id="d2e1178">Distributions of water related input variables for ICON-A and coarse-grained QUBICC data. The bold line shows the mean and the shaded area shows 95 % of the spread between the 2.5 % percentile and the 97.5 % percentile. The boxplot is limited by the minimum and maximum values. The box edges are defined at the 25 % and 75 % percentile of the distribution. The black line illustrates the mean of the distribution and the star is the median.</p></caption>
        <graphic xlink:href="https://gmd.copernicus.org/articles/19/3875/2026/gmd-19-3875-2026-f02.png"/>

      </fig>

      <p id="d2e1187">The remaining differences between the coarse-scale ICON-A and high-resolution QUBICC simulation include the horizontal and vertical resolution, the higher temporal resolution of physical process in QUBICC, which is required due to the high spatial resolution. Moreover, QUBICC intends to resolve gravity waves and (deep) convection. Additionally, the radiation scheme RTE+RRTMGP <xref ref-type="bibr" rid="bib1.bibx35" id="paren.41"/> in QUBICC uses snow mixing ratio as an input. Other differences could be due to differences between the code versions and tuning, as we did not retune ICON-A.</p>
<sec id="Ch1.S3.SS1">
  <label>3.1</label><title>Comparison of input and output variables</title>
      <p id="d2e1200">If an ML-based scheme is trained on high-resolution simulations like QUBICC and the goal is to couple it to a coarse-scale model like ICON-A, one needs to ensure that the distributions of input and output variables match. Otherwise, the ML-based scheme could be faced with out-of-distribution samples, which can lead to errors that quickly build-up while the model is integrated <xref ref-type="bibr" rid="bib1.bibx38" id="paren.42"/>. Therefore, we analyze systematic differences between the coarse-scale ICON-A simulations and the high-resolution QUBICC simulations. This analysis is conducted by comparing the spatial and temporal means and spread in the input and output used and produced by the radiation parameterization. Specifically, we focus on <inline-formula><mml:math id="M60" display="inline"><mml:mrow><mml:msub><mml:mi>q</mml:mi><mml:mi mathvariant="normal">v</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M61" display="inline"><mml:mrow><mml:msub><mml:mi>q</mml:mi><mml:mi mathvariant="normal">l</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M62" display="inline"><mml:mrow><mml:msub><mml:mi>q</mml:mi><mml:mi mathvariant="normal">i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M63" display="inline"><mml:mrow><mml:mi>c</mml:mi><mml:mi>l</mml:mi></mml:mrow></mml:math></inline-formula>, total cloud cover, <inline-formula><mml:math id="M64" display="inline"><mml:mrow><mml:msub><mml:mi>q</mml:mi><mml:mi mathvariant="normal">s</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, as well as longwave and shortwave heating rates. For the comparison, we use all samples in the test set. The samples of both simulations cover the same time period which is November 2004, January, April, and July 2005. Comparing two simulations with different grids enables us to uncover systematic differences, with a focus on identifying larger variations.</p>
      <p id="d2e1261">Distributions of water related input variables are shown in Fig. <xref ref-type="fig" rid="F2"/>. The distributions of specific humidity look similar in the coarse-grained and coarse-scale data set, where the maximum differences between the means is 0.9 <inline-formula><mml:math id="M65" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">g</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="normal">kg</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula> (Fig. <xref ref-type="fig" rid="F2"/>a). The spread in humidity values also overlaps for both simulations. Cloud water has higher values below 3 <inline-formula><mml:math id="M66" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow></mml:math></inline-formula> for the coarse-scale simulation while cloud water is more evenly distributed throughout the troposphere for the QUBICC simulations (Fig. <xref ref-type="fig" rid="F2"/>b). Here, the maximum difference of the mean values is 0.02 <inline-formula><mml:math id="M67" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">g</mml:mi><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msup><mml:mi mathvariant="normal">kg</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>. The distributions of cloud ice have similar shapes, but the coarse-grained distribution peaks higher by 0.002 <inline-formula><mml:math id="M68" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">g</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="normal">kg</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula> (Fig. <xref ref-type="fig" rid="F2"/>c). The mean cloud area fraction is on average larger by 3 % for the coarse-grained data set below 14 <inline-formula><mml:math id="M69" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow></mml:math></inline-formula> (Fig. <xref ref-type="fig" rid="F2"/>d). Snow peaks between 5–10 <inline-formula><mml:math id="M70" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow></mml:math></inline-formula> (Fig. <xref ref-type="fig" rid="F2"/>f). However, the vertical distribution is wider in the coarse-grained simulation. Nevertheless, snow was not used as input for radiation in ICON-A.</p>
      <p id="d2e1353">For comparability, the coarse-grained heating rates from QUBICC had to be rescaled by a factor <inline-formula><mml:math id="M71" display="inline"><mml:mrow><mml:msub><mml:mi>c</mml:mi><mml:mi mathvariant="normal">v</mml:mi></mml:msub><mml:mo>/</mml:mo><mml:msub><mml:mi>c</mml:mi><mml:mi>p</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> to account for differences in the two model versions used, where <inline-formula><mml:math id="M72" display="inline"><mml:mrow><mml:msub><mml:mi>c</mml:mi><mml:mi>p</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is specific heat at constant pressure.</p>

      <fig id="F3" specific-use="star"><label>Figure 3</label><caption><p id="d2e1388">The distribution of shortwave (top row) and longwave (bottom row) heating rates in coarse-scale and scaled coarse-grained data. The bold line shows the mean and the shaded area shows 95 % of the spread, which is defined as the spread between the 2.5 % percentile and the 97.5 % percentile. The left column shows all-sky heating rates as it is used in the ICON model. The middle column shows clear-sky heating rate computed from clear-sky fluxes, which is a diagnostic output in the ICON model. The right column shows the cloud radiative impact on the heating rate which was computed by subtracting the clear-sky heating rate from the all-sky heating rate.</p></caption>
          <graphic xlink:href="https://gmd.copernicus.org/articles/19/3875/2026/gmd-19-3875-2026-f03.png"/>

        </fig>

      <p id="d2e1397">For SW all-sky heating rates, the mean profiles match very well and the mean difference is 0.18 <inline-formula><mml:math id="M73" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">K</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="normal">d</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula> (Fig. <xref ref-type="fig" rid="F3"/>a). There are only small differences in the spread of heating rates in the troposphere. The heating rate is decomposed into a clear-sky heating rate – which is calculated from the clear-sky fluxes – and the cloud radiative impact on heating rates. Their distributions are also shown in Fig. <xref ref-type="fig" rid="F3"/>b and c. The SW clear-sky heating rate has a mean difference of 0.11 <inline-formula><mml:math id="M74" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">K</mml:mi><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msup><mml:mi mathvariant="normal">d</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>  for the mean profiles. For the SW cloud radiative impact, the mean difference is also 0.11 <inline-formula><mml:math id="M75" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">K</mml:mi><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msup><mml:mi mathvariant="normal">d</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula> . Here, we expect small differences due to mostly resolved clouds in the coarse-grained dataset. However, there is no clear bias between coarse-scale and coarse-grained cloud impacts.</p>
      <p id="d2e1455">For the LW all-sky heating rates, the mean profiles match well and have a mean difference of 0.18 <inline-formula><mml:math id="M76" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">K</mml:mi><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msup><mml:mi mathvariant="normal">d</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula> (Fig. <xref ref-type="fig" rid="F3"/>d). However, the spread in heating rates is slightly different, which co-occurs with the different spread in cloud water at 1 <inline-formula><mml:math id="M77" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow></mml:math></inline-formula> (Fig. <xref ref-type="fig" rid="F2"/>b). The LW clear-sky heating rates look very similar in their mean values and their spread where the mean difference is 0.14 <inline-formula><mml:math id="M78" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">K</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="normal">d</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula> (Fig. <xref ref-type="fig" rid="F3"/>e). The LW cloud impact is concentrated in the troposphere (Fig. <xref ref-type="fig" rid="F3"/>f). The mean impact is almost the same between coarse-scale and coarse-grained simulations with a mean difference of 0.09 <inline-formula><mml:math id="M79" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">K</mml:mi><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msup><mml:mi mathvariant="normal">d</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>. However, there is a difference in spread, which can be expected, because the coarse-grained data set implicitly includes subgrid-scale cloud effects.</p>
      <p id="d2e1526">In the unscaled comparison, the clear-sky heating rates show the same bias as the all-sky heating rates for both SW and LW (Fig. <xref ref-type="fig" rid="FB1"/>). However, this bias does not directly translate to the cloud radiative impact on heating rates because adding the cloud impact is a highly non-linear process <xref ref-type="bibr" rid="bib1.bibx5" id="paren.43"/>. Additionally, this indicates that the mean cloud effect is similar for a (quasi)-hydrostatic and a non-hydrostatic model.</p>

<table-wrap id="T1" specific-use="star"><label>Table 1</label><caption><p id="d2e1537">All datasets used in this study. Online means that a simulation was run in high or coarse resolution and the output was saved for certain time steps. Offline means that a parameterization (RTE+RRTMGP or MLe-radiation) computed its tendencies based on model output.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="3">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="left"/>
     <oasis:colspec colnum="3" colname="col3" align="justify" colwidth="124mm"/>
     <oasis:thead>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Name</oasis:entry>
         <oasis:entry colname="col2">Offline/Online</oasis:entry>
         <oasis:entry colname="col3" align="left">Purpose</oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">QUBICC</oasis:entry>
         <oasis:entry colname="col2">Online</oasis:entry>
         <oasis:entry colname="col3" align="left">High-resolution simulation that is coarse-grained to serve as training set, compared to ICON-A in Sect. <xref ref-type="sec" rid="Ch1.S3"/> and “ground truth” in Sect. <xref ref-type="sec" rid="Ch1.S4"/>.</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">ICON-A</oasis:entry>
         <oasis:entry colname="col2">Online</oasis:entry>
         <oasis:entry colname="col3" align="left">Coarse-scale simulation used in Sect. <xref ref-type="sec" rid="Ch1.S3"/> to identify systematic biases between coarse-scale and high-resolution simulations.</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Baseline</oasis:entry>
         <oasis:entry colname="col2">Offline</oasis:entry>
         <oasis:entry colname="col3" align="left">Calculated using pyRTE based on coarse-grained samples from QUBICC to provide the coarse-scale reference of radiative tendencies, serves as “baseline” and used to measure improvement.</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">MLe-radiation</oasis:entry>
         <oasis:entry colname="col2">Offline</oasis:entry>
         <oasis:entry colname="col3" align="left">New ML-enhanced radiation parameterization trained on coarse-grained samples from QUBICC computes tendencies, the tendencies are compared to coarse-grained QUBICC and baseline tendencies.</oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table></table-wrap>

      <fig id="F4" specific-use="star"><label>Figure 4</label><caption><p id="d2e1626">Comparison of the baseline and the ML-enhanced radiation scheme on coarse-grained QUBICC data. Results are shown for the shortwave (top rows) and longwave (bottom rows) spectral range. The results are shown separately for clear-sky samples (no clouds, left column), fully cloudy sky samples (second column), and samples with partial cloudiness (third column). Additionally, the results are separated by non-precipitating (fourth column) and precipitating clouds (fifth column). The shown metrics are coefficient of determination <inline-formula><mml:math id="M80" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> (green), bias (orange) and MAE (blue) with 95 % of the spread, which is defined as the spread between the 2.5 % percentile and the 97.5 % percentiles. The bias and MAE share the <inline-formula><mml:math id="M81" display="inline"><mml:mi>x</mml:mi></mml:math></inline-formula>-axis. The ML-clear-sky panels are gray because the clear-sky fluxes are not calculated by the ML model, but shown as reference.</p></caption>
          <graphic xlink:href="https://gmd.copernicus.org/articles/19/3875/2026/gmd-19-3875-2026-f04.png"/>

        </fig>

</sec>
</sec>
<sec id="Ch1.S4">
  <label>4</label><title>Results</title>
      <p id="d2e1663">As mentioned above, the simulation outputs of QUBICC and ICON-A can't be used for a sample-by-sample comparison. Therefore, a coarse-scale radiation reference is need that is computed (offline) based on coarse grained QUBICC output. All the dataset used in this study are summarized in Table <xref ref-type="table" rid="T1"/>.</p>

      <fig id="F5" specific-use="star"><label>Figure 5</label><caption><p id="d2e1670">As Fig. <xref ref-type="fig" rid="F4"/>, but for the selected regions shown in Fig. 5 of <xref ref-type="bibr" rid="bib1.bibx4" id="text.44"/>.</p></caption>
        <graphic xlink:href="https://gmd.copernicus.org/articles/19/3875/2026/gmd-19-3875-2026-f05.png"/>

      </fig>

      <p id="d2e1684">For the coarse-scale radiation reference, we use the Python front-end (hereafter pyRTE) <xref ref-type="bibr" rid="bib1.bibx36" id="paren.45"/> of the radiation scheme RTE+RRTMGP <xref ref-type="bibr" rid="bib1.bibx35" id="paren.46"/>, which is used in all our simulations. Using pyRTE, we replicate the implementation of radiation in QUBICC as closely as possible. For subgrid-scale variability, we employ McICA <xref ref-type="bibr" rid="bib1.bibx34" id="paren.47"/> together with maximum-random cloud overlap <xref ref-type="bibr" rid="bib1.bibx37" id="paren.48"/>. The procedure is as follows: first we calculate gas optical properties, assign them to the atmospheric state and calculate clear-sky fluxes. Next, we compute cloud optical properties, apply McICA with maximum-random overlap to represent subgrid-scale variability, and add cloud optical properties to the gas optical properties. Because QUBICC also includes snow in its radiation parameterization, we additionally calculate snow optical properties and combine them with the other optical properties, before computing all-sky fluxes. Then, heating rates are obtained applying Eq. (<xref ref-type="disp-formula" rid="Ch1.E2"/>). We calculate heating rates for all samples in the test dataset. The output of pyRTE is used as baseline as it represents the coarse-scale radiation scheme. This allows a sample-by-sample comparison to the coarse-grained heating rates obtained from QUBICC (first and third row in Figs. <xref ref-type="fig" rid="F4"/> and <xref ref-type="fig" rid="F5"/>). The ML-enhanced radiation scheme predicts the cloud radiative impact, which is added to clear sky heating. The resulting all-sky heating is compared to the coarse-grained QUBICC heating rates (second and third row in Figs. <xref ref-type="fig" rid="F4"/> and <xref ref-type="fig" rid="F5"/>). As evaluation metric, we use mean absolute error (MAE), bias, and the coefficient of determination (<inline-formula><mml:math id="M82" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula>). The improvement is measured by comparing MLe-radiation to the baseline. The results are presented in Fig. <xref ref-type="fig" rid="F4"/> with the column-averaged metrics summarized in Table <xref ref-type="table" rid="TA1"/>. We restrict the presentation to the lowest 20 <inline-formula><mml:math id="M83" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow></mml:math></inline-formula> of the atmosphere, as cloud impacts are most relevant in the troposphere (see Fig. <xref ref-type="fig" rid="F3"/>). Nevertheless, the NN predicts the cloud impact on the entire atmospheric column, and the results for the full column can be found in the Appendix <xref ref-type="fig" rid="FC1"/>.</p>
      <p id="d2e1739">The first column of Fig. <xref ref-type="fig" rid="F4"/> shows clear-sky heating rates, for which the cloud distribution plays no role. Accordingly, only small and statistically insignificant differences are expected, arising mainly from variability in water vapor. While coarse-grained QUBICC heating rates reflect subgrid-scale variability of water vapor, coarse-scale radiation schemes assume a horizontal homogeneous distribution of water vapor. Therefore, the assumption of horizontally homogeneous input parameters introduces a small error of 0.367 <inline-formula><mml:math id="M84" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">K</mml:mi><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msup><mml:mi mathvariant="normal">d</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula> (SW) and 0.571 <inline-formula><mml:math id="M85" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">K</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="normal">d</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula> (LW) for the baseline. For comparison with other studies, Table <xref ref-type="table" rid="TA1"/> also reports the root mean squared error (RMSE). For clear-sky heating rates of the coarse-scale baseline, the RMSE is 0.443 <inline-formula><mml:math id="M86" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">K</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="normal">d</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula> (SW) and 0.688 <inline-formula><mml:math id="M87" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">K</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="normal">d</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula> (LW) compared with the coarse-grained QUBICC heating rates. <xref ref-type="bibr" rid="bib1.bibx22" id="text.49"/> developed a fast tool for computing gas-optical properties and reports an RMSE of 0.18 <inline-formula><mml:math id="M88" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">K</mml:mi><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msup><mml:mi mathvariant="normal">d</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>. Although the error source differs (gas optical properties vs. spatial resolution), the magnitudes are comparable.</p>
      <p id="d2e1835">For completeness, we also evaluated the ML-model on clear-sky samples. However, it is not intended for clear-sky scenes as the cloud impact is zero. Therefore, the corresponding panels are grayed out. The MAE is 0.049 <inline-formula><mml:math id="M89" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">K</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="normal">d</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula> for SW and 0.028 <inline-formula><mml:math id="M90" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">K</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="normal">d</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula> for LW.</p>
      <p id="d2e1872">The second column of Fig. <xref ref-type="fig" rid="F4"/> shows results for fully cloudy samples (total cloud cover of 100 %). For the baseline, the MAE peaks near 10 <inline-formula><mml:math id="M91" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow></mml:math></inline-formula>, exceeding 0.5 <inline-formula><mml:math id="M92" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">K</mml:mi><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msup><mml:mi mathvariant="normal">d</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula> for SW and 1 <inline-formula><mml:math id="M93" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">K</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="normal">d</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula> for LW. The corresponding <inline-formula><mml:math id="M94" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> are low, with averaged values of 0.83 (SW) and 0.66 (LW), compared to 0.98 for the ML-enhanced scheme. The average <inline-formula><mml:math id="M95" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> values are computed by averaging over the vertical levels. <inline-formula><mml:math id="M96" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> is weighted by variability, and an <inline-formula><mml:math id="M97" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> of zero indicates that the error is as large as the variability itself. Since, McICA within a coarse-scale radiation parameterization is supposed to produce unbiased noise, the <inline-formula><mml:math id="M98" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> is therefore a less informative metric. In contrast, the bias reveals that the coarse-scale radiation scheme systematically struggles to represent the cloud impact near 10 <inline-formula><mml:math id="M99" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow></mml:math></inline-formula>, particularly for SW. The ML-enhanced scheme produces nearly unbiased heating rates, with MAEs of 0.106 <inline-formula><mml:math id="M100" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">K</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="normal">d</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula> (SW) and 0.127 <inline-formula><mml:math id="M101" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">K</mml:mi><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msup><mml:mi mathvariant="normal">d</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula> (LW), representing errors 4–6 times smaller than those of the baseline.</p>
      <p id="d2e2018">The third column of Fig. <xref ref-type="fig" rid="F4"/> shows results for partially cloudy scenes, defined here as total cloud cover between 10 %–90 %. In these cases, both schemes exhibit smaller errors than in fully cloudy scenes, consistent with the weaker overall cloud radiative impact. For the baseline, the bias is substantially smaller than fully cloudy conditions but still show a pronounced peak near 1 <inline-formula><mml:math id="M102" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow></mml:math></inline-formula> for LW and a double peak at 1 and 10 <inline-formula><mml:math id="M103" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow></mml:math></inline-formula> for SW. In contrast, the ML-enhanced samples produces nearly unbiased heating rates, with an MAE of 0.082 <inline-formula><mml:math id="M104" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">K</mml:mi><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msup><mml:mi mathvariant="normal">d</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula> (SW) and 0.068 <inline-formula><mml:math id="M105" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">K</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="normal">d</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula> (LW), representing errors that are 5–10 times smaller than those of the baseline.</p>
      <p id="d2e2073">To interpret the double-peak error observed in partially cloudy samples, we divided the samples into precipitating and non-precipitating clouds, as a rough proxy for deep and shallow convection. Non-precipitating (warm) clouds were identified using thresholds of maximum 0.01 <inline-formula><mml:math id="M106" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">mm</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="normal">h</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>, total cloud cover larger than 10 %, and vertically integrated ice water path of less than 1 g m<sup>−2</sup>. For reference, drizzle has a precipitation rate of 0.2 <inline-formula><mml:math id="M108" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">mm</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="normal">d</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula> <xref ref-type="bibr" rid="bib1.bibx46" id="paren.50"/>. Precipitating clouds were identified by total cloud cover <inline-formula><mml:math id="M109" display="inline"><mml:mo>&gt;</mml:mo></mml:math></inline-formula> 10 % and precipitation more than 3 <inline-formula><mml:math id="M110" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">mm</mml:mi><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msup><mml:mi mathvariant="normal">h</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>. For reference, <xref ref-type="bibr" rid="bib1.bibx49" id="text.51"/> reports mean precipitation of 3.5 <inline-formula><mml:math id="M111" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">mm</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="normal">h</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula> for deep convective cores over the tropical ocean. For non-precipitating clouds, both the coarse-scale radiation scheme and the ML-enhanced scheme show an MAE peak near 1 <inline-formula><mml:math id="M112" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow></mml:math></inline-formula> for SW and LW. However, the ML-enhanced scheme achieves substantially smaller errors of 0.080 <inline-formula><mml:math id="M113" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">K</mml:mi><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msup><mml:mi mathvariant="normal">d</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula> for SW and 0.069 <inline-formula><mml:math id="M114" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">K</mml:mi><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msup><mml:mi mathvariant="normal">d</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula> for LW, which are 6–10 times lower than those of the baseline. For the precipitating clouds, the baseline exhibits an MAE peak at 10–12 <inline-formula><mml:math id="M115" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow></mml:math></inline-formula>, while the ML-enhanced scheme shows enhanced error in the upper troposphere but without distinct peak. Instead, the ML-enhanced scheme shows a broader peak in MAE between 12–14 <inline-formula><mml:math id="M116" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow></mml:math></inline-formula>. On average, however, the ML-enhanced error remain about 4–5 times smaller than those of the baseline.</p>
      <p id="d2e2229">We further evaluated the performance of the ML-enhanced scheme across five selected regions characterized by different predominant cloud regimes, following the classification of <xref ref-type="bibr" rid="bib1.bibx4" id="text.52"/>. The results are shown in Fig. <xref ref-type="fig" rid="F5"/> and summarized in Table <xref ref-type="table" rid="TA2"/>. In the arctic region (70–90° N), errors remain confined below 10 <inline-formula><mml:math id="M117" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow></mml:math></inline-formula> consistent with the lower tropopause height in this region.  For the baseline, the SW MAE is 0.215 <inline-formula><mml:math id="M118" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">K</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="normal">d</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>, even smaller than in the clear-sky conditions, although the spread in MAE is slightly larger. For LW, the MAE is 0.614 <inline-formula><mml:math id="M119" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">K</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="normal">d</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>, exceeding the clear-sky values. In contrast, the ML-enhanced scheme achieves errors that are 4–8 times smaller than those of the baseline.</p>
      <p id="d2e2283">In the Southern Ocean (30–65° S), the baseline exhibits large errors of 0.417 <inline-formula><mml:math id="M120" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">K</mml:mi><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msup><mml:mi mathvariant="normal">d</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula> for SW and 0.760 <inline-formula><mml:math id="M121" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">K</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="normal">d</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula> for LW, with a characteristic double peak in the MAE at 1–2 and 10 <inline-formula><mml:math id="M122" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow></mml:math></inline-formula>. The ML-enhanced scheme also reproduces this double peak structure in the MAE spread but reduces the MAE by a factor of 4–7 relative to the baseline. Over the tropical ocean (30° N–30° S), the baseline shows large errors in the upper troposphere, likely associated with deep convection. In this region, the ML-enhanced scheme again reduces the MAE by a factor of 5–9. A subregion within the tropical region, the Pacific ITCZ region (0–12° N, 135° E–85° W), shows similar behavior but with even larger errors at higher altitudes.</p>
      <p id="d2e2328">The stratocumulus region is represented by three sub-regions: the Southeast Pacific (10–30° N, 75–97° W), Southeast Atlantic (10–30° S, 10° W–10° E), and Northeast Pacific (15–35° N, 120–140° W). In these regions, both models show two peaks in the MAE: one in the lower troposphere at 1–2 <inline-formula><mml:math id="M123" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow></mml:math></inline-formula> and another in the upper troposphere at 10–12 <inline-formula><mml:math id="M124" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow></mml:math></inline-formula>. Notably, the upper tropospheric peak is larger in both SW and LW for both models. Nevertheless, the ML-enhanced scheme achieves errors 5–9 times smaller than those of the baseline.</p>
</sec>
<sec id="Ch1.S5" sec-type="conclusions">
  <label>5</label><title>Conclusions</title>
      <p id="d2e2355">ESMs struggle to represent subgrid-scale cloudiness and commonly rely on statistical schemes, such as McICA, to account for subgrid-scale cloud radiative effects <xref ref-type="bibr" rid="bib1.bibx34" id="paren.53"/>. Although random, unbiased errors can be mitigated by large-scale atmospheric mixing <xref ref-type="bibr" rid="bib1.bibx34" id="paren.54"/>, the column-by-column error can be large. ML algorithms trained on high-resolution, global storm-resolving simulations now provide an opportunity to represent fractional cloudiness in radiative transfer more accurately. To bridge scales, the high-resolution model output is coarse-grained to the target resolution, such that the coarse-grained variables implicitly retain subgrid-scale effects. Then, the coarse-grained variables implicitly include subgrid-scale effects.</p>
      <p id="d2e2364">We developed a hybrid physics-ML radiation parameterization, where the physics-based component computes clear-sky fluxes, while the ML component predicts cloud impact, implicitly accounting for subgrid-scale variability. This ML-enhanced framework offers a more robust and generalizable radiation scheme: the physics-based parameterization retains its responsiveness to changes in GHGs and aerosols, thereby mitigating potential out-of-distribution issues in climate projections. The ML component is implemented as a BiLSTM neural network, which has previously demonstrated strong performance in radiation applications <xref ref-type="bibr" rid="bib1.bibx42 bib1.bibx47 bib1.bibx43 bib1.bibx2 bib1.bibx19" id="paren.55"/>. For training, we use data from high-resolution QUBICC simulations with a horizontal resolution of <inline-formula><mml:math id="M125" display="inline"><mml:mo>≈</mml:mo></mml:math></inline-formula> 5 <inline-formula><mml:math id="M126" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow></mml:math></inline-formula> and 191 vertical layers expanding up to 83 <inline-formula><mml:math id="M127" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow></mml:math></inline-formula>. These fields are coarse-grained to <inline-formula><mml:math id="M128" display="inline"><mml:mo>≈</mml:mo></mml:math></inline-formula> 80 <inline-formula><mml:math id="M129" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow></mml:math></inline-formula> and 47 vertical layers, matching the target resolution for a coarse-scale ESM. For comparison and to assess systematic differences between high-resolution and coarse-scale models, we additionally perform a coarse-scale ICON-A simulation. The distributions of the relevant input variables are found to be comparable between the coarse-scale and coarse-grained simulations.</p>
      <p id="d2e2409">We find that a coarse-scale radiation scheme (baseline) performs well for clear-sky samples, but exhibits large errors in cloudy conditions, reflecting its inability to represent subgrid-scale distributions. In contrast, the ML-enhanced radiation scheme consistently outperforms the baseline, reducing errors from unresolved clouds in the radiative transfer calculations by a factor of 4–10. Although the ML-enhanced radiation scheme does not explicitly resolve subgrid-scale distributions, it learns how specific combinations of grid-scale mean states map to heating rates that implicitly include subgrid effects. THe baseline showed substantial biases of 1–3 <inline-formula><mml:math id="M130" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">K</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="normal">d</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula> in the upper troposphere at 10–15 <inline-formula><mml:math id="M131" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow></mml:math></inline-formula> for precipitating clouds, highlighting the strong influence of subgrid-scale cloud ice on heating rates. In general, both coarse-scale radiation scheme and the ML-enhanced scheme produce larger errors in cloudier conditions, but the ML-enhanced scheme consistently yields smaller errors. These results emphasize the need for a more explicit treatment of subgrid-scale clouds, particularly in the upper troposphere.</p>
      <p id="d2e2437">Therefore, we conclude that high-resolution model data combined with ML can improve the representation of cloud–radiation interactions in coarse-scale radiation parameterizations. Nevertheless, the presented approach has caveats. High-resolution simulations at 5 <inline-formula><mml:math id="M132" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow></mml:math></inline-formula> horizontal resolution cannot resolve shallow convection directly, leaving associated cloud radiative effects on heating rates – particularly within the planetary boundary layer – unresolved <xref ref-type="bibr" rid="bib1.bibx41" id="paren.56"/>. Using finer horizontal resolutions could help reduce these uncertainties. In addition, aerosols and heterogeneous GHG concentrations are typically absent from current high-resolution models. If future simulations include substantial variability in GHG and aerosol concentrations, these could be incorporated as additional NN inputs to capture secondary effects of reflected radiation.</p>
      <p id="d2e2452">One of the next steps is the online implementation of the ML-enhanced radiation scheme in a coarse-resolution model such as ICON-A. Although the online stability remains to be tested, the comparison with the coarse-scale model and results from previous stable hybrid simulations <xref ref-type="bibr" rid="bib1.bibx18" id="paren.57"/> are promising, suggesting potential improvement for climate projections. An additional advantage of the presented scheme is that clear-sky fluxes can be computed less frequently, while the cloud radiative impact can be updated every time step. This provides a pathway to both reducing computational costs and improving the representation of cloud–radiation interactions.</p>
</sec>

      
      </body>
    <back><app-group>

<app id="App1.Ch1.S1">
  <label>Appendix A</label><title>Default microphysics scheme</title>
      <p id="d2e2469">One major difference between the ICON-A and QUBICC versions is the microphysics scheme. ICON-A uses a modified version of the <xref ref-type="bibr" rid="bib1.bibx31" id="text.58"/> scheme and the QUBICC simulation uses the graupel scheme described in <xref ref-type="bibr" rid="bib1.bibx12" id="text.59"/>. While both schemes are single-moment schemes, the latter treats precipitating tracers like snow, rain and graupel as prognostic variables while the former only diagnoses snow and rain. Moreover, it is known that cloud ice is too large in the upper troposphere in ICON-A <xref ref-type="bibr" rid="bib1.bibx11" id="paren.60"/>, which we also see when comparing cloud ice in ICON-A and QUBICC (Fig. <xref ref-type="fig" rid="FA1"/>).</p>

      <fig id="FA1"><label>Figure A1</label><caption><p id="d2e2485">As Fig. <xref ref-type="fig" rid="F2"/> but for the default microphysics scheme based on <xref ref-type="bibr" rid="bib1.bibx31" id="text.61"/>.</p></caption>
        
        <graphic xlink:href="https://gmd.copernicus.org/articles/19/3875/2026/gmd-19-3875-2026-f06.png"/>

      </fig>

<table-wrap id="TA1" specific-use="star"><label>Table A1</label><caption><p id="d2e2504">Bulk statistics for heating rate results of the coarse-scale ML-based radiation emulator on coarse-grained QUBICC data. MAE is mean absolute error and <inline-formula><mml:math id="M133" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> is coefficient of determination. RMSE is root mean squared error. The percentage values in parentheses denote the relative values of MAE, bias and RMSE.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="5">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="right"/>
     <oasis:colspec colnum="3" colname="col3" align="right"/>
     <oasis:colspec colnum="4" colname="col4" align="center"/>
     <oasis:colspec colnum="5" colname="col5" align="right"/>
     <oasis:thead>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">MAE [<inline-formula><mml:math id="M134" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">K</mml:mi><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msup><mml:mi mathvariant="normal">d</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>]</oasis:entry>
         <oasis:entry colname="col3">Bias [<inline-formula><mml:math id="M135" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">K</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="normal">d</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>]</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M136" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col5">RMSE [<inline-formula><mml:math id="M137" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">K</mml:mi><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msup><mml:mi mathvariant="normal">d</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>]</oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">pyRTE</oasis:entry>
         <oasis:entry colname="col2"/>
         <oasis:entry colname="col3"/>
         <oasis:entry colname="col4"/>
         <oasis:entry colname="col5"/>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">SW clear</oasis:entry>
         <oasis:entry colname="col2">0.367 (8.47 %)</oasis:entry>
         <oasis:entry colname="col3">0.234 (4.91 %)</oasis:entry>
         <oasis:entry colname="col4">0.91</oasis:entry>
         <oasis:entry colname="col5">0.443 (10.56 %)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">SW cloudy</oasis:entry>
         <oasis:entry colname="col2">0.445 (16.73 %)</oasis:entry>
         <oasis:entry colname="col3">0.204 (4.22 %)</oasis:entry>
         <oasis:entry colname="col4">0.83</oasis:entry>
         <oasis:entry colname="col5">0.789 (41.54 %)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">SW partial</oasis:entry>
         <oasis:entry colname="col2">0.470 (12.24 %)</oasis:entry>
         <oasis:entry colname="col3">0.273 (4.65 %)</oasis:entry>
         <oasis:entry colname="col4">0.82</oasis:entry>
         <oasis:entry colname="col5">0.683 (23.55 %)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">SW non-precip. clouds</oasis:entry>
         <oasis:entry colname="col2">0.493 (12.11 %)</oasis:entry>
         <oasis:entry colname="col3">0.292 (4.79 %)</oasis:entry>
         <oasis:entry colname="col4">0.87</oasis:entry>
         <oasis:entry colname="col5">0.711 (19.98 %)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">SW precip. clouds</oasis:entry>
         <oasis:entry colname="col2">0.778 (32.62 %)</oasis:entry>
         <oasis:entry colname="col3">0.244 (17.28 %)</oasis:entry>
         <oasis:entry colname="col4">0.59</oasis:entry>
         <oasis:entry colname="col5">1.250 (58.35 %)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">LW clear</oasis:entry>
         <oasis:entry colname="col2">0.564 (23.56 %)</oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M138" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>0.349 (<inline-formula><mml:math id="M139" display="inline"><mml:mo lspace="0mm">-</mml:mo></mml:math></inline-formula>9.85 %)</oasis:entry>
         <oasis:entry colname="col4">0.83</oasis:entry>
         <oasis:entry colname="col5">0.677 (30.09 %)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">LW cloudy</oasis:entry>
         <oasis:entry colname="col2">0.862 (41.08 %)</oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M140" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>0.296 (<inline-formula><mml:math id="M141" display="inline"><mml:mo lspace="0mm">-</mml:mo></mml:math></inline-formula>8.35 %)</oasis:entry>
         <oasis:entry colname="col4">0.67</oasis:entry>
         <oasis:entry colname="col5">1.478 (81.55 %)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">LW partial</oasis:entry>
         <oasis:entry colname="col2">0.694 (25.83 %)</oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M142" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>0.328 (<inline-formula><mml:math id="M143" display="inline"><mml:mo lspace="0mm">-</mml:mo></mml:math></inline-formula>7.74 %)</oasis:entry>
         <oasis:entry colname="col4">0.56</oasis:entry>
         <oasis:entry colname="col5">1.112 (47.24 %)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">LW non-precip. clouds</oasis:entry>
         <oasis:entry colname="col2">0.725 (48.35 %)</oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M144" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>0.294 (<inline-formula><mml:math id="M145" display="inline"><mml:mo lspace="0mm">-</mml:mo></mml:math></inline-formula>27.53 %)</oasis:entry>
         <oasis:entry colname="col4">0.70</oasis:entry>
         <oasis:entry colname="col5">1.230 (73.66 %)</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">LW precip. clouds</oasis:entry>
         <oasis:entry colname="col2">1.109 (77.76 %)</oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M146" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>0.337 (<inline-formula><mml:math id="M147" display="inline"><mml:mo lspace="0mm">-</mml:mo></mml:math></inline-formula>23.85 %)</oasis:entry>
         <oasis:entry colname="col4">0.34</oasis:entry>
         <oasis:entry colname="col5">1.578 (130.42 %)</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">ML-enhanced</oasis:entry>
         <oasis:entry colname="col2"/>
         <oasis:entry colname="col3"/>
         <oasis:entry colname="col4"/>
         <oasis:entry colname="col5"/>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">SW clear</oasis:entry>
         <oasis:entry colname="col2">0.049 (0.49 %)</oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M148" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>0.000 (<inline-formula><mml:math id="M149" display="inline"><mml:mo lspace="0mm">-</mml:mo></mml:math></inline-formula>0.00 %)</oasis:entry>
         <oasis:entry colname="col4">0.99</oasis:entry>
         <oasis:entry colname="col5">0.074 (0.72 %)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">SW cloudy</oasis:entry>
         <oasis:entry colname="col2">0.106 (4.46 %)</oasis:entry>
         <oasis:entry colname="col3">0.012 (0.98 %)</oasis:entry>
         <oasis:entry colname="col4">0.98</oasis:entry>
         <oasis:entry colname="col5">0.214 (11.30 %)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">SW partial</oasis:entry>
         <oasis:entry colname="col2">0.082 (2.00 %)</oasis:entry>
         <oasis:entry colname="col3">0.005 (0.23 %)</oasis:entry>
         <oasis:entry colname="col4">0.99</oasis:entry>
         <oasis:entry colname="col5">0.150 (5.00 %)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">SW non-precip. clouds</oasis:entry>
         <oasis:entry colname="col2">0.080 (1.57 %)</oasis:entry>
         <oasis:entry colname="col3">0.006 (0.24 %)</oasis:entry>
         <oasis:entry colname="col4">0.99</oasis:entry>
         <oasis:entry colname="col5">0.144 (3.50 %)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">SW precip. clouds</oasis:entry>
         <oasis:entry colname="col2">0.188 (9.16 %)</oasis:entry>
         <oasis:entry colname="col3">0.024 (2.87 %)</oasis:entry>
         <oasis:entry colname="col4">0.96</oasis:entry>
         <oasis:entry colname="col5">0.341 (17.71 %)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">LW clear</oasis:entry>
         <oasis:entry colname="col2">0.028 (2.58 %)</oasis:entry>
         <oasis:entry colname="col3">0.003 (0.52 %)</oasis:entry>
         <oasis:entry colname="col4">1.00</oasis:entry>
         <oasis:entry colname="col5">0.053 (5.50 %)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">LW cloudy</oasis:entry>
         <oasis:entry colname="col2">0.127 (7.42 %)</oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M150" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>0.001 (0.00 %)</oasis:entry>
         <oasis:entry colname="col4">0.98</oasis:entry>
         <oasis:entry colname="col5">0.275 (17.70 %)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">LW partial</oasis:entry>
         <oasis:entry colname="col2">0.068 (3.33 %)</oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M151" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>0.001 (0.02 %)</oasis:entry>
         <oasis:entry colname="col4">0.99</oasis:entry>
         <oasis:entry colname="col5">0.158 (8.70 %)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">LW non-precip. clouds</oasis:entry>
         <oasis:entry colname="col2">0.069 (4.43 %)</oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M152" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>0.002 (0.43 %)</oasis:entry>
         <oasis:entry colname="col4">0.99</oasis:entry>
         <oasis:entry colname="col5">0.160 (9.41 %)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">LW precip. clouds</oasis:entry>
         <oasis:entry colname="col2">0.197 (19.37 %)</oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M153" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>0.004 (<inline-formula><mml:math id="M154" display="inline"><mml:mo lspace="0mm">-</mml:mo></mml:math></inline-formula>0.99 %)</oasis:entry>
         <oasis:entry colname="col4">0.96</oasis:entry>
         <oasis:entry colname="col5">0.319 (36.18 %)</oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table></table-wrap>

<table-wrap id="TA2" specific-use="star"><label>Table A2</label><caption><p id="d2e3121">Bulk statistics for results of cloud radiative effect on heating rates. MAE is the mean absolute error and <inline-formula><mml:math id="M155" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> is the coefficient of determination. RMSE is the root mean squared error. The percentage values in parentheses denote the relative values of MAE, bias and RMSE.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="5">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="right"/>
     <oasis:colspec colnum="3" colname="col3" align="right"/>
     <oasis:colspec colnum="4" colname="col4" align="center"/>
     <oasis:colspec colnum="5" colname="col5" align="right"/>
     <oasis:thead>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">MAE [<inline-formula><mml:math id="M156" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">K</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="normal">d</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>]</oasis:entry>
         <oasis:entry colname="col3">Bias [<inline-formula><mml:math id="M157" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">K</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="normal">d</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>]</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M158" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col5">RMSE [<inline-formula><mml:math id="M159" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">K</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="normal">d</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>]</oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">pyRTE</oasis:entry>
         <oasis:entry colname="col2"/>
         <oasis:entry colname="col3"/>
         <oasis:entry colname="col4"/>
         <oasis:entry colname="col5"/>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">SW Arctic</oasis:entry>
         <oasis:entry colname="col2">0.215 (9.55 %)</oasis:entry>
         <oasis:entry colname="col3">0.114 (3.73 %)</oasis:entry>
         <oasis:entry colname="col4">0.92</oasis:entry>
         <oasis:entry colname="col5">0.326 (17.99 %)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">SW Southern Ocean</oasis:entry>
         <oasis:entry colname="col2">0.417 (13.34 %)</oasis:entry>
         <oasis:entry colname="col3">0.218 (3.32 %)</oasis:entry>
         <oasis:entry colname="col4">0.87</oasis:entry>
         <oasis:entry colname="col5">0.665 (27.68 %)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">SW Tropics</oasis:entry>
         <oasis:entry colname="col2">0.535 (13.86 %)</oasis:entry>
         <oasis:entry colname="col3">0.290 (4.39 %)</oasis:entry>
         <oasis:entry colname="col4">0.80</oasis:entry>
         <oasis:entry colname="col5">0.817 (29.42 %)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">SW Pacific ITCZ</oasis:entry>
         <oasis:entry colname="col2">0.638 (18.33 %)</oasis:entry>
         <oasis:entry colname="col3">0.271 (6.27 %)</oasis:entry>
         <oasis:entry colname="col4">0.76</oasis:entry>
         <oasis:entry colname="col5">0.964 (33.67 %)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">SW stratocumulus</oasis:entry>
         <oasis:entry colname="col2">0.529 (13.66 %)</oasis:entry>
         <oasis:entry colname="col3">0.289 (4.38 %)</oasis:entry>
         <oasis:entry colname="col4">0.80</oasis:entry>
         <oasis:entry colname="col5">0.825 (28.97 %)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">LW Arctic</oasis:entry>
         <oasis:entry colname="col2">0.634 (30.75 %)</oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M160" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>0.254 (<inline-formula><mml:math id="M161" display="inline"><mml:mo lspace="0mm">-</mml:mo></mml:math></inline-formula>14.89 %)</oasis:entry>
         <oasis:entry colname="col4">0.77</oasis:entry>
         <oasis:entry colname="col5">1.046 (55.94 %)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">LW Southern Ocean</oasis:entry>
         <oasis:entry colname="col2">0.787 (30.60 %)</oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M162" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>0.318 (<inline-formula><mml:math id="M163" display="inline"><mml:mo lspace="0mm">-</mml:mo></mml:math></inline-formula>8.64 %)</oasis:entry>
         <oasis:entry colname="col4">0.69</oasis:entry>
         <oasis:entry colname="col5">1.307 (56.70 %)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">LW Tropics</oasis:entry>
         <oasis:entry colname="col2">0.755 (30.96 %)</oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M164" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>0.326 (<inline-formula><mml:math id="M165" display="inline"><mml:mo lspace="0mm">-</mml:mo></mml:math></inline-formula>4.21 %)</oasis:entry>
         <oasis:entry colname="col4">0.59</oasis:entry>
         <oasis:entry colname="col5">1.192 (65.09 %)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">LW Pacific ITCZ</oasis:entry>
         <oasis:entry colname="col2">0.944 (50.80 %)</oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M166" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>0.311 (<inline-formula><mml:math id="M167" display="inline"><mml:mo lspace="0mm">-</mml:mo></mml:math></inline-formula>2.19 %)</oasis:entry>
         <oasis:entry colname="col4">0.49</oasis:entry>
         <oasis:entry colname="col5">1.326 (99.80 %)</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">LW stratocumulus</oasis:entry>
         <oasis:entry colname="col2">0.742 (31.38 %)</oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M168" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>0.327 (<inline-formula><mml:math id="M169" display="inline"><mml:mo lspace="0mm">-</mml:mo></mml:math></inline-formula>6.13 %)</oasis:entry>
         <oasis:entry colname="col4">0.58</oasis:entry>
         <oasis:entry colname="col5">1.234 (62.76 %)</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">ML-enhanced</oasis:entry>
         <oasis:entry colname="col2"/>
         <oasis:entry colname="col3"/>
         <oasis:entry colname="col4"/>
         <oasis:entry colname="col5"/>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">SW Arctic</oasis:entry>
         <oasis:entry colname="col2">0.056 (2.17 %)</oasis:entry>
         <oasis:entry colname="col3">0.006 (0.58 %)</oasis:entry>
         <oasis:entry colname="col4">0.99</oasis:entry>
         <oasis:entry colname="col5">0.098 (4.84 %)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">SW Southern Ocean</oasis:entry>
         <oasis:entry colname="col2">0.089 (2.98 %)</oasis:entry>
         <oasis:entry colname="col3">0.005 (0.34 %)</oasis:entry>
         <oasis:entry colname="col4">0.99</oasis:entry>
         <oasis:entry colname="col5">0.174 (7.30 %)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">SW Tropics</oasis:entry>
         <oasis:entry colname="col2">0.097 (2.61 %)</oasis:entry>
         <oasis:entry colname="col3">0.009 (0.47 %)</oasis:entry>
         <oasis:entry colname="col4">0.98</oasis:entry>
         <oasis:entry colname="col5">0.187 (7.32 %)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">SW Pacific ITCZ</oasis:entry>
         <oasis:entry colname="col2">0.142 (4.50 %)</oasis:entry>
         <oasis:entry colname="col3">0.019 (1.17 %)</oasis:entry>
         <oasis:entry colname="col4">0.98</oasis:entry>
         <oasis:entry colname="col5">0.247 (9.73 %)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">SW stratocumulus</oasis:entry>
         <oasis:entry colname="col2">0.095 (2.48 %)</oasis:entry>
         <oasis:entry colname="col3">0.008 (0.41 %)</oasis:entry>
         <oasis:entry colname="col4">0.99</oasis:entry>
         <oasis:entry colname="col5">0.179 (6.38 %)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">LW Arctic</oasis:entry>
         <oasis:entry colname="col2">0.071 (4.96 %)</oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M170" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>0.000 (0.01 %)</oasis:entry>
         <oasis:entry colname="col4">0.99</oasis:entry>
         <oasis:entry colname="col5">0.176 (11.91 %)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">LW Southern Ocean</oasis:entry>
         <oasis:entry colname="col2">0.103 (4.86 %)</oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M171" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>0.002 (<inline-formula><mml:math id="M172" display="inline"><mml:mo lspace="0mm">-</mml:mo></mml:math></inline-formula>0.07 %)</oasis:entry>
         <oasis:entry colname="col4">0.99</oasis:entry>
         <oasis:entry colname="col5">0.231 (11.55 %)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">LW Tropics</oasis:entry>
         <oasis:entry colname="col2">0.079 (4.67 %)</oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M173" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>0.002 (<inline-formula><mml:math id="M174" display="inline"><mml:mo lspace="0mm">-</mml:mo></mml:math></inline-formula>0.02 %)</oasis:entry>
         <oasis:entry colname="col4">0.99</oasis:entry>
         <oasis:entry colname="col5">0.172 (14.14 %)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">LW Pacific ITCZ</oasis:entry>
         <oasis:entry colname="col2">0.132 (10.55 %)</oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M175" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>0.002 (<inline-formula><mml:math id="M176" display="inline"><mml:mo lspace="0mm">-</mml:mo></mml:math></inline-formula>0.46 %)</oasis:entry>
         <oasis:entry colname="col4">0.98</oasis:entry>
         <oasis:entry colname="col5">0.219 (23.47 %)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">LW stratocumulus</oasis:entry>
         <oasis:entry colname="col2">0.080 (4.75 %)</oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M177" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>0.001 (0.11 %)</oasis:entry>
         <oasis:entry colname="col4">0.99</oasis:entry>
         <oasis:entry colname="col5">0.175 (11.59 %)</oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table></table-wrap>


</app>

<app id="App1.Ch1.S2">
  <label>Appendix B</label><title>Calculation of heating rates</title>
      <p id="d2e3751">When comparing the unscaled heating rates between coarse-scale and coarse-grained simulations, we find huge biases of up to 10 <inline-formula><mml:math id="M178" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">K</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="normal">d</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula> between the mean heating rates, especially in the stratosphere. We found that there is a difference in heating rate calculation between the code versions. Usually, the heating rate is calculated from flux divergence, pressure difference and constants:

          <disp-formula id="App1.Ch1.S2.E3" content-type="numbered"><label>B1</label><mml:math id="M179" display="block"><mml:mrow><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:mo>∂</mml:mo><mml:msub><mml:mi>T</mml:mi><mml:mi>k</mml:mi></mml:msub></mml:mrow><mml:mrow><mml:mo>∂</mml:mo><mml:mi>t</mml:mi></mml:mrow></mml:mfrac></mml:mstyle><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mi>g</mml:mi><mml:mrow><mml:msub><mml:mi>c</mml:mi><mml:mi>p</mml:mi></mml:msub></mml:mrow></mml:mfrac></mml:mstyle><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:msub><mml:mi>F</mml:mi><mml:mrow><mml:mtext>Net</mml:mtext><mml:mo>,</mml:mo><mml:mi>k</mml:mi><mml:mo>+</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>/</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:mrow></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mi>F</mml:mi><mml:mrow><mml:mtext>Net</mml:mtext><mml:mo>,</mml:mo><mml:mi>k</mml:mi><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>/</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:mrow></mml:msub></mml:mrow><mml:mrow><mml:msub><mml:mi>P</mml:mi><mml:mrow><mml:mi>k</mml:mi><mml:mo>+</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>/</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:mrow></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mi>P</mml:mi><mml:mrow><mml:mi>k</mml:mi><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>/</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:mfrac></mml:mstyle><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>

        where <inline-formula><mml:math id="M180" display="inline"><mml:mi>g</mml:mi></mml:math></inline-formula> is gravitational acceleration, <inline-formula><mml:math id="M181" display="inline"><mml:mrow><mml:msub><mml:mi>c</mml:mi><mml:mi>p</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> specific heat at constant pressure, <inline-formula><mml:math id="M182" display="inline"><mml:mrow><mml:msub><mml:mi>F</mml:mi><mml:mtext>Net</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula> is the difference between downward and upward flux, <inline-formula><mml:math id="M183" display="inline"><mml:mi>P</mml:mi></mml:math></inline-formula> is pressure. <inline-formula><mml:math id="M184" display="inline"><mml:mi>k</mml:mi></mml:math></inline-formula> is defined at the center of a layer (also full levels) while <inline-formula><mml:math id="M185" display="inline"><mml:mrow><mml:mi>k</mml:mi><mml:mo>±</mml:mo><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">1</mml:mn><mml:mn mathvariant="normal">2</mml:mn></mml:mfrac></mml:mstyle></mml:mrow></mml:math></inline-formula> is defined at the layer boundaries (also half levels). This form of converting fluxes to heating rates is usually found in hydrostatic models but does not work in ICON because pressure is a diagnostic variable. Instead, the density is kept constant and specific heat at constant volume <inline-formula><mml:math id="M186" display="inline"><mml:mrow><mml:msub><mml:mi>c</mml:mi><mml:mi mathvariant="normal">v</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> needs to be used for the conversion <xref ref-type="bibr" rid="bib1.bibx48" id="paren.62"/>. This transforms Eq. (<xref ref-type="disp-formula" rid="App1.Ch1.S2.E3"/>) to Eq. (<xref ref-type="disp-formula" rid="Ch1.E2"/>). In the ICON-A version, they use <inline-formula><mml:math id="M187" display="inline"><mml:mrow><mml:msub><mml:mi>c</mml:mi><mml:mi>p</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> which is valid for quasi-hydrostatic models because then the hydrostatic pressure holds <inline-formula><mml:math id="M188" display="inline"><mml:mrow><mml:mi mathvariant="normal">d</mml:mi><mml:mi>P</mml:mi><mml:mo>/</mml:mo><mml:mi>g</mml:mi><mml:mo>=</mml:mo><mml:mi mathvariant="italic">ρ</mml:mi><mml:mi mathvariant="normal">d</mml:mi><mml:mi>z</mml:mi></mml:mrow></mml:math></inline-formula>. Additionally, the specific heat is scaled only by water vapor, while all tracers are included in the code version used for QUBICC. Therefore, we scaled the coarse-grained heating rates in Fig. <xref ref-type="fig" rid="F3"/> with the ratio <inline-formula><mml:math id="M189" display="inline"><mml:mrow><mml:msub><mml:mi>c</mml:mi><mml:mi mathvariant="normal">v</mml:mi></mml:msub><mml:mo>/</mml:mo><mml:msub><mml:mi>c</mml:mi><mml:mi>p</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, which is on average 0.7. The unscaled heating rates are shown in Fig. <xref ref-type="fig" rid="FB1"/>.</p>

      <fig id="FB1"><label>Figure B1</label><caption><p id="d2e4014">As Fig. <xref ref-type="fig" rid="F3"/> but here the coarse-grained heating rates are not scaled.</p></caption>
        
        <graphic xlink:href="https://gmd.copernicus.org/articles/19/3875/2026/gmd-19-3875-2026-f07.png"/>

      </fig>


</app>

<app id="App1.Ch1.S3">
  <label>Appendix C</label><title>Results for full vertical column</title>

      <fig id="FC1"><label>Figure C1</label><caption><p id="d2e4039">As Fig. <xref ref-type="fig" rid="F4"/> but for the full column. The large errors in the upper stratosphere for the baseline are related to rounding errors increasing with height.</p></caption>
        
        <graphic xlink:href="https://gmd.copernicus.org/articles/19/3875/2026/gmd-19-3875-2026-f08.png"/>

      </fig>

</app>

<app id="App1.Ch1.S4">
  <label>Appendix D</label><title>Linear decomposition under changing <inline-formula><mml:math id="M190" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">CO</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula></title>
      <p id="d2e4071">The linear decomposition assumption of clear-sky heating and cloud radiative impact may be questionable for different greenhouse gas concentrations. To provide some validity, we estimate the error induced by this assumption for <inline-formula><mml:math id="M191" display="inline"><mml:mrow><mml:mn mathvariant="normal">4</mml:mn><mml:mo>×</mml:mo><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">CO</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:mrow></mml:math></inline-formula>. For a direct estimation, we select 4000 random samples from the test set and calculate the cloud radiative impact for the reference climate (<inline-formula><mml:math id="M192" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">CO</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> concentration of 2004) offline using pyRTE. Next, we increase <inline-formula><mml:math id="M193" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">CO</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> by a factor of 4 and repeat the calculation. The mean absolute difference between cloud_effect_reference and cloud_effect_4xCO2 gives an estimate of the error that is induced by the linear decomposition assumption for <inline-formula><mml:math id="M194" display="inline"><mml:mrow><mml:mn mathvariant="normal">4</mml:mn><mml:mo>×</mml:mo><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">CO</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:mrow></mml:math></inline-formula>. The vertical resolved error and bias are shown in Fig. <xref ref-type="fig" rid="FD1"/>. The <inline-formula><mml:math id="M195" display="inline"><mml:mi>x</mml:mi></mml:math></inline-formula>-range is the same as in Figs. <xref ref-type="fig" rid="F4"/>, <xref ref-type="fig" rid="F5"/> and <xref ref-type="fig" rid="FC1"/> for comparison with the errors induced by subgrid-scale clouds and the remaining errors of MLe-radiation. In general, the error is smaller in the stratosphere than in the troposphere. For SW, the error is around 2.5 % and for LW it is around 4 % in the troposphere. This around 10 times smaller than the error from subgrid-scale clouds and around the same magnitude as the errors from MLe-radiation.</p><fig id="FD1"><label>Figure D1</label><caption><p id="d2e4147">Mean absolute error (MAE) and bias induced by the linear decomposition assumption for <inline-formula><mml:math id="M196" display="inline"><mml:mrow><mml:mn mathvariant="normal">4</mml:mn><mml:mo>×</mml:mo><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">CO</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:mrow></mml:math></inline-formula>. For comparison, the <inline-formula><mml:math id="M197" display="inline"><mml:mi>x</mml:mi></mml:math></inline-formula>-range is the same as in Figs. <xref ref-type="fig" rid="F4"/>, <xref ref-type="fig" rid="F5"/> and <xref ref-type="fig" rid="FC1"/>.</p></caption>
        <graphic xlink:href="https://gmd.copernicus.org/articles/19/3875/2026/gmd-19-3875-2026-f09.png"/>

      </fig>

</app>
  </app-group><notes notes-type="codedataavailability"><title>Code and data availability</title>

      <p id="d2e4189">The software code for the ICON model is available at <ext-link xlink:href="https://doi.org/10.35089/WDCC/IconRelease01" ext-link-type="DOI">10.35089/WDCC/IconRelease01</ext-link>  <xref ref-type="bibr" rid="bib1.bibx24" id="paren.63"/> under the BSD-3C license. The exact version for high-resolution simulations is icon-2024.10 and for coarse-resolution, the exact version (2.6.4) is archived at <ext-link xlink:href="https://doi.org/10.5281/zenodo.18853569" ext-link-type="DOI">10.5281/zenodo.18853569</ext-link> <xref ref-type="bibr" rid="bib1.bibx17" id="paren.64"/>. The ICON-model repository is archived under <ext-link xlink:href="https://doi.org/10.35089/WDCC/IconRelease01" ext-link-type="DOI">10.35089/WDCC/IconRelease01</ext-link> <xref ref-type="bibr" rid="bib1.bibx24" id="paren.65"/>. The software code for pyRTE+RRTMGP is   archived under <ext-link xlink:href="https://doi.org/10.5281/zenodo.16644555" ext-link-type="DOI">10.5281/zenodo.16644555</ext-link> <xref ref-type="bibr" rid="bib1.bibx36" id="paren.66"/>. The code for the network training and plots is   archived under <ext-link xlink:href="https://doi.org/10.5281/zenodo.17280639" ext-link-type="DOI">10.5281/zenodo.17280639</ext-link> <xref ref-type="bibr" rid="bib1.bibx16" id="paren.67"/>. Data used for training and reproducing the plots is archived at <ext-link xlink:href="https://doi.org/10.5281/zenodo.18853569" ext-link-type="DOI">10.5281/zenodo.18853569</ext-link> <xref ref-type="bibr" rid="bib1.bibx17" id="paren.68"/>.</p>
  </notes><notes notes-type="authorcontribution"><title>Author contributions</title>

      <p id="d2e4234">KH: Conceptualization, Investigation, Formal analysis, Software,  Data Curation, Methodology, Writing – Original Draft. SS: Conceptualization, Methodology, Writing – Review and Editing. GB: Conceptualization, Methodology. AL: Supervision, Writing – Review and Editing. RP: Conceptualization, Software. JS: Data Curation, Writing – Review and Editing. VE: Funding acquisition, Supervision.</p>
  </notes><notes notes-type="competinginterests"><title>Competing interests</title>

      <p id="d2e4240">At least one of the (co-)authors is a member of the editorial board of <italic>Geoscientific Model Development</italic>. The peer-review process was guided by an independent editor, and the authors also have no other competing interests to declare.</p>
  </notes><notes notes-type="disclaimer"><title>Disclaimer</title>

      <p id="d2e4249">Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. The authors bear the ultimate responsibility for providing appropriate place names. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.</p>
  </notes><ack><title>Acknowledgements</title><p id="d2e4255">Katharina Hafner and Veronika Eyring were supported by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) through the Gottfried Wilhelm Leibniz Prize awarded to Veronika Eyring (Reference No. EY 22/2-1). Veronika Eyring additionally acknowledge funding by the European Research Council (ERC) Synergy Grant “Understanding and Modeling the Earth System with Machine Learning” (USMILE) under the Horizon 2020 Research and Innovation program (Grant Agreement No. 855187). This work used resources of the Deutsches Klimarechenzentrum (DKRZ) granted by its Scientific Steering Committee (WLA) under project ID bd1179. Robert Pincus, and Sara Shamekh were supported by the US National Science Foundation through the Learning the Earth with Artificial intelligence and Physics (LEAP) Science and Technology Center (STC) (Award #2019625). Sara Shamekh acknowledges support provided by Schmidt Sciences, LLC. The authors gratefully acknowledge the Earth System Modelling Project (ESM) for funding this work by providing computing time on the ESM partition of the supercomputer JUWELS at the Jülich Supercomputing Centre (JSC). Katharina Hafner acknowledges funding from the Swiss State Secretariat for Education, Research and Innovation (SERI) for the Horizon Europe project AI4PEX (Grant agreement ID: 101137682 and SERI no 23.00546). The authors would like to thank two anonymous reviewers that helped improving this manuscript.</p></ack><notes notes-type="financialsupport"><title>Financial support</title>

      <p id="d2e4261">This research has been supported by the Deutsche Forschungsgemeinschaft (grant no. EY 22/2-1), the European Horizon 2020 (grant no. 855187), the Deutsches Klimarechenzentrum (grant no. bd1179), the National Science Foundation (grant no. 2019625), and the Jülich Supercomputing Centre, Forschungszentrum Jülich (grant no. icon-a-ml).  The article processing charges for this open-access  publication were covered by the University of Bremen.</p>
  </notes><notes notes-type="reviewstatement"><title>Review statement</title>

      <p id="d2e4272">This paper was edited by Po-Lun Ma and reviewed by two anonymous referees.</p>
  </notes><ref-list>
    <title>References</title>

      <ref id="bib1.bibx1"><label>Barker et al.(1999)Barker, Stephens, and Fu</label><mixed-citation>Barker, H. W., Stephens, G. L., and Fu, Q.: The sensitivity of domain-averaged solar fluxes to assumptions about cloud geometry, Q. J. Roy. Meteor. Soc., 125, 2127–2152, <ext-link xlink:href="https://doi.org/10.1002/qj.49712555810" ext-link-type="DOI">10.1002/qj.49712555810</ext-link>, 1999.</mixed-citation></ref>
      <ref id="bib1.bibx2"><label>Bertoli et al.(2025)Bertoli, Mohebi, Ozdemir, Jucker, Rüdisühli, Perez-Cruz, Salzmann, and Schemm</label><mixed-citation>Bertoli, G., Mohebi, S., Ozdemir, F., Jucker, J., Rüdisühli, S., Perez-Cruz, F., Salzmann, M., and Schemm, S.: Revisiting Machine Learning Approaches for Short- and Longwave Radiation Inference in Weather and Climate Models, J. Adv. Model. Earth Sy., 17, <ext-link xlink:href="https://doi.org/10.1029/2025ms004956" ext-link-type="DOI">10.1029/2025ms004956</ext-link>, 2025.</mixed-citation></ref>
      <ref id="bib1.bibx3"><label>Beucler et al.(2024)Beucler, Gentine, Yuval, Gupta, Peng, Lin, Yu, Rasp, Ahmed, O’Gorman, Neelin, Lutsko, and Pritchard</label><mixed-citation>Beucler, T., Gentine, P., Yuval, J., Gupta, A., Peng, L., Lin, J., Yu, S., Rasp, S., Ahmed, F., O’Gorman, P. A., Neelin, J. D., Lutsko, N. J., and Pritchard, M.: Climate-invariant machine learning, Science Advances, 10, eadj7250, <ext-link xlink:href="https://doi.org/10.1126/sciadv.adj7250" ext-link-type="DOI">10.1126/sciadv.adj7250</ext-link>, 2024.</mixed-citation></ref>
      <ref id="bib1.bibx4"><label>Bock and Lauer(2024)</label><mixed-citation>Bock, L. and Lauer, A.: Cloud properties and their projected changes in CMIP models with low to high climate sensitivity, Atmos. Chem. Phys., 24, 1587–1605, <ext-link xlink:href="https://doi.org/10.5194/acp-24-1587-2024" ext-link-type="DOI">10.5194/acp-24-1587-2024</ext-link>, 2024.</mixed-citation></ref>
      <ref id="bib1.bibx5"><label>Bony et al.(2015)Bony, Stevens, Frierson, Jakob, Kageyama, Pincus, Shepherd, Sherwood, Siebesma, Sobel, Watanabe, and Webb</label><mixed-citation>Bony, S., Stevens, B., Frierson, D. M. W., Jakob, C., Kageyama, M., Pincus, R., Shepherd, T. G., Sherwood, S. C., Siebesma, A. P., Sobel, A. H., Watanabe, M., and Webb, M. J.: Clouds, circulation and climate sensitivity, Nat. Geosci., 8, 261–268, <ext-link xlink:href="https://doi.org/10.1038/ngeo2398" ext-link-type="DOI">10.1038/ngeo2398</ext-link>, 2015.</mixed-citation></ref>
      <ref id="bib1.bibx6"><label>Bretherton et al.(2022)Bretherton, Henn, Kwa, Brenowitz, Watt-Meyer, McGibbon, Perkins, Clark, and Harris</label><mixed-citation>Bretherton, C. S., Henn, B., Kwa, A., Brenowitz, N. D., Watt-Meyer, O., McGibbon, J., Perkins, W. A., Clark, S. K., and Harris, L.: Correcting Coarse-Grid Weather and Climate Models by Machine Learning From Global Storm-Resolving Simulations, J. Adv. Model. Earth Sy., 14, <ext-link xlink:href="https://doi.org/10.1029/2021ms002794" ext-link-type="DOI">10.1029/2021ms002794</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bibx7"><label>Busecke et al.(2025)Busecke, Balwada, Martin, Nicholas, Johnson, Nalluri, Stern, and Abernathey</label><mixed-citation>Busecke, J. J. M., Balwada, D., Martin, P. E., Nicholas, T. E. G., Johnson, Z. C. P., Nalluri, P., Stern, C. I., and Abernathey, R. P.: The Impact of Sub-Grid Heterogeneity on Air-Sea Turbulent Heat Flux in Coupled Climate Models, Geophys. Res. Lett., 52, <ext-link xlink:href="https://doi.org/10.1029/2025gl114951" ext-link-type="DOI">10.1029/2025gl114951</ext-link>, 2025.</mixed-citation></ref>
      <ref id="bib1.bibx8"><label>Chen et al.(2021)Chen, Rojas, Samset, Cobb, Diongue Niang, Edwards, Emori, Faria, Hawkins, Hope, Huybrechts, Meinshausen, Mustafa, Plattner, and Tréguier</label><mixed-citation>Chen, D., Rojas, M., Samset, B., Cobb, K., Diongue Niang, A., Edwards, P., Emori, S., Faria, S., Hawkins, E., Hope, P., Huybrechts, P., Meinshausen, M., Mustafa, S., Plattner, G.-K., and Tréguier, A.-M.: Framing, Context, and Methods, in: Climate Change 2021: The Physical Science Basis. Contribution of Working Group I to the Sixth Assessment Report of the Intergovernmental Panel on Climate Change, edited by: Masson-Delmotte, V., Zhai, P., Pirani, A., Connors, S. L., Péan, C., Berger, S., Caud, N., Chen, Y., Goldfarb, L., Gomis, M. I., Huang, M., Leitzell, K., Lonnoy, E., Matthews, J. B. R., Maycock, T. K., Waterfield, T., Yelekçi, O., Yu, R., and Zhou, B., book section 1, pp. 147–286, Cambridge University Press, Cambridge, UK and New York, NY, USA, <ext-link xlink:href="https://doi.org/10.1017/9781009157896.003" ext-link-type="DOI">10.1017/9781009157896.003</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bibx9"><label>Chevallier et al.(1998)Chevallier, Chéruy, Scott, and Chédin</label><mixed-citation>Chevallier, F., Chéruy, F., Scott, N. A., and Chédin, A.: A Neural Network Approach for a Fast and Accurate Computation of a Longwave Radiative Budget, J. Appl. Meteorol. Clim., 37, 1385–1397, <ext-link xlink:href="https://doi.org/10.1175/1520-0450(1998)037&lt;1385:ANNAFA&gt;2.0.CO;2" ext-link-type="DOI">10.1175/1520-0450(1998)037&lt;1385:ANNAFA&gt;2.0.CO;2</ext-link>, 1998.</mixed-citation></ref>
      <ref id="bib1.bibx10"><label>Cotronei and Slawig(2020)</label><mixed-citation>Cotronei, A. and Slawig, T.: Single-precision arithmetic in ECHAM radiation reduces runtime and energy consumption, Geosci. Model Dev., 13, 2783–2804, <ext-link xlink:href="https://doi.org/10.5194/gmd-13-2783-2020" ext-link-type="DOI">10.5194/gmd-13-2783-2020</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bibx11"><label>Doktorowski et al.(2024)Doktorowski, Kretzschmar, Quaas, Salzmann, and Sourdeval</label><mixed-citation>Doktorowski, S., Kretzschmar, J., Quaas, J., Salzmann, M., and Sourdeval, O.: Subgrid-scale variability of cloud ice in the ICON-AES 1.3.00, Geosci. Model Dev., 17, 3099–3110, <ext-link xlink:href="https://doi.org/10.5194/gmd-17-3099-2024" ext-link-type="DOI">10.5194/gmd-17-3099-2024</ext-link>, 2024.</mixed-citation></ref>
      <ref id="bib1.bibx12"><label>Doms et al.(2011)Doms, Förstner, Heise, Herzog, Mironov, Raschendorfer, Reinhardt, Ritter, Schrodin, Schulz, and Vogel</label><mixed-citation>Doms, G., Förstner, G., Heise, E., Herzog, H.-J., Mironov, D., Raschendorfer, M., Reinhardt, T., Ritter, B., Schrodin, R., Schulz, J.-P., and Vogel, G.: A Description of the Nonhydrostatic Regional COSMO Model. Part II: Physical Parameterization, Consortium for Small-Scale Modelling, <uri>http://www.cosmo-model.org/</uri> (last access: 3 May 2026), 2011.</mixed-citation></ref>
      <ref id="bib1.bibx13"><label>Giorgetta et al.(2018)Giorgetta, Brokopf, Crueger, Esch, Fiedler, Helmert, Hohenegger, Kornblueh, Köhler, Manzini, Mauritsen, Nam, Raddatz, Rast, Reinert, Sakradzija, Schmidt, Schneck, Schnur, Silvers, Wan, Zängl, and Stevens</label><mixed-citation>Giorgetta, M. A., Brokopf, R., Crueger, T., Esch, M., Fiedler, S., Helmert, J., Hohenegger, C., Kornblueh, L., Köhler, M., Manzini, E., Mauritsen, T., Nam, C., Raddatz, T., Rast, S., Reinert, D., Sakradzija, M., Schmidt, H., Schneck, R., Schnur, R., Silvers, L., Wan, H., Zängl, G., and Stevens, B.: ICON-A, the Atmosphere Component of the ICON Earth System Model: I. Model Description, J. Adv. Model. Earth Sy., 10, 1613–1637, <ext-link xlink:href="https://doi.org/10.1029/2017MS001242" ext-link-type="DOI">10.1029/2017MS001242</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bibx14"><label>Giorgetta et al.(2022)Giorgetta, Sawyer, Lapillonne, Adamidis, Alexeev, Clément, Dietlicher, Engels, Esch, Franke, Frauen, Hannah, Hillman, Kornblueh, Marti, Norman, Pincus, Rast, Reinert, Schnur, Schulzweida, and Stevens</label><mixed-citation>Giorgetta, M. A., Sawyer, W., Lapillonne, X., Adamidis, P., Alexeev, D., Clément, V., Dietlicher, R., Engels, J. F., Esch, M., Franke, H., Frauen, C., Hannah, W. M., Hillman, B. R., Kornblueh, L., Marti, P., Norman, M. R., Pincus, R., Rast, S., Reinert, D., Schnur, R., Schulzweida, U., and Stevens, B.: The ICON-A model for direct QBO simulations on GPUs (version icon-cscs:baf28a514), Geosci. Model Dev., 15, 6985–7016, <ext-link xlink:href="https://doi.org/10.5194/gmd-15-6985-2022" ext-link-type="DOI">10.5194/gmd-15-6985-2022</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bibx15"><label>Grundner et al.(2022)Grundner, Beucler, Gentine, Iglesias-Suarez, Giorgetta, and Eyring</label><mixed-citation>Grundner, A., Beucler, T., Gentine, P., Iglesias-Suarez, F., Giorgetta, M. A., and Eyring, V.: Deep Learning Based Cloud Cover Parameterization for ICON, J. Adv. Model. Earth Sy., 14, e2021MS002959, <ext-link xlink:href="https://doi.org/10.1029/2021MS002959" ext-link-type="DOI">10.1029/2021MS002959</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bibx16"><label>Hafner(2025)</label><mixed-citation>Hafner, K.: Representing Subgrid-Scale Cloud Effects in a Radiation Parameterization using Machine Learning: MLe-radiation v1.0, Zenodo [code], <ext-link xlink:href="https://doi.org/10.5281/ZENODO.17280639" ext-link-type="DOI">10.5281/ZENODO.17280639</ext-link>, 2025.</mixed-citation></ref>
      <ref id="bib1.bibx17"><label>Hafner(2026)</label><mixed-citation>Hafner, K.: Code and Data for “Representing Subgrid-Scale Cloud Effects in a Radiation Parameterization using Machine Learning: MLe-radiation v1.0”, Zenodo [code and data set], <ext-link xlink:href="https://doi.org/10.5281/zenodo.18853569" ext-link-type="DOI">10.5281/zenodo.18853569</ext-link>, 2026.</mixed-citation></ref>
      <ref id="bib1.bibx18"><label>Hafner et al.(2025a)Hafner, Iglesiaz-Suarez, Shamekh, Gentine, Pincus, Giorgetta, and Eyring</label><mixed-citation>Hafner, K., Iglesiaz-Suarez, F., Shamekh, S., Gentine, P., Pincus, R., Giorgetta, M., and Eyring, V.: Stable Machine Learning based Radiation Emulation for ICON, J. Adv. Model. Earth Sy., <ext-link xlink:href="https://doi.org/10.22541/essoar.174708082.27787580/v1" ext-link-type="DOI">10.22541/essoar.174708082.27787580/v1</ext-link>, in review, 2025a.</mixed-citation></ref>
      <ref id="bib1.bibx19"><label>Hafner et al.(2025b)Hafner, Iglesiaz-Suarez, Shamekh, Gentine, Pincus, Girgetta, and Eyring</label><mixed-citation>Hafner, K., Iglesiaz-Suarez, F., Shamekh, S., Gentine, P., Pincus, R., Girgetta, M., and Eyring, V.: Interpretable machine learning radiation parameterization for ICON, J. Geophys. Res.-Mach. Learn. Comput., 2, e2024JH000501, <ext-link xlink:href="https://doi.org/10.1029/2024JH000501" ext-link-type="DOI">10.1029/2024JH000501</ext-link>, 2025b.</mixed-citation></ref>
      <ref id="bib1.bibx20"><label>Henn et al.(2024)Henn, Jauregui, Clark, Brenowitz, McGibbon, Watt-Meyer, Pauling, and Bretherton</label><mixed-citation>Henn, B., Jauregui, Y. R., Clark, S. K., Brenowitz, N. D., McGibbon, J., Watt-Meyer, O., Pauling, A. G., and Bretherton, C. S.: A Machine Learning Parameterization of Clouds in a Coarse-Resolution Climate Model for Unbiased Radiation, J. Adv. Model. Earth Sy., 16, <ext-link xlink:href="https://doi.org/10.1029/2023ms003949" ext-link-type="DOI">10.1029/2023ms003949</ext-link>, 2024.</mixed-citation></ref>
      <ref id="bib1.bibx21"><label>Heuer et al.(2024)Heuer, Schwabe, Gentine, Giorgetta, and Eyring</label><mixed-citation>Heuer, H., Schwabe, M., Gentine, P., Giorgetta, M. A., and Eyring, V.: Interpretable Multiscale Machine Learning-Based Parameterizations of Convection for ICON, J. Adv. Model. Earth Sy., 16, <ext-link xlink:href="https://doi.org/10.1029/2024ms004398" ext-link-type="DOI">10.1029/2024ms004398</ext-link>, 2024.</mixed-citation></ref>
      <ref id="bib1.bibx22"><label>Hogan and Matricardi(2022)</label><mixed-citation>Hogan, R. J. and Matricardi, M.: A Tool for Generating Fast k-Distribution Gas-Optics Models for Weather and Climate Applications, J. Adv. Model. Earth Sy., 14, e2022MS003033, <ext-link xlink:href="https://doi.org/10.1029/2022MS003033" ext-link-type="DOI">10.1029/2022MS003033</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bibx23"><label>Hohenegger et al.(2023)Hohenegger, Korn, Linardakis, Redler, Schnur, Adamidis, Bao, Bastin, Behravesh, Bergemann, Biercamp, Bockelmann, Brokopf, Brüggemann, Casaroli, Chegini, Datseris, Esch, George, Giorgetta, Gutjahr, Haak, Hanke, Ilyina, Jahns, Jungclaus, Kern, Klocke, Kluft, Kölling, Kornblueh, Kosukhin, Kroll, Lee, Mauritsen, Mehlmann, Mieslinger, Naumann, Paccini, Peinado, Praturi, Putrasahan, Rast, Riddick, Roeber, Schmidt, Schulzweida, Schütte, Segura, Shevchenko, Singh, Specht, Stephan, von Storch, Vogel, Wengel, Winkler, Ziemen, Marotzke, and Stevens</label><mixed-citation>Hohenegger, C., Korn, P., Linardakis, L., Redler, R., Schnur, R., Adamidis, P., Bao, J., Bastin, S., Behravesh, M., Bergemann, M., Biercamp, J., Bockelmann, H., Brokopf, R., Brüggemann, N., Casaroli, L., Chegini, F., Datseris, G., Esch, M., George, G., Giorgetta, M., Gutjahr, O., Haak, H., Hanke, M., Ilyina, T., Jahns, T., Jungclaus, J., Kern, M., Klocke, D., Kluft, L., Kölling, T., Kornblueh, L., Kosukhin, S., Kroll, C., Lee, J., Mauritsen, T., Mehlmann, C., Mieslinger, T., Naumann, A. K., Paccini, L., Peinado, A., Praturi, D. S., Putrasahan, D., Rast, S., Riddick, T., Roeber, N., Schmidt, H., Schulzweida, U., Schütte, F., Segura, H., Shevchenko, R., Singh, V., Specht, M., Stephan, C. C., von Storch, J.-S., Vogel, R., Wengel, C., Winkler, M., Ziemen, F., Marotzke, J., and Stevens, B.: ICON-Sapphire: simulating the components of the Earth system and their interactions at kilometer and subkilometer scales, Geosci. Model Dev., 16, 779–811, <ext-link xlink:href="https://doi.org/10.5194/gmd-16-779-2023" ext-link-type="DOI">10.5194/gmd-16-779-2023</ext-link>, 2023.</mixed-citation></ref>
      <ref id="bib1.bibx24"><label>ICON Partnership et al.(2024)</label><mixed-citation>ICON Partnership (DWD, MPI-M, DKRZ, KIT, and C2SM): ICON release 2024.01, ICON Partnership [code], <ext-link xlink:href="https://doi.org/10.35089/WDCC/IconRelease01" ext-link-type="DOI">10.35089/WDCC/IconRelease01</ext-link>, 2024.</mixed-citation></ref>
      <ref id="bib1.bibx25"><label>Kingma and Ba(2017)</label><mixed-citation>Kingma, D. P. and Ba, J.: Adam: A Method for Stochastic Optimization, 3rd International Conference on Learning Representations, ICLR2015, San Diego, CA, USA, 7–9 May  2015, Conference Track Proceedings, <ext-link xlink:href="https://doi.org/10.48550/arXiv.1412.6980" ext-link-type="DOI">10.48550/arXiv.1412.6980</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bibx26"><label>Klöwer et al.(2024)Klöwer, Gelbrecht, Hotta, Willmert, Silvestri, Wagner, White, Hatfield, Kimpson, Constantinou, and Hill</label><mixed-citation>Klöwer, M., Gelbrecht, M., Hotta, D., Willmert, J., Silvestri, S., Wagner, G. L., White, A., Hatfield, S., Kimpson, T., Constantinou, N. C., and Hill, C.: SpeedyWeather.jl: Reinventing atmospheric general circulation models towards interactivity and extensibility, Journal of Open Source Software, 9, 6323, <ext-link xlink:href="https://doi.org/10.21105/joss.06323" ext-link-type="DOI">10.21105/joss.06323</ext-link>, 2024.</mixed-citation></ref>
      <ref id="bib1.bibx27"><label>Kochkov et al.(2024)Kochkov, Yuval, Langmore, Norgaard, Smith, Mooers, Klöwer, Lottes, Rasp, Düben, Hatfield, Battaglia, Sanchez-Gonzalez, Willson, Brenner, and Hoyer</label><mixed-citation>Kochkov, D., Yuval, J., Langmore, I., Norgaard, P., Smith, J., Mooers, G., Klöwer, M., Lottes, J., Rasp, S., Düben, P., Hatfield, S., Battaglia, P., Sanchez-Gonzalez, A., Willson, M., Brenner, M. P., and Hoyer, S.: Neural general circulation models for weather and climate, Nature, 632, 1060–1066, <ext-link xlink:href="https://doi.org/10.1038/s41586-024-07744-y" ext-link-type="DOI">10.1038/s41586-024-07744-y</ext-link>, 2024.</mixed-citation></ref>
      <ref id="bib1.bibx28"><label>Krasnopolsky et al.(2005)Krasnopolsky, Fox-Rabinovitz, and Chalikov</label><mixed-citation>Krasnopolsky, V. M., Fox-Rabinovitz, M. S., and Chalikov, D. V.: New Approach to Calculation of Atmospheric Model Physics: Accurate and Fast Neural Network Emulation of Longwave Radiation in a Climate Model, Mon. Weather Rev., 133, 1370–1383, <ext-link xlink:href="https://doi.org/10.1175/MWR2923.1" ext-link-type="DOI">10.1175/MWR2923.1</ext-link>, 2005.</mixed-citation></ref>
      <ref id="bib1.bibx29"><label>Lagerquist et al.(2023)Lagerquist, Turner, Ebert-Uphoff, and Stewart</label><mixed-citation>Lagerquist, R., Turner, D. D., Ebert-Uphoff, I., and Stewart, J. Q.: Estimating Full Longwave and Shortwave Radiative Transfer with Neural Networks of Varying Complexity, J. Atmos. Ocean. Tech., 40, 1407–1432, <ext-link xlink:href="https://doi.org/10.1175/JTECH-D-23-0012.1" ext-link-type="DOI">10.1175/JTECH-D-23-0012.1</ext-link>, 2023.</mixed-citation></ref>
      <ref id="bib1.bibx30"><label>LeCun et al.(2012)LeCun, Bottou, Orr, and Müller</label><mixed-citation>LeCun, Y. A., Bottou, L., Orr, G. B., and Müller, K.-R.: Efficient BackProp, Springer Berlin Heidelberg, p. 9–48, <ext-link xlink:href="https://doi.org/10.1007/978-3-642-35289-8_3" ext-link-type="DOI">10.1007/978-3-642-35289-8_3</ext-link>, 2012.</mixed-citation></ref>
      <ref id="bib1.bibx31"><label>Lohmann and Roeckner(1996)</label><mixed-citation>Lohmann, U. and Roeckner, E.: Design and performance of a new cloud microphysics scheme developed for the ECHAM general circulation model, Clim. Dynam., 12, 557–572, <ext-link xlink:href="https://doi.org/10.1007/bf00207939" ext-link-type="DOI">10.1007/bf00207939</ext-link>, 1996.</mixed-citation></ref>
      <ref id="bib1.bibx32"><label>Meyer et al.(2022)Meyer, Hogan, Dueben, and Mason</label><mixed-citation>Meyer, D., Hogan, R. J., Dueben, P. D., and Mason, S. L.: Machine Learning Emulation of 3D Cloud Radiative Effects, J. Adv. Model. Earth Sy., 14, e2021MS002550, <ext-link xlink:href="https://doi.org/10.1029/2021MS002550" ext-link-type="DOI">10.1029/2021MS002550</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bibx33"><label>Pal et al.(2019)Pal, Mahajan, and Norman</label><mixed-citation>Pal, A., Mahajan, S., and Norman, M. R.: Using Deep Neural Networks as Cost-Effective Surrogate Models for Super-Parameterized E3SM Radiative Transfer, Geophys. Res. Lett., 46, 6069–6079, <ext-link xlink:href="https://doi.org/10.1029/2018GL081646" ext-link-type="DOI">10.1029/2018GL081646</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bibx34"><label>Pincus et al.(2003)Pincus, Barker, and Morcrette</label><mixed-citation>Pincus, R., Barker, H. W., and Morcrette, J.: A fast, flexible, approximate technique for computing radiative transfer in inhomogeneous cloud fields, J. Geophys. Res.-Atmos., 108, <ext-link xlink:href="https://doi.org/10.1029/2002jd003322" ext-link-type="DOI">10.1029/2002jd003322</ext-link>, 2003.</mixed-citation></ref>
      <ref id="bib1.bibx35"><label>Pincus et al.(2019)Pincus, Mlawer, and Delamere</label><mixed-citation>Pincus, R., Mlawer, E. J., and Delamere, J. S.: Balancing Accuracy, Efficiency, and Flexibility in Radiation Calculations for Dynamical Models, J. Adv. Model. Earth Sy., 11, 3074–3089, <ext-link xlink:href="https://doi.org/10.1029/2019MS001621" ext-link-type="DOI">10.1029/2019MS001621</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bibx36"><label>Pincus et al.(2025)Pincus, makepath LLC, and Sehnem</label><mixed-citation>Pincus, R., makepath LLC, and Sehnem, J. M.: pyRTE-RRTMGP, Zenodo [code], <ext-link xlink:href="https://doi.org/10.5281/zenodo.16644555" ext-link-type="DOI">10.5281/zenodo.16644555</ext-link>, 2025.</mixed-citation></ref>
      <ref id="bib1.bibx37"><label>Räisänen et al.(2004)Räisänen, Barker, Khairoutdinov, Li, and Randall</label><mixed-citation>Räisänen, P., Barker, H. W., Khairoutdinov, M. F., Li, J., and Randall, D. A.: Stochastic generation of subgrid-scale cloudy columns for large-scale models, Q. J. Roy. Meteor. Soc., 130, 2047–2067, <ext-link xlink:href="https://doi.org/10.1256/qj.03.99" ext-link-type="DOI">10.1256/qj.03.99</ext-link>, 2004.</mixed-citation></ref>
      <ref id="bib1.bibx38"><label>Rasp(2020)</label><mixed-citation>Rasp, S.: Coupled online learning as a way to tackle instabilities and biases in neural network parameterizations: general algorithms and Lorenz 96 case study (v1.0), Geosci. Model Dev., 13, 2185–2196, <ext-link xlink:href="https://doi.org/10.5194/gmd-13-2185-2020" ext-link-type="DOI">10.5194/gmd-13-2185-2020</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bibx39"><label>Roh and Song(2020)</label><mixed-citation>Roh, S. and Song, H.-J.: Evaluation of Neural Network Emulations for Radiation Parameterization in Cloud Resolving Model, Geophys. Res. Lett., 47, e2020GL089444, <ext-link xlink:href="https://doi.org/10.1029/2020GL089444" ext-link-type="DOI">10.1029/2020GL089444</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bibx40"><label>Shamekh et al.(2023)Shamekh, Lamb, Huang, and Gentine</label><mixed-citation>Shamekh, S., Lamb, K. D., Huang, Y., and Gentine, P.: Implicit learning of convective organization explains precipitation stochasticity, P. Natl. Acad. Sci. USA, 120, <ext-link xlink:href="https://doi.org/10.1073/pnas.2216158120" ext-link-type="DOI">10.1073/pnas.2216158120</ext-link>, 2023.</mixed-citation></ref>
      <ref id="bib1.bibx41"><label>Stevens et al.(2019)Stevens, Satoh, Auger, Biercamp, Bretherton, Chen, Düben, Judt, Khairoutdinov, Klocke, Kodama, Kornblueh, Lin, Neumann, Putman, Röber, Shibuya, Vanniere, Vidale, Wedi, and Zhou</label><mixed-citation>Stevens, B., Satoh, M., Auger, L., Biercamp, J., Bretherton, C. S., Chen, X., Düben, P., Judt, F., Khairoutdinov, M., Klocke, D., Kodama, C., Kornblueh, L., Lin, S.-J., Neumann, P., Putman, W. M., Röber, N., Shibuya, R., Vanniere, B., Vidale, P. L., Wedi, N., and Zhou, L.: DYAMOND: the DYnamics of the Atmospheric general circulation Modeled On Non-hydrostatic Domains, Progress in Earth and Planetary Science, 6, <ext-link xlink:href="https://doi.org/10.1186/s40645-019-0304-z" ext-link-type="DOI">10.1186/s40645-019-0304-z</ext-link>, 2019. </mixed-citation></ref>
      <ref id="bib1.bibx42"><label>Ukkonen(2022)</label><mixed-citation>Ukkonen, P.: Exploring Pathways to More Accurate Machine Learning Emulation of Atmospheric Radiative Transfer, J. Adv. Model. Earth Syst., 14, e2021MS002875, <ext-link xlink:href="https://doi.org/10.1029/2021MS002875" ext-link-type="DOI">10.1029/2021MS002875</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bibx43"><label>Ukkonen and Chantry(2025)</label><mixed-citation>Ukkonen, P. and Chantry, M.: Vertically Recurrent Neural Networks for Sub-Grid Parameterization, J. Adv. Model. Earth Sy., 17, e2024MS004833, <ext-link xlink:href="https://doi.org/10.1029/2024MS004833" ext-link-type="DOI">10.1029/2024MS004833</ext-link>, 2025.</mixed-citation></ref>
      <ref id="bib1.bibx44"><label>Ukkonen and Hogan(2024)</label><mixed-citation>Ukkonen, P. and Hogan, R. J.: Twelve Times Faster yet Accurate: A New State-Of-The-Art in Radiation Schemes via Performance and Spectral Optimization, J. Adv. Model. Earth Sy., 16, <ext-link xlink:href="https://doi.org/10.1029/2023ms003932" ext-link-type="DOI">10.1029/2023ms003932</ext-link>, 2024.</mixed-citation></ref>
      <ref id="bib1.bibx45"><label>Watt-Meyer et al.(2024)Watt-Meyer, Brenowitz, Clark, Henn, Kwa, McGibbon, Perkins, Harris, and Bretherton</label><mixed-citation>Watt-Meyer, O., Brenowitz, N. D., Clark, S. K., Henn, B., Kwa, A., McGibbon, J., Perkins, W. A., Harris, L., and Bretherton, C. S.: Neural Network Parameterization of Subgrid-Scale Physics From a Realistic Geography Global Storm-Resolving Simulation, J. Adv. Model. Earth Sy., 16, <ext-link xlink:href="https://doi.org/10.1029/2023ms003668" ext-link-type="DOI">10.1029/2023ms003668</ext-link>, 2024.</mixed-citation></ref>
      <ref id="bib1.bibx46"><label>Wood(2012)</label><mixed-citation>Wood, R.: Stratocumulus Clouds, Mon. Weather Rev., 140, 2373–2423, <ext-link xlink:href="https://doi.org/10.1175/mwr-d-11-00121.1" ext-link-type="DOI">10.1175/mwr-d-11-00121.1</ext-link>, 2012.</mixed-citation></ref>
      <ref id="bib1.bibx47"><label>Yao et al.(2023)Yao, Zhong, Zheng, and Wang</label><mixed-citation>Yao, Y., Zhong, X., Zheng, Y., and Wang, Z.: A Physics-Incorporated Deep Learning Framework for Parameterization of Atmospheric Radiative Transfer, J. Adv. Model. Earth Sy., 15, e2022MS003445, <ext-link xlink:href="https://doi.org/10.1029/2022MS003445" ext-link-type="DOI">10.1029/2022MS003445</ext-link>, 2023.</mixed-citation></ref>
      <ref id="bib1.bibx48"><label>Zängl et al.(2014)Zängl, Reinert, Rípodas, and Baldauf</label><mixed-citation>Zängl, G., Reinert, D., Rípodas, P., and Baldauf, M.: The ICON (ICOsahedral Non-hydrostatic) modelling framework of DWD and MPI-M: Description of the non-hydrostatic dynamical core, Q. J. Roy. Meteor. Soc., 141, 563–579, <ext-link xlink:href="https://doi.org/10.1002/qj.2378" ext-link-type="DOI">10.1002/qj.2378</ext-link>, 2014.</mixed-citation></ref>
      <ref id="bib1.bibx49"><label>Zhao et al.(2024)Zhao, Li, Wen, Li, Wang, and Huang</label><mixed-citation>Zhao, Y., Li, J., Wen, D., Li, Y., Wang, Y., and Huang, J.: Distinct structure, radiative effects, and precipitation characteristics of deep convection systems in the Tibetan Plateau compared to the tropical Indian Ocean, Atmos. Chem. Phys., 24, 9435–9457, <ext-link xlink:href="https://doi.org/10.5194/acp-24-9435-2024" ext-link-type="DOI">10.5194/acp-24-9435-2024</ext-link>, 2024.</mixed-citation></ref>

  </ref-list></back>
    <!--<article-title-html>Representing subgrid-scale cloud effects in a radiation parameterization using machine learning: MLe-radiation v1.0</article-title-html>
<abstract-html/>
<ref-html id="bib1.bib1"><label>Barker et al.(1999)Barker, Stephens, and Fu</label><mixed-citation>
      
Barker, H. W., Stephens, G. L., and Fu, Q.:
The sensitivity of domain-averaged solar fluxes to assumptions about cloud geometry, Q. J. Roy. Meteor. Soc., 125, 2127–2152, <a href="https://doi.org/10.1002/qj.49712555810" target="_blank">https://doi.org/10.1002/qj.49712555810</a>, 1999.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib2"><label>Bertoli et al.(2025)Bertoli, Mohebi, Ozdemir, Jucker, Rüdisühli, Perez-Cruz, Salzmann, and Schemm</label><mixed-citation>
      
Bertoli, G., Mohebi, S., Ozdemir, F., Jucker, J., Rüdisühli, S., Perez-Cruz, F., Salzmann, M., and Schemm, S.:
Revisiting Machine Learning Approaches for Short- and Longwave Radiation Inference in Weather and Climate Models, J. Adv. Model. Earth Sy., 17, <a href="https://doi.org/10.1029/2025ms004956" target="_blank">https://doi.org/10.1029/2025ms004956</a>, 2025.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib3"><label>Beucler et al.(2024)Beucler, Gentine, Yuval, Gupta, Peng, Lin, Yu, Rasp, Ahmed, O’Gorman, Neelin, Lutsko, and Pritchard</label><mixed-citation>
      
Beucler, T., Gentine, P., Yuval, J., Gupta, A., Peng, L., Lin, J., Yu, S., Rasp, S., Ahmed, F., O’Gorman, P. A., Neelin, J. D., Lutsko, N. J., and Pritchard, M.:
Climate-invariant machine learning, Science Advances, 10, eadj7250, <a href="https://doi.org/10.1126/sciadv.adj7250" target="_blank">https://doi.org/10.1126/sciadv.adj7250</a>, 2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib4"><label>Bock and Lauer(2024)</label><mixed-citation>
      
Bock, L. and Lauer, A.:
Cloud properties and their projected changes in CMIP models with low to high climate sensitivity, Atmos. Chem. Phys., 24, 1587–1605, <a href="https://doi.org/10.5194/acp-24-1587-2024" target="_blank">https://doi.org/10.5194/acp-24-1587-2024</a>, 2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib5"><label>Bony et al.(2015)Bony, Stevens, Frierson, Jakob, Kageyama, Pincus, Shepherd, Sherwood, Siebesma, Sobel, Watanabe, and Webb</label><mixed-citation>
      
Bony, S., Stevens, B., Frierson, D. M. W., Jakob, C., Kageyama, M., Pincus, R., Shepherd, T. G., Sherwood, S. C., Siebesma, A. P., Sobel, A. H., Watanabe, M., and Webb, M. J.:
Clouds, circulation and climate sensitivity, Nat. Geosci., 8, 261–268, <a href="https://doi.org/10.1038/ngeo2398" target="_blank">https://doi.org/10.1038/ngeo2398</a>, 2015.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib6"><label>Bretherton et al.(2022)Bretherton, Henn, Kwa, Brenowitz, Watt-Meyer, McGibbon, Perkins, Clark, and Harris</label><mixed-citation>
      
Bretherton, C. S., Henn, B., Kwa, A., Brenowitz, N. D., Watt-Meyer, O., McGibbon, J., Perkins, W. A., Clark, S. K., and Harris, L.:
Correcting Coarse-Grid Weather and Climate Models by Machine Learning From Global Storm-Resolving Simulations, J. Adv. Model. Earth Sy., 14, <a href="https://doi.org/10.1029/2021ms002794" target="_blank">https://doi.org/10.1029/2021ms002794</a>, 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib7"><label>Busecke et al.(2025)Busecke, Balwada, Martin, Nicholas, Johnson, Nalluri, Stern, and Abernathey</label><mixed-citation>
      
Busecke, J. J. M., Balwada, D., Martin, P. E., Nicholas, T. E. G., Johnson, Z. C. P., Nalluri, P., Stern, C. I., and Abernathey, R. P.:
The Impact of Sub-Grid Heterogeneity on Air-Sea Turbulent Heat Flux in Coupled Climate Models, Geophys. Res. Lett., 52, <a href="https://doi.org/10.1029/2025gl114951" target="_blank">https://doi.org/10.1029/2025gl114951</a>, 2025.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib8"><label>Chen et al.(2021)Chen, Rojas, Samset, Cobb, Diongue Niang, Edwards, Emori, Faria, Hawkins, Hope, Huybrechts, Meinshausen, Mustafa, Plattner, and Tréguier</label><mixed-citation>
      
Chen, D., Rojas, M., Samset, B., Cobb, K., Diongue Niang, A., Edwards, P., Emori, S., Faria, S., Hawkins, E., Hope, P., Huybrechts, P., Meinshausen, M., Mustafa, S., Plattner, G.-K., and Tréguier, A.-M.:
Framing, Context, and Methods, in: Climate Change 2021: The Physical Science Basis. Contribution of Working Group I to the Sixth Assessment Report of the Intergovernmental Panel on Climate Change, edited by: Masson-Delmotte, V., Zhai, P., Pirani, A., Connors, S. L., Péan, C., Berger, S., Caud, N., Chen, Y., Goldfarb, L., Gomis, M. I., Huang, M., Leitzell, K., Lonnoy, E., Matthews, J. B. R., Maycock, T. K., Waterfield, T., Yelekçi, O., Yu, R., and Zhou, B., book section 1, pp. 147–286, Cambridge University Press, Cambridge, UK and New York, NY, USA, <a href="https://doi.org/10.1017/9781009157896.003" target="_blank">https://doi.org/10.1017/9781009157896.003</a>, 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib9"><label>Chevallier et al.(1998)Chevallier, Chéruy, Scott, and Chédin</label><mixed-citation>
      
Chevallier, F., Chéruy, F., Scott, N. A., and Chédin, A.:
A Neural Network Approach for a Fast and Accurate Computation of a Longwave Radiative Budget, J. Appl. Meteorol. Clim., 37, 1385–1397, <a href="https://doi.org/10.1175/1520-0450(1998)037&lt;1385:ANNAFA&gt;2.0.CO;2" target="_blank">https://doi.org/10.1175/1520-0450(1998)037&lt;1385:ANNAFA&gt;2.0.CO;2</a>, 1998.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib10"><label>Cotronei and Slawig(2020)</label><mixed-citation>
      
Cotronei, A. and Slawig, T.:
Single-precision arithmetic in ECHAM radiation reduces runtime and energy consumption, Geosci. Model Dev., 13, 2783–2804, <a href="https://doi.org/10.5194/gmd-13-2783-2020" target="_blank">https://doi.org/10.5194/gmd-13-2783-2020</a>, 2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib11"><label>Doktorowski et al.(2024)Doktorowski, Kretzschmar, Quaas, Salzmann, and Sourdeval</label><mixed-citation>
      
Doktorowski, S., Kretzschmar, J., Quaas, J., Salzmann, M., and Sourdeval, O.:
Subgrid-scale variability of cloud ice in the ICON-AES 1.3.00, Geosci. Model Dev., 17, 3099–3110, <a href="https://doi.org/10.5194/gmd-17-3099-2024" target="_blank">https://doi.org/10.5194/gmd-17-3099-2024</a>, 2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib12"><label>Doms et al.(2011)Doms, Förstner, Heise, Herzog, Mironov, Raschendorfer, Reinhardt, Ritter, Schrodin, Schulz, and Vogel</label><mixed-citation>
      
Doms, G., Förstner, G., Heise, E., Herzog, H.-J., Mironov, D., Raschendorfer, M., Reinhardt, T., Ritter, B., Schrodin, R., Schulz, J.-P., and Vogel, G.:
A Description of the Nonhydrostatic Regional COSMO Model. Part II: Physical Parameterization, Consortium for Small-Scale Modelling, <a href="http://www.cosmo-model.org/" target="_blank"/> (last access: 3 May 2026), 2011.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib13"><label>Giorgetta et al.(2018)Giorgetta, Brokopf, Crueger, Esch, Fiedler, Helmert, Hohenegger, Kornblueh, Köhler, Manzini, Mauritsen, Nam, Raddatz, Rast, Reinert, Sakradzija, Schmidt, Schneck, Schnur, Silvers, Wan, Zängl, and Stevens</label><mixed-citation>
      
Giorgetta, M. A., Brokopf, R., Crueger, T., Esch, M., Fiedler, S., Helmert, J., Hohenegger, C., Kornblueh, L., Köhler, M., Manzini, E., Mauritsen, T., Nam, C., Raddatz, T., Rast, S., Reinert, D., Sakradzija, M., Schmidt, H., Schneck, R., Schnur, R., Silvers, L., Wan, H., Zängl, G., and Stevens, B.:
ICON-A, the Atmosphere Component of the ICON Earth System Model: I. Model Description, J. Adv. Model. Earth Sy., 10, 1613–1637, <a href="https://doi.org/10.1029/2017MS001242" target="_blank">https://doi.org/10.1029/2017MS001242</a>, 2018.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib14"><label>Giorgetta et al.(2022)Giorgetta, Sawyer, Lapillonne, Adamidis, Alexeev, Clément, Dietlicher, Engels, Esch, Franke, Frauen, Hannah, Hillman, Kornblueh, Marti, Norman, Pincus, Rast, Reinert, Schnur, Schulzweida, and Stevens</label><mixed-citation>
      
Giorgetta, M. A., Sawyer, W., Lapillonne, X., Adamidis, P., Alexeev, D., Clément, V., Dietlicher, R., Engels, J. F., Esch, M., Franke, H., Frauen, C., Hannah, W. M., Hillman, B. R., Kornblueh, L., Marti, P., Norman, M. R., Pincus, R., Rast, S., Reinert, D., Schnur, R., Schulzweida, U., and Stevens, B.:
The ICON-A model for direct QBO simulations on GPUs (version icon-cscs:baf28a514), Geosci. Model Dev., 15, 6985–7016, <a href="https://doi.org/10.5194/gmd-15-6985-2022" target="_blank">https://doi.org/10.5194/gmd-15-6985-2022</a>, 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib15"><label>Grundner et al.(2022)Grundner, Beucler, Gentine, Iglesias-Suarez, Giorgetta, and Eyring</label><mixed-citation>
      
Grundner, A., Beucler, T., Gentine, P., Iglesias-Suarez, F., Giorgetta, M. A., and Eyring, V.:
Deep Learning Based Cloud Cover Parameterization for ICON, J. Adv. Model. Earth Sy., 14, e2021MS002959, <a href="https://doi.org/10.1029/2021MS002959" target="_blank">https://doi.org/10.1029/2021MS002959</a>, 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib16"><label>Hafner(2025)</label><mixed-citation>
      
Hafner, K.: Representing Subgrid-Scale Cloud Effects in a Radiation Parameterization using Machine Learning: MLe-radiation v1.0, Zenodo [code], <a href="https://doi.org/10.5281/ZENODO.17280639" target="_blank">https://doi.org/10.5281/ZENODO.17280639</a>, 2025.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib17"><label>Hafner(2026)</label><mixed-citation>
      
Hafner, K.: Code and Data for “Representing Subgrid-Scale Cloud Effects in a Radiation Parameterization using Machine Learning: MLe-radiation v1.0”, Zenodo [code and data set], <a href="https://doi.org/10.5281/zenodo.18853569" target="_blank">https://doi.org/10.5281/zenodo.18853569</a>, 2026.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib18"><label>Hafner et al.(2025a)Hafner, Iglesiaz-Suarez, Shamekh, Gentine, Pincus, Giorgetta, and Eyring</label><mixed-citation>
      
Hafner, K., Iglesiaz-Suarez, F., Shamekh, S., Gentine, P., Pincus, R., Giorgetta, M., and Eyring, V.:
Stable Machine Learning based Radiation Emulation for ICON, J. Adv. Model. Earth Sy., <a href="https://doi.org/10.22541/essoar.174708082.27787580/v1" target="_blank">https://doi.org/10.22541/essoar.174708082.27787580/v1</a>, in review, 2025a.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib19"><label>Hafner et al.(2025b)Hafner, Iglesiaz-Suarez, Shamekh, Gentine, Pincus, Girgetta, and Eyring</label><mixed-citation>
      
Hafner, K., Iglesiaz-Suarez, F., Shamekh, S., Gentine, P., Pincus, R., Girgetta, M., and Eyring, V.: Interpretable machine learning radiation parameterization for ICON, J. Geophys. Res.-Mach. Learn. Comput., 2, e2024JH000501, <a href="https://doi.org/10.1029/2024JH000501" target="_blank">https://doi.org/10.1029/2024JH000501</a>, 2025b.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib20"><label>Henn et al.(2024)Henn, Jauregui, Clark, Brenowitz, McGibbon, Watt-Meyer, Pauling, and Bretherton</label><mixed-citation>
      
Henn, B., Jauregui, Y. R., Clark, S. K., Brenowitz, N. D., McGibbon, J., Watt-Meyer, O., Pauling, A. G., and Bretherton, C. S.:
A Machine Learning Parameterization of Clouds in a Coarse-Resolution Climate Model for Unbiased Radiation, J. Adv. Model. Earth Sy., 16, <a href="https://doi.org/10.1029/2023ms003949" target="_blank">https://doi.org/10.1029/2023ms003949</a>, 2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib21"><label>Heuer et al.(2024)Heuer, Schwabe, Gentine, Giorgetta, and Eyring</label><mixed-citation>
      
Heuer, H., Schwabe, M., Gentine, P., Giorgetta, M. A., and Eyring, V.:
Interpretable Multiscale Machine Learning-Based Parameterizations of Convection for ICON, J. Adv. Model. Earth Sy., 16, <a href="https://doi.org/10.1029/2024ms004398" target="_blank">https://doi.org/10.1029/2024ms004398</a>, 2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib22"><label>Hogan and Matricardi(2022)</label><mixed-citation>
      
Hogan, R. J. and Matricardi, M.:
A Tool for Generating Fast k-Distribution Gas-Optics Models for Weather and Climate Applications, J. Adv. Model. Earth Sy., 14, e2022MS003033, <a href="https://doi.org/10.1029/2022MS003033" target="_blank">https://doi.org/10.1029/2022MS003033</a>, 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib23"><label>Hohenegger et al.(2023)Hohenegger, Korn, Linardakis, Redler, Schnur, Adamidis, Bao, Bastin, Behravesh, Bergemann, Biercamp, Bockelmann, Brokopf, Brüggemann, Casaroli, Chegini, Datseris, Esch, George, Giorgetta, Gutjahr, Haak, Hanke, Ilyina, Jahns, Jungclaus, Kern, Klocke, Kluft, Kölling, Kornblueh, Kosukhin, Kroll, Lee, Mauritsen, Mehlmann, Mieslinger, Naumann, Paccini, Peinado, Praturi, Putrasahan, Rast, Riddick, Roeber, Schmidt, Schulzweida, Schütte, Segura, Shevchenko, Singh, Specht, Stephan, von Storch, Vogel, Wengel, Winkler, Ziemen, Marotzke, and Stevens</label><mixed-citation>
      
Hohenegger, C., Korn, P., Linardakis, L., Redler, R., Schnur, R., Adamidis, P., Bao, J., Bastin, S., Behravesh, M., Bergemann, M., Biercamp, J., Bockelmann, H., Brokopf, R., Brüggemann, N., Casaroli, L., Chegini, F., Datseris, G., Esch, M., George, G., Giorgetta, M., Gutjahr, O., Haak, H., Hanke, M., Ilyina, T., Jahns, T., Jungclaus, J., Kern, M., Klocke, D., Kluft, L., Kölling, T., Kornblueh, L., Kosukhin, S., Kroll, C., Lee, J., Mauritsen, T., Mehlmann, C., Mieslinger, T., Naumann, A. K., Paccini, L., Peinado, A., Praturi, D. S., Putrasahan, D., Rast, S., Riddick, T., Roeber, N., Schmidt, H., Schulzweida, U., Schütte, F., Segura, H., Shevchenko, R., Singh, V., Specht, M., Stephan, C. C., von Storch, J.-S., Vogel, R., Wengel, C., Winkler, M., Ziemen, F., Marotzke, J., and Stevens, B.:
ICON-Sapphire: simulating the components of the Earth system and their interactions at kilometer and subkilometer scales, Geosci. Model Dev., 16, 779–811, <a href="https://doi.org/10.5194/gmd-16-779-2023" target="_blank">https://doi.org/10.5194/gmd-16-779-2023</a>, 2023.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib24"><label>ICON Partnership et al.(2024)</label><mixed-citation>
      
ICON Partnership (DWD, MPI-M, DKRZ, KIT, and C2SM):
ICON release 2024.01, ICON Partnership [code], <a href="https://doi.org/10.35089/WDCC/IconRelease01" target="_blank">https://doi.org/10.35089/WDCC/IconRelease01</a>, 2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib25"><label>Kingma and Ba(2017)</label><mixed-citation>
      
Kingma, D. P. and Ba, J.: Adam: A Method for Stochastic Optimization, 3rd International Conference on Learning Representations, ICLR2015, San
Diego, CA, USA, 7–9 May  2015, Conference Track Proceedings, <a href="https://doi.org/10.48550/arXiv.1412.6980" target="_blank">https://doi.org/10.48550/arXiv.1412.6980</a>, 2017.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib26"><label>Klöwer et al.(2024)Klöwer, Gelbrecht, Hotta, Willmert, Silvestri, Wagner, White, Hatfield, Kimpson, Constantinou, and Hill</label><mixed-citation>
      
Klöwer, M., Gelbrecht, M., Hotta, D., Willmert, J., Silvestri, S., Wagner, G. L., White, A., Hatfield, S., Kimpson, T., Constantinou, N. C., and Hill, C.:
SpeedyWeather.jl: Reinventing atmospheric general circulation models towards interactivity and extensibility, Journal of Open Source Software, 9, 6323, <a href="https://doi.org/10.21105/joss.06323" target="_blank">https://doi.org/10.21105/joss.06323</a>, 2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib27"><label>Kochkov et al.(2024)Kochkov, Yuval, Langmore, Norgaard, Smith, Mooers, Klöwer, Lottes, Rasp, Düben, Hatfield, Battaglia, Sanchez-Gonzalez, Willson, Brenner, and Hoyer</label><mixed-citation>
      
Kochkov, D., Yuval, J., Langmore, I., Norgaard, P., Smith, J., Mooers, G., Klöwer, M., Lottes, J., Rasp, S., Düben, P., Hatfield, S., Battaglia, P., Sanchez-Gonzalez, A., Willson, M., Brenner, M. P., and Hoyer, S.:
Neural general circulation models for weather and climate, Nature, 632, 1060–1066, <a href="https://doi.org/10.1038/s41586-024-07744-y" target="_blank">https://doi.org/10.1038/s41586-024-07744-y</a>, 2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib28"><label>Krasnopolsky et al.(2005)Krasnopolsky, Fox-Rabinovitz, and Chalikov</label><mixed-citation>
      
Krasnopolsky, V. M., Fox-Rabinovitz, M. S., and Chalikov, D. V.:
New Approach to Calculation of Atmospheric Model Physics: Accurate and Fast Neural Network Emulation of Longwave Radiation in a Climate Model, Mon. Weather Rev., 133, 1370–1383, <a href="https://doi.org/10.1175/MWR2923.1" target="_blank">https://doi.org/10.1175/MWR2923.1</a>, 2005.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib29"><label>Lagerquist et al.(2023)Lagerquist, Turner, Ebert-Uphoff, and Stewart</label><mixed-citation>
      
Lagerquist, R., Turner, D. D., Ebert-Uphoff, I., and Stewart, J. Q.:
Estimating Full Longwave and Shortwave Radiative Transfer with Neural Networks of Varying Complexity, J. Atmos. Ocean. Tech., 40, 1407–1432, <a href="https://doi.org/10.1175/JTECH-D-23-0012.1" target="_blank">https://doi.org/10.1175/JTECH-D-23-0012.1</a>, 2023.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib30"><label>LeCun et al.(2012)LeCun, Bottou, Orr, and Müller</label><mixed-citation>
      
LeCun, Y. A., Bottou, L., Orr, G. B., and Müller, K.-R.:
Efficient BackProp, Springer Berlin Heidelberg, p. 9–48, <a href="https://doi.org/10.1007/978-3-642-35289-8_3" target="_blank">https://doi.org/10.1007/978-3-642-35289-8_3</a>, 2012.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib31"><label>Lohmann and Roeckner(1996)</label><mixed-citation>
      
Lohmann, U. and Roeckner, E.:
Design and performance of a new cloud microphysics scheme developed for the ECHAM general circulation model, Clim. Dynam., 12, 557–572, <a href="https://doi.org/10.1007/bf00207939" target="_blank">https://doi.org/10.1007/bf00207939</a>, 1996.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib32"><label>Meyer et al.(2022)Meyer, Hogan, Dueben, and Mason</label><mixed-citation>
      
Meyer, D., Hogan, R. J., Dueben, P. D., and Mason, S. L.:
Machine Learning Emulation of 3D Cloud Radiative Effects, J. Adv. Model. Earth Sy., 14, e2021MS002550, <a href="https://doi.org/10.1029/2021MS002550" target="_blank">https://doi.org/10.1029/2021MS002550</a>, 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib33"><label>Pal et al.(2019)Pal, Mahajan, and Norman</label><mixed-citation>
      
Pal, A., Mahajan, S., and Norman, M. R.:
Using Deep Neural Networks as Cost-Effective Surrogate Models for Super-Parameterized E3SM Radiative Transfer, Geophys. Res. Lett., 46, 6069–6079, <a href="https://doi.org/10.1029/2018GL081646" target="_blank">https://doi.org/10.1029/2018GL081646</a>, 2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib34"><label>Pincus et al.(2003)Pincus, Barker, and Morcrette</label><mixed-citation>
      
Pincus, R., Barker, H. W., and Morcrette, J.:
A fast, flexible, approximate technique for computing radiative transfer in inhomogeneous cloud fields, J. Geophys. Res.-Atmos., 108, <a href="https://doi.org/10.1029/2002jd003322" target="_blank">https://doi.org/10.1029/2002jd003322</a>, 2003.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib35"><label>Pincus et al.(2019)Pincus, Mlawer, and Delamere</label><mixed-citation>
      
Pincus, R., Mlawer, E. J., and Delamere, J. S.:
Balancing Accuracy, Efficiency, and Flexibility in Radiation Calculations for Dynamical Models, J. Adv. Model. Earth Sy., 11, 3074–3089, <a href="https://doi.org/10.1029/2019MS001621" target="_blank">https://doi.org/10.1029/2019MS001621</a>, 2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib36"><label>Pincus et al.(2025)Pincus, makepath LLC, and Sehnem</label><mixed-citation>
      
Pincus, R., makepath LLC, and Sehnem, J. M.: pyRTE-RRTMGP, Zenodo [code], <a href="https://doi.org/10.5281/zenodo.16644555" target="_blank">https://doi.org/10.5281/zenodo.16644555</a>, 2025.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib37"><label>Räisänen et al.(2004)Räisänen, Barker, Khairoutdinov, Li, and Randall</label><mixed-citation>
      
Räisänen, P., Barker, H. W., Khairoutdinov, M. F., Li, J., and Randall, D. A.:
Stochastic generation of subgrid-scale cloudy columns for large-scale models, Q. J. Roy. Meteor. Soc., 130, 2047–2067, <a href="https://doi.org/10.1256/qj.03.99" target="_blank">https://doi.org/10.1256/qj.03.99</a>, 2004.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib38"><label>Rasp(2020)</label><mixed-citation>
      
Rasp, S.:
Coupled online learning as a way to tackle instabilities and biases in neural network parameterizations: general algorithms and Lorenz 96 case study (v1.0), Geosci. Model Dev., 13, 2185–2196, <a href="https://doi.org/10.5194/gmd-13-2185-2020" target="_blank">https://doi.org/10.5194/gmd-13-2185-2020</a>, 2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib39"><label>Roh and Song(2020)</label><mixed-citation>
      
Roh, S. and Song, H.-J.:
Evaluation of Neural Network Emulations for Radiation Parameterization in Cloud Resolving Model, Geophys. Res. Lett., 47, e2020GL089444, <a href="https://doi.org/10.1029/2020GL089444" target="_blank">https://doi.org/10.1029/2020GL089444</a>, 2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib40"><label>Shamekh et al.(2023)Shamekh, Lamb, Huang, and Gentine</label><mixed-citation>
      
Shamekh, S., Lamb, K. D., Huang, Y., and Gentine, P.:
Implicit learning of convective organization explains precipitation stochasticity, P. Natl. Acad. Sci. USA, 120, <a href="https://doi.org/10.1073/pnas.2216158120" target="_blank">https://doi.org/10.1073/pnas.2216158120</a>, 2023.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib41"><label>Stevens et al.(2019)Stevens, Satoh, Auger, Biercamp, Bretherton, Chen, Düben, Judt, Khairoutdinov, Klocke, Kodama, Kornblueh, Lin, Neumann, Putman, Röber, Shibuya, Vanniere, Vidale, Wedi, and Zhou</label><mixed-citation>
      
Stevens, B., Satoh, M., Auger, L., Biercamp, J., Bretherton, C. S., Chen, X., Düben, P., Judt, F., Khairoutdinov, M., Klocke, D., Kodama, C., Kornblueh, L., Lin, S.-J., Neumann, P., Putman, W. M., Röber, N., Shibuya, R., Vanniere, B., Vidale, P. L., Wedi, N., and Zhou, L.:
DYAMOND: the DYnamics of the Atmospheric general circulation Modeled On Non-hydrostatic Domains, Progress in Earth and Planetary Science, 6, <a href="https://doi.org/10.1186/s40645-019-0304-z" target="_blank">https://doi.org/10.1186/s40645-019-0304-z</a>, 2019.


    </mixed-citation></ref-html>
<ref-html id="bib1.bib42"><label>Ukkonen(2022)</label><mixed-citation>
      
Ukkonen, P.: Exploring Pathways to More Accurate Machine Learning Emulation of Atmospheric Radiative Transfer, J. Adv. Model. Earth Syst., 14, e2021MS002875, <a href="https://doi.org/10.1029/2021MS002875" target="_blank">https://doi.org/10.1029/2021MS002875</a>, 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib43"><label>Ukkonen and Chantry(2025)</label><mixed-citation>
      
Ukkonen, P. and Chantry, M.:
Vertically Recurrent Neural Networks for Sub-Grid Parameterization, J. Adv. Model. Earth Sy., 17, e2024MS004833, <a href="https://doi.org/10.1029/2024MS004833" target="_blank">https://doi.org/10.1029/2024MS004833</a>, 2025.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib44"><label>Ukkonen and Hogan(2024)</label><mixed-citation>
      
Ukkonen, P. and Hogan, R. J.:
Twelve Times Faster yet Accurate: A New State-Of-The-Art in Radiation Schemes via Performance and Spectral Optimization, J. Adv. Model. Earth Sy., 16, <a href="https://doi.org/10.1029/2023ms003932" target="_blank">https://doi.org/10.1029/2023ms003932</a>, 2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib45"><label>Watt-Meyer et al.(2024)Watt-Meyer, Brenowitz, Clark, Henn, Kwa, McGibbon, Perkins, Harris, and Bretherton</label><mixed-citation>
      
Watt-Meyer, O., Brenowitz, N. D., Clark, S. K., Henn, B., Kwa, A., McGibbon, J., Perkins, W. A., Harris, L., and Bretherton, C. S.:
Neural Network Parameterization of Subgrid-Scale Physics From a Realistic Geography Global Storm-Resolving Simulation, J. Adv. Model. Earth Sy., 16, <a href="https://doi.org/10.1029/2023ms003668" target="_blank">https://doi.org/10.1029/2023ms003668</a>, 2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib46"><label>Wood(2012)</label><mixed-citation>
      
Wood, R.:
Stratocumulus Clouds, Mon. Weather Rev., 140, 2373–2423, <a href="https://doi.org/10.1175/mwr-d-11-00121.1" target="_blank">https://doi.org/10.1175/mwr-d-11-00121.1</a>, 2012.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib47"><label>Yao et al.(2023)Yao, Zhong, Zheng, and Wang</label><mixed-citation>
      
Yao, Y., Zhong, X., Zheng, Y., and Wang, Z.:
A Physics-Incorporated Deep Learning Framework for Parameterization of Atmospheric Radiative Transfer, J. Adv. Model. Earth Sy., 15, e2022MS003445, <a href="https://doi.org/10.1029/2022MS003445" target="_blank">https://doi.org/10.1029/2022MS003445</a>, 2023.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib48"><label>Zängl et al.(2014)Zängl, Reinert, Rípodas, and Baldauf</label><mixed-citation>
      
Zängl, G., Reinert, D., Rípodas, P., and Baldauf, M.:
The ICON (ICOsahedral Non-hydrostatic) modelling framework of DWD and MPI-M: Description of the non-hydrostatic dynamical core, Q. J. Roy. Meteor. Soc., 141, 563–579, <a href="https://doi.org/10.1002/qj.2378" target="_blank">https://doi.org/10.1002/qj.2378</a>, 2014.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib49"><label>Zhao et al.(2024)Zhao, Li, Wen, Li, Wang, and Huang</label><mixed-citation>
      
Zhao, Y., Li, J., Wen, D., Li, Y., Wang, Y., and Huang, J.:
Distinct structure, radiative effects, and precipitation characteristics of deep convection systems in the Tibetan Plateau compared to the tropical Indian Ocean, Atmos. Chem. Phys., 24, 9435–9457, <a href="https://doi.org/10.5194/acp-24-9435-2024" target="_blank">https://doi.org/10.5194/acp-24-9435-2024</a>, 2024.

    </mixed-citation></ref-html>--></article>
