<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing with OASIS Tables v3.0 20080202//EN" "https://jats.nlm.nih.gov/nlm-dtd/publishing/3.0/journalpub-oasis3.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:oasis="http://docs.oasis-open.org/ns/oasis-exchange/table" xml:lang="en" dtd-version="3.0" article-type="research-article">
  <front>
    <journal-meta><journal-id journal-id-type="publisher">GMD</journal-id><journal-title-group>
    <journal-title>Geoscientific Model Development</journal-title>
    <abbrev-journal-title abbrev-type="publisher">GMD</abbrev-journal-title><abbrev-journal-title abbrev-type="nlm-ta">Geosci. Model Dev.</abbrev-journal-title>
  </journal-title-group><issn pub-type="epub">1991-9603</issn><publisher>
    <publisher-name>Copernicus Publications</publisher-name>
    <publisher-loc>Göttingen, Germany</publisher-loc>
  </publisher></journal-meta>
    <article-meta>
      <article-id pub-id-type="doi">10.5194/gmd-19-2437-2026</article-id><title-group><article-title>Deep learning representation of the aerosol size distribution</article-title><alt-title>Deep learning representation of the aerosol size distribution</alt-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author" corresp="yes" rid="aff1">
          <name><surname>Barahona</surname><given-names>Donifan</given-names></name>
          <email>donifan.o.barahona@nasa.gov</email>
        <ext-link>https://orcid.org/0000-0001-5786-1344</ext-link></contrib>
        <contrib contrib-type="author" corresp="no" rid="aff1 aff2">
          <name><surname>Breen</surname><given-names>Katherine H.</given-names></name>
          
        <ext-link>https://orcid.org/0000-0003-3271-1782</ext-link></contrib>
        <contrib contrib-type="author" corresp="no" rid="aff3">
          <name><surname>Block</surname><given-names>Karoline</given-names></name>
          
        <ext-link>https://orcid.org/0000-0002-4458-2327</ext-link></contrib>
        <contrib contrib-type="author" corresp="no" rid="aff1">
          <name><surname>Darmenov</surname><given-names>Anton</given-names></name>
          
        </contrib>
        <aff id="aff1"><label>1</label><institution>NASA, Goddard Space Flight Center, Greenbelt, MD, USA</institution>
        </aff>
        <aff id="aff2"><label>2</label><institution>Morgan State University, Baltimore, MD, USA</institution>
        </aff>
        <aff id="aff3"><label>3</label><institution>Leipzig Institute for Meteorology, Faculty of Physics and Earth Sciences, University of Leipzig, Leipzig, Germany</institution>
        </aff>
      </contrib-group>
      <author-notes><corresp id="corr1">Donifan Barahona (donifan.o.barahona@nasa.gov)</corresp></author-notes><pub-date><day>26</day><month>March</month><year>2026</year></pub-date>
      
      <volume>19</volume>
      <issue>6</issue>
      <fpage>2437</fpage><lpage>2459</lpage>
      <history>
        <date date-type="received"><day>31</day><month>January</month><year>2025</year></date>
           <date date-type="rev-request"><day>17</day><month>March</month><year>2025</year></date>
           <date date-type="rev-recd"><day>11</day><month>February</month><year>2026</year></date>
           <date date-type="accepted"><day>20</day><month>February</month><year>2026</year></date>
      </history>
      <permissions>
        <copyright-statement>Copyright: © 2026 Donifan Barahona et al.</copyright-statement>
        <copyright-year>2026</copyright-year>
      <license license-type="open-access"><license-p>This work is licensed under the Creative Commons Attribution 4.0 International License. To view a copy of this licence, visit <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by/4.0/">https://creativecommons.org/licenses/by/4.0/</ext-link></license-p></license></permissions><self-uri xlink:href="https://gmd.copernicus.org/articles/19/2437/2026/gmd-19-2437-2026.html">This article is available from https://gmd.copernicus.org/articles/19/2437/2026/gmd-19-2437-2026.html</self-uri><self-uri xlink:href="https://gmd.copernicus.org/articles/19/2437/2026/gmd-19-2437-2026.pdf">The full text article is available as a PDF file from https://gmd.copernicus.org/articles/19/2437/2026/gmd-19-2437-2026.pdf</self-uri>
      <abstract><title>Abstract</title>

      <p id="d2e122">Aerosols influence Earth's radiative balance via the scattering and absorbing of solar radiation, affect cloud formation, and play important roles on precipitation, ocean seeding and human health. Accurate modeling of these effects requires knowledge of the chemical composition and size distribution of aerosol particles present in the atmosphere. Computationally intensive applications like remote sensing and weather forecasting commonly use simplified representations of aerosol microphysics, prescribing the aerosol size distribution (ASD), introducing uncertainty in climate predictions and aerosol retrievals. In this work, we develop a neural network model, MAMnet, to predict the ASD and mixing state for seven lognormal modes based on the bulk aerosol mass and the meteorological state. MAMnet is designed to operate with outputs from single-moment, mass-based aerosol schemes, making it compatible with existing models. We demonstrate that MAMnet can accurately reproduce the output of a two-moment modal aerosol scheme, and also agrees well with field measurements when driven by reanalysis data. Our model paves the way to improve the representation of aerosols in atmospheric models while maintaining the versatility and efficiency required in large scale applications.</p>
  </abstract>
    
<funding-group>
<award-group id="gs1">
<funding-source>National Aeronautics and Space Administration</funding-source>
<award-id>NNH20ZDA001N-MAP</award-id>
</award-group>
</funding-group>
</article-meta>
  </front>
<body>
      

<sec id="Ch1.S1" sec-type="intro">
  <label>1</label><title>Introduction</title>
      <p id="d2e134">Aerosols play a crucial role in the Earth's system by influencing radiative forcing <xref ref-type="bibr" rid="bib1.bibx32 bib1.bibx11" id="paren.1"/>, cloud formation and lifetime <xref ref-type="bibr" rid="bib1.bibx27" id="paren.2"/>, and precipitation patterns <xref ref-type="bibr" rid="bib1.bibx89" id="paren.3"/>. Aerosol particle size and composition determine their atmospheric lifetime <xref ref-type="bibr" rid="bib1.bibx84" id="paren.4"/>, impact on human health <xref ref-type="bibr" rid="bib1.bibx6" id="paren.5"/>, long range transport <xref ref-type="bibr" rid="bib1.bibx97" id="paren.6"/>, and their ability to become cloud droplets and ice crystals <xref ref-type="bibr" rid="bib1.bibx85" id="paren.7"/>. The size and composition of atmospheric aerosols are critical parameters determining the concentration of cloud condensation nuclei (CCN) in the atmosphere <xref ref-type="bibr" rid="bib1.bibx84" id="paren.8"/>. Understanding the distribution and composition of atmospheric aerosols is thus essential for accurate climate and weather simulations <xref ref-type="bibr" rid="bib1.bibx85" id="paren.9"/>.</p>
      <p id="d2e165">The ASD and mixing state are at the center of the ability of climate models to accurately simulate the transport and chemical evolution of aerosol species <xref ref-type="bibr" rid="bib1.bibx5 bib1.bibx12" id="paren.10"/>. Variability in the representation of the ASD among models has been shown to drive large differences in cloud droplet number concentration and aerosol–cloud radiative forcing <xref ref-type="bibr" rid="bib1.bibx99" id="paren.11"/>. Explicitly resolving the ASD improves the representation of nucleation, condensation, and coagulation processes <xref ref-type="bibr" rid="bib1.bibx106" id="paren.12"/>, and it is critical for realistically simulating scavenging within clouds, as smaller particles are less efficiently removed than larger ones, affecting global particle number concentrations by up to <inline-formula><mml:math id="M1" display="inline"><mml:mn mathvariant="normal">20</mml:mn></mml:math></inline-formula> % <xref ref-type="bibr" rid="bib1.bibx75" id="paren.13"/>. It has been shown that models that  resolve particle-level mixing state and size better represent CCN activity, aerosol aging, and radiative properties <xref ref-type="bibr" rid="bib1.bibx81" id="paren.14"/>.</p>
      <p id="d2e191">Atmospheric models represent the ASD using approximations with different degrees of sophistication. The bulk mass approach predicts the transport and evolution of aerosols by tracking the mass concentration of individual chemical species <xref ref-type="bibr" rid="bib1.bibx48 bib1.bibx56 bib1.bibx35 bib1.bibx24" id="paren.15"/>. Each particle is assumed to consist of a single chemical component or their surrogate <xref ref-type="bibr" rid="bib1.bibx81" id="paren.16"/>. Because each species is typically represented by a single prognostic variable, the bulk approach is not designed to resolve the ASD or the mixing state, which are often prescribed from climatological data. However, due to their low computational cost, bulk schemes are well suited for data assimilation <xref ref-type="bibr" rid="bib1.bibx76" id="paren.17"/> and are widely used in forecasting systems, satellite retrieval algorithms, and reanalysis products <xref ref-type="bibr" rid="bib1.bibx28 bib1.bibx34 bib1.bibx43" id="paren.18"/>. For example, aerosol transport in the MERRA-2 climate reanalysis <xref ref-type="bibr" rid="bib1.bibx34 bib1.bibx76" id="paren.19"/> is based on the Goddard Chemistry, Aerosol, Radiation, and Transport model (GOCART) <xref ref-type="bibr" rid="bib1.bibx30" id="paren.20"/>. GOCART is a bulk aerosol scheme that explicitly calculates the mass of major species, i.e.,  dust, black carbon, organic material, sea salt, and sulfate, using an externally mixed representation.</p>
      <p id="d2e213">In contrast to bulk methods, modal aerosol schemes estimate both the number concentration and mass of atmospheric aerosol, approximating the ASD as the combination of overlapping populations, termed “modes”, each typically assumed internally-mixed, and following a log-normal distribution with prescribed geometric standard deviation <xref ref-type="bibr" rid="bib1.bibx101 bib1.bibx102 bib1.bibx88 bib1.bibx63 bib1.bibx59" id="paren.21"><named-content content-type="pre">e.g.,</named-content></xref>. Because they predict the number concentration and mass independently, modal schemes can better resolve the composition of aerosol species, particularly when several subpopulations are used  <xref ref-type="bibr" rid="bib1.bibx81" id="paren.22"/>.  More sophisticated aerosol schemes either compute additional moments of the ASD <xref ref-type="bibr" rid="bib1.bibx105" id="paren.23"/>, explicitly resolve it using a binned approach <xref ref-type="bibr" rid="bib1.bibx3" id="paren.24"><named-content content-type="pre">e.g.,</named-content></xref>, or represent it on a particle-by-particle basis <xref ref-type="bibr" rid="bib1.bibx81" id="paren.25"/>. While these models provide the most physically consistent representation of the ASD, they are often too computationally expensive for operational forecasting and long-term climate simulations.</p>
      <p id="d2e236">In computationally intensive applications and satellite retrievals it is desirable to maintain the efficiency and simplicity of the bulk schemes. However, key processes such as nucleation, coagulation, scavenging, and activation, as well as aerosol radiative properties, are highly sensitive to particle size, requiring the explicit representation of the ASD <xref ref-type="bibr" rid="bib1.bibx84" id="paren.26"/>. A common approach to address this is to prescribe a global mean ASD <xref ref-type="bibr" rid="bib1.bibx79 bib1.bibx9 bib1.bibx43 bib1.bibx17" id="paren.27"><named-content content-type="pre">e.g.,</named-content></xref>. Yet, aerosol size and composition vary substantially across time and space, influenced by both meteorological conditions and natural and anthropogenic sources. As a result, a fixed global ASD can only approximate the actual, locally varying distribution, potentially introducing biases in the simulation of aerosol–radiation and aerosol–cloud interactions.</p>
      <p id="d2e247">To address these challenges, there is growing interest in leveraging machine learning (ML) techniques to develop more efficient and accurate aerosol models <xref ref-type="bibr" rid="bib1.bibx77 bib1.bibx38 bib1.bibx86 bib1.bibx41" id="paren.28"><named-content content-type="pre">e.g., </named-content></xref>. ML models, can in principle capture complex nonlinear relationships between aerosol properties and environmental variables with reduced computational costs. For example, <xref ref-type="bibr" rid="bib1.bibx41" id="text.29"/> developed a surrogate of the Modal Aerosol Module <xref ref-type="bibr" rid="bib1.bibx59" id="paren.30"><named-content content-type="pre">MAM7;</named-content></xref> to predict the mass and number tendencies of aerosol species, with the aim to improve computational performance. The emulator replaces computationally intensive parts of MAM7, however does not map the ASD to the mass of the aerosol species, a requirement to many assimilation and remote sensing algorithms <xref ref-type="bibr" rid="bib1.bibx76 bib1.bibx19" id="paren.31"/>. Other work also have sought to use ML to predict the ASD from bulk aerosol mass, focusing on specific species or its effect on cloud formation <xref ref-type="bibr" rid="bib1.bibx69 bib1.bibx107" id="paren.32"/>.</p>
      <p id="d2e269">Here we present a novel ML-based approach for predicting the ASD and aerosol mixing state in atmospheric models that run bulk aerosol schemes. This is accomplished by developing a ML-based parameterization that emulates the ASD predicted by the MAM7 model, using as input the total mass of aerosol species from relatively fast single-moment bulk aerosol models like GOCART. By combining the strengths of machine learning and physical principles, the parameterization maps the bulk mass model into the ASD that would be predicted by the modal approach, enhancing the former.  Our method offers a promising avenue for advancing aerosol representation in climate predictions, data assimilation and remote sensing applications.</p>
</sec>
<sec id="Ch1.S2">
  <label>2</label><title>Methods and Data</title>
      <p id="d2e280">We developed a neural network (NN) termed “MAMnet”, to estimate the ASD using as input the total mass of aerosol species,  air density and temperature. This minimal set of inputs ensures that the neural network remains independent of the host model, since including additional meteorological inputs like wind speed and humidity, would introduce sensitivity to model-specific parameterizations. It also makes MAMnet suitable for applications involving satellite aerosol retrievals, where only a limited set of atmospheric variables is typically available. This approach is supported by previous studies showing that the conversion between aerosol mass and number concentrations can be reasonably approximated using spatially varying, but prescribed, ASDs <xref ref-type="bibr" rid="bib1.bibx79 bib1.bibx43 bib1.bibx17" id="paren.33"><named-content content-type="pre">e.g.,</named-content></xref>, suggesting that such relationships can be effectively learned by a neural network. MAMnet was trained on simulated data using the MAM7 model implemented on the NASA's Global Earth Observing System (GEOS). This section details the modeling components as well as the development and evaluation approach of the NN.</p>
<sec id="Ch1.S2.SS1">
  <label>2.1</label><title>Modeling components</title>
      <p id="d2e295">The NASA Goddard Earth Observing System (GEOS), consists of a set of components that numerically represent different aspects of the Earth system (atmosphere, ocean, land, sea-ice, and chemistry), coupled following the Earth System Modeling Framework (<uri>https://gmao.gsfc.nasa.gov/GEOS_systems/</uri>, last access: 23 March 2026). In GEOS-AGCM configuration, atmospheric transport of water vapor, condensate and other tracers, and associated land-atmosphere exchanges, is computed explicitly, whereas sea-ice and sea surface temperature (SST) are prescribed as time-dependent boundary conditions <xref ref-type="bibr" rid="bib1.bibx80 bib1.bibx82" id="paren.34"/>. Cloud microphysics in the operational version of GEOS uses a single moment microphysics scheme for short-term weather forecast <xref ref-type="bibr" rid="bib1.bibx67" id="paren.35"/>, and a two-moment cloud scheme in subseasonal and seasonal prediction <xref ref-type="bibr" rid="bib1.bibx9 bib1.bibx68" id="paren.36"/>. GEOS constitutes the modeling base of MERRA-2 (Modern Era Retrospective analysis for Research and Applications, version 2), the first multidecadal reanalysis to integrate both aerosol and meteorological observations <xref ref-type="bibr" rid="bib1.bibx34 bib1.bibx76" id="paren.37"/>. In MERRA-2 aerosol fields are described using GOCART. Aerosols are interactive and radiatively active, hence MERRA-2 has a representation of the aerosol direct effect. Aerosol assimilation uses the Goddard Aerosol Assimilation System (GAAS), and the overall assimilation cycle is controlled by the meteorology.</p>
<sec id="Ch1.S2.SS1.SSS1">
  <label>2.1.1</label><title>Aerosol transport schemes</title>
      <p id="d2e320">GEOS implements two aerosol schemes to interactively calculate the evolution of aerosol and gaseous tracers. Both include parameterized representation of aerosol formation, growth, aging and wet removal, and differ in their treatment of the ASD and mixing state. GOCART <xref ref-type="bibr" rid="bib1.bibx24 bib1.bibx29" id="paren.38"/> is used operationally on weather forecast and data assimilation applications. GOCART is a mass-based aerosol model that explicitly calculates the transport and evolution of dust, black carbon, organic material, sea salt, and sulfate. Aerosol species are assumed externally mixed. Dust and sea salt are represented in five mass bins of different sizes whereas a single bin is assumed for other species. The ASD for each bin is prescribed as a lognormal distribution. Both organics (primary and secondary organic matter) and black carbon are split into hydrophilic and hydrophobic components. Dust and sea salt emissions are prognostic whereas sulfate and biomass burning emissions are obtained from the MERRA-2 dataset <xref ref-type="bibr" rid="bib1.bibx29 bib1.bibx76" id="paren.39"/>.</p>
      <p id="d2e329">GEOS also implements the MAM7 model <xref ref-type="bibr" rid="bib1.bibx59" id="paren.40"/>, as an alternative aerosol scheme for research applications. MAM7 is a modal aerosol scheme that predicts the mass and number concentration of Aitken (AIT), accumulation (ACC), primary carbon (PCM), fine dust (FDU) and sea salt (FSS), and coarse dust (CDU) and sea salt (CSS) aerosol modes. The aerosol representation is internally mixed with aerosol species and  modal composition as detailed in Tables <xref ref-type="table" rid="T1"/> and  <xref ref-type="table" rid="T2"/>. The total number of simulated tracers in MAM7 is 31: 24 modal mass components and seven aerosol number concentrations. The size distributions for each mode is assumed to follow a lognormal distribution, with geometric mean diameter computed diagnostically and prescribed geometric standard deviation for each mode <xref ref-type="bibr" rid="bib1.bibx59" id="paren.41"/>.</p>

<table-wrap id="T1"><label>Table 1</label><caption><p id="d2e345">Aerosol species considered in this work; <inline-formula><mml:math id="M2" display="inline"><mml:mi mathvariant="italic">κ</mml:mi></mml:math></inline-formula> is the hygroscopicity parameter <xref ref-type="bibr" rid="bib1.bibx53" id="paren.42"/>.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="4">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="left"/>
     <oasis:colspec colnum="3" colname="col3" align="right"/>
     <oasis:colspec colnum="4" colname="col4" align="right"/>
     <oasis:thead>
       <oasis:row>
         <oasis:entry colname="col1">Abbreviation</oasis:entry>
         <oasis:entry colname="col2">Description</oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M3" display="inline"><mml:mi mathvariant="italic">κ</mml:mi></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col4">Density</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"/>
         <oasis:entry colname="col3"/>
         <oasis:entry colname="col4">(kg m<sup>−3</sup>)</oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row>
         <oasis:entry colname="col1">SU</oasis:entry>
         <oasis:entry colname="col2">Sulfates</oasis:entry>
         <oasis:entry colname="col3">0.64</oasis:entry>
         <oasis:entry colname="col4">1600</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">AMM</oasis:entry>
         <oasis:entry colname="col2">Ammonium</oasis:entry>
         <oasis:entry colname="col3">0.64</oasis:entry>
         <oasis:entry colname="col4">1600</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">SS</oasis:entry>
         <oasis:entry colname="col2">Sea salt</oasis:entry>
         <oasis:entry colname="col3">1.3</oasis:entry>
         <oasis:entry colname="col4">2200</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">SOM</oasis:entry>
         <oasis:entry colname="col2">Secondary organic matter</oasis:entry>
         <oasis:entry colname="col3">0.25</oasis:entry>
         <oasis:entry colname="col4">900</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">POM</oasis:entry>
         <oasis:entry colname="col2">Primary Organic Matter</oasis:entry>
         <oasis:entry colname="col3">0.25</oasis:entry>
         <oasis:entry colname="col4">900</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">BC</oasis:entry>
         <oasis:entry colname="col2">Black carbon</oasis:entry>
         <oasis:entry colname="col3">0.01</oasis:entry>
         <oasis:entry colname="col4">1600</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">DU</oasis:entry>
         <oasis:entry colname="col2">Dust</oasis:entry>
         <oasis:entry colname="col3">0.1</oasis:entry>
         <oasis:entry colname="col4">1700</oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table></table-wrap>

<table-wrap id="T2" specific-use="star"><label>Table 2</label><caption><p id="d2e532">Aerosol modes predicted by MAM7 and MAMnet; <inline-formula><mml:math id="M5" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="normal">g</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is the geometric standard deviation.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="4">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="left"/>
     <oasis:colspec colnum="3" colname="col3" align="right"/>
     <oasis:colspec colnum="4" colname="col4" align="left"/>
     <oasis:thead>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Abbreviation</oasis:entry>
         <oasis:entry colname="col2">Mode</oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M6" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="normal">g</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col4">Species in mode</oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row>
         <oasis:entry colname="col1">ACC</oasis:entry>
         <oasis:entry colname="col2">Accumulation</oasis:entry>
         <oasis:entry colname="col3">1.8</oasis:entry>
         <oasis:entry colname="col4">SU, AMM, SOM, POM, BC, SS</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">AIT</oasis:entry>
         <oasis:entry colname="col2">Aitken</oasis:entry>
         <oasis:entry colname="col3">1.6</oasis:entry>
         <oasis:entry colname="col4">SU, AMM, SOM, SS</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">CDU</oasis:entry>
         <oasis:entry colname="col2">Coarse dust</oasis:entry>
         <oasis:entry colname="col3">1.8</oasis:entry>
         <oasis:entry colname="col4">SU, AMM, DU</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">CSS</oasis:entry>
         <oasis:entry colname="col2">Coarse sea salt</oasis:entry>
         <oasis:entry colname="col3">2.0</oasis:entry>
         <oasis:entry colname="col4">SU, AMM, SS</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">FDU</oasis:entry>
         <oasis:entry colname="col2">Fine dust</oasis:entry>
         <oasis:entry colname="col3">1.8</oasis:entry>
         <oasis:entry colname="col4">SU, AMM, DU</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">FSS</oasis:entry>
         <oasis:entry colname="col2">Fine sea salt</oasis:entry>
         <oasis:entry colname="col3">2.0</oasis:entry>
         <oasis:entry colname="col4">SU, AMM, SS</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">PCM</oasis:entry>
         <oasis:entry colname="col2">Primary carbon matter</oasis:entry>
         <oasis:entry colname="col3">1.6</oasis:entry>
         <oasis:entry colname="col4">POM, BC</oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table></table-wrap>

</sec>
</sec>
<sec id="Ch1.S2.SS2">
  <label>2.2</label><title>Development of the deep learning model</title>
      <p id="d2e705">We built a neural network, termed “MAMnet”, to estimate the aerosol number concentration and composition emulating the output of the MAM7 model (Table <xref ref-type="table" rid="T2"/>), using as input the total mass mixing ratios for dust, sulfates, organics, black carbon and sea salt, and the atmospheric state (temperature, <inline-formula><mml:math id="M7" display="inline"><mml:mi>T</mml:mi></mml:math></inline-formula> and air density, <inline-formula><mml:math id="M8" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">ρ</mml:mi><mml:mi mathvariant="normal">air</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>), for a total of 31 predicted tracers as shown in Fig. <xref ref-type="fig" rid="F1"/>. MAMnet is intended to map the simulated aerosol mass across species into a 7-modal ASD, rather than to fully emulate MAM7. This is arguably a simpler task than emulating the full range of aerosol processes represented in MAM7, since the aerosol mass fields used as input already encapsulate the integrated effects of meteorology, clouds, as well as trends in aerosol emissions. The mass-number relationship on the other hand is not expected to depend strongly on such factors, since it many cases it can be approximated to some degree using prescribed formulations for the ASD <xref ref-type="bibr" rid="bib1.bibx65" id="paren.43"/>. This section describes the development of the NN.</p>

      <fig id="F1"><label>Figure 1</label><caption><p id="d2e735">Neural network development workflow. Blue arrows represent the training steps, while red arrows correspond to the inference process. During training, the “mapping” step aggregates aerosol species across modes from the MAM7 output to construct the input <inline-formula><mml:math id="M9" display="inline"><mml:mrow><mml:msub><mml:mi>X</mml:mi><mml:mi mathvariant="normal">MAM</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>. The MAM7 output is then used to calculate the MAMnet loss (calculated as the minimum mean square difference between the model prediction and the MAM7 fields). During inference, input from MERRA-2 is used to predict the aerosol size distribution and mixing state. MAMnet consists of a single input layer (black), seven hidden layers (orange), and one output layer (green). <inline-formula><mml:math id="M10" display="inline"><mml:mi>T</mml:mi></mml:math></inline-formula> and AIRD represent the temperature and air density, respectively. <inline-formula><mml:math id="M11" display="inline"><mml:mrow><mml:msub><mml:mi>N</mml:mi><mml:mi mathvariant="normal">s</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is the number of samples.</p></caption>
          <graphic xlink:href="https://gmd.copernicus.org/articles/19/2437/2026/gmd-19-2437-2026-f01.png"/>

        </fig>

<sec id="Ch1.S2.SS2.SSS1">
  <label>2.2.1</label><title>Data generation</title>
      <p id="d2e780">The AGCM configuration of GEOS, running MAM7 (referred to as “GEOS+MAM7”), was used to develop a robust dataset to train the  neural network. We ran a 5-year simulation (2001–2006) at 1° horizontal resolution and 72 vertical levels (from the surface to 0.01 hPa), with diurnal, instantaneous outputs at UTC 09:00:00 and 21:00:00. Temperature and horizontal winds were “replayed” to MERRA-2. The replay technique is a form of nudging that combines analysis increments with the model results every six hours, at each model grid point, to correct the model state   <xref ref-type="bibr" rid="bib1.bibx93" id="paren.44"/>. This  ensures that the simulation reproduces the observed meteorological state. Aerosol mass however evolves freely from observational constraints.</p>
      <p id="d2e786">To construct the training dataset, we randomly selected 25 unique output files (without replacement) from the years 2001–2005. An additional 10 unique files were used as validation data to compute the loss during training. Each file represents global instantaneous output from the GEOS+MAM7 model. Although the validation data are not used to update the network parameters, the validation loss informs optimization choices (e.g., early stopping and hyperparameter selection) and is therefore considered part of the training process. For testing, we used 5 separate output files from the year 2006, which was not included in either the training or validation stages, ensuring a fully independent evaluation set.</p>
      <p id="d2e789">Each grid cell in the GEOS+MAM7 output is treated as an independent training example, resulting in a large volume of data: with <inline-formula><mml:math id="M12" display="inline"><mml:mrow><mml:msub><mml:mi>N</mml:mi><mml:mi mathvariant="normal">time</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mn mathvariant="normal">25</mml:mn></mml:mrow></mml:math></inline-formula> timestamps, <inline-formula><mml:math id="M13" display="inline"><mml:mrow><mml:msub><mml:mi>N</mml:mi><mml:mi mathvariant="normal">lev</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mn mathvariant="normal">72</mml:mn></mml:mrow></mml:math></inline-formula> vertical levels, <inline-formula><mml:math id="M14" display="inline"><mml:mrow><mml:msub><mml:mi>N</mml:mi><mml:mi mathvariant="normal">lat</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mn mathvariant="normal">181</mml:mn></mml:mrow></mml:math></inline-formula> latitudes, and <inline-formula><mml:math id="M15" display="inline"><mml:mrow><mml:msub><mml:mi>N</mml:mi><mml:mi mathvariant="normal">lon</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mn mathvariant="normal">360</mml:mn></mml:mrow></mml:math></inline-formula> longitudes, the training set contains over 100 million samples. The samples are randomly shuffled in time and space prior to training. This single-cell approach makes the parameterization resolution-independent, facilitating integration into atmospheric models with varying grid resolutions. It also ensures broad coverage of physically plausible combinations of aerosol mass and number. Although this approach omits spatial or vertical correlations, the mass-number relationship depends primarily on the relative abundance of species within a given grid cell. This was tested by using using a full-column input structure, which resulted in no significant gain in accuracy (not shown).</p>
      <p id="d2e853">To balance data volume and temporal representativeness, we sampled at 12 h intervals, which allows the network to capture differences between day and night while maximizing the number of training samples. Higher-frequency sampling could better resolve the diurnal cycle, but this comes at the cost of fewer training time steps due to memory limitations. This is not expected to be critical  as the relationship between aerosol mass and number is expected to exhibit weaker diurnal variability than mass itself, which would be already resolved by the host model.</p>
      <p id="d2e857">We combined the internally-mixed, modal mass components parameterized by MAM7 across 5 different species including sulfate, ammonium, sea salt, dust, primary and secondary organic matter, and black carbon (Table <xref ref-type="table" rid="T1"/> and Fig. <xref ref-type="fig" rid="F1"/>) to derive the total mass mixing ratios for the input features. Neither MERRA-2 nor the current implementation of MAM7 in GEOS include nitrate aerosol species. The mass input variables were first log<sub>10</sub>-transformed, and the resulting values were then standardized by computing <inline-formula><mml:math id="M17" display="inline"><mml:mi>Z</mml:mi></mml:math></inline-formula>-scores using the global mean and standard deviation across all levels. Temperature and air density were also standardized using their global mean and standard deviation. Statistics used for normalization were calculated using 100 random instantaneous output files not used during training. Target variables included the mass of each of the MAM7 species and the number concentration for each mode. Because aerosol mass and number concentration vary over several orders of magnitude, we log<sub>10</sub>-transformed the targets,  and filtered out values less than minimum threshold values, <inline-formula><mml:math id="M19" display="inline"><mml:mrow><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">20</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M20" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>g kg <sup>−1</sup> and <inline-formula><mml:math id="M22" display="inline"><mml:mrow><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">4</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula> mg<sup>−1</sup> for mass and number, respectively, prior to training. Values below these thresholds are held constant and therefore do not contribute to the gradient. During testing, points below the thresholds are masked. All metrics are computed in logarithmic space.</p>
      <p id="d2e950">The modal aerosol dry diameter (hereafter <inline-formula><mml:math id="M24" display="inline"><mml:mrow><mml:msub><mml:mi>D</mml:mi><mml:mi mathvariant="normal">pg</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>) was not directly included as a target of MAMnet, and it is not part of the loss function. Instead it was used to check for mass conservation, that is, matching the predicted <inline-formula><mml:math id="M25" display="inline"><mml:mrow><mml:msub><mml:mi>D</mml:mi><mml:mi mathvariant="normal">pg</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> against the target values indicates that the mass and number concentration remain consistent in the prediction. <inline-formula><mml:math id="M26" display="inline"><mml:mrow><mml:msub><mml:mi>D</mml:mi><mml:mi mathvariant="normal">pg</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> was derived for each <inline-formula><mml:math id="M27" display="inline"><mml:mi>i</mml:mi></mml:math></inline-formula>th mode in the form <xref ref-type="bibr" rid="bib1.bibx84" id="paren.45"/>,

              <disp-formula id="Ch1.E1" content-type="numbered"><label>1</label><mml:math id="M28" display="block"><mml:mrow><mml:msub><mml:mi>D</mml:mi><mml:mrow><mml:mi mathvariant="normal">pg</mml:mi><mml:mo>,</mml:mo><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:msup><mml:mfenced open="(" close=")"><mml:mrow><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mn mathvariant="normal">6</mml:mn><mml:mrow><mml:mi mathvariant="italic">π</mml:mi><mml:msub><mml:mi>N</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:mfrac></mml:mstyle><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>j</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mrow><mml:msub><mml:mi>N</mml:mi><mml:mrow><mml:mi mathvariant="normal">sp</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="normal">i</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:munderover><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:msub><mml:mi>M</mml:mi><mml:mrow><mml:mi>j</mml:mi><mml:mo>,</mml:mo><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mrow><mml:msub><mml:mi mathvariant="italic">ρ</mml:mi><mml:mrow><mml:mi>j</mml:mi><mml:mo>,</mml:mo><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mfrac></mml:mstyle></mml:mrow></mml:mfenced><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>/</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msup><mml:mi>exp⁡</mml:mi><mml:mfenced open="(" close=")"><mml:mrow><mml:mo>-</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:mn mathvariant="normal">3</mml:mn><mml:msup><mml:mi>ln⁡</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mrow><mml:mi mathvariant="normal">g</mml:mi><mml:mo>,</mml:mo><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mn mathvariant="normal">2</mml:mn></mml:mfrac></mml:mstyle></mml:mrow></mml:mfenced></mml:mrow></mml:math></disp-formula>

            where <inline-formula><mml:math id="M29" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mrow><mml:mi mathvariant="normal">g</mml:mi><mml:mo>,</mml:mo><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>  and <inline-formula><mml:math id="M30" display="inline"><mml:mrow><mml:msub><mml:mi>N</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> are the geometric standard deviation and number concentration for the <inline-formula><mml:math id="M31" display="inline"><mml:mi>i</mml:mi></mml:math></inline-formula>th mode, respectively. <inline-formula><mml:math id="M32" display="inline"><mml:mrow><mml:msub><mml:mi>M</mml:mi><mml:mrow><mml:mi>j</mml:mi><mml:mo>,</mml:mo><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M33" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">ρ</mml:mi><mml:mrow><mml:mi>j</mml:mi><mml:mo>,</mml:mo><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> are the mass and density of the  <inline-formula><mml:math id="M34" display="inline"><mml:mi>j</mml:mi></mml:math></inline-formula>th species in the <inline-formula><mml:math id="M35" display="inline"><mml:mi>i</mml:mi></mml:math></inline-formula>th mode, respectively, and <inline-formula><mml:math id="M36" display="inline"><mml:mrow><mml:msub><mml:mi>N</mml:mi><mml:mrow><mml:mi mathvariant="normal">sp</mml:mi><mml:mo>,</mml:mo><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> is the number of species present in the mode.</p>
</sec>
<sec id="Ch1.S2.SS2.SSS2">
  <label>2.2.2</label><title>Model architecture</title>
      <p id="d2e1216">Various levels of complexity were tested for the MAMnet architecture, including Multilayer Perceptrons (MLPs) and Convolutional Neural Networks (CNNs) <xref ref-type="bibr" rid="bib1.bibx13" id="paren.46"/>. These architectures have demonstrated success in capturing multi-scale behaviors of GCMs for different physical properties <xref ref-type="bibr" rid="bib1.bibx18 bib1.bibx77 bib1.bibx10" id="paren.47"><named-content content-type="pre">e.g.,</named-content></xref>. MLPs extract global patterns from the entirety of the input feature vector simultaneously, resulting in a greater number of model parameters for optimization. This approach compels the NN to make localized decisions, considering what occurs at an individual model level within each grid cell and time step, utilizing global information encompassing all grid cells and time steps. In contrast, CNNs extract features from smaller spatiotemporal blocks, enabling local decisions to be influenced by nearby information where the receptive field of each sample is a hyperparameter. Testing of both architectures showed that the MLP configuration exhibited superior performance and was easier to optimize. The final architecture is shown in Fig. <xref ref-type="fig" rid="F1"/>. Hyperparameters targeting generalization (dropout rate, activation function, batch size) were tuned such that the optimized model not only minimized error on the validation data, but minimized the difference between the training and validation losses as detailed in the Appendix.</p>
</sec>
</sec>
<sec id="Ch1.S2.SS3">
  <label>2.3</label><title>Observational data </title>
      <p id="d2e1239">Besides synthetic data the neural network was evaluated on its ability to reproduce observations when driven by the MERRA-2 reanalysis output. This was important for testing the reliability of MAMnet when applied outside of the purely simulated environment. Near-surface aerosol number concentrations ranging from 30 to 500 nm, compiled by <xref ref-type="bibr" rid="bib1.bibx7" id="text.48"/>, were used for model evaluation. The dataset includes two years (2008–2009) of hourly measurements from 24 sites across Western Europe, as detailed in Table <xref ref-type="table" rid="T3"/>. These measurements were collected from two major monitoring networks: the European Supersites for Atmospheric Aerosol Research (EUSAAR) project, part of the Sixth Framework Programme of the European Commission <xref ref-type="bibr" rid="bib1.bibx74" id="paren.49"/>, and the German Ultrafine Aerosol Network (GUAN) <xref ref-type="bibr" rid="bib1.bibx15" id="paren.50"/>. The data are reported as cumulative number concentrations for four aerosol size ranges, <inline-formula><mml:math id="M37" display="inline"><mml:mrow><mml:msub><mml:mi>N</mml:mi><mml:mn mathvariant="normal">30</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M38" display="inline"><mml:mrow><mml:msub><mml:mi>N</mml:mi><mml:mn mathvariant="normal">50</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M39" display="inline"><mml:mrow><mml:msub><mml:mi>N</mml:mi><mml:mn mathvariant="normal">100</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>, and <inline-formula><mml:math id="M40" display="inline"><mml:mrow><mml:msub><mml:mi>N</mml:mi><mml:mn mathvariant="normal">250</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>, defined as,

            <disp-formula id="Ch1.E2" content-type="numbered"><label>2</label><mml:math id="M41" display="block"><mml:mrow><mml:msub><mml:mi>N</mml:mi><mml:mi>X</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:msub><mml:mi>D</mml:mi><mml:mi>p</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mi>X</mml:mi></mml:mrow><mml:mi>Y</mml:mi></mml:munderover><mml:mi>N</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mi>D</mml:mi><mml:mi>p</mml:mi></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math></disp-formula>

          where <inline-formula><mml:math id="M42" display="inline"><mml:mrow><mml:msub><mml:mi>D</mml:mi><mml:mi>p</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is the aerosol dry diameter, and subscript <inline-formula><mml:math id="M43" display="inline"><mml:mi>X</mml:mi></mml:math></inline-formula> represents the aerosol number concentration for size range defined by threshold <inline-formula><mml:math id="M44" display="inline"><mml:mi>X</mml:mi></mml:math></inline-formula>, and <inline-formula><mml:math id="M45" display="inline"><mml:mrow><mml:mi>Y</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">500</mml:mn></mml:mrow></mml:math></inline-formula> nm for <inline-formula><mml:math id="M46" display="inline"><mml:mrow><mml:mi>X</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">50</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">100</mml:mn></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M47" display="inline"><mml:mn mathvariant="normal">250</mml:mn></mml:math></inline-formula> nm and <inline-formula><mml:math id="M48" display="inline"><mml:mrow><mml:mi>Y</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">50</mml:mn></mml:mrow></mml:math></inline-formula> nm for <inline-formula><mml:math id="M49" display="inline"><mml:mrow><mml:mi>X</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">30</mml:mn></mml:mrow></mml:math></inline-formula> nm. Equivalently, these can be calculated from the predicted ASD in the form <xref ref-type="bibr" rid="bib1.bibx84" id="paren.51"/>,

            <disp-formula id="Ch1.E3" content-type="numbered"><label>3</label><mml:math id="M50" display="block"><mml:mtable rowspacing="0.2ex" class="split" displaystyle="true" columnalign="right left"><mml:mtr><mml:mtd><mml:mrow><mml:msub><mml:mi>N</mml:mi><mml:mi>X</mml:mi></mml:msub></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mo>=</mml:mo><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mrow><mml:msub><mml:mi>N</mml:mi><mml:mi mathvariant="normal">mod</mml:mi></mml:msub></mml:mrow></mml:munderover><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:msub><mml:mi>N</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow><mml:mn mathvariant="normal">2</mml:mn></mml:mfrac></mml:mstyle><mml:mfenced open="[" close=""><mml:mrow><mml:mi mathvariant="normal">erf</mml:mi><mml:mfenced close=")" open="("><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:mi>ln⁡</mml:mi><mml:mi>Y</mml:mi><mml:mo>-</mml:mo><mml:mi>ln⁡</mml:mi><mml:msub><mml:mi>D</mml:mi><mml:mrow><mml:mi mathvariant="normal">pg</mml:mi><mml:mo>,</mml:mo><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mrow><mml:msqrt><mml:mn mathvariant="normal">2</mml:mn></mml:msqrt><mml:mi>ln⁡</mml:mi><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mrow><mml:mi mathvariant="normal">g</mml:mi><mml:mo>,</mml:mo><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mfrac></mml:mstyle></mml:mfenced></mml:mrow></mml:mfenced></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd/><mml:mtd><mml:mrow><mml:mfenced open="" close="]"><mml:mrow><mml:mo>-</mml:mo><mml:mi mathvariant="normal">erf</mml:mi><mml:mfenced close=")" open="("><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:mi>ln⁡</mml:mi><mml:mi>X</mml:mi><mml:mo>-</mml:mo><mml:mi>ln⁡</mml:mi><mml:msub><mml:mi>D</mml:mi><mml:mrow><mml:mi mathvariant="normal">pg</mml:mi><mml:mo>,</mml:mo><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mrow><mml:msqrt><mml:mn mathvariant="normal">2</mml:mn></mml:msqrt><mml:mi>ln⁡</mml:mi><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mrow><mml:mi mathvariant="normal">g</mml:mi><mml:mo>,</mml:mo><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mfrac></mml:mstyle></mml:mfenced></mml:mrow></mml:mfenced></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>

          where <inline-formula><mml:math id="M51" display="inline"><mml:mrow><mml:msub><mml:mi>N</mml:mi><mml:mi mathvariant="normal">mod</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mn mathvariant="normal">7</mml:mn></mml:mrow></mml:math></inline-formula>, is the number of lognormal modes. For each site, MERRA-2 derived aerosol mass concentration, temperature and air density are interpolated at the location and time of the measurements, then used in MAMnet to predict the ASD. Using Eqs. (<xref ref-type="disp-formula" rid="Ch1.E1"/>) and (<xref ref-type="disp-formula" rid="Ch1.E3"/>), <inline-formula><mml:math id="M52" display="inline"><mml:mrow><mml:msub><mml:mi>N</mml:mi><mml:mi mathvariant="normal">X</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is predicted as the average of the two lowermost model levels (i.e., nearest the surface)  and compared against the observations.</p>

<table-wrap id="T3" specific-use="star"><label>Table 3</label><caption><p id="d2e1593">Datasets for the period 2008–2009 used for comparison with surface aerosol size distributions predicted by MAMnet. The original data reference is given, although all data sets used in this work were curated by <xref ref-type="bibr" rid="bib1.bibx7" id="text.52"/>.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="4">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="left"/>
     <oasis:colspec colnum="3" colname="col3" align="right"/>
     <oasis:colspec colnum="4" colname="col4" align="left"/>
     <oasis:thead>
       <oasis:row>
         <oasis:entry colname="col1">Station Name</oasis:entry>
         <oasis:entry colname="col2">Station</oasis:entry>
         <oasis:entry colname="col3">Altitude</oasis:entry>
         <oasis:entry colname="col4">Reference</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">Code</oasis:entry>
         <oasis:entry colname="col3">(m a.s.l.)</oasis:entry>
         <oasis:entry colname="col4"/>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row>
         <oasis:entry colname="col1">Aspvreten</oasis:entry>
         <oasis:entry colname="col2">ASP</oasis:entry>
         <oasis:entry colname="col3">30</oasis:entry>
         <oasis:entry colname="col4">
                    <xref ref-type="bibr" rid="bib1.bibx94" id="text.53"/>
                  </oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Birkenes</oasis:entry>
         <oasis:entry colname="col2">BIR</oasis:entry>
         <oasis:entry colname="col3">190</oasis:entry>
         <oasis:entry colname="col4">
                    <xref ref-type="bibr" rid="bib1.bibx4" id="text.54"/>
                  </oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Pallas</oasis:entry>
         <oasis:entry colname="col2">PAL</oasis:entry>
         <oasis:entry colname="col3">560</oasis:entry>
         <oasis:entry colname="col4">
                    <xref ref-type="bibr" rid="bib1.bibx58" id="text.55"/>
                  </oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Preila</oasis:entry>
         <oasis:entry colname="col2">PLA</oasis:entry>
         <oasis:entry colname="col3">5</oasis:entry>
         <oasis:entry colname="col4">
                    <xref ref-type="bibr" rid="bib1.bibx96" id="text.56"/>
                  </oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">SMEAR II</oasis:entry>
         <oasis:entry colname="col2">SMR</oasis:entry>
         <oasis:entry colname="col3">181</oasis:entry>
         <oasis:entry colname="col4">
                    <xref ref-type="bibr" rid="bib1.bibx42" id="text.57"/>
                  </oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Vavihill</oasis:entry>
         <oasis:entry colname="col2">VHL</oasis:entry>
         <oasis:entry colname="col3">172</oasis:entry>
         <oasis:entry colname="col4">
                    <xref ref-type="bibr" rid="bib1.bibx54" id="text.58"/>
                  </oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Bösel</oasis:entry>
         <oasis:entry colname="col2">BOE</oasis:entry>
         <oasis:entry colname="col3">16</oasis:entry>
         <oasis:entry colname="col4">
                    <xref ref-type="bibr" rid="bib1.bibx15" id="text.59"/>
                  </oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">K-Puszta</oasis:entry>
         <oasis:entry colname="col2">KPO</oasis:entry>
         <oasis:entry colname="col3">125</oasis:entry>
         <oasis:entry colname="col4">
                    <xref ref-type="bibr" rid="bib1.bibx52" id="text.60"/>
                  </oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Melpitz</oasis:entry>
         <oasis:entry colname="col2">MPZ</oasis:entry>
         <oasis:entry colname="col3">87</oasis:entry>
         <oasis:entry colname="col4">
                    <xref ref-type="bibr" rid="bib1.bibx31" id="text.61"/>
                  </oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Kosetice</oasis:entry>
         <oasis:entry colname="col2">OBK</oasis:entry>
         <oasis:entry colname="col3">534</oasis:entry>
         <oasis:entry colname="col4">
                    <xref ref-type="bibr" rid="bib1.bibx22" id="text.62"/>
                  </oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Hohenpeissenberg</oasis:entry>
         <oasis:entry colname="col2">SMPS</oasis:entry>
         <oasis:entry colname="col3">988</oasis:entry>
         <oasis:entry colname="col4">
                    <xref ref-type="bibr" rid="bib1.bibx14" id="text.63"/>
                  </oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Waldhof</oasis:entry>
         <oasis:entry colname="col2">WAL</oasis:entry>
         <oasis:entry colname="col3">70</oasis:entry>
         <oasis:entry colname="col4">
                    <xref ref-type="bibr" rid="bib1.bibx15" id="text.64"/>
                  </oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Cabauw</oasis:entry>
         <oasis:entry colname="col2">CBW</oasis:entry>
         <oasis:entry colname="col3">60</oasis:entry>
         <oasis:entry colname="col4">
                    <xref ref-type="bibr" rid="bib1.bibx83" id="text.65"/>
                  </oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Harwell</oasis:entry>
         <oasis:entry colname="col2">HWL</oasis:entry>
         <oasis:entry colname="col3">60</oasis:entry>
         <oasis:entry colname="col4">
                    <xref ref-type="bibr" rid="bib1.bibx23" id="text.66"/>
                  </oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Mace Head</oasis:entry>
         <oasis:entry colname="col2">MHD</oasis:entry>
         <oasis:entry colname="col3">5</oasis:entry>
         <oasis:entry colname="col4">
                    <xref ref-type="bibr" rid="bib1.bibx46" id="text.67"/>
                  </oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Finokalia</oasis:entry>
         <oasis:entry colname="col2">FKL</oasis:entry>
         <oasis:entry colname="col3">250</oasis:entry>
         <oasis:entry colname="col4">
                    <xref ref-type="bibr" rid="bib1.bibx66" id="text.68"/>
                  </oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">JRC-Ispra</oasis:entry>
         <oasis:entry colname="col2">ISP</oasis:entry>
         <oasis:entry colname="col3">209</oasis:entry>
         <oasis:entry colname="col4">
                    <xref ref-type="bibr" rid="bib1.bibx39" id="text.69"/>
                  </oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Zeppelin</oasis:entry>
         <oasis:entry colname="col2">ZEP</oasis:entry>
         <oasis:entry colname="col3">474</oasis:entry>
         <oasis:entry colname="col4">
                    <xref ref-type="bibr" rid="bib1.bibx90" id="text.70"/>
                  </oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Puy de Dôme</oasis:entry>
         <oasis:entry colname="col2">PDD</oasis:entry>
         <oasis:entry colname="col3">1465</oasis:entry>
         <oasis:entry colname="col4">
                    <xref ref-type="bibr" rid="bib1.bibx98" id="text.71"/>
                  </oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Schauninsland</oasis:entry>
         <oasis:entry colname="col2">SCH</oasis:entry>
         <oasis:entry colname="col3">1210</oasis:entry>
         <oasis:entry colname="col4">
                    <xref ref-type="bibr" rid="bib1.bibx15" id="text.72"/>
                  </oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Zugzpitze</oasis:entry>
         <oasis:entry colname="col2">ZSF</oasis:entry>
         <oasis:entry colname="col3">2650</oasis:entry>
         <oasis:entry colname="col4">
                    <xref ref-type="bibr" rid="bib1.bibx15" id="text.73"/>
                  </oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Jungfraujoch</oasis:entry>
         <oasis:entry colname="col2">JFJ</oasis:entry>
         <oasis:entry colname="col3">3580</oasis:entry>
         <oasis:entry colname="col4">
                    <xref ref-type="bibr" rid="bib1.bibx49" id="text.74"/>
                  </oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">BEO Moussala</oasis:entry>
         <oasis:entry colname="col2">BEO</oasis:entry>
         <oasis:entry colname="col3">2971</oasis:entry>
         <oasis:entry colname="col4">
                    <xref ref-type="bibr" rid="bib1.bibx71" id="text.75"/>
                  </oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Monte Cimone</oasis:entry>
         <oasis:entry colname="col2">CMN</oasis:entry>
         <oasis:entry colname="col3">2165</oasis:entry>
         <oasis:entry colname="col4">
                    <xref ref-type="bibr" rid="bib1.bibx64" id="text.76"/>
                  </oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table></table-wrap>

      <p id="d2e2083">We also used estimates of the concentration of cloud condensation nuclei, <inline-formula><mml:math id="M53" display="inline"><mml:mrow><mml:msub><mml:mi>N</mml:mi><mml:mi mathvariant="normal">CCN</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> to place MAMnet in the context of observationally-constrained estimates. To calculate <inline-formula><mml:math id="M54" display="inline"><mml:mrow><mml:msub><mml:mi>N</mml:mi><mml:mi mathvariant="normal">CCN</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> from MAMnet we folllowed the method of <xref ref-type="bibr" rid="bib1.bibx33" id="text.77"/> to estimate <inline-formula><mml:math id="M55" display="inline"><mml:mrow><mml:msub><mml:mi>N</mml:mi><mml:mi mathvariant="normal">CCN</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> from the derived 7-modal size distribution and modal composition and the MERRA-2 fields as inputs. Hygroscopicity parameters, (<inline-formula><mml:math id="M56" display="inline"><mml:mi mathvariant="italic">κ</mml:mi></mml:math></inline-formula>) for each mode were obtained by volume-weighting the values for each aerosol species as listed in Table <xref ref-type="table" rid="T1"/>. Because <inline-formula><mml:math id="M57" display="inline"><mml:mrow><mml:msub><mml:mi>N</mml:mi><mml:mi mathvariant="normal">CCN</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is strongly influenced by the ASD and composition, it serves as a useful diagnostic for evaluating the estimation of particle size. CCN concentrations are highly sensitive to aerosol size <xref ref-type="bibr" rid="bib1.bibx57" id="paren.78"/>, as larger and more hygroscopic particles are more likely to activate into cloud droplets. As a result, <inline-formula><mml:math id="M58" display="inline"><mml:mrow><mml:msub><mml:mi>N</mml:mi><mml:mi mathvariant="normal">CCN</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> tends to be enhanced in populations dominated by such particles. Underestimation of particle size therefore translates into a lower  <inline-formula><mml:math id="M59" display="inline"><mml:mrow><mml:msub><mml:mi>N</mml:mi><mml:mi mathvariant="normal">CCN</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>.</p>
      <p id="d2e2169">Global <inline-formula><mml:math id="M60" display="inline"><mml:mrow><mml:msub><mml:mi>N</mml:mi><mml:mi mathvariant="normal">CCN</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> datasets are typically derived from bulk aerosol mass using simplified assumptions about the ASD. For example,  <xref ref-type="bibr" rid="bib1.bibx26" id="text.79"/> estimated <inline-formula><mml:math id="M61" display="inline"><mml:mrow><mml:msub><mml:mi>N</mml:mi><mml:mi mathvariant="normal">CCN</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> from spaceborne CALIOP (Cloud-Aerosol Lidar with Orthogonal Polarization) lidar measurements using pre-computed conversion factors. <xref ref-type="bibr" rid="bib1.bibx17" id="text.80"/> derived <inline-formula><mml:math id="M62" display="inline"><mml:mrow><mml:msub><mml:mi>N</mml:mi><mml:mi mathvariant="normal">CCN</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> based on the latest Copernicus Atmosphere Monitoring Service (CAMS) reanalysis provided by the European Centre for Medium-Range Weather Forecast (ECMWF) by introducing assumptions on ASD and composition <xref ref-type="bibr" rid="bib1.bibx16" id="paren.81"/>. Similarly, The GiOcean atmosphere–ocean–aerosol reanalysis <xref ref-type="bibr" rid="bib1.bibx87" id="paren.82"/>, derived from the NASA GEOS-S2S system <xref ref-type="bibr" rid="bib1.bibx68" id="paren.83"/>, incorporates a more advanced model framework. Unlike MERRA-2, which only assimilates the atmospheric state, GiOcean is a coupled atmosphere–ocean reanalysis that includes two-moment cloud microphysics, enabling the explicit calculation of <inline-formula><mml:math id="M63" display="inline"><mml:mrow><mml:msub><mml:mi>N</mml:mi><mml:mi mathvariant="normal">CCN</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>. Although GiOcean still relies on assumptions about aerosol size and composition, its CCN fields directly influence cloud processes and, in turn, aerosol evolution. In addition to comparing with reanalysis-based products, we evaluated MAMnet’s CCN predictions against in situ measurements from the Global Aerosol Synthesis and Science Project (GASSP) <xref ref-type="bibr" rid="bib1.bibx100 bib1.bibx78" id="paren.84"/>. The GASSP dataset compiles aerosol and CCN measurements from 37 field campaigns and over 1000 aircraft flights, primarily concentrated over North America and Western Europe.</p>
</sec>
</sec>
<sec id="Ch1.S3">
  <label>3</label><title>Results and Discussion</title>
      <p id="d2e2244">We evaluated the MAMnet model for both its ability to reproduce the GEOS+MAM7 model when driven by the testing data set, and to reproduce  observations when driven by aerosol concentrations derived from the MERRA-2 reanalysis. We assessed whether MAMnet reproduces the spatial distribution of aerosol variables in GEOS+MAM7 using the the mean Pearson's spatial correlation coefficient (<inline-formula><mml:math id="M64" display="inline"><mml:mi>R</mml:mi></mml:math></inline-formula>). We also calculated the mean log-bias, i.e.,

          <disp-formula id="Ch1.E4" content-type="numbered"><label>4</label><mml:math id="M65" display="block"><mml:mrow><mml:mi mathvariant="normal">MLB</mml:mi><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:msubsup><mml:mo>∑</mml:mo><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mrow><mml:msub><mml:mi>N</mml:mi><mml:mi mathvariant="normal">samples</mml:mi></mml:msub></mml:mrow></mml:msubsup><mml:msub><mml:mi mathvariant="normal">log</mml:mi><mml:mn mathvariant="normal">10</mml:mn></mml:msub><mml:mo>(</mml:mo><mml:mover accent="true"><mml:mi mathvariant="bold">Y</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mo>)</mml:mo><mml:mo>-</mml:mo><mml:msub><mml:mi mathvariant="normal">log</mml:mi><mml:mn mathvariant="normal">10</mml:mn></mml:msub><mml:mfenced open="(" close=")"><mml:mi mathvariant="bold">Y</mml:mi></mml:mfenced></mml:mrow><mml:mrow><mml:msub><mml:mi>N</mml:mi><mml:mi mathvariant="normal">samples</mml:mi></mml:msub></mml:mrow></mml:mfrac></mml:mstyle><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>

        where <inline-formula><mml:math id="M66" display="inline"><mml:mover accent="true"><mml:mi mathvariant="bold">Y</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover></mml:math></inline-formula> and <inline-formula><mml:math id="M67" display="inline"><mml:mi mathvariant="bold">Y</mml:mi></mml:math></inline-formula> correspond to the predicted variables by MAMnet and GEOS+MAM7, respectively.  In general MLB in the range <inline-formula><mml:math id="M68" display="inline"><mml:mrow><mml:mo>[</mml:mo><mml:mo>-</mml:mo><mml:mn mathvariant="normal">0.5</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">0</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">5</mml:mn><mml:mo>]</mml:mo></mml:mrow></mml:math></inline-formula> indicates a prediction within an order of magnitude window around the target value. These metrics are summarized in Fig. <xref ref-type="fig" rid="F3"/> for each pressure level and output variable.</p>
<sec id="Ch1.S3.SS1">
  <label>3.1</label><title>Evaluation against GEOS+MAM7</title>
      <p id="d2e2364">We first tested whether MAMnet had learned the physical relationships underlying the ASD or simply memorized the training data, and whether the model is able to conserve mass. To investigate this, we run MAMnet using MERRA-2 inputs, then recover the total mass of each species by mapping from the seven modes produced by MAMnet, as depicted in Fig. <xref ref-type="fig" rid="F1"/>. Since aerosol concentrations in GEOS+MAM7 are not assimilated, they are expected to differ from MERRA-2, which incorporates observational constraints. If MAMnet had merely memorized the GEOS+MAM7 outputs, these same biases would persist when MERRA-2 inputs were used. Such a discrepancy would also indicate that MAMnet does not conserve mass as the total mass of each species would differ from the inputs.</p>

      <fig id="F2" specific-use="star"><label>Figure 2</label><caption><p id="d2e2371">Comparison of column-integrated bulk aerosol. From top: sulfates plus ammonium (SA), sea salt (SS), dust (DU), black carbon (BC), and primary plus secondary organic matter (OG). Left panels correspond to MERRA-2, middle panels to the trained MAMnet model applied to MERRA-2 inputs, and right panels to the reserved GEOS+MAM7 test data.</p></caption>
          <graphic xlink:href="https://gmd.copernicus.org/articles/19/2437/2026/gmd-19-2437-2026-f02.png"/>

        </fig>

      <p id="d2e2380">Figure <xref ref-type="fig" rid="F2"/> compares the total aerosol mass column from MERRA-2 (left), MAMnet driven by MERRA-2 inputs (center), and GEOS+MAM (right). It is evident that GEOS+MAM7 tends to underestimate black carbon (BC) and sea salt (SS) over the ocean compared to MERRA2. If MAMnet had merely memorized the GEOS+MAM7 outputs, these same biases would persist when MERRA-2 inputs were used. Instead, MAMnet accurately reproduces the MERRA-2 aerosol concentrations when driven by MERRA-2 inputs, highlighting the internal consistency of the model as MAMnet is able to generalize to new, unseen data. This test also demonstrates that mass is conserved, as the total mass of each aerosol species is recovered showing very little bias.</p>

      <fig id="F3" specific-use="star"><label>Figure 3</label><caption><p id="d2e2388">Pearson's spatial correlation (<inline-formula><mml:math id="M69" display="inline"><mml:mi>R</mml:mi></mml:math></inline-formula>) (top) and mean log-bias (Eq. <xref ref-type="disp-formula" rid="Ch1.E4"/>; bottom) predicted by MAMnet calculated on the reserved test set, against the GEOS+MAM7 simulation. Results are shown for mass (Tables <xref ref-type="table" rid="T1"/> and <xref ref-type="table" rid="T2"/>) and number (NUM) concentration for each mode, as well as for the derived modal diameter (DPG). Label color on the horizontal axis is added for emphasis.</p></caption>
          <graphic xlink:href="https://gmd.copernicus.org/articles/19/2437/2026/gmd-19-2437-2026-f03.png"/>

        </fig>

      <p id="d2e2410">Figure <xref ref-type="fig" rid="F3"/> summarizes the performance of MAMnet compared to GEOS+MAM7 across all output number and mass variables, as well modal size, for all model levels. MAMnet is able to reproduce the modal number concentrations (“NUM” variables in Fig. <xref ref-type="fig" rid="F3"/>) from the GEOS+MAM7 simulations, with high spatial correlations <inline-formula><mml:math id="M70" display="inline"><mml:mrow><mml:mi>R</mml:mi><mml:mo>&gt;</mml:mo><mml:mn mathvariant="normal">0.9</mml:mn></mml:mrow></mml:math></inline-formula> and mean log-bias (MLB) within <inline-formula><mml:math id="M71" display="inline"><mml:mrow><mml:mo>±</mml:mo><mml:mn mathvariant="normal">0.1</mml:mn></mml:mrow></mml:math></inline-formula> across most pressure levels. However, performance slightly degrades at pressures below <inline-formula><mml:math id="M72" display="inline"><mml:mrow><mml:mi>p</mml:mi><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">100</mml:mn></mml:mrow></mml:math></inline-formula> hPa, particularly for the Aitken  (NUM_AIT) and coarse dust modes (NUM_CDU), where correlations drop slightly (<inline-formula><mml:math id="M73" display="inline"><mml:mrow><mml:mi>R</mml:mi><mml:mo>&gt;</mml:mo><mml:mn mathvariant="normal">0.7</mml:mn></mml:mrow></mml:math></inline-formula>) and MLB increases to <inline-formula><mml:math id="M74" display="inline"><mml:mrow><mml:mo>±</mml:mo><mml:mn mathvariant="normal">0.3</mml:mn></mml:mrow></mml:math></inline-formula>. The largest discrepancies occur near the surface  (<inline-formula><mml:math id="M75" display="inline"><mml:mrow><mml:mi>p</mml:mi><mml:mo>&gt;</mml:mo><mml:mn mathvariant="normal">900</mml:mn></mml:mrow></mml:math></inline-formula> hPa) and in the upper troposphere (150–400 hPa). Specifically, NUM_AIT shows underprediction between 150–400 hPa while it overpredicts from 700–850 hPa, indicating that MAMnet tends to underestimate fine particles near the tropopause and overestimate them in the mid-to-lower troposphere. Similarly, NUM_PCM exhibits negative biases near 150–400 hPa, suggesting underprediction of particle number in the primary carbon mode at higher altitudes. The MLB patterns (Fig. <xref ref-type="fig" rid="F3"/>, bottom panel) reveal localized biases at specific pressure ranges. Positive MLBs (orange shading) occur in NUM_PCM and NUM_ACC, indicating slight overestimation at mid-to-lower pressure levels. In contrast, NUM_CDU displays small negative MLB (blue shading) around 500–700 hPa, suggesting underprediction of coarse dust particles in the mid-troposphere. In summary, MAMnet captures overall modal number patterns well, but errors remain in the Aitken mode, primary carbon and coarse dust.</p>

      <fig id="F4" specific-use="star"><label>Figure 4</label><caption><p id="d2e2490">Zonal profiles for modal aerosol number concentration. Left: GEOS+MAM7 reserved test set. Middle: MAMnet prediction. Right: Mean Log-Bias. From top: Accumulation (ACC), Aitken (AIT), coarse dust (CDU), coarse sea salt (CSS), fine dust (FDU), fine sea salt (FSS), primary carbon matter (PCM).</p></caption>
          <graphic xlink:href="https://gmd.copernicus.org/articles/19/2437/2026/gmd-19-2437-2026-f04.png"/>

        </fig>

      <p id="d2e2499">Figure <xref ref-type="fig" rid="F4"/> shows the zonal mean profiles of modal aerosol number concentration, comparing GEOS+MAM7 outputs (left column), MAMnet predictions (center column). Consistent with Fig. <xref ref-type="fig" rid="F3"/>, the Aitken mode exhibits the largest biases, characterized by underestimation above <inline-formula><mml:math id="M76" display="inline"><mml:mn mathvariant="normal">400</mml:mn></mml:math></inline-formula> hPa and overestimation in the lower troposphere, mostly below <inline-formula><mml:math id="M77" display="inline"><mml:mn mathvariant="normal">700</mml:mn></mml:math></inline-formula> hPa. Unlike other aerosol modes, the vertical distribution of Aitken mode particles is unique, with higher concentrations found in the upper troposphere and lower stratosphere (<inline-formula><mml:math id="M78" display="inline"><mml:mrow><mml:mi>p</mml:mi><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">400</mml:mn></mml:mrow></mml:math></inline-formula> hPa) compared to lower altitudes, where the other modes exhibit the highest concentrations near the surface. The underestimation in the upper troposphere and overestimation in the lower troposphere may be influenced by a “dilution” effect, as Aitken particles contribute relatively little mass compared to other modes. This may be exacerbated in regions with active particle nucleation processes and combustion emissions which tend to disproportionately enhance number over mass concentration.</p>
      <p id="d2e2532">The accumulation mode (ACC), coarse sea salt (CSS), and fine sea salt (FSS) modes show generally strong agreement between true and predicted values, with minimal biases across most pressure levels, with MLB values close to zero. However, localized biases are apparent for FSS and PCM near the surface, suggesting slight overestimation in these regions. The coarse dust mode (CDU) and fine dust mode (FDU) exhibit minimal errors overall, with MLB values near zero across most pressure levels. MAMnet accurately predicts the aerosol number concentration for most modes, with systematic biases for the Aitken and primary carbon modes, particularly near the tropopause and in the lower troposphere, suggest that smaller particles are more difficult to predict accurately due to their unique vertical distribution and sparse representation in the data.</p>
      <p id="d2e2536">MAMnet accuately reproduces the spatial patterns of the aerosol mass, with accumulation mode mass variables such as SU_ACC, SS_ACC, and SOA_ACC showing high correlations (<inline-formula><mml:math id="M79" display="inline"><mml:mrow><mml:mi>R</mml:mi><mml:mo>&gt;</mml:mo><mml:mn mathvariant="normal">0.9</mml:mn></mml:mrow></mml:math></inline-formula>) across the entire pressure range (Fig. <xref ref-type="fig" rid="F3"/>). This is also the case for most other variables with only DU_FDU and AMM_FSS  showing slight reduction in correlation near 1000 hPa, indicating slightly worse performance in the lower atmosphere. Biases shown in Fig. <xref ref-type="fig" rid="F3"/> (bottom) indicate that all but six mass tracers (SOA_ACC, SU_AIT, SOA_AIT, SU_CSS, SS_A_CSS, AMM_CSS) are systematically overpredicted  for <inline-formula><mml:math id="M80" display="inline"><mml:mrow><mml:mi>p</mml:mi><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">200</mml:mn></mml:mrow></mml:math></inline-formula> hPa where mass values are very small (<inline-formula><mml:math id="M81" display="inline"><mml:mrow><mml:mo>∼</mml:mo><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">20</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula> kg kg<sup>−1</sup>). These errors tend to be exacerbated in logarithmic space but remain negligible in absolute terms. Negative biases are also notable for SU_FDU and AMM_FDU, which become increasingly negative towards the surface. This is explained by the low mass of sulfate in the fine dust mode leading to a “dilution” of sulfate in fine dust mode relative to other aerosol modes. Overall, the model demonstrates robust predictive skill for most aerosol types, with minor discrepancies concentrated near the surface and in sparse aerosol regimes.</p>

      <fig id="F5" specific-use="star"><label>Figure 5</label><caption><p id="d2e2598">Modal geometric diameter, <inline-formula><mml:math id="M83" display="inline"><mml:mrow><mml:msub><mml:mi>D</mml:mi><mml:mi mathvariant="normal">pg</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> at 950 hPa. Left: GEOS+MAM7 reserved test set. Middle: MAMnet prediction. Right: Residual (log<sub>10</sub>(MAMnet) <inline-formula><mml:math id="M85" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula> log<sub>10</sub>(GEOS+MAM)). From top: Accumulation (ACC), Aitken (AIT), coarse dust (CDU), coarse sea salt (CSS), fine dust (FDU), fine sea salt (FSS), primary carbon matter (PCM).</p></caption>
          <graphic xlink:href="https://gmd.copernicus.org/articles/19/2437/2026/gmd-19-2437-2026-f05.png"/>

        </fig>

      <p id="d2e2643">Differences in the zonal distribution between aerosol modes can contribute to errors, especially due to the uneven representation of smaller particles, such as those in the Aitken and organic modes, in the training dataset, typically referred to as “class imbalance” <xref ref-type="bibr" rid="bib1.bibx44 bib1.bibx20" id="paren.85"/>. For instance, the mass of sulfate particles in the accumulation mode is often at least ten times greater than in the Aitken mode, whereas the opposite is true for the number concentration. As a result, the variability in sulfate mass is primarily driven by the accumulation mode, causing the Aitken mode to be underrepresented in the neural network's input data. Despite this, the residual differences between predicted and true values for Aitken mode aerosol number concentrations are small compared to other modes, and the mean global error remains well below an order of magnitude, highlighting the neural network's  accuracy.</p>
      <p id="d2e2649">Figures <xref ref-type="fig" rid="F5"/> and <xref ref-type="fig" rid="F6"/> show the MAMnet-derived <inline-formula><mml:math id="M87" display="inline"><mml:mrow><mml:msub><mml:mi>D</mml:mi><mml:mi mathvariant="normal">pg</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> in agreement with GEOS+MAM7.  It must be noted that <inline-formula><mml:math id="M88" display="inline"><mml:mrow><mml:msub><mml:mi>D</mml:mi><mml:mi mathvariant="normal">pg</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is buffered against variation in mass and number concentration (Eq. <xref ref-type="disp-formula" rid="Ch1.E1"/>). This buffering effect holds only if mass and number vary coherently. In practice, MAMnet predicts number concentration as a single output per mode, while mass is distributed across multiple species per mode. Unlike in the physical model, where mass and number are dynamically linked through the governing equations, MAMnet treats them as independent outputs. Therefore, accurate reproduction of <inline-formula><mml:math id="M89" display="inline"><mml:mrow><mml:msub><mml:mi>D</mml:mi><mml:mi mathvariant="normal">pg</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> by MAMnet is not guaranteed and must be learned implicitly. MLB for <inline-formula><mml:math id="M90" display="inline"><mml:mrow><mml:msub><mml:mi>D</mml:mi><mml:mi mathvariant="normal">pg</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> when tested against the test set is typically below 0.01, consistent with the high correlation coefficients for DPG (<inline-formula><mml:math id="M91" display="inline"><mml:mrow><mml:mi>R</mml:mi><mml:mo>&gt;</mml:mo><mml:mn mathvariant="normal">0.9</mml:mn></mml:mrow></mml:math></inline-formula>), shown in Fig. <xref ref-type="fig" rid="F3"/>. The fact that MAMnet is able to maintain low bias in <inline-formula><mml:math id="M92" display="inline"><mml:mrow><mml:msub><mml:mi>D</mml:mi><mml:mi mathvariant="normal">pg</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> without being explicitly constrained to do so suggests that the network has successfully learned a physically consistent relationship between mass and number.</p>
      <p id="d2e2728">The global distribution of <inline-formula><mml:math id="M93" display="inline"><mml:mrow><mml:msub><mml:mi>D</mml:mi><mml:mi mathvariant="normal">pg</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> for the different aerosol modes closely matches the spatial patterns of the modal number concentrations. Larger residuals are observed near the surface in the tropics and the Southern Hemisphere, particularly for fine dust (FDU). This discrepancy arises primarily due to the very low number concentrations of fine dust over the oceans, making it challenging for the neural network to accurately predict values close to zero.  Residuals for  <inline-formula><mml:math id="M94" display="inline"><mml:mrow><mml:msub><mml:mi>D</mml:mi><mml:mi mathvariant="normal">pg</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> in the coarse dust (CDU) mode are also slightly larger compared to other modes. This is evident in the zonal mean profiles (Fig. <xref ref-type="fig" rid="F6"/>), where biases are most prominent in the tropics and near the Arctic. Additionally, MAMnet tends to underestimate <inline-formula><mml:math id="M95" display="inline"><mml:mrow><mml:msub><mml:mi>D</mml:mi><mml:mi mathvariant="normal">pg</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> in the Southern Hemisphere around 30° S, particularly in the free troposphere for fine dust. This underestimation likely results from class imbalance, as dust concentrations in this region are very low, making it difficult for the neural network to learn accurate predictions. It is also possible that MAMnet has learned associations biased toward aerosol-rich environments, which are more prevalent in the Northern Hemisphere.</p>

      <fig id="F6" specific-use="star"><label>Figure 6</label><caption><p id="d2e2769">As in Fig. <xref ref-type="fig" rid="F4"/>, but for geometric mean diameter (<inline-formula><mml:math id="M96" display="inline"><mml:mrow><mml:msub><mml:mi>D</mml:mi><mml:mi mathvariant="normal">pg</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>) per mode.</p></caption>
          <graphic xlink:href="https://gmd.copernicus.org/articles/19/2437/2026/gmd-19-2437-2026-f06.png"/>

        </fig>

</sec>
<sec id="Ch1.S3.SS2">
  <label>3.2</label><title>Explainable machine learning analysis</title>
      <p id="d2e2799">Shapley values <xref ref-type="bibr" rid="bib1.bibx103" id="paren.86"/>, originally developed in cooperative game theory, are now widely used to interpret predictions from  neural networks <xref ref-type="bibr" rid="bib1.bibx55 bib1.bibx45 bib1.bibx47 bib1.bibx62 bib1.bibx60" id="paren.87"/>. A Shapley value quantifies the contribution of a single input feature to a specific model prediction by comparing the prediction for a given sample to the average prediction across all samples. This contribution is averaged over all possible combinations of the remaining input features, referred to as coalitions. Because the number of such combinations grows rapidly with the number of features, we approximate Shapley values using 1000 randomly selected coalitions per calculation, facilitated by the SHAP python library using the kernel explainer method <xref ref-type="bibr" rid="bib1.bibx61" id="paren.88"/>. In this study, Shapley values are used to assess the influence of each input feature on the predicted aerosol number concentrations for each mode.</p>
      <p id="d2e2811">Figure <xref ref-type="fig" rid="F7"/> shows a summary plot of Shapley values calculated for each input feature relative to predicted targets for aerosol modal number concentration (left) and mass (right). Each row represents a specific feature. The <inline-formula><mml:math id="M97" display="inline"><mml:mi>x</mml:mi></mml:math></inline-formula>-axis represents SHAP values, indicating the impact (positive or negative) of each feature on the model's prediction, so that the features with larger SHAP values contribute more significantly to the model output. Red dots represent high feature values, while blue dots indicate low feature values.</p>
      <p id="d2e2823">Features such as sulfate (SU), sea salt (SS), dust (DU), temperature (<inline-formula><mml:math id="M98" display="inline"><mml:mi>T</mml:mi></mml:math></inline-formula>), and air density (AIRD) are consistently ranked as dominant contributors, with their relative importance varying across aerosol modes. Fine-mode outputs, such as ACC and AIT, are strongly influenced by sulfate and temperature, where higher feature values positively impact predictions. In contrast, coarse-mode outputs like CDU and FDU are heavily driven by dust, with significant positive contributions observed for high dust concentrations. Intuitively, this makes sense as SU, SS, and DU are the largest components of accumulation mode aerosols. However the relation is non-linear as high SU values correspond to a strong positive impact on aerosol number, particularly for dust (CDU) and sea salt (CSS) modes, whereas low values lead to neutral or negative contributions.The SHAP values further highlight the critical role of air density and temperature in for CSS and FSS, which may be related to the aerosol activation processes.</p>
      <p id="d2e2833">For aerosol mass, the SHAP plots show significant influence from  DU, SS, BC, SU. The interplay between feature importance and values is evident as, high sea salt concentrations (SS) are positively correlated with increased mass in CSS, while low values lead to neutral or negative contributions. One significant characteristic of the mass SHAP plots is the broader range of SHAP values compared to number concentration, indicating greater variability in the importance of input features for predicting mass. For instance, black carbon (BC) has a consistently positive influence on ACC and AIT mass predictions, but its impact is less pronounced for other modes. Additionally, temperature at low values negatively impacts aerosol mass, while at higher values, it positively influences mass. In some cases however a SHAP value for a feature may have no obviously interpretable significance to the target prediction, or may be so strongly correlated with another feature that the individual contribution is negligible with respect to the feedbacks between a pair of features or more <xref ref-type="bibr" rid="bib1.bibx1" id="paren.89"/>.</p>

      <fig id="F7" specific-use="star"><label>Figure 7</label><caption><p id="d2e2842">SHAP analysis for aerosol number concentration (left) and total mass (right) across different modes in the troposphere, based on 1000 randomly selected samples from the test set. Modes are displayed from top to bottom: Accumulation (ACC), Aitken (AIT), coarse dust (CDU), coarse sea salt (CSS), fine dust (FDU), fine sea salt (FSS), and primary carbon matter (PCM). The color gradient (red for high values, blue for low values) indicates the relative value of each feature, with features ordered top-to-bottom by their importance to the prediction (most sensitive at the top). The <inline-formula><mml:math id="M99" display="inline"><mml:mi>x</mml:mi></mml:math></inline-formula>-axis represents SHAP values, quantifying how much each feature contributes to deviations from the mean prediction.</p></caption>
          <graphic xlink:href="https://gmd.copernicus.org/articles/19/2437/2026/gmd-19-2437-2026-f07.png"/>

        </fig>

</sec>
<sec id="Ch1.S3.SS3">
  <label>3.3</label><title>Evaluation against observations</title>
      <p id="d2e2866">The ability of a neural network to generalize to new data is a key measure of its effectiveness and reliability in real-world applications. While the neural network may perform well reproducing simulated data, it is important to test whether MAMnet is able to reproduce patterns observed in nature. To accomplish this, we take advantage of the ability of MAMnet to work with reanalysis data, that is, using as input the assimilated fields of MERRA-2.</p>
      <p id="d2e2869">MERRA-2 includes aerosol mass fields that are constrained by satellite observations through data assimilation <xref ref-type="bibr" rid="bib1.bibx19 bib1.bibx92 bib1.bibx95 bib1.bibx40 bib1.bibx91" id="paren.90"/>, and thus provides a more realistic input compared to free-running model simulations. Although GEOS+MAM7, which was used to train MAMnet, does not assimilate aerosols and cannot be directly compared to observations at specific sites, it provides physically consistent mass and number concentrations from which the network learns the relationship between these quantities. When applied to MERRA-2, MAMnet combines this learned relationship with more observation-constrained aerosol mass fields, allowing us to evaluate how well it maintains physical consistency in a more realistic setting. This comparison does not validate MAMnet independently of its training data but serves to assess its performance when driven by the best available mass estimates.</p>
<sec id="Ch1.S3.SS3.SSS1">
  <label>3.3.1</label><title>Comparison against ground observations</title>
      <p id="d2e2882">Figure <xref ref-type="fig" rid="F8"/> compares the cumulative ASD predicted by MAMnet against surface observations from different European sites. These are mostly  coastal sites with composition typical of clean and polluted continental origin, that is, mostly of sulfates, dust, organics and sea salt <xref ref-type="bibr" rid="bib1.bibx7" id="paren.91"/>. Altitude ranges from a few meters to about <inline-formula><mml:math id="M100" display="inline"><mml:mn mathvariant="normal">3</mml:mn></mml:math></inline-formula> km providing a good overview of the lower troposphere. Although representing a limited set, the range of aerosol compositions, sources, and altitudes offers a meaningful assessment of the model's ability to generalize to different atmospheric states. To carry out the comparison, MAMnet was run using collocated aerosol concentrations and meteorological fields obtained from MERRA-2 at each site, and using Eq. (<xref ref-type="disp-formula" rid="Ch1.E3"/>). MAMnet results represent the average of the two lowermost model layers (roughly <inline-formula><mml:math id="M101" display="inline"><mml:mn mathvariant="normal">200</mml:mn></mml:math></inline-formula> m above the surface).</p>

      <fig id="F8" specific-use="star"><label>Figure 8</label><caption><p id="d2e2908">Cumulative size distribution comparison of the trained MAMnet model applied to MERRA-2 inputs against surface measurements <xref ref-type="bibr" rid="bib1.bibx7" id="paren.92"/>. The sites in the bottom row (PDD, SCH, ZSF, JFJ, BEO, CMN) are characterized as high altitude sites, with altitudes between 1200 and 3600 m a.s.l.</p></caption>
            <graphic xlink:href="https://gmd.copernicus.org/articles/19/2437/2026/gmd-19-2437-2026-f08.png"/>

          </fig>

      <p id="d2e2920">Except for high altitude sites (Fig. <xref ref-type="fig" rid="F8"/>, bottom row), MAMnet tends to predict slightly lower median values compared to observations, with this discrepancy becoming more pronounced as particle size increases (N100 and N250). This is particularly noticeable at locations such as PAL, PLA, OBK, MHD, FKL, and JFJ, where the model underestimates values consistently. The pattern is reversed at high-altitude sites (PDD, SCH, ZSF, JFJ, BEO, and CMN), where median N100 and N250 are generally overestimated by MAMnet, although the observations themselves display significant variability. Some locations like SMR, WAL, CBW, and SCH exhibit better agreement, with overlapping medians and interquartile ranges. Additionally, the spread of values for MAMnet is typically narrower than for observations, indicating that the model underestimates variability. Observations also show more outliers, whereas MAMnet predictions are more constrained. There are a few exceptions where MAMnet slightly overestimates values, such as VHL (N250) and SMR (N100). Overall, systematic bias exists in MAMnet-predicted particle concentrations, particularly for larger size bins,  capturing less variability compared to observations.</p>
      <p id="d2e2926">Errors in the estimated ASD may originate from the training data (GEOS+MAM7) or from the MERRA-2 fields used as input to MAMnet. To assess the potential impact of the training data, we collocated the average ASD at each site using the GEOS+MAM7 test dataset. This analysis is intended to evaluate whether GEOS+MAM7 exhibits, on average, the same biases shown in Fig. <xref ref-type="fig" rid="F8"/>, rather than assessing point-by-point predictions as done for MAMnet. Figure <xref ref-type="fig" rid="F9"/> shows that GEOS+MAM7 and MAMnet indeed display similar biases relative to the observations, suggesting that errors in the training data are partially inherited by MAMnet. However, a comparison of Figs. <xref ref-type="fig" rid="F8"/> and <xref ref-type="fig" rid="F9"/> also indicates that the use of MERRA-2 inputs tends to reduce variability in the ASD. For each size bin, the interquantile range is substantially narrower in the MAMnet results. This effect is particularly pronounced at high-altitude sites (bottom row in both figures).</p>
      <p id="d2e2937">MERRA-2 may not resolve local emissions, terrain, or small-scale meteorology, such as boundary layer height and humidity. This can lead to biases, particularly in the larger particle size categories that depend on aerosol growth processes. Moreover, despite representing better ASD variability, the training data  lacks sufficient diversity, particularly for remote or high-altitude sites, which are also underrepresented in the training set. Additionally, aerosol evolution involve complex, nonlinear interactions that are not explicitly modeled by MAMnet. These factors likely contribute to the model's challenges in capturing the magnitude and variability of aerosol concentrations observed in the real world. Additionally, it is important to note that retrievals of ASD are inherently complex, and experimental errors can be significant, particularly for larger particle sizes <xref ref-type="bibr" rid="bib1.bibx7" id="paren.93"/>. Nevertheless, the consistent results across many sites indicate that MAMnet is capable of reasonably capturing the ASD on regional scales when driven by reanalysis data.</p>
</sec>
<sec id="Ch1.S3.SS3.SSS2">
  <label>3.3.2</label><title>Comparison against global CCN datasets</title>
      <p id="d2e2951">Figure <xref ref-type="fig" rid="F10"/> illustrates the global mean distribution of cloud condensation nuclei (CCN) at <inline-formula><mml:math id="M102" display="inline"><mml:mn mathvariant="normal">0.2</mml:mn></mml:math></inline-formula> % supersaturation at 900 hPa, derived from MAMnet driven by MERRA-2 (shown as MAMnet-MERRA2), GiOcean <xref ref-type="bibr" rid="bib1.bibx87" id="paren.94"/>, the CAMS aerosol reanalysis <xref ref-type="bibr" rid="bib1.bibx17" id="paren.95"/>, and CALIOP satellite retrievals <xref ref-type="bibr" rid="bib1.bibx26" id="paren.96"/>. Data were averaged over the period 2006–2021. All datasets display on average lower CCN concentrations over oceans, particularly in polar regions, and higher concentrations over land particularly in central and eastern Asia, Europe, and the Americas. However, large differences in absolute values are evident with MAMnet-MERRA2 consistently showing the lowest <inline-formula><mml:math id="M103" display="inline"><mml:mrow><mml:msub><mml:mi>N</mml:mi><mml:mi mathvariant="normal">CCN</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, and CALIOP-derived the highest. These discrepancies likely arise from differing assumptions in estimation methods. GiOcean and CAMS estimate CCN based on aerosol mass, prescribing the ASD, and assuming externally-mixed aerosols, which may double-count CCN as organics and sulfates are typically internally mixed <xref ref-type="bibr" rid="bib1.bibx2 bib1.bibx51" id="paren.97"/>. MAMnet-MERRA2 avoids this issue but underpredicts  <inline-formula><mml:math id="M104" display="inline"><mml:mrow><mml:msub><mml:mi>N</mml:mi><mml:mi mathvariant="normal">CCN</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> over oceanic regions, likely due to low sea salt concentrations in MERRA-2, stemming from uncertainties in the aerosol assimilation system <xref ref-type="bibr" rid="bib1.bibx19" id="paren.98"/>. In contrast, CALIOP may overestimate <inline-formula><mml:math id="M105" display="inline"><mml:mrow><mml:msub><mml:mi>N</mml:mi><mml:mi mathvariant="normal">CCN</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> from the assumption of CCN as all soluble aerosols above <inline-formula><mml:math id="M106" display="inline"><mml:mn mathvariant="normal">50</mml:mn></mml:math></inline-formula> nm, some of which may not activate as CCN at <inline-formula><mml:math id="M107" display="inline"><mml:mn mathvariant="normal">0.2</mml:mn></mml:math></inline-formula> % supersaturation.</p>

      <fig id="F9" specific-use="star"><label>Figure 9</label><caption><p id="d2e3029">Average cumulative size distribution comparison of the GEOS+MAM7 test data  against surface measurements <xref ref-type="bibr" rid="bib1.bibx7" id="paren.99"/>. The sites in the bottom row (PDD, SCH, ZSF, JFJ, BEO, CMN) are characterized as high altitude sites, with altitudes between 1200 and 3600 m a.s.l.</p></caption>
            <graphic xlink:href="https://gmd.copernicus.org/articles/19/2437/2026/gmd-19-2437-2026-f09.png"/>

          </fig>

      <p id="d2e3041">Figure <xref ref-type="fig" rid="F11"/> compares global mean vertical profiles of <inline-formula><mml:math id="M108" display="inline"><mml:mrow><mml:msub><mml:mi>N</mml:mi><mml:mi mathvariant="normal">CCN</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> from the datasets in Fig. <xref ref-type="fig" rid="F10"/> and from in-situ observations <xref ref-type="bibr" rid="bib1.bibx100" id="paren.100"/>. Vertical distributions vary significantly. GiOcean, CALIOP, and in-situ profiles exhibit similar shapes with peak concentrations around 950 hPa, while the MAMnet-MERRA2 and CAMS profiles show a monotonic decrease with altitude. The peak in <inline-formula><mml:math id="M109" display="inline"><mml:mrow><mml:msub><mml:mi>N</mml:mi><mml:mi mathvariant="normal">CCN</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> at 950 hPa may result from more efficient aerosol scavenging near the surface, better represented by two-moment cloud microphysics in GiOcean <xref ref-type="bibr" rid="bib1.bibx87 bib1.bibx9" id="paren.101"/>. In contrast, MAMnet-MERRA2 and CAMS rely on single-moment cloud microphysics, which may explain the smoother decrease in <inline-formula><mml:math id="M110" display="inline"><mml:mrow><mml:msub><mml:mi>N</mml:mi><mml:mi mathvariant="normal">CCN</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> with height. The reliance on single-moment microphysics may also explain the more gradual decrease in <inline-formula><mml:math id="M111" display="inline"><mml:mrow><mml:msub><mml:mi>N</mml:mi><mml:mi mathvariant="normal">CCN</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> with height in CAMS and MAMnet-MERRA2 than in the other data sets, noticeable over the ocean. In the free troposphere, MAMnet-MERRA2 aligns more closely with GiOcean, CALIOP and the in situ data.</p>

      <fig id="F10" specific-use="star"><label>Figure 10</label><caption><p id="d2e3102">MAMnet-derived CCN at <inline-formula><mml:math id="M112" display="inline"><mml:mn mathvariant="normal">0.2</mml:mn></mml:math></inline-formula> % supersaturation at 900 hPa using the MERRA2 reanalysis (MAMnet-MERRA2) against global CCN datasets. Also shown are results from the GiOcean  reanalysis <xref ref-type="bibr" rid="bib1.bibx87" id="paren.102"/>, CALIOP- <xref ref-type="bibr" rid="bib1.bibx26" id="paren.103"/>, and CAMS- <xref ref-type="bibr" rid="bib1.bibx17" id="paren.104"/> derived  CCN.</p></caption>
            <graphic xlink:href="https://gmd.copernicus.org/articles/19/2437/2026/gmd-19-2437-2026-f10.png"/>

          </fig>

      <fig id="F11" specific-use="star"><label>Figure 11</label><caption><p id="d2e3129">Annual mean profile of CCN concentration derived from MERRA2 using MAMnet (red). Also shown are CALIOP-derived CCN <xref ref-type="bibr" rid="bib1.bibx26" id="paren.105"><named-content content-type="pre">magenta; </named-content></xref>, the GiOcean reanalysis <xref ref-type="bibr" rid="bib1.bibx87" id="paren.106"><named-content content-type="pre">blue; </named-content></xref>, CCN derived from field campaign data around the globe <xref ref-type="bibr" rid="bib1.bibx100" id="paren.107"><named-content content-type="pre">green; </named-content></xref>, and CAMS-derived CCN <xref ref-type="bibr" rid="bib1.bibx17" id="paren.108"><named-content content-type="pre">black; </named-content></xref>.</p></caption>
            <graphic xlink:href="https://gmd.copernicus.org/articles/19/2437/2026/gmd-19-2437-2026-f11.png"/>

          </fig>

</sec>
</sec>
</sec>
<sec id="Ch1.S4" sec-type="conclusions">
  <label>4</label><title>Conclusions</title>
      <p id="d2e3168">This study develops a neural network, termed MAMnet, to predict the aerosol size distribution and mixing state using as input the bulk mass of different aerosol species, temperature and density. MAMnet is oriented towards allowing a better estimation of the ASD and the aerosol physicochemical properties in cases where computational cost considerations prevent the usage two and higher moment aerosol microphysics schemes, for instance, weather forecast, or where limited information is available to constraint the ASD as in remote sensing and data assimilation. The neural network was optimized for performance taking into account the model architecture, training parameters and the rank of the data used as input.</p>
      <p id="d2e3171">MAMnet is designed to reproduce the mapping from aerosol mass to size distribution as generated by the MAM7 scheme within the GEOS system. As such, it inherits the assumptions and meteorological context of the training simulations. However through a comprehensive evaluation of the NN model against simulated data and observations, we demosntrated that MAMnet is a robust and accurate model over a wide set of conditions.  Importantly, MAMnet reproduces MERRA-2 aerosol concentrations when driven by MERRA-2 inputs, demonstrating that it has learned physical relationships rather than memorizing the training data, conserving the total aerosol mass. Explainable machine learning analysis showed that MAMnet identifies key physical drivers and the non-linear behavior governing the aerosol distribution across fine and coarse scales.</p>
      <p id="d2e3174">Comparison of MAMnet predictions against a reference dataset from GEOS+MAM7 simulations resulted in good agreement, with log-mean residuals typically below <inline-formula><mml:math id="M113" display="inline"><mml:mn mathvariant="normal">0.1</mml:mn></mml:math></inline-formula> and spatial correlation typically exceeding <inline-formula><mml:math id="M114" display="inline"><mml:mn mathvariant="normal">0.9</mml:mn></mml:math></inline-formula> for all aerosol modes. The greatest discrepancies were observed near the surface and in regions with low aerosol concentrations i.e., fine dust over oceans and coarse dust in the Southern Hemisphere free troposphere. These discrepancies are primarily attributed to challenges in the prediction of concentrations near zero and class imbalance. Notably, biases in number and mass concentrations do not significantly influence the prediction of geometric mean diameter, indicating that MAMnet captures the physical consistency between aerosol mass and number, inherently conserving mass.</p>
      <p id="d2e3191">We took advantage of the fact that MAMnet can be driven by output from reanalysis data to evaluate its performance against observations. When driven using collocated MERRA-2 fields, MAMnet reasonably reproduced the measured aerosol size distribution at different ground observation sites, representing a variety of aerosol composition, origin and meteorological conditions. The median values of the predicted number concentrations were generally consistent with observations. However the range of values predicted by MAMnet is in general smaller than observed, indicating that the model underestimates variability. It is likely that coarse reanalysis inputs, limited training data diversity, and the complexities of aerosol evolution, not modeled explicitly by MAMnet, contributed to the observed discrepancies.</p>
      <p id="d2e3195">CCN concentrations derived from MAMnet using the MERRA2 dataset were within the range of reported values but showed discrepancy near the surface and in regions with high variability, which may originate from uncertainty in the MERRA-2 aerosol fields. As MAMnet reproduces well the training dataset it is likely that the biases against observations result from biases in the input and complex physics not modelled by MAM7. At this point it is difficult to explicitly attribute sources of error in MAMnet and this is left for future research. Despite such biases, the comparison against observations indicate that MAMnet is able to capture the aerosol size distribution on regional and global scales.</p>
      <p id="d2e3198">Strategies to address the remaining  biases include applying physical constraints via transfer learning, as well as including observational data during the training process <xref ref-type="bibr" rid="bib1.bibx10" id="paren.109"/>. Class imbalance can potentially be addressed by reconfiguring the NN such that each mode would be predicted by a separate layer, or even individual NNs. The latter option is less desirable because it would require developing, constraining, and maintaining multiple NNs as opposed to one. Future work would focus on applying MAMnet to elucidate long-term trends in the ASD as well as on its implementation with GCMs <xref ref-type="bibr" rid="bib1.bibx73" id="paren.110"/>. MAMnet is designed to be resolution-agnostic, but  the relationship between aerosol mass and size distribution may vary with model resolution and it is suggested to be explored in future work. Including additional input variables, such as gaseous species or solar radiation, may make MAMnet predictions more physically interpretable. Neverthless, the model developed here provides a versatile foundation to improve the physical representation of aerosols in weather forecasting, remote sensing and data assimilation, potentially enhancing our understanding of their role in the climate system.</p>
</sec>

      
      </body>
    <back><app-group>

<app id="App1.Ch1.S1">
  <label>Appendix A</label><title>Network Training and Optimization</title>
      <p id="d2e3219">The MAMnet model was trained using the Keras library with Tensorflow backend <xref ref-type="bibr" rid="bib1.bibx25" id="paren.111"/>. Optimization was carried out with the Adam algorithm <xref ref-type="bibr" rid="bib1.bibx50" id="paren.112"/> using the minimum mean square error (MSE) as the loss function, with no additional constraints. Hyperparameter optimization for MAMnet was performed using the Keras Tuner software <xref ref-type="bibr" rid="bib1.bibx72" id="paren.113"/>. Approximately 1500 optimization trials were performed using random configurations of the hyperparameters in Table <xref ref-type="table" rid="TA1"/>, using a subset of the training data as in <xref ref-type="bibr" rid="bib1.bibx104" id="text.114"/>. All trials used the same subset of the training/validation data (5 output files for training, 2 for validation). For each parameter set, a new model was built and trained for up to 100 epochs with the same early stopping criteria used during the training of MAMnet. For each trial, a custom metric was recorded at the end of each epoch, the convergence loss (<inline-formula><mml:math id="M115" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="script">L</mml:mi><mml:mi mathvariant="normal">conv</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>), defined as the absolute difference between the training and validation losses. Using this custom metric allowed us to select the model that generalizes the best over both the training and validation data sets. The best set of hyperparameters was selected by choosing the configuration that minimized the MSE on the validation set and had the lowest <inline-formula><mml:math id="M116" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="script">L</mml:mi><mml:mi mathvariant="normal">conv</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>.</p>

<table-wrap id="TA1"><label>Table A1</label><caption><p id="d2e3262">Parameters used during hyperparameter tuning for MAMnet. Optimal hyperparameters are shown in bold.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="2">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="left"/>
     <oasis:thead>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Hyperparameter</oasis:entry>
         <oasis:entry colname="col2">Values Interrogated</oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Number of dense layers</oasis:entry>
         <oasis:entry colname="col2">1, 2, 3, 4, 5, 6, <bold>7</bold>, 8, 9, 10, 15, 20</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Number of nodes per layer</oasis:entry>
         <oasis:entry colname="col2">32, 64, 128, <bold>256</bold>, 512</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Batch normalization</oasis:entry>
         <oasis:entry colname="col2">True, <bold>False</bold></oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Dropout</oasis:entry>
         <oasis:entry colname="col2"><bold>True</bold>, False</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Dropout rate</oasis:entry>
         <oasis:entry colname="col2"><bold>0.1</bold>, 0.2, 0.3, 0.5</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Initial learning rate</oasis:entry>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M117" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>×</mml:mo><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M118" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>×</mml:mo><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">4</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M119" display="inline"><mml:mrow><mml:mn mathvariant="bold">1</mml:mn><mml:mo>×</mml:mo><mml:msup><mml:mn mathvariant="bold">10</mml:mn><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="bold">5</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>,</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M120" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>×</mml:mo><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">6</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Activation function</oasis:entry>
         <oasis:entry colname="col2">ReLU, ELU, <bold>Leaky ReLU</bold></oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Activation <inline-formula><mml:math id="M121" display="inline"><mml:mi mathvariant="italic">α</mml:mi></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col2">0.1, 0.2, <bold>0.3</bold></oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Optimizer</oasis:entry>
         <oasis:entry colname="col2"><bold>Adam</bold>, SGD, RMSprop</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Batch size</oasis:entry>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M122" display="inline"><mml:mrow><mml:mn mathvariant="bold">64</mml:mn><mml:mo>×</mml:mo><mml:mn mathvariant="bold">72</mml:mn></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M123" display="inline"><mml:mrow><mml:mn mathvariant="normal">128</mml:mn><mml:mo>×</mml:mo><mml:mn mathvariant="normal">72</mml:mn></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M124" display="inline"><mml:mrow><mml:mn mathvariant="normal">256</mml:mn><mml:mo>×</mml:mo><mml:mn mathvariant="normal">72</mml:mn></mml:mrow></mml:math></inline-formula>,</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M125" display="inline"><mml:mrow><mml:mn mathvariant="normal">512</mml:mn><mml:mo>×</mml:mo><mml:mn mathvariant="normal">72</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table></table-wrap>

</app>
  </app-group><notes notes-type="codedataavailability"><title>Code and data availability</title>

      <p id="d2e3539">The MERRA-2 Reanalysis is publicly available from <ext-link xlink:href="https://doi.org/10.5067/WWQSXQ8IVFW8" ext-link-type="DOI">10.5067/WWQSXQ8IVFW8</ext-link> <xref ref-type="bibr" rid="bib1.bibx36" id="paren.115"/>.  The MAMnet model and training and test data can be downloaded at <ext-link xlink:href="https://doi.org/10.5281/zenodo.15190121" ext-link-type="DOI">10.5281/zenodo.15190121</ext-link> <xref ref-type="bibr" rid="bib1.bibx8" id="paren.116"/>. CAMS data was obtained from <ext-link xlink:href="https://doi.org/10.26050/WDCC/QUAERERE_CCNCAMS_v1" ext-link-type="DOI">10.26050/WDCC/QUAERERE_CCNCAMS_v1</ext-link> <xref ref-type="bibr" rid="bib1.bibx16" id="paren.117"/>. Data from the GiOcean reanalysis was downloaded from <uri>https://portal.nccs.nasa.gov/datashare/gmao/GiOCEAN/</uri> <xref ref-type="bibr" rid="bib1.bibx37" id="paren.118"/>. CALIOP data was obtained from <ext-link xlink:href="https://doi.org/10.5067/CALIOP/CALIPSO/LID_L2_05KMAPRO-STANDARD-V4-20" ext-link-type="DOI">10.5067/CALIOP/CALIPSO/LID_L2_05KMAPRO-STANDARD-V4-20</ext-link> <xref ref-type="bibr" rid="bib1.bibx21" id="paren.119"/>. Code and training datasets used in this work can be downloaded at <ext-link xlink:href="https://doi.org/10.5281/zenodo.15190121" ext-link-type="DOI">10.5281/zenodo.15190121</ext-link> <xref ref-type="bibr" rid="bib1.bibx8" id="paren.120"/>.</p>
  </notes><notes notes-type="authorcontribution"><title>Author contributions</title>

      <p id="d2e3583">DB conceived and directed the work. KHB co-developed of the neural network model. AD implemented the MAM model within GEOS. KB Provided CCN data for comparison.</p>
  </notes><notes notes-type="competinginterests"><title>Competing interests</title>

      <p id="d2e3590">The contact author has declared that none of the authors has any competing interests.</p>
  </notes><notes notes-type="disclaimer"><title>Disclaimer</title>

      <p id="d2e3596">Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. The authors bear the ultimate responsibility for providing appropriate place names. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.</p>
  </notes><ack><title>Acknowledgements</title><p id="d2e3602">Resources supporting this work were provided by the NASA High-End Computing (HEC) Program through the NASA Center for Climate Simulation (NCCS) at Goddard Space Flight Center. Keras and Tensorflow libraries were obtained from <uri>https://keras.io/</uri> (last access: 23 March 2026). Maps were created using the NCAR Command Language <xref ref-type="bibr" rid="bib1.bibx70" id="paren.121"/>. The SHAP python package was used to conduct the explainable machine learning analysis as described in <uri>https://shap.readthedocs.io/en/latest/</uri> (last access: 23 March 2026). This work was supported by the NASA Modeling, Analysis and Prediction program, Grant NNH20ZDA001N-MAP.</p></ack><notes notes-type="financialsupport"><title>Financial support</title>

      <p id="d2e3616">This research has been supported by the National Aeronautics and Space Administration (grant no. NNH20ZDA001N-MAP).</p>
  </notes><notes notes-type="reviewstatement"><title>Review statement</title>

      <p id="d2e3622">This paper was edited by Slimane Bekki and reviewed by three anonymous referees.</p>
  </notes><ref-list>
    <title>References</title>

      <ref id="bib1.bibx1"><label>Aas et al.(2021)Aas, Jullum, and Løland</label><mixed-citation>Aas, K., Jullum, M., and Løland, A.: Explaining individual predictions when features are dependent: More accurate approximations to Shapley values, Artif. Intell., 298, 103502, <ext-link xlink:href="https://doi.org/10.1016/j.artint.2021.103502" ext-link-type="DOI">10.1016/j.artint.2021.103502</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bibx2"><label>Adachi and Buseck(2008)</label><mixed-citation>Adachi, K. and Buseck, P. R.: Internally mixed soot, sulfates, and organic matter in aerosol particles from Mexico City, Atmos. Chem. Phys., 8, 6469–6481, <ext-link xlink:href="https://doi.org/10.5194/acp-8-6469-2008" ext-link-type="DOI">10.5194/acp-8-6469-2008</ext-link>, 2008.</mixed-citation></ref>
      <ref id="bib1.bibx3"><label>Adams and Seinfeld(2002)</label><mixed-citation>Adams, P. J. and Seinfeld, J. H.: Predicting global aerosol size distributions in general circulation models, J. Geophys. Res.-Atmos., 107, <ext-link xlink:href="https://doi.org/10.1029/2001JD001010" ext-link-type="DOI">10.1029/2001JD001010</ext-link>, 2002.</mixed-citation></ref>
      <ref id="bib1.bibx4"><label>Amunsen et al.(1992)Amunsen, Hanssen, Semb, and Steinnes</label><mixed-citation>Amunsen, C., Hanssen, J., Semb, A., and Steinnes, E.: Long-range atmospheric transport of trace elements to southern Norway, Atmos. Environ. A Gen., 26, 1309–1324, <ext-link xlink:href="https://doi.org/10.1016/0960-1686(92)90391-W" ext-link-type="DOI">10.1016/0960-1686(92)90391-W</ext-link>, 1992.</mixed-citation></ref>
      <ref id="bib1.bibx5"><label>Aquila et al.(2011)Aquila, Hendricks, Lauer, Riemer, Vogel, Baumgardner, Minikin, Petzold, Schwarz, Spackman et al.</label><mixed-citation>Aquila, V., Hendricks, J., Lauer, A., Riemer, N., Vogel, H., Baumgardner, D., Minikin, A., Petzold, A., Schwarz, J. P., Spackman, J. R., Weinzierl, B., Righi, M., and Dall'Amico, M.: MADE-in: a new aerosol microphysics submodel for global simulation of insoluble particles and their mixing state, Geosci. Model Dev., 4, 325–355, <ext-link xlink:href="https://doi.org/10.5194/gmd-4-325-2011" ext-link-type="DOI">10.5194/gmd-4-325-2011</ext-link>, 2011.</mixed-citation></ref>
      <ref id="bib1.bibx6"><label>Arfin et al.(2023)Arfin, Pillai, Mathew, Tirpude, Bang, and Mondal</label><mixed-citation>Arfin, T., Pillai, A. M., Mathew, N., Tirpude, A., Bang, R., and Mondal, P.: An overview of atmospheric aerosol and their effects on human health, Environ. Sci. Pollut. R., 30, 125347–125369, <ext-link xlink:href="https://doi.org/10.1007/s11356-023-29652-w" ext-link-type="DOI">10.1007/s11356-023-29652-w</ext-link>, 2023.</mixed-citation></ref>
      <ref id="bib1.bibx7"><label>Asmi et al.(2011)Asmi, Wiedensohler, Laj, Fjaeraa, Sellegri, Birmili, Weingartner, Baltensperger, Zdimal, Zikova et al.</label><mixed-citation>Asmi, A., Wiedensohler, A., Laj, P., Fjaeraa, A.-M., Sellegri, K., Birmili, W., Weingartner, E., Baltensperger, U., Zdimal, V., Zikova, N., Putaud, J.-P., Marinoni, A., Tunved, P., Hansson, H.-C., Fiebig, M., Kivekäs, N., Lihavainen, H., Asmi, E., Ulevicius, V., Aalto, P. P., Swietlicki, E., Kristensson, A., Mihalopoulos, N., Kalivitis, N., Kalapov, I., Kiss, G., de Leeuw, G., Henzing, B., Harrison, R. M., Beddows, D., O'Dowd, C., Jennings, S. G., Flentje, H., Weinhold, K., Meinhardt, F., Ries, L., and Kulmala, M.: Number size distributions and seasonality of submicron particles in Europe 2008–2009, Atmos. Chem. Phys., 11, 5505–5538, <ext-link xlink:href="https://doi.org/10.5194/acp-11-5505-2011" ext-link-type="DOI">10.5194/acp-11-5505-2011</ext-link>, 2011.</mixed-citation></ref>
      <ref id="bib1.bibx8"><label>Barahona and Breen(2025)</label><mixed-citation>Barahona, D. and Breen, K.: MAMnet, Zenodo [code], <ext-link xlink:href="https://doi.org/10.5281/zenodo.15190121" ext-link-type="DOI">10.5281/zenodo.15190121</ext-link>, 2025.</mixed-citation></ref>
      <ref id="bib1.bibx9"><label>Barahona et al.(2014)Barahona, Molod, Bacmeister, Nenes, Gettelman, Morrison, Phillips, and Eichmann</label><mixed-citation>Barahona, D., Molod, A., Bacmeister, J., Nenes, A., Gettelman, A., Morrison, H., Phillips, V., and Eichmann, A.: Development of two-moment cloud microphysics for liquid and ice within the NASA Goddard Earth Observing System Model (GEOS-5), Geosci. Model Dev., 7, 1733–1766, <ext-link xlink:href="https://doi.org/10.5194/gmd-7-1733-2014" ext-link-type="DOI">10.5194/gmd-7-1733-2014</ext-link>, 2014.</mixed-citation></ref>
      <ref id="bib1.bibx10"><label>Barahona et al.(2024)Barahona, Breen, Kalesse-Los, and Röttenbacher</label><mixed-citation>Barahona, D., Breen, K. H., Kalesse-Los, H., and Röttenbacher, J.: Deep Learning Parameterization of Vertical Wind Velocity Variability via Constrained Adversarial Training, Artificial Intelligence for the Earth Systems, 3, e230025, <ext-link xlink:href="https://doi.org/10.1175/AIES-D-23-0025.1" ext-link-type="DOI">10.1175/AIES-D-23-0025.1</ext-link>, 2024.</mixed-citation></ref>
      <ref id="bib1.bibx11"><label>Bender(2020)</label><mixed-citation>Bender, F. A.-M.: Aerosol forcing: Still uncertain, still relevant, AGU Advances, 1, e2019AV000128, <ext-link xlink:href="https://doi.org/10.1029/2019AV000128" ext-link-type="DOI">10.1029/2019AV000128</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bibx12"><label>Bender et al.(2019)Bender, Frey, McCoy, Grosvenor, and Mohrmann</label><mixed-citation>Bender, F. A.-M., Frey, L., McCoy, D. T., Grosvenor, D. P., and Mohrmann, J. K.: Assessment of aerosol–cloud–radiation correlations in satellite observations, climate models and reanalysis, Cli. Dynam., 52, 4371–4392, <ext-link xlink:href="https://doi.org/10.1007/s00382-018-4384-z" ext-link-type="DOI">10.1007/s00382-018-4384-z</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bibx13"><label>Bengio et al.(2017)Bengio, Goodfellow, and Courville</label><mixed-citation> Bengio, Y., Goodfellow, I., and Courville, A.: Deep learning, vol. 1, MIT Press Cambridge, MA, USA, ISBN 978-0262035613, 2017.</mixed-citation></ref>
      <ref id="bib1.bibx14"><label>Birmili et al.(2003)Birmili, Berresheim, Plass-Dülmer, Elste, Gilge, Wiedensohler, and Uhrner</label><mixed-citation>Birmili, W., Berresheim, H., Plass-Dülmer, C., Elste, T., Gilge, S., Wiedensohler, A., and Uhrner, U.: The Hohenpeissenberg aerosol formation experiment (HAFEX): a long-term study including size-resolved aerosol, H<sub>2</sub>SO<sub>4</sub>, OH, and monoterpenes measurements, Atmos. Chem. Phys., 3, 361–376, <ext-link xlink:href="https://doi.org/10.5194/acp-3-361-2003" ext-link-type="DOI">10.5194/acp-3-361-2003</ext-link>, 2003.</mixed-citation></ref>
      <ref id="bib1.bibx15"><label>Birmili et al.(2009)Birmili, Weinhold, Nordmann, Wiedensohler, Spindler, Müller, Herrmann, Gnauk, Pitz, Cyrys et al.</label><mixed-citation> Birmili, W., Weinhold, K., Nordmann, S., Wiedensohler, A., Spindler, G., Müller, K., Herrmann, H., Gnauk, T., Pitz, M., Cyrys, J., and Flentje, H.: Atmospheric aerosol measurements in the German ultrafine aerosol network (GUAN), Gefahrst. Reinhalt. L, 69, 137–145, 2009.</mixed-citation></ref>
      <ref id="bib1.bibx16"><label>Block(2023)</label><mixed-citation>Block, K.: Cloud condensation nuclei (CCN) numbers derived from CAMS reanalysis EAC4 (Version 1), World Data Center for Climate (WDCC) at DKRZ [data set], <ext-link xlink:href="https://doi.org/10.26050/WDCC/QUAERERE_CCNCAMS_v1" ext-link-type="DOI">10.26050/WDCC/QUAERERE_CCNCAMS_v1</ext-link>, 2023.</mixed-citation></ref>
      <ref id="bib1.bibx17"><label>Block et al.(2024)Block, Haghighatnasab, Partridge, Stier, and Quaas</label><mixed-citation>Block, K., Haghighatnasab, M., Partridge, D. G., Stier, P., and Quaas, J.: Cloud condensation nuclei concentrations derived from the CAMS reanalysis, Earth Syst. Sci. Data, 16, 443–470, <ext-link xlink:href="https://doi.org/10.5194/essd-16-443-2024" ext-link-type="DOI">10.5194/essd-16-443-2024</ext-link>, 2024.</mixed-citation></ref>
      <ref id="bib1.bibx18"><label>Brenowitz and Bretherton(2019)</label><mixed-citation>Brenowitz, N. D. and Bretherton, C. S.: Spatially extended tests of a neural network parametrization trained by coarse-graining, J. Adv. Model. Earth Sy., 11, 2728–2744, <ext-link xlink:href="https://doi.org/10.1029/2019MS001711" ext-link-type="DOI">10.1029/2019MS001711</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bibx19"><label>Buchard et al.(2017)Buchard, Randles, Da Silva, Darmenov, Colarco, Govindaraju, Ferrare, Hair, Beyersdorf, Ziemba et al.</label><mixed-citation>Buchard, V., Randles, C., Da Silva, A., Darmenov, A., Colarco, P., Govindaraju, R., Ferrare, R., Hair, J., Beyersdorf, A., Ziemba, L., and Yu, H.: The MERRA-2 aerosol reanalysis, 1980 onward. Part II: Evaluation and case studies, J. Climate, 30, 6851–6872, <ext-link xlink:href="https://doi.org/10.1175/JCLI-D-16-0613.1" ext-link-type="DOI">10.1175/JCLI-D-16-0613.1</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bibx20"><label>Buda et al.(2018)Buda, Maki, and Mazurowski</label><mixed-citation>Buda, M., Maki, A., and Mazurowski, M. A.: A systematic study of the class imbalance problem in convolutional neural networks, Neural Networks, 106, 249–259, <ext-link xlink:href="https://doi.org/10.1016/j.neunet.2018.07.011" ext-link-type="DOI">10.1016/j.neunet.2018.07.011</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bibx21"><label>CALIPSO(2023)</label><mixed-citation>CALIPSO: Cloud–Aerosol Lidar and Infrared Pathfinder Satellite Observation Lidar Level 2 Aerosol Profile V4-20, NASA [data set], <ext-link xlink:href="https://doi.org/10.5067/CALIOP/CALIPSO/LID_L2_05KMAPRO-STANDARD-V4-20" ext-link-type="DOI">10.5067/CALIOP/CALIPSO/LID_L2_05KMAPRO-STANDARD-V4-20</ext-link>, 2023.</mixed-citation></ref>
      <ref id="bib1.bibx22"><label>C̆ervenkova and Vá n̆a(2010)</label><mixed-citation>C̆ervenkova, J. and Vá n̆a, M.: Trend Assessment of deposition, throughfall and runoff water chemistry at the ICP-IM station Kosetice, Czech Republic, IAHS-AISH Publication, 336, 103–108, 2010.</mixed-citation></ref>
      <ref id="bib1.bibx23"><label>Charron et al.(2007)Charron, Birmili, and Harrison</label><mixed-citation>Charron, A., Birmili, W., and Harrison, R. M.: Factors influencing new particle formation at the rural site, Harwell, United Kingdom, J. Geophys. Res.-Atmos., 112, <ext-link xlink:href="https://doi.org/10.1029/2007JD008425" ext-link-type="DOI">10.1029/2007JD008425</ext-link>, 2007.</mixed-citation></ref>
      <ref id="bib1.bibx24"><label>Chin et al.(2000)Chin, Rood, Lin, Müller, and Thompson</label><mixed-citation>Chin, M., Rood, R. B., Lin, S.-J., Müller, J.-F., and Thompson, A. M.: Atmospheric sulfur cycle simulated in the global model GOCART: Model description and global properties, J. Geophys. Res.-Atmos., 105, 24671–24687, <ext-link xlink:href="https://doi.org/10.1029/2000JD900384" ext-link-type="DOI">10.1029/2000JD900384</ext-link>, 2000.</mixed-citation></ref>
      <ref id="bib1.bibx25"><label>Chollet(2015)</label><mixed-citation>Chollet, F.: Keras, GitHub [code], <uri>https://github.com/fchollet/keras</uri> (last access: 23 March 2026), 2015.</mixed-citation></ref>
      <ref id="bib1.bibx26"><label>Choudhury and Tesche(2022)</label><mixed-citation>Choudhury, G. and Tesche, M.: Estimating cloud condensation nuclei concentrations from CALIPSO lidar measurements, Atmos. Meas. Tech., 15, 639–654, <ext-link xlink:href="https://doi.org/10.5194/amt-15-639-2022" ext-link-type="DOI">10.5194/amt-15-639-2022</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bibx27"><label>Christensen et al.(2020)Christensen, Jones, and Stier</label><mixed-citation>Christensen, M. W., Jones, W. K., and Stier, P.: Aerosols enhance cloud lifetime and brightness along the stratus-to-cumulus transition, P. Natl. Acad. Sci. USA, 117, 17591–17598, <ext-link xlink:href="https://doi.org/10.1073/pnas.1921231117" ext-link-type="DOI">10.1073/pnas.1921231117</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bibx28"><label>Chu et al.(2002)Chu, Kaufman, Ichoku, Remer, Tanré, and Holben</label><mixed-citation>Chu, D., Kaufman, Y., Ichoku, C., Remer, L., Tanré, D., and Holben, B.: Validation of MODIS aerosol optical depth retrieval over land, Geophys. Res. Lett., 29, <ext-link xlink:href="https://doi.org/10.1029/2001GL013205" ext-link-type="DOI">10.1029/2001GL013205</ext-link>, 2002.</mixed-citation></ref>
      <ref id="bib1.bibx29"><label>Colarco et al.(2010a)Colarco, da Silva, Chin, and Diehl</label><mixed-citation>Colarco, P., da Silva, A., Chin, M., and Diehl, T.: Online simulations of global aerosol distributions in the NASA GEOS-4 model and comparisons to satellite and ground-based aerosol optical depth, J. Geophys. Res., 115, D14207, <ext-link xlink:href="https://doi.org/10.1029/2009JD012820" ext-link-type="DOI">10.1029/2009JD012820</ext-link>, 2010a.</mixed-citation></ref>
      <ref id="bib1.bibx30"><label>Colarco et al.(2010b)Colarco, da Silva, Chin, and Diehl</label><mixed-citation>Colarco, P., da Silva, A., Chin, M., and Diehl, T.: Online simulations of global aerosol distributions in the NASA GEOS-4 model and comparisons to satellite and ground-based aerosol optical depth, J. Geophys. Res.-Atmos., 115, <ext-link xlink:href="https://doi.org/10.1029/2009JD012820" ext-link-type="DOI">10.1029/2009JD012820</ext-link>, 2010b.</mixed-citation></ref>
      <ref id="bib1.bibx31"><label>Engler et al.(2007)Engler, Rose, Wehner, Wiedensohler, Brüggemann, Gnauk, Spindler, Tuch, and Birmili</label><mixed-citation>Engler, C., Rose, D., Wehner, B., Wiedensohler, A., Brüggemann, E., Gnauk, T., Spindler, G., Tuch, T., and Birmili, W.: Size distributions of non-volatile particle residuals (<inline-formula><mml:math id="M128" display="inline"><mml:mrow><mml:msub><mml:mi>D</mml:mi><mml:mi mathvariant="normal">p</mml:mi></mml:msub><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">800</mml:mn></mml:mrow></mml:math></inline-formula> nm) at a rural site in Germany and relation to air mass origin, Atmos. Chem. Phys., 7, 5785–5802, <ext-link xlink:href="https://doi.org/10.5194/acp-7-5785-2007" ext-link-type="DOI">10.5194/acp-7-5785-2007</ext-link>, 2007.</mixed-citation></ref>
      <ref id="bib1.bibx32"><label>Forster et al.(2007)Forster, Ramaswamy, Artaxo, Berntsen, Betts, Fahey, Haywood, Lean, Lowe, Myhre et al.</label><mixed-citation> Forster, P., Ramaswamy, V., Artaxo, P., Berntsen, T., Betts, R., Fahey, D. W., Haywood, J., Lean, J., Lowe, D. C., Myhre, G., Nganga, J., Prinn, R., Raga, G., Schulz, M., and Van Dorland, R.: Changes in Atmospheric Constituents and in Radiative Forcing, in: Climate Change 2007: The Physical Science Basis. Contribution of Working Group I to the Fourth Assessment Report of the Intergovernmental Panel on Climate Change, edited by: Solomon, S., Qin, D., Manning, M., Chen, Z., Marquis, M., Averyt, K. B., Tignor, M., and Miller, H. L., Cambridge University Press, Cambridge, United Kingdom and New York, NY, USA, ISBN 9780521880091, 2007.</mixed-citation></ref>
      <ref id="bib1.bibx33"><label>Fountoukis and Nenes(2005)</label><mixed-citation>Fountoukis, C. and Nenes, A.: Continued development of a cloud droplet formation parameterization for global climate models, J. Geophys. Res.-Atmos., 110, <ext-link xlink:href="https://doi.org/10.1029/2004JD005591" ext-link-type="DOI">10.1029/2004JD005591</ext-link>, 2005.</mixed-citation></ref>
      <ref id="bib1.bibx34"><label>Gelaro et al.(2017)Gelaro, McCarty, Suárez, Todling, Molod, Takacs, Randles, Darmenov, Bosilovich, Reichle, Wargan, Coy, Cullather, Draper, Akella, Buchard, Conaty, da Silva, Gu, Kim, Koster, Lucchesi, Merkova, Nielsen, Partyka, Pawson, Putman, Rienecker, Schubert, Sienkiewicz, and Zhao</label><mixed-citation>Gelaro, R., McCarty, W., Suárez, M. J., Todling, R., Molod, A., Takacs, L., Randles, C. A., Darmenov, A., Bosilovich, M. G., Reichle, R., Wargan, K., Coy, L., Cullather, R., Draper, C., Akella, S., Buchard, V., Conaty, A., da Silva, A. M., Gu, W., Kim, G.-K., Koster, R., Lucchesi, R., Merkova, D., Nielsen, J. E., Partyka, G., Pawson, S., Putman, W., Rienecker, M., Schubert, S. D., Sienkiewicz, M., and Zhao, B.: The Modern-Era Retrospective Analysis for Research and Applications, Version 2 (MERRA-2), J. Climate, 30, 5419–5454, <ext-link xlink:href="https://doi.org/10.1175/JCLI-D-16-0758.1" ext-link-type="DOI">10.1175/JCLI-D-16-0758.1</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bibx35"><label>Ginoux et al.(2001)Ginoux, Chin, Tegen, Prospero, Holben, Dubovik, and Lin</label><mixed-citation>Ginoux, P., Chin, M., Tegen, I., Prospero, J. M., Holben, B., Dubovik, O., and Lin, S.-J.: Sources and distributions of dust aerosols simulated with the GOCART model, J. Geophys. Res.-Atmos., 106, 20255–20273, <ext-link xlink:href="https://doi.org/10.1029/2000JD000053" ext-link-type="DOI">10.1029/2000JD000053</ext-link>, 2001.</mixed-citation></ref>
      <ref id="bib1.bibx36"><label>GMAO(2015)</label><mixed-citation>GMAO: MERRA-2 inst3_3d_asm_Nv: 3d,3-Hourly,Instantaneous,Model-Level,Assimilation,Assimilated Meteorological Fields V5.12.4, Goddard Earth Sciences Data and Information Services Center (GES DISC) [data set], <ext-link xlink:href="https://doi.org/10.5067/WWQSXQ8IVFW8" ext-link-type="DOI">10.5067/WWQSXQ8IVFW8</ext-link>, 2015.</mixed-citation></ref>
      <ref id="bib1.bibx37"><label>GMAO(2025)</label><mixed-citation>GMAO: GiOcean Coupled Reanalysis, GMAO [data set], <uri>https://portal.nccs.nasa.gov/datashare/gmao/GiOCEAN/</uri> (last access: 23 March 2026), 2025.</mixed-citation></ref>
      <ref id="bib1.bibx38"><label>Gong et al.(2022)Gong, Wex, Müller, Henning, Voigtländer, Wiedensohler, and Stratmann</label><mixed-citation>Gong, X., Wex, H., Müller, T., Henning, S., Voigtländer, J., Wiedensohler, A., and Stratmann, F.: Understanding aerosol microphysical properties from 10 years of data collected at Cabo Verde based on an unsupervised machine learning classification, Atmos. Chem. Phys., 22, 5175–5194, <ext-link xlink:href="https://doi.org/10.5194/acp-22-5175-2022" ext-link-type="DOI">10.5194/acp-22-5175-2022</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bibx39"><label>Gruening et al.(2009)Gruening, Adam, Cavalli, Cavalli, Dell’Acqua, Martins Dos Santos, Pagliari, Roux, and Putaud</label><mixed-citation>Gruening, C., Adam, M., Cavalli, F., Cavalli, P., Dell’Acqua, A., Martins Dos Santos, S., Pagliari, V., Roux, D., and Putaud, J.: JRC Ispra EMEP–GAW Regional Station for Atmos. Res, Tech. Rep. JRC55382, European Commission, <uri>https://publications.jrc.ec.europa.eu/repository/handle/JRC55382</uri> (last access: 23 March 2026), 2009.</mixed-citation></ref>
      <ref id="bib1.bibx40"><label>Gueymard and Yang(2020)</label><mixed-citation>Gueymard, C. A. and Yang, D.: Worldwide validation of CAMS and MERRA-2 reanalysis aerosol optical depth products using 15 years of AERONET observations, Atmos. Environ., 225, 117216, <ext-link xlink:href="https://doi.org/10.1016/j.atmosenv.2019.117216" ext-link-type="DOI">10.1016/j.atmosenv.2019.117216</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bibx41"><label>Harder et al.(2022)Harder, Watson-Parris, Stier, Strassel, Gauger, and Keuper</label><mixed-citation>Harder, P., Watson-Parris, D., Stier, P., Strassel, D., Gauger, N. R., and Keuper, J.: Physics-Informed Learning of Aerosol Microphysics, arXiv [preprint], <ext-link xlink:href="https://doi.org/10.48550/arXiv.2207.11786" ext-link-type="DOI">10.48550/arXiv.2207.11786</ext-link>, 24 July 2022.</mixed-citation></ref>
      <ref id="bib1.bibx42"><label>Hari et al.(2013)Hari, Nikinmaa, Pohja, Siivola, Bäck, Vesala, and Kulmala</label><mixed-citation>Hari, P., Nikinmaa, E., Pohja, T., Siivola, E., Bäck, J., Vesala, T., and Kulmala, M.: Station for measuring ecosystem-atmosphere relations: SMEAR, in: Physical and physiological forest ecology, Springer Nature, 471–487, <ext-link xlink:href="https://doi.org/10.1007/978-94-007-5603-8_9" ext-link-type="DOI">10.1007/978-94-007-5603-8_9</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bibx43"><label>Inness et al.(2019)Inness, Ades, Agustí-Panareda, Barré, Benedictow, Blechschmidt, Dominguez, Engelen, Eskes, Flemming et al.</label><mixed-citation>Inness, A., Ades, M., Agustí-Panareda, A., Barré, J., Benedictow, A., Blechschmidt, A.-M., Dominguez, J. J., Engelen, R., Eskes, H., Flemming, J., Huijnen, V., Jones, L., Kipling, Z., Massart, S., Parrington, M., Peuch, V.-H., Razinger, M., Remy, S., Schulz, M., and Suttie, M.: The CAMS reanalysis of atmospheric composition, Atmos. Chem. Phys., 19, 3515–3556, <ext-link xlink:href="https://doi.org/10.5194/acp-19-3515-2019" ext-link-type="DOI">10.5194/acp-19-3515-2019</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bibx44"><label>Japkowicz and Stephen(2002)</label><mixed-citation>Japkowicz, N. and Stephen, S.: The class imbalance problem: A systematic study, Intell. Data Anal., 6, 429–449, <ext-link xlink:href="https://doi.org/10.3233/IDA-2002-6504" ext-link-type="DOI">10.3233/IDA-2002-6504</ext-link>, 2002.</mixed-citation></ref>
      <ref id="bib1.bibx45"><label>Jeggle et al.(2023)Jeggle, Neubauer, Camps-Valls, and Lohmann</label><mixed-citation>Jeggle, K., Neubauer, D., Camps-Valls, G., and Lohmann, U.: Understanding cirrus clouds using explainable machine learning, Environmental Data Science, 2, e19, <ext-link xlink:href="https://doi.org/10.1017/eds.2023.14" ext-link-type="DOI">10.1017/eds.2023.14</ext-link>, 2023.</mixed-citation></ref>
      <ref id="bib1.bibx46"><label>Jennings et al.(1991)Jennings, O'Dowd, O'Connor, and McGovern</label><mixed-citation>Jennings, S., O'Dowd, C., O'Connor, T., and McGovern, F.: Physical characteristics of the ambient aerosol at Mace Head, Atmos. Environ. A Gen., 25, 557–562, <ext-link xlink:href="https://doi.org/10.1016/0960-1686(91)90052-9" ext-link-type="DOI">10.1016/0960-1686(91)90052-9</ext-link>, 1991.</mixed-citation></ref>
      <ref id="bib1.bibx47"><label>Jia et al.(2024)Jia, Andersen, and Cermak</label><mixed-citation>Jia, Y., Andersen, H., and Cermak, J.: Analysis of the cloud fraction adjustment to aerosols and its dependence on meteorological controls using explainable machine learning, Atmos. Chem. Phys., 24, 13025–13045, <ext-link xlink:href="https://doi.org/10.5194/acp-24-13025-2024" ext-link-type="DOI">10.5194/acp-24-13025-2024</ext-link>, 2024.</mixed-citation></ref>
      <ref id="bib1.bibx48"><label>Jones et al.(1994)Jones, Roberts, and Slingo</label><mixed-citation>Jones, A., Roberts, D., and Slingo, A.: A climate model study of indirect radiative forcing by anthropogenic sulphate aerosols, Nature, 370, 450–453, <ext-link xlink:href="https://doi.org/10.1038/370450a0" ext-link-type="DOI">10.1038/370450a0</ext-link>, 1994.</mixed-citation></ref>
      <ref id="bib1.bibx49"><label>Jurányi et al.(2011)Jurányi, Gysel, Weingartner, Bukowiecki, Kammermann, and Baltensperger</label><mixed-citation>Jurányi, Z., Gysel, M., Weingartner, E., Bukowiecki, N., Kammermann, L., and Baltensperger, U.: A 17 month climatology of the cloud condensation nuclei number concentration at the high alpine site Jungfraujoch, J. Geophys. Res.-Atmos., 116, <ext-link xlink:href="https://doi.org/10.1029/2010JD015199" ext-link-type="DOI">10.1029/2010JD015199</ext-link>, 2011.</mixed-citation></ref>
      <ref id="bib1.bibx50"><label>Kingma and Ba(2014)</label><mixed-citation>Kingma, D. P. and Ba, J.: Adam: A method for stochastic optimization, arXiv [preprint],  <ext-link xlink:href="https://doi.org/10.48550/arXiv.1412.6980" ext-link-type="DOI">10.48550/arXiv.1412.6980</ext-link>,  22 December 2014.</mixed-citation></ref>
      <ref id="bib1.bibx51"><label>Kirpes et al.(2018)Kirpes, Bondy, Bonanno, Moffet, Wang, Laskin, Ault, and Pratt</label><mixed-citation>Kirpes, R. M., Bondy, A. L., Bonanno, D., Moffet, R. C., Wang, B., Laskin, A., Ault, A. P., and Pratt, K. A.: Secondary sulfate is internally mixed with sea spray aerosol and organic aerosol in the winter Arctic, Atmos. Chem. Phys., 18, 3937–3949, <ext-link xlink:href="https://doi.org/10.5194/acp-18-3937-2018" ext-link-type="DOI">10.5194/acp-18-3937-2018</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bibx52"><label>Kiss et al.(2002)Kiss, Varga, Galambos, and Ganszky</label><mixed-citation>Kiss, G., Varga, B., Galambos, I., and Ganszky, I.: Characterization of water-soluble organic matter isolated from atmospheric fine aerosol, J. Geophys. Res.-Atmos., 107, <ext-link xlink:href="https://doi.org/10.1029/2001JD000603" ext-link-type="DOI">10.1029/2001JD000603</ext-link>, 2002.</mixed-citation></ref>
      <ref id="bib1.bibx53"><label>Kreidenweis et al.(2005)Kreidenweis, Koehler, DeMott, Prenni, Carrico, and Ervens</label><mixed-citation>Kreidenweis, S. M., Koehler, K., DeMott, P. J., Prenni, A. J., Carrico, C., and Ervens, B.: Water activity and activation diameters from hygroscopicity data - Part I: Theory and application to inorganic salts, Atmos. Chem. Phys., 5, 1357–1370, <ext-link xlink:href="https://doi.org/10.5194/acp-5-1357-2005" ext-link-type="DOI">10.5194/acp-5-1357-2005</ext-link>, 2005.</mixed-citation></ref>
      <ref id="bib1.bibx54"><label>Kristensson et al.(2008)Kristensson, Dal Maso, Swietlicki, Hussein, Zhou, Kerminen, and Kulmala</label><mixed-citation>Kristensson, A., Dal Maso, M., Swietlicki, E., Hussein, T., Zhou, J., Kerminen, V.-M., and Kulmala, M.: Characterization of new particle formation events at a background site in Southern Sweden: relation to air mass history, Tellus B, 60, 330–344, <ext-link xlink:href="https://doi.org/10.1111/j.1600-0889.2008.00345.x" ext-link-type="DOI">10.1111/j.1600-0889.2008.00345.x</ext-link>, 2008.</mixed-citation></ref>
      <ref id="bib1.bibx55"><label>Kwon et al.(2023)Kwon, An, Song, and Sung</label><mixed-citation>Kwon, Y., An, S. A., Song, H.-J., and Sung, K.: Particulate Matter Prediction and Shapley Value Interpretation in Korea through a Deep Learning Model, SOLA, 19, 225–231, <ext-link xlink:href="https://doi.org/10.2151/sola.2023-029" ext-link-type="DOI">10.2151/sola.2023-029</ext-link>, 2023.</mixed-citation></ref>
      <ref id="bib1.bibx56"><label>Langner and Rodhe(1991)</label><mixed-citation>Langner, J. and Rodhe, H.: A global three-dimensional model of the tropospheric sulfur cycle, J. Atmos. Chem., 13, 225–263, <ext-link xlink:href="https://doi.org/10.1007/BF00058134" ext-link-type="DOI">10.1007/BF00058134</ext-link>, 1991.</mixed-citation></ref>
      <ref id="bib1.bibx57"><label>Lee et al.(2013)Lee, Pringle, Reddington, Mann, Stier, Spracklen, Pierce, and Karslaw</label><mixed-citation>Lee, L. A., Pringle, K. J., Reddington, C. L., Mann, G. W., Stier, P., Spracklen, D. V., Pierce, J. R., and Carslaw, K. S.: The magnitude and causes of uncertainty in global model simulations of cloud condensation nuclei, Atmos. Chem. Phys., 13, 8879–8914, <ext-link xlink:href="https://doi.org/10.5194/acp-13-8879-2013" ext-link-type="DOI">10.5194/acp-13-8879-2013</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bibx58"><label>Lihavainen et al.(2008)Lihavainen, Kerminen, Komppula, Hyvärinen, Laakia, Saarikoski, Makkonen, Kivekäs, Hillamo, Kulmala et al.</label><mixed-citation>Lihavainen, H., Kerminen, V.-M., Komppula, M., Hyvärinen, A.-P., Laakia, J., Saarikoski, S., Makkonen, U., Kivekäs, N., Hillamo, R., Kulmala, M., and Viisanen, Y.: Measurements of the relation between aerosol properties and microphysics and chemistry of low level liquid water clouds in Northern Finland, Atmos. Chem. Phys., 8, 6925–6938, <ext-link xlink:href="https://doi.org/10.5194/acp-8-6925-2008" ext-link-type="DOI">10.5194/acp-8-6925-2008</ext-link>, 2008.</mixed-citation></ref>
      <ref id="bib1.bibx59"><label>Liu et al.(2012)Liu, Easter, Ghan, Zaveri, Rasch, Shi, Lamarque, Gettelman, Morrison, Vitt et al.</label><mixed-citation>Liu, X., Easter, R. C., Ghan, S. J., Zaveri, R., Rasch, P., Shi, X., Lamarque, J.-F., Gettelman, A., Morrison, H., Vitt, F., Conley, A., Park, S., Neale, R., Hannay, C., Ekman, A. M. L., Hess, P., Mahowald, N., Collins, W., Iacono, M. J., Bretherton, C. S., Flanner, M. G., and Mitchell, D.: Toward a minimal representation of aerosols in climate models: description and evaluation in the Community Atmosphere Model CAM5, Geosci. Model Dev., 5, 709–739, <ext-link xlink:href="https://doi.org/10.5194/gmd-5-709-2012" ext-link-type="DOI">10.5194/gmd-5-709-2012</ext-link>, 2012.</mixed-citation></ref>
      <ref id="bib1.bibx60"><label>Lundberg and Lee(2017)</label><mixed-citation>Lundberg, S. M. and Lee, S.-I.: A unified approach to interpreting model predictions, Advances in neural information processing systems, 30, <ext-link xlink:href="https://doi.org/10.48550/arXiv.1705.07874" ext-link-type="DOI">10.48550/arXiv.1705.07874</ext-link>, 22 May 2017.</mixed-citation></ref>
      <ref id="bib1.bibx61"><label>Lundberg et al.(2020)Lundberg, Erion, Chen, DeGrave, Prutkin, Nair, Katz, Himmelfarb, Bansal, and Lee</label><mixed-citation>Lundberg, S. M., Erion, G., Chen, H., DeGrave, A., Prutkin, J. M., Nair, B., Katz, R., Himmelfarb, J., Bansal, N., and Lee, S.-I.: From local explanations to global understanding with explainable AI for trees, Nature Machine Intelligence, 2, 56–67, <ext-link xlink:href="https://doi.org/10.1038/s42256-019-0138-9" ext-link-type="DOI">10.1038/s42256-019-0138-9</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bibx62"><label>Ma and Stinis(2020)</label><mixed-citation>Ma, P. L. and Stinis, P.: Developing a simulator-based satellite dataset for using machine learning techniques to derive aerosol-cloud-precipitation interactions in models and observations in a consistent framework, Tech. rep., Pacific Northwest National Laboratory (PNNL), Richland, WA (United States), <ext-link xlink:href="https://doi.org/10.2172/1984697" ext-link-type="DOI">10.2172/1984697</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bibx63"><label>Mann et al.(2010)Mann, Carslaw, Spracklen, Ridley, Manktelow, Chipperfield, Pickering, and Johnson</label><mixed-citation>Mann, G. W., Carslaw, K. S., Spracklen, D. V., Ridley, D. A., Manktelow, P. T., Chipperfield, M. P., Pickering, S. J., and Johnson, C. E.: Description and evaluation of GLOMAP-mode: a modal global aerosol microphysics model for the UKCA composition-climate model, Geosci. Model Dev., 3, 519–551, <ext-link xlink:href="https://doi.org/10.5194/gmd-3-519-2010" ext-link-type="DOI">10.5194/gmd-3-519-2010</ext-link>, 2010.</mixed-citation></ref>
      <ref id="bib1.bibx64"><label>Marinoni et al.(2008)Marinoni, Cristofanelli, Calzolari, Roccato, Bonafè, and Bonasoni</label><mixed-citation>Marinoni, A., Cristofanelli, P., Calzolari, F., Roccato, F., Bonafè, U., and Bonasoni, P.: Continuous measurements of aerosol physical parameters at the Mt. Cimone GAW Station (2165 m asl, Italy), Sci. Total Environ., 391, 241–251, <ext-link xlink:href="https://doi.org/10.1016/j.scitotenv.2007.10.004" ext-link-type="DOI">10.1016/j.scitotenv.2007.10.004</ext-link>, 2008.</mixed-citation></ref>
      <ref id="bib1.bibx65"><label>McCoy et al.(2017)McCoy, Bender, Mohrmann, Hartmann, Wood, and Grosvenor</label><mixed-citation>McCoy, D., Bender, F.-M., Mohrmann, J., Hartmann, D., Wood, R., and Grosvenor, D.: The global aerosol-cloud first indirect effect estimated using MODIS, MERRA, and AeroCom, J. Geophys. Res.-Atmos., 122, 1779–1796, <ext-link xlink:href="https://doi.org/10.1002/2016JD026141" ext-link-type="DOI">10.1002/2016JD026141</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bibx66"><label>Mihalopoulos et al.(1997)Mihalopoulos, Stephanou, Kanakidou, Pilitsidis, and Bousquet</label><mixed-citation>Mihalopoulos, N., Stephanou, E., Kanakidou, M., Pilitsidis, S., and Bousquet, P.: Tropospheric aerosol ionic composition in the Eastern Mediterranean region, Tellus B, 49, 314–326, <ext-link xlink:href="https://doi.org/10.3402/tellusb.v49i3.15970" ext-link-type="DOI">10.3402/tellusb.v49i3.15970</ext-link>, 1997.</mixed-citation></ref>
      <ref id="bib1.bibx67"><label>Molod et al.(2015)Molod, Takacs, Suarez, and Bacmeister</label><mixed-citation>Molod, A., Takacs, L., Suarez, M., and Bacmeister, J.: Development of the GEOS-5 atmospheric general circulation model: evolution from MERRA to MERRA2, Geosci. Model Dev., 8, 1339–1356, <ext-link xlink:href="https://doi.org/10.5194/gmd-8-1339-2015" ext-link-type="DOI">10.5194/gmd-8-1339-2015</ext-link>, 2015.</mixed-citation></ref>
      <ref id="bib1.bibx68"><label>Molod et al.(2020)Molod, Hackert, Vikhliaev, Zhao, Barahona, Vernieres, Borovikov, Kovach, Marshak, Schubert et al.</label><mixed-citation>Molod, A., Hackert, E., Vikhliaev, Y., Zhao, B., Barahona, D., Vernieres, G., Borovikov, A., Kovach, R. M., Marshak, J., Schubert, S., Li, Z., Lim, Y.-K., Andrews, L. C., Cullather, R., Koster, R., Achuthavarier, D., Carton, J., Coy, L., Friere, J. L. M., Longo, K. M., Nakada, K., and Pawson, S.: GEOS-S2S version 2: The GMAO high-resolution coupled model and assimilation system for seasonal prediction, J. Geophys. Res.-Atmos., 125, e2019JD031767, <ext-link xlink:href="https://doi.org/10.1029/2019JD031767" ext-link-type="DOI">10.1029/2019JD031767</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bibx69"><label>Nair et al.(2021)Nair, Yu, Campuzano-Jost, DeMott, Levin, Jimenez, Peischl, Pollack, Fredrickson, Beyersdorf, Nault, Park, Yum, Palm, Xu, Bourgeois, Anderson, Nenes, Ziemba, Moore, Lee, Park, Thompson, Flocke, Huey, Kim, and Peng</label><mixed-citation>Nair, A. A., Yu, F., Campuzano-Jost, P., DeMott, P. J., Levin, E. J. T., Jimenez, J. L., Peischl, J., Pollack, I. B., Fredrickson, C. D., Beyersdorf, A. J., Nault, B. A., Park, M., Yum, S. S., Palm, B. B., Xu, L., Bourgeois, I., Anderson, B. E., Nenes, A., Ziemba, L. D., Moore, R. H., Lee, T., Park, T., Thompson, C. R., Flocke, F., Huey, L. G., Kim, M. J., and Peng, Q.: Machine Learning Uncovers Aerosol Size Information From Chemistry and Meteorology to Quantify Potential Cloud-Forming Particles, Geophys. Res. Lett., 48, e2021GL094133, <ext-link xlink:href="https://doi.org/10.1029/2021GL094133" ext-link-type="DOI">10.1029/2021GL094133</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bibx70"><label>NCAR(2019)</label><mixed-citation>NCAR: NCAR Command Language (Version 6.6.2), UCAR/NCAR/CISL/TDD [software], <ext-link xlink:href="https://doi.org/10.5065/D6WD3XH5" ext-link-type="DOI">10.5065/D6WD3XH5</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bibx71"><label>Nojarov et al.(2009)Nojarov, Ivanov, Kalapov, Penev, and Drenska</label><mixed-citation>Nojarov, P., Ivanov, P., Kalapov, I., Penev, I., and Drenska, M.: Connection between ozone concentration and atmosphere circulation at peak Moussala, Theor. Appl. Climatol., 98, 201–208, <ext-link xlink:href="https://doi.org/10.1007/s00704-009-0173-2" ext-link-type="DOI">10.1007/s00704-009-0173-2</ext-link>, 2009.</mixed-citation></ref>
      <ref id="bib1.bibx72"><label>O'Malley et al.(2019)O'Malley, Bursztein, Long, Chollet, Jin, Invernizzi et al.</label><mixed-citation>O'Malley, T., Bursztein, E., Long, J., Chollet, F., Jin, H., and Invernizzi, L.: Keras Tuner, <uri>https://github.com/keras-team/keras-tuner</uri> (last access: 23 March 2026), 2019.</mixed-citation></ref>
      <ref id="bib1.bibx73"><label>Ott et al.(2020)Ott, Pritchard, Best, Linstead, Curcic, and Baldi</label><mixed-citation>Ott, J., Pritchard, M., Best, N., Linstead, E., Curcic, M., and Baldi, P.: A Fortran-Keras deep learning bridge for scientific computing, Scientific Programming, 2020, <ext-link xlink:href="https://doi.org/10.1155/2020/8888811" ext-link-type="DOI">10.1155/2020/8888811</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bibx74"><label>Philippin et al.(2009)Philippin, Laj, Putaud, Wiedensohler, LEEUW, FJAERAA, PLATT, BALTENSPERGER, and FIEBIG</label><mixed-citation>Philippin, S., Laj, P., Putaud, J.-P., Wiedensohler, A., Leeuw, G. D., Fjaeraa, A. M., Platt, U., Baltensperger, U., and Fiebig, M.: EUSAAR-An unprecedented network of aerosol observation in Europe, Journal of Aerosol Research (Earozoru Kenkyu), 24, 78–83, <ext-link xlink:href="https://doi.org/10.11203/jar.24.78" ext-link-type="DOI">10.11203/jar.24.78</ext-link>, 2009.</mixed-citation></ref>
      <ref id="bib1.bibx75"><label>Pierce et al.(2015)Pierce, Croft, Kodros, D'Andrea, and Martin</label><mixed-citation>Pierce, J. R., Croft, B., Kodros, J. K., D'Andrea, S. D., and Martin, R. V.: The importance of interstitial particle scavenging by cloud droplets in shaping the remote aerosol size distribution and global aerosol-climate effects, Atmos. Chem. Phys., 15, 6147–6158, <ext-link xlink:href="https://doi.org/10.5194/acp-15-6147-2015" ext-link-type="DOI">10.5194/acp-15-6147-2015</ext-link>, 2015.</mixed-citation></ref>
      <ref id="bib1.bibx76"><label>Randles et al.(2017)Randles, da Silva, Buchard, Colarco, Darmenov, Govindaraju, Smirnov, Holben, Ferrare, Hair, Shinozuka, and Flynn</label><mixed-citation>Randles, C. A., da Silva, A. M., Buchard, V., Colarco, P. R., Darmenov, A., Govindaraju, R., Smirnov, A., Holben, B., Ferrare, R., Hair, J., Shinozuka, Y., and Flynn, C. J.: The MERRA-2 Aerosol Reanalysis, 1980 Onward. Part I: System Description and Data Assimilation Evaluation, J. Climate, 30, 6823–6850, <ext-link xlink:href="https://doi.org/10.1175/JCLI-D-16-0609.1" ext-link-type="DOI">10.1175/JCLI-D-16-0609.1</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bibx77"><label>Rasp et al.(2018)Rasp, Pritchard, and Gentine</label><mixed-citation>Rasp, S., Pritchard, M. S., and Gentine, P.: Deep learning to represent subgrid processes in climate models, P. Natl. Acad. Sci. USA, 115, 9684–9689, <ext-link xlink:href="https://doi.org/10.1073/pnas.1810286115" ext-link-type="DOI">10.1073/pnas.1810286115</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bibx78"><label>Reddington et al.(2017)Reddington, Carslaw, Stier, Schutgens, Coe, Liu, Allan, Pringle, Lee, Yoshioka et al.</label><mixed-citation>Reddington, C., Carslaw, K., Stier, P., Schutgens, N., Coe, H., Liu, D., Allan, J., Pringle, K., Lee, L., Yoshioka, M., Johnson, J. S., Regayre, L. A., Spracklen, D. V., Mann, G. W., Clarke, A., Hermann, M., Henning, S., Wex, H., Kristensen, T. B., Leaitch, W. R., Pöschl, U., Rose, D., Andreae, M. O., Schmale, J., Kondo, Y., Oshima, N., Schwarz, J. P., Nenes, A., Anderson, B., Roberts, G. C., Snider, J. R., Leck, C., Quinn, P. K., Chi, X., Ding, A., Jimenez, J. L., and Zhang, Q.: The Global Aerosol Synthesis and Science Project (GASSP): measurements and modeling to reduce uncertainty, B. Am. Meteorol. Soc., 98, 1857–1877, <ext-link xlink:href="https://doi.org/10.1175/BAMS-D-15-00317.1" ext-link-type="DOI">10.1175/BAMS-D-15-00317.1</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bibx79"><label>Remer et al.(2005)Remer, Kaufman, Tanré, Mattoo, Chu, Martins, Li, Ichoku, Levy, Kleidman et al.</label><mixed-citation>Remer, L. A., Kaufman, Y., Tanré, D., Mattoo, S., Chu, D., Martins, J. V., Li, R.-R., Ichoku, C., Levy, R., Kleidman, R., Eck, T. F., Vermote, E., and Holben, B. N.: The MODIS aerosol algorithm, products, and validation, J. Atmos. Sci., 62, 947–973, <ext-link xlink:href="https://doi.org/10.1175/JAS3385.1" ext-link-type="DOI">10.1175/JAS3385.1</ext-link>, 2005.</mixed-citation></ref>
      <ref id="bib1.bibx80"><label>Reynolds et al.(2002)Reynolds, Rayner, Smith, Stokes, and Wang</label><mixed-citation>Reynolds, R. W., Rayner, N. A., Smith, T. M., Stokes, D. C., and Wang, W.: An improved in situ and satellite SST analysis for climate, J. Climate, 15, 1609–1625, <ext-link xlink:href="https://doi.org/10.1175/1520-0442(2002)015&lt;1609:AIISAS&gt;2.0.CO;2" ext-link-type="DOI">10.1175/1520-0442(2002)015&lt;1609:AIISAS&gt;2.0.CO;2</ext-link>, 2002.</mixed-citation></ref>
      <ref id="bib1.bibx81"><label>Riemer et al.(2019)Riemer, Ault, West, Craig, and Curtis</label><mixed-citation>Riemer, N., Ault, A., West, M., Craig, R., and Curtis, J.: Aerosol mixing state: Measurements, modeling, and impacts, Rev. Geophys., 57, 187–249, <ext-link xlink:href="https://doi.org/10.1029/2018RG000615" ext-link-type="DOI">10.1029/2018RG000615</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bibx82"><label>Rienecker et al.(2008)Rienecker, Suarez, Todling, Bacmeister, Takacs, Liu, Gu, Sienkiewicz, Koster, Gelaro, Stajner, and Nielsen</label><mixed-citation>Rienecker, M., Suarez, M., Todling, R., Bacmeister, J., Takacs, L., Liu, H.-C., Gu, W., Sienkiewicz, M., Koster, R., Gelaro, R., Stajner, I., and Nielsen, J.: The GEOS-5 Data Assimilation System – Documentation of Versions 5.0.1, 5.1.0, and 5.2.0., vol. 27 of Technical Report Series on Global Modeling and Data Assimilation, NASA Goddard Space Flight Center, Greenbelt, MD, USA, <uri>https://ntrs.nasa.gov/citations/20120011955</uri> (last access: 23 March 2026), 2008.</mixed-citation></ref>
      <ref id="bib1.bibx83"><label>Russchenberg et al.(2005)Russchenberg, Bosveld, Swart, ten BRINK, de LEEUW, Uijlenhoet, Arbesser-Rastburg, van der MAREL, LIGTHART, Boers et al.</label><mixed-citation>Russchenberg, H., Bosveld, F., Swart, D., ten Brink, H., de Leeuw, G., Uijlenhoet, R., Arbesser-Rastburg, B., van der Marel, H., Ligthart, L., Boers, R., and Apituley, A.: Ground-based atmospheric remote sensing in the Netherlands: European outlook, IEICE T. Commun., 88, 2252–2258, <ext-link xlink:href="https://doi.org/10.1093/ietcom/e88-b.6.2252" ext-link-type="DOI">10.1093/ietcom/e88-b.6.2252</ext-link>, 2005.</mixed-citation></ref>
      <ref id="bib1.bibx84"><label>Seinfeld and Pandis(2016)</label><mixed-citation> Seinfeld, J. H. and Pandis, S. N.: Atmospheric chemistry and physics: from air pollution to climate change, John Wiley &amp; Sons, ISBN 0471720186, 2016.</mixed-citation></ref>
      <ref id="bib1.bibx85"><label>Seinfeld et al.(2016)Seinfeld, Bretherton, Carslaw, Coe, DeMott, Dunlea, Feingold, Ghan, Guenther, Kahn et al.</label><mixed-citation>Seinfeld, J. H., Bretherton, C., Carslaw, K. S., Coe, H., DeMott, P. J., Dunlea, E. J., Feingold, G., Ghan, S., Guenther, A. B., Kahn, R., Kraucunas, I., Kreidenweis, S. M., Molina, M. J., Nenes, A., Penner, J. E., Prather, K. A., Ramanathan, V., Ramaswamy, V., Rasch, P. J., Ravishankara, A. R., Rosenfeld, D., Stephens, G., and Wood, R.: Improving our fundamental understanding of the role of aerosol- cloud interactions in the climate system, P. Natl. Acad. Sci. USA, 113, 5781–5790, <ext-link xlink:href="https://doi.org/10.1073/pnas.1514043113" ext-link-type="DOI">10.1073/pnas.1514043113</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bibx86"><label>Silva et al.(2021)Silva, Ma, Hardin, and Rothenberg</label><mixed-citation>Silva, S. J., Ma, P.-L., Hardin, J. C., and Rothenberg, D.: Physically regularized machine learning emulators of aerosol activation , Geosci. Model Dev., 14, 3067–3077, <ext-link xlink:href="https://doi.org/10.5194/gmd-14-3067-2021" ext-link-type="DOI">10.5194/gmd-14-3067-2021</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bibx87"><label>Song et al.(2025)Song, McCoy, Molod, and Barahona</label><mixed-citation>Song, C., McCoy, D., Molod, A., Aerenson, T., and Barahona, D.: Signatures of aerosol-cloud interactions in GiOcean: a coupled global reanalysis with two-moment cloud microphysics, Atmos. Chem. Phys., 25, 15567–15592, <ext-link xlink:href="https://doi.org/10.5194/acp-25-15567-2025" ext-link-type="DOI">10.5194/acp-25-15567-2025</ext-link>, 2025.</mixed-citation></ref>
      <ref id="bib1.bibx88"><label>Stier et al.(2005)Stier, Feichter, Kinne, Kloster, Vignati, Wilson, Ganzeveld, Tegen, Werner, Balkanski et al.</label><mixed-citation>Stier, P., Feichter, J., Kinne, S., Kloster, S., Vignati, E., Wilson, J., Ganzeveld, L., Tegen, I., Werner, M., Balkanski, Y., Schulz, M., Boucher, O., Minikin, A., and Petzold, A.: The aerosol-climate model ECHAM5-HAM, Atmos. Chem. Phys., 5, 1125–1156, <ext-link xlink:href="https://doi.org/10.5194/acp-5-1125-2005" ext-link-type="DOI">10.5194/acp-5-1125-2005</ext-link>, 2005.</mixed-citation></ref>
      <ref id="bib1.bibx89"><label>Stier et al.(2024)Stier, van den Heever, Christensen, Gryspeerdt, Dagan, Saleeby, Bollasina, Donner, Emanuel, Ekman et al.</label><mixed-citation>Stier, P., van den Heever, S. C., Christensen, M. W., Gryspeerdt, E., Dagan, G., Saleeby, S. M., Bollasina, M., Donner, L., Emanuel, K., Ekman, A. M., Feingold, G., Field, P., Forster, P., Haywood, J., Kahn, R., Koren, I., Kummerow, C., L’Ecuyer, T., Lohmann, U., Ming, Y., Myhre, G., Quaas, J., Rosenfeld, D., Samset, B., Seifert, A., Stephens, G., and Tao, W. K.: Multifaceted aerosol effects on precipitation, Nat. Geosci., 17, 719–732, <ext-link xlink:href="https://doi.org/10.1038/s41561-024-01482-6" ext-link-type="DOI">10.1038/s41561-024-01482-6</ext-link>, 2024.</mixed-citation></ref>
      <ref id="bib1.bibx90"><label>Ström et al.(2003)Ström, Umegård, Tørseth, Tunved, Hansson, Holmén, Wismann, Herber, and König-Langlo</label><mixed-citation>Ström, J., Umegård, J., Tørseth, K., Tunved, P., Hansson, H.-C., Holmén, K., Wismann, V., Herber, A., and König-Langlo, G.: One year of particle size distribution and aerosol chemical composition measurements at the Zeppelin Station, Svalbard, March 2000–March 2001, Phys. Chem. Earth Pt. A/B/C, 28, 1181–1190, <ext-link xlink:href="https://doi.org/10.1016/j.pce.2003.08.058" ext-link-type="DOI">10.1016/j.pce.2003.08.058</ext-link>, 2003.</mixed-citation></ref>
      <ref id="bib1.bibx91"><label>Su et al.(2023)Su, Huang, Wang, Cao, and Feng</label><mixed-citation>Su, X., Huang, Y., Wang, L., Cao, M., and Feng, L.: Validation and diurnal variation evaluation of MERRA-2 multiple aerosol properties on a global scale, Atmos. Environ., 311, 120019, <ext-link xlink:href="https://doi.org/10.1016/j.atmosenv.2023.120019" ext-link-type="DOI">10.1016/j.atmosenv.2023.120019</ext-link>, 2023.</mixed-citation></ref>
      <ref id="bib1.bibx92"><label>Sun et al.(2019)Sun, Che, Xu, Wang, Lu, Gui, Zhao, Zheng, Wang, Wang et al.</label><mixed-citation>Sun, E., Che, H., Xu, X., Wang, Z., Lu, C., Gui, K., Zhao, H., Zheng, Y., Wang, Y., Wang, H., Sun, T., Liang, Y., Li, X., Sheng, Z., An, L., Zhang, X., and Shi, G.: Variation in MERRA-2 aerosol optical depth over the Yangtze River Delta from 1980 to 2016, Theor. Appl. Climatol., 136, 363–375, <ext-link xlink:href="https://doi.org/10.1007/s00704-018-2490-9" ext-link-type="DOI">10.1007/s00704-018-2490-9</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bibx93"><label>Takacs et al.(2018)Takacs, Suárez, and Todling</label><mixed-citation>Takacs, L. L., Suárez, M. J., and Todling, R.: The stability of incremental analysis update, Mon. Weather Rev., 146, 3259–3275, <ext-link xlink:href="https://doi.org/10.1175/MWR-D-18-0117.1" ext-link-type="DOI">10.1175/MWR-D-18-0117.1</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bibx94"><label>Tunved et al.(2004)Tunved, Ström, and Hansson</label><mixed-citation>Tunved, P., Ström, J., and Hansson, H.-C.: An investigation of processes controlling the evolution of the boundary layer aerosol size distribution properties at the Swedish background station Aspvreten, Atmos. Chem. Phys., 4, 2581–2592, <ext-link xlink:href="https://doi.org/10.5194/acp-4-2581-2004" ext-link-type="DOI">10.5194/acp-4-2581-2004</ext-link>, 2004.</mixed-citation></ref>
      <ref id="bib1.bibx95"><label>Ukhov et al.(2020)Ukhov, Mostamandi, Da Silva, Flemming, Alshehri, Shevchenko, and Stenchikov</label><mixed-citation>Ukhov, A., Mostamandi, S., da Silva, A., Flemming, J., Alshehri, Y., Shevchenko, I., and Stenchikov, G.: Assessment of natural and anthropogenic aerosol air pollution in the Middle East using MERRA-2, CAMS data assimilation products, and high-resolution WRF-Chem model simulations, Atmos. Chem. Phys., 20, 9281–9310, <ext-link xlink:href="https://doi.org/10.5194/acp-20-9281-2020" ext-link-type="DOI">10.5194/acp-20-9281-2020</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bibx96"><label>Ulevicius et al.(2010)Ulevicius, Byčenkienė, Remeikis, Garbaras, Kecorius, Andriejauskienė, Jasinevičienė, and Mocnik</label><mixed-citation>Ulevicius, V., Byčenkienė, S., Remeikis, V., Garbaras, A., Kecorius, S., Andriejauskienė, J., Jasinevičienė, D., and Mocnik, G.: Characterization of pollution events in the East Baltic region affected by regional biomass fire emissions, Atmos. Res., 98, 190–200, <ext-link xlink:href="https://doi.org/10.1016/j.atmosres.2010.03.021" ext-link-type="DOI">10.1016/j.atmosres.2010.03.021</ext-link>, 2010. </mixed-citation></ref>
      <ref id="bib1.bibx97"><label>Uno et al.(2009)Uno, Eguchi, Yumimoto, Takemura, Shimizu, Uematsu, Liu, Wang, Hara, and Sugimoto</label><mixed-citation>Uno, I., Eguchi, K., Yumimoto, K., Takemura, T., Shimizu, A., Uematsu, M., Liu, Z., Wang, Z., Hara, Y., and Sugimoto, N.: Asian dust transported one full circuit around the globe, Nat. Geosci., 2, 557–560, <ext-link xlink:href="https://doi.org/10.1038/ngeo583" ext-link-type="DOI">10.1038/ngeo583</ext-link>, 2009.</mixed-citation></ref>
      <ref id="bib1.bibx98"><label>Venzac et al.(2009)Venzac, Sellegri, Villani, Picard, and Laj</label><mixed-citation>Venzac, H., Sellegri, K., Villani, P., Picard, D., and Laj, P.: Seasonal variation of aerosol size distributions in the free troposphere and residual layer at the puy de Dôme station, France, Atmos. Chem. Phys., 9, 1465–1478, <ext-link xlink:href="https://doi.org/10.5194/acp-9-1465-2009" ext-link-type="DOI">10.5194/acp-9-1465-2009</ext-link>, 2009.</mixed-citation></ref>
      <ref id="bib1.bibx99"><label>Virtanen et al.(2025)Virtanen, Joutsensaari, Kokkola, Partridge, Blichner, Seland, Holopainen, Tovazzi, Lipponen, Mikkonen et al.</label><mixed-citation>Virtanen, A., Joutsensaari, J., Kokkola, H., Partridge, D. G., Blichner, S., Seland, Ø., Holopainen, E., Tovazzi, E., Lipponen, A., Mikkonen, S., Leskinen, A., Hyvärinen, A.-P., Zieger, P., Krejci, R., Ekman, A. M. L., Riipinen, I., Quaas, J., and Romakkaniemi, S.: High sensitivity of cloud formation to aerosol changes, Nat. Geosci., <ext-link xlink:href="https://doi.org/10.1038/s41561-025-01662-y" ext-link-type="DOI">10.1038/s41561-025-01662-y</ext-link>, 2025.</mixed-citation></ref>
      <ref id="bib1.bibx100"><label>Watson-Parris et al.(2019)Watson-Parris, Schutgens, Reddington, Pringle, Liu, Allan, Coe, Carslaw, and Stier</label><mixed-citation>Watson-Parris, D., Schutgens, N., Reddington, C., Pringle, K. J., Liu, D., Allan, J. D., Coe, H., Carslaw, K. S., and Stier, P.: In situ constraints on the vertical distribution of global aerosol, Atmos. Chem. Phys., 19, 11765–11790, <ext-link xlink:href="https://doi.org/10.5194/acp-19-11765-2019" ext-link-type="DOI">10.5194/acp-19-11765-2019</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bibx101"><label>Whitby and McMurry(1997)</label><mixed-citation>Whitby, E. R. and McMurry, P. H.: Modal aerosol dynamics modeling, Aerosol Sci. Tech., 27, 673–688, <ext-link xlink:href="https://doi.org/10.1080/02786829708965504" ext-link-type="DOI">10.1080/02786829708965504</ext-link>, 1997.</mixed-citation></ref>
      <ref id="bib1.bibx102"><label>Wilson et al.(2001)Wilson, Cuvelier, and Raes</label><mixed-citation>Wilson, J., Cuvelier, C., and Raes, F.: A modeling study of global mixed aerosol fields, J. Geophys. Res.-Atmos., 106, 34081–34108, <ext-link xlink:href="https://doi.org/10.1029/2000JD000198" ext-link-type="DOI">10.1029/2000JD000198</ext-link>, 2001.</mixed-citation></ref>
      <ref id="bib1.bibx103"><label>Winter(2002)</label><mixed-citation>Winter, E.: The shapley value, Handbook of Game Theory with Economic Applications, 3, 2025–2054, <ext-link xlink:href="https://doi.org/10.1016/S1574-0005(02)03016-3" ext-link-type="DOI">10.1016/S1574-0005(02)03016-3</ext-link>, 2002.</mixed-citation></ref>
      <ref id="bib1.bibx104"><label>Yu et al.(2024)Yu, Ma, Singh, Silva, and Pritchard</label><mixed-citation>Yu, S., Ma, P.-L., Singh, B., Silva, S., and Pritchard, M.: Two-step hyperparameter optimization method: Accelerating hyperparameter search by using a fraction of a training dataset, Artificial Intelligence for the Earth Systems, 3, e230013, <ext-link xlink:href="https://doi.org/10.1175/AIES-D-23-0013.1" ext-link-type="DOI">10.1175/AIES-D-23-0013.1</ext-link>, 2024.</mixed-citation></ref>
      <ref id="bib1.bibx105"><label>Zhang et al.(2020)Zhang, Sharma, Dhawan, Dhanraj, Li, and Biswas</label><mixed-citation>Zhang, H., Sharma, G., Dhawan, S., Dhanraj, D., Li, Z., and Biswas, P.: Comparison of discrete, discrete-sectional, modal and moment models for aerosol dynamics simulations, Aerosol Sci. Tech., 54, 739–760, <ext-link xlink:href="https://doi.org/10.1080/02786826.2020.1723787" ext-link-type="DOI">10.1080/02786826.2020.1723787</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bibx106"><label>Zhou et al.(2018)Zhou, Shen, Liu, Zhang, and Xin</label><mixed-citation>Zhou, C., Shen, X., Liu, Z., Zhang, Y., and Xin, J.: Simulating aerosol size distribution and mass concentration with simultaneous nucleation, condensation/coagulation, and deposition with the GRAPES–CUACE, Journal of Meteorological Research, 32, 265–278, <ext-link xlink:href="https://doi.org/10.1007/s13351-018-7116-8" ext-link-type="DOI">10.1007/s13351-018-7116-8</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bibx107"><label>Zhu et al.(2023)Zhu, Martin, Croft, Zhai, Li, Bindle, Pierce, Chang, Anderson, Ziemba, Hair, Ferrare, Hostetler, Singh, Chatterjee, Jimenez, Campuzano-Jost, Nault, Dibb, Schwarz, and Weinheimer</label><mixed-citation>Zhu, H., Martin, R. V., Croft, B., Zhai, S., Li, C., Bindle, L., Pierce, J. R., Chang, R. Y.-W., Anderson, B. E., Ziemba, L. D., Hair, J. W., Ferrare, R. A., Hostetler, C. A., Singh, I., Chatterjee, D., Jimenez, J. L., Campuzano-Jost, P., Nault, B. A., Dibb, J. E., Schwarz, J. S., and Weinheimer, A.: Parameterization of size of organic and secondary inorganic aerosol for efficient representation of global aerosol optical properties, Atmos. Chem. Phys., 23, 5023–5042, <ext-link xlink:href="https://doi.org/10.5194/acp-23-5023-2023" ext-link-type="DOI">10.5194/acp-23-5023-2023</ext-link>, 2023.</mixed-citation></ref>

  </ref-list></back>
    <!--<article-title-html>Deep learning representation of the aerosol size distribution</article-title-html>
<abstract-html/>
<ref-html id="bib1.bib1"><label>Aas et al.(2021)Aas, Jullum, and Løland</label><mixed-citation>
      
Aas, K., Jullum, M., and Løland, A.: Explaining individual predictions when features are dependent:
More accurate approximations to Shapley values, Artif. Intell., 298, 103502,
<a href="https://doi.org/10.1016/j.artint.2021.103502" target="_blank">https://doi.org/10.1016/j.artint.2021.103502</a>, 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib2"><label>Adachi and Buseck(2008)</label><mixed-citation>
      
Adachi, K. and Buseck, P. R.: Internally mixed soot, sulfates, and organic matter in aerosol particles from Mexico City, Atmos. Chem. Phys., 8, 6469–6481, <a href="https://doi.org/10.5194/acp-8-6469-2008" target="_blank">https://doi.org/10.5194/acp-8-6469-2008</a>, 2008.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib3"><label>Adams and Seinfeld(2002)</label><mixed-citation>
      
Adams, P. J. and Seinfeld, J. H.: Predicting global aerosol size distributions in general circulation models,
J. Geophys. Res.-Atmos., 107, <a href="https://doi.org/10.1029/2001JD001010" target="_blank">https://doi.org/10.1029/2001JD001010</a>, 2002.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib4"><label>Amunsen et al.(1992)Amunsen, Hanssen, Semb, and Steinnes</label><mixed-citation>
      
Amunsen, C., Hanssen, J., Semb, A., and Steinnes, E.: Long-range atmospheric transport of trace
elements to southern Norway, Atmos. Environ. A Gen., 26, 1309–1324,
<a href="https://doi.org/10.1016/0960-1686(92)90391-W" target="_blank">https://doi.org/10.1016/0960-1686(92)90391-W</a>, 1992.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib5"><label>Aquila et al.(2011)Aquila, Hendricks, Lauer, Riemer, Vogel, Baumgardner, Minikin, Petzold, Schwarz, Spackman et al.</label><mixed-citation>
      
Aquila, V., Hendricks, J., Lauer, A., Riemer, N., Vogel, H., Baumgardner, D., Minikin, A., Petzold, A., Schwarz, J. P., Spackman, J. R., Weinzierl, B., Righi, M., and Dall'Amico, M.: MADE-in: a new aerosol microphysics submodel for global simulation of insoluble particles and their mixing state, Geosci. Model Dev., 4, 325–355, <a href="https://doi.org/10.5194/gmd-4-325-2011" target="_blank">https://doi.org/10.5194/gmd-4-325-2011</a>, 2011.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib6"><label>Arfin et al.(2023)Arfin, Pillai, Mathew, Tirpude, Bang, and Mondal</label><mixed-citation>
      
Arfin, T., Pillai, A. M., Mathew, N., Tirpude, A., Bang, R., and Mondal, P.:
An overview of atmospheric aerosol and their effects on human health, Environ. Sci. Pollut. R.,
30, 125347–125369, <a href="https://doi.org/10.1007/s11356-023-29652-w" target="_blank">https://doi.org/10.1007/s11356-023-29652-w</a>, 2023.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib7"><label>Asmi et al.(2011)Asmi, Wiedensohler, Laj, Fjaeraa, Sellegri, Birmili, Weingartner, Baltensperger, Zdimal, Zikova et al.</label><mixed-citation>
      
Asmi, A., Wiedensohler, A., Laj, P., Fjaeraa, A.-M., Sellegri, K., Birmili, W., Weingartner, E., Baltensperger, U., Zdimal, V., Zikova, N., Putaud, J.-P., Marinoni, A., Tunved, P., Hansson, H.-C., Fiebig, M., Kivekäs, N., Lihavainen, H., Asmi, E., Ulevicius, V., Aalto, P. P., Swietlicki, E., Kristensson, A., Mihalopoulos, N., Kalivitis, N., Kalapov, I., Kiss, G., de Leeuw, G., Henzing, B., Harrison, R. M., Beddows, D., O'Dowd, C., Jennings, S. G., Flentje, H., Weinhold, K., Meinhardt, F., Ries, L., and Kulmala, M.: Number size distributions and seasonality of submicron particles in Europe 2008–2009, Atmos. Chem. Phys., 11, 5505–5538, <a href="https://doi.org/10.5194/acp-11-5505-2011" target="_blank">https://doi.org/10.5194/acp-11-5505-2011</a>, 2011.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib8"><label>Barahona and Breen(2025)</label><mixed-citation>
      
Barahona, D. and Breen, K.: MAMnet, Zenodo [code], <a href="https://doi.org/10.5281/zenodo.15190121" target="_blank">https://doi.org/10.5281/zenodo.15190121</a>, 2025.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib9"><label>Barahona et al.(2014)Barahona, Molod, Bacmeister, Nenes, Gettelman, Morrison, Phillips, and Eichmann</label><mixed-citation>
      
Barahona, D., Molod, A., Bacmeister, J., Nenes, A., Gettelman, A., Morrison, H., Phillips, V., and Eichmann, A.: Development of two-moment cloud microphysics for liquid and ice within the NASA Goddard Earth Observing System Model (GEOS-5), Geosci. Model Dev., 7, 1733–1766, <a href="https://doi.org/10.5194/gmd-7-1733-2014" target="_blank">https://doi.org/10.5194/gmd-7-1733-2014</a>, 2014.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib10"><label>Barahona et al.(2024)Barahona, Breen, Kalesse-Los, and Röttenbacher</label><mixed-citation>
      
Barahona, D., Breen, K. H., Kalesse-Los, H., and Röttenbacher, J.:
Deep Learning Parameterization of Vertical Wind Velocity Variability via Constrained Adversarial Training,
Artificial Intelligence for the Earth Systems, 3, e230025, <a href="https://doi.org/10.1175/AIES-D-23-0025.1" target="_blank">https://doi.org/10.1175/AIES-D-23-0025.1</a>, 2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib11"><label>Bender(2020)</label><mixed-citation>
      
Bender, F. A.-M.: Aerosol forcing: Still uncertain, still relevant, AGU Advances, 1, e2019AV000128,
<a href="https://doi.org/10.1029/2019AV000128" target="_blank">https://doi.org/10.1029/2019AV000128</a>, 2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib12"><label>Bender et al.(2019)Bender, Frey, McCoy, Grosvenor, and Mohrmann</label><mixed-citation>
      
Bender, F. A.-M., Frey, L., McCoy, D. T., Grosvenor, D. P., and Mohrmann, J. K.:
Assessment of aerosol–cloud–radiation correlations in satellite observations, climate models and reanalysis,
Cli. Dynam., 52, 4371–4392, <a href="https://doi.org/10.1007/s00382-018-4384-z" target="_blank">https://doi.org/10.1007/s00382-018-4384-z</a>, 2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib13"><label>Bengio et al.(2017)Bengio, Goodfellow, and Courville</label><mixed-citation>
      
Bengio, Y., Goodfellow, I., and Courville, A.: Deep learning, vol. 1, MIT Press Cambridge, MA, USA, ISBN 978-0262035613, 2017.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib14"><label>Birmili et al.(2003)Birmili, Berresheim, Plass-Dülmer, Elste, Gilge, Wiedensohler, and Uhrner</label><mixed-citation>
      
Birmili, W., Berresheim, H., Plass-Dülmer, C., Elste, T., Gilge, S., Wiedensohler, A., and Uhrner, U.: The Hohenpeissenberg aerosol formation experiment (HAFEX): a long-term study including size-resolved aerosol, H<sub>2</sub>SO<sub>4</sub>, OH, and monoterpenes measurements, Atmos. Chem. Phys., 3, 361–376, <a href="https://doi.org/10.5194/acp-3-361-2003" target="_blank">https://doi.org/10.5194/acp-3-361-2003</a>, 2003.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib15"><label>Birmili et al.(2009)Birmili, Weinhold, Nordmann, Wiedensohler, Spindler, Müller, Herrmann, Gnauk, Pitz, Cyrys et al.</label><mixed-citation>
      
Birmili, W., Weinhold, K., Nordmann, S., Wiedensohler, A., Spindler, G., Müller, K., Herrmann, H.,
Gnauk, T., Pitz, M., Cyrys, J., and Flentje, H.: Atmospheric aerosol measurements in the German ultrafine aerosol network (GUAN),
Gefahrst. Reinhalt. L, 69, 137–145, 2009.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib16"><label>Block(2023)</label><mixed-citation>
      
Block, K.: Cloud condensation nuclei (CCN) numbers derived from CAMS reanalysis EAC4 (Version 1), World Data Center for Climate (WDCC) at DKRZ [data set],
<a href="https://doi.org/10.26050/WDCC/QUAERERE_CCNCAMS_v1" target="_blank">https://doi.org/10.26050/WDCC/QUAERERE_CCNCAMS_v1</a>, 2023.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib17"><label>Block et al.(2024)Block, Haghighatnasab, Partridge, Stier, and Quaas</label><mixed-citation>
      
Block, K., Haghighatnasab, M., Partridge, D. G., Stier, P., and Quaas, J.: Cloud condensation nuclei concentrations derived from the CAMS reanalysis, Earth Syst. Sci. Data, 16, 443–470, <a href="https://doi.org/10.5194/essd-16-443-2024" target="_blank">https://doi.org/10.5194/essd-16-443-2024</a>, 2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib18"><label>Brenowitz and Bretherton(2019)</label><mixed-citation>
      
Brenowitz, N. D. and Bretherton, C. S.: Spatially extended tests of a neural network parametrization trained by coarse-graining,
J. Adv. Model. Earth Sy., 11, 2728–2744, <a href="https://doi.org/10.1029/2019MS001711" target="_blank">https://doi.org/10.1029/2019MS001711</a>, 2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib19"><label>Buchard et al.(2017)Buchard, Randles, Da Silva, Darmenov, Colarco, Govindaraju, Ferrare, Hair, Beyersdorf, Ziemba et al.</label><mixed-citation>
      
Buchard, V., Randles, C., Da Silva, A., Darmenov, A., Colarco, P., Govindaraju, R., Ferrare, R., Hair, J.,
Beyersdorf, A., Ziemba, L., and Yu, H.: The MERRA-2 aerosol reanalysis, 1980 onward. Part II: Evaluation and case studies,
J. Climate, 30, 6851–6872, <a href="https://doi.org/10.1175/JCLI-D-16-0613.1" target="_blank">https://doi.org/10.1175/JCLI-D-16-0613.1</a>, 2017.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib20"><label>Buda et al.(2018)Buda, Maki, and Mazurowski</label><mixed-citation>
      
Buda, M., Maki, A., and Mazurowski, M. A.: A systematic study of the class imbalance problem in convolutional neural networks,
Neural Networks, 106, 249–259, <a href="https://doi.org/10.1016/j.neunet.2018.07.011" target="_blank">https://doi.org/10.1016/j.neunet.2018.07.011</a>, 2018.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib21"><label>CALIPSO(2023)</label><mixed-citation>
      
CALIPSO: Cloud–Aerosol Lidar and Infrared Pathfinder Satellite Observation Lidar Level 2 Aerosol Profile V4-20, NASA [data set],
<a href="https://doi.org/10.5067/CALIOP/CALIPSO/LID_L2_05KMAPRO-STANDARD-V4-20" target="_blank">https://doi.org/10.5067/CALIOP/CALIPSO/LID_L2_05KMAPRO-STANDARD-V4-20</a>, 2023.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib22"><label>C̆ervenkova and Vá n̆a(2010)</label><mixed-citation>
      C̆ervenkova, J. and Vá n̆a, M.: Trend Assessment of deposition, throughfall and runoff water chemistry at the ICP-IM station Kosetice,
Czech Republic, IAHS-AISH Publication, 336, 103–108, 2010.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib23"><label>Charron et al.(2007)Charron, Birmili, and Harrison</label><mixed-citation>
      
Charron, A., Birmili, W., and Harrison, R. M.: Factors influencing new particle formation at the rural site, Harwell, United Kingdom,
J. Geophys. Res.-Atmos., 112, <a href="https://doi.org/10.1029/2007JD008425" target="_blank">https://doi.org/10.1029/2007JD008425</a>, 2007.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib24"><label>Chin et al.(2000)Chin, Rood, Lin, Müller, and Thompson</label><mixed-citation>
      
Chin, M., Rood, R. B., Lin, S.-J., Müller, J.-F., and Thompson, A. M.:
Atmospheric sulfur cycle simulated in the global model GOCART: Model description and global properties,
J. Geophys. Res.-Atmos., 105, 24671–24687, <a href="https://doi.org/10.1029/2000JD900384" target="_blank">https://doi.org/10.1029/2000JD900384</a>, 2000.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib25"><label>Chollet(2015)</label><mixed-citation>
      
Chollet, F.: Keras, GitHub [code], <a href="https://github.com/fchollet/keras" target="_blank"/> (last access: 23 March 2026), 2015.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib26"><label>Choudhury and Tesche(2022)</label><mixed-citation>
      
Choudhury, G. and Tesche, M.: Estimating cloud condensation nuclei concentrations from CALIPSO lidar measurements, Atmos. Meas. Tech., 15, 639–654, <a href="https://doi.org/10.5194/amt-15-639-2022" target="_blank">https://doi.org/10.5194/amt-15-639-2022</a>, 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib27"><label>Christensen et al.(2020)Christensen, Jones, and Stier</label><mixed-citation>
      
Christensen, M. W., Jones, W. K., and Stier, P.: Aerosols enhance cloud lifetime and brightness along the stratus-to-cumulus transition,
P. Natl. Acad. Sci. USA, 117, 17591–17598, <a href="https://doi.org/10.1073/pnas.1921231117" target="_blank">https://doi.org/10.1073/pnas.1921231117</a>, 2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib28"><label>Chu et al.(2002)Chu, Kaufman, Ichoku, Remer, Tanré, and Holben</label><mixed-citation>
      
Chu, D., Kaufman, Y., Ichoku, C., Remer, L., Tanré, D., and Holben, B.:
Validation of MODIS aerosol optical depth retrieval over land, Geophys. Res. Lett., 29,
<a href="https://doi.org/10.1029/2001GL013205" target="_blank">https://doi.org/10.1029/2001GL013205</a>, 2002.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib29"><label>Colarco et al.(2010a)Colarco, da Silva, Chin, and Diehl</label><mixed-citation>
      
Colarco, P., da Silva, A., Chin, M., and Diehl, T.: Online simulations of global aerosol distributions in the NASA GEOS-4 model and comparisons to satellite and ground-based aerosol optical depth,
J. Geophys. Res., 115, D14207, <a href="https://doi.org/10.1029/2009JD012820" target="_blank">https://doi.org/10.1029/2009JD012820</a>, 2010a.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib30"><label>Colarco et al.(2010b)Colarco, da Silva, Chin, and Diehl</label><mixed-citation>
      
Colarco, P., da Silva, A., Chin, M., and Diehl, T.: Online simulations of global aerosol distributions in the NASA GEOS-4 model and comparisons to satellite and ground-based aerosol optical depth,
J. Geophys. Res.-Atmos., 115, <a href="https://doi.org/10.1029/2009JD012820" target="_blank">https://doi.org/10.1029/2009JD012820</a>, 2010b.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib31"><label>Engler et al.(2007)Engler, Rose, Wehner, Wiedensohler, Brüggemann, Gnauk, Spindler, Tuch, and Birmili</label><mixed-citation>
      
Engler, C., Rose, D., Wehner, B., Wiedensohler, A., Brüggemann, E., Gnauk, T., Spindler, G., Tuch, T., and Birmili, W.: Size distributions of non-volatile particle residuals (<i>D</i><sub>p</sub> &lt; 800&thinsp;nm) at a rural site in Germany and relation to air mass origin, Atmos. Chem. Phys., 7, 5785–5802, <a href="https://doi.org/10.5194/acp-7-5785-2007" target="_blank">https://doi.org/10.5194/acp-7-5785-2007</a>, 2007.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib32"><label>Forster et al.(2007)Forster, Ramaswamy, Artaxo, Berntsen, Betts, Fahey, Haywood, Lean, Lowe, Myhre et al.</label><mixed-citation>
      
Forster, P., Ramaswamy, V., Artaxo, P., Berntsen, T., Betts, R., Fahey, D. W., Haywood, J., Lean, J., Lowe, D. C., Myhre, G., Nganga, J., Prinn, R., Raga, G., Schulz, M., and Van Dorland, R.: Changes in Atmospheric Constituents and in Radiative Forcing, in: Climate Change 2007: The Physical Science Basis. Contribution of Working Group I to the Fourth Assessment Report of the Intergovernmental Panel on Climate Change, edited by: Solomon, S., Qin, D., Manning, M., Chen, Z., Marquis, M., Averyt, K. B., Tignor, M., and Miller, H. L., Cambridge University Press, Cambridge, United Kingdom and New York, NY, USA, ISBN 9780521880091, 2007.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib33"><label>Fountoukis and Nenes(2005)</label><mixed-citation>
      
Fountoukis, C. and Nenes, A.: Continued development of a cloud droplet formation parameterization for global climate models,
J. Geophys. Res.-Atmos., 110, <a href="https://doi.org/10.1029/2004JD005591" target="_blank">https://doi.org/10.1029/2004JD005591</a>, 2005.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib34"><label>Gelaro et al.(2017)Gelaro, McCarty, Suárez, Todling, Molod, Takacs, Randles, Darmenov, Bosilovich, Reichle, Wargan, Coy, Cullather, Draper, Akella, Buchard, Conaty, da Silva, Gu, Kim, Koster, Lucchesi, Merkova, Nielsen, Partyka, Pawson, Putman, Rienecker, Schubert, Sienkiewicz, and Zhao</label><mixed-citation>
      
Gelaro, R., McCarty, W., Suárez, M. J., Todling, R., Molod, A., Takacs, L., Randles, C. A., Darmenov, A., Bosilovich, M. G., Reichle, R., Wargan, K., Coy, L., Cullather, R., Draper, C., Akella, S., Buchard, V., Conaty, A., da Silva, A. M., Gu, W., Kim, G.-K., Koster, R., Lucchesi, R., Merkova, D., Nielsen, J. E., Partyka, G., Pawson, S., Putman, W., Rienecker, M., Schubert, S. D., Sienkiewicz, M., and Zhao, B.:
The Modern-Era Retrospective Analysis for Research and Applications, Version 2 (MERRA-2), J. Climate, 30, 5419–5454, <a href="https://doi.org/10.1175/JCLI-D-16-0758.1" target="_blank">https://doi.org/10.1175/JCLI-D-16-0758.1</a>, 2017.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib35"><label>Ginoux et al.(2001)Ginoux, Chin, Tegen, Prospero, Holben, Dubovik, and Lin</label><mixed-citation>
      
Ginoux, P., Chin, M., Tegen, I., Prospero, J. M., Holben, B., Dubovik, O., and Lin, S.-J.:
Sources and distributions of dust aerosols simulated with the GOCART model, J. Geophys. Res.-Atmos.,
106, 20255–20273, <a href="https://doi.org/10.1029/2000JD000053" target="_blank">https://doi.org/10.1029/2000JD000053</a>, 2001.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib36"><label>GMAO(2015)</label><mixed-citation>
      
GMAO: MERRA-2 inst3_3d_asm_Nv: 3d,3-Hourly,Instantaneous,Model-Level,Assimilation,Assimilated Meteorological Fields V5.12.4, Goddard Earth Sciences Data and Information Services Center (GES DISC) [data set],
<a href="https://doi.org/10.5067/WWQSXQ8IVFW8" target="_blank">https://doi.org/10.5067/WWQSXQ8IVFW8</a>, 2015.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib37"><label>GMAO(2025)</label><mixed-citation>
      
GMAO: GiOcean Coupled Reanalysis, GMAO [data set], <a href="https://portal.nccs.nasa.gov/datashare/gmao/GiOCEAN/" target="_blank"/> (last access: 23 March 2026), 2025.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib38"><label>Gong et al.(2022)Gong, Wex, Müller, Henning, Voigtländer, Wiedensohler, and Stratmann</label><mixed-citation>
      
Gong, X., Wex, H., Müller, T., Henning, S., Voigtländer, J., Wiedensohler, A., and Stratmann, F.: Understanding aerosol microphysical properties from 10 years of data collected at Cabo Verde based on an unsupervised machine learning classification, Atmos. Chem. Phys., 22, 5175–5194, <a href="https://doi.org/10.5194/acp-22-5175-2022" target="_blank">https://doi.org/10.5194/acp-22-5175-2022</a>, 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib39"><label>Gruening et al.(2009)Gruening, Adam, Cavalli, Cavalli, Dell’Acqua, Martins Dos Santos, Pagliari, Roux, and Putaud</label><mixed-citation>
      
Gruening, C., Adam, M., Cavalli, F., Cavalli, P., Dell’Acqua, A., Martins Dos Santos, S., Pagliari, V., Roux, D., and Putaud, J.:
JRC Ispra EMEP–GAW Regional Station for Atmos. Res, Tech. Rep. JRC55382, European Commission,
<a href="https://publications.jrc.ec.europa.eu/repository/handle/JRC55382" target="_blank"/> (last access: 23 March 2026), 2009.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib40"><label>Gueymard and Yang(2020)</label><mixed-citation>
      
Gueymard, C. A. and Yang, D.: Worldwide validation of CAMS and MERRA-2 reanalysis aerosol optical depth products using 15 years of AERONET observations,
Atmos. Environ., 225, 117216, <a href="https://doi.org/10.1016/j.atmosenv.2019.117216" target="_blank">https://doi.org/10.1016/j.atmosenv.2019.117216</a>, 2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib41"><label>Harder et al.(2022)Harder, Watson-Parris, Stier, Strassel, Gauger, and Keuper</label><mixed-citation>
      
Harder, P., Watson-Parris, D., Stier, P., Strassel, D., Gauger, N. R., and Keuper, J.:
Physics-Informed Learning of Aerosol Microphysics, arXiv [preprint], <a href="https://doi.org/10.48550/arXiv.2207.11786" target="_blank">https://doi.org/10.48550/arXiv.2207.11786</a>, 24 July 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib42"><label>Hari et al.(2013)Hari, Nikinmaa, Pohja, Siivola, Bäck, Vesala, and Kulmala</label><mixed-citation>
      
Hari, P., Nikinmaa, E., Pohja, T., Siivola, E., Bäck, J., Vesala, T., and Kulmala, M.:
Station for measuring ecosystem-atmosphere relations: SMEAR, in: Physical and physiological forest ecology, Springer Nature, 471–487,
<a href="https://doi.org/10.1007/978-94-007-5603-8_9" target="_blank">https://doi.org/10.1007/978-94-007-5603-8_9</a>, 2013.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib43"><label>Inness et al.(2019)Inness, Ades, Agustí-Panareda, Barré, Benedictow, Blechschmidt, Dominguez, Engelen, Eskes, Flemming et al.</label><mixed-citation>
      
Inness, A., Ades, M., Agustí-Panareda, A., Barré, J., Benedictow, A., Blechschmidt, A.-M., Dominguez, J. J., Engelen, R., Eskes, H., Flemming, J., Huijnen, V., Jones, L., Kipling, Z., Massart, S., Parrington, M., Peuch, V.-H., Razinger, M., Remy, S., Schulz, M., and Suttie, M.: The CAMS reanalysis of atmospheric composition, Atmos. Chem. Phys., 19, 3515–3556, <a href="https://doi.org/10.5194/acp-19-3515-2019" target="_blank">https://doi.org/10.5194/acp-19-3515-2019</a>, 2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib44"><label>Japkowicz and Stephen(2002)</label><mixed-citation>
      
Japkowicz, N. and Stephen, S.: The class imbalance problem: A systematic study, Intell. Data Anal., 6, 429–449,
<a href="https://doi.org/10.3233/IDA-2002-6504" target="_blank">https://doi.org/10.3233/IDA-2002-6504</a>, 2002.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib45"><label>Jeggle et al.(2023)Jeggle, Neubauer, Camps-Valls, and Lohmann</label><mixed-citation>
      
Jeggle, K., Neubauer, D., Camps-Valls, G., and Lohmann, U.:
Understanding cirrus clouds using explainable machine learning, Environmental Data Science, 2, e19,
<a href="https://doi.org/10.1017/eds.2023.14" target="_blank">https://doi.org/10.1017/eds.2023.14</a>, 2023.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib46"><label>Jennings et al.(1991)Jennings, O'Dowd, O'Connor, and McGovern</label><mixed-citation>
      
Jennings, S., O'Dowd, C., O'Connor, T., and McGovern, F.: Physical characteristics of the ambient aerosol at Mace Head,
Atmos. Environ. A Gen., 25, 557–562, <a href="https://doi.org/10.1016/0960-1686(91)90052-9" target="_blank">https://doi.org/10.1016/0960-1686(91)90052-9</a>, 1991.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib47"><label>Jia et al.(2024)Jia, Andersen, and Cermak</label><mixed-citation>
      
Jia, Y., Andersen, H., and Cermak, J.: Analysis of the cloud fraction adjustment to aerosols and its dependence on meteorological controls using explainable machine learning, Atmos. Chem. Phys., 24, 13025–13045, <a href="https://doi.org/10.5194/acp-24-13025-2024" target="_blank">https://doi.org/10.5194/acp-24-13025-2024</a>, 2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib48"><label>Jones et al.(1994)Jones, Roberts, and Slingo</label><mixed-citation>
      
Jones, A., Roberts, D., and Slingo, A.: A climate model study of indirect radiative forcing by anthropogenic sulphate aerosols,
Nature, 370, 450–453, <a href="https://doi.org/10.1038/370450a0" target="_blank">https://doi.org/10.1038/370450a0</a>, 1994.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib49"><label>Jurányi et al.(2011)Jurányi, Gysel, Weingartner, Bukowiecki, Kammermann, and Baltensperger</label><mixed-citation>
      
Jurányi, Z., Gysel, M., Weingartner, E., Bukowiecki, N., Kammermann, L., and Baltensperger, U.:
A 17 month climatology of the cloud condensation nuclei number concentration at the high alpine site Jungfraujoch,
J. Geophys. Res.-Atmos., 116, <a href="https://doi.org/10.1029/2010JD015199" target="_blank">https://doi.org/10.1029/2010JD015199</a>, 2011.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib50"><label>Kingma and Ba(2014)</label><mixed-citation>
      
Kingma, D. P. and Ba, J.: Adam: A method for stochastic optimization, arXiv [preprint],  <a href="https://doi.org/10.48550/arXiv.1412.6980" target="_blank">https://doi.org/10.48550/arXiv.1412.6980</a>,  22 December 2014.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib51"><label>Kirpes et al.(2018)Kirpes, Bondy, Bonanno, Moffet, Wang, Laskin, Ault, and Pratt</label><mixed-citation>
      
Kirpes, R. M., Bondy, A. L., Bonanno, D., Moffet, R. C., Wang, B., Laskin, A., Ault, A. P., and Pratt, K. A.: Secondary sulfate is internally mixed with sea spray aerosol and organic aerosol in the winter Arctic, Atmos. Chem. Phys., 18, 3937–3949, <a href="https://doi.org/10.5194/acp-18-3937-2018" target="_blank">https://doi.org/10.5194/acp-18-3937-2018</a>, 2018.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib52"><label>Kiss et al.(2002)Kiss, Varga, Galambos, and Ganszky</label><mixed-citation>
      
Kiss, G., Varga, B., Galambos, I., and Ganszky, I.:
Characterization of water-soluble organic matter isolated from atmospheric fine aerosol,
J. Geophys. Res.-Atmos., 107, <a href="https://doi.org/10.1029/2001JD000603" target="_blank">https://doi.org/10.1029/2001JD000603</a>, 2002.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib53"><label>Kreidenweis et al.(2005)Kreidenweis, Koehler, DeMott, Prenni, Carrico, and Ervens</label><mixed-citation>
      
Kreidenweis, S. M., Koehler, K., DeMott, P. J., Prenni, A. J., Carrico, C., and Ervens, B.: Water activity and activation diameters from hygroscopicity data - Part I: Theory and application to inorganic salts, Atmos. Chem. Phys., 5, 1357–1370, <a href="https://doi.org/10.5194/acp-5-1357-2005" target="_blank">https://doi.org/10.5194/acp-5-1357-2005</a>, 2005.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib54"><label>Kristensson et al.(2008)Kristensson, Dal Maso, Swietlicki, Hussein, Zhou, Kerminen, and Kulmala</label><mixed-citation>
      
Kristensson, A., Dal Maso, M., Swietlicki, E., Hussein, T., Zhou, J., Kerminen, V.-M., and Kulmala, M.:
Characterization of new particle formation events at a background site in Southern Sweden: relation to air mass history,
Tellus B, 60, 330–344, <a href="https://doi.org/10.1111/j.1600-0889.2008.00345.x" target="_blank">https://doi.org/10.1111/j.1600-0889.2008.00345.x</a>, 2008.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib55"><label>Kwon et al.(2023)Kwon, An, Song, and Sung</label><mixed-citation>
      
Kwon, Y., An, S. A., Song, H.-J., and Sung, K.:
Particulate Matter Prediction and Shapley Value Interpretation in Korea through a Deep Learning Model,
SOLA, 19, 225–231, <a href="https://doi.org/10.2151/sola.2023-029" target="_blank">https://doi.org/10.2151/sola.2023-029</a>, 2023.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib56"><label>Langner and Rodhe(1991)</label><mixed-citation>
      
Langner, J. and Rodhe, H.: A global three-dimensional model of the tropospheric sulfur cycle,
J. Atmos. Chem., 13, 225–263, <a href="https://doi.org/10.1007/BF00058134" target="_blank">https://doi.org/10.1007/BF00058134</a>, 1991.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib57"><label>Lee et al.(2013)Lee, Pringle, Reddington, Mann, Stier, Spracklen, Pierce, and Karslaw</label><mixed-citation>
      
Lee, L. A., Pringle, K. J., Reddington, C. L., Mann, G. W., Stier, P., Spracklen, D. V., Pierce, J. R., and Carslaw, K. S.: The magnitude and causes of uncertainty in global model simulations of cloud condensation nuclei, Atmos. Chem. Phys., 13, 8879–8914, <a href="https://doi.org/10.5194/acp-13-8879-2013" target="_blank">https://doi.org/10.5194/acp-13-8879-2013</a>, 2013.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib58"><label>Lihavainen et al.(2008)Lihavainen, Kerminen, Komppula, Hyvärinen, Laakia, Saarikoski, Makkonen, Kivekäs, Hillamo, Kulmala et al.</label><mixed-citation>
      
Lihavainen, H., Kerminen, V.-M., Komppula, M., Hyvärinen, A.-P., Laakia, J., Saarikoski, S., Makkonen, U., Kivekäs, N., Hillamo, R., Kulmala, M., and Viisanen, Y.: Measurements of the relation between aerosol properties and microphysics and chemistry of low level liquid water clouds in Northern Finland, Atmos. Chem. Phys., 8, 6925–6938, <a href="https://doi.org/10.5194/acp-8-6925-2008" target="_blank">https://doi.org/10.5194/acp-8-6925-2008</a>, 2008.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib59"><label>Liu et al.(2012)Liu, Easter, Ghan, Zaveri, Rasch, Shi, Lamarque, Gettelman, Morrison, Vitt et al.</label><mixed-citation>
      
Liu, X., Easter, R. C., Ghan, S. J., Zaveri, R., Rasch, P., Shi, X., Lamarque, J.-F., Gettelman, A., Morrison, H., Vitt, F., Conley, A., Park, S., Neale, R., Hannay, C., Ekman, A. M. L., Hess, P., Mahowald, N., Collins, W., Iacono, M. J., Bretherton, C. S., Flanner, M. G., and Mitchell, D.: Toward a minimal representation of aerosols in climate models: description and evaluation in the Community Atmosphere Model CAM5, Geosci. Model Dev., 5, 709–739, <a href="https://doi.org/10.5194/gmd-5-709-2012" target="_blank">https://doi.org/10.5194/gmd-5-709-2012</a>, 2012.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib60"><label>Lundberg and Lee(2017)</label><mixed-citation>
      
Lundberg, S. M. and Lee, S.-I.: A unified approach to interpreting model predictions,
Advances in neural information processing systems, 30, <a href="https://doi.org/10.48550/arXiv.1705.07874" target="_blank">https://doi.org/10.48550/arXiv.1705.07874</a>, 22 May 2017.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib61"><label>Lundberg et al.(2020)Lundberg, Erion, Chen, DeGrave, Prutkin, Nair, Katz, Himmelfarb, Bansal, and Lee</label><mixed-citation>
      
Lundberg, S. M., Erion, G., Chen, H., DeGrave, A., Prutkin, J. M., Nair, B., Katz, R., Himmelfarb, J., Bansal, N., and Lee, S.-I.:
From local explanations to global understanding with explainable AI for trees, Nature Machine Intelligence, 2, 56–67,
<a href="https://doi.org/10.1038/s42256-019-0138-9" target="_blank">https://doi.org/10.1038/s42256-019-0138-9</a>, 2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib62"><label>Ma and Stinis(2020)</label><mixed-citation>
      
Ma, P. L. and Stinis, P.: Developing a simulator-based satellite dataset for using machine learning techniques to derive aerosol-cloud-precipitation interactions in models and observations in a consistent framework,
Tech. rep., Pacific Northwest National Laboratory (PNNL), Richland, WA (United States),
<a href="https://doi.org/10.2172/1984697" target="_blank">https://doi.org/10.2172/1984697</a>, 2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib63"><label>Mann et al.(2010)Mann, Carslaw, Spracklen, Ridley, Manktelow, Chipperfield, Pickering, and Johnson</label><mixed-citation>
      
Mann, G. W., Carslaw, K. S., Spracklen, D. V., Ridley, D. A., Manktelow, P. T., Chipperfield, M. P., Pickering, S. J., and Johnson, C. E.: Description and evaluation of GLOMAP-mode: a modal global aerosol microphysics model for the UKCA composition-climate model, Geosci. Model Dev., 3, 519–551, <a href="https://doi.org/10.5194/gmd-3-519-2010" target="_blank">https://doi.org/10.5194/gmd-3-519-2010</a>, 2010.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib64"><label>Marinoni et al.(2008)Marinoni, Cristofanelli, Calzolari, Roccato, Bonafè, and Bonasoni</label><mixed-citation>
      
Marinoni, A., Cristofanelli, P., Calzolari, F., Roccato, F., Bonafè, U., and Bonasoni, P.:
Continuous measurements of aerosol physical parameters at the Mt. Cimone GAW Station (2165&thinsp;m asl, Italy),
Sci. Total Environ., 391, 241–251, <a href="https://doi.org/10.1016/j.scitotenv.2007.10.004" target="_blank">https://doi.org/10.1016/j.scitotenv.2007.10.004</a>, 2008.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib65"><label>McCoy et al.(2017)McCoy, Bender, Mohrmann, Hartmann, Wood, and Grosvenor</label><mixed-citation>
      
McCoy, D., Bender, F.-M., Mohrmann, J., Hartmann, D., Wood, R., and Grosvenor, D.:
The global aerosol-cloud first indirect effect estimated using MODIS, MERRA, and AeroCom,
J. Geophys. Res.-Atmos., 122, 1779–1796, <a href="https://doi.org/10.1002/2016JD026141" target="_blank">https://doi.org/10.1002/2016JD026141</a>, 2017.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib66"><label>Mihalopoulos et al.(1997)Mihalopoulos, Stephanou, Kanakidou, Pilitsidis, and Bousquet</label><mixed-citation>
      
Mihalopoulos, N., Stephanou, E., Kanakidou, M., Pilitsidis, S., and Bousquet, P.:
Tropospheric aerosol ionic composition in the Eastern Mediterranean region, Tellus B, 49, 314–326,
<a href="https://doi.org/10.3402/tellusb.v49i3.15970" target="_blank">https://doi.org/10.3402/tellusb.v49i3.15970</a>, 1997.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib67"><label>Molod et al.(2015)Molod, Takacs, Suarez, and Bacmeister</label><mixed-citation>
      
Molod, A., Takacs, L., Suarez, M., and Bacmeister, J.: Development of the GEOS-5 atmospheric general circulation model: evolution from MERRA to MERRA2, Geosci. Model Dev., 8, 1339–1356, <a href="https://doi.org/10.5194/gmd-8-1339-2015" target="_blank">https://doi.org/10.5194/gmd-8-1339-2015</a>, 2015.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib68"><label>Molod et al.(2020)Molod, Hackert, Vikhliaev, Zhao, Barahona, Vernieres, Borovikov, Kovach, Marshak, Schubert et al.</label><mixed-citation>
      
Molod, A., Hackert, E., Vikhliaev, Y., Zhao, B., Barahona, D., Vernieres, G., Borovikov, A., Kovach, R. M., Marshak, J., Schubert, S., Li, Z., Lim, Y.-K., Andrews, L. C., Cullather, R., Koster, R., Achuthavarier, D., Carton, J., Coy, L., Friere, J. L. M., Longo, K. M., Nakada, K., and Pawson, S.:
GEOS-S2S version 2: The GMAO high-resolution coupled model and assimilation system for seasonal prediction,
J. Geophys. Res.-Atmos., 125, e2019JD031767, <a href="https://doi.org/10.1029/2019JD031767" target="_blank">https://doi.org/10.1029/2019JD031767</a>, 2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib69"><label>Nair et al.(2021)Nair, Yu, Campuzano-Jost, DeMott, Levin, Jimenez, Peischl, Pollack, Fredrickson, Beyersdorf, Nault, Park, Yum, Palm, Xu, Bourgeois, Anderson, Nenes, Ziemba, Moore, Lee, Park, Thompson, Flocke, Huey, Kim, and Peng</label><mixed-citation>
      
Nair, A. A., Yu, F., Campuzano-Jost, P., DeMott, P. J., Levin, E. J. T., Jimenez, J. L., Peischl, J., Pollack, I. B., Fredrickson, C. D.,
Beyersdorf, A. J., Nault, B. A., Park, M., Yum, S. S., Palm, B. B., Xu, L., Bourgeois, I., Anderson, B. E., Nenes, A., Ziemba, L. D.,
Moore, R. H., Lee, T., Park, T., Thompson, C. R., Flocke, F., Huey, L. G., Kim, M. J., and Peng, Q.:
Machine Learning Uncovers Aerosol Size Information From Chemistry and Meteorology to Quantify Potential Cloud-Forming Particles,
Geophys. Res. Lett., 48, e2021GL094133, <a href="https://doi.org/10.1029/2021GL094133" target="_blank">https://doi.org/10.1029/2021GL094133</a>, 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib70"><label>NCAR(2019)</label><mixed-citation>
      
NCAR: NCAR Command Language (Version 6.6.2), UCAR/NCAR/CISL/TDD [software], <a href="https://doi.org/10.5065/D6WD3XH5" target="_blank">https://doi.org/10.5065/D6WD3XH5</a>, 2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib71"><label>Nojarov et al.(2009)Nojarov, Ivanov, Kalapov, Penev, and Drenska</label><mixed-citation>
      
Nojarov, P., Ivanov, P., Kalapov, I., Penev, I., and Drenska, M.:
Connection between ozone concentration and atmosphere circulation at peak Moussala,
Theor. Appl. Climatol., 98, 201–208, <a href="https://doi.org/10.1007/s00704-009-0173-2" target="_blank">https://doi.org/10.1007/s00704-009-0173-2</a>, 2009.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib72"><label>O'Malley et al.(2019)O'Malley, Bursztein, Long, Chollet, Jin, Invernizzi et al.</label><mixed-citation>
      
O'Malley, T., Bursztein, E., Long, J., Chollet, F., Jin, H., and Invernizzi, L.: Keras Tuner, <a href="https://github.com/keras-team/keras-tuner" target="_blank"/> (last access: 23 March 2026), 2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib73"><label>Ott et al.(2020)Ott, Pritchard, Best, Linstead, Curcic, and Baldi</label><mixed-citation>
      
Ott, J., Pritchard, M., Best, N., Linstead, E., Curcic, M., and Baldi, P.:
A Fortran-Keras deep learning bridge for scientific computing, Scientific Programming, 2020, <a href="https://doi.org/10.1155/2020/8888811" target="_blank">https://doi.org/10.1155/2020/8888811</a>, 2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib74"><label>Philippin et al.(2009)Philippin, Laj, Putaud, Wiedensohler, LEEUW, FJAERAA, PLATT, BALTENSPERGER, and FIEBIG</label><mixed-citation>
      
Philippin, S., Laj, P., Putaud, J.-P., Wiedensohler, A., Leeuw, G. D., Fjaeraa, A. M., Platt, U., Baltensperger, U., and Fiebig, M.:
EUSAAR-An unprecedented network of aerosol observation in Europe, Journal of Aerosol Research (Earozoru Kenkyu), 24, 78–83, <a href="https://doi.org/10.11203/jar.24.78" target="_blank">https://doi.org/10.11203/jar.24.78</a>, 2009.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib75"><label>Pierce et al.(2015)Pierce, Croft, Kodros, D'Andrea, and Martin</label><mixed-citation>
      
Pierce, J. R., Croft, B., Kodros, J. K., D'Andrea, S. D., and Martin, R. V.: The importance of interstitial particle scavenging by cloud droplets in shaping the remote aerosol size distribution and global aerosol-climate effects, Atmos. Chem. Phys., 15, 6147–6158, <a href="https://doi.org/10.5194/acp-15-6147-2015" target="_blank">https://doi.org/10.5194/acp-15-6147-2015</a>, 2015.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib76"><label>Randles et al.(2017)Randles, da Silva, Buchard, Colarco, Darmenov, Govindaraju, Smirnov, Holben, Ferrare, Hair, Shinozuka, and Flynn</label><mixed-citation>
      
Randles, C. A., da Silva, A. M., Buchard, V., Colarco, P. R., Darmenov, A., Govindaraju, R., Smirnov, A., Holben, B., Ferrare, R., Hair, J., Shinozuka, Y., and Flynn, C. J.:
The MERRA-2 Aerosol Reanalysis, 1980 Onward. Part I: System Description and Data Assimilation Evaluation,
J. Climate, 30, 6823–6850, <a href="https://doi.org/10.1175/JCLI-D-16-0609.1" target="_blank">https://doi.org/10.1175/JCLI-D-16-0609.1</a>, 2017.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib77"><label>Rasp et al.(2018)Rasp, Pritchard, and Gentine</label><mixed-citation>
      
Rasp, S., Pritchard, M. S., and Gentine, P.:
Deep learning to represent subgrid processes in climate models, P. Natl. Acad. Sci. USA, 115, 9684–9689,
<a href="https://doi.org/10.1073/pnas.1810286115" target="_blank">https://doi.org/10.1073/pnas.1810286115</a>, 2018.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib78"><label>Reddington et al.(2017)Reddington, Carslaw, Stier, Schutgens, Coe, Liu, Allan, Pringle, Lee, Yoshioka et al.</label><mixed-citation>
      
Reddington, C., Carslaw, K., Stier, P., Schutgens, N., Coe, H., Liu, D., Allan, J., Pringle, K., Lee, L., Yoshioka, M., Johnson, J. S., Regayre, L. A., Spracklen, D. V., Mann, G. W., Clarke, A., Hermann, M., Henning, S., Wex, H., Kristensen, T. B., Leaitch, W. R., Pöschl, U., Rose, D., Andreae, M. O.,
Schmale, J., Kondo, Y., Oshima, N., Schwarz, J. P., Nenes, A., Anderson, B.,
Roberts, G. C., Snider, J. R., Leck, C., Quinn, P. K., Chi, X., Ding, A., Jimenez, J. L., and Zhang, Q.:
The Global Aerosol Synthesis and Science Project (GASSP): measurements and modeling to reduce uncertainty,
B. Am. Meteorol. Soc., 98, 1857–1877, <a href="https://doi.org/10.1175/BAMS-D-15-00317.1" target="_blank">https://doi.org/10.1175/BAMS-D-15-00317.1</a>, 2017.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib79"><label>Remer et al.(2005)Remer, Kaufman, Tanré, Mattoo, Chu, Martins, Li, Ichoku, Levy, Kleidman et al.</label><mixed-citation>
      
Remer, L. A., Kaufman, Y., Tanré, D., Mattoo, S., Chu, D., Martins, J. V., Li, R.-R., Ichoku, C., Levy, R., Kleidman, R., Eck, T. F., Vermote, E., and Holben, B. N.:
The MODIS aerosol algorithm, products, and validation, J. Atmos. Sci., 62, 947–973,
<a href="https://doi.org/10.1175/JAS3385.1" target="_blank">https://doi.org/10.1175/JAS3385.1</a>, 2005.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib80"><label>Reynolds et al.(2002)Reynolds, Rayner, Smith, Stokes, and Wang</label><mixed-citation>
      
Reynolds, R. W., Rayner, N. A., Smith, T. M., Stokes, D. C., and Wang, W.:
An improved in situ and satellite SST analysis for climate, J. Climate, 15, 1609–1625,
<a href="https://doi.org/10.1175/1520-0442(2002)015&lt;1609:AIISAS&gt;2.0.CO;2" target="_blank">https://doi.org/10.1175/1520-0442(2002)015&lt;1609:AIISAS&gt;2.0.CO;2</a>, 2002.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib81"><label>Riemer et al.(2019)Riemer, Ault, West, Craig, and Curtis</label><mixed-citation>
      
Riemer, N., Ault, A., West, M., Craig, R., and Curtis, J.:
Aerosol mixing state: Measurements, modeling, and impacts, Rev. Geophys., 57, 187–249,
<a href="https://doi.org/10.1029/2018RG000615" target="_blank">https://doi.org/10.1029/2018RG000615</a>, 2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib82"><label>Rienecker et al.(2008)Rienecker, Suarez, Todling, Bacmeister, Takacs, Liu, Gu, Sienkiewicz, Koster, Gelaro, Stajner, and Nielsen</label><mixed-citation>
      
Rienecker, M., Suarez, M., Todling, R., Bacmeister, J., Takacs, L., Liu, H.-C., Gu, W., Sienkiewicz, M., Koster, R., Gelaro, R., Stajner, I., and Nielsen, J.:
The GEOS-5 Data Assimilation System – Documentation of Versions 5.0.1, 5.1.0, and 5.2.0.,
vol. 27 of Technical Report Series on Global Modeling and Data Assimilation, NASA Goddard Space Flight Center, Greenbelt, MD, USA, <a href="https://ntrs.nasa.gov/citations/20120011955" target="_blank"/> (last access: 23 March 2026), 2008.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib83"><label>Russchenberg et al.(2005)Russchenberg, Bosveld, Swart, ten BRINK, de LEEUW, Uijlenhoet, Arbesser-Rastburg, van der MAREL, LIGTHART, Boers et al.</label><mixed-citation>
      
Russchenberg, H., Bosveld, F., Swart, D., ten Brink, H., de Leeuw, G., Uijlenhoet, R., Arbesser-Rastburg, B., van der Marel, H., Ligthart, L., Boers, R., and Apituley, A.:
Ground-based atmospheric remote sensing in the Netherlands: European outlook, IEICE T. Commun., 88, 2252–2258,
<a href="https://doi.org/10.1093/ietcom/e88-b.6.2252" target="_blank">https://doi.org/10.1093/ietcom/e88-b.6.2252</a>, 2005.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib84"><label>Seinfeld and Pandis(2016)</label><mixed-citation>
      
Seinfeld, J. H. and Pandis, S. N.: Atmospheric chemistry and physics: from air pollution to climate change, John Wiley &amp; Sons, ISBN 0471720186, 2016.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib85"><label>Seinfeld et al.(2016)Seinfeld, Bretherton, Carslaw, Coe, DeMott, Dunlea, Feingold, Ghan, Guenther, Kahn et al.</label><mixed-citation>
      
Seinfeld, J. H., Bretherton, C., Carslaw, K. S., Coe, H., DeMott, P. J., Dunlea, E. J., Feingold, G., Ghan, S., Guenther, A. B., Kahn, R., Kraucunas, I., Kreidenweis, S. M., Molina, M. J., Nenes, A., Penner, J. E., Prather, K. A., Ramanathan, V., Ramaswamy, V., Rasch, P. J., Ravishankara, A. R., Rosenfeld, D., Stephens, G., and Wood, R.:
Improving our fundamental understanding of the role of aerosol- cloud interactions in the climate system,
P. Natl. Acad. Sci. USA, 113, 5781–5790, <a href="https://doi.org/10.1073/pnas.1514043113" target="_blank">https://doi.org/10.1073/pnas.1514043113</a>, 2016.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib86"><label>Silva et al.(2021)Silva, Ma, Hardin, and Rothenberg</label><mixed-citation>
      
Silva, S. J., Ma, P.-L., Hardin, J. C., and Rothenberg, D.: Physically regularized machine learning emulators of aerosol activation , Geosci. Model Dev., 14, 3067–3077, <a href="https://doi.org/10.5194/gmd-14-3067-2021" target="_blank">https://doi.org/10.5194/gmd-14-3067-2021</a>, 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib87"><label>Song et al.(2025)Song, McCoy, Molod, and Barahona</label><mixed-citation>
      
Song, C., McCoy, D., Molod, A., Aerenson, T., and Barahona, D.: Signatures of aerosol-cloud interactions in GiOcean: a coupled global reanalysis with two-moment cloud microphysics, Atmos. Chem. Phys., 25, 15567–15592, <a href="https://doi.org/10.5194/acp-25-15567-2025" target="_blank">https://doi.org/10.5194/acp-25-15567-2025</a>, 2025.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib88"><label>Stier et al.(2005)Stier, Feichter, Kinne, Kloster, Vignati, Wilson, Ganzeveld, Tegen, Werner, Balkanski et al.</label><mixed-citation>
      
Stier, P., Feichter, J., Kinne, S., Kloster, S., Vignati, E., Wilson, J., Ganzeveld, L., Tegen, I., Werner, M., Balkanski, Y., Schulz, M., Boucher, O., Minikin, A., and Petzold, A.: The aerosol-climate model ECHAM5-HAM, Atmos. Chem. Phys., 5, 1125–1156, <a href="https://doi.org/10.5194/acp-5-1125-2005" target="_blank">https://doi.org/10.5194/acp-5-1125-2005</a>, 2005.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib89"><label>Stier et al.(2024)Stier, van den Heever, Christensen, Gryspeerdt, Dagan, Saleeby, Bollasina, Donner, Emanuel, Ekman et al.</label><mixed-citation>
      
Stier, P., van den Heever, S. C., Christensen, M. W., Gryspeerdt, E., Dagan, G., Saleeby, S. M., Bollasina, M., Donner, L., Emanuel, K., Ekman, A. M., Feingold, G., Field, P., Forster, P., Haywood, J., Kahn, R., Koren, I., Kummerow, C., L’Ecuyer, T., Lohmann, U., Ming, Y., Myhre, G., Quaas, J., Rosenfeld, D., Samset, B., Seifert, A., Stephens, G., and Tao, W. K.: Multifaceted aerosol effects on precipitation, Nat. Geosci., 17, 719–732, <a href="https://doi.org/10.1038/s41561-024-01482-6" target="_blank">https://doi.org/10.1038/s41561-024-01482-6</a>, 2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib90"><label>Ström et al.(2003)Ström, Umegård, Tørseth, Tunved, Hansson, Holmén, Wismann, Herber, and König-Langlo</label><mixed-citation>
      
Ström, J., Umegård, J., Tørseth, K., Tunved, P., Hansson, H.-C., Holmén, K., Wismann, V., Herber, A., and König-Langlo, G.:
One year of particle size distribution and aerosol chemical composition measurements at the Zeppelin Station, Svalbard, March 2000–March 2001,
Phys. Chem. Earth Pt. A/B/C, 28, 1181–1190, <a href="https://doi.org/10.1016/j.pce.2003.08.058" target="_blank">https://doi.org/10.1016/j.pce.2003.08.058</a>, 2003.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib91"><label>Su et al.(2023)Su, Huang, Wang, Cao, and Feng</label><mixed-citation>
      
Su, X., Huang, Y., Wang, L., Cao, M., and Feng, L.:
Validation and diurnal variation evaluation of MERRA-2 multiple aerosol properties on a global scale,
Atmos. Environ., 311, 120019, <a href="https://doi.org/10.1016/j.atmosenv.2023.120019" target="_blank">https://doi.org/10.1016/j.atmosenv.2023.120019</a>, 2023.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib92"><label>Sun et al.(2019)Sun, Che, Xu, Wang, Lu, Gui, Zhao, Zheng, Wang, Wang et al.</label><mixed-citation>
      
Sun, E., Che, H., Xu, X., Wang, Z., Lu, C., Gui, K., Zhao, H., Zheng, Y., Wang, Y., Wang, H., Sun, T., Liang, Y., Li, X., Sheng, Z., An, L., Zhang, X., and Shi, G.:
Variation in MERRA-2 aerosol optical depth over the Yangtze River Delta from 1980 to 2016,
Theor. Appl. Climatol., 136, 363–375, <a href="https://doi.org/10.1007/s00704-018-2490-9" target="_blank">https://doi.org/10.1007/s00704-018-2490-9</a>, 2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib93"><label>Takacs et al.(2018)Takacs, Suárez, and Todling</label><mixed-citation>
      
Takacs, L. L., Suárez, M. J., and Todling, R.:
The stability of incremental analysis update, Mon. Weather Rev., 146, 3259–3275, <a href="https://doi.org/10.1175/MWR-D-18-0117.1" target="_blank">https://doi.org/10.1175/MWR-D-18-0117.1</a>, 2018.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib94"><label>Tunved et al.(2004)Tunved, Ström, and Hansson</label><mixed-citation>
      
Tunved, P., Ström, J., and Hansson, H.-C.: An investigation of processes controlling the evolution of the boundary layer aerosol size distribution properties at the Swedish background station Aspvreten, Atmos. Chem. Phys., 4, 2581–2592, <a href="https://doi.org/10.5194/acp-4-2581-2004" target="_blank">https://doi.org/10.5194/acp-4-2581-2004</a>, 2004.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib95"><label>Ukhov et al.(2020)Ukhov, Mostamandi, Da Silva, Flemming, Alshehri, Shevchenko, and Stenchikov</label><mixed-citation>
      
Ukhov, A., Mostamandi, S., da Silva, A., Flemming, J., Alshehri, Y., Shevchenko, I., and Stenchikov, G.: Assessment of natural and anthropogenic aerosol air pollution in the Middle East using MERRA-2, CAMS data assimilation products, and high-resolution WRF-Chem model simulations, Atmos. Chem. Phys., 20, 9281–9310, <a href="https://doi.org/10.5194/acp-20-9281-2020" target="_blank">https://doi.org/10.5194/acp-20-9281-2020</a>, 2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib96"><label>Ulevicius et al.(2010)Ulevicius, Byčenkienė, Remeikis, Garbaras, Kecorius, Andriejauskienė, Jasinevičienė, and Mocnik</label><mixed-citation>
      
Ulevicius, V., Byčenkienė, S., Remeikis, V., Garbaras, A., Kecorius, S., Andriejauskienė, J., Jasinevičienė, D., and Mocnik, G.:
Characterization of pollution events in the East Baltic region affected by regional biomass fire emissions,
Atmos. Res., 98, 190–200, <a href="https://doi.org/10.1016/j.atmosres.2010.03.021" target="_blank">https://doi.org/10.1016/j.atmosres.2010.03.021</a>, 2010.


    </mixed-citation></ref-html>
<ref-html id="bib1.bib97"><label>Uno et al.(2009)Uno, Eguchi, Yumimoto, Takemura, Shimizu, Uematsu, Liu, Wang, Hara, and Sugimoto</label><mixed-citation>
      
Uno, I., Eguchi, K., Yumimoto, K., Takemura, T., Shimizu, A., Uematsu, M., Liu, Z., Wang, Z., Hara, Y., and Sugimoto, N.:
Asian dust transported one full circuit around the globe, Nat. Geosci., 2, 557–560, <a href="https://doi.org/10.1038/ngeo583" target="_blank">https://doi.org/10.1038/ngeo583</a>, 2009.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib98"><label>Venzac et al.(2009)Venzac, Sellegri, Villani, Picard, and Laj</label><mixed-citation>
      
Venzac, H., Sellegri, K., Villani, P., Picard, D., and Laj, P.: Seasonal variation of aerosol size distributions in the free troposphere and residual layer at the puy de Dôme station, France, Atmos. Chem. Phys., 9, 1465–1478, <a href="https://doi.org/10.5194/acp-9-1465-2009" target="_blank">https://doi.org/10.5194/acp-9-1465-2009</a>, 2009.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib99"><label>Virtanen et al.(2025)Virtanen, Joutsensaari, Kokkola, Partridge, Blichner, Seland, Holopainen, Tovazzi, Lipponen, Mikkonen et al.</label><mixed-citation>
      
Virtanen, A., Joutsensaari, J., Kokkola, H., Partridge, D. G., Blichner, S., Seland, Ø., Holopainen, E., Tovazzi, E., Lipponen, A., Mikkonen, S., Leskinen, A., Hyvärinen, A.-P., Zieger, P., Krejci, R., Ekman, A. M. L., Riipinen, I., Quaas, J., and Romakkaniemi, S.:
High sensitivity of cloud formation to aerosol changes, Nat. Geosci., <a href="https://doi.org/10.1038/s41561-025-01662-y" target="_blank">https://doi.org/10.1038/s41561-025-01662-y</a>, 2025.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib100"><label>Watson-Parris et al.(2019)Watson-Parris, Schutgens, Reddington, Pringle, Liu, Allan, Coe, Carslaw, and Stier</label><mixed-citation>
      
Watson-Parris, D., Schutgens, N., Reddington, C., Pringle, K. J., Liu, D., Allan, J. D., Coe, H., Carslaw, K. S., and Stier, P.: In situ constraints on the vertical distribution of global aerosol, Atmos. Chem. Phys., 19, 11765–11790, <a href="https://doi.org/10.5194/acp-19-11765-2019" target="_blank">https://doi.org/10.5194/acp-19-11765-2019</a>, 2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib101"><label>Whitby and McMurry(1997)</label><mixed-citation>
      
Whitby, E. R. and McMurry, P. H.: Modal aerosol dynamics modeling, Aerosol Sci. Tech., 27, 673–688,
<a href="https://doi.org/10.1080/02786829708965504" target="_blank">https://doi.org/10.1080/02786829708965504</a>, 1997.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib102"><label>Wilson et al.(2001)Wilson, Cuvelier, and Raes</label><mixed-citation>
      
Wilson, J., Cuvelier, C., and Raes, F.: A modeling study of global mixed aerosol fields, J. Geophys. Res.-Atmos., 106, 34081–34108,
<a href="https://doi.org/10.1029/2000JD000198" target="_blank">https://doi.org/10.1029/2000JD000198</a>, 2001.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib103"><label>Winter(2002)</label><mixed-citation>
      
Winter, E.: The shapley value, Handbook of Game Theory with Economic Applications, 3, 2025–2054,
<a href="https://doi.org/10.1016/S1574-0005(02)03016-3" target="_blank">https://doi.org/10.1016/S1574-0005(02)03016-3</a>, 2002.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib104"><label>Yu et al.(2024)Yu, Ma, Singh, Silva, and Pritchard</label><mixed-citation>
      
Yu, S., Ma, P.-L., Singh, B., Silva, S., and Pritchard, M.:
Two-step hyperparameter optimization method: Accelerating hyperparameter search by using a fraction of a training dataset,
Artificial Intelligence for the Earth Systems, 3, e230013, <a href="https://doi.org/10.1175/AIES-D-23-0013.1" target="_blank">https://doi.org/10.1175/AIES-D-23-0013.1</a>, 2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib105"><label>Zhang et al.(2020)Zhang, Sharma, Dhawan, Dhanraj, Li, and Biswas</label><mixed-citation>
      
Zhang, H., Sharma, G., Dhawan, S., Dhanraj, D., Li, Z., and Biswas, P.:
Comparison of discrete, discrete-sectional, modal and moment models for aerosol dynamics simulations,
Aerosol Sci. Tech., 54, 739–760, <a href="https://doi.org/10.1080/02786826.2020.1723787" target="_blank">https://doi.org/10.1080/02786826.2020.1723787</a>, 2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib106"><label>Zhou et al.(2018)Zhou, Shen, Liu, Zhang, and Xin</label><mixed-citation>
      
Zhou, C., Shen, X., Liu, Z., Zhang, Y., and Xin, J.:
Simulating aerosol size distribution and mass concentration with simultaneous nucleation, condensation/coagulation, and deposition with the GRAPES–CUACE,
Journal of Meteorological Research, 32, 265–278, <a href="https://doi.org/10.1007/s13351-018-7116-8" target="_blank">https://doi.org/10.1007/s13351-018-7116-8</a>, 2018.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib107"><label>Zhu et al.(2023)Zhu, Martin, Croft, Zhai, Li, Bindle, Pierce, Chang, Anderson, Ziemba, Hair, Ferrare, Hostetler, Singh, Chatterjee, Jimenez, Campuzano-Jost, Nault, Dibb, Schwarz, and Weinheimer</label><mixed-citation>
      
Zhu, H., Martin, R. V., Croft, B., Zhai, S., Li, C., Bindle, L., Pierce, J. R., Chang, R. Y.-W., Anderson, B. E., Ziemba, L. D., Hair, J. W., Ferrare, R. A., Hostetler, C. A., Singh, I., Chatterjee, D., Jimenez, J. L., Campuzano-Jost, P., Nault, B. A., Dibb, J. E., Schwarz, J. S., and Weinheimer, A.: Parameterization of size of organic and secondary inorganic aerosol for efficient representation of global aerosol optical properties, Atmos. Chem. Phys., 23, 5023–5042, <a href="https://doi.org/10.5194/acp-23-5023-2023" target="_blank">https://doi.org/10.5194/acp-23-5023-2023</a>, 2023.

    </mixed-citation></ref-html>--></article>
