Deep learning representation of the aerosol size distribution

Barahona, Donifan; Breen, Katherine H.; Block, Karoline; Darmenov, Anton

doi:10.5194/gmd-19-2437-2026

Articles | Volume 19, issue 6

https://doi.org/10.5194/gmd-19-2437-2026

Articles | Volume 19, issue 6

Model description paper

26 Mar 2026

Model description paper |

| 26 Mar 2026

Deep learning representation of the aerosol size distribution

Donifan Barahona, Katherine H. Breen, Karoline Block, and Anton Darmenov

Abstract

Aerosols influence Earth's radiative balance via the scattering and absorbing of solar radiation, affect cloud formation, and play important roles on precipitation, ocean seeding and human health. Accurate modeling of these effects requires knowledge of the chemical composition and size distribution of aerosol particles present in the atmosphere. Computationally intensive applications like remote sensing and weather forecasting commonly use simplified representations of aerosol microphysics, prescribing the aerosol size distribution (ASD), introducing uncertainty in climate predictions and aerosol retrievals. In this work, we develop a neural network model, MAMnet, to predict the ASD and mixing state for seven lognormal modes based on the bulk aerosol mass and the meteorological state. MAMnet is designed to operate with outputs from single-moment, mass-based aerosol schemes, making it compatible with existing models. We demonstrate that MAMnet can accurately reproduce the output of a two-moment modal aerosol scheme, and also agrees well with field measurements when driven by reanalysis data. Our model paves the way to improve the representation of aerosols in atmospheric models while maintaining the versatility and efficiency required in large scale applications.

Download & links

Article (PDF, 2280 KB)

Download & links

How to cite.

Received: 31 Jan 2025 – Discussion started: 17 Mar 2025 – Revised: 11 Feb 2026 – Accepted: 20 Feb 2026 – Published: 26 Mar 2026

1 Introduction

Aerosols play a crucial role in the Earth's system by influencing radiative forcing (Forster et al., 2007; Bender, 2020), cloud formation and lifetime (Christensen et al., 2020), and precipitation patterns (Stier et al., 2024). Aerosol particle size and composition determine their atmospheric lifetime (Seinfeld and Pandis, 2016), impact on human health (Arfin et al., 2023), long range transport (Uno et al., 2009), and their ability to become cloud droplets and ice crystals (Seinfeld et al., 2016). The size and composition of atmospheric aerosols are critical parameters determining the concentration of cloud condensation nuclei (CCN) in the atmosphere (Seinfeld and Pandis, 2016). Understanding the distribution and composition of atmospheric aerosols is thus essential for accurate climate and weather simulations (Seinfeld et al., 2016).

The ASD and mixing state are at the center of the ability of climate models to accurately simulate the transport and chemical evolution of aerosol species (Aquila et al., 2011; Bender et al., 2019). Variability in the representation of the ASD among models has been shown to drive large differences in cloud droplet number concentration and aerosol–cloud radiative forcing (Virtanen et al., 2025). Explicitly resolving the ASD improves the representation of nucleation, condensation, and coagulation processes (Zhou et al., 2018), and it is critical for realistically simulating scavenging within clouds, as smaller particles are less efficiently removed than larger ones, affecting global particle number concentrations by up to 20 % (Pierce et al., 2015). It has been shown that models that resolve particle-level mixing state and size better represent CCN activity, aerosol aging, and radiative properties (Riemer et al., 2019).

Atmospheric models represent the ASD using approximations with different degrees of sophistication. The bulk mass approach predicts the transport and evolution of aerosols by tracking the mass concentration of individual chemical species (Jones et al., 1994; Langner and Rodhe, 1991; Ginoux et al., 2001; Chin et al., 2000). Each particle is assumed to consist of a single chemical component or their surrogate (Riemer et al., 2019). Because each species is typically represented by a single prognostic variable, the bulk approach is not designed to resolve the ASD or the mixing state, which are often prescribed from climatological data. However, due to their low computational cost, bulk schemes are well suited for data assimilation (Randles et al., 2017) and are widely used in forecasting systems, satellite retrieval algorithms, and reanalysis products (Chu et al., 2002; Gelaro et al., 2017; Inness et al., 2019). For example, aerosol transport in the MERRA-2 climate reanalysis (Gelaro et al., 2017; Randles et al., 2017) is based on the Goddard Chemistry, Aerosol, Radiation, and Transport model (GOCART) (Colarco et al., 2010 b). GOCART is a bulk aerosol scheme that explicitly calculates the mass of major species, i.e., dust, black carbon, organic material, sea salt, and sulfate, using an externally mixed representation.

In contrast to bulk methods, modal aerosol schemes estimate both the number concentration and mass of atmospheric aerosol, approximating the ASD as the combination of overlapping populations, termed “modes”, each typically assumed internally-mixed, and following a log-normal distribution with prescribed geometric standard deviation (e.g., Whitby and McMurry, 1997; Wilson et al., 2001; Stier et al., 2005; Mann et al., 2010; Liu et al., 2012). Because they predict the number concentration and mass independently, modal schemes can better resolve the composition of aerosol species, particularly when several subpopulations are used (Riemer et al., 2019). More sophisticated aerosol schemes either compute additional moments of the ASD (Zhang et al., 2020), explicitly resolve it using a binned approach (e.g., Adams and Seinfeld, 2002), or represent it on a particle-by-particle basis (Riemer et al., 2019). While these models provide the most physically consistent representation of the ASD, they are often too computationally expensive for operational forecasting and long-term climate simulations.

In computationally intensive applications and satellite retrievals it is desirable to maintain the efficiency and simplicity of the bulk schemes. However, key processes such as nucleation, coagulation, scavenging, and activation, as well as aerosol radiative properties, are highly sensitive to particle size, requiring the explicit representation of the ASD (Seinfeld and Pandis, 2016). A common approach to address this is to prescribe a global mean ASD (e.g., Remer et al., 2005; Barahona et al., 2014; Inness et al., 2019; Block et al., 2024). Yet, aerosol size and composition vary substantially across time and space, influenced by both meteorological conditions and natural and anthropogenic sources. As a result, a fixed global ASD can only approximate the actual, locally varying distribution, potentially introducing biases in the simulation of aerosol–radiation and aerosol–cloud interactions.

To address these challenges, there is growing interest in leveraging machine learning (ML) techniques to develop more efficient and accurate aerosol models (e.g., Rasp et al., 2018; Gong et al., 2022; Silva et al., 2021; Harder et al., 2022). ML models, can in principle capture complex nonlinear relationships between aerosol properties and environmental variables with reduced computational costs. For example, Harder et al. (2022) developed a surrogate of the Modal Aerosol Module (MAM7; Liu et al., 2012) to predict the mass and number tendencies of aerosol species, with the aim to improve computational performance. The emulator replaces computationally intensive parts of MAM7, however does not map the ASD to the mass of the aerosol species, a requirement to many assimilation and remote sensing algorithms (Randles et al., 2017; Buchard et al., 2017). Other work also have sought to use ML to predict the ASD from bulk aerosol mass, focusing on specific species or its effect on cloud formation (Nair et al., 2021; Zhu et al., 2023).

Here we present a novel ML-based approach for predicting the ASD and aerosol mixing state in atmospheric models that run bulk aerosol schemes. This is accomplished by developing a ML-based parameterization that emulates the ASD predicted by the MAM7 model, using as input the total mass of aerosol species from relatively fast single-moment bulk aerosol models like GOCART. By combining the strengths of machine learning and physical principles, the parameterization maps the bulk mass model into the ASD that would be predicted by the modal approach, enhancing the former. Our method offers a promising avenue for advancing aerosol representation in climate predictions, data assimilation and remote sensing applications.

2 Methods and Data

We developed a neural network (NN) termed “MAMnet”, to estimate the ASD using as input the total mass of aerosol species, air density and temperature. This minimal set of inputs ensures that the neural network remains independent of the host model, since including additional meteorological inputs like wind speed and humidity, would introduce sensitivity to model-specific parameterizations. It also makes MAMnet suitable for applications involving satellite aerosol retrievals, where only a limited set of atmospheric variables is typically available. This approach is supported by previous studies showing that the conversion between aerosol mass and number concentrations can be reasonably approximated using spatially varying, but prescribed, ASDs (e.g., Remer et al., 2005; Inness et al., 2019; Block et al., 2024), suggesting that such relationships can be effectively learned by a neural network. MAMnet was trained on simulated data using the MAM7 model implemented on the NASA's Global Earth Observing System (GEOS). This section details the modeling components as well as the development and evaluation approach of the NN.

2.1 Modeling components

The NASA Goddard Earth Observing System (GEOS), consists of a set of components that numerically represent different aspects of the Earth system (atmosphere, ocean, land, sea-ice, and chemistry), coupled following the Earth System Modeling Framework (https://gmao.gsfc.nasa.gov/GEOS_systems/, last access: 23 March 2026). In GEOS-AGCM configuration, atmospheric transport of water vapor, condensate and other tracers, and associated land-atmosphere exchanges, is computed explicitly, whereas sea-ice and sea surface temperature (SST) are prescribed as time-dependent boundary conditions (Reynolds et al., 2002; Rienecker et al., 2008). Cloud microphysics in the operational version of GEOS uses a single moment microphysics scheme for short-term weather forecast (Molod et al., 2015), and a two-moment cloud scheme in subseasonal and seasonal prediction (Barahona et al., 2014; Molod et al., 2020). GEOS constitutes the modeling base of MERRA-2 (Modern Era Retrospective analysis for Research and Applications, version 2), the first multidecadal reanalysis to integrate both aerosol and meteorological observations (Gelaro et al., 2017; Randles et al., 2017). In MERRA-2 aerosol fields are described using GOCART. Aerosols are interactive and radiatively active, hence MERRA-2 has a representation of the aerosol direct effect. Aerosol assimilation uses the Goddard Aerosol Assimilation System (GAAS), and the overall assimilation cycle is controlled by the meteorology.

2.1.1 Aerosol transport schemes

GEOS implements two aerosol schemes to interactively calculate the evolution of aerosol and gaseous tracers. Both include parameterized representation of aerosol formation, growth, aging and wet removal, and differ in their treatment of the ASD and mixing state. GOCART (Chin et al., 2000; Colarco et al., 2010 a) is used operationally on weather forecast and data assimilation applications. GOCART is a mass-based aerosol model that explicitly calculates the transport and evolution of dust, black carbon, organic material, sea salt, and sulfate. Aerosol species are assumed externally mixed. Dust and sea salt are represented in five mass bins of different sizes whereas a single bin is assumed for other species. The ASD for each bin is prescribed as a lognormal distribution. Both organics (primary and secondary organic matter) and black carbon are split into hydrophilic and hydrophobic components. Dust and sea salt emissions are prognostic whereas sulfate and biomass burning emissions are obtained from the MERRA-2 dataset (Colarco et al., 2010 a; Randles et al., 2017).

GEOS also implements the MAM7 model (Liu et al., 2012), as an alternative aerosol scheme for research applications. MAM7 is a modal aerosol scheme that predicts the mass and number concentration of Aitken (AIT), accumulation (ACC), primary carbon (PCM), fine dust (FDU) and sea salt (FSS), and coarse dust (CDU) and sea salt (CSS) aerosol modes. The aerosol representation is internally mixed with aerosol species and modal composition as detailed in Tables 1 and 2. The total number of simulated tracers in MAM7 is 31: 24 modal mass components and seven aerosol number concentrations. The size distributions for each mode is assumed to follow a lognormal distribution, with geometric mean diameter computed diagnostically and prescribed geometric standard deviation for each mode (Liu et al., 2012).

Table 1Aerosol species considered in this work; κ is the hygroscopicity parameter (Kreidenweis et al., 2005).

Download Print Version | Download XLSX

Table 2Aerosol modes predicted by MAM7 and MAMnet; σ_g is the geometric standard deviation.

Download Print Version | Download XLSX

2.2 Development of the deep learning model

We built a neural network, termed “MAMnet”, to estimate the aerosol number concentration and composition emulating the output of the MAM7 model (Table 2), using as input the total mass mixing ratios for dust, sulfates, organics, black carbon and sea salt, and the atmospheric state (temperature, T and air density, ρ_air), for a total of 31 predicted tracers as shown in Fig. 1. MAMnet is intended to map the simulated aerosol mass across species into a 7-modal ASD, rather than to fully emulate MAM7. This is arguably a simpler task than emulating the full range of aerosol processes represented in MAM7, since the aerosol mass fields used as input already encapsulate the integrated effects of meteorology, clouds, as well as trends in aerosol emissions. The mass-number relationship on the other hand is not expected to depend strongly on such factors, since it many cases it can be approximated to some degree using prescribed formulations for the ASD (McCoy et al., 2017). This section describes the development of the NN.

https://gmd.copernicus.org/articles/19/2437/2026/gmd-19-2437-2026-f01

Figure 1Neural network development workflow. Blue arrows represent the training steps, while red arrows correspond to the inference process. During training, the “mapping” step aggregates aerosol species across modes from the MAM7 output to construct the input X_MAM. The MAM7 output is then used to calculate the MAMnet loss (calculated as the minimum mean square difference between the model prediction and the MAM7 fields). During inference, input from MERRA-2 is used to predict the aerosol size distribution and mixing state. MAMnet consists of a single input layer (black), seven hidden layers (orange), and one output layer (green). T and AIRD represent the temperature and air density, respectively. N_s is the number of samples.

Download

2.2.1 Data generation

The AGCM configuration of GEOS, running MAM7 (referred to as “GEOS+MAM7”), was used to develop a robust dataset to train the neural network. We ran a 5-year simulation (2001–2006) at 1° horizontal resolution and 72 vertical levels (from the surface to 0.01 hPa), with diurnal, instantaneous outputs at UTC 09:00:00 and 21:00:00. Temperature and horizontal winds were “replayed” to MERRA-2. The replay technique is a form of nudging that combines analysis increments with the model results every six hours, at each model grid point, to correct the model state (Takacs et al., 2018). This ensures that the simulation reproduces the observed meteorological state. Aerosol mass however evolves freely from observational constraints.

To construct the training dataset, we randomly selected 25 unique output files (without replacement) from the years 2001–2005. An additional 10 unique files were used as validation data to compute the loss during training. Each file represents global instantaneous output from the GEOS+MAM7 model. Although the validation data are not used to update the network parameters, the validation loss informs optimization choices (e.g., early stopping and hyperparameter selection) and is therefore considered part of the training process. For testing, we used 5 separate output files from the year 2006, which was not included in either the training or validation stages, ensuring a fully independent evaluation set.

Each grid cell in the GEOS+MAM7 output is treated as an independent training example, resulting in a large volume of data: with N_time=25 timestamps, N_lev=72 vertical levels, N_lat=181 latitudes, and N_lon=360 longitudes, the training set contains over 100 million samples. The samples are randomly shuffled in time and space prior to training. This single-cell approach makes the parameterization resolution-independent, facilitating integration into atmospheric models with varying grid resolutions. It also ensures broad coverage of physically plausible combinations of aerosol mass and number. Although this approach omits spatial or vertical correlations, the mass-number relationship depends primarily on the relative abundance of species within a given grid cell. This was tested by using using a full-column input structure, which resulted in no significant gain in accuracy (not shown).

To balance data volume and temporal representativeness, we sampled at 12 h intervals, which allows the network to capture differences between day and night while maximizing the number of training samples. Higher-frequency sampling could better resolve the diurnal cycle, but this comes at the cost of fewer training time steps due to memory limitations. This is not expected to be critical as the relationship between aerosol mass and number is expected to exhibit weaker diurnal variability than mass itself, which would be already resolved by the host model.

We combined the internally-mixed, modal mass components parameterized by MAM7 across 5 different species including sulfate, ammonium, sea salt, dust, primary and secondary organic matter, and black carbon (Table 1 and Fig. 1) to derive the total mass mixing ratios for the input features. Neither MERRA-2 nor the current implementation of MAM7 in GEOS include nitrate aerosol species. The mass input variables were first log₁₀-transformed, and the resulting values were then standardized by computing Z-scores using the global mean and standard deviation across all levels. Temperature and air density were also standardized using their global mean and standard deviation. Statistics used for normalization were calculated using 100 random instantaneous output files not used during training. Target variables included the mass of each of the MAM7 species and the number concentration for each mode. Because aerosol mass and number concentration vary over several orders of magnitude, we log₁₀-transformed the targets, and filtered out values less than minimum threshold values, 10⁻²⁰ µg kg ⁻¹ and 10⁻⁴ mg⁻¹ for mass and number, respectively, prior to training. Values below these thresholds are held constant and therefore do not contribute to the gradient. During testing, points below the thresholds are masked. All metrics are computed in logarithmic space.

The modal aerosol dry diameter (hereafter D_pg) was not directly included as a target of MAMnet, and it is not part of the loss function. Instead it was used to check for mass conservation, that is, matching the predicted D_pg against the target values indicates that the mass and number concentration remain consistent in the prediction. D_pg was derived for each ith mode in the form (Seinfeld and Pandis, 2016),

\begin{matrix} (1) & D_{pg, i} = {(\frac{6}{π N_{i}} \sum_{j = 1}^{N_{sp, i}} \frac{M_{j, i}}{ρ_{j, i}})}^{1 / 3} \exp (- \frac{3 \ln^{2} σ_{g, i}}{2}) \end{matrix}

where σ_g,i and N_i are the geometric standard deviation and number concentration for the ith mode, respectively. M_j,i and ρ_j,i are the mass and density of the jth species in the ith mode, respectively, and N_sp,i is the number of species present in the mode.

2.2.2 Model architecture

Various levels of complexity were tested for the MAMnet architecture, including Multilayer Perceptrons (MLPs) and Convolutional Neural Networks (CNNs) (Bengio et al., 2017). These architectures have demonstrated success in capturing multi-scale behaviors of GCMs for different physical properties (e.g., Brenowitz and Bretherton, 2019; Rasp et al., 2018; Barahona et al., 2024). MLPs extract global patterns from the entirety of the input feature vector simultaneously, resulting in a greater number of model parameters for optimization. This approach compels the NN to make localized decisions, considering what occurs at an individual model level within each grid cell and time step, utilizing global information encompassing all grid cells and time steps. In contrast, CNNs extract features from smaller spatiotemporal blocks, enabling local decisions to be influenced by nearby information where the receptive field of each sample is a hyperparameter. Testing of both architectures showed that the MLP configuration exhibited superior performance and was easier to optimize. The final architecture is shown in Fig. 1. Hyperparameters targeting generalization (dropout rate, activation function, batch size) were tuned such that the optimized model not only minimized error on the validation data, but minimized the difference between the training and validation losses as detailed in the Appendix.

2.3 Observational data

Besides synthetic data the neural network was evaluated on its ability to reproduce observations when driven by the MERRA-2 reanalysis output. This was important for testing the reliability of MAMnet when applied outside of the purely simulated environment. Near-surface aerosol number concentrations ranging from 30 to 500 nm, compiled by Asmi et al. (2011), were used for model evaluation. The dataset includes two years (2008–2009) of hourly measurements from 24 sites across Western Europe, as detailed in Table 3. These measurements were collected from two major monitoring networks: the European Supersites for Atmospheric Aerosol Research (EUSAAR) project, part of the Sixth Framework Programme of the European Commission (Philippin et al., 2009), and the German Ultrafine Aerosol Network (GUAN) (Birmili et al., 2009). The data are reported as cumulative number concentrations for four aerosol size ranges, N₃₀, N₅₀, N₁₀₀, and N₂₅₀, defined as,

\begin{matrix} (2) & N_{X} = \sum_{D_{p} = X}^{Y} N (D_{p}) \end{matrix}

where D_p is the aerosol dry diameter, and subscript X represents the aerosol number concentration for size range defined by threshold X, and Y=500 nm for $X = 50, 100$ and 250 nm and Y=50 nm for X=30 nm. Equivalently, these can be calculated from the predicted ASD in the form (Seinfeld and Pandis, 2016),

\begin{matrix} (3) & \begin{aligned} N_{X} & = \sum_{i = 1}^{N_{\mod}} \frac{N_{i}}{2} [\erf (\frac{\ln Y - \ln D_{pg, i}}{\sqrt{2} \ln σ_{g, i}}) \\ - \erf (\frac{\ln X - \ln D_{pg, i}}{\sqrt{2} \ln σ_{g, i}})] \end{aligned} \end{matrix}

where N_mod=7, is the number of lognormal modes. For each site, MERRA-2 derived aerosol mass concentration, temperature and air density are interpolated at the location and time of the measurements, then used in MAMnet to predict the ASD. Using Eqs. (1) and (3), N_X is predicted as the average of the two lowermost model levels (i.e., nearest the surface) and compared against the observations.

Tunved et al. (2004)Amunsen et al. (1992)Lihavainen et al. (2008)Ulevicius et al. (2010)Hari et al. (2013)Kristensson et al. (2008)Birmili et al. (2009)Kiss et al. (2002)Engler et al. (2007)C̆ervenkova and Vá n̆a (2010)Birmili et al. (2003)Birmili et al. (2009)Russchenberg et al. (2005)Charron et al. (2007)Jennings et al. (1991)Mihalopoulos et al. (1997)Gruening et al. (2009)Ström et al. (2003)Venzac et al. (2009)Birmili et al. (2009)Birmili et al. (2009)Jurányi et al. (2011)Nojarov et al. (2009)Marinoni et al. (2008)

Table 3Datasets for the period 2008–2009 used for comparison with surface aerosol size distributions predicted by MAMnet. The original data reference is given, although all data sets used in this work were curated by Asmi et al. (2011).

Download Print Version | Download XLSX

We also used estimates of the concentration of cloud condensation nuclei, N_CCN to place MAMnet in the context of observationally-constrained estimates. To calculate N_CCN from MAMnet we folllowed the method of Fountoukis and Nenes (2005) to estimate N_CCN from the derived 7-modal size distribution and modal composition and the MERRA-2 fields as inputs. Hygroscopicity parameters, (κ) for each mode were obtained by volume-weighting the values for each aerosol species as listed in Table 1. Because N_CCN is strongly influenced by the ASD and composition, it serves as a useful diagnostic for evaluating the estimation of particle size. CCN concentrations are highly sensitive to aerosol size (Lee et al., 2013), as larger and more hygroscopic particles are more likely to activate into cloud droplets. As a result, N_CCN tends to be enhanced in populations dominated by such particles. Underestimation of particle size therefore translates into a lower N_CCN.

Global N_CCN datasets are typically derived from bulk aerosol mass using simplified assumptions about the ASD. For example, Choudhury and Tesche (2022) estimated N_CCN from spaceborne CALIOP (Cloud-Aerosol Lidar with Orthogonal Polarization) lidar measurements using pre-computed conversion factors. Block et al. (2024) derived N_CCN based on the latest Copernicus Atmosphere Monitoring Service (CAMS) reanalysis provided by the European Centre for Medium-Range Weather Forecast (ECMWF) by introducing assumptions on ASD and composition (Block, 2023). Similarly, The GiOcean atmosphere–ocean–aerosol reanalysis (Song et al., 2025), derived from the NASA GEOS-S2S system (Molod et al., 2020), incorporates a more advanced model framework. Unlike MERRA-2, which only assimilates the atmospheric state, GiOcean is a coupled atmosphere–ocean reanalysis that includes two-moment cloud microphysics, enabling the explicit calculation of N_CCN. Although GiOcean still relies on assumptions about aerosol size and composition, its CCN fields directly influence cloud processes and, in turn, aerosol evolution. In addition to comparing with reanalysis-based products, we evaluated MAMnet’s CCN predictions against in situ measurements from the Global Aerosol Synthesis and Science Project (GASSP) (Watson-Parris et al., 2019; Reddington et al., 2017). The GASSP dataset compiles aerosol and CCN measurements from 37 field campaigns and over 1000 aircraft flights, primarily concentrated over North America and Western Europe.

3 Results and Discussion

We evaluated the MAMnet model for both its ability to reproduce the GEOS+MAM7 model when driven by the testing data set, and to reproduce observations when driven by aerosol concentrations derived from the MERRA-2 reanalysis. We assessed whether MAMnet reproduces the spatial distribution of aerosol variables in GEOS+MAM7 using the the mean Pearson's spatial correlation coefficient (R). We also calculated the mean log-bias, i.e.,

\begin{matrix} (4) & MLB = \frac{\sum_{n = 1}^{N_{samples}} \log_{10} (\hat{Y}) - \log_{10} (Y)}{N_{samples}}, \end{matrix}

where $\hat{Y}$ and Y correspond to the predicted variables by MAMnet and GEOS+MAM7, respectively. In general MLB in the range $[- 0.5, 0, 5]$ indicates a prediction within an order of magnitude window around the target value. These metrics are summarized in Fig. 3 for each pressure level and output variable.

3.1 Evaluation against GEOS+MAM7

We first tested whether MAMnet had learned the physical relationships underlying the ASD or simply memorized the training data, and whether the model is able to conserve mass. To investigate this, we run MAMnet using MERRA-2 inputs, then recover the total mass of each species by mapping from the seven modes produced by MAMnet, as depicted in Fig. 1. Since aerosol concentrations in GEOS+MAM7 are not assimilated, they are expected to differ from MERRA-2, which incorporates observational constraints. If MAMnet had merely memorized the GEOS+MAM7 outputs, these same biases would persist when MERRA-2 inputs were used. Such a discrepancy would also indicate that MAMnet does not conserve mass as the total mass of each species would differ from the inputs.

https://gmd.copernicus.org/articles/19/2437/2026/gmd-19-2437-2026-f02

Figure 2Comparison of column-integrated bulk aerosol. From top: sulfates plus ammonium (SA), sea salt (SS), dust (DU), black carbon (BC), and primary plus secondary organic matter (OG). Left panels correspond to MERRA-2, middle panels to the trained MAMnet model applied to MERRA-2 inputs, and right panels to the reserved GEOS+MAM7 test data.

Figure 2 compares the total aerosol mass column from MERRA-2 (left), MAMnet driven by MERRA-2 inputs (center), and GEOS+MAM (right). It is evident that GEOS+MAM7 tends to underestimate black carbon (BC) and sea salt (SS) over the ocean compared to MERRA2. If MAMnet had merely memorized the GEOS+MAM7 outputs, these same biases would persist when MERRA-2 inputs were used. Instead, MAMnet accurately reproduces the MERRA-2 aerosol concentrations when driven by MERRA-2 inputs, highlighting the internal consistency of the model as MAMnet is able to generalize to new, unseen data. This test also demonstrates that mass is conserved, as the total mass of each aerosol species is recovered showing very little bias.

https://gmd.copernicus.org/articles/19/2437/2026/gmd-19-2437-2026-f03

Figure 3Pearson's spatial correlation (R) (top) and mean log-bias (Eq. 4; bottom) predicted by MAMnet calculated on the reserved test set, against the GEOS+MAM7 simulation. Results are shown for mass (Tables 1 and 2) and number (NUM) concentration for each mode, as well as for the derived modal diameter (DPG). Label color on the horizontal axis is added for emphasis.

Download

Figure 3 summarizes the performance of MAMnet compared to GEOS+MAM7 across all output number and mass variables, as well modal size, for all model levels. MAMnet is able to reproduce the modal number concentrations (“NUM” variables in Fig. 3) from the GEOS+MAM7 simulations, with high spatial correlations R>0.9 and mean log-bias (MLB) within ±0.1 across most pressure levels. However, performance slightly degrades at pressures below p<100 hPa, particularly for the Aitken (NUM_AIT) and coarse dust modes (NUM_CDU), where correlations drop slightly (R>0.7) and MLB increases to ±0.3. The largest discrepancies occur near the surface (p>900 hPa) and in the upper troposphere (150–400 hPa). Specifically, NUM_AIT shows underprediction between 150–400 hPa while it overpredicts from 700–850 hPa, indicating that MAMnet tends to underestimate fine particles near the tropopause and overestimate them in the mid-to-lower troposphere. Similarly, NUM_PCM exhibits negative biases near 150–400 hPa, suggesting underprediction of particle number in the primary carbon mode at higher altitudes. The MLB patterns (Fig. 3, bottom panel) reveal localized biases at specific pressure ranges. Positive MLBs (orange shading) occur in NUM_PCM and NUM_ACC, indicating slight overestimation at mid-to-lower pressure levels. In contrast, NUM_CDU displays small negative MLB (blue shading) around 500–700 hPa, suggesting underprediction of coarse dust particles in the mid-troposphere. In summary, MAMnet captures overall modal number patterns well, but errors remain in the Aitken mode, primary carbon and coarse dust.

https://gmd.copernicus.org/articles/19/2437/2026/gmd-19-2437-2026-f04

Figure 4Zonal profiles for modal aerosol number concentration. Left: GEOS+MAM7 reserved test set. Middle: MAMnet prediction. Right: Mean Log-Bias. From top: Accumulation (ACC), Aitken (AIT), coarse dust (CDU), coarse sea salt (CSS), fine dust (FDU), fine sea salt (FSS), primary carbon matter (PCM).

Download

Figure 4 shows the zonal mean profiles of modal aerosol number concentration, comparing GEOS+MAM7 outputs (left column), MAMnet predictions (center column). Consistent with Fig. 3, the Aitken mode exhibits the largest biases, characterized by underestimation above 400 hPa and overestimation in the lower troposphere, mostly below 700 hPa. Unlike other aerosol modes, the vertical distribution of Aitken mode particles is unique, with higher concentrations found in the upper troposphere and lower stratosphere (p<400 hPa) compared to lower altitudes, where the other modes exhibit the highest concentrations near the surface. The underestimation in the upper troposphere and overestimation in the lower troposphere may be influenced by a “dilution” effect, as Aitken particles contribute relatively little mass compared to other modes. This may be exacerbated in regions with active particle nucleation processes and combustion emissions which tend to disproportionately enhance number over mass concentration.

The accumulation mode (ACC), coarse sea salt (CSS), and fine sea salt (FSS) modes show generally strong agreement between true and predicted values, with minimal biases across most pressure levels, with MLB values close to zero. However, localized biases are apparent for FSS and PCM near the surface, suggesting slight overestimation in these regions. The coarse dust mode (CDU) and fine dust mode (FDU) exhibit minimal errors overall, with MLB values near zero across most pressure levels. MAMnet accurately predicts the aerosol number concentration for most modes, with systematic biases for the Aitken and primary carbon modes, particularly near the tropopause and in the lower troposphere, suggest that smaller particles are more difficult to predict accurately due to their unique vertical distribution and sparse representation in the data.

MAMnet accuately reproduces the spatial patterns of the aerosol mass, with accumulation mode mass variables such as SU_ACC, SS_ACC, and SOA_ACC showing high correlations (R>0.9) across the entire pressure range (Fig. 3). This is also the case for most other variables with only DU_FDU and AMM_FSS showing slight reduction in correlation near 1000 hPa, indicating slightly worse performance in the lower atmosphere. Biases shown in Fig. 3 (bottom) indicate that all but six mass tracers (SOA_ACC, SU_AIT, SOA_AIT, SU_CSS, SS_A_CSS, AMM_CSS) are systematically overpredicted for p<200 hPa where mass values are very small ( $\sim 10^{- 20}$ kg kg⁻¹). These errors tend to be exacerbated in logarithmic space but remain negligible in absolute terms. Negative biases are also notable for SU_FDU and AMM_FDU, which become increasingly negative towards the surface. This is explained by the low mass of sulfate in the fine dust mode leading to a “dilution” of sulfate in fine dust mode relative to other aerosol modes. Overall, the model demonstrates robust predictive skill for most aerosol types, with minor discrepancies concentrated near the surface and in sparse aerosol regimes.

https://gmd.copernicus.org/articles/19/2437/2026/gmd-19-2437-2026-f05

Figure 5Modal geometric diameter, D_pg at 950 hPa. Left: GEOS+MAM7 reserved test set. Middle: MAMnet prediction. Right: Residual (log₁₀(MAMnet) − log₁₀(GEOS+MAM)). From top: Accumulation (ACC), Aitken (AIT), coarse dust (CDU), coarse sea salt (CSS), fine dust (FDU), fine sea salt (FSS), primary carbon matter (PCM).

Differences in the zonal distribution between aerosol modes can contribute to errors, especially due to the uneven representation of smaller particles, such as those in the Aitken and organic modes, in the training dataset, typically referred to as “class imbalance” (Japkowicz and Stephen, 2002; Buda et al., 2018). For instance, the mass of sulfate particles in the accumulation mode is often at least ten times greater than in the Aitken mode, whereas the opposite is true for the number concentration. As a result, the variability in sulfate mass is primarily driven by the accumulation mode, causing the Aitken mode to be underrepresented in the neural network's input data. Despite this, the residual differences between predicted and true values for Aitken mode aerosol number concentrations are small compared to other modes, and the mean global error remains well below an order of magnitude, highlighting the neural network's accuracy.

Figures 5 and 6 show the MAMnet-derived D_pg in agreement with GEOS+MAM7. It must be noted that D_pg is buffered against variation in mass and number concentration (Eq. 1). This buffering effect holds only if mass and number vary coherently. In practice, MAMnet predicts number concentration as a single output per mode, while mass is distributed across multiple species per mode. Unlike in the physical model, where mass and number are dynamically linked through the governing equations, MAMnet treats them as independent outputs. Therefore, accurate reproduction of D_pg by MAMnet is not guaranteed and must be learned implicitly. MLB for D_pg when tested against the test set is typically below 0.01, consistent with the high correlation coefficients for DPG (R>0.9), shown in Fig. 3. The fact that MAMnet is able to maintain low bias in D_pg without being explicitly constrained to do so suggests that the network has successfully learned a physically consistent relationship between mass and number.

The global distribution of D_pg for the different aerosol modes closely matches the spatial patterns of the modal number concentrations. Larger residuals are observed near the surface in the tropics and the Southern Hemisphere, particularly for fine dust (FDU). This discrepancy arises primarily due to the very low number concentrations of fine dust over the oceans, making it challenging for the neural network to accurately predict values close to zero. Residuals for D_pg in the coarse dust (CDU) mode are also slightly larger compared to other modes. This is evident in the zonal mean profiles (Fig. 6), where biases are most prominent in the tropics and near the Arctic. Additionally, MAMnet tends to underestimate D_pg in the Southern Hemisphere around 30° S, particularly in the free troposphere for fine dust. This underestimation likely results from class imbalance, as dust concentrations in this region are very low, making it difficult for the neural network to learn accurate predictions. It is also possible that MAMnet has learned associations biased toward aerosol-rich environments, which are more prevalent in the Northern Hemisphere.

https://gmd.copernicus.org/articles/19/2437/2026/gmd-19-2437-2026-f06

Figure 6As in Fig. 4, but for geometric mean diameter (D_pg) per mode.

Download

3.2 Explainable machine learning analysis

Shapley values (Winter, 2002), originally developed in cooperative game theory, are now widely used to interpret predictions from neural networks (Kwon et al., 2023; Jeggle et al., 2023; Jia et al., 2024; Ma and Stinis, 2020; Lundberg and Lee, 2017). A Shapley value quantifies the contribution of a single input feature to a specific model prediction by comparing the prediction for a given sample to the average prediction across all samples. This contribution is averaged over all possible combinations of the remaining input features, referred to as coalitions. Because the number of such combinations grows rapidly with the number of features, we approximate Shapley values using 1000 randomly selected coalitions per calculation, facilitated by the SHAP python library using the kernel explainer method (Lundberg et al., 2020). In this study, Shapley values are used to assess the influence of each input feature on the predicted aerosol number concentrations for each mode.

Figure 7 shows a summary plot of Shapley values calculated for each input feature relative to predicted targets for aerosol modal number concentration (left) and mass (right). Each row represents a specific feature. The x-axis represents SHAP values, indicating the impact (positive or negative) of each feature on the model's prediction, so that the features with larger SHAP values contribute more significantly to the model output. Red dots represent high feature values, while blue dots indicate low feature values.

Features such as sulfate (SU), sea salt (SS), dust (DU), temperature (T), and air density (AIRD) are consistently ranked as dominant contributors, with their relative importance varying across aerosol modes. Fine-mode outputs, such as ACC and AIT, are strongly influenced by sulfate and temperature, where higher feature values positively impact predictions. In contrast, coarse-mode outputs like CDU and FDU are heavily driven by dust, with significant positive contributions observed for high dust concentrations. Intuitively, this makes sense as SU, SS, and DU are the largest components of accumulation mode aerosols. However the relation is non-linear as high SU values correspond to a strong positive impact on aerosol number, particularly for dust (CDU) and sea salt (CSS) modes, whereas low values lead to neutral or negative contributions.The SHAP values further highlight the critical role of air density and temperature in for CSS and FSS, which may be related to the aerosol activation processes.

For aerosol mass, the SHAP plots show significant influence from DU, SS, BC, SU. The interplay between feature importance and values is evident as, high sea salt concentrations (SS) are positively correlated with increased mass in CSS, while low values lead to neutral or negative contributions. One significant characteristic of the mass SHAP plots is the broader range of SHAP values compared to number concentration, indicating greater variability in the importance of input features for predicting mass. For instance, black carbon (BC) has a consistently positive influence on ACC and AIT mass predictions, but its impact is less pronounced for other modes. Additionally, temperature at low values negatively impacts aerosol mass, while at higher values, it positively influences mass. In some cases however a SHAP value for a feature may have no obviously interpretable significance to the target prediction, or may be so strongly correlated with another feature that the individual contribution is negligible with respect to the feedbacks between a pair of features or more (Aas et al., 2021).

https://gmd.copernicus.org/articles/19/2437/2026/gmd-19-2437-2026-f07

Figure 7SHAP analysis for aerosol number concentration (left) and total mass (right) across different modes in the troposphere, based on 1000 randomly selected samples from the test set. Modes are displayed from top to bottom: Accumulation (ACC), Aitken (AIT), coarse dust (CDU), coarse sea salt (CSS), fine dust (FDU), fine sea salt (FSS), and primary carbon matter (PCM). The color gradient (red for high values, blue for low values) indicates the relative value of each feature, with features ordered top-to-bottom by their importance to the prediction (most sensitive at the top). The x-axis represents SHAP values, quantifying how much each feature contributes to deviations from the mean prediction.

Download

3.3 Evaluation against observations

The ability of a neural network to generalize to new data is a key measure of its effectiveness and reliability in real-world applications. While the neural network may perform well reproducing simulated data, it is important to test whether MAMnet is able to reproduce patterns observed in nature. To accomplish this, we take advantage of the ability of MAMnet to work with reanalysis data, that is, using as input the assimilated fields of MERRA-2.

MERRA-2 includes aerosol mass fields that are constrained by satellite observations through data assimilation (Buchard et al., 2017; Sun et al., 2019; Ukhov et al., 2020; Gueymard and Yang, 2020; Su et al., 2023), and thus provides a more realistic input compared to free-running model simulations. Although GEOS+MAM7, which was used to train MAMnet, does not assimilate aerosols and cannot be directly compared to observations at specific sites, it provides physically consistent mass and number concentrations from which the network learns the relationship between these quantities. When applied to MERRA-2, MAMnet combines this learned relationship with more observation-constrained aerosol mass fields, allowing us to evaluate how well it maintains physical consistency in a more realistic setting. This comparison does not validate MAMnet independently of its training data but serves to assess its performance when driven by the best available mass estimates.

3.3.1 Comparison against ground observations

Figure 8 compares the cumulative ASD predicted by MAMnet against surface observations from different European sites. These are mostly coastal sites with composition typical of clean and polluted continental origin, that is, mostly of sulfates, dust, organics and sea salt (Asmi et al., 2011). Altitude ranges from a few meters to about 3 km providing a good overview of the lower troposphere. Although representing a limited set, the range of aerosol compositions, sources, and altitudes offers a meaningful assessment of the model's ability to generalize to different atmospheric states. To carry out the comparison, MAMnet was run using collocated aerosol concentrations and meteorological fields obtained from MERRA-2 at each site, and using Eq. (3). MAMnet results represent the average of the two lowermost model layers (roughly 200 m above the surface).

https://gmd.copernicus.org/articles/19/2437/2026/gmd-19-2437-2026-f08

Figure 8Cumulative size distribution comparison of the trained MAMnet model applied to MERRA-2 inputs against surface measurements (Asmi et al., 2011). The sites in the bottom row (PDD, SCH, ZSF, JFJ, BEO, CMN) are characterized as high altitude sites, with altitudes between 1200 and 3600 m a.s.l.

Download

Except for high altitude sites (Fig. 8, bottom row), MAMnet tends to predict slightly lower median values compared to observations, with this discrepancy becoming more pronounced as particle size increases (N100 and N250). This is particularly noticeable at locations such as PAL, PLA, OBK, MHD, FKL, and JFJ, where the model underestimates values consistently. The pattern is reversed at high-altitude sites (PDD, SCH, ZSF, JFJ, BEO, and CMN), where median N100 and N250 are generally overestimated by MAMnet, although the observations themselves display significant variability. Some locations like SMR, WAL, CBW, and SCH exhibit better agreement, with overlapping medians and interquartile ranges. Additionally, the spread of values for MAMnet is typically narrower than for observations, indicating that the model underestimates variability. Observations also show more outliers, whereas MAMnet predictions are more constrained. There are a few exceptions where MAMnet slightly overestimates values, such as VHL (N250) and SMR (N100). Overall, systematic bias exists in MAMnet-predicted particle concentrations, particularly for larger size bins, capturing less variability compared to observations.

Errors in the estimated ASD may originate from the training data (GEOS+MAM7) or from the MERRA-2 fields used as input to MAMnet. To assess the potential impact of the training data, we collocated the average ASD at each site using the GEOS+MAM7 test dataset. This analysis is intended to evaluate whether GEOS+MAM7 exhibits, on average, the same biases shown in Fig. 8, rather than assessing point-by-point predictions as done for MAMnet. Figure 9 shows that GEOS+MAM7 and MAMnet indeed display similar biases relative to the observations, suggesting that errors in the training data are partially inherited by MAMnet. However, a comparison of Figs. 8 and 9 also indicates that the use of MERRA-2 inputs tends to reduce variability in the ASD. For each size bin, the interquantile range is substantially narrower in the MAMnet results. This effect is particularly pronounced at high-altitude sites (bottom row in both figures).

MERRA-2 may not resolve local emissions, terrain, or small-scale meteorology, such as boundary layer height and humidity. This can lead to biases, particularly in the larger particle size categories that depend on aerosol growth processes. Moreover, despite representing better ASD variability, the training data lacks sufficient diversity, particularly for remote or high-altitude sites, which are also underrepresented in the training set. Additionally, aerosol evolution involve complex, nonlinear interactions that are not explicitly modeled by MAMnet. These factors likely contribute to the model's challenges in capturing the magnitude and variability of aerosol concentrations observed in the real world. Additionally, it is important to note that retrievals of ASD are inherently complex, and experimental errors can be significant, particularly for larger particle sizes (Asmi et al., 2011). Nevertheless, the consistent results across many sites indicate that MAMnet is capable of reasonably capturing the ASD on regional scales when driven by reanalysis data.

3.3.2 Comparison against global CCN datasets

Figure 10 illustrates the global mean distribution of cloud condensation nuclei (CCN) at 0.2 % supersaturation at 900 hPa, derived from MAMnet driven by MERRA-2 (shown as MAMnet-MERRA2), GiOcean (Song et al., 2025), the CAMS aerosol reanalysis (Block et al., 2024), and CALIOP satellite retrievals (Choudhury and Tesche, 2022). Data were averaged over the period 2006–2021. All datasets display on average lower CCN concentrations over oceans, particularly in polar regions, and higher concentrations over land particularly in central and eastern Asia, Europe, and the Americas. However, large differences in absolute values are evident with MAMnet-MERRA2 consistently showing the lowest N_CCN, and CALIOP-derived the highest. These discrepancies likely arise from differing assumptions in estimation methods. GiOcean and CAMS estimate CCN based on aerosol mass, prescribing the ASD, and assuming externally-mixed aerosols, which may double-count CCN as organics and sulfates are typically internally mixed (Adachi and Buseck, 2008; Kirpes et al., 2018). MAMnet-MERRA2 avoids this issue but underpredicts N_CCN over oceanic regions, likely due to low sea salt concentrations in MERRA-2, stemming from uncertainties in the aerosol assimilation system (Buchard et al., 2017). In contrast, CALIOP may overestimate N_CCN from the assumption of CCN as all soluble aerosols above 50 nm, some of which may not activate as CCN at 0.2 % supersaturation.

https://gmd.copernicus.org/articles/19/2437/2026/gmd-19-2437-2026-f09

Figure 9Average cumulative size distribution comparison of the GEOS+MAM7 test data against surface measurements (Asmi et al., 2011). The sites in the bottom row (PDD, SCH, ZSF, JFJ, BEO, CMN) are characterized as high altitude sites, with altitudes between 1200 and 3600 m a.s.l.

Download

Figure 11 compares global mean vertical profiles of N_CCN from the datasets in Fig. 10 and from in-situ observations (Watson-Parris et al., 2019). Vertical distributions vary significantly. GiOcean, CALIOP, and in-situ profiles exhibit similar shapes with peak concentrations around 950 hPa, while the MAMnet-MERRA2 and CAMS profiles show a monotonic decrease with altitude. The peak in N_CCN at 950 hPa may result from more efficient aerosol scavenging near the surface, better represented by two-moment cloud microphysics in GiOcean (Song et al., 2025; Barahona et al., 2014). In contrast, MAMnet-MERRA2 and CAMS rely on single-moment cloud microphysics, which may explain the smoother decrease in N_CCN with height. The reliance on single-moment microphysics may also explain the more gradual decrease in N_CCN with height in CAMS and MAMnet-MERRA2 than in the other data sets, noticeable over the ocean. In the free troposphere, MAMnet-MERRA2 aligns more closely with GiOcean, CALIOP and the in situ data.

https://gmd.copernicus.org/articles/19/2437/2026/gmd-19-2437-2026-f10

Figure 10MAMnet-derived CCN at 0.2 % supersaturation at 900 hPa using the MERRA2 reanalysis (MAMnet-MERRA2) against global CCN datasets. Also shown are results from the GiOcean reanalysis (Song et al., 2025), CALIOP- (Choudhury and Tesche, 2022), and CAMS- (Block et al., 2024) derived CCN.

https://gmd.copernicus.org/articles/19/2437/2026/gmd-19-2437-2026-f11

Figure 11Annual mean profile of CCN concentration derived from MERRA2 using MAMnet (red). Also shown are CALIOP-derived CCN (magenta; Choudhury and Tesche, 2022), the GiOcean reanalysis (blue; Song et al., 2025), CCN derived from field campaign data around the globe (green; Watson-Parris et al., 2019), and CAMS-derived CCN (black; Block et al., 2024).

Download

4 Conclusions

This study develops a neural network, termed MAMnet, to predict the aerosol size distribution and mixing state using as input the bulk mass of different aerosol species, temperature and density. MAMnet is oriented towards allowing a better estimation of the ASD and the aerosol physicochemical properties in cases where computational cost considerations prevent the usage two and higher moment aerosol microphysics schemes, for instance, weather forecast, or where limited information is available to constraint the ASD as in remote sensing and data assimilation. The neural network was optimized for performance taking into account the model architecture, training parameters and the rank of the data used as input.

MAMnet is designed to reproduce the mapping from aerosol mass to size distribution as generated by the MAM7 scheme within the GEOS system. As such, it inherits the assumptions and meteorological context of the training simulations. However through a comprehensive evaluation of the NN model against simulated data and observations, we demosntrated that MAMnet is a robust and accurate model over a wide set of conditions. Importantly, MAMnet reproduces MERRA-2 aerosol concentrations when driven by MERRA-2 inputs, demonstrating that it has learned physical relationships rather than memorizing the training data, conserving the total aerosol mass. Explainable machine learning analysis showed that MAMnet identifies key physical drivers and the non-linear behavior governing the aerosol distribution across fine and coarse scales.

Comparison of MAMnet predictions against a reference dataset from GEOS+MAM7 simulations resulted in good agreement, with log-mean residuals typically below 0.1 and spatial correlation typically exceeding 0.9 for all aerosol modes. The greatest discrepancies were observed near the surface and in regions with low aerosol concentrations i.e., fine dust over oceans and coarse dust in the Southern Hemisphere free troposphere. These discrepancies are primarily attributed to challenges in the prediction of concentrations near zero and class imbalance. Notably, biases in number and mass concentrations do not significantly influence the prediction of geometric mean diameter, indicating that MAMnet captures the physical consistency between aerosol mass and number, inherently conserving mass.

We took advantage of the fact that MAMnet can be driven by output from reanalysis data to evaluate its performance against observations. When driven using collocated MERRA-2 fields, MAMnet reasonably reproduced the measured aerosol size distribution at different ground observation sites, representing a variety of aerosol composition, origin and meteorological conditions. The median values of the predicted number concentrations were generally consistent with observations. However the range of values predicted by MAMnet is in general smaller than observed, indicating that the model underestimates variability. It is likely that coarse reanalysis inputs, limited training data diversity, and the complexities of aerosol evolution, not modeled explicitly by MAMnet, contributed to the observed discrepancies.

CCN concentrations derived from MAMnet using the MERRA2 dataset were within the range of reported values but showed discrepancy near the surface and in regions with high variability, which may originate from uncertainty in the MERRA-2 aerosol fields. As MAMnet reproduces well the training dataset it is likely that the biases against observations result from biases in the input and complex physics not modelled by MAM7. At this point it is difficult to explicitly attribute sources of error in MAMnet and this is left for future research. Despite such biases, the comparison against observations indicate that MAMnet is able to capture the aerosol size distribution on regional and global scales.

Strategies to address the remaining biases include applying physical constraints via transfer learning, as well as including observational data during the training process (Barahona et al., 2024). Class imbalance can potentially be addressed by reconfiguring the NN such that each mode would be predicted by a separate layer, or even individual NNs. The latter option is less desirable because it would require developing, constraining, and maintaining multiple NNs as opposed to one. Future work would focus on applying MAMnet to elucidate long-term trends in the ASD as well as on its implementation with GCMs (Ott et al., 2020). MAMnet is designed to be resolution-agnostic, but the relationship between aerosol mass and size distribution may vary with model resolution and it is suggested to be explored in future work. Including additional input variables, such as gaseous species or solar radiation, may make MAMnet predictions more physically interpretable. Neverthless, the model developed here provides a versatile foundation to improve the physical representation of aerosols in weather forecasting, remote sensing and data assimilation, potentially enhancing our understanding of their role in the climate system.

Appendix A: Network Training and Optimization

The MAMnet model was trained using the Keras library with Tensorflow backend (Chollet, 2015). Optimization was carried out with the Adam algorithm (Kingma and Ba, 2014) using the minimum mean square error (MSE) as the loss function, with no additional constraints. Hyperparameter optimization for MAMnet was performed using the Keras Tuner software (O'Malley et al., 2019). Approximately 1500 optimization trials were performed using random configurations of the hyperparameters in Table A1, using a subset of the training data as in Yu et al. (2024). All trials used the same subset of the training/validation data (5 output files for training, 2 for validation). For each parameter set, a new model was built and trained for up to 100 epochs with the same early stopping criteria used during the training of MAMnet. For each trial, a custom metric was recorded at the end of each epoch, the convergence loss (ℒ_conv), defined as the absolute difference between the training and validation losses. Using this custom metric allowed us to select the model that generalizes the best over both the training and validation data sets. The best set of hyperparameters was selected by choosing the configuration that minimized the MSE on the validation set and had the lowest ℒ_conv.

Table A1Parameters used during hyperparameter tuning for MAMnet. Optimal hyperparameters are shown in bold.

Download Print Version | Download XLSX

Code and data availability

The MERRA-2 Reanalysis is publicly available from https://doi.org/10.5067/WWQSXQ8IVFW8 (GMAO, 2015). The MAMnet model and training and test data can be downloaded at https://doi.org/10.5281/zenodo.15190121 (Barahona and Breen, 2025). CAMS data was obtained from https://doi.org/10.26050/WDCC/QUAERERE_CCNCAMS_v1 (Block, 2023). Data from the GiOcean reanalysis was downloaded from https://portal.nccs.nasa.gov/datashare/gmao/GiOCEAN/ (GMAO, 2025). CALIOP data was obtained from https://doi.org/10.5067/CALIOP/CALIPSO/LID_L2_05KMAPRO-STANDARD-V4-20 (CALIPSO, 2023). Code and training datasets used in this work can be downloaded at https://doi.org/10.5281/zenodo.15190121 (Barahona and Breen, 2025).

Author contributions

DB conceived and directed the work. KHB co-developed of the neural network model. AD implemented the MAM model within GEOS. KB Provided CCN data for comparison.

Competing interests

The contact author has declared that none of the authors has any competing interests.

Disclaimer

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. The authors bear the ultimate responsibility for providing appropriate place names. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.

Acknowledgements

Resources supporting this work were provided by the NASA High-End Computing (HEC) Program through the NASA Center for Climate Simulation (NCCS) at Goddard Space Flight Center. Keras and Tensorflow libraries were obtained from https://keras.io/ (last access: 23 March 2026). Maps were created using the NCAR Command Language (NCAR, 2019). The SHAP python package was used to conduct the explainable machine learning analysis as described in https://shap.readthedocs.io/en/latest/ (last access: 23 March 2026). This work was supported by the NASA Modeling, Analysis and Prediction program, Grant NNH20ZDA001N-MAP.

Financial support

This research has been supported by the National Aeronautics and Space Administration (grant no. NNH20ZDA001N-MAP).

Review statement

This paper was edited by Slimane Bekki and reviewed by three anonymous referees.

References

Aas, K., Jullum, M., and Løland, A.: Explaining individual predictions when features are dependent: More accurate approximations to Shapley values, Artif. Intell., 298, 103502, https://doi.org/10.1016/j.artint.2021.103502, 2021. a

Adachi, K. and Buseck, P. R.: Internally mixed soot, sulfates, and organic matter in aerosol particles from Mexico City, Atmos. Chem. Phys., 8, 6469–6481, https://doi.org/10.5194/acp-8-6469-2008, 2008. a

Adams, P. J. and Seinfeld, J. H.: Predicting global aerosol size distributions in general circulation models, J. Geophys. Res.-Atmos., 107, https://doi.org/10.1029/2001JD001010, 2002. a

Amunsen, C., Hanssen, J., Semb, A., and Steinnes, E.: Long-range atmospheric transport of trace elements to southern Norway, Atmos. Environ. A Gen., 26, 1309–1324, https://doi.org/10.1016/0960-1686(92)90391-W, 1992. a

Aquila, V., Hendricks, J., Lauer, A., Riemer, N., Vogel, H., Baumgardner, D., Minikin, A., Petzold, A., Schwarz, J. P., Spackman, J. R., Weinzierl, B., Righi, M., and Dall'Amico, M.: MADE-in: a new aerosol microphysics submodel for global simulation of insoluble particles and their mixing state, Geosci. Model Dev., 4, 325–355, https://doi.org/10.5194/gmd-4-325-2011, 2011. a

Arfin, T., Pillai, A. M., Mathew, N., Tirpude, A., Bang, R., and Mondal, P.: An overview of atmospheric aerosol and their effects on human health, Environ. Sci. Pollut. R., 30, 125347–125369, https://doi.org/10.1007/s11356-023-29652-w, 2023. a

Asmi, A., Wiedensohler, A., Laj, P., Fjaeraa, A.-M., Sellegri, K., Birmili, W., Weingartner, E., Baltensperger, U., Zdimal, V., Zikova, N., Putaud, J.-P., Marinoni, A., Tunved, P., Hansson, H.-C., Fiebig, M., Kivekäs, N., Lihavainen, H., Asmi, E., Ulevicius, V., Aalto, P. P., Swietlicki, E., Kristensson, A., Mihalopoulos, N., Kalivitis, N., Kalapov, I., Kiss, G., de Leeuw, G., Henzing, B., Harrison, R. M., Beddows, D., O'Dowd, C., Jennings, S. G., Flentje, H., Weinhold, K., Meinhardt, F., Ries, L., and Kulmala, M.: Number size distributions and seasonality of submicron particles in Europe 2008–2009, Atmos. Chem. Phys., 11, 5505–5538, https://doi.org/10.5194/acp-11-5505-2011, 2011. a, b, c, d, e, f

Barahona, D. and Breen, K.: MAMnet, Zenodo [code], https://doi.org/10.5281/zenodo.15190121, 2025. a, b

Barahona, D., Molod, A., Bacmeister, J., Nenes, A., Gettelman, A., Morrison, H., Phillips, V., and Eichmann, A.: Development of two-moment cloud microphysics for liquid and ice within the NASA Goddard Earth Observing System Model (GEOS-5), Geosci. Model Dev., 7, 1733–1766, https://doi.org/10.5194/gmd-7-1733-2014, 2014. a, b, c

Barahona, D., Breen, K. H., Kalesse-Los, H., and Röttenbacher, J.: Deep Learning Parameterization of Vertical Wind Velocity Variability via Constrained Adversarial Training, Artificial Intelligence for the Earth Systems, 3, e230025, https://doi.org/10.1175/AIES-D-23-0025.1, 2024. a, b

Bender, F. A.-M.: Aerosol forcing: Still uncertain, still relevant, AGU Advances, 1, e2019AV000128, https://doi.org/10.1029/2019AV000128, 2020. a

Bender, F. A.-M., Frey, L., McCoy, D. T., Grosvenor, D. P., and Mohrmann, J. K.: Assessment of aerosol–cloud–radiation correlations in satellite observations, climate models and reanalysis, Cli. Dynam., 52, 4371–4392, https://doi.org/10.1007/s00382-018-4384-z, 2019. a

Bengio, Y., Goodfellow, I., and Courville, A.: Deep learning, vol. 1, MIT Press Cambridge, MA, USA, ISBN 978-0262035613, 2017. a

Birmili, W., Berresheim, H., Plass-Dülmer, C., Elste, T., Gilge, S., Wiedensohler, A., and Uhrner, U.: The Hohenpeissenberg aerosol formation experiment (HAFEX): a long-term study including size-resolved aerosol, H₂SO₄, OH, and monoterpenes measurements, Atmos. Chem. Phys., 3, 361–376, https://doi.org/10.5194/acp-3-361-2003, 2003. a

Birmili, W., Weinhold, K., Nordmann, S., Wiedensohler, A., Spindler, G., Müller, K., Herrmann, H., Gnauk, T., Pitz, M., Cyrys, J., and Flentje, H.: Atmospheric aerosol measurements in the German ultrafine aerosol network (GUAN), Gefahrst. Reinhalt. L, 69, 137–145, 2009. a, b, c, d, e

Block, K.: Cloud condensation nuclei (CCN) numbers derived from CAMS reanalysis EAC4 (Version 1), World Data Center for Climate (WDCC) at DKRZ [data set], https://doi.org/10.26050/WDCC/QUAERERE_CCNCAMS_v1, 2023. a, b

Block, K., Haghighatnasab, M., Partridge, D. G., Stier, P., and Quaas, J.: Cloud condensation nuclei concentrations derived from the CAMS reanalysis, Earth Syst. Sci. Data, 16, 443–470, https://doi.org/10.5194/essd-16-443-2024, 2024. a, b, c, d, e, f

Brenowitz, N. D. and Bretherton, C. S.: Spatially extended tests of a neural network parametrization trained by coarse-graining, J. Adv. Model. Earth Sy., 11, 2728–2744, https://doi.org/10.1029/2019MS001711, 2019. a

Buchard, V., Randles, C., Da Silva, A., Darmenov, A., Colarco, P., Govindaraju, R., Ferrare, R., Hair, J., Beyersdorf, A., Ziemba, L., and Yu, H.: The MERRA-2 aerosol reanalysis, 1980 onward. Part II: Evaluation and case studies, J. Climate, 30, 6851–6872, https://doi.org/10.1175/JCLI-D-16-0613.1, 2017. a, b, c

Buda, M., Maki, A., and Mazurowski, M. A.: A systematic study of the class imbalance problem in convolutional neural networks, Neural Networks, 106, 249–259, https://doi.org/10.1016/j.neunet.2018.07.011, 2018. a

CALIPSO: Cloud–Aerosol Lidar and Infrared Pathfinder Satellite Observation Lidar Level 2 Aerosol Profile V4-20, NASA [data set], https://doi.org/10.5067/CALIOP/CALIPSO/LID_L2_05KMAPRO-STANDARD-V4-20, 2023. a

C̆ervenkova, J. and Vá n̆a, M.: Trend Assessment of deposition, throughfall and runoff water chemistry at the ICP-IM station Kosetice, Czech Republic, IAHS-AISH Publication, 336, 103–108, 2010. a

Charron, A., Birmili, W., and Harrison, R. M.: Factors influencing new particle formation at the rural site, Harwell, United Kingdom, J. Geophys. Res.-Atmos., 112, https://doi.org/10.1029/2007JD008425, 2007. a

Chin, M., Rood, R. B., Lin, S.-J., Müller, J.-F., and Thompson, A. M.: Atmospheric sulfur cycle simulated in the global model GOCART: Model description and global properties, J. Geophys. Res.-Atmos., 105, 24671–24687, https://doi.org/10.1029/2000JD900384, 2000. a, b

Chollet, F.: Keras, GitHub [code], https://github.com/fchollet/keras (last access: 23 March 2026), 2015. a

Choudhury, G. and Tesche, M.: Estimating cloud condensation nuclei concentrations from CALIPSO lidar measurements, Atmos. Meas. Tech., 15, 639–654, https://doi.org/10.5194/amt-15-639-2022, 2022. a, b, c, d

Christensen, M. W., Jones, W. K., and Stier, P.: Aerosols enhance cloud lifetime and brightness along the stratus-to-cumulus transition, P. Natl. Acad. Sci. USA, 117, 17591–17598, https://doi.org/10.1073/pnas.1921231117, 2020. a

Chu, D., Kaufman, Y., Ichoku, C., Remer, L., Tanré, D., and Holben, B.: Validation of MODIS aerosol optical depth retrieval over land, Geophys. Res. Lett., 29, https://doi.org/10.1029/2001GL013205, 2002. a

Colarco, P., da Silva, A., Chin, M., and Diehl, T.: Online simulations of global aerosol distributions in the NASA GEOS-4 model and comparisons to satellite and ground-based aerosol optical depth, J. Geophys. Res., 115, D14207, https://doi.org/10.1029/2009JD012820, 2010a. a, b

Colarco, P., da Silva, A., Chin, M., and Diehl, T.: Online simulations of global aerosol distributions in the NASA GEOS-4 model and comparisons to satellite and ground-based aerosol optical depth, J. Geophys. Res.-Atmos., 115, https://doi.org/10.1029/2009JD012820, 2010b. a

Engler, C., Rose, D., Wehner, B., Wiedensohler, A., Brüggemann, E., Gnauk, T., Spindler, G., Tuch, T., and Birmili, W.: Size distributions of non-volatile particle residuals (D_p<800 nm) at a rural site in Germany and relation to air mass origin, Atmos. Chem. Phys., 7, 5785–5802, https://doi.org/10.5194/acp-7-5785-2007, 2007. a

Forster, P., Ramaswamy, V., Artaxo, P., Berntsen, T., Betts, R., Fahey, D. W., Haywood, J., Lean, J., Lowe, D. C., Myhre, G., Nganga, J., Prinn, R., Raga, G., Schulz, M., and Van Dorland, R.: Changes in Atmospheric Constituents and in Radiative Forcing, in: Climate Change 2007: The Physical Science Basis. Contribution of Working Group I to the Fourth Assessment Report of the Intergovernmental Panel on Climate Change, edited by: Solomon, S., Qin, D., Manning, M., Chen, Z., Marquis, M., Averyt, K. B., Tignor, M., and Miller, H. L., Cambridge University Press, Cambridge, United Kingdom and New York, NY, USA, ISBN 9780521880091, 2007. a

Fountoukis, C. and Nenes, A.: Continued development of a cloud droplet formation parameterization for global climate models, J. Geophys. Res.-Atmos., 110, https://doi.org/10.1029/2004JD005591, 2005. a

Gelaro, R., McCarty, W., Suárez, M. J., Todling, R., Molod, A., Takacs, L., Randles, C. A., Darmenov, A., Bosilovich, M. G., Reichle, R., Wargan, K., Coy, L., Cullather, R., Draper, C., Akella, S., Buchard, V., Conaty, A., da Silva, A. M., Gu, W., Kim, G.-K., Koster, R., Lucchesi, R., Merkova, D., Nielsen, J. E., Partyka, G., Pawson, S., Putman, W., Rienecker, M., Schubert, S. D., Sienkiewicz, M., and Zhao, B.: The Modern-Era Retrospective Analysis for Research and Applications, Version 2 (MERRA-2), J. Climate, 30, 5419–5454, https://doi.org/10.1175/JCLI-D-16-0758.1, 2017. a, b, c

Ginoux, P., Chin, M., Tegen, I., Prospero, J. M., Holben, B., Dubovik, O., and Lin, S.-J.: Sources and distributions of dust aerosols simulated with the GOCART model, J. Geophys. Res.-Atmos., 106, 20255–20273, https://doi.org/10.1029/2000JD000053, 2001. a

GMAO: MERRA-2 inst3_3d_asm_Nv: 3d,3-Hourly,Instantaneous,Model-Level,Assimilation,Assimilated Meteorological Fields V5.12.4, Goddard Earth Sciences Data and Information Services Center (GES DISC) [data set], https://doi.org/10.5067/WWQSXQ8IVFW8, 2015. a

GMAO: GiOcean Coupled Reanalysis, GMAO [data set], https://portal.nccs.nasa.gov/datashare/gmao/GiOCEAN/ (last access: 23 March 2026), 2025. a

Gong, X., Wex, H., Müller, T., Henning, S., Voigtländer, J., Wiedensohler, A., and Stratmann, F.: Understanding aerosol microphysical properties from 10 years of data collected at Cabo Verde based on an unsupervised machine learning classification, Atmos. Chem. Phys., 22, 5175–5194, https://doi.org/10.5194/acp-22-5175-2022, 2022. a

Gruening, C., Adam, M., Cavalli, F., Cavalli, P., Dell’Acqua, A., Martins Dos Santos, S., Pagliari, V., Roux, D., and Putaud, J.: JRC Ispra EMEP–GAW Regional Station for Atmos. Res, Tech. Rep. JRC55382, European Commission, https://publications.jrc.ec.europa.eu/repository/handle/JRC55382 (last access: 23 March 2026), 2009. a

Gueymard, C. A. and Yang, D.: Worldwide validation of CAMS and MERRA-2 reanalysis aerosol optical depth products using 15 years of AERONET observations, Atmos. Environ., 225, 117216, https://doi.org/10.1016/j.atmosenv.2019.117216, 2020. a

Harder, P., Watson-Parris, D., Stier, P., Strassel, D., Gauger, N. R., and Keuper, J.: Physics-Informed Learning of Aerosol Microphysics, arXiv [preprint], https://doi.org/10.48550/arXiv.2207.11786, 24 July 2022. a, b

Hari, P., Nikinmaa, E., Pohja, T., Siivola, E., Bäck, J., Vesala, T., and Kulmala, M.: Station for measuring ecosystem-atmosphere relations: SMEAR, in: Physical and physiological forest ecology, Springer Nature, 471–487, https://doi.org/10.1007/978-94-007-5603-8_9, 2013. a

Inness, A., Ades, M., Agustí-Panareda, A., Barré, J., Benedictow, A., Blechschmidt, A.-M., Dominguez, J. J., Engelen, R., Eskes, H., Flemming, J., Huijnen, V., Jones, L., Kipling, Z., Massart, S., Parrington, M., Peuch, V.-H., Razinger, M., Remy, S., Schulz, M., and Suttie, M.: The CAMS reanalysis of atmospheric composition, Atmos. Chem. Phys., 19, 3515–3556, https://doi.org/10.5194/acp-19-3515-2019, 2019. a, b, c

Japkowicz, N. and Stephen, S.: The class imbalance problem: A systematic study, Intell. Data Anal., 6, 429–449, https://doi.org/10.3233/IDA-2002-6504, 2002. a

Jeggle, K., Neubauer, D., Camps-Valls, G., and Lohmann, U.: Understanding cirrus clouds using explainable machine learning, Environmental Data Science, 2, e19, https://doi.org/10.1017/eds.2023.14, 2023. a

Jennings, S., O'Dowd, C., O'Connor, T., and McGovern, F.: Physical characteristics of the ambient aerosol at Mace Head, Atmos. Environ. A Gen., 25, 557–562, https://doi.org/10.1016/0960-1686(91)90052-9, 1991. a

Jia, Y., Andersen, H., and Cermak, J.: Analysis of the cloud fraction adjustment to aerosols and its dependence on meteorological controls using explainable machine learning, Atmos. Chem. Phys., 24, 13025–13045, https://doi.org/10.5194/acp-24-13025-2024, 2024. a

Jones, A., Roberts, D., and Slingo, A.: A climate model study of indirect radiative forcing by anthropogenic sulphate aerosols, Nature, 370, 450–453, https://doi.org/10.1038/370450a0, 1994. a

Jurányi, Z., Gysel, M., Weingartner, E., Bukowiecki, N., Kammermann, L., and Baltensperger, U.: A 17 month climatology of the cloud condensation nuclei number concentration at the high alpine site Jungfraujoch, J. Geophys. Res.-Atmos., 116, https://doi.org/10.1029/2010JD015199, 2011. a

Kingma, D. P. and Ba, J.: Adam: A method for stochastic optimization, arXiv [preprint], https://doi.org/10.48550/arXiv.1412.6980, 22 December 2014. a

Kirpes, R. M., Bondy, A. L., Bonanno, D., Moffet, R. C., Wang, B., Laskin, A., Ault, A. P., and Pratt, K. A.: Secondary sulfate is internally mixed with sea spray aerosol and organic aerosol in the winter Arctic, Atmos. Chem. Phys., 18, 3937–3949, https://doi.org/10.5194/acp-18-3937-2018, 2018. a

Kiss, G., Varga, B., Galambos, I., and Ganszky, I.: Characterization of water-soluble organic matter isolated from atmospheric fine aerosol, J. Geophys. Res.-Atmos., 107, https://doi.org/10.1029/2001JD000603, 2002. a

Kreidenweis, S. M., Koehler, K., DeMott, P. J., Prenni, A. J., Carrico, C., and Ervens, B.: Water activity and activation diameters from hygroscopicity data - Part I: Theory and application to inorganic salts, Atmos. Chem. Phys., 5, 1357–1370, https://doi.org/10.5194/acp-5-1357-2005, 2005. a

Kristensson, A., Dal Maso, M., Swietlicki, E., Hussein, T., Zhou, J., Kerminen, V.-M., and Kulmala, M.: Characterization of new particle formation events at a background site in Southern Sweden: relation to air mass history, Tellus B, 60, 330–344, https://doi.org/10.1111/j.1600-0889.2008.00345.x, 2008. a

Kwon, Y., An, S. A., Song, H.-J., and Sung, K.: Particulate Matter Prediction and Shapley Value Interpretation in Korea through a Deep Learning Model, SOLA, 19, 225–231, https://doi.org/10.2151/sola.2023-029, 2023. a

Langner, J. and Rodhe, H.: A global three-dimensional model of the tropospheric sulfur cycle, J. Atmos. Chem., 13, 225–263, https://doi.org/10.1007/BF00058134, 1991. a

Lee, L. A., Pringle, K. J., Reddington, C. L., Mann, G. W., Stier, P., Spracklen, D. V., Pierce, J. R., and Carslaw, K. S.: The magnitude and causes of uncertainty in global model simulations of cloud condensation nuclei, Atmos. Chem. Phys., 13, 8879–8914, https://doi.org/10.5194/acp-13-8879-2013, 2013. a

Lihavainen, H., Kerminen, V.-M., Komppula, M., Hyvärinen, A.-P., Laakia, J., Saarikoski, S., Makkonen, U., Kivekäs, N., Hillamo, R., Kulmala, M., and Viisanen, Y.: Measurements of the relation between aerosol properties and microphysics and chemistry of low level liquid water clouds in Northern Finland, Atmos. Chem. Phys., 8, 6925–6938, https://doi.org/10.5194/acp-8-6925-2008, 2008. a

Liu, X., Easter, R. C., Ghan, S. J., Zaveri, R., Rasch, P., Shi, X., Lamarque, J.-F., Gettelman, A., Morrison, H., Vitt, F., Conley, A., Park, S., Neale, R., Hannay, C., Ekman, A. M. L., Hess, P., Mahowald, N., Collins, W., Iacono, M. J., Bretherton, C. S., Flanner, M. G., and Mitchell, D.: Toward a minimal representation of aerosols in climate models: description and evaluation in the Community Atmosphere Model CAM5, Geosci. Model Dev., 5, 709–739, https://doi.org/10.5194/gmd-5-709-2012, 2012. a, b, c, d

Lundberg, S. M. and Lee, S.-I.: A unified approach to interpreting model predictions, Advances in neural information processing systems, 30, https://doi.org/10.48550/arXiv.1705.07874, 22 May 2017. a

Lundberg, S. M., Erion, G., Chen, H., DeGrave, A., Prutkin, J. M., Nair, B., Katz, R., Himmelfarb, J., Bansal, N., and Lee, S.-I.: From local explanations to global understanding with explainable AI for trees, Nature Machine Intelligence, 2, 56–67, https://doi.org/10.1038/s42256-019-0138-9, 2020. a

Ma, P. L. and Stinis, P.: Developing a simulator-based satellite dataset for using machine learning techniques to derive aerosol-cloud-precipitation interactions in models and observations in a consistent framework, Tech. rep., Pacific Northwest National Laboratory (PNNL), Richland, WA (United States), https://doi.org/10.2172/1984697, 2020. a

Mann, G. W., Carslaw, K. S., Spracklen, D. V., Ridley, D. A., Manktelow, P. T., Chipperfield, M. P., Pickering, S. J., and Johnson, C. E.: Description and evaluation of GLOMAP-mode: a modal global aerosol microphysics model for the UKCA composition-climate model, Geosci. Model Dev., 3, 519–551, https://doi.org/10.5194/gmd-3-519-2010, 2010. a

Marinoni, A., Cristofanelli, P., Calzolari, F., Roccato, F., Bonafè, U., and Bonasoni, P.: Continuous measurements of aerosol physical parameters at the Mt. Cimone GAW Station (2165 m asl, Italy), Sci. Total Environ., 391, 241–251, https://doi.org/10.1016/j.scitotenv.2007.10.004, 2008. a

McCoy, D., Bender, F.-M., Mohrmann, J., Hartmann, D., Wood, R., and Grosvenor, D.: The global aerosol-cloud first indirect effect estimated using MODIS, MERRA, and AeroCom, J. Geophys. Res.-Atmos., 122, 1779–1796, https://doi.org/10.1002/2016JD026141, 2017. a

Mihalopoulos, N., Stephanou, E., Kanakidou, M., Pilitsidis, S., and Bousquet, P.: Tropospheric aerosol ionic composition in the Eastern Mediterranean region, Tellus B, 49, 314–326, https://doi.org/10.3402/tellusb.v49i3.15970, 1997. a

Molod, A., Takacs, L., Suarez, M., and Bacmeister, J.: Development of the GEOS-5 atmospheric general circulation model: evolution from MERRA to MERRA2, Geosci. Model Dev., 8, 1339–1356, https://doi.org/10.5194/gmd-8-1339-2015, 2015. a

Molod, A., Hackert, E., Vikhliaev, Y., Zhao, B., Barahona, D., Vernieres, G., Borovikov, A., Kovach, R. M., Marshak, J., Schubert, S., Li, Z., Lim, Y.-K., Andrews, L. C., Cullather, R., Koster, R., Achuthavarier, D., Carton, J., Coy, L., Friere, J. L. M., Longo, K. M., Nakada, K., and Pawson, S.: GEOS-S2S version 2: The GMAO high-resolution coupled model and assimilation system for seasonal prediction, J. Geophys. Res.-Atmos., 125, e2019JD031767, https://doi.org/10.1029/2019JD031767, 2020. a, b

Nair, A. A., Yu, F., Campuzano-Jost, P., DeMott, P. J., Levin, E. J. T., Jimenez, J. L., Peischl, J., Pollack, I. B., Fredrickson, C. D., Beyersdorf, A. J., Nault, B. A., Park, M., Yum, S. S., Palm, B. B., Xu, L., Bourgeois, I., Anderson, B. E., Nenes, A., Ziemba, L. D., Moore, R. H., Lee, T., Park, T., Thompson, C. R., Flocke, F., Huey, L. G., Kim, M. J., and Peng, Q.: Machine Learning Uncovers Aerosol Size Information From Chemistry and Meteorology to Quantify Potential Cloud-Forming Particles, Geophys. Res. Lett., 48, e2021GL094133, https://doi.org/10.1029/2021GL094133, 2021. a

NCAR: NCAR Command Language (Version 6.6.2), UCAR/NCAR/CISL/TDD [software], https://doi.org/10.5065/D6WD3XH5, 2019. a

Nojarov, P., Ivanov, P., Kalapov, I., Penev, I., and Drenska, M.: Connection between ozone concentration and atmosphere circulation at peak Moussala, Theor. Appl. Climatol., 98, 201–208, https://doi.org/10.1007/s00704-009-0173-2, 2009. a

O'Malley, T., Bursztein, E., Long, J., Chollet, F., Jin, H., and Invernizzi, L.: Keras Tuner, https://github.com/keras-team/keras-tuner (last access: 23 March 2026), 2019. a

Ott, J., Pritchard, M., Best, N., Linstead, E., Curcic, M., and Baldi, P.: A Fortran-Keras deep learning bridge for scientific computing, Scientific Programming, 2020, https://doi.org/10.1155/2020/8888811, 2020. a

Philippin, S., Laj, P., Putaud, J.-P., Wiedensohler, A., Leeuw, G. D., Fjaeraa, A. M., Platt, U., Baltensperger, U., and Fiebig, M.: EUSAAR-An unprecedented network of aerosol observation in Europe, Journal of Aerosol Research (Earozoru Kenkyu), 24, 78–83, https://doi.org/10.11203/jar.24.78, 2009. a

Pierce, J. R., Croft, B., Kodros, J. K., D'Andrea, S. D., and Martin, R. V.: The importance of interstitial particle scavenging by cloud droplets in shaping the remote aerosol size distribution and global aerosol-climate effects, Atmos. Chem. Phys., 15, 6147–6158, https://doi.org/10.5194/acp-15-6147-2015, 2015. a

Randles, C. A., da Silva, A. M., Buchard, V., Colarco, P. R., Darmenov, A., Govindaraju, R., Smirnov, A., Holben, B., Ferrare, R., Hair, J., Shinozuka, Y., and Flynn, C. J.: The MERRA-2 Aerosol Reanalysis, 1980 Onward. Part I: System Description and Data Assimilation Evaluation, J. Climate, 30, 6823–6850, https://doi.org/10.1175/JCLI-D-16-0609.1, 2017. a, b, c, d, e

Rasp, S., Pritchard, M. S., and Gentine, P.: Deep learning to represent subgrid processes in climate models, P. Natl. Acad. Sci. USA, 115, 9684–9689, https://doi.org/10.1073/pnas.1810286115, 2018. a, b

Reddington, C., Carslaw, K., Stier, P., Schutgens, N., Coe, H., Liu, D., Allan, J., Pringle, K., Lee, L., Yoshioka, M., Johnson, J. S., Regayre, L. A., Spracklen, D. V., Mann, G. W., Clarke, A., Hermann, M., Henning, S., Wex, H., Kristensen, T. B., Leaitch, W. R., Pöschl, U., Rose, D., Andreae, M. O., Schmale, J., Kondo, Y., Oshima, N., Schwarz, J. P., Nenes, A., Anderson, B., Roberts, G. C., Snider, J. R., Leck, C., Quinn, P. K., Chi, X., Ding, A., Jimenez, J. L., and Zhang, Q.: The Global Aerosol Synthesis and Science Project (GASSP): measurements and modeling to reduce uncertainty, B. Am. Meteorol. Soc., 98, 1857–1877, https://doi.org/10.1175/BAMS-D-15-00317.1, 2017. a

Remer, L. A., Kaufman, Y., Tanré, D., Mattoo, S., Chu, D., Martins, J. V., Li, R.-R., Ichoku, C., Levy, R., Kleidman, R., Eck, T. F., Vermote, E., and Holben, B. N.: The MODIS aerosol algorithm, products, and validation, J. Atmos. Sci., 62, 947–973, https://doi.org/10.1175/JAS3385.1, 2005. a, b

Reynolds, R. W., Rayner, N. A., Smith, T. M., Stokes, D. C., and Wang, W.: An improved in situ and satellite SST analysis for climate, J. Climate, 15, 1609–1625, https://doi.org/10.1175/1520-0442(2002)015<1609:AIISAS>2.0.CO;2, 2002. a

Riemer, N., Ault, A., West, M., Craig, R., and Curtis, J.: Aerosol mixing state: Measurements, modeling, and impacts, Rev. Geophys., 57, 187–249, https://doi.org/10.1029/2018RG000615, 2019. a, b, c, d

Rienecker, M., Suarez, M., Todling, R., Bacmeister, J., Takacs, L., Liu, H.-C., Gu, W., Sienkiewicz, M., Koster, R., Gelaro, R., Stajner, I., and Nielsen, J.: The GEOS-5 Data Assimilation System – Documentation of Versions 5.0.1, 5.1.0, and 5.2.0., vol. 27 of Technical Report Series on Global Modeling and Data Assimilation, NASA Goddard Space Flight Center, Greenbelt, MD, USA, https://ntrs.nasa.gov/citations/20120011955 (last access: 23 March 2026), 2008. a

Russchenberg, H., Bosveld, F., Swart, D., ten Brink, H., de Leeuw, G., Uijlenhoet, R., Arbesser-Rastburg, B., van der Marel, H., Ligthart, L., Boers, R., and Apituley, A.: Ground-based atmospheric remote sensing in the Netherlands: European outlook, IEICE T. Commun., 88, 2252–2258, https://doi.org/10.1093/ietcom/e88-b.6.2252, 2005. a

Seinfeld, J. H. and Pandis, S. N.: Atmospheric chemistry and physics: from air pollution to climate change, John Wiley & Sons, ISBN 0471720186, 2016. a, b, c, d, e

Seinfeld, J. H., Bretherton, C., Carslaw, K. S., Coe, H., DeMott, P. J., Dunlea, E. J., Feingold, G., Ghan, S., Guenther, A. B., Kahn, R., Kraucunas, I., Kreidenweis, S. M., Molina, M. J., Nenes, A., Penner, J. E., Prather, K. A., Ramanathan, V., Ramaswamy, V., Rasch, P. J., Ravishankara, A. R., Rosenfeld, D., Stephens, G., and Wood, R.: Improving our fundamental understanding of the role of aerosol- cloud interactions in the climate system, P. Natl. Acad. Sci. USA, 113, 5781–5790, https://doi.org/10.1073/pnas.1514043113, 2016. a, b

Silva, S. J., Ma, P.-L., Hardin, J. C., and Rothenberg, D.: Physically regularized machine learning emulators of aerosol activation , Geosci. Model Dev., 14, 3067–3077, https://doi.org/10.5194/gmd-14-3067-2021, 2021. a

Song, C., McCoy, D., Molod, A., Aerenson, T., and Barahona, D.: Signatures of aerosol-cloud interactions in GiOcean: a coupled global reanalysis with two-moment cloud microphysics, Atmos. Chem. Phys., 25, 15567–15592, https://doi.org/10.5194/acp-25-15567-2025, 2025. a, b, c, d, e

Stier, P., Feichter, J., Kinne, S., Kloster, S., Vignati, E., Wilson, J., Ganzeveld, L., Tegen, I., Werner, M., Balkanski, Y., Schulz, M., Boucher, O., Minikin, A., and Petzold, A.: The aerosol-climate model ECHAM5-HAM, Atmos. Chem. Phys., 5, 1125–1156, https://doi.org/10.5194/acp-5-1125-2005, 2005. a

Stier, P., van den Heever, S. C., Christensen, M. W., Gryspeerdt, E., Dagan, G., Saleeby, S. M., Bollasina, M., Donner, L., Emanuel, K., Ekman, A. M., Feingold, G., Field, P., Forster, P., Haywood, J., Kahn, R., Koren, I., Kummerow, C., L’Ecuyer, T., Lohmann, U., Ming, Y., Myhre, G., Quaas, J., Rosenfeld, D., Samset, B., Seifert, A., Stephens, G., and Tao, W. K.: Multifaceted aerosol effects on precipitation, Nat. Geosci., 17, 719–732, https://doi.org/10.1038/s41561-024-01482-6, 2024. a

Ström, J., Umegård, J., Tørseth, K., Tunved, P., Hansson, H.-C., Holmén, K., Wismann, V., Herber, A., and König-Langlo, G.: One year of particle size distribution and aerosol chemical composition measurements at the Zeppelin Station, Svalbard, March 2000–March 2001, Phys. Chem. Earth Pt. A/B/C, 28, 1181–1190, https://doi.org/10.1016/j.pce.2003.08.058, 2003. a

Su, X., Huang, Y., Wang, L., Cao, M., and Feng, L.: Validation and diurnal variation evaluation of MERRA-2 multiple aerosol properties on a global scale, Atmos. Environ., 311, 120019, https://doi.org/10.1016/j.atmosenv.2023.120019, 2023. a

Sun, E., Che, H., Xu, X., Wang, Z., Lu, C., Gui, K., Zhao, H., Zheng, Y., Wang, Y., Wang, H., Sun, T., Liang, Y., Li, X., Sheng, Z., An, L., Zhang, X., and Shi, G.: Variation in MERRA-2 aerosol optical depth over the Yangtze River Delta from 1980 to 2016, Theor. Appl. Climatol., 136, 363–375, https://doi.org/10.1007/s00704-018-2490-9, 2019. a

Takacs, L. L., Suárez, M. J., and Todling, R.: The stability of incremental analysis update, Mon. Weather Rev., 146, 3259–3275, https://doi.org/10.1175/MWR-D-18-0117.1, 2018. a

Tunved, P., Ström, J., and Hansson, H.-C.: An investigation of processes controlling the evolution of the boundary layer aerosol size distribution properties at the Swedish background station Aspvreten, Atmos. Chem. Phys., 4, 2581–2592, https://doi.org/10.5194/acp-4-2581-2004, 2004. a

Ukhov, A., Mostamandi, S., da Silva, A., Flemming, J., Alshehri, Y., Shevchenko, I., and Stenchikov, G.: Assessment of natural and anthropogenic aerosol air pollution in the Middle East using MERRA-2, CAMS data assimilation products, and high-resolution WRF-Chem model simulations, Atmos. Chem. Phys., 20, 9281–9310, https://doi.org/10.5194/acp-20-9281-2020, 2020. a

Ulevicius, V., Byčenkienė, S., Remeikis, V., Garbaras, A., Kecorius, S., Andriejauskienė, J., Jasinevičienė, D., and Mocnik, G.: Characterization of pollution events in the East Baltic region affected by regional biomass fire emissions, Atmos. Res., 98, 190–200, https://doi.org/10.1016/j.atmosres.2010.03.021, 2010. a

Uno, I., Eguchi, K., Yumimoto, K., Takemura, T., Shimizu, A., Uematsu, M., Liu, Z., Wang, Z., Hara, Y., and Sugimoto, N.: Asian dust transported one full circuit around the globe, Nat. Geosci., 2, 557–560, https://doi.org/10.1038/ngeo583, 2009. a

Venzac, H., Sellegri, K., Villani, P., Picard, D., and Laj, P.: Seasonal variation of aerosol size distributions in the free troposphere and residual layer at the puy de Dôme station, France, Atmos. Chem. Phys., 9, 1465–1478, https://doi.org/10.5194/acp-9-1465-2009, 2009. a

Virtanen, A., Joutsensaari, J., Kokkola, H., Partridge, D. G., Blichner, S., Seland, Ø., Holopainen, E., Tovazzi, E., Lipponen, A., Mikkonen, S., Leskinen, A., Hyvärinen, A.-P., Zieger, P., Krejci, R., Ekman, A. M. L., Riipinen, I., Quaas, J., and Romakkaniemi, S.: High sensitivity of cloud formation to aerosol changes, Nat. Geosci., https://doi.org/10.1038/s41561-025-01662-y, 2025. a

Watson-Parris, D., Schutgens, N., Reddington, C., Pringle, K. J., Liu, D., Allan, J. D., Coe, H., Carslaw, K. S., and Stier, P.: In situ constraints on the vertical distribution of global aerosol, Atmos. Chem. Phys., 19, 11765–11790, https://doi.org/10.5194/acp-19-11765-2019, 2019. a, b, c

Whitby, E. R. and McMurry, P. H.: Modal aerosol dynamics modeling, Aerosol Sci. Tech., 27, 673–688, https://doi.org/10.1080/02786829708965504, 1997. a

Wilson, J., Cuvelier, C., and Raes, F.: A modeling study of global mixed aerosol fields, J. Geophys. Res.-Atmos., 106, 34081–34108, https://doi.org/10.1029/2000JD000198, 2001. a

Winter, E.: The shapley value, Handbook of Game Theory with Economic Applications, 3, 2025–2054, https://doi.org/10.1016/S1574-0005(02)03016-3, 2002. a

Yu, S., Ma, P.-L., Singh, B., Silva, S., and Pritchard, M.: Two-step hyperparameter optimization method: Accelerating hyperparameter search by using a fraction of a training dataset, Artificial Intelligence for the Earth Systems, 3, e230013, https://doi.org/10.1175/AIES-D-23-0013.1, 2024. a

Zhang, H., Sharma, G., Dhawan, S., Dhanraj, D., Li, Z., and Biswas, P.: Comparison of discrete, discrete-sectional, modal and moment models for aerosol dynamics simulations, Aerosol Sci. Tech., 54, 739–760, https://doi.org/10.1080/02786826.2020.1723787, 2020. a

Zhou, C., Shen, X., Liu, Z., Zhang, Y., and Xin, J.: Simulating aerosol size distribution and mass concentration with simultaneous nucleation, condensation/coagulation, and deposition with the GRAPES–CUACE, Journal of Meteorological Research, 32, 265–278, https://doi.org/10.1007/s13351-018-7116-8, 2018. a

Zhu, H., Martin, R. V., Croft, B., Zhai, S., Li, C., Bindle, L., Pierce, J. R., Chang, R. Y.-W., Anderson, B. E., Ziemba, L. D., Hair, J. W., Ferrare, R. A., Hostetler, C. A., Singh, I., Chatterjee, D., Jimenez, J. L., Campuzano-Jost, P., Nault, B. A., Dibb, J. E., Schwarz, J. S., and Weinheimer, A.: Parameterization of size of organic and secondary inorganic aerosol for efficient representation of global aerosol optical properties, Atmos. Chem. Phys., 23, 5023–5042, https://doi.org/10.5194/acp-23-5023-2023, 2023. a

Articles

Short summary

Particulate matter impacts Earth's radiation, clouds, and human health, but modeling their size is challenging due to computational and observational limits. We developed a machine learning model to predict aerosol size distributions, which accurately replicates advanced models and field measurements.