<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing with OASIS Tables v3.0 20080202//EN" "https://jats.nlm.nih.gov/nlm-dtd/publishing/3.0/journalpub-oasis3.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:oasis="http://docs.oasis-open.org/ns/oasis-exchange/table" xml:lang="en" dtd-version="3.0" article-type="research-article"><?xmltex \makeatother\@nolinetrue\makeatletter?><?xmltex \bartext{Development and technical paper}?>
  <front>
    <journal-meta><journal-id journal-id-type="publisher">GMD</journal-id><journal-title-group>
    <journal-title>Geoscientific Model Development</journal-title>
    <abbrev-journal-title abbrev-type="publisher">GMD</abbrev-journal-title><abbrev-journal-title abbrev-type="nlm-ta">Geosci. Model Dev.</abbrev-journal-title>
  </journal-title-group><issn pub-type="epub">1991-9603</issn><publisher>
    <publisher-name>Copernicus Publications</publisher-name>
    <publisher-loc>Göttingen, Germany</publisher-loc>
  </publisher></journal-meta>
    <article-meta>
      <article-id pub-id-type="doi">10.5194/gmd-16-4521-2023</article-id><title-group><article-title>Automatic snow type classification of snow micropenetrometer profiles with machine learning algorithms</article-title><alt-title>Automatic snow type classification of snow micropenetrometer profiles</alt-title>
      </title-group><?xmltex \runningtitle{Automatic snow type classification of snow micropenetrometer profiles}?><?xmltex \runningauthor{J. Kaltenborn et al.}?>
      <contrib-group>
        <contrib contrib-type="author" corresp="yes" rid="aff1 aff2 aff3 aff4">
          <name><surname>Kaltenborn</surname><given-names>Julia</given-names></name>
          <email>julia.kaltenborn@mail.mcgill.ca</email>
        <ext-link>https://orcid.org/0000-0002-9292-9655</ext-link></contrib>
        <contrib contrib-type="author" corresp="no" rid="aff1">
          <name><surname>Macfarlane</surname><given-names>Amy R.</given-names></name>
          
        <ext-link>https://orcid.org/0000-0002-1638-8885</ext-link></contrib>
        <contrib contrib-type="author" corresp="no" rid="aff2 aff5">
          <name><surname>Clay</surname><given-names>Viviane</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="yes" rid="aff1">
          <name><surname>Schneebeli</surname><given-names>Martin</given-names></name>
          <email>schneebeli@slf.ch</email>
        <ext-link>https://orcid.org/0000-0003-2872-4409</ext-link></contrib>
        <aff id="aff1"><label>1</label><institution>WSL Institute for Snow and
Avalanche Research SLF, Flüelastrasse 11, 7260 Davos Dorf, Switzerland</institution>
        </aff>
        <aff id="aff2"><label>2</label><institution>Institute of Cognitive Science, University Osnabrück, Wachsbleiche 27, 49090 Osnabrück, Germany</institution>
        </aff>
        <aff id="aff3"><label>3</label><institution>Mila – Quebec AI Institute, 6666 Rue Saint-Urbain, QC H2S 3H1, Montréal, Canada</institution>
        </aff>
        <aff id="aff4"><label>4</label><institution>School of Computer Science, McGill University, 3480 Rue University, QC H3A 2A7, Montréal, Canada</institution>
        </aff>
        <aff id="aff5"><label>5</label><institution>Numenta, 889 Winslow Street, Redwood City, CA 94063, USA</institution>
        </aff>
      </contrib-group>
      <author-notes><corresp id="corr1">Julia Kaltenborn (julia.kaltenborn@mail.mcgill.ca) and Martin Schneebeli (schneebeli@slf.ch)</corresp></author-notes><pub-date><day>10</day><month>August</month><year>2023</year></pub-date>
      
      <volume>16</volume>
      <issue>15</issue>
      <fpage>4521</fpage><lpage>4550</lpage>
      <history>
        <date date-type="received"><day>28</day><month>September</month><year>2022</year></date>
           <date date-type="rev-request"><day>6</day><month>December</month><year>2022</year></date>
           <date date-type="rev-recd"><day>3</day><month>June</month><year>2023</year></date>
           <date date-type="accepted"><day>16</day><month>June</month><year>2023</year></date>
      </history>
      <permissions>
        <copyright-statement>Copyright: © 2023 Julia Kaltenborn et al.</copyright-statement>
        <copyright-year>2023</copyright-year>
      <license license-type="open-access"><license-p>This work is licensed under the Creative Commons Attribution 4.0 International License. To view a copy of this licence, visit <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by/4.0/">https://creativecommons.org/licenses/by/4.0/</ext-link></license-p></license></permissions><self-uri xlink:href="https://gmd.copernicus.org/articles/16/4521/2023/gmd-16-4521-2023.html">This article is available from https://gmd.copernicus.org/articles/16/4521/2023/gmd-16-4521-2023.html</self-uri><self-uri xlink:href="https://gmd.copernicus.org/articles/16/4521/2023/gmd-16-4521-2023.pdf">The full text article is available as a PDF file from https://gmd.copernicus.org/articles/16/4521/2023/gmd-16-4521-2023.pdf</self-uri>
      <abstract><title>Abstract</title>

      <p id="d1e139">Snow-layer segmentation and classification are essential diagnostic tasks for various cryospheric applications.
The SnowMicroPen (SMP) measures the snowpack's penetration force at submillimeter intervals in snow depth. The resulting depth–force profile can be parameterized for density and specific surface area. However, no information on traditional snow types is currently extracted automatically. The labeling of snow types is a time-intensive task that requires practice and becomes infeasible for large datasets. Previous work showed that automated segmentation and classification is, in theory, possible but cannot be applied to data straight from the field or needs additional time-costly information, such as from classified snow pits.
We evaluate how well machine learning models can automatically segment and classify SMP profiles to address this gap. We trained 14 models, among them semi-supervised models and artificial neural networks (ANNs), on the MOSAiC SMP dataset, an extensive collection of snow profiles on Arctic sea ice. SMP profiles can be successfully segmented and classified into snow classes based solely on the SMP's signal. The model comparison provided in this study enables SMP users to choose a suitable model for their task and dataset.
The findings presented will facilitate and accelerate snow type identification through SMP profiles. Anyone can access the tools and models needed to automate snow type identification via the software repository “snowdragon”. Overall, snowdragon creates a link between traditional snow classification and high-resolution force–depth profiles. Traditional snow profile observations can be compared to SMP profiles with such a tool.</p>
  </abstract>
    
<funding-group>
<award-group id="gs1">
<funding-source>Swiss Polar Institute</funding-source>
<award-id>DIRCR-2018-003</award-id>
</award-group>
<award-group id="gs2">
<funding-source>Horizon 2020</funding-source>
</award-group>
<award-group id="gs3">
<funding-source>WSL-Institut für Schnee- und Lawinenforschung SLF</funding-source>
<award-id>WSL201812N1678</award-id>
</award-group>
<award-group id="gs4">
<funding-source>Deutsche Forschungsgemeinschaft</funding-source>
<award-id>GRK2340</award-id>
</award-group>
<award-group id="gs5">
<funding-source>Alfred Wegener Institute Helmholtz Centre for Polar and Marine Research</funding-source>
<award-id>AWI PS12200</award-id>
</award-group>
</funding-group>
</article-meta>
  </front>
<body>
      

<sec id="Ch1.S1" sec-type="intro">
  <label>1</label><title>Introduction</title>
      <p id="d1e151">The cryosphere covers around 10 % of our Earth and plays a significant role in stabilizing the Earth's climate <xref ref-type="bibr" rid="bib1.bibx50" id="paren.1"/>. Snow cover plays a role in optics, heat, and mass balance and is one of the most significant uncertainties in global climate models <xref ref-type="bibr" rid="bib1.bibx65 bib1.bibx62 bib1.bibx14" id="paren.2"/>. Snow layer segmentation and classification put forth knowledge about the atmospheric conditions a snowpack has experienced <xref ref-type="bibr" rid="bib1.bibx8 bib1.bibx15" id="paren.3"/>. This knowledge helps to discern fundamental snow and climate mechanisms in the Arctic and to analyze polar tipping points. Classification of snow types (also referred to as “snow grain type” or “grain type” in the community) is essential to assess the state of our cryosphere. It is thus of interest for polar, cryospheric, and climate change research <xref ref-type="bibr" rid="bib1.bibx13 bib1.bibx31 bib1.bibx64" id="paren.4"/>. Snow type is often better reproduced in detailed snow cover models <xref ref-type="bibr" rid="bib1.bibx68" id="paren.5"/> than their effective physical properties, especially indirectly structural anisotropy <xref ref-type="bibr" rid="bib1.bibx31" id="paren.6"/>. This is especially relevant for active and passive microwave sensing, essential to map the Arctic snowpack during polar night <xref ref-type="bibr" rid="bib1.bibx56" id="paren.7"/>.</p>
      <?pagebreak page4522?><p id="d1e176"><?xmltex \hack{\newpage}?>Traditionally, snow stratigraphy measurements are made in snow pits. These pits are dug manually into snowpacks, requiring trained operators and a substantial time commitment. To accelerate these measurements, the SnowMicroPen (SMP), a portable high-resolution snow penetrometer, can be used <xref ref-type="bibr" rid="bib1.bibx26" id="paren.8"/>. It has been demonstrated that the SMP is a capable tool for rapid snow type classification and layer segmentation. The measurement results are stored in an SMP profile that consists of the penetration force signal of the measurement tip in newtons and the depth signal indicating how far the tip moved.
Afterwards, the SMP profiles must be manually labeled by an expert, which requires time and practice.</p>
      <p id="d1e183">To address these shortcomings, machine learning (ML) algorithms could be used to automate the labeling process. Instead of manually labeling each SMP profile, an ML model can be trained on a few labeled profiles and subsequently reproduce the labeling patterns on other profiles. As a consequence, this would (1) immensely accelerate the SMP analysis, (2) enable the analysis of large datasets, and (3) support interdisciplinary scientists that are unfamiliar with snow type categorization.</p>
      <p id="d1e186">Such an automatic classification of SMP profiles helps to find layers with shared properties within a large SMP dataset. By reproducing a trained labeling pattern on new profiles with ML, SMP classification is upscaled. While it is impossible to manually label and analyze a dataset of thousands of SMP profiles, an ML-assisted classification enables us to conduct completely new analyses. Questions like “How does a typical snow layer in the Arctic look?” suddenly move within reach. Statistical analyses of signal and layer types rely on consistent, large, and fully labeled SMP datasets.</p>
      <p id="d1e190">Several previous works have addressed automatically classifying snow types with machine learning algorithms.
The nearest-neighbor method of <xref ref-type="bibr" rid="bib1.bibx57" id="text.9"/> was the first model that automated the segmentation and classification of SMP profiles without needing additional snow pit information. To assign a snow type to an unlabeled data point, the method chooses the most frequent class occurring in the neighborhood of this data point. The neighborhood contains the most similar points to the unlabeled data point. Their algorithm predicts five different snow types (“new snow”, “faceted snow”, “depth hoar”, “rounded grains”, “melt–freeze”), with an accuracy ranging from 0.68 to 0.94. However, this high performance is only achieved by integrating specific and inflexible expert rules. For example, one rule ensures that no faceted snow, depth hoar, or rounded grains occur between layers of new snow, but precisely this happens under certain circumstances, as they point out. Hard-coded rules might improve the performance of one dataset, but they cannot capture all phenomena and will not generalize well to other datasets. The performance results are also limited by the fact that their testing set consists of only three SMP profiles; i.e., it is not clear how representative their results are. In addition, their results can hardly transfer to the real-world setting because they explicitly exclude any mixed snow type layers. Suppose an automatic segmentation and classification algorithm will work with profiles straight from the field. In that case, this algorithm should be able to handle mixed classes and diverse snow phenomena and be thoroughly tested.</p>
      <p id="d1e196"><xref ref-type="bibr" rid="bib1.bibx21" id="text.10"/> worked with random forests and support vector machines (SVMs) to classify SMP profiles. They used previously segmented SMP profiles and classified the snow type of each layer with the help of a random forest model. They build upon their previous work with single decision trees <xref ref-type="bibr" rid="bib1.bibx20" id="paren.11"/>. They trained the model on three different snow types (new snow, rounded grains, faceted grains), achieving error rates between 16.4 % and 44.4 % (depending on the dataset). Notably, <xref ref-type="bibr" rid="bib1.bibx21" id="text.12"/> requires profiles that have been manually segmented beforehand. Since this is done manually, this takes a considerable amount of time, raising the question as to what extent the task has been “automated”. Only layers larger than 100 mm (sometimes 20 mm) could be considered due to manual segmentation. In the field, particularly for avalanche risk assessment <xref ref-type="bibr" rid="bib1.bibx41" id="paren.13"/>, it is important to detect layers only a few millimeters thick. Improving on the work of <xref ref-type="bibr" rid="bib1.bibx20" id="text.14"/> would thus include more snow types, thinner layers, and no need for manual segmentation.</p>
      <p id="d1e213">More recently, <xref ref-type="bibr" rid="bib1.bibx32" id="text.15"/> trained SVMs on SMP force signals and manual density cutter measurement. Both segmentation and classification are conducted automatically. They distinguish three types of snow grains (“rounded”, “faceted”, and “hoar”) and achieve classification accuracies between 0.76 and 0.83. The profiles were collected on Arctic ice in the same region, which means that the profiles might be more homogeneous than in other datasets. In theory, the model's generalizability could be enhanced by training it on additional, broader datasets. Most importantly, the SVM method by <xref ref-type="bibr" rid="bib1.bibx32" id="text.16"/> relies on additional manual density cutter measurements and time-intensive snow pit measurements that are not always available. Thus, similarly to  <xref ref-type="bibr" rid="bib1.bibx21" id="text.17"/>, more snow types would make the work more applicable in the field and  eliminate the necessity of additional manual density cutter measurements.
In summary, previous work showed that supervised machine learning algorithms are a promising pathway to automatic snow grain categorization.</p>
      <p id="d1e225">While all these works put forward the task of automated SMP analysis, SMP users still lack a method that can be used in practice. Users need a model that fully automates their SMP analysis (1) without the need of digging a snow pit, (2) picking layers manually, or (3) constructing specific knowledge rules. Furthermore, SMP users need models to deal with SMP profiles from the field. This implies that (4) the profiles have multiple snow types (more than three) and that (5) no layers are excluded. This study aims to provide models that fully automate SMP analysis and can directly be used, addressing all five mentioned needs.</p>
      <?pagebreak page4523?><p id="d1e228">To this end, we implemented 14 different machine learning (ML) models and compared their performance on the MOSAiC SMP dataset, consisting of 164 labeled profiles (see Fig. <xref ref-type="fig" rid="Ch1.F1"/>) <xref ref-type="bibr" rid="bib1.bibx29" id="paren.18"/>. We provide the first comparable performance overview of different models classifying and segmenting SMP profiles.
Moreover, we used semi-supervised methods and artificial neural networks (ANNs) for SMP classification.</p>
      <p id="d1e236">Results show that especially artificial neural networks (ANNs), such as the long short-term memory (LSTM) and encoder–decoder, can produce predictions similar to profiles labeled by experts and achieve the best results among all models. However, the choice of the model depends mostly on the individual needs of an SMP user because factors such as explainability, desired sensitivity to rare classes, available time, and computational resources must be considered.</p>
      <p id="d1e240">The work presented here is a methodological contribution. We provide insights into which ML algorithms can be used to automatically and consistently classify large SMP datasets. Our findings can be applied to different SMP datasets or similar data.
The more fine-grained contributions of this study are as follows:
<list list-type="bullet"><list-item>
      <p id="d1e245">demonstration that SMP profiles straight from the field can be automatically segmented and classified without manual preparation of the profiles or additional snow-pit data after training on a smaller set of SMP profiles,</p></list-item><list-item>
      <p id="d1e249">evaluation of semi-supervised models and ANNs for SMP classification,</p></list-item><list-item>
      <p id="d1e253">detailed comparison of different ML models for SMP classification,</p></list-item><list-item>
      <p id="d1e257">use of the snowdragon repository that provides the tools to automate SMP labeling.</p></list-item></list></p>
      <p id="d1e260">In the following section (Sect. <xref ref-type="sec" rid="Ch1.S2"/>) the data, the classification task, and the 14 different models used in this study are described. In Sect. <xref ref-type="sec" rid="Ch1.S3"/>, the models' performances are presented. Subsequently, the results, their limitations, and future work are discussed in Sect. <xref ref-type="sec" rid="Ch1.S4"/>. The impact of this work is addressed in Sect. <xref ref-type="sec" rid="Ch1.S5"/>. The code and data availability is outlined directly after the conclusion, and a detailed guide on how to use snowdragon with your SMP dataset can be found in Appendix <xref ref-type="sec" rid="App1.Ch1.S1"/>.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F1" specific-use="star"><?xmltex \currentcnt{1}?><?xmltex \def\figurename{Figure}?><label>Figure 1</label><caption><p id="d1e275">All 164 labeled SnowMicroPen (SMP) profiles used for training, validation (80 %), and testing (20 %). Each bar represents one SMP profile. The colors encode the different snow types. The top of each bar is the air–snow interface, while the bottom is the profile's snow–ground interface. The inset image illustrates the force signal (grey) and the mean force signal (blue) of a single SMP profile (S31H0368). The snow–air interface is on the left, and the bottom of the profile is on the right. The background shading in the inset panel and the colors in the main panel represent the labeling of the profiles.</p></caption>
        <?xmltex \igopts{width=341.433071pt}?><graphic xlink:href="https://gmd.copernicus.org/articles/16/4521/2023/gmd-16-4521-2023-f01.png"/>

      </fig>

</sec>
<sec id="Ch1.S2">
  <label>2</label><title>Methods</title>
<sec id="Ch1.S2.SS1">
  <label>2.1</label><title>Data</title>
      <p id="d1e299">All experiments throughout this study use snow data collected during the MOSAiC expedition (October 2019–September 2020) <xref ref-type="bibr" rid="bib1.bibx48" id="paren.19"/>. The snow pit measurements conducted include SMP profiles, micro-computer tomography (Micro-CT) <xref ref-type="bibr" rid="bib1.bibx9" id="paren.20"/>, and near-infrared (NIR) photographs <xref ref-type="bibr" rid="bib1.bibx44" id="paren.21"/>. Collecting snow profiles on Arctic sea ice is especially challenging because (i) only a few hours were available to perform all measurements within one snow pit and (B) the measurements must be conducted with wind velocities up to <inline-formula><mml:math id="M1" display="inline"><mml:mn mathvariant="normal">25</mml:mn></mml:math></inline-formula> m s<inline-formula><mml:math id="M2" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>  and temperatures of <inline-formula><mml:math id="M3" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">30</mml:mn><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:mrow></mml:math></inline-formula>C. Changing personnel, i.e., different operators, were conducting the snow pit measurements. As a result, traditional stratigraphy analysis and in situ snow grain classification from snow pits carry operator biases. <xref ref-type="bibr" rid="bib1.bibx46" id="text.22"/> could measure only 27 snow pits with stratigraphy under similar conditions.</p>
      <p id="d1e349">In contrast, during the MOSAiC expedition, several thousand (3680) SMP profiles were collected. Out of the 269 snow pit events that included SMP measurements, 102 had NIR measurements and 103 had micro-CT profiles collected simultaneously. A total of 71 snow pit events had all three measurements (SMP, NIR, and micro-CT).
We encountered 8 different snow types. Refer to <xref ref-type="bibr" rid="bib1.bibx15" id="text.23"/> for descriptions of the different snow types referenced here and a classification guideline for snow particles that were visually observed.<fn id="Ch1.Footn1"><p id="d1e355"><xref ref-type="bibr" rid="bib1.bibx15" id="text.24"/> refer only to visually observed snow grains and not to SMP signals.</p></fn></p>
      <p id="d1e360">The main measurements collected were signal profiles from the snow micropenetrometer since it provides profiles quickly with little physical labor and independent of the person that measures them. Of the 3680, 164 profiles from the cold season (January–May 2020) were labeled and evaluated here (see Fig. <xref ref-type="fig" rid="Ch1.F1"/>). The labels expressed by color in Fig. <xref ref-type="fig" rid="Ch1.F1"/> indicate which snow type is found at the respective position of the profile. In this study, we focus only on profiles of cold snow that is not experiencing melt, as no standardized interpretation of SMP force profiles exists for wet snow. All profiles collected in the cold season are referred to as “MOSAiC winter data” in the following.
Micro-CT and NIR data were recorded whenever possible to validate the subsequent labeling of the SMP profiles. More details on the collection methods can be found in <xref ref-type="bibr" rid="bib1.bibx42" id="text.25"/>. A comparison of these instruments can be seen in Appendix <xref ref-type="sec" rid="App1.Ch1.S2"/> in Fig. <xref ref-type="fig" rid="App1.Ch1.S2.F9"/>.</p>
      <?pagebreak page4524?><p id="d1e374">The labeling of the SMP profiles was conducted by two snow experts and is based on the properties of the force signal (magnitude, frequency, and gradient) and the signature of the SMP signal. The labeling procedure is described in detail in Appendix <xref ref-type="sec" rid="App1.Ch1.S2"/>, building upon the notion and observations of <xref ref-type="bibr" rid="bib1.bibx58" id="paren.26"/>. The first labeling phase was conducted by one expert, and in the second phase, two experts revisited the profiles to ensure consistent labeling. The labeling process involves using Micro-CT samples and NIR photography to validate the snow types identified from the force signal where possible. When assigning the labels to the SMP profiles, we lean to the abovementioned international classification guideline of seasonal snow on the ground <xref ref-type="bibr" rid="bib1.bibx15" id="text.27"/>. However, we regard the labels assigned to the SMP signals as mere approximators. During the labeling process, signal types are grouped together, and we infer from Micro-CTs which snow type matches each group best. Since we seek a language that is common to the snow community, we are using the labels provided by <xref ref-type="bibr" rid="bib1.bibx15" id="text.28"/> where possible. Since <xref ref-type="bibr" rid="bib1.bibx15" id="text.29"/> focuses on Alpine snow and does not cover all snow types on Arctic sea ice, such as different forms of depth hoar (further details are given in Appendix B), we extend those labels where necessary. The resulting labeled profiles were used during training, testing, and validation, while some unlabeled profiles were used for semi-supervised models and out-of-distribution tests. Upscaling consistent labeling of SMP profiles is exactly the type of task that ML algorithms can tackle.</p>
      <p id="d1e392">We preprocessed each SMP profile and the complete labeled dataset.
The surface and the ground of the profiles were detected automatically by the snowmicropyn package (<uri>https://snowmicropyn.readthedocs.io/en/latest/</uri>, last access: 3 August 2023).
For each SMP profile, we replaced negative force values with 0; summarized the signal into bins (1 mm); and added mean, variance, maximum, and minimum force values for those bins. Those values were also determined for a 4 mm and 12 mm moving window. Moreover, the Poisson shot noise model of <xref ref-type="bibr" rid="bib1.bibx40" id="text.30"/> was used to extract <inline-formula><mml:math id="M4" display="inline"><mml:mi mathvariant="italic">δ</mml:mi></mml:math></inline-formula>, <inline-formula><mml:math id="M5" display="inline"><mml:mi>f</mml:mi></mml:math></inline-formula>, <inline-formula><mml:math id="M6" display="inline"><mml:mi>L</mml:mi></mml:math></inline-formula>, and the median force value for a 4 and 12 mm window. We added further depth-dependent information, including the distance from the ground and position within the snowpack for each data point.
Refer to Table <xref ref-type="table" rid="App1.Ch1.S3.T3"/> in Appendix <xref ref-type="sec" rid="App1.Ch1.S3"/> for an overview of all features used for each SMP profile and to Fig. <xref ref-type="fig" rid="App1.Ch1.S3.F10"/> to see the feature importance for each snow type.</p>
      <p id="d1e429">We preprocessed the complete labeled dataset by normalizing it, removing profiles from the melting season, and merging snow classes. For example, “decomposed and fragmented precipitation particles” are merged with the class “precipitation particles” since they represent a similar type of snow. The few occurring ice formations and surface hoar instances in the MOSAiC dataset are summarized in the class “rare”. While a high classification performance cannot be expected for the rare classes, we still include them to show how the models perform on a “real-world dataset” that in most cases will also include classes with few occurrences.
The data preprocessing ensures that the dataset is clean and that all necessary information, such as depth-dependent information, is available during classification.</p>
      <p id="d1e432">The resulting dataset has the following properties. (1) There are multiple noisy and overlapping classes. (2) There is a between-class imbalance; i.e., some snow types occur much more frequently than others. (3) There is a within-class imbalance; i.e., some grain classes contain different sub-grain-classes, but some of them are more frequent than others. (4) The labeling of classes is afflicted with uncertainty; i.e., snow experts themselves are not sure to which class exactly some data points belong. The complexity of the dataset complicates classification and lowers the maximum achievable accuracy.</p>
</sec>
<sec id="Ch1.S2.SS2">
  <label>2.2</label><title>Task description</title>
      <p id="d1e443">We compare the capabilities of different models to classify and segment the profiles of the MOSAiC winter SMP dataset. To this end, the models first classify each data point of the signal and then summarize the classified points into distinct snow layers (“first-classify-then-segment”). This task can be solved with different learning and classification techniques.</p>
      <p id="d1e446">The task can be addressed via independent classification or sequence labeling. In independent classification, each individual point is classified independently, without looking at other data points. The underlying assumption is that each individual data point carries enough information to be classified solely on that basis. In contrast, sequence labeling assumes that the data are an intra-dependent sequence, where the label of each data point also depends on the preceding labels <xref ref-type="bibr" rid="bib1.bibx47" id="paren.31"/>.</p>
      <?pagebreak page4525?><p id="d1e452">The models can follow either the supervised, unsupervised, or semi-supervised learning regime. In supervised learning, labels are provided to learn an input–output mapping function <xref ref-type="bibr" rid="bib1.bibx55" id="paren.32"/>. In unsupervised learning, patterns and structure are found in unlabeled data <xref ref-type="bibr" rid="bib1.bibx18" id="paren.33"/>; however, no classification is possible, which is why no unsupervised models are employed here. Instead, semi-supervised models are used, which are able to find structures in sparsely labeled data and leverage this information during classification. In the following, all models employed in this work are shortly presented and put in the context of their learning and task type.</p>
</sec>
<sec id="Ch1.S2.SS3">
  <label>2.3</label><title>Models</title>
      <p id="d1e469">The majority vote classifier is used as the baseline for the performance comparison and simply predicts always the majority class (“rounded grains wind packed”). It satisfies the criteria that a baseline should not require much expertise, be easy to build, and be quick to evaluate <xref ref-type="bibr" rid="bib1.bibx36" id="paren.34"/>.</p>
      <p id="d1e475">The cluster-then-predict models employed in this study can be separated into three different semi-supervised and independent classification models. Unsupervised methods are used to find clusters in the dataset, and a supervised model is subsequently used to assign labels to the cluster <xref ref-type="bibr" rid="bib1.bibx61 bib1.bibx67" id="paren.35"/>. As an unsupervised model, <inline-formula><mml:math id="M7" display="inline"><mml:mi>k</mml:mi></mml:math></inline-formula>-means clustering <xref ref-type="bibr" rid="bib1.bibx17 bib1.bibx39" id="paren.36"/>, mixture model clustering (GMM) <xref ref-type="bibr" rid="bib1.bibx3" id="paren.37"/>, and Bayesian Gaussian mixture models (BGMM) <xref ref-type="bibr" rid="bib1.bibx3" id="paren.38"/> were used. The supervised part of the model is a simple majority vote within the clusters in order to see if the unsupervised model adds enough information to beat the majority vote baseline.</p>
      <p id="d1e497">Label propagation is a graph-based, semi-supervised, and independent classification algorithm. It propagates the labels of labeled data points to unlabeled ones <xref ref-type="bibr" rid="bib1.bibx74" id="paren.39"/>. Here, a modified version of this algorithm by <xref ref-type="bibr" rid="bib1.bibx73" id="text.40"/> is used (also known as “label spreading”) <xref ref-type="bibr" rid="bib1.bibx72 bib1.bibx51" id="paren.41"/>.</p>
      <p id="d1e509">Self-trained classifiers turn a given supervised classifier into a semi-supervised independent classifier. It follows an iterative approach of training a supervised model on labeled data, predicting more data with the model, and retraining the model with the most confident predictions <xref ref-type="bibr" rid="bib1.bibx71" id="paren.42"/>.</p>
      <p id="d1e516">Random forests (RFs) are ensembles of diversified decision trees (supervised and independent classification). The diversification happens via tree and feature bagging, where only subsets of data or features are used during training <xref ref-type="bibr" rid="bib1.bibx23 bib1.bibx4" id="paren.43"/>. Decision trees are simple-to-build, explainable, white-box classifiers, and for these reasons they are among the most popular machine learning algorithms <xref ref-type="bibr" rid="bib1.bibx70" id="paren.44"/>. Additionally, a balanced random forest was used with random undersampling to balance the data <xref ref-type="bibr" rid="bib1.bibx6" id="paren.45"/>.</p>
      <p id="d1e528">Support vector machines (SVMs) construct a hyperplane in a high-dimensional space to solve binary classification tasks <xref ref-type="bibr" rid="bib1.bibx10 bib1.bibx19" id="paren.46"/> (supervised and independently). When a problem is nonlinearly separable, the input data can be projected into a higher-dimensional space until the problem becomes linearly separable. The kernel trick can be used to circumvent the computationally expensive data transformation involved here. It directly extracts a nonlinear optimal hyperplane <xref ref-type="bibr" rid="bib1.bibx59" id="paren.47"/>.</p>
      <p id="d1e537"><inline-formula><mml:math id="M8" display="inline"><mml:mi>K</mml:mi></mml:math></inline-formula>-nearest neighbors (KNN) is a local, non-parametric classification method that compares samples and classifies new samples based on their <inline-formula><mml:math id="M9" display="inline"><mml:mi>k</mml:mi></mml:math></inline-formula>-nearest training data points (supervised and independently). The class of the prediction sample is determined via a majority vote <xref ref-type="bibr" rid="bib1.bibx16 bib1.bibx11" id="paren.48"/>.</p>
      <p id="d1e556">Easy ensemble classifiers are ensembles of balanced adaptive boosting classifiers (supervised and independent). The method is especially helpful for imbalanced datasets since the learners are trained on different bootstrap samples, which are balanced via random undersampling <xref ref-type="bibr" rid="bib1.bibx38" id="paren.49"/>.</p>
      <p id="d1e562">Long short-term memories (LSTMs) are a form of artificial neural networks (ANNs) and can perform supervised sequence labeling tasks. ANNs incrementally update their decision function that describes the decision boundary between classes. ANNs have different nodes, which can be seen as representing different parts of the functions that are weighted differently. During training, the weights of the ANN are optimized by minimizing a loss function via gradient descent. A long short-term memory can handle time series data. It consists of different memory cells so the LSTM can forget information that is no longer needed, remember information that is required for future decisions, and retrieve information that is required for current decisions <xref ref-type="bibr" rid="bib1.bibx24 bib1.bibx27" id="paren.50"/>.</p>
      <p id="d1e568">Bidirectional LSTMs (BLSTMs) connect two independent LSTMs, where the first LSTM processes the inputs forward, and the second one processes the inputs backwards. The outputs of both LSTMs are connected to one output. This architecture is helpful when the dependencies of a time series go in both time directions, which is the case for snow profiles <xref ref-type="bibr" rid="bib1.bibx60 bib1.bibx27" id="paren.51"/>.</p>
      <p id="d1e575">Encoder–decoder networks consist of an ANN encoder that compresses the time-dependent information into a vector and a decoder that uses this information to solve a supervised sequence labeling task. Additionally, the attention mechanism can be used to strengthen the ability to learn long-term dependencies by focusing only on the parts of the input sequence that are relevant for the current time step <xref ref-type="bibr" rid="bib1.bibx2 bib1.bibx27" id="paren.52"/>.</p>
</sec>
<sec id="Ch1.S2.SS4">
  <label>2.4</label><title>Evaluation</title>
      <p id="d1e589">In this work, (1) the performance of different models is compared, (2) differences in the classification of different snow<?pagebreak page4526?> types are analyzed, and (3) the generalization capability of the best-performing model is examined. (1) The performance comparison is done by looking at the metrics of each model and the specific predictions on the test dataset. The metrics used here are accuracy, balanced accuracy, weighted precision, F1 score, area-under-the-receiver operating characteristic (AUROC), log loss, fitting, and scoring time (see Appendix <xref ref-type="sec" rid="App1.Ch1.S4"/> for further explanations). (2) The label-wise performance is analyzed with the help of label-wise accuracy plots and receiver operating characteristic (ROC) curves. ROC curves plot the true-positive rate versus the false-positive rate. The higher the area under the ROC curve, the clearer the model can separate between positive and negative samples. (3) The generalization capability is tested by running the best-performing model on 100 random profiles from different parts of MOSAiC winter data. These profiles are outside of the distribution of the training, validation, and testing data, and we refer to them as “out-of-distribution profiles”. Here, the out-of-distribution profiles contain the same classes as the training data, so the model still has a chance to predict the correct labels. Evaluating these three aspects ensures that users can choose a model and know (1) how it performs compared to other models, (2) what to expect from the snow-type-specific predictions, and (3) how robust a chosen model will be.</p>
</sec>
<sec id="Ch1.S2.SS5">
  <label>2.5</label><title>Experimental setup</title>
      <p id="d1e603">The experimental setup includes a training, validation, and testing framework: roughly 80 % of the labeled dataset is used for training and validation, while the other 20 % is set aside for testing. Validation is realized as 5-fold cross validation <xref ref-type="bibr" rid="bib1.bibx63" id="paren.53"/>. The hyperparameters were tuned on the validation data and the best hyperparameters found were used during testing.</p>
      <p id="d1e609">Hyperparameter tuning is the process of searching the optimal internal learning settings of an ML model. Hyperparameters control the learning process of the models, whereas parameters are learned by the model. The tuning is performed on the validation data and the hyperparameters that achieve the highest performance for their model chosen for subsequent model evaluation. Here, tuning was applied moderately and with a simple grid search. All tuning results can be found in the GitHub repository. Specifications of the machine on which the experiments were run can be found in Appendix <xref ref-type="sec" rid="App1.Ch1.S5"/> and descriptions of the model setup can be found in Appendix <xref ref-type="sec" rid="App1.Ch1.S6"/>.</p>

<?xmltex \floatpos{t}?><table-wrap id="Ch1.T1" specific-use="star"><?xmltex \currentcnt{1}?><label>Table 1</label><caption><p id="d1e619">Results of different models from the categories baseline, semi-supervised, supervised, and ANN. The best values among all models are given in bold. The second-best values among all models are given in italics. The best values among one category are underlined. The area under the curve (AUC) of the receiver operating characteristic (ROC) and logistic loss (log loss) could not be determined for the baseline and some of the semi-supervised models due to the design of these models.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="11">
     <oasis:colspec colnum="1" colname="col1" align="justify" colwidth="1.9cm"/>
     <oasis:colspec colnum="2" colname="col2" align="left"/>
     <oasis:colspec colnum="3" colname="col3" align="justify" colwidth="1.3cm"/>
     <oasis:colspec colnum="4" colname="col4" align="justify" colwidth="1.1cm"/>
     <oasis:colspec colnum="5" colname="col5" align="justify" colwidth="0.8cm"/>
     <oasis:colspec colnum="6" colname="col6" align="justify" colwidth="0.8cm"/>
     <oasis:colspec colnum="7" colname="col7" align="justify" colwidth="0.8cm"/>
     <oasis:colspec colnum="8" colname="col8" align="justify" colwidth="0.8cm"/>
     <oasis:colspec colnum="9" colname="col9" align="justify" colwidth="0.8cm"/>
     <oasis:colspec colnum="10" colname="col10" align="justify" colwidth="0.8cm"/>
     <oasis:colspec colnum="11" colname="col11" align="justify" colwidth="1.2cm"/>
     <oasis:thead>
       <oasis:row rowsep="1">

         <oasis:entry colname="col1">Category</oasis:entry>

         <oasis:entry colname="col2">Model</oasis:entry>

         <oasis:entry colname="col3">Absolute accuracy</oasis:entry>

         <oasis:entry colname="col4">Balanced accuracy</oasis:entry>

         <oasis:entry colname="col5">Precision</oasis:entry>

         <oasis:entry colname="col6">F1 Score</oasis:entry>

         <oasis:entry colname="col7">ROC AUC</oasis:entry>

         <oasis:entry colname="col8">Log loss</oasis:entry>

         <oasis:entry colname="col9">Fitting time</oasis:entry>

         <oasis:entry colname="col10">Scoring time</oasis:entry>

         <oasis:entry colname="col11"/>

       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row rowsep="1">

         <oasis:entry colname="col1">Baseline</oasis:entry>

         <oasis:entry colname="col2">Majority vote</oasis:entry>

         <oasis:entry colname="col3">0.39</oasis:entry>

         <oasis:entry colname="col4">0.14</oasis:entry>

         <oasis:entry colname="col5">0.15</oasis:entry>

         <oasis:entry colname="col6">0.22</oasis:entry>

         <oasis:entry colname="col7">not a number (nan)</oasis:entry>

         <oasis:entry colname="col8">nan</oasis:entry>

         <oasis:entry colname="col9"><inline-formula><mml:math id="M10" display="inline"><mml:mrow><mml:mo>&lt;</mml:mo><mml:mn mathvariant="bold">1</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>

         <oasis:entry colname="col10"><inline-formula><mml:math id="M11" display="inline"><mml:mrow><mml:mo>&lt;</mml:mo><mml:msup><mml:mn mathvariant="bold">10</mml:mn><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="bold">3</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula></oasis:entry>

         <oasis:entry colname="col11"/>

       </oasis:row>
       <oasis:row>

         <?xmltex \mrwidth{1.9cm}?><oasis:entry rowsep="1" colname="col1" morerows="4">Semi-supervised</oasis:entry>

         <oasis:entry colname="col2"><inline-formula><mml:math id="M12" display="inline"><mml:mi>K</mml:mi></mml:math></inline-formula>-means</oasis:entry>

         <oasis:entry colname="col3">0.62</oasis:entry>

         <oasis:entry colname="col4">0.44</oasis:entry>

         <oasis:entry colname="col5">0.60</oasis:entry>

         <oasis:entry colname="col6">0.61</oasis:entry>

         <oasis:entry colname="col7">nan</oasis:entry>

         <oasis:entry colname="col8">nan</oasis:entry>

         <oasis:entry colname="col9">385</oasis:entry>

         <oasis:entry colname="col10">0.01</oasis:entry>

         <oasis:entry colname="col11"/>

       </oasis:row>
       <oasis:row>

         <oasis:entry colname="col2">GMM</oasis:entry>

         <oasis:entry colname="col3">0.65</oasis:entry>

         <oasis:entry colname="col4">0.36</oasis:entry>

         <oasis:entry colname="col5">0.57</oasis:entry>

         <oasis:entry colname="col6">0.61</oasis:entry>

         <oasis:entry colname="col7">nan</oasis:entry>

         <oasis:entry colname="col8">nan</oasis:entry>

         <oasis:entry colname="col9">151</oasis:entry>

         <oasis:entry colname="col10"><italic><underline>0.008</underline></italic></oasis:entry>

         <oasis:entry colname="col11"/>

       </oasis:row>
       <oasis:row>

         <oasis:entry colname="col2">BGMM</oasis:entry>

         <oasis:entry colname="col3">0.65</oasis:entry>

         <oasis:entry colname="col4">0.38</oasis:entry>

         <oasis:entry colname="col5">0.63</oasis:entry>

         <oasis:entry colname="col6">0.63</oasis:entry>

         <oasis:entry colname="col7">nan</oasis:entry>

         <oasis:entry colname="col8">nan</oasis:entry>

         <oasis:entry colname="col9">225</oasis:entry>

         <oasis:entry colname="col10">0.009</oasis:entry>

         <oasis:entry colname="col11"/>

       </oasis:row>
       <oasis:row>

         <oasis:entry colname="col2">Self-trainer</oasis:entry>

         <oasis:entry colname="col3">0.69</oasis:entry>

         <oasis:entry colname="col4"><underline>0.67</underline></oasis:entry>

         <oasis:entry colname="col5"><underline>0.74</underline></oasis:entry>

         <oasis:entry colname="col6">0.71</oasis:entry>

         <oasis:entry colname="col7">0.92</oasis:entry>

         <oasis:entry colname="col8">0.84</oasis:entry>

         <oasis:entry colname="col9">19</oasis:entry>

         <oasis:entry colname="col10">0.29</oasis:entry>

         <oasis:entry colname="col11"/>

       </oasis:row>
       <oasis:row rowsep="1">

         <oasis:entry colname="col2">Label propagation</oasis:entry>

         <oasis:entry colname="col3"><underline>0.71</underline></oasis:entry>

         <oasis:entry colname="col4">0.54</oasis:entry>

         <oasis:entry colname="col5">0.72</oasis:entry>

         <oasis:entry colname="col6"><underline>0.71</underline></oasis:entry>

         <oasis:entry colname="col7">0.92</oasis:entry>

         <oasis:entry colname="col8">1.5</oasis:entry>

         <oasis:entry colname="col9"><underline>10</underline></oasis:entry>

         <oasis:entry colname="col10">3.35</oasis:entry>

         <oasis:entry colname="col11"/>

       </oasis:row>
       <oasis:row>

         <?xmltex \mrwidth{1.9cm}?><oasis:entry rowsep="1" colname="col1" morerows="4">Supervised</oasis:entry>

         <oasis:entry colname="col2">Random forest</oasis:entry>

         <oasis:entry colname="col3"><underline>0.73</underline></oasis:entry>

         <oasis:entry colname="col4">0.60</oasis:entry>

         <oasis:entry colname="col5">0.73</oasis:entry>

         <oasis:entry colname="col6"><underline>0.73</underline></oasis:entry>

         <oasis:entry colname="col7">0.93</oasis:entry>

         <oasis:entry colname="col8">0.70</oasis:entry>

         <oasis:entry colname="col9">72</oasis:entry>

         <oasis:entry colname="col10">0.97</oasis:entry>

         <oasis:entry colname="col11"/>

       </oasis:row>
       <oasis:row>

         <oasis:entry colname="col2">Balanced RF</oasis:entry>

         <oasis:entry colname="col3">0.70</oasis:entry>

         <oasis:entry colname="col4"><bold><underline>0.67</underline></bold></oasis:entry>

         <oasis:entry colname="col5"><underline>0.74</underline></oasis:entry>

         <oasis:entry colname="col6">0.71</oasis:entry>

         <oasis:entry colname="col7">0.92</oasis:entry>

         <oasis:entry colname="col8">0.84</oasis:entry>

         <oasis:entry colname="col9">9.9</oasis:entry>

         <oasis:entry colname="col10"><underline>0.58</underline></oasis:entry>

         <oasis:entry colname="col11"/>

       </oasis:row>
       <oasis:row>

         <oasis:entry colname="col2">SVM</oasis:entry>

         <oasis:entry colname="col3">0.71</oasis:entry>

         <oasis:entry colname="col4">0.66</oasis:entry>

         <oasis:entry colname="col5">0.73</oasis:entry>

         <oasis:entry colname="col6">0.71</oasis:entry>

         <oasis:entry colname="col7"><underline>0.93</underline></oasis:entry>

         <oasis:entry colname="col8"><underline>0.67</underline></oasis:entry>

         <oasis:entry colname="col9">19</oasis:entry>

         <oasis:entry colname="col10">7.45</oasis:entry>

         <oasis:entry colname="col11"/>

       </oasis:row>
       <oasis:row>

         <oasis:entry colname="col2">KNN</oasis:entry>

         <oasis:entry colname="col3">0.71</oasis:entry>

         <oasis:entry colname="col4">0.54</oasis:entry>

         <oasis:entry colname="col5">0.71</oasis:entry>

         <oasis:entry colname="col6">0.71</oasis:entry>

         <oasis:entry colname="col7">0.89</oasis:entry>

         <oasis:entry colname="col8">3.58</oasis:entry>

         <oasis:entry colname="col9"><inline-formula><mml:math id="M13" display="inline"><mml:mo>&lt;</mml:mo></mml:math></inline-formula><underline><italic>1</italic></underline></oasis:entry>

         <oasis:entry colname="col10">1.84</oasis:entry>

         <oasis:entry colname="col11"/>

       </oasis:row>
       <oasis:row rowsep="1">

         <oasis:entry colname="col2">Easy ensemble</oasis:entry>

         <oasis:entry colname="col3">0.62</oasis:entry>

         <oasis:entry colname="col4">0.59</oasis:entry>

         <oasis:entry colname="col5">0.70</oasis:entry>

         <oasis:entry colname="col6">0.64</oasis:entry>

         <oasis:entry colname="col7">0.88</oasis:entry>

         <oasis:entry colname="col8">1.66</oasis:entry>

         <oasis:entry colname="col9">46</oasis:entry>

         <oasis:entry colname="col10">42.5</oasis:entry>

         <oasis:entry colname="col11"/>

       </oasis:row>
       <oasis:row>

         <?xmltex \mrwidth{1.9cm}?><oasis:entry colname="col1" morerows="2">ANNs</oasis:entry>

         <oasis:entry colname="col2">LSTM</oasis:entry>

         <oasis:entry colname="col3"><italic>0.75</italic></oasis:entry>

         <oasis:entry colname="col4"><underline>0.58</underline></oasis:entry>

         <oasis:entry colname="col5"><italic>0.75</italic></oasis:entry>

         <oasis:entry colname="col6"><italic>0.75</italic></oasis:entry>

         <oasis:entry colname="col7"><bold><underline>0.94</underline></bold></oasis:entry>

         <oasis:entry colname="col8"><bold><underline>0.63</underline></bold></oasis:entry>

         <oasis:entry colname="col9"><underline>349</underline></oasis:entry>

         <oasis:entry colname="col10"><underline>2.3</underline></oasis:entry>

         <oasis:entry colname="col11"/>

       </oasis:row>
       <oasis:row>

         <oasis:entry colname="col2">BLSTM</oasis:entry>

         <oasis:entry colname="col3">0.74</oasis:entry>

         <oasis:entry colname="col4">0.58</oasis:entry>

         <oasis:entry colname="col5">0.74</oasis:entry>

         <oasis:entry colname="col6">0.73</oasis:entry>

         <oasis:entry colname="col7">0.93</oasis:entry>

         <oasis:entry colname="col8">0.79</oasis:entry>

         <oasis:entry colname="col9">975</oasis:entry>

         <oasis:entry colname="col10">3.4</oasis:entry>

         <oasis:entry colname="col11"/>

       </oasis:row>
       <oasis:row>

         <oasis:entry colname="col2">Encoder–decoder</oasis:entry>

         <oasis:entry colname="col3"><bold><underline>0.78</underline></bold></oasis:entry>

         <oasis:entry colname="col4">0.54</oasis:entry>

         <oasis:entry colname="col5"><bold><underline>0.78</underline></bold></oasis:entry>

         <oasis:entry colname="col6"><bold><underline>0.77</underline></bold></oasis:entry>

         <oasis:entry colname="col7"><italic>0.94</italic></oasis:entry>

         <oasis:entry colname="col8"><italic>0.64</italic></oasis:entry>

         <oasis:entry colname="col9">2911</oasis:entry>

         <oasis:entry colname="col10">5.8</oasis:entry>

         <oasis:entry colname="col11"/>

       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table><?xmltex \gdef\@currentlabel{1}?></table-wrap>

</sec>
</sec>
<sec id="Ch1.S3">
  <label>3</label><title>Results</title>
<sec id="Ch1.S3.SS1">
  <label>3.1</label><title>Classification performance of models</title>
      <p id="d1e1254">Overall, the results show that an automatic classification and segmentation of SMP profiles with ML algorithms is possible, even if no further information such as snow pit data or manual segmentation is provided.
Category wise, all semi-supervised models were not performing particularly well (see Table <xref ref-type="table" rid="Ch1.T1"/>). Only the self-trainer could compete with models from other categories, but this might be the case because the self-trainer is based on a balanced random forest. The supervised models achieved mixed performances. Some models such as the random forests and the SVM are clearly performing well, whereas other models such as the KNN and the easy ensemble are underperforming. Overall, the random forest was the best model in the supervised category since it achieves the highest absolute accuracy (0.73) and F1 score (0.73). However, considering rare classes, the balanced random forest outperformed the plain random forest. All three ANNs did exceptionally well, and their category was clearly the most successful among all three categories. The encoder–decoder showed the best scores among all models in terms of absolute accuracy, precision, and F1 score, closely followed by the LSTM. We consider the LSTM to be the best model within that category since the encoder–decoder only reached its high performance after extensive hyperparameter tuning and underperformed significantly when not tuned well. In contrast, the LSTM achieved its performance more consistently, even under moderate hyperparameter tuning, and it is thus more suitable for users.
The subsequent analyses compare the three models that performed best within their category: the LSTM performed best among the ANNs, the random forest among the supervised models, and the self-trainer among the semi-supervised models.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F2" specific-use="star"><?xmltex \currentcnt{2}?><?xmltex \def\figurename{Figure}?><label>Figure 2</label><caption><p id="d1e1261">Predictions on the test dataset of the LSTM, random forest, and self-trainer. The upper-left panel shows the labeled data. In the other panels, the correct predictions are shown with more intense colors and the wrong predictions with less intense colors. The LSTM has the highest rate of correct predictions and imitates the smoothness of the labeled data very well. The random forest does well but provides more segmented predictions. The self-trainer immensely overestimates rare classes.
</p></caption>
          <?xmltex \igopts{width=341.433071pt}?><graphic xlink:href="https://gmd.copernicus.org/articles/16/4521/2023/gmd-16-4521-2023-f02.png"/>

        </fig>

      <p id="d1e1270">Different ML models exhibited different prediction styles in terms of smoothness and ability to predict rare classes.
In Fig. <xref ref-type="fig" rid="Ch1.F2"/> it becomes visible that the models' predictions are not far off from the labels. In general, the predictions are somewhat similar to the labeled profiles, but the models often had difficulties in determining the precise start and end of a segment. Looking at three random exemplary profiles of the test data in Fig. <xref ref-type="fig" rid="Ch1.F3"/>, one can see that the three main models seem to not only generate similar predictions but also make similar mistakes. In the medium-depth profile (middle column), all three models predicted a longer segment of depth hoar that was not present in the labeled profile. In the shallow profile, all three models predicted some intermediate “depth hoar wind packed” layers in the first third that did not exist. In the deep profile, all three models miss the narrow intermediate depth hoar layer. In summary, it becomes apparent that the different models are producing consistent predictions to a certain degree. There are of course also significant differences among the models. First, the LSTM is closest to the labeled profiles (see Fig. <xref ref-type="fig" rid="Ch1.F3"/>). Second, the LSTM provided much smoother and less fragmented predictions than the other two models. Third, the self-trainer clearly overestimates rare classes, which hurts the overall performance.
To summarize, the LSTM, random forest, and self-trainer show certain prediction similarities among each other; however, the LSTM imitates expert labeling best.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F3" specific-use="star"><?xmltex \currentcnt{3}?><?xmltex \def\figurename{Figure}?><label>Figure 3</label><caption><p id="d1e1282">Model predictions for three randomly chosen SMP profiles. The first row represents the labeled profiles (with force signal). The subsequent rows represent the LSTM, random forest, and self-trainer predictions, with the red bar indicating wrong predictions. Each column shows a different profile randomly chosen from the test data (shallow profile: S31H0276; medium profile: S31H0206; deep profile: S49M1918). All three models seem to make similar mistakes, e.g., they predict a larger portion of depth hoar at the end of the medium SMP profile. The predictions of the LSTM are closest to the labeled profiles.</p></caption>
          <?xmltex \igopts{width=341.433071pt}?><graphic xlink:href="https://gmd.copernicus.org/articles/16/4521/2023/gmd-16-4521-2023-f03.png"/>

        </fig>

</sec>
<?pagebreak page4527?><sec id="Ch1.S3.SS2">
  <label>3.2</label><title>Classification difficulty of snow types</title>
      <p id="d1e1299">Figure <xref ref-type="fig" rid="Ch1.F4"/> shows that some snow types are easier and others are harder to classify.
The label-wise accuracy seems to be influenced by the following factors: (1) choice of model, (2) the frequency of snow type in the dataset, and (3) the snow type itself. Within one snow type category, the models perform differently; however, some snow types seem to be easier, while others are more difficult to classify for all models. For example, rounded grains wind packed achieved a high accuracy among all models, whereas depth hoar wind packed achieved a low accuracy among all models. This could be partially attributed to the fact that there are fewer samples available for depth hoar wind packed. However, the snow types themselves seem to influence the classification difficulty as well: the precipitation particles class achieves high accuracy values among some models, despite the fact that it is the rarest class in the dataset. For some snow types, some models are able to access certain information, enabling a high performance for that particular snow type that is independent of its frequency. This means that the classification difficulty does not solely depend on the number of available samples. Instead, several other underlying characteristics determine the classification of difficulty of each snow type as well, most notably (1) the initial classification, which is not always consistent; (2) the underlying micro-mechanical properties, i.e., some snow types have characteristic force signals that separate them more clearly from others; and (3) the training dataset, since it does not cover all types of force signals.</p>
      <p id="d1e1304">Depending on the model, a higher accuracy score could lead to a lower precision score for a label (accuracy–precision trade-off).
The ROC curve in Fig. <xref ref-type="fig" rid="Ch1.F5"/> illustrates this relationship between the true-positive and false-positive rates for the different snow types and their averaged performances. It becomes apparent that both the snow type and the choice of model influence the accuracy–precision trade-off. For example, the rare class seems to be difficult to classify both accurately and precisely for all models, whereas precipitation particles show an almost perfect ROC curve. If one is interested in choosing a model that performs well for a particular snow type, these ROC curves can reveal which model is most suitable. To get even more detailed label- and model-wise insights, refer to the confusion matrices in Appendix <xref ref-type="sec" rid="App1.Ch1.S8"/>. Both the LSTM and the random forest achieve an area under the ROC curve of 0.96. However, on average (see Fig. <xref ref-type="fig" rid="Ch1.F5"/>, dotted pink line), the LSTM outperforms the self-trainer and random forest and is thus most suitable for general classification tasks.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F4" specific-use="star"><?xmltex \currentcnt{4}?><?xmltex \def\figurename{Figure}?><label>Figure 4</label><caption><p id="d1e1315">Label-wise accuracy of all models. Each model is encoded with a different color. The most frequent label is on the left of the <inline-formula><mml:math id="M14" display="inline"><mml:mi>x</mml:mi></mml:math></inline-formula> axis (rounded grains wind packed), and the least frequent is on the right (precipitation particles). The rare class was dropped. Each bar represents the accuracy for a single snow type. The dotted lines show the overall accuracy performance of each model. The encoder–decoder, the BLSTM, and the LSTM achieved the highest accuracy values. For all models, some classes are more difficult to classify than others, e.g., depth hoar indurated and depth hoar wind packed. Some classes are easier to classify than others, such as rounded grains wind packed. Some classes can only be classified well by a subset of the models, such as precipitation particles and melted form depth hoar.</p></caption>
          <?xmltex \igopts{width=426.791339pt}?><graphic xlink:href="https://gmd.copernicus.org/articles/16/4521/2023/gmd-16-4521-2023-f04.png"/>

        </fig>

      <?xmltex \floatpos{t}?><fig id="Ch1.F5" specific-use="star"><?xmltex \currentcnt{5}?><?xmltex \def\figurename{Figure}?><label>Figure 5</label><caption><p id="d1e1334">ROC curves of the LSTM, random forest, and self-trainer for each class. The dotted lines are the micro- and macro-averaged ROC curves. The macro-average calculates the ROC for each class and averages the performances afterwards. The micro-average weights the performance according to class contribution (balanced performance results). The LSTM achieves the highest ROC performance overall. The order of the best-performing snow types is similar among all models. The rare and depth hoar indurated classes have the lowest ROC areas, whereas the precipitation particles class has the highest ROC area for all models.</p></caption>
          <?xmltex \igopts{width=341.433071pt}?><graphic xlink:href="https://gmd.copernicus.org/articles/16/4521/2023/gmd-16-4521-2023-f05.png"/>

        </fig>

</sec>
<sec id="Ch1.S3.SS3">
  <label>3.3</label><title>Generalizability</title>
      <p id="d1e1351">The prediction of the LSTM for 100 random profiles outside of the training and testing distribution is shown in Fig. <xref ref-type="fig" rid="Ch1.F6"/>.
Since the labeled profiles are not yet available for these predictions, the generalization capabilities can only be<?pagebreak page4528?> evaluated on the basis of what seems “reasonable”. Melted form depth hoar appears only at the ground of the profiles, precipitation particles appears only at the top, and rounded grains wind packed appears mostly at the top and rather deep – these are all reasonable predictions. However, there are also some predictions that are not reasonable or at least unexpected: the left profile consists almost entirely of depth hoar wind packed, sometimes depth hoar wind packed appears right before melted form depth hoar, and rounded grains wind packed sometimes appears briefly in the “middle” of a profile (and not at the top).
Overall, the LSTM seems to make mostly reasonable predictions; however, an in-depth expert analysis of the predictions is necessary to validate that further.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F6" specific-use="star"><?xmltex \currentcnt{6}?><?xmltex \def\figurename{Figure}?><label>Figure 6</label><caption><p id="d1e1358">LSTM SMP profile predictions on out-of-distribution data. The SMP profiles used here come from different legs of the MOSAiC expedition than the training, validation, and test data. The profiles used here still stem from the winter season to ensure that the same set of snow types can be used as in the training dataset. The distribution of the predicted profiles looks convincing, with only a few profiles standing out as certainly wrong predictions (e.g., the rightmost profile with <inline-formula><mml:math id="M15" display="inline"><mml:mrow><mml:mo>∼</mml:mo><mml:mn mathvariant="normal">90</mml:mn></mml:mrow></mml:math></inline-formula> % depth hoar wind packed).</p></caption>
          <?xmltex \igopts{width=341.433071pt}?><graphic xlink:href="https://gmd.copernicus.org/articles/16/4521/2023/gmd-16-4521-2023-f06.png"/>

        </fig>

</sec>
</sec>
<sec id="Ch1.S4">
  <label>4</label><title>Discussion</title>
      <p id="d1e1386">The results showed that the automatic classification of SMP profiles is possible with up to 78 % accuracy. In the following, the nature, impact, and limits of these results are discussed.</p>
      <p id="d1e1389">The metrical results presented are in line with previous findings. <xref ref-type="bibr" rid="bib1.bibx32" id="text.54"/> reported an overall accuracy score of 0.76 when using SVMs and additional snow pit information to classify three snow types. <xref ref-type="bibr" rid="bib1.bibx57" id="text.55"/> achieved an average accuracy of 0.81
when using the nearest-neighbor approach and knowledge rules to classify five snow types. However, these results stem from only three profiles and are not representative. <xref ref-type="bibr" rid="bib1.bibx21" id="text.56"/> achieved an accuracy of maximal 0.76 (global dataset) when using random forests and time-intensive manual layer segmentation to classify three snow types. The major difference from these previous results is that the accuracy results<?pagebreak page4529?> of this study were achieved for seven snow types, without time-intensive layer picking, snow pit digging, or additional knowledge rules. This means that in contrast to previous work, the models here can be directly employed by users for their own SMP datasets in the field: simply retrain and predict.
For this, they only need to provide a set of training samples for their specific dataset and classification style. The work presented here enables scientists for the first time to rely on fully automated ML SMP profile classification and segmentation.</p>
      <p id="d1e1401">The results were also satisfying to domain experts since the predictions were consistent within themselves and followed the patterns of the training data. In general, the snowpack on sea ice is extremely variable, and the traditional snow types are often a mixture of different features. This becomes visible when comparing the SMP profiles to the<?pagebreak page4530?> micro-CT samples. In the view of the authors, a temporally consistent classification is more relevant to the interpretation of the development of the snowpack, even if there is a certain, but unknown, bias to an expert interpretation. Hence, the models were in practice also helpful when analyzing Arctic snowpack development.</p>
<sec id="Ch1.S4.SS1">
  <label>4.1</label><title>Classification performance of models</title>
      <p id="d1e1411">Each model category performs differently because each model takes different aspects of the data into account.
Semi-supervised models try to take unlabeled data into account to improve their predictions; however, this did not work well in our context. The most likely reason for the overall underperformance of this category is that the unlabeled data contained out-of-distribution data, i.e., the unlabeled data had different underlying mechanisms than the labeled data (different parts of the winter season). Another reason might be that only a small subset of unlabeled data were included in order to limit running times. Moreover, the poor performance of the cluster-then-predict models is most likely also a result of the classifier used after clustering: a more sophisticated method than a majority vote classifier is needed here.</p>
      <p id="d1e1414">The simple supervised models take one data point after the other into account and do not consider time series structures within the data. The algorithms used in all previous SMP automation studies fall into this category. In contrast, ANNs are supervised models that take the underlying time sequence of the data into account. While the supervised model in general performed well, they were still clearly outperformed by the ANNs. A likely reason why the ANNs outperformed all the other models is precisely the ANNs' ability to process time-dependent – or in the case of snow profiles depth-dependent – information. ANNs are tackling the classification task as a sequence labeling task, which enables them to include information from the order and position of snow layers. The supervised models still have access to time-relevant information (time window features; see Appendix <xref ref-type="sec" rid="App1.Ch1.S3.SS1"/>); however, they do not have any ability to learn time-based information (what should be remembered and forgotten). Besides, the ANNs learn to imitate the training set, leading to smooth and expert-like predictions.
In comparison, taking the time component of SMP signals into account has not been done<?pagebreak page4531?> in previous methods, and we argue that it adds a major information piece and boosts the overall prediction performance significantly.</p>
      <p id="d1e1419">Each model exhibits a different prediction style due to the models' intrinsic differences and thus might be suitable for specific tasks.
The following aspects are listed for consideration (user's guide).
<list list-type="custom"><list-item><label>A.</label>
      <p id="d1e1424"><italic>Time and resources for hyperparameter tuning.</italic> The LSTM and the encoder–decoder network are recommended when plenty of tuning time is available. The encoder–decoder network performs especially badly if not tuned well. The SVM and the balanced random forest need little tuning time, whereas the random forest is the go-to model in cases where (almost) no tuning time can be provided.</p></list-item><list-item><label>B.</label>
      <p id="d1e1430"><italic>Need for a simple to handle, off-the-shelf algorithm.</italic> Among the high-performing models, the random forest and the SVM are the easiest to handle off-the-shelf algorithms with. The self-supervised algorithms and especially the ANNs require a somewhat deeper understanding of the models and the ability to implement them.</p></list-item><list-item><label>C.</label>
      <p id="d1e1436"><italic>Desired level of explainability.</italic> The random forests are most explainable since the decision trees can be directly visualized (Appendix <xref ref-type="sec" rid="App1.Ch1.S7"/>). The ANNs are the least explainable models (without further modifications).</p></list-item><list-item><label>D.</label>
      <p id="d1e1444"><italic>Importance of minority classes.</italic> When deciding on a model, the underlying task must be examined as well. In the case of avalanche prediction, it might be essential to predict a buried layer of surface hoar, a very uncommon class, which needs to be detected no matter the costs. In such a case of “minority class prediction,” the balanced RF or the SVM should be employed. The ANNs and the random forest, in contrast, are more suitable to achieve an overall good classification.</p></list-item><list-item><label>E.</label>
      <p id="d1e1450"><italic>Availability of unlabeled data that are from the same distribution as the labeled data.</italic> In cases where a lot of unlabeled data from the same distribution and time are available, the self-trained classifier can be considered. The weak learner of the self-trained classifier can be chosen according to the criteria listed above. Since in this work we only had a small subset of unlabeled data stemming from the same distribution as the labeled data, further evaluations of the self-trained classifier and label propagation remain open.</p></list-item></list>
This highlights that there is not a single best model and that users can instead deliberately choose a model that suits their needs, such as overall accuracy, ability to predict rare classes, explainability, training, and deployment time.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F7" specific-use="star"><?xmltex \currentcnt{7}?><?xmltex \def\figurename{Figure}?><label>Figure 7</label><caption><p id="d1e1459">Two-dimensional t-distributed stochastic neighbor embedding (t-SNE) of SnowMicroPen (SMP) dataset. The colors encode the snow types. The figure shows that (1) depth hoar and depth hoar indurated are hardly separable; (2) depth hoar wind packed is similar to several other snow types; and (3) precipitation particles, melted form of depth hoar, and rounded grains wind packed can each be separated more clearly from the other snow types.</p></caption>
          <?xmltex \igopts{width=341.433071pt}?><graphic xlink:href="https://gmd.copernicus.org/articles/16/4521/2023/gmd-16-4521-2023-f07.png"/>

        </fig>

</sec>
<sec id="Ch1.S4.SS2">
  <label>4.2</label><title>Classification difficulty of snow types</title>
      <?pagebreak page4532?><p id="d1e1477">Snow types are difficult to classify since their categories are continuous rather than discrete.
This was also observed in previous work, and in all previous work performances were reported label-wise to account for those differences <xref ref-type="bibr" rid="bib1.bibx57 bib1.bibx21 bib1.bibx32" id="paren.57"/>.
We performed t-distributed stochastic neighbor embedding (t-SNE) on the SMP dataset to visualize how separable the different classes are (see Fig. <xref ref-type="fig" rid="Ch1.F7"/>). The precipitation particles class, for example, appears as a singled-out green grouping, which is in line with our (and previous) findings <xref ref-type="bibr" rid="bib1.bibx57" id="paren.58"/> that it is easier to classify than other snow types. We conclude that some classes have features distinguishing them more strongly from other snow types. The rounded grain wind packed class behaves in a similar way <xref ref-type="bibr" rid="bib1.bibx57" id="paren.59"/>. However, some classes, such as depth hoar and depth hoar indurated are completely overlapping in Fig. <xref ref-type="fig" rid="Ch1.F7"/>, and indeed our models had problems with differentiating between those two classes. Similarly, depth hoar wind packed seems to overlap largely with rounded grains wind packed and melted form of depth hoar. We theorize that the reason for their non-separability is that those snow types transform into each other during snow metamorphosis. This means many data points can not be discretized into one single category since they are on a continuous spectrum. <xref ref-type="bibr" rid="bib1.bibx57" id="text.60"/> also pointed out that they often found data points that were in transition between snow classes and attributed this to the fact that the snow is changing continuously.
In conclusion, it is currently impossible to reach 100 % classification accuracy on every snow type since some snow types will always lie between categories.</p>
      <p id="d1e1497">Despite these difficulties, the underlying SMP signals are still characteristic enough for specific snow types to be classified successfully. The different micro-mechanical properties of the snow types are reflected in the SMP signal and are thus the driver for the classification. Some classes, such as precipitation particles, can be clearly separated from others since the bonding between the grains is so weak that the force signal is very low. As long as precipitation particles do not share this characteristic with other snow types, they can be easily classified. Refer to Appendix <xref ref-type="sec" rid="App1.Ch1.S2"/> to learn more about the relation between snow types and SMP signal, and refer to Appendix <xref ref-type="sec" rid="App1.Ch1.S7"/> to see which classes have unique signal characteristics and which classes have shared signal characteristics.</p>
      <p id="d1e1504">The classification difficulties also extend to the expert labeling process itself. The continuous nature of the snow types makes it particularly difficult for domain experts to agree on labeling, i.e., two different snow experts will produce two different labeled and segmented profiles for the same SMP measurement <xref ref-type="bibr" rid="bib1.bibx22" id="paren.61"/>. This is another reason why a classification accuracy of 100 % cannot be reached. One might suggest supplementing the classification process with additional observational data to make the process more “objective” (as we also do here). However, each classification and segmentation of a snowpack is “subjective” in nature right now, no matter which observational data are used as the basis for the classification. When requesting a segmentation and classification of a snowpack, one is always requesting the classification of a specific expert. While the operator bias can be mitigated by using NIR, Micro-CTs, or the SMP, the classification of those measurements remains subjective. It is neither this study's goal nor its task to provide an objective classification; instead, we aim for a consistent classification.</p>
      <p id="d1e1510">However, difficulties in reaching 100 % accuracy do not preclude overall good performance. While experts may end up with different segmentations and classifications, they can still agree that two different analyses are both valid analyses of the same profile. Similarly, the algorithms provided here output predictions that may not always align with the expert labeling but are sensible and directly usable. Hence, we cannot evaluate the models solely based on numerical metrics, such as accuracy, but must also evaluate the performance from a qualitative perspective. This is the reason why we evaluated if an SMP user, who also labeled the training data, would (1) accept the predictions of the ML algorithms on an out-of-distribution dataset, (2) find them<?pagebreak page4533?> consistent with their own labeling, (3) and subsequently work with those predictions. In the case of the MOSAiC dataset, all those aspects were fulfilled. We find such a qualitative assessment important since these questions decide whether or not the tools provided will be used in practice.</p>
      <p id="d1e1514">We also want to point out that the algorithms themselves are entirely agnostic to the question of “subjectivity”. The algorithms are merely reproducing what they have been trained on. If we can provide the algorithms with a dataset that can be considered “fully objective”, and the community agrees on that as ground truth data, the algorithms could reproduce those hypothetical objective labels. Alternatively, signals could also be grouped first, and some abstract classes could be assigned to them. Nevertheless, even this would rely on human expertise since the parameters to separate those groups would be subject to discussion (see Fig. <xref ref-type="fig" rid="Ch1.F7"/>; the groups are not simply separable from each other, and the clustering would depend on parameter choices). In general, we provide a methodological framework here to classify and segment SMP profiles. Which classification patterns are reproduced depends on the user's choice.</p>
      <p id="d1e1519">The benefits of using an automatic classification are that the SMP user can (1) save valuable time, (2) receive consistent labeling, and (3) perform statistical analysis on their SMP dataset. In the case of the MOSAiC dataset, manual labeling would have meant labeling over 3000 profiles, which can easily take up to a year to classify (next to other obligations of domain experts). In terms of consistency, we already experienced how some of the models' predictions helped us – to our surprise – to detect human mistakes and inconsistencies during the first labeling round. Furthermore, such an upscaled classification enables, for the first time, the statistical analysis of an SMP dataset. One of the initial research questions for MOSAiC was “Is depth hoar in Arctic snowpacks mostly present at the bottom and rounded grains wind packed at the top?”. With the help of snowdragon, the MOSAiC dataset could be consistent and accurately labeled enough to answer such a question with “Yes, this is indeed the case.”.</p>
</sec>
<sec id="Ch1.S4.SS3">
  <label>4.3</label><title>Generalizability</title>
      <p id="d1e1530">The LSTM can generalize to other winter profiles with the same snow types since the underlying classification and segmentation rules stay the same.
However, the LSTM's generalization capability does not extend to other seasons or regions when and where other snow types are found, such as melted forms or regional snow types. As mentioned before, the models do not generalize regarding the different classification styles of experts. The models used in this work are still generalizable in that they can be used on any desired dataset as long as they are retrained on the chosen dataset. This would not have been possible in previous works, such as <xref ref-type="bibr" rid="bib1.bibx57" id="text.62"/>, since knowledge rules for one snow region and season do not transfer to other regions or seasons.
For greater generalization capability, the LSTM – or any other model – must be either trained with a more general dataset or specifically retrained for an individual dataset.</p>
</sec>
<sec id="Ch1.S4.SS4">
  <label>4.4</label><title>Limitations and future work</title>
      <p id="d1e1544">As previously discussed, the uncertainty in expert labeling is a general limitation of this particular study. While this uncertainty might be partially mitigated further by using a dataset for which many additional in situ observations exist, it would still remain an issue. One approach for future work would be to quantify the uncertainty that is inflicted upon the labeled profiles. Subsequently, a machine learning model could be trained to not only classify snow types but also provide a probabilistic classification.</p>
      <p id="d1e1547">This work does not address the task setting of a first-segment-then-classify algorithm because this would require a completely different set of methods.
In a first-segment-then-classify setting, the SMP signal could first be segmented with techniques used in audio segmentation <xref ref-type="bibr" rid="bib1.bibx66" id="paren.63"/>. The resulting time series pieces could subsequently be classified as a whole <xref ref-type="bibr" rid="bib1.bibx25" id="paren.64"/>.
Future work could experiment with this problem formulation and analyze if performance further increases in this setting.</p>
      <p id="d1e1556">The ANNs used here are off-the-shelf networks and are not adapted to the specific underlying task in order to ensure a fair comparison between the different models.
However, one could look into adapting the loss functions to include similarity measurements between snow samples. Results from clustering, performed on t-SNE data, could then be leveraged during classification to increase classification performance.
Adapting the loss function of the ANNs could increase prediction performance greatly; however, such a loss function must be carefully constructed and evaluated on different datasets.</p>
      <p id="d1e1559">As mentioned in Sect. <xref ref-type="sec" rid="Ch1.S4.SS3"/>, the models cannot generalize to completely different settings in terms of seasons and regions.
To ensure generalization capability one could train a large model on a dataset that includes snow types from different regions and seasons. Such a dataset would need to be newly compiled because common SMP datasets are usually limited to one region <xref ref-type="bibr" rid="bib1.bibx45 bib1.bibx5" id="paren.65"/>. In theory, a large enough model trained on a large enough dataset could be able to produce direct predictions for any SMP users. Thus, it would be interesting to train an ML model on a generalized dataset and validate its performance on the specialized MOSAiC SMP dataset. This would shed new light on the spatiotemporal transferability of the ML models presented here.</p>
      <p id="d1e1568">Alternatively, SMP users can simply retrain a chosen model for their particular dataset. They need to provide a set of SMP profiles for their region, season, and classification style, but the overall time savings are still immense.
To summarize, the generalization capabilities may be enhanced by using a more general dataset, or one can bypass this problem<?pagebreak page4534?> by retraining to specific datasets. The snowdragon repository addresses the needs of the latter.</p>
      <p id="d1e1571">An immediate consequence of this study is the further analysis of the unlabeled part of the MOSAiC dataset.
Domain experts can use the LSTM or other models to create predictions for the remaining 3516 profiles. A previously almost impossible task to classify and segment those thousands of profiles became feasible by providing just a set of 164 labeled profiles.
The results of these predictions and their impacts on the cryospheric analysis of snow coverage in the Arctic will become apparent in future publications.</p>
</sec>
</sec>
<sec id="Ch1.S5" sec-type="conclusions">
  <label>5</label><title>Conclusions</title>
      <p id="d1e1584">Snowdragon provides SMP users with a way to upscale manual SMP labeling and provide large statistically consistent datasets.
We showed for the first time that SMP profiles straight from the field can be automatically segmented and classified (up to <inline-formula><mml:math id="M16" display="inline"><mml:mn mathvariant="normal">0.78</mml:mn></mml:math></inline-formula> accuracy). A total of 14 different models were trained here to classify seven snow types without providing any additional manual information. It also showed for the first time how ANNs and semi-supervised models can be used for the task of SMP classification and segmentation. Among all models, the LSTM and the encoder–decoder models perform the best. The resulting predicted profiles show smooth segmentations and expert-like classification patterns that were satisfying to domain experts.</p>
      <p id="d1e1594">These findings will enable SMP users to automatically analyze their SMP measurements. To that end, an SMP user must simply decide on 1 of the 14 models provided by the snowdragon repository given the considerations listed in this paper and retrain the model for their particular dataset. Afterwards, the SMP user can simply predict SMP classifications for the remaining unlabeled profiles.</p>
      <p id="d1e1597">The models presented here, in particular the LSTM model, could be trained on a broad dataset from different regions and seasons so that automatic SMP classification becomes even more accessible. Such a model could even be integrated into the snowmicropyn package. The resulting tool would make knowledge about snowpacks easier and faster to access for all scientists. This is of particular interest (1) for interdisciplinary scientists that rely on snow type information but do not have the tools to classify them themselves (remote sensing); (2) for scientists that require fast analysis of SMP profiles, such as in avalanche prediction; and (3) for SMP users facing large datasets.</p>
      <p id="d1e1600">Snowdragon enables the analysis of the SMP MOSAiC dataset, a dataset containing detailed information about snow on Arctic sea ice. In times of climate change, this information is crucial. We need to understand the state of the sea ice in order to understand which state the Arctic system is in. For the first time, MOSAiC enables the scientific community to have access to such a detailed and large dataset. Snowdragon is one example of how ML can help us to actually access the knowledge behind all the data.</p>
</sec>

      
      </body>
    <back><app-group>

<app id="App1.Ch1.S1">
  <?xmltex \currentcnt{A}?><label>Appendix A</label><title>User's guide</title>
      <p id="d1e1614">Here, we provide a walk-through of how to use snowdragon with SMP profiles collected in the field.
<list list-type="order"><list-item>
      <p id="d1e1619"><italic>Data collection.</italic>
<list list-type="bullet"><list-item>
      <p id="d1e1626">Collect the desired SMP profiles.</p></list-item><list-item>
      <p id="d1e1630">If you are familiar with snow stratigraphy measurements, you should consider collecting additional in situ observations such as Micro-CTs, NIR photography, or similar to inform your labeling procedure (see also points listed under “labeling”).</p></list-item><list-item>
      <p id="d1e1634">If you are not familiar with snow stratigraphy measurements, you should ask experts if a labeled dataset for your snow conditions exists (<xref ref-type="bibr" rid="bib1.bibx43 bib1.bibx69 bib1.bibx33" id="altparen.66"/>) or if you need to onboard an expert to conduct a few in situ observations and label some of your profiles.</p></list-item></list></p></list-item><list-item>
      <p id="d1e1641"><italic>Labeling.</italic>
<list list-type="bullet"><list-item>
      <p id="d1e1648">Evaluate the following questions before you start the data collection.</p></list-item><list-item>
      <p id="d1e1652">If you conduct your own labeling proceed through the following steps:
<list list-type="bullet"><list-item>
      <p id="d1e1657">use additional in situ observations to fine-tune your labeling where possible;</p></list-item><list-item>
      <p id="d1e1661">ask a fellow researcher for their opinion on a few profiles (before you label all of them);</p></list-item><list-item>
      <p id="d1e1665">note down your labeling criteria – this way you can ensure consistency in your labeling;</p></list-item><list-item>
      <p id="d1e1669">revisit your labeled profiles (all of them!) at least a second time because you can catch mistakes and ensure once more that there is consistency in your labeling.</p></list-item></list></p></list-item><list-item>
      <p id="d1e1673">If a labeled dataset exists for a specific location, proceed through the following questions and analyze carefully if the labeled data do transfer to your snow conditions. Can you expect the same snow types? Was the data collected in the same location or a similar location? Is it the same season? Might changing climatic conditions have also changed the nature of the snowpacks? Has the environment of the location gone through other types of changes?</p></list-item><list-item>
      <p id="d1e1677">If labeled datasets exist capturing SMP profiles in general, proceed through the following questions and analyze carefully if you can work with a general dataset or need a specialized labeled dataset. Does the general dataset reflect the profiles you<?pagebreak page4535?> have collected well? Do you have snow types dominating your dataset that are a minority in the general dataset? Do you have a particular season dominating your dataset that is underrepresented in the general dataset? Does the general dataset contain all snow types that you have encountered in your dataset?</p></list-item></list></p></list-item><list-item>
      <p id="d1e1681"><italic>Setup.</italic>
<list list-type="bullet"><list-item>
      <p id="d1e1688">Raw preprocess your SMP profiles and labels if necessary; data must be provided in <monospace>.pnt</monospace> format.</p></list-item><list-item>
      <p id="d1e1695">Establish a consistent naming convention for your profiles. The labeling files (in <monospace>.ini</monospace> format) should have the same file name as the SMP profile that belongs to that labeling file. For example, you can have a <monospace>S31H0370.ini</monospace> containing the label markers for the force file <monospace>S31H0370.pnt</monospace>.</p></list-item><list-item>
      <p id="d1e1708">Clone or fork the snowdragon repository (<uri>https://github.com/liellnima/snowdragon</uri>).</p></list-item><list-item>
      <p id="d1e1715">Follow the setup guide in the GitHub repository.</p></list-item><list-item>
      <p id="d1e1719">Tell the repository where your raw data lives. Change the <monospace>SMP_LOC</monospace> in <monospace>data_handling/data_parameters.py</monospace> to the right path as described online.</p></list-item><list-item>
      <p id="d1e1729">Preprocess all the SMP profiles (follow online guidelines).</p></list-item></list></p></list-item><list-item>
      <p id="d1e1733"><italic>Model selection.</italic>
<list list-type="bullet"><list-item>
      <p id="d1e1740">Select the right model for your use case. Refer to Sect. <xref ref-type="sec" rid="Ch1.S4.SS1"/> for further information.</p></list-item></list></p></list-item><list-item>
      <p id="d1e1746"><italic>Training and evaluation.</italic>
<list list-type="bullet"><list-item>
      <p id="d1e1753">Refer to the online guide of the repository.</p></list-item></list></p></list-item><list-item>
      <p id="d1e1757"><italic>Tuning.</italic>
<list list-type="bullet"><list-item>
      <p id="d1e1764">Refer to the online guide of the repository.</p></list-item></list></p></list-item><list-item>
      <p id="d1e1768"><italic>Inference.</italic>
<list list-type="bullet"><list-item>
      <p id="d1e1775">Use the <monospace>predict_profile()</monospace> or <monospace>predict_all()</monospace> functions from the <monospace>predict.py</monospace> file (provide path to data again). The functions can either be directly used or further adapted to your particular needs. The model you choose for inference must be stored somewhere, meaning you either need to train it beforehand or download the pre-trained models we provide.</p></list-item></list></p></list-item><list-item>
      <p id="d1e1788"><italic>Analysis.</italic>
<list list-type="bullet"><list-item>
      <p id="d1e1795">Conduct your specific analysis on the labeled profiles. Run visualizations if desired, as explained in the online guide.</p></list-item></list></p></list-item></list></p>
</app>

<app id="App1.Ch1.S2">
  <?xmltex \currentcnt{B}?><label>Appendix B</label><title>Labeling</title>
      <p id="d1e1806">A snow micro penetrometer (SMP) is a device used to determine bond strength between internal snow grains in a snowpack. The micro-structural and micro-mechanical properties of the snow, for example, density and specific surface area (SSA), are directly influencing the bond strength. When a snow micropenetrometer penetrates the snowpack and breaks these bonds between the snow grains, we are able to directly infer these micro-structural properties, as shown in the existing method by <xref ref-type="bibr" rid="bib1.bibx53" id="text.67"/>. For example, snow with high density has a higher bond strength and therefore a higher penetration resistance force (measurable with the SMP) in comparison to low-density snow.</p>
      <p id="d1e1812">Different types of snow <xref ref-type="bibr" rid="bib1.bibx15" id="paren.68"/> are known to have different densities and SSAs, so the extraction of this data from the SMP force signal already allows us to draw pivotal conclusions about the snow type. However, the characteristics (using magnitude, frequency, and gradient) and the signature of the penetration force signal can provide more information about the internal snow type. This document outlines the process of classification of a snow type found on sea ice in the High Arctic using the SMP penetration resistance force signal.</p>
      <p id="d1e1818">Typical grains observed as part of the MOSAiC expedition on sea ice in the High Arctic are listed below:
<list list-type="bullet"><list-item>
      <p id="d1e1823">precipitation particles (PP) or decomposing and fragmented precipitation particles (DF);</p></list-item><list-item>
      <p id="d1e1827">ice formations (IF);</p></list-item><list-item>
      <p id="d1e1831">surface hoar (SH);</p></list-item><list-item>
      <p id="d1e1835">rounded grains wind packed (RGwp);</p></list-item><list-item>
      <p id="d1e1839">depth hoar (DH);</p></list-item><list-item>
      <p id="d1e1843">depth hoar indurated (DHid);</p></list-item><list-item>
      <p id="d1e1847">depth hoar wind packed (DHwp);</p></list-item><list-item>
      <p id="d1e1851">melt form depth hoar (MFdh).</p></list-item></list></p>
      <p id="d1e1854">It is important to mention that the melt season is not included in this study due to liquid water influencing the interpretation of the SMP signal. For more information on the environmental and meteorological conditions under which the dataset has been collected refer to <xref ref-type="bibr" rid="bib1.bibx54" id="text.69"/>.</p>
      <p id="d1e1861">For the majority of snow types, we follow the classification of <xref ref-type="bibr" rid="bib1.bibx15" id="text.70"/>. However, <xref ref-type="bibr" rid="bib1.bibx15" id="text.71"/> was adapted for Alpine snow, meaning some of the snow types listed above are either not included in the classification or differ from the ones encountered in Alpine snow.</p>
      <?pagebreak page4536?><p id="d1e1870"><italic>Melt form, depth hoar.</italic> When working on sea ice we identified one alternative snow grain class (melt form, depth hoar, MFdh) that does not exist in the <xref ref-type="bibr" rid="bib1.bibx15" id="text.72"/> classification. This snow type is known in the sea ice community as a surface scattering layer <xref ref-type="bibr" rid="bib1.bibx37" id="paren.73"/>. It is typically found in the summer season when sea ice melts; however, we identified this as a persistent layer when transitioning into winter. In the field, this was an extremely dense layer at the snow–sea ice interface, and the penetration resistance force of this layer varied throughout the season. The melt form depth hoar label was chosen as this is a feature of melting sea ice that has persisted into the winter and has undergone metamorphism when buried under snow.</p>
      <p id="d1e1881"><italic>Depth hoar, wind packed.</italic> Grains initially classified as rounded grains wind packed (RGwp) metamorphose into a very hard, dense depth hoar under the large temperature gradients, which we call depth hoar wind packed (DHwp) <xref ref-type="bibr" rid="bib1.bibx52" id="paren.74"/>.</p>
      <p id="d1e1889">All other classifications are listed in <xref ref-type="bibr" rid="bib1.bibx15" id="text.75"/>.</p>
<sec id="App1.Ch1.S2.SS1">
  <label>B1</label><title>Classification details</title>

<?xmltex \floatpos{h!}?><table-wrap id="App1.Ch1.S2.T2"><?xmltex \hack{\hsize\textwidth}?><?xmltex \currentcnt{B1}?><label>Table B1</label><caption><p id="d1e1906">Features used to identify snow types visually from the SMP signal.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="5">
     <oasis:colspec colnum="1" colname="col1" align="justify" colwidth="1cm"/>
     <oasis:colspec colnum="2" colname="col2" align="justify" colwidth="3.5cm"/>
     <oasis:colspec colnum="3" colname="col3" align="justify" colwidth="2.5cm"/>
     <oasis:colspec colnum="4" colname="col4" align="justify" colwidth="6cm"/>
     <oasis:colspec colnum="5" colname="col5" align="justify" colwidth="2.3cm"/>
     <oasis:thead>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Snow type</oasis:entry>
         <oasis:entry colname="col2">Location in snow profile</oasis:entry>
         <oasis:entry colname="col3">Typical thickness</oasis:entry>
         <oasis:entry colname="col4">Signal description</oasis:entry>
         <oasis:entry colname="col5">Force range</oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">DF</oasis:entry>
         <oasis:entry colname="col2">Predominantly at the surface of the profile</oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M17" display="inline"><mml:mrow><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:mrow></mml:math></inline-formula> cm</oasis:entry>
         <oasis:entry colname="col4">Very low force signal</oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M18" display="inline"><mml:mrow><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:math></inline-formula> N</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">IF</oasis:entry>
         <oasis:entry colname="col2">Anywhere</oasis:entry>
         <oasis:entry colname="col3">0.1–5 mm</oasis:entry>
         <oasis:entry colname="col4">Sharp singular peak, no intermediate peaks</oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M19" display="inline"><mml:mrow><mml:mo>&gt;</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:math></inline-formula> N</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">SH</oasis:entry>
         <oasis:entry colname="col2">Surface of profile</oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M20" display="inline"><mml:mrow><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">10</mml:mn></mml:mrow></mml:math></inline-formula> mm</oasis:entry>
         <oasis:entry colname="col4">Tooth-like structure similar to depth hoar</oasis:entry>
         <oasis:entry colname="col5">0–0.2 N</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">RGwp</oasis:entry>
         <oasis:entry colname="col2">Anywhere (not necessarily on the surface and can<?xmltex \hack{\hfill\break}?>sometimes be buried)</oasis:entry>
         <oasis:entry colname="col3">10 mm<inline-formula><mml:math id="M21" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mo>&gt;</mml:mo><mml:mn mathvariant="normal">50</mml:mn></mml:mrow></mml:math></inline-formula> cm</oasis:entry>
         <oasis:entry colname="col4">Wavy force signal, when density is around<?xmltex \hack{\hfill\break}?>500 kg m<inline-formula><mml:math id="M22" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula> can also have a tooth-like structure similar to depth hoar (density of <inline-formula><mml:math id="M23" display="inline"><mml:mrow><mml:mo>&gt;</mml:mo><mml:mn mathvariant="normal">400</mml:mn></mml:mrow></mml:math></inline-formula> kg m<inline-formula><mml:math id="M24" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula> is typical of Arctic wind crust)</oasis:entry>
         <oasis:entry colname="col5">Varying but in the 2–20 N range</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">DH</oasis:entry>
         <oasis:entry colname="col2">Often found in the middle to the bottom of the profile</oasis:entry>
         <oasis:entry colname="col3">Complete range</oasis:entry>
         <oasis:entry colname="col4">Classic teeth signal, increasing in force, then a sudden drop in force due to hitting an air pocket</oasis:entry>
         <oasis:entry colname="col5">0–2 N</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">DHid</oasis:entry>
         <oasis:entry colname="col2">Often at the middle to the bottom of the profile</oasis:entry>
         <oasis:entry colname="col3">Complete range</oasis:entry>
         <oasis:entry colname="col4">Classic teeth signal, does not drop to 0 N like DH would</oasis:entry>
         <oasis:entry colname="col5">2–6 N (<inline-formula><mml:math id="M25" display="inline"><mml:mo lspace="0mm">±</mml:mo></mml:math></inline-formula>2 N)</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">DHwp</oasis:entry>
         <oasis:entry colname="col2">Very hard layer at the <?xmltex \hack{\hfill\break}?>surface</oasis:entry>
         <oasis:entry colname="col3">4 mm–10 cm</oasis:entry>
         <oasis:entry colname="col4">High force signal caused by wind-packed snow grains that have metamorphosed into an icy layer</oasis:entry>
         <oasis:entry colname="col5">5–30 N</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">MFdh</oasis:entry>
         <oasis:entry colname="col2">Very hard layer at the<?xmltex \hack{\hfill\break}?>snow–sea ice interface</oasis:entry>
         <oasis:entry colname="col3">1–10 mm</oasis:entry>
         <oasis:entry colname="col4">High force signal caused by a metamorphosed surface scattering layer buried under the snowpack</oasis:entry>
         <oasis:entry colname="col5">5–30 N</oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table><?xmltex \gdef\@currentlabel{B1}?></table-wrap>

<?xmltex \hack{\clearpage}?>
</sec>
<?pagebreak page4537?><sec id="App1.Ch1.S2.SS2">
  <label>B2</label><title>Examples of snow types' SMP signals</title>

      <?xmltex \floatpos{h!}?><fig id="App1.Ch1.S2.F8"><?xmltex \currentcnt{B1}?><?xmltex \def\figurename{Figure}?><label>Figure B1</label><caption><p id="d1e2203">SMP profiles with typical SMP signals for the following snow types: <bold>(a)</bold> a typical signal for decomposing and fragmented precipitation particles (DF) with a force remaining under 0.1 N between approximately 111 and 121 mm, <bold>(b)</bold> a typical signal for ice formations (IF) with a sharp singular peak at a maximum of 4 N between approximately 98.6 and 99.3 mm, and <bold>(c)</bold> a typical increase in force at the snow–sea ice interface. This signal is typical of a remnant surface scattering layer, named melt form, depth hoar (MFdh) in this study. This signal typically has a force range of 5–30 N showing <bold>(d)</bold> a typical signal for surface hoar (SH) at the surface of the profile with a tooth-like structure with a low force signal; <bold>(e)</bold> a typical tooth-like signal for indurated depth hoar (DHid) with a force between 2–6 N, <bold>(f)</bold> a typical wavy force signal for rounded grains, wind packed snow (RGwp); <bold>(g)</bold> a typical tooth-like signal for depth hoar (DH), and <bold>(h)</bold> a typical wavy and tooth-like signal for depth hoar, wind packed (DHwp) with a force between 5–30 N at snow depths 208 to 215 mm.</p></caption>
          <?xmltex \hack{\hsize\textwidth}?>
          <?xmltex \igopts{width=312.980315pt}?><graphic xlink:href="https://gmd.copernicus.org/articles/16/4521/2023/gmd-16-4521-2023-f08.png"/>

        </fig>

<?xmltex \hack{\clearpage}?>
</sec>
<?pagebreak page4538?><sec id="App1.Ch1.S2.SS3">
  <label>B3</label><title>Complementary parallel measurements</title>
      <p id="d1e2250">When measuring the snow properties, we had access to numerous instruments, with each proving beneficial when interpreting the snow grain type. For example, the near-infrared camera provided overview images of the cross section of the snow pit wall (see examples in Fig. <xref ref-type="fig" rid="App1.Ch1.S2.F9"/>a and c), and micro-computer tomography measured the snow's micro-structure in high resolution (Fig. <xref ref-type="fig" rid="App1.Ch1.S2.F9"/>d). The metadata section in the dataset by <xref ref-type="bibr" rid="bib1.bibx43" id="text.76"/> gives additional information about how many micro-CTs and NIR images are used in parallel.</p>

      <?xmltex \floatpos{h!}?><fig id="App1.Ch1.S2.F9"><?xmltex \currentcnt{B2}?><?xmltex \def\figurename{Figure}?><label>Figure B2</label><caption><p id="d1e2262">A holistic figure showing the use of a library of datasets to assist in labeling the SMP signal. <bold>(a)</bold> An NIR image from the event PS122-2_23-105 giving a horizontal cross section of the snowpack where the five SMP measurements in <bold>(b)</bold> were taken. The rounded grain, wind packed  (rgwp); indurated depth hoar (dhid); and depth hoar (dh) regions are identified. <bold>(b)</bold> Five SMP profiles measured approximately 20 cm apart in the same snow pit during event PS122-2_23-105. <bold>(c)</bold> An NIR image from event PS122-2_21-53 giving a horizontal cross section of the snowpack where the five SMP measurements in <bold>(e)</bold> were taken. <bold>(d)</bold> A 3-D reconstruction of the snow microstructure measured using micro-computer tomography. <bold>(e)</bold> Five SMP profiles measured approximately 20 cm apart in the same snow pit during event PS122-2_21-53.</p></caption>
          <?xmltex \hack{\hsize\textwidth}?>
          <?xmltex \igopts{width=398.338583pt}?><graphic xlink:href="https://gmd.copernicus.org/articles/16/4521/2023/gmd-16-4521-2023-f09.png"/>

        </fig>

<?xmltex \hack{\clearpage}?>
</sec>
</app>

<?pagebreak page4539?><app id="App1.Ch1.S3">
  <?xmltex \currentcnt{C}?><label>Appendix C</label><title>Features</title>
<sec id="App1.Ch1.S3.SS1">
  <label>C1</label><title>Features included in data</title>
      <p id="d1e2313">Table <xref ref-type="table" rid="App1.Ch1.S3.T3"/> lists all features that were included in the training, validation, and testing data of this study. The importance of those features depends on the specific snow type that should be classified (see Fig. <xref ref-type="fig" rid="App1.Ch1.S3.F10"/>). For example, rounded grains wind packed shows a high correlation with micro-mechanical features such as <inline-formula><mml:math id="M26" display="inline"><mml:mi>L</mml:mi></mml:math></inline-formula> (4 mm window), whereas melted form depth hoar is mainly correlated with the force values of the SMP profile. Further feature importance analysis (ANOVA and decision tree importance) can be found online in the snowdragon GitHub repository.</p>

<?xmltex \floatpos{h!}?><table-wrap id="App1.Ch1.S3.T3"><?xmltex \hack{\hsize\textwidth}?><?xmltex \currentcnt{C1}?><label>Table C1</label><caption><p id="d1e2331">Names and description of the features included in the training, validation, and testing data.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="3">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="left"/>
     <oasis:colspec colnum="3" colname="col3" align="justify" colwidth="9.7cm"/>
     <oasis:thead>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Feature name</oasis:entry>
         <oasis:entry colname="col2">Abbreviation</oasis:entry>
         <oasis:entry colname="col3">Explanation</oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row>
         <oasis:entry colname="col1">distance</oasis:entry>
         <oasis:entry colname="col2">dist</oasis:entry>
         <oasis:entry colname="col3">Distance from the snowpack's surface</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">dist_ground</oasis:entry>
         <oasis:entry colname="col2">dist_gro</oasis:entry>
         <oasis:entry colname="col3">Distance from the ground</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">pos_rel</oasis:entry>
         <oasis:entry colname="col2">pos_rel</oasis:entry>
         <oasis:entry colname="col3">Relative position in the snowpack</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">gradient</oasis:entry>
         <oasis:entry colname="col2">gradient</oasis:entry>
         <oasis:entry colname="col3">Gradient (slope) of the force signal</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">mean_force</oasis:entry>
         <oasis:entry colname="col2">mean</oasis:entry>
         <oasis:entry colname="col3">Mean force signal (1 mm window)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">mean_force_4</oasis:entry>
         <oasis:entry colname="col2">mean_4</oasis:entry>
         <oasis:entry colname="col3">Mean force signal (4 mm window)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">mean_force_12</oasis:entry>
         <oasis:entry colname="col2">mean_12</oasis:entry>
         <oasis:entry colname="col3">Mean force signal (12 mm window)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">var_force</oasis:entry>
         <oasis:entry colname="col2">var</oasis:entry>
         <oasis:entry colname="col3">Variance of the force signal (1 mm window)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">var_force_4</oasis:entry>
         <oasis:entry colname="col2">var_4</oasis:entry>
         <oasis:entry colname="col3">Variance of the force signal (4 mm window)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">var_force_12</oasis:entry>
         <oasis:entry colname="col2">var_12</oasis:entry>
         <oasis:entry colname="col3">Variance of the force signal (12 mm window)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">max_force</oasis:entry>
         <oasis:entry colname="col2">max</oasis:entry>
         <oasis:entry colname="col3">Maximum of the force signal (1 mm window)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">max_force_4</oasis:entry>
         <oasis:entry colname="col2">max_4</oasis:entry>
         <oasis:entry colname="col3">Maximum of the force signal (4 mm window)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">max_force_12</oasis:entry>
         <oasis:entry colname="col2">max_12</oasis:entry>
         <oasis:entry colname="col3">Maximum of the force signal (12 mm window)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">min_force</oasis:entry>
         <oasis:entry colname="col2">min</oasis:entry>
         <oasis:entry colname="col3">Minimum of the force signal (1 mm window)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">min_force_4</oasis:entry>
         <oasis:entry colname="col2">min_4</oasis:entry>
         <oasis:entry colname="col3">Minimum of the force signal (4 mm window)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">min_force_12</oasis:entry>
         <oasis:entry colname="col2">min_12</oasis:entry>
         <oasis:entry colname="col3">Minimum of the force signal (12 mm window)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">median_force_4</oasis:entry>
         <oasis:entry colname="col2">med_4</oasis:entry>
         <oasis:entry colname="col3">Median of the force signal (4 mm window)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">median_force_12</oasis:entry>
         <oasis:entry colname="col2">med_12</oasis:entry>
         <oasis:entry colname="col3">Median of the force signal (12 mm window)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">delta_4</oasis:entry>
         <oasis:entry colname="col2">delta_4</oasis:entry>
         <oasis:entry colname="col3">Width of peaks in the force signal (4 mm window)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">delta_12</oasis:entry>
         <oasis:entry colname="col2">delta_12</oasis:entry>
         <oasis:entry colname="col3">Width of peaks in the force signal (12 mm window)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">L_4</oasis:entry>
         <oasis:entry colname="col2">L_4</oasis:entry>
         <oasis:entry colname="col3">Distance between neighboring peaks in the force signal (4 mm window)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">L_12</oasis:entry>
         <oasis:entry colname="col2">L_12</oasis:entry>
         <oasis:entry colname="col3">Distance between neighboring peaks in the force signal (12 mm window)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">lambda_4</oasis:entry>
         <oasis:entry colname="col2">lambda_4</oasis:entry>
         <oasis:entry colname="col3">Parameter regulating the Poisson shot noise (4 mm window)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">lambda_12</oasis:entry>
         <oasis:entry colname="col2">lambda_12</oasis:entry>
         <oasis:entry colname="col3">Parameter regulating the Poisson shot noise (12 mm window)</oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table><?xmltex \gdef\@currentlabel{C1}?></table-wrap>

<?xmltex \hack{\clearpage}?>
</sec>
<?pagebreak page4540?><sec id="App1.Ch1.S3.SS2">
  <label>C2</label><title>Label-wise feature correlation</title>
      <p id="d1e2665">Figure <xref ref-type="fig" rid="App1.Ch1.S3.F10"/> shows why classification for this dataset is so hard. Some labels have lower correlations among all features, making it unclear how the right predictions can be achieved on this basis. Other more predictive features are missing; i.e., if a feature is discovered that shows a high correlation within this plot, it might boost the overall classification capabilities of the models. The figure also shows that there might be interaction effects arising since some snow types show very similar correlations (for example melted form depth hoar and depth hoar wind packed). In summary, the label-wise feature correlation reveals the classification difficulty of the dataset and can be used to discover new predictive features.</p>

      <?xmltex \floatpos{h!}?><fig id="App1.Ch1.S3.F10"><?xmltex \currentcnt{C1}?><?xmltex \def\figurename{Figure}?><label>Figure C1</label><caption><p id="d1e2672">Label–feature correlation between snow types and aggregated features of the SMP profiles. The numbers in the feature names stand for the window size used during aggregation. Depth hoar (dh), depth hoar indurated (dhid), and rounded grains wind packed (rgwp) show some negative correlations with a subset of the features. Melted form depth hoar (mfdh), depth hoar wind packed (dhwp), and rounded grains wind packed (rgwp) show a strong positive correlation with at least one feature. Precipitation particles (pp) does not show strong correlations with any feature; however, a correlation with distance (dist), variance, and force features was expected by experts. The low correlations could be caused by the data-preprocessing step when decomposed and fragmented precipitation particles were categorized as precipitation particles as well. The rare class shows no correlations with the features since it consists of very different sub-classes (ice formation and surface hoar).</p></caption>
          <?xmltex \hack{\hsize\textwidth}?>
          <?xmltex \igopts{width=455.244094pt}?><graphic xlink:href="https://gmd.copernicus.org/articles/16/4521/2023/gmd-16-4521-2023-f10.png"/>

        </fig>

<?xmltex \hack{\clearpage}?>
</sec>
</app>

<?pagebreak page4541?><app id="App1.Ch1.S4">
  <?xmltex \currentcnt{D}?><label>Appendix D</label><title>Metrics</title>
      <p id="d1e2694">The metrics used for validation and testing are listed and explained in Table <xref ref-type="table" rid="App1.Ch1.S4.T4"/>. It might be helpful to familiarize oneself with a binary confusion matrix beforehand.</p>

<?xmltex \floatpos{h!}?><table-wrap id="App1.Ch1.S4.T4"><?xmltex \hack{\hsize\textwidth}?><?xmltex \currentcnt{D1}?><label>Table D1</label><caption><p id="d1e2703">List of metrics employed during validation and testing. The given formulas are only simplified versions for a binary classification case where no weighting takes place. The formula for the AUROC is not given here, since it is not a one-line formula and actually involves calculating an area under the ROC curve. Implementation and explanations of the metrics are from <xref ref-type="bibr" rid="bib1.bibx51" id="text.77"/>.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="3">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="center"/>
     <oasis:colspec colnum="3" colname="col3" align="justify" colwidth="227.622047pt"/>
     <oasis:thead>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Metric name</oasis:entry>
         <oasis:entry colname="col2">Formula for binary case</oasis:entry>
         <oasis:entry colname="col3">Description</oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Balanced accuracy</oasis:entry>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M27" display="inline"><mml:mrow><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">1</mml:mn><mml:mn mathvariant="normal">2</mml:mn></mml:mfrac></mml:mstyle><mml:mfenced close=")" open="("><mml:mrow><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mi mathvariant="normal">TP</mml:mi><mml:mrow><mml:mi mathvariant="normal">TP</mml:mi><mml:mo>+</mml:mo><mml:mi mathvariant="normal">FN</mml:mi></mml:mrow></mml:mfrac></mml:mstyle><mml:mo>+</mml:mo><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mi mathvariant="normal">TN</mml:mi><mml:mrow><mml:mi mathvariant="normal">TN</mml:mi><mml:mo>+</mml:mo><mml:mi mathvariant="normal">FP</mml:mi></mml:mrow></mml:mfrac></mml:mstyle></mml:mrow></mml:mfenced></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col3">Macro-average of recall scores per class. For balanced datasets, the score is equal to accuracy.</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Weighted recall</oasis:entry>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M28" display="inline"><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mi mathvariant="normal">TP</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi mathvariant="normal">TP</mml:mi><mml:mo>+</mml:mo><mml:mi mathvariant="normal">FN</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:mfrac></mml:mstyle></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col3">Calculates the recall for each class and computes the mean, weighted by the class's presence in the target data.</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Weighted precision</oasis:entry>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M29" display="inline"><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mi mathvariant="normal">TP</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi mathvariant="normal">TP</mml:mi><mml:mo>+</mml:mo><mml:mi mathvariant="normal">FP</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:mfrac></mml:mstyle></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col3">Calculates the precision for each class and computes the weighted mean, weighted by the class's presence in the target data.</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">F1 score</oasis:entry>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M30" display="inline"><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mo>⋅</mml:mo><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mrow><mml:mtext>precision</mml:mtext><mml:mo>⋅</mml:mo><mml:mtext>recall</mml:mtext></mml:mrow><mml:mrow><mml:mtext>precision</mml:mtext><mml:mo>+</mml:mo><mml:mtext>recall</mml:mtext></mml:mrow></mml:mfrac></mml:mstyle></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col3">Harmonic mean of precision and recall. In the multiclass case, F1 computes the class mean, weighted by the class's presence in the target data.</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">AUROC</oasis:entry>
         <oasis:entry colname="col2">–</oasis:entry>
         <oasis:entry colname="col3">Computes the area under the receiver operating characteristic curve from the prediction scores. The ROC curve plots the true-positive rate versus the false-positive rate. The scores are calculated for each class against all other classes (one versus rest) and weighted.</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Log loss</oasis:entry>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M31" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mo>(</mml:mo><mml:mi>y</mml:mi><mml:mo>⋅</mml:mo><mml:mi>log⁡</mml:mi><mml:mo>(</mml:mo><mml:mi>p</mml:mi><mml:mo>)</mml:mo><mml:mo>+</mml:mo><mml:mo>(</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>-</mml:mo><mml:mi>y</mml:mi><mml:mo>)</mml:mo><mml:mo>⋅</mml:mo><mml:mi>log⁡</mml:mi><mml:mo>(</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>-</mml:mo><mml:mi>p</mml:mi><mml:mo>)</mml:mo><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col3">Negative log-likelihood value of a logistic model that returns prediction probabilities <inline-formula><mml:math id="M32" display="inline"><mml:mi>p</mml:mi></mml:math></inline-formula> for the true data <inline-formula><mml:math id="M33" display="inline"><mml:mi>y</mml:mi></mml:math></inline-formula>.</oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table><?xmltex \gdef\@currentlabel{D1}?></table-wrap>

      <p id="d1e2970">Intuitively speaking, accuracy expresses how many samples were predicted correctly relative to all predictions; recall expresses how many positive samples were predicted correctly relative to all positive samples; precision expresses how many positive samples were predicted correctly relative to all positive predictions; F1 score can be used to measure both recall and precision in one score; ROC is the receiver operating characteristics and plots the true-positive rate versus the false-positive rate; AUROC expresses that the higher the area under the ROC curve, the clearer can the model separate between positive and negative samples; and log loss expresses how good or bad the prediction probabilities of each sample are compared to the target predictions.
All these values are better the larger they are (except for log loss, which is kept as low as possible). Some of the metrics from Table <xref ref-type="table" rid="App1.Ch1.S4.T4"/> cannot be computed for all models. This is the case because the AUROC and the log loss metric operate on prediction probabilities for the different classes, which not every model can provide. In these cases, the missing metric is marked with “–” in the result tables.</p><?xmltex \hack{\clearpage}?>
</app>

<?pagebreak page4542?><app id="App1.Ch1.S5">
  <?xmltex \currentcnt{E}?><label>Appendix E</label><title>Machine specifications</title>
      <p id="d1e2984">The evaluation and hyperparameter tuning experiments were run on two different machines. The complete evaluation was conducted on a 64 bit system with an Ubuntu 18.04.5 (Bionic Beaver) operating system. The machine has 16 GB RAM and an Intel<sup>®</sup> Core™ i7-6700HQ CPU @ 2.60GHz<inline-formula><mml:math id="M34" display="inline"><mml:mo>×</mml:mo></mml:math></inline-formula>8 (and the GPU was not used). The machine on which the first hyperparameter tuning, training, and validation experiments have been run has the following specifications: 64 bit system with an Ubuntu 20.04.1 (Focal Fossal) operating system, an Intel® Core<sup>™</sup> i7-4510U CPU @ 2.00GHz<inline-formula><mml:math id="M35" display="inline"><mml:mo>×</mml:mo></mml:math></inline-formula>4 CPU, and 12 GB RAM (and the GPU was not used). Final hyperparameter tuning, training, and validation (results presented here) were run on an Azure virtual machine of the Dsv3-series, namely on a Standard_D4s_v3 (<uri>https://docs.microsoft.com/en-us/azure/virtual-machines/dv3-dsv3-series</uri>, last access: 3 August 2023) machine with Ubuntu 18.04 (Bionic Beaver) as an operating system, 16 GB RAM, and four vCPUs.</p>
</app>

<app id="App1.Ch1.S6">
  <?xmltex \currentcnt{F}?><label>Appendix F</label><title>Model setup</title>
      <p id="d1e3019">The project was executed in Python 3.6, and all used packages can be found on GitHub in the “requirements.txt” file. Principal component analysis, t-SNE, <inline-formula><mml:math id="M36" display="inline"><mml:mi>k</mml:mi></mml:math></inline-formula>-means clustering, Gaussian mixture models, Bayesian Gaussian mixture models, random forests, SVMs, and the <inline-formula><mml:math id="M37" display="inline"><mml:mi>k</mml:mi></mml:math></inline-formula>-nearest-neighbor algorithm were used as made available through scikit-learn by <xref ref-type="bibr" rid="bib1.bibx51" id="text.78"/>. (<uri>https://scikit-learn.org/stable/</uri>, last access: 3 August 2023) The easy ensemble for imbalanced datasets and a balanced variant of the random forest are imported from imbalanced-learn by <xref ref-type="bibr" rid="bib1.bibx35" id="text.79"/> (<uri>https://imbalanced-learn.org/stable/</uri>, last access: 3 August 2023). All ANN architecture was created with the help of TensorFlow <xref ref-type="bibr" rid="bib1.bibx1" id="paren.80"/> (<uri>https://www.tensorflow.org/</uri>, last access: 3 August 2023) and Keras <xref ref-type="bibr" rid="bib1.bibx7" id="paren.81"/> (<uri>https://keras.io/</uri>, last access: 3 August 2023). The attention model within the encoder–decoder network was used as provided in the keras-attention-mechanism package by <xref ref-type="bibr" rid="bib1.bibx12" id="text.82"/>.</p><?xmltex \hack{\clearpage}?>
</app>

<?pagebreak page4543?><app id="App1.Ch1.S7">
  <?xmltex \currentcnt{G}?><label>Appendix G</label><title>Pruned decision tree</title>

      <?xmltex \floatpos{h!}?><fig id="App1.Ch1.S7.F11"><?xmltex \currentcnt{G1}?><?xmltex \def\figurename{Figure}?><label>Figure G1</label><caption><p id="d1e3075">Pruned decision tree extracted from the random forest. See Appendix <xref ref-type="sec" rid="App1.Ch1.S3.SS1"/> for an explanation of the features that the nodes represent. Decision trees encode the decision rules for predicting snow type labels. This approach helps explain the model's decisions, a property often asked for by domain experts. At each leaf node, a labeling decision is made. All the other nodes encode the labeling rules used to classify each point. Take the root node as an example. If the variance of the force is smaller or equal to zero, the point is labeled as precipitation particles. In all other cases, it has to be one of the other labels. The Gini index encodes how separable the subsets of data points are (the bigger the number the better), and the sample's number shows what percent of the complete data can be found in this subset.</p></caption>
        <?xmltex \hack{\hsize\textwidth}?>
        <?xmltex \igopts{width=398.338583pt}?><graphic xlink:href="https://gmd.copernicus.org/articles/16/4521/2023/gmd-16-4521-2023-f11.png"/>

      </fig>

<?xmltex \hack{\clearpage}?>
</app>

<?pagebreak page4544?><app id="App1.Ch1.S8">
  <?xmltex \currentcnt{H}?><label>Appendix H</label><title>Confusion matrices</title>

      <?xmltex \floatpos{h!}?><fig id="App1.Ch1.S8.F12"><?xmltex \currentcnt{H1}?><?xmltex \def\figurename{Figure}?><label>Figure H1</label><caption><p id="d1e3100"> </p></caption>
        <?xmltex \hack{\hsize\textwidth}?>
        <?xmltex \igopts{width=426.791339pt}?><graphic xlink:href="https://gmd.copernicus.org/articles/16/4521/2023/gmd-16-4521-2023-f12-part01.png"/>

      </fig>

<?xmltex \hack{\clearpage}?><?xmltex \floatpos{h!}?><fig id="App1.Ch1.S8.F13"><?xmltex \currentcnt{H1}?><?xmltex \def\figurename{Figure}?><label>Figure H1</label><caption><p id="d1e3114"> </p></caption>
        <?xmltex \hack{\hsize\textwidth}?>
        <?xmltex \igopts{width=426.791339pt}?><graphic xlink:href="https://gmd.copernicus.org/articles/16/4521/2023/gmd-16-4521-2023-f12-part02.png"/>

      </fig>

      <?xmltex \floatpos{h!}?><fig id="App1.Ch1.S8.F14" specific-use="star"><?xmltex \currentcnt{H1}?><?xmltex \def\figurename{Figure}?><label>Figure H1</label><caption><p id="d1e3128">Confusion matrices of all models displaying the predicted and the observed snow types. The number in each cell is the relative prediction frequency of a label within the observed class. The numbers of the diagonal (upper left to lower right) represent the prediction accuracy of each label. The more pronounced the diagonal and less pronounced the upper and the lower triangles are, the better the predictions become. The confusion matrices help for an in-depth analysis of the label-specific performances. This is useful when users want to choose a model that is suitable for a specific snow classification task. <bold>(a)</bold> Confusion matrices of majority vote, Gaussian mixture model, <inline-formula><mml:math id="M38" display="inline"><mml:mi>k</mml:mi></mml:math></inline-formula>-mean, easy ensemble, and <inline-formula><mml:math id="M39" display="inline"><mml:mi>k</mml:mi></mml:math></inline-formula>-nearest neighbor approaches. <bold>(b)</bold> Confusion matrices of random forest, support vector machine, LSTM, BLSTM, and encoder approaches. <bold>(c)</bold> Confusion matrices of self-trainer and label propagation approaches.</p></caption>
        <?xmltex \igopts{width=426.791339pt}?><graphic xlink:href="https://gmd.copernicus.org/articles/16/4521/2023/gmd-16-4521-2023-f12-part03.png"/>

      </fig>

<?xmltex \hack{\clearpage}?>
</app>
  </app-group><notes notes-type="codedataavailability"><title>Code and data availability</title>

      <p id="d1e3166">The current version of snowdragon is available on GitHub under the MIT License: <uri>https://github.com/liellnima/snowdragon</uri> (last access: 3 August 2023).
To run the code version used in this paper, please refer to v1.0.0 on GitHub or Zenodo: <ext-link xlink:href="https://doi.org/10.5281/zenodo.7335813" ext-link-type="DOI">10.5281/zenodo.7335813</ext-link> <xref ref-type="bibr" rid="bib1.bibx28" id="paren.83"/>.
The exact versions of the models used to produce the results used in this paper are also archived on Zenodo: <ext-link xlink:href="https://doi.org/10.5281/zenodo.7063520" ext-link-type="DOI">10.5281/zenodo.7063520</ext-link> <xref ref-type="bibr" rid="bib1.bibx30" id="paren.84"/>. The MOSAiC SMP data used as input and training data are available on PANGAEA: <ext-link xlink:href="https://doi.org/10.1594/PANGAEA.935554" ext-link-type="DOI">10.1594/PANGAEA.935554</ext-link> <xref ref-type="bibr" rid="bib1.bibx43" id="paren.85"/>.</p>
  </notes><notes notes-type="authorcontribution"><title>Author contributions</title>

      <p id="d1e3194">ARM and MS collected and curated the data. ARM and MS labeled the data. ARM and JK preprocessed the data. JK developed the methodological framework. JK implemented, compared, tuned, and validated the models. JK and VC visualized the results. JK wrote the manuscript draft. VC, ARM, and MS reviewed and edited the manuscript. VC supervised the ML part of the study. MS supervised the cryospheric part of the study.</p>
  </notes><notes notes-type="competinginterests"><title>Competing interests</title>

      <p id="d1e3200">The contact author has declared that none of the authors has any competing interests.</p>
  </notes><notes notes-type="disclaimer"><title>Disclaimer</title>

      <p id="d1e3206">Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.</p>
  </notes><ack><title>Acknowledgements</title><p id="d1e3212">Data used in this paper were produced as part of the international Multidisciplinary drifting Observatory for the Study of the Arctic Climate (MOSAiC) with the tag MOSAiC20192020. The data were collected during the Polarstern expedition AWI_PS122_00. We acknowledge the contribution of the MOSAiC expedition <xref ref-type="bibr" rid="bib1.bibx49" id="paren.86"/>.
We especially thank the crew of RV <italic>Polarstern</italic> <xref ref-type="bibr" rid="bib1.bibx34" id="paren.87"/> and participants of leg one to three for their help in the field. We would especially like to thank the late Joshua M. L. King for insightful discussions and comments.</p></ack><notes notes-type="financialsupport"><title>Financial support</title>

      <p id="d1e3226">This research has been supported by the Swiss Polar Institute (grant no. DIRCR-2018-003), the European Union's Horizon 2020 research and innovation programme project ARICE (grant no. 730965) for berth fees associated with the participation of the DEARice project, the WSL-Institut für Schnee- und Lawinenforschung (grant no. WSL201812N1678), the Deutsche Forschungsgemeinschaft (grant no. GRK2340), and the Alfred Wegener Institute Helmholtz Centre for Polar and Marine Research (grant no. AWI_PS122_00). Additional funding was provided for the research training group by the Deutsche Forschungsgemeinschaft (DFG) (grant no. GRK2340).</p>
  </notes><notes notes-type="reviewstatement"><title>Review statement</title>

      <p id="d1e3232">This paper was edited by Fabien Maussion and reviewed by Pascal Hagenmuller and two anonymous referees.</p>
  </notes><ref-list>
    <title>References</title>

      <ref id="bib1.bibx1"><?xmltex \def\ref@label{{Abadi et~al.(2016)Abadi, Agarwal, Barham, Brevdo, Chen, Citro,
Corrado, Davis, Dean, Devin, Ghemawat, Goodfellow, Harp, Irving, Isard, Jia,
Jozefowicz, Kaiser, Kudlur, Levenberg, Mane, Monga, Moore, Murray, Olah,
Schuster, Shlens, Steiner, Sutskever, Talwar, Tucker, Vanhoucke, Vasudevan,
Viegas, Vinyals, Warden, Wattenberg, Wicke, Yu, and
Zheng}}?><label>Abadi et al.(2016)Abadi, Agarwal, Barham, Brevdo, Chen, Citro,
Corrado, Davis, Dean, Devin, Ghemawat, Goodfellow, Harp, Irving, Isard, Jia,
Jozefowicz, Kaiser, Kudlur, Levenberg, Mane, Monga, Moore, Murray, Olah,
Schuster, Shlens, Steiner, Sutskever, Talwar, Tucker, Vanhoucke, Vasudevan,
Viegas, Vinyals, Warden, Wattenberg, Wicke, Yu, and
Zheng</label><?label tensorflow2015-whitepaper?><mixed-citation>Abadi, M., Agarwal, A., Barham, P., Brevdo, E., Chen, Z., Citro, C., Corrado,
G. S., Davis, A., Dean, J., Devin, M., Ghemawat, S., Goodfellow, I., Harp,
A., Irving, G., Isard, M., Jia, Y., Jozefowicz, R., Kaiser, L., Kudlur, M.,
Levenberg, J., Mane, D., Monga, R., Moore, S., Murray, D., Olah, C.,
Schuster, M., Shlens, J., Steiner, B., Sutskever, I., Talwar, K., Tucker, P.,
Vanhoucke, V., Vasudevan, V., Viegas, F., Vinyals, O., Warden, P.,
Wattenberg, M., Wicke, M., Yu, Y., and Zheng, X.: TensorFlow:
Large-Scale Machine Learning on Heterogeneous Distributed
Systems, arXiv [preprint], <ext-link xlink:href="https://doi.org/10.48550/arXiv.1603.04467" ext-link-type="DOI">10.48550/arXiv.1603.04467</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bibx2"><?xmltex \def\ref@label{{Bahdanau et~al.(2014)Bahdanau, Cho, and Bengio}}?><label>Bahdanau et al.(2014)Bahdanau, Cho, and Bengio</label><?label bahdanau2014neural?><mixed-citation>Bahdanau, D., Cho, K., and Bengio, Y.: Neural Machine Translation by Jointly
Learning to Align and Translate, arXiv [preprint], <ext-link xlink:href="https://doi.org/10.48550/ARXIV.1409.0473" ext-link-type="DOI">10.48550/ARXIV.1409.0473</ext-link>, 2014.</mixed-citation></ref>
      <ref id="bib1.bibx3"><?xmltex \def\ref@label{{Bishop(2006)}}?><label>Bishop(2006)</label><?label bishop2006pattern?><mixed-citation>
Bishop, C. M.: Pattern recognition and machine learning, Information science
and statistics, Springer, New York, 738 pp., ISBN 978-0-387-31073-2, 2006.</mixed-citation></ref>
      <ref id="bib1.bibx4"><?xmltex \def\ref@label{{Breiman(2001)}}?><label>Breiman(2001)</label><?label breiman2001random?><mixed-citation>Breiman, L.: Random forests, Mach. Learn., 45, 5–32,
<ext-link xlink:href="https://doi.org/10.1023/A:1010933404324" ext-link-type="DOI">10.1023/A:1010933404324</ext-link>, 2001.</mixed-citation></ref>
      <ref id="bib1.bibx5"><?xmltex \def\ref@label{{Calonne et~al.(2020)Calonne, Richter, Löwe, Cetti, Ter~Schure,
Van~Herwijnen, Fierz, Jaggi, and Schneebeli}}?><label>Calonne et al.(2020)Calonne, Richter, Löwe, Cetti, Ter Schure,
Van Herwijnen, Fierz, Jaggi, and Schneebeli</label><?label calonne2020rhossa?><mixed-citation>Calonne, N., Richter, B., Löwe, H., Cetti, C., ter Schure, J., Van Herwijnen, A., Fierz, C., Jaggi, M., and Schneebeli, M.: The RHOSSA campaign: multi-resolution monitoring of the seasonal evolution of the structure and mechanical stability of an alpine snowpack, The Cryosphere, 14, 1829–1848, <ext-link xlink:href="https://doi.org/10.5194/tc-14-1829-2020" ext-link-type="DOI">10.5194/tc-14-1829-2020</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bibx6"><?xmltex \def\ref@label{{Chao et~al.(2004)Chao, Liaw, and Breiman}}?><label>Chao et al.(2004)Chao, Liaw, and Breiman</label><?label chen2004using?><mixed-citation>Chao, C., Liaw, A., and Breiman, L.: Using random forest to learn imbalanced
data, Tech. Reports 666, University of California, Dep. Statistics,
Berkeley, <uri>https://statistics.berkeley.edu/tech-reports/666</uri> (last access: 3 August 2023),
2004.</mixed-citation></ref>
      <ref id="bib1.bibx7"><?xmltex \def\ref@label{{Chollet et~al.(2015)}}?><label>Chollet et al.(2015)</label><?label chollet2015keras?><mixed-citation>Chollet, F. et al.: Keras,  GitHub, <uri>https://github.com/fchollet/keras</uri>  (last access: 3 August 2023), 2015.</mixed-citation></ref>
      <ref id="bib1.bibx8"><?xmltex \def\ref@label{{Colbeck(1987)}}?><label>Colbeck(1987)</label><?label colbeck1987review?><mixed-citation>Colbeck, S.: A review of the metamorphism and classification of seasonal snow
cover crystals, IAHS Publication, 162, 3–24,
<uri>https://iahs.info/uploads/dms/6807.3-34-162-Colbeck.pdf</uri>,
1987.</mixed-citation></ref>
      <ref id="bib1.bibx9"><?xmltex \def\ref@label{{Coléou et~al.(2001)Coléou, Lesaffre, Brzoska, Ludwig, and
Boller}}?><label>Coléou et al.(2001)Coléou, Lesaffre, Brzoska, Ludwig, and
Boller</label><?label coleou2001three?><mixed-citation>Coléou, C., Lesaffre, B., Brzoska, J.-B., Ludwig, W., and Boller, E.:
Three-dimensional snow images by X-ray microtomography, Ann.
Glaciol., 32, 75–81, <ext-link xlink:href="https://doi.org/10.3189/172756401781819418" ext-link-type="DOI">10.3189/172756401781819418</ext-link>, 2001.</mixed-citation></ref>
      <ref id="bib1.bibx10"><?xmltex \def\ref@label{{Cortes and Vapnik(1995)}}?><label>Cortes and Vapnik(1995)</label><?label cortes1995support?><mixed-citation>Cortes, C. and Vapnik, V.: Support-vector networks, Mach. Learn., 20,
273–297, <ext-link xlink:href="https://doi.org/10.1007/BF00994018" ext-link-type="DOI">10.1007/BF00994018</ext-link>, 1995.</mixed-citation></ref>
      <ref id="bib1.bibx11"><?xmltex \def\ref@label{{{Cover} and {Hart}(1967)}}?><label>Cover and Hart(1967)</label><?label cover1967nearest?><mixed-citation>Cover, T. and Hart, P.: Nearest neighbor pattern classification, IEEE
Transactions on Information Theory, 13, 21–27,
<ext-link xlink:href="https://doi.org/10.1109/TIT.1967.1053964" ext-link-type="DOI">10.1109/TIT.1967.1053964</ext-link>, 1967.</mixed-citation></ref>
      <ref id="bib1.bibx12"><?xmltex \def\ref@label{{CyberZHG(2020)}}?><label>CyberZHG(2020)</label><?label cyberZHG2020attention?><mixed-citation>CyberZHG: Keras Self-Attention, GitHub,
<uri>https://github.com/CyberZHG/keras-self-attention</uri> (last access: 3 August 2023), 2020.</mixed-citation></ref>
      <ref id="bib1.bibx13"><?xmltex \def\ref@label{{Domine et~al.(2019)Domine, Picard, Morin, Barrere, Madore, and
Langlois}}?><label>Domine et al.(2019)Domine, Picard, Morin, Barrere, Madore, and
Langlois</label><?label domine_major_2019?><mixed-citation>Domine, F., Picard, G., Morin, S., Barrere, M., Madore, J.-B., and Langlois,
A.: Major Issues in Simulating Some Arctic Snowpack Properties
Using Current Detailed Snow Physics Models: Consequences for
the Thermal Regime and Water Budget of Permafrost, J.
Adv. Model. Earth Syst., 11, 34–44, <ext-link xlink:href="https://doi.org/10.1029/2018MS001445" ext-link-type="DOI">10.1029/2018MS001445</ext-link>,
2019.</mixed-citation></ref>
      <ref id="bib1.bibx14"><?xmltex \def\ref@label{{Douville et~al.(1995)Douville, Royer, and Mahfouf}}?><label>Douville et al.(1995)Douville, Royer, and Mahfouf</label><?label douville1995new?><mixed-citation>Douville, H., Royer, J. F., and Mahfouf, J. F.: A new snow parameterization for
the Météo-France climate model, Clim. Dynam., 12, 21–35,
<ext-link xlink:href="https://doi.org/10.1007/BF00208760" ext-link-type="DOI">10.1007/BF00208760</ext-link>, 1995.</mixed-citation></ref>
      <ref id="bib1.bibx15"><?xmltex \def\ref@label{{Fierz et~al.(2009)Fierz, Armstrong, Durand, Etchevers, Greene,
Mcclung, Nishimura, Satyawali, and Sokratov}}?><label>Fierz et al.(2009)Fierz, Armstrong, Durand, Etchevers, Greene,
Mcclung, Nishimura, Satyawali, and Sokratov</label><?label fierz2009international?><mixed-citation>
Fierz, C., Armstrong, R., Durand, Y., Etchevers,<?pagebreak page4548?> P., Greene, E., Mcclung,
D. M., Nishimura, K., Satyawali, P., and Sokratov, S.: The international
classification for seasonal snow on the ground, Tech. rep., UNESCO-IHP,
Paris, Paris, publication Title: IHP-VII Technical Documents in Hydrology
No. 83, IACS Contribution No. 1, 2009.</mixed-citation></ref>
      <ref id="bib1.bibx16"><?xmltex \def\ref@label{{Fix and Hodges(1952)}}?><label>Fix and Hodges(1952)</label><?label fix1952discriminatory?><mixed-citation>Fix, E. and Hodges, J. L.: Discriminatory Analysis - Nonparametric
Discrimination: Small Sample Performance, Tech. rep., California
University Berkeley,
<uri>https://apps.dtic.mil/sti/citations/ADA800391</uri>  (last access: 3 August 2023), section:
Technical Reports, 1952.</mixed-citation></ref>
      <ref id="bib1.bibx17"><?xmltex \def\ref@label{{Forgy(1965)}}?><label>Forgy(1965)</label><?label forgy1965cluster?><mixed-citation>
Forgy, E. W.: Cluster analysis of multivariate data: efficiency versus
interpretability of classifications, Biometrics, 21, 768–769, 1965.</mixed-citation></ref>
      <ref id="bib1.bibx18"><?xmltex \def\ref@label{{Ghahramani(2004)}}?><label>Ghahramani(2004)</label><?label ghahramani2004Unsupervised?><mixed-citation>Ghahramani, Z.: Unsupervised Learning, pp. 72–112, Springer Berlin Heidelberg,
Berlin, Heidelberg, <ext-link xlink:href="https://doi.org/10.1007/978-3-540-28650-9_5" ext-link-type="DOI">10.1007/978-3-540-28650-9_5</ext-link>, 2004.</mixed-citation></ref>
      <ref id="bib1.bibx19"><?xmltex \def\ref@label{{Han et~al.(2012)Han, Kamber, and Pei}}?><label>Han et al.(2012)Han, Kamber, and Pei</label><?label han2012classification?><mixed-citation>Han, J., Kamber, M., and Pei, J.: 9 – Classification: Advanced Methods, in:
Data Mining (Third Edition), edited by: Han, J., Kamber, M., and Pei, J., The
Morgan Kaufmann Series in Data Management Systems, pp. 393–442, Morgan
Kaufmann, Boston, third edition edn.,
<ext-link xlink:href="https://doi.org/10.1016/B978-0-12-381479-1.00009-5" ext-link-type="DOI">10.1016/B978-0-12-381479-1.00009-5</ext-link>, 2012.</mixed-citation></ref>
      <ref id="bib1.bibx20"><?xmltex \def\ref@label{{Havens et~al.(2010)Havens, Marshall, Steiner, and
Tedesco}}?><label>Havens et al.(2010)Havens, Marshall, Steiner, and
Tedesco</label><?label havens2010singleCT?><mixed-citation>Havens, S., Marshall, H.-P., Steiner, N., and Tedesco, M.: Snow micro
penetrometer and near infrared photography for grain type classification, in:
2010 International Snow Science Workshop, pp. 465–469,
<uri>https://arc.lib.montana.edu/snow-science/objects/ISSW_P-029.pdf</uri>  (last access: 3 August 2023),
2010.</mixed-citation></ref>
      <ref id="bib1.bibx21"><?xmltex \def\ref@label{{Havens et~al.(2013)Havens, Marshall, Pielmeier, and
Elder}}?><label>Havens et al.(2013)Havens, Marshall, Pielmeier, and
Elder</label><?label havens2012automatic?><mixed-citation>Havens, S., Marshall, H.-P., Pielmeier, C., and Elder, K.: Automatic Grain
Type Classification of Snow Micro Penetrometer Signals With
Random Forests, IEEE Transactions on Geoscience and Remote Sensing, 51,
3328–3335, <ext-link xlink:href="https://doi.org/10.1109/TGRS.2012.2220549" ext-link-type="DOI">10.1109/TGRS.2012.2220549</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bibx22"><?xmltex \def\ref@label{{Herla et~al.(2021)Herla, Horton, Mair, and Haegeli}}?><label>Herla et al.(2021)Herla, Horton, Mair, and Haegeli</label><?label herla2021snow?><mixed-citation>Herla, F., Horton, S., Mair, P., and Haegeli, P.: Snow profile alignment and similarity assessment for aggregating, clustering, and evaluating snowpack model output for avalanche forecasting, Geosci. Model Dev., 14, 239–258, <ext-link xlink:href="https://doi.org/10.5194/gmd-14-239-2021" ext-link-type="DOI">10.5194/gmd-14-239-2021</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bibx23"><?xmltex \def\ref@label{{Ho(1995)}}?><label>Ho(1995)</label><?label ho1995random?><mixed-citation>
Ho, T. K.: Random decision forests, in: Proceedings of 3rd international
conference on document analysis and recognition, vol. 1, pp. 278–282, IEEE,
1995.</mixed-citation></ref>
      <ref id="bib1.bibx24"><?xmltex \def\ref@label{{Hochreiter and Schmidhuber(1997)}}?><label>Hochreiter and Schmidhuber(1997)</label><?label hochreiter1997long?><mixed-citation>
Hochreiter, S. and Schmidhuber, J.: Long short-term memory,
Neural Computat.,
9, 1735–1780, 1997.</mixed-citation></ref>
      <ref id="bib1.bibx25"><?xmltex \def\ref@label{{Ismail~Fawaz et~al.(2019)Ismail~Fawaz, Forestier, Weber, Idoumghar,
and Muller}}?><label>Ismail Fawaz et al.(2019)Ismail Fawaz, Forestier, Weber, Idoumghar,
and Muller</label><?label ismail2019deep?><mixed-citation>Ismail Fawaz, H., Forestier, G., Weber, J., Idoumghar, L., and Muller, P.-A.:
Deep learning for time series classification: a review, Data Min. Knowl. Disc., 33, 917–963, <ext-link xlink:href="https://doi.org/10.1007/s10618-019-00619-1" ext-link-type="DOI">10.1007/s10618-019-00619-1</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bibx26"><?xmltex \def\ref@label{{Johnson and Schneebeli(1998)}}?><label>Johnson and Schneebeli(1998)</label><?label johnson1998patent?><mixed-citation>Johnson, J. B. and Schneebeli, M.: Snow strength penetrometer,
<uri>https://patents.google.com/patent/US5831161/en</uri>  (last access: 3 August 2023), 1998.</mixed-citation></ref>
      <ref id="bib1.bibx27"><?xmltex \def\ref@label{{Jurafsky and Martin(2021)}}?><label>Jurafsky and Martin(2021)</label><?label jurafsky2021speech?><mixed-citation>Jurafsky, D. and Martin, J. H.: Speech and Language Processing: An Introduction
to Natural Language Processing, Computational Linguistics, and Speech
Recognition, 3rd ed. draft, in
progress,
<uri>https://web.stanford.edu/~jurafsky/slp3/</uri>  (last access: 3 August 2023), 2021.</mixed-citation></ref>
      <ref id="bib1.bibx28"><?xmltex \def\ref@label{Kaltenborn and vclay(2022)}?><label>Kaltenborn and vclay(2022)</label><?label Kaltenborncode?><mixed-citation>Kaltenborn, J. and vclay: liellnima/snowdragon: Snowdragon Release 1.0.0 (v1.0.0), Zenodo [code], <ext-link xlink:href="https://doi.org/10.5281/zenodo.7335813" ext-link-type="DOI">10.5281/zenodo.7335813</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bibx29"><?xmltex \def\ref@label{{Kaltenborn et~al.(2021)Kaltenborn, Clay, Macfarlane, and
Schneebeli}}?><label>Kaltenborn et al.(2021)Kaltenborn, Clay, Macfarlane, and
Schneebeli</label><?label kaltenborn2021machine?><mixed-citation>Kaltenborn, J., Clay, V., Macfarlane, A. R., and Schneebeli, M.: Machine
Learning for Snow Stratigraphy Classification, in: NeurIPS 2021 Workshop on
Tackling Climate Change with Machine Learning,
<uri>https://www.climatechange.ai/papers/neurips2021/48</uri> (last access: 3 August 2023), 2021.</mixed-citation></ref>
      <ref id="bib1.bibx30"><?xmltex \def\ref@label{{Kaltenborn et~al.(2022)Kaltenborn, Macfarlane, Clay, and
Schneebeli}}?><label>Kaltenborn et al.(2022)Kaltenborn, Macfarlane, Clay, and
Schneebeli</label><?label kaltenborn2022pretrained?><mixed-citation>Kaltenborn, J., Macfarlane, A. R., Clay, V., and Schneebeli, M.: Pre-trained
Models for SMP Classification and Segmentation, Zenodo [code], <ext-link xlink:href="https://doi.org/10.5281/zenodo.7063521" ext-link-type="DOI">10.5281/zenodo.7063521</ext-link>,
2022.</mixed-citation></ref>
      <ref id="bib1.bibx31"><?xmltex \def\ref@label{{King et~al.(2015)King, Kelly, Kasurak, Duguay, Gunn, Rutter, Watts,
and Derksen}}?><label>King et al.(2015)King, Kelly, Kasurak, Duguay, Gunn, Rutter, Watts,
and Derksen</label><?label King2015a?><mixed-citation>King, J., Kelly, R., Kasurak, A., Duguay, C., Gunn, G., Rutter, N., Watts, T.,
and Derksen, C.: Spatio-temporal influence of tundra snow properties on
Ku-band (17.2 GHz) backscatter, J. Glaciol., 61, 267–279,
<ext-link xlink:href="https://doi.org/10.3189/2015JoG14J020" ext-link-type="DOI">10.3189/2015JoG14J020</ext-link>, 2015.</mixed-citation></ref>
      <ref id="bib1.bibx32"><?xmltex \def\ref@label{{King et~al.(2020{\natexlab{a}})King, Howell, Brady, Toose, Derksen,
Haas, and Beckers}}?><label>King et al.(2020a)King, Howell, Brady, Toose, Derksen,
Haas, and Beckers</label><?label king2020local?><mixed-citation>King, J., Howell, S., Brady, M., Toose, P., Derksen, C., Haas, C., and Beckers, J.: Local-scale variability of snow density on Arctic sea ice, The Cryosphere, 14, 4323–4339, <ext-link xlink:href="https://doi.org/10.5194/tc-14-4323-2020" ext-link-type="DOI">10.5194/tc-14-4323-2020</ext-link>, 2020a.</mixed-citation></ref>
      <ref id="bib1.bibx33"><?xmltex \def\ref@label{{King et~al.(2020{\natexlab{b}})King, Howell, Brady, Toose, Derksen,
Haas, and Beckers}}?><label>King et al.(2020b)King, Howell, Brady, Toose, Derksen,
Haas, and Beckers</label><?label king2020snowmicropen?><mixed-citation>King, J., Howell, S., Brady, M., Toose, P., Derksen, C., Haas, C., and Beckers,
J.: SnowMicroPen Measurements on Sea Ice 2016–2017, Zenodo,
<ext-link xlink:href="https://doi.org/10.5281/zenodo.4068349" ext-link-type="DOI">10.5281/zenodo.4068349</ext-link>, 2020b.</mixed-citation></ref>
      <ref id="bib1.bibx34"><?xmltex \def\ref@label{{Knust(2017)}}?><label>Knust(2017)</label><?label knust2017polar?><mixed-citation>Knust, R.: Polar Research and Supply Vessel POLARSTERN Operated by
the Alfred-Wegener-Institute, Journal of Large-Scale Research
Facilities, 3, A119–A119, <ext-link xlink:href="https://doi.org/10.17815/jlsrf-3-163" ext-link-type="DOI">10.17815/jlsrf-3-163</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bibx35"><?xmltex \def\ref@label{{Lema{\^{\i}}tre et~al.(2017)Lema{\^{\i}}tre, Nogueira, and
Aridas}}?><label>Lemaître et al.(2017)Lemaître, Nogueira, and
Aridas</label><?label lemaitre2017imbalanced?><mixed-citation>Lemaître, G., Nogueira, F., and Aridas, C. K.: Imbalanced-learn: A python
toolbox to tackle the curse of imbalanced datasets in machine learning,
J. Mach. Learn. Res., 18, 559–563,
<uri>https://www.jmlr.org/papers/volume18/16-365/16-365.pdf</uri>  (last access: 3 August 2023), 2017.</mixed-citation></ref>
      <ref id="bib1.bibx36"><?xmltex \def\ref@label{{Li et~al.(2020)Li, Hasanaj, and Li}}?><label>Li et al.(2020)Li, Hasanaj, and Li</label><?label li2020Baseline?><mixed-citation>Li, D., Hasanaj, E., and Li, S.: 3 – Baselines,
<uri>https://blog.ml.cmu.edu/2020/08/31/3-baselines/</uri> (last access:
4 March 2021), 2020.</mixed-citation></ref>
      <ref id="bib1.bibx37"><?xmltex \def\ref@label{{Light et~al.(2015)Light, Perovich, Webster, Polashenski, and
Dadic}}?><label>Light et al.(2015)Light, Perovich, Webster, Polashenski, and
Dadic</label><?label light2015optical?><mixed-citation>Light, B., Perovich, D. K., Webster, M. A., Polashenski, C., and Dadic, R.:
Optical properties of melting first-year Arctic sea ice, J.
Geophys. Res.-Oceans, 120, 7657–7675, <ext-link xlink:href="https://doi.org/10.1002/2015JC011163" ext-link-type="DOI">10.1002/2015JC011163</ext-link>,
2015.</mixed-citation></ref>
      <ref id="bib1.bibx38"><?xmltex \def\ref@label{{Liu et~al.(2008)Liu, Wu, and Zhou}}?><label>Liu et al.(2008)Liu, Wu, and Zhou</label><?label liu2008exploratory?><mixed-citation>Liu, X.-Y., Wu, J., and Zhou, Z.-H.: Exploratory undersampling for
class-imbalance learning, IEEE Transactions on Systems, Man, and Cybernetics,
Part B (Cybernetics), 39, 539–550, <ext-link xlink:href="https://doi.org/10.1109/TSMCB.2008.2007853" ext-link-type="DOI">10.1109/TSMCB.2008.2007853</ext-link>, 2008.</mixed-citation></ref>
      <ref id="bib1.bibx39"><?xmltex \def\ref@label{{Lloyd(1982)}}?><label>Lloyd(1982)</label><?label lloyd1982least?><mixed-citation>Lloyd, S.: Least squares quantization in PCM, IEEE Transactions on Information
Theory, 28, 129–137, <ext-link xlink:href="https://doi.org/10.1109/TIT.1982.1056489" ext-link-type="DOI">10.1109/TIT.1982.1056489</ext-link>, 1982.</mixed-citation></ref>
      <ref id="bib1.bibx40"><?xmltex \def\ref@label{{L{\"{o}}we and Van~Herwijnen(2012)}}?><label>Löwe and Van Herwijnen(2012)</label><?label lowe2012poisson?><mixed-citation>Löwe, H. and Van Herwijnen, A.: A Poisson shot noise model for
micro-penetration of snow, Cold Reg. Sci. Technol., 70, 62–70,
<ext-link xlink:href="https://doi.org/10.1016/j.coldregions.2011.09.001" ext-link-type="DOI">10.1016/j.coldregions.2011.09.001</ext-link>, 2012.</mixed-citation></ref>
      <ref id="bib1.bibx41"><?xmltex \def\ref@label{{Lutz et~al.(2007)Lutz, Birkeland, Kronholm, Hansen, and
Aspinall}}?><label>Lutz et al.(2007)Lutz, Birkeland, Kronholm, Hansen, and
Aspinall</label><?label lutz2007segmentation_moving_window?><mixed-citation>Lutz, E., Birkeland, K. W., Kronholm, K., Hansen, K., and Aspinall, R.: Surface
hoar characteristics derived from a snow micropenetrometer using moving
window statistical operations, Cold Reg. Sci. Technol., 47,
118–133, <ext-link xlink:href="https://doi.org/10.1016/j.coldregions.2006.08.021" ext-link-type="DOI">10.1016/j.coldregions.2006.08.021</ext-link>, 2007.</mixed-citation></ref>
      <ref id="bib1.bibx42"><?xmltex \def\ref@label{{Macfarlane et~al.(2023)Macfarlane, Schneebeli, Dadic, Tavri,
Immerz, Polashenski, Krampe, Clemens-Sewall, Wagner, Perovich, Henna-Reetta,
Raphael, Matero, Regnery, Smith, Nicolaus, Jaggi, Oggier, Webster, Lehning,
Kolabutin, Itkin, Naderpour, Pirazzini, Hammerle, Arndt, and
Fons}}?><label>Macfarlane et al.(2023)Macfarlane, Schneebeli, Dadic, Tavri,
Immerz, Polashenski, Krampe, Clemens-Sewall, Wagner, Perovich, Henna-Reetta,
Raphael, Matero, Regnery, Smith, Nicolaus, Jaggi, Oggier, Webster, Lehning,
Kolabutin, Itkin, Naderpour, Pirazzini, Hammerle, Arndt, and
Fons</label><?label macfarlane2023database?><mixed-citation>Macfarlane, A., Schneebeli, M., Dadic, R., Tavri, A., Immerz, A., Polashenski,
C., Krampe, D., Clemens-Sewall, D., Wagner, D., Perovich, D., Henna-Reetta,
H., Raphael, I., Matero, I., Regnery, J., Smith, M., Nicolaus, M., Jaggi, M.,
Oggier, M., Webster, M., Lehning, M., Kolabutin, N., Itkin, P., Naderpour,
R., Pirazzini, R., Hammerle, S., Arndt, S., and Fons, S.: A Database of Snow
on Sea Ice in the Central Arctic Collected during the MOSAiC expedition,
Scientific Data, 10, 398, <ext-link xlink:href="https://doi.org/10.1038/s41597-023-02273-1" ext-link-type="DOI">10.1038/s41597-023-02273-1</ext-link>, 2023.</mixed-citation></ref>
      <ref id="bib1.bibx43"><?xmltex \def\ref@label{{{Macfarlane} et~al.(2021){Macfarlane}, {Schneebeli}, {Dadic},
{Wagner}, {Arndt}, {Clemens-Sewall}, {H\"{a}mmerle}, {Hannula}, {Jaggi},
{Kolabutin}, {Krampe}, {Lehning}, {Matero}, {Nicolaus}, {Oggier},
{Pirazzini}, {Polashenski}, {Raphael}, {Regnery}, {Shimanchuck}, {Smith}, and
{Tavri}}}?><label>Macfarlane et al.(2021)Macfarlane, Schneebeli, Dadic,
Wagner, Arndt, Clemens-Sewall, Hämmerle, Hannula, Jaggi,
Kolabutin, Krampe, Lehning, Matero, Nicolaus, Oggier,
Pirazzini, Polashenski, Raphael, Regnery, Shimanchuck, Smith, and
Tavri</label><?label macfarlane2021sssf?><mixed-citation>Macfarlane, A. R., Schneebeli, M.,<?pagebreak page4549?> Dadic, R., Wagner, D. N., Arndt,
S., Clemens-Sewall, D., Hämmerle, S., Hannula, H.-R., Jaggi, M.,
Kolabutin, N., Krampe, D., Lehning, M., Matero, I., Nicolaus, M.,
Oggier, M., Pirazzini, R., Polashenski, C., Raphael, I., Regnery,
J., Shimanchuck, E., Smith, M. M., and Tavri, A.: Snowpit SnowMicroPen
(SMP) force profiles collected during the MOSAiC expedition, PANGAEA [data set],
<ext-link xlink:href="https://doi.org/10.1594/PANGAEA.935554" ext-link-type="DOI">10.1594/PANGAEA.935554</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bibx44"><?xmltex \def\ref@label{{Matzl and Schneebeli(2006)}}?><label>Matzl and Schneebeli(2006)</label><?label matzl2006measuring?><mixed-citation>Matzl, M. and Schneebeli, M.: Measuring specific surface area of snow by
near-infrared photography, J. Glaciol., 52, 558–564,
<ext-link xlink:href="https://doi.org/10.3189/172756506781828412" ext-link-type="DOI">10.3189/172756506781828412</ext-link>, 2006.</mixed-citation></ref>
      <ref id="bib1.bibx45"><?xmltex \def\ref@label{{M{\'{e}}nard et~al.(2019)M{\'{e}}nard, Essery, Barr, Bartlett, Derry,
Dumont, Fierz, Kim, Kontu, Lejeune et~al.}}?><label>Ménard et al.(2019)Ménard, Essery, Barr, Bartlett, Derry,
Dumont, Fierz, Kim, Kontu, Lejeune et al.</label><?label menard2019meteorological?><mixed-citation>Ménard, C. B., Essery, R., Barr, A., Bartlett, P., Derry, J., Dumont, M., Fierz, C., Kim, H., Kontu, A., Lejeune, Y., Marks, D., Niwano, M., Raleigh, M., Wang, L., and Wever, N.: Meteorological and evaluation datasets for snow modelling at 10 reference sites: description of in situ and bias-corrected reanalysis data, Earth Syst. Sci. Data, 11, 865–880, <ext-link xlink:href="https://doi.org/10.5194/essd-11-865-2019" ext-link-type="DOI">10.5194/essd-11-865-2019</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bibx46"><?xmltex \def\ref@label{{Merkouriadi et~al.(2017)Merkouriadi, Gallet, Graham, Liston,
Polashenski, R{\"{o}}sel, and Gerland}}?><label>Merkouriadi et al.(2017)Merkouriadi, Gallet, Graham, Liston,
Polashenski, Rösel, and Gerland</label><?label merkouriadi2017winter?><mixed-citation>Merkouriadi, I., Gallet, J.-C., Graham, R. M., Liston, G. E., Polashenski, C.,
Rösel, A., and Gerland, S.: Winter snow conditions on Arctic sea ice
north of Svalbard during the Norwegian young sea ICE (N-ICE2015) expedition,
J. Geophys. Res.-Atmos., 122, 10–837,
<ext-link xlink:href="https://doi.org/10.1002/2016JD026035" ext-link-type="DOI">10.1002/2016JD026035</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bibx47"><?xmltex \def\ref@label{{Nguyen and Guo(2007)}}?><label>Nguyen and Guo(2007)</label><?label nguyen2007comparisons?><mixed-citation>Nguyen, N. and Guo, Y.: Comparisons of sequence labeling algorithms and
extensions, in: Proceedings of the 24th International Conference on Machine
Learning, pp. 681–688, <ext-link xlink:href="https://doi.org/10.1145/1273496.1273582" ext-link-type="DOI">10.1145/1273496.1273582</ext-link>, 2007.</mixed-citation></ref>
      <ref id="bib1.bibx48"><?xmltex \def\ref@label{{Nicolaus et~al.(2022)Nicolaus, Perovich, Spreen, Granskog, von
Albedyll, Angelopoulos, Anhaus, Arndt, Belter, Bessonov, Birnbaum, Brauchle,
Calmer, Cardellach, Cheng, Clemens-Sewall, Dadic, Damm, de~Boer, Demir,
Dethloff, Divine, Fong, Fons, Frey, Fuchs, Gabarró, Gerland, Goessling,
Gradinger, Haapala, Haas, Hamilton, Hannula, Hendricks, Herber, Heuzé,
Hoppmann, Høyland, Huntemann, Hutchings, Hwang, Itkin, Jacobi, Jaggi,
Jutila, Kaleschke, Katlein, Kolabutin, Krampe, Kristensen, Krumpen, Kurtz,
Lampert, Lange, Lei, Light, Linhardt, Liston, Loose, Macfarlane, Mahmud,
Matero, Maus, Morgenstern, Naderpour, Nandan, Niubom, Oggier, Oppelt,
Pätzold, Perron, Petrovsky, Pirazzini, Polashenski, Rabe, Raphael, Regnery,
Rex, Ricker, Riemann-Campe, Rinke, Rohde, Salganik, Scharien, Schiller,
Schneebeli, Semmling, Shimanchuk, Shupe, Smith, Smolyanitsky, Sokolov,
Stanton, Stroeve, Thielke, Timofeeva, Tonboe, Tavri, Tsamados, Wagner,
Watkins, Webster, and Wendisch}}?><label>Nicolaus et al.(2022)Nicolaus, Perovich, Spreen, Granskog, von
Albedyll, Angelopoulos, Anhaus, Arndt, Belter, Bessonov, Birnbaum, Brauchle,
Calmer, Cardellach, Cheng, Clemens-Sewall, Dadic, Damm, de Boer, Demir,
Dethloff, Divine, Fong, Fons, Frey, Fuchs, Gabarró, Gerland, Goessling,
Gradinger, Haapala, Haas, Hamilton, Hannula, Hendricks, Herber, Heuzé,
Hoppmann, Høyland, Huntemann, Hutchings, Hwang, Itkin, Jacobi, Jaggi,
Jutila, Kaleschke, Katlein, Kolabutin, Krampe, Kristensen, Krumpen, Kurtz,
Lampert, Lange, Lei, Light, Linhardt, Liston, Loose, Macfarlane, Mahmud,
Matero, Maus, Morgenstern, Naderpour, Nandan, Niubom, Oggier, Oppelt,
Pätzold, Perron, Petrovsky, Pirazzini, Polashenski, Rabe, Raphael, Regnery,
Rex, Ricker, Riemann-Campe, Rinke, Rohde, Salganik, Scharien, Schiller,
Schneebeli, Semmling, Shimanchuk, Shupe, Smith, Smolyanitsky, Sokolov,
Stanton, Stroeve, Thielke, Timofeeva, Tonboe, Tavri, Tsamados, Wagner,
Watkins, Webster, and Wendisch</label><?label nicolaus2022overview?><mixed-citation>Nicolaus, M., Perovich, D. K., Spreen, G., Granskog, M. A., von Albedyll, L.,
Angelopoulos, M., Anhaus, P., Arndt, S., Belter, H. J., Bessonov, V.,
Birnbaum, G., Brauchle, J., Calmer, R., Cardellach, E., Cheng, B.,
Clemens-Sewall, D., Dadic, R., Damm, E., de Boer, G., Demir, O., Dethloff,
K., Divine, D. V., Fong, A. A., Fons, S., Frey, M. M., Fuchs, N., Gabarró,
C., Gerland, S., Goessling, H. F., Gradinger, R., Haapala, J., Haas, C.,
Hamilton, J., Hannula, H.-R., Hendricks, S., Herber, A., Heuzé, C.,
Hoppmann, M., Høyland, K. V., Huntemann, M., Hutchings, J. K., Hwang, B.,
Itkin, P., Jacobi, H.-W., Jaggi, M., Jutila, A., Kaleschke, L., Katlein, C.,
Kolabutin, N., Krampe, D., Kristensen, S. S., Krumpen, T., Kurtz, N.,
Lampert, A., Lange, B. A., Lei, R., Light, B., Linhardt, F., Liston, G. E.,
Loose, B., Macfarlane, A. R., Mahmud, M., Matero, I. O., Maus, S.,
Morgenstern, A., Naderpour, R., Nandan, V., Niubom, A., Oggier, M., Oppelt,
N., Pätzold, F., Perron, C., Petrovsky, T., Pirazzini, R., Polashenski, C.,
Rabe, B., Raphael, I. A., Regnery, J., Rex, M., Ricker, R., Riemann-Campe,
K., Rinke, A., Rohde, J., Salganik, E., Scharien, R. K., Schiller, M.,
Schneebeli, M., Semmling, M., Shimanchuk, E., Shupe, M. D., Smith, M. M.,
Smolyanitsky, V., Sokolov, V., Stanton, T., Stroeve, J., Thielke, L.,
Timofeeva, A., Tonboe, R. T., Tavri, A., Tsamados, M., Wagner, D. N.,
Watkins, D., Webster, M., and Wendisch, M.: Overview of the MOSAiC
expedition: Snow and sea ice, Elementa: Science of the Anthropocene, 10,
<ext-link xlink:href="https://doi.org/10.1525/elementa.2021.000046" ext-link-type="DOI">10.1525/elementa.2021.000046</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bibx49"><?xmltex \def\ref@label{{Nixdorf et~al.(2021)Nixdorf, Dethloff, Rex, Shupe, Sommerfeld,
Perovich, Nicolaus, Heuzé, Rabe, Loose, Damm, Gradinger, Fong, Maslowski,
Rinke, Kwok, Spreen, Wendisch, Herber, Hirsekorn, Mohaupt, Frickenhaus,
Immerz, Weiss-Tuider, König, Mengedoht, Regnery, Gerchow, Ransby, Krumpen,
Morgenstern, Haas, Kanzow, Rack, Saitzev, Sokolov, Makarov, Schwarze,
Wunderlich, Wurr, and Boetius}}?><label>Nixdorf et al.(2021)Nixdorf, Dethloff, Rex, Shupe, Sommerfeld,
Perovich, Nicolaus, Heuzé, Rabe, Loose, Damm, Gradinger, Fong, Maslowski,
Rinke, Kwok, Spreen, Wendisch, Herber, Hirsekorn, Mohaupt, Frickenhaus,
Immerz, Weiss-Tuider, König, Mengedoht, Regnery, Gerchow, Ransby, Krumpen,
Morgenstern, Haas, Kanzow, Rack, Saitzev, Sokolov, Makarov, Schwarze,
Wunderlich, Wurr, and Boetius</label><?label nixdorf2021MOSAiC?><mixed-citation>Nixdorf, U., Dethloff, K., Rex, M., Shupe, M., Sommerfeld, A., Perovich, D. K.,
Nicolaus, M., Heuzé, C., Rabe, B., Loose, B., Damm, E., Gradinger, R., Fong,
A., Maslowski, W., Rinke, A., Kwok, R., Spreen, G., Wendisch, M., Herber, A.,
Hirsekorn, M., Mohaupt, V., Frickenhaus, S., Immerz, A., Weiss-Tuider, K.,
König, B., Mengedoht, D., Regnery, J., Gerchow, P., Ransby, D., Krumpen, T.,
Morgenstern, A., Haas, C., Kanzow, T., Rack, F. R., Saitzev, V., Sokolov, V.,
Makarov, A., Schwarze, S., Wunderlich, T., Wurr, K., and Boetius, A.: MOSAiC
Extended Acknowledgement, Zenodo, <ext-link xlink:href="https://doi.org/10.5281/zenodo.5541624" ext-link-type="DOI">10.5281/zenodo.5541624</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bibx50"><?xmltex \def\ref@label{{IPCC(2022)}}?><label>IPCC(2022)</label><?label portner2019ocean?><mixed-citation>IPCC: The Ocean and Cryosphere in a Changing
Climate: Special Report of the Intergovernmental Panel on Climate
Change, Cambridge University Press, Cambridge, <ext-link xlink:href="https://doi.org/10.1017/9781009157964" ext-link-type="DOI">10.1017/9781009157964</ext-link>,
2022.</mixed-citation></ref>
      <ref id="bib1.bibx51"><?xmltex \def\ref@label{{Pedregosa et~al.(2011)Pedregosa, Varoquaux, Gramfort, Michel,
Thirion, Grisel, Blondel, Prettenhofer, Weiss, Dubourg, Vanderplas, Passos,
Cournapeau, Brucher, Perrot, and {{\'{E}}}douard Duchesnay}}?><label>Pedregosa et al.(2011)Pedregosa, Varoquaux, Gramfort, Michel,
Thirion, Grisel, Blondel, Prettenhofer, Weiss, Dubourg, Vanderplas, Passos,
Cournapeau, Brucher, Perrot, and Édouard Duchesnay</label><?label scikit-learn?><mixed-citation>Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel,
O., Blondel, M., Prettenhofer, P., Weiss, R., Dubourg, V., Vanderplas, J.,
Passos, A., Cournapeau, D., Brucher, M., Perrot, M., and Édouard
Duchesnay: Scikit-learn: Machine Learning in Python, J. Mach.
Learn. Res., 12, 2825–2830,
<uri>http://jmlr.org/papers/v12/pedregosa11a.html</uri> (last access: 3 August 2023), 2011.</mixed-citation></ref>
      <ref id="bib1.bibx52"><?xmltex \def\ref@label{{Pfeffer and Mrugala(2002)}}?><label>Pfeffer and Mrugala(2002)</label><?label Pfeffer2002?><mixed-citation>Pfeffer, W. T. and Mrugala, R.: Temperature gradient and initial snow density
as controlling factors in the formation and structure of hard depth hoar,
J. Glaciol., 48, 485–494, <ext-link xlink:href="https://doi.org/10.3189/172756502781831098" ext-link-type="DOI">10.3189/172756502781831098</ext-link>, 2002.</mixed-citation></ref>
      <ref id="bib1.bibx53"><?xmltex \def\ref@label{{Proksch et~al.(2015)Proksch, L{\"{o}}we, and
Schneebeli}}?><label>Proksch et al.(2015)Proksch, Löwe, and
Schneebeli</label><?label proksch2015density?><mixed-citation>Proksch, M., Löwe, H., and Schneebeli, M.: Density, specific surface area,
and correlation length of snow measured by high-resolution penetrometry,
J. Geophys. Res.-Ea. Surf., 120, 346–362,
<ext-link xlink:href="https://doi.org/10.1002/2014JF003266" ext-link-type="DOI">10.1002/2014JF003266</ext-link>, 2015.</mixed-citation></ref>
      <ref id="bib1.bibx54"><?xmltex \def\ref@label{{Rinke et~al.(2021)Rinke, Cassano, Cassano, Jaiser, and
Handorf}}?><label>Rinke et al.(2021)Rinke, Cassano, Cassano, Jaiser, and
Handorf</label><?label Rinke2021meteorological?><mixed-citation>Rinke, A., Cassano, J. J., Cassano, E. N., Jaiser, R., and Handorf, D.:
Meteorological conditions during the MOSAiC expedition: Normal or
anomalous?, Elementa: Science of the Anthropocene, 9, 00023,
<ext-link xlink:href="https://doi.org/10.1525/elementa.2021.00023" ext-link-type="DOI">10.1525/elementa.2021.00023</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bibx55"><?xmltex \def\ref@label{{Russell and Norvig(2021)}}?><label>Russell and Norvig(2021)</label><?label russell2002artificial?><mixed-citation>
Russell, S. J. and Norvig, P.: Artificial intelligence: a modern approach,
Pearson series in artificial intelligence, Pearson, Hoboken, fourth edition
edn., 1136 pp., ISBN 978-0-13-461099-3, 2021.</mixed-citation></ref>
      <ref id="bib1.bibx56"><?xmltex \def\ref@label{{Sandells et~al.(2023)Sandells, Rutter, Wivell, Essery, Fox, Harlow,
Picard, Roy, Royer, and Toose}}?><label>Sandells et al.(2023)Sandells, Rutter, Wivell, Essery, Fox, Harlow,
Picard, Roy, Royer, and Toose</label><?label sandells_simulation_2023?><mixed-citation>Sandells, M., Rutter, N., Wivell, K., Essery, R., Fox, S., Harlow, C., Picard, G., Roy, A., Royer, A., and Toose, P.: Simulation of Arctic snow microwave emission in surface-sensitive atmosphere channels, EGUsphere [preprint], <ext-link xlink:href="https://doi.org/10.5194/egusphere-2023-696" ext-link-type="DOI">10.5194/egusphere-2023-696</ext-link>, 2023.</mixed-citation></ref>
      <ref id="bib1.bibx57"><?xmltex \def\ref@label{{Satyawali et~al.(2009)Satyawali, Schneebeli, Pielmeier, Stucki, and
Singh}}?><label>Satyawali et al.(2009)Satyawali, Schneebeli, Pielmeier, Stucki, and
Singh</label><?label satyawali2009preliminary?><mixed-citation>Satyawali, P., Schneebeli, M., Pielmeier, C., Stucki, T., and Singh, A.:
Preliminary characterization of Alpine snow using SnowMicroPen, Cold Reg.
Sci. Technol., 55, 311–320,
<ext-link xlink:href="https://doi.org/10.1016/j.coldregions.2008.09.003" ext-link-type="DOI">10.1016/j.coldregions.2008.09.003</ext-link>, 2009.</mixed-citation></ref>
      <ref id="bib1.bibx58"><?xmltex \def\ref@label{{Schneebeli et~al.(1999)Schneebeli, Pielmeier, and
Johnson}}?><label>Schneebeli et al.(1999)Schneebeli, Pielmeier, and
Johnson</label><?label schneebeli1999measuring?><mixed-citation>Schneebeli, M., Pielmeier, C., and Johnson, J. B.: Measuring snow
microstructure and hardness using a high resolution penetrometer, Cold
Reg. Sci. Technol., 30, 101–114,
<ext-link xlink:href="https://doi.org/10.1016/S0165-232X(99)00030-0" ext-link-type="DOI">10.1016/S0165-232X(99)00030-0</ext-link>, 1999.</mixed-citation></ref>
      <ref id="bib1.bibx59"><?xmltex \def\ref@label{{Sch{\"{o}}lkopf and Smola(2002)Sch{\"{o}}lkopf, Smola, Bach
et~al.}}?><label>Schölkopf and Smola(2002)Schölkopf, Smola, Bach
et al.</label><?label scholkopf2002learning?><mixed-citation>
Schölkopf, B. and Smola, A. J.: Learning with kernels:
support vector machines, regularization, optimization, and beyond, MIT press, 626 pp.,
ISBN 0-262-19475-9,
2002.</mixed-citation></ref>
      <ref id="bib1.bibx60"><?xmltex \def\ref@label{{Schuster and Paliwal(1997)}}?><label>Schuster and Paliwal(1997)</label><?label schuster1997bidirectional?><mixed-citation>Schuster, M. and Paliwal, K. K.: Bidirectional recurrent neural networks, IEEE
transactions on Signal Processing, 45, 2673–2681, <ext-link xlink:href="https://doi.org/10.1109/78.650093" ext-link-type="DOI">10.1109/78.650093</ext-link>,
1997.</mixed-citation></ref>
      <ref id="bib1.bibx61"><?xmltex \def\ref@label{{Soni and Mathai(2015)}}?><label>Soni and Mathai(2015)</label><?label soni2015improved?><mixed-citation>Soni, R. and Mathai, K. J.: Improved Twitter Sentiment Prediction through
Cluster-then-Predict Model, arXiv [preprint],
<ext-link xlink:href="https://doi.org/10.48550/arXiv.1509.02437" ext-link-type="DOI">10.48550/arXiv.1509.02437</ext-link>, 2015.</mixed-citation></ref>
      <ref id="bib1.bibx62"><?xmltex \def\ref@label{{Steger et~al.(2013)Steger, Kotlarski, Jonas, and
Sch{\"{a}}r}}?><label>Steger et al.(2013)Steger, Kotlarski, Jonas, and
Schär</label><?label steger2013alpine?><mixed-citation>Steger, C., Kotlarski, S., Jonas, T., and Schär, C.: Alpine snow cover in a
changing climate: a regional climate model perspective, Clim. Dynam., 41,
735–754, <ext-link xlink:href="https://doi.org/10.1007/s00382-012-1545-3" ext-link-type="DOI">10.1007/s00382-012-1545-3</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bibx63"><?xmltex \def\ref@label{{Stone(1974)}}?><label>Stone(1974)</label><?label stone1974cross?><mixed-citation>Stone, M.: Cross-validatory choice and assessment of statistical predictions,
J. R. Stat. Soc. B, 36,
111–133, <ext-link xlink:href="https://doi.org/10.1111/j.2517-6161.1974.tb00994.x" ext-link-type="DOI">10.1111/j.2517-6161.1974.tb00994.x</ext-link>, 1974.</mixed-citation></ref>
      <ref id="bib1.bibx64"><?xmltex \def\ref@label{{Sturm and Liston(2021)}}?><label>Sturm and Liston(2021)</label><?label sturm_revisiting_2021?><mixed-citation>Sturm, M. and Liston, G. E.: Revisiting the Global Seasonal Snow
Classification: An Updated Dataset for Earth System
Applications, J. Hydrometeorol., 22, 2917–2938,
<ext-link xlink:href="https://doi.org/10.1175/JHM-D-21-0070.1" ext-link-type="DOI">10.1175/JHM-D-21-0070.1</ext-link>, 2021.</mixed-citation></ref>
      <?pagebreak page4550?><ref id="bib1.bibx65"><?xmltex \def\ref@label{{Sturm and Massom(2017)}}?><label>Sturm and Massom(2017)</label><?label sturm2017snow?><mixed-citation>Sturm, M. and Massom, R. A.: Snow in the sea ice system: friend or foe?, in:
Sea Ice, John Wiley &amp; Sons, Ltd, pp. 65–109, section: 3,
<ext-link xlink:href="https://doi.org/10.1002/9781118778371.ch3" ext-link-type="DOI">10.1002/9781118778371.ch3</ext-link>,  2017.</mixed-citation></ref>
      <ref id="bib1.bibx66"><?xmltex \def\ref@label{{Theodorou et~al.(2014)Theodorou, Mporas, and
Fakotakis}}?><label>Theodorou et al.(2014)Theodorou, Mporas, and
Fakotakis</label><?label theodorou2014overview?><mixed-citation>Theodorou, T., Mporas, I., and Fakotakis, N.: An Overview of Automatic Audio
Segmentation, International Journal of Information Technology and Computer
Science, 6, 1–9, <ext-link xlink:href="https://doi.org/10.5815/ijitcs.2014.11.01" ext-link-type="DOI">10.5815/ijitcs.2014.11.01</ext-link>, 2014.</mixed-citation></ref>
      <ref id="bib1.bibx67"><?xmltex \def\ref@label{{Trivedi et~al.(2015)Trivedi, Pardos, and
Heffernan}}?><label>Trivedi et al.(2015)Trivedi, Pardos, and
Heffernan</label><?label trivedi2015utility?><mixed-citation>Trivedi, S., Pardos, Z. A., and Heffernan, N. T.: The Utility of Clustering
in Prediction Tasks, arXiv [preprint], <ext-link xlink:href="https://doi.org/10.48550/arXiv.1509.06163" ext-link-type="DOI">10.48550/arXiv.1509.06163</ext-link>, 2015.</mixed-citation></ref>
      <ref id="bib1.bibx68"><?xmltex \def\ref@label{{Vionnet et~al.(2012)Vionnet, Brun, Morin, Boone, Faroux, Le~Moigne,
Martin, and Willemet}}?><label>Vionnet et al.(2012)Vionnet, Brun, Morin, Boone, Faroux, Le Moigne,
Martin, and Willemet</label><?label Vionnet2012?><mixed-citation>Vionnet, V., Brun, E., Morin, S., Boone, A., Faroux, S., Le Moigne, P., Martin, E., and Willemet, J.-M.: The detailed snowpack scheme Crocus and its implementation in SURFEX v7.2, Geosci. Model Dev., 5, 773–791, <ext-link xlink:href="https://doi.org/10.5194/gmd-5-773-2012" ext-link-type="DOI">10.5194/gmd-5-773-2012</ext-link>, 2012.</mixed-citation></ref>
      <ref id="bib1.bibx69"><?xmltex \def\ref@label{{Wever et~al.(2022)Wever, Keenan, Kausch, and
Lehning}}?><label>Wever et al.(2022)Wever, Keenan, Kausch, and
Lehning</label><?label wever2022snowmicropen?><mixed-citation>Wever, N., Keenan, E., Kausch, T., and Lehning, M.: SnowMicroPen measurements
and manual snowpits from Dronning Maud Land, East Antarctica, EnviDat,
<ext-link xlink:href="https://doi.org/10.16904/envidat.331" ext-link-type="DOI">10.16904/envidat.331</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bibx70"><?xmltex \def\ref@label{{Wu et~al.(2008)Wu, Kumar, Quinlan, Ghosh, Yang, Motoda, McLachlan,
Ng, Liu, Philip et~al.}}?><label>Wu et al.(2008)Wu, Kumar, Quinlan, Ghosh, Yang, Motoda, McLachlan,
Ng, Liu, Philip et al.</label><?label wu2008top?><mixed-citation>Wu, X., Kumar, V., Ross Quinlan, J., Ghosh, J., Yang, Q., Motoda, H., McLachlan, G. J., Ng, A., Liu, B., Yu, P. S., Zhou, Z.-H., Steinbach, M., Hand, D. J., and Steinberg, D.: Top 10 algorithms in data
mining, Knowl. Inf. Syst., 14, 1–37,
<ext-link xlink:href="https://doi.org/10.1007/s10115-007-0114-2" ext-link-type="DOI">10.1007/s10115-007-0114-2</ext-link>, 2008.
</mixed-citation></ref><?xmltex \hack{\newpage}?>
      <ref id="bib1.bibx71"><?xmltex \def\ref@label{{Yarowsky(1995)}}?><label>Yarowsky(1995)</label><?label yarowsky1995unsupervised?><mixed-citation>Yarowsky, D.: Unsupervised word sense disambiguation rivaling supervised
methods, in: 33rd annual meeting of the association for computational
linguistics, pp. 189–196, <ext-link xlink:href="https://doi.org/10.3115/981658.981684" ext-link-type="DOI">10.3115/981658.981684</ext-link>, 1995.</mixed-citation></ref>
      <ref id="bib1.bibx72"><?xmltex \def\ref@label{{Yoshua et~al.(2006)Yoshua, Olivier, and Nicolas~Le}}?><label>Yoshua et al.(2006)Yoshua, Olivier, and Nicolas Le</label><?label bengio200611?><mixed-citation>Yoshua, B., Olivier, D., and Nicolas Le, R.: 192193Label Propagation and
Quadratic Criterion, in: Semi-Supervised Learning, The MIT Press,
<ext-link xlink:href="https://doi.org/10.7551/mitpress/9780262033589.003.0011" ext-link-type="DOI">10.7551/mitpress/9780262033589.003.0011</ext-link>, 2006.</mixed-citation></ref>
      <ref id="bib1.bibx73"><?xmltex \def\ref@label{{Zhou et~al.(2003)Zhou, Bousquet, Lal, Weston, and
Schölkopf}}?><label>Zhou et al.(2003)Zhou, Bousquet, Lal, Weston, and
Schölkopf</label><?label Zhou04learningwith?><mixed-citation>Zhou, D., Bousquet, O., Lal, T., Weston, J., and Schölkopf, B.: Learning with
Local and Global Consistency, in: Advances in Neural Information
Processing Systems, edited by: Thrun, S., Saul, L., and Schölkopf, B.,
vol. 16, pp. 321–328, MIT Press,
<uri>https://proceedings.neurips.cc/paper_files/paper/2003/file/87682805257e619d49b8e0dfdc14affa-Paper.pdf</uri> (last access: 3 August 2023),
2003.</mixed-citation></ref>
      <ref id="bib1.bibx74"><?xmltex \def\ref@label{{Zhu and Ghahramani(2002)}}?><label>Zhu and Ghahramani(2002)</label><?label zhu2002learning?><mixed-citation>Zhu, X. and Ghahramani, Z.: Learning from labeled and unlabeled data with label
propagation, Tech. rep., Carnegie Mellon University,
<uri>https://mlg.eng.cam.ac.uk/zoubin/papers/CMU-CALD-02-107.pdf</uri>   (last access: 3 August 2023),
2002.</mixed-citation></ref>

  </ref-list></back>
    <!--<article-title-html>Automatic snow type classification of snow micropenetrometer profiles with machine learning algorithms</article-title-html>
<abstract-html/>
<ref-html id="bib1.bib1"><label>Abadi et al.(2016)Abadi, Agarwal, Barham, Brevdo, Chen, Citro,
Corrado, Davis, Dean, Devin, Ghemawat, Goodfellow, Harp, Irving, Isard, Jia,
Jozefowicz, Kaiser, Kudlur, Levenberg, Mane, Monga, Moore, Murray, Olah,
Schuster, Shlens, Steiner, Sutskever, Talwar, Tucker, Vanhoucke, Vasudevan,
Viegas, Vinyals, Warden, Wattenberg, Wicke, Yu, and
Zheng</label><mixed-citation>
      
Abadi, M., Agarwal, A., Barham, P., Brevdo, E., Chen, Z., Citro, C., Corrado,
G. S., Davis, A., Dean, J., Devin, M., Ghemawat, S., Goodfellow, I., Harp,
A., Irving, G., Isard, M., Jia, Y., Jozefowicz, R., Kaiser, L., Kudlur, M.,
Levenberg, J., Mane, D., Monga, R., Moore, S., Murray, D., Olah, C.,
Schuster, M., Shlens, J., Steiner, B., Sutskever, I., Talwar, K., Tucker, P.,
Vanhoucke, V., Vasudevan, V., Viegas, F., Vinyals, O., Warden, P.,
Wattenberg, M., Wicke, M., Yu, Y., and Zheng, X.: TensorFlow:
Large-Scale Machine Learning on Heterogeneous Distributed
Systems, arXiv [preprint], <a href="https://doi.org/10.48550/arXiv.1603.04467" target="_blank">https://doi.org/10.48550/arXiv.1603.04467</a>, 2016.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib2"><label>Bahdanau et al.(2014)Bahdanau, Cho, and Bengio</label><mixed-citation>
      
Bahdanau, D., Cho, K., and Bengio, Y.: Neural Machine Translation by Jointly
Learning to Align and Translate, arXiv [preprint], <a href="https://doi.org/10.48550/ARXIV.1409.0473" target="_blank">https://doi.org/10.48550/ARXIV.1409.0473</a>, 2014.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib3"><label>Bishop(2006)</label><mixed-citation>
      
Bishop, C. M.: Pattern recognition and machine learning, Information science
and statistics, Springer, New York, 738 pp., ISBN 978-0-387-31073-2, 2006.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib4"><label>Breiman(2001)</label><mixed-citation>
      
Breiman, L.: Random forests, Mach. Learn., 45, 5–32,
<a href="https://doi.org/10.1023/A:1010933404324" target="_blank">https://doi.org/10.1023/A:1010933404324</a>, 2001.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib5"><label>Calonne et al.(2020)Calonne, Richter, Löwe, Cetti, Ter Schure,
Van Herwijnen, Fierz, Jaggi, and Schneebeli</label><mixed-citation>
      
Calonne, N., Richter, B., Löwe, H., Cetti, C., ter Schure, J., Van Herwijnen, A., Fierz, C., Jaggi, M., and Schneebeli, M.: The RHOSSA campaign: multi-resolution monitoring of the seasonal evolution of the structure and mechanical stability of an alpine snowpack, The Cryosphere, 14, 1829–1848, <a href="https://doi.org/10.5194/tc-14-1829-2020" target="_blank">https://doi.org/10.5194/tc-14-1829-2020</a>, 2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib6"><label>Chao et al.(2004)Chao, Liaw, and Breiman</label><mixed-citation>
      
Chao, C., Liaw, A., and Breiman, L.: Using random forest to learn imbalanced
data, Tech. Reports 666, University of California, Dep. Statistics,
Berkeley, <a href="https://statistics.berkeley.edu/tech-reports/666" target="_blank"/> (last access: 3 August 2023),
2004.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib7"><label>Chollet et al.(2015)</label><mixed-citation>
      
Chollet, F. et al.: Keras,  GitHub, <a href="https://github.com/fchollet/keras" target="_blank"/>  (last access: 3 August 2023), 2015.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib8"><label>Colbeck(1987)</label><mixed-citation>
      
Colbeck, S.: A review of the metamorphism and classification of seasonal snow
cover crystals, IAHS Publication, 162, 3–24,
<a href="https://iahs.info/uploads/dms/6807.3-34-162-Colbeck.pdf" target="_blank"/>,
1987.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib9"><label>Coléou et al.(2001)Coléou, Lesaffre, Brzoska, Ludwig, and
Boller</label><mixed-citation>
      
Coléou, C., Lesaffre, B., Brzoska, J.-B., Ludwig, W., and Boller, E.:
Three-dimensional snow images by X-ray microtomography, Ann.
Glaciol., 32, 75–81, <a href="https://doi.org/10.3189/172756401781819418" target="_blank">https://doi.org/10.3189/172756401781819418</a>, 2001.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib10"><label>Cortes and Vapnik(1995)</label><mixed-citation>
      
Cortes, C. and Vapnik, V.: Support-vector networks, Mach. Learn., 20,
273–297, <a href="https://doi.org/10.1007/BF00994018" target="_blank">https://doi.org/10.1007/BF00994018</a>, 1995.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib11"><label>Cover and Hart(1967)</label><mixed-citation>
      
Cover, T. and Hart, P.: Nearest neighbor pattern classification, IEEE
Transactions on Information Theory, 13, 21–27,
<a href="https://doi.org/10.1109/TIT.1967.1053964" target="_blank">https://doi.org/10.1109/TIT.1967.1053964</a>, 1967.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib12"><label>CyberZHG(2020)</label><mixed-citation>
      
CyberZHG: Keras Self-Attention, GitHub,
<a href="https://github.com/CyberZHG/keras-self-attention" target="_blank"/> (last access: 3 August 2023), 2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib13"><label>Domine et al.(2019)Domine, Picard, Morin, Barrere, Madore, and
Langlois</label><mixed-citation>
      
Domine, F., Picard, G., Morin, S., Barrere, M., Madore, J.-B., and Langlois,
A.: Major Issues in Simulating Some Arctic Snowpack Properties
Using Current Detailed Snow Physics Models: Consequences for
the Thermal Regime and Water Budget of Permafrost, J.
Adv. Model. Earth Syst., 11, 34–44, <a href="https://doi.org/10.1029/2018MS001445" target="_blank">https://doi.org/10.1029/2018MS001445</a>,
2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib14"><label>Douville et al.(1995)Douville, Royer, and Mahfouf</label><mixed-citation>
      
Douville, H., Royer, J. F., and Mahfouf, J. F.: A new snow parameterization for
the Météo-France climate model, Clim. Dynam., 12, 21–35,
<a href="https://doi.org/10.1007/BF00208760" target="_blank">https://doi.org/10.1007/BF00208760</a>, 1995.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib15"><label>Fierz et al.(2009)Fierz, Armstrong, Durand, Etchevers, Greene,
Mcclung, Nishimura, Satyawali, and Sokratov</label><mixed-citation>
      
Fierz, C., Armstrong, R., Durand, Y., Etchevers, P., Greene, E., Mcclung,
D. M., Nishimura, K., Satyawali, P., and Sokratov, S.: The international
classification for seasonal snow on the ground, Tech. rep., UNESCO-IHP,
Paris, Paris, publication Title: IHP-VII Technical Documents in Hydrology
No. 83, IACS Contribution No. 1, 2009.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib16"><label>Fix and Hodges(1952)</label><mixed-citation>
      
Fix, E. and Hodges, J. L.: Discriminatory Analysis - Nonparametric
Discrimination: Small Sample Performance, Tech. rep., California
University Berkeley,
<a href="https://apps.dtic.mil/sti/citations/ADA800391" target="_blank"/>  (last access: 3 August 2023), section:
Technical Reports, 1952.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib17"><label>Forgy(1965)</label><mixed-citation>
      
Forgy, E. W.: Cluster analysis of multivariate data: efficiency versus
interpretability of classifications, Biometrics, 21, 768–769, 1965.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib18"><label>Ghahramani(2004)</label><mixed-citation>
      
Ghahramani, Z.: Unsupervised Learning, pp. 72–112, Springer Berlin Heidelberg,
Berlin, Heidelberg, <a href="https://doi.org/10.1007/978-3-540-28650-9_5" target="_blank">https://doi.org/10.1007/978-3-540-28650-9_5</a>, 2004.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib19"><label>Han et al.(2012)Han, Kamber, and Pei</label><mixed-citation>
      
Han, J., Kamber, M., and Pei, J.: 9 – Classification: Advanced Methods, in:
Data Mining (Third Edition), edited by: Han, J., Kamber, M., and Pei, J., The
Morgan Kaufmann Series in Data Management Systems, pp. 393–442, Morgan
Kaufmann, Boston, third edition edn.,
<a href="https://doi.org/10.1016/B978-0-12-381479-1.00009-5" target="_blank">https://doi.org/10.1016/B978-0-12-381479-1.00009-5</a>, 2012.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib20"><label>Havens et al.(2010)Havens, Marshall, Steiner, and
Tedesco</label><mixed-citation>
      
Havens, S., Marshall, H.-P., Steiner, N., and Tedesco, M.: Snow micro
penetrometer and near infrared photography for grain type classification, in:
2010 International Snow Science Workshop, pp. 465–469,
<a href="https://arc.lib.montana.edu/snow-science/objects/ISSW_P-029.pdf" target="_blank"/>  (last access: 3 August 2023),
2010.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib21"><label>Havens et al.(2013)Havens, Marshall, Pielmeier, and
Elder</label><mixed-citation>
      
Havens, S., Marshall, H.-P., Pielmeier, C., and Elder, K.: Automatic Grain
Type Classification of Snow Micro Penetrometer Signals With
Random Forests, IEEE Transactions on Geoscience and Remote Sensing, 51,
3328–3335, <a href="https://doi.org/10.1109/TGRS.2012.2220549" target="_blank">https://doi.org/10.1109/TGRS.2012.2220549</a>, 2013.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib22"><label>Herla et al.(2021)Herla, Horton, Mair, and Haegeli</label><mixed-citation>
      
Herla, F., Horton, S., Mair, P., and Haegeli, P.: Snow profile alignment and similarity assessment for aggregating, clustering, and evaluating snowpack model output for avalanche forecasting, Geosci. Model Dev., 14, 239–258, <a href="https://doi.org/10.5194/gmd-14-239-2021" target="_blank">https://doi.org/10.5194/gmd-14-239-2021</a>, 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib23"><label>Ho(1995)</label><mixed-citation>
      
Ho, T. K.: Random decision forests, in: Proceedings of 3rd international
conference on document analysis and recognition, vol. 1, pp. 278–282, IEEE,
1995.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib24"><label>Hochreiter and Schmidhuber(1997)</label><mixed-citation>
      
Hochreiter, S. and Schmidhuber, J.: Long short-term memory,
Neural Computat.,
9, 1735–1780, 1997.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib25"><label>Ismail Fawaz et al.(2019)Ismail Fawaz, Forestier, Weber, Idoumghar,
and Muller</label><mixed-citation>
      
Ismail Fawaz, H., Forestier, G., Weber, J., Idoumghar, L., and Muller, P.-A.:
Deep learning for time series classification: a review, Data Min. Knowl. Disc., 33, 917–963, <a href="https://doi.org/10.1007/s10618-019-00619-1" target="_blank">https://doi.org/10.1007/s10618-019-00619-1</a>, 2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib26"><label>Johnson and Schneebeli(1998)</label><mixed-citation>
      
Johnson, J. B. and Schneebeli, M.: Snow strength penetrometer,
<a href="https://patents.google.com/patent/US5831161/en" target="_blank"/>  (last access: 3 August 2023), 1998.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib27"><label>Jurafsky and Martin(2021)</label><mixed-citation>
      
Jurafsky, D. and Martin, J. H.: Speech and Language Processing: An Introduction
to Natural Language Processing, Computational Linguistics, and Speech
Recognition, 3rd ed. draft, in
progress,
<a href="https://web.stanford.edu/~jurafsky/slp3/" target="_blank"/>  (last access: 3 August 2023), 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib28"><label>Kaltenborn and vclay(2022)</label><mixed-citation>
      
Kaltenborn, J. and vclay: liellnima/snowdragon: Snowdragon Release 1.0.0 (v1.0.0), Zenodo [code], <a href="https://doi.org/10.5281/zenodo.7335813" target="_blank">https://doi.org/10.5281/zenodo.7335813</a>, 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib29"><label>Kaltenborn et al.(2021)Kaltenborn, Clay, Macfarlane, and
Schneebeli</label><mixed-citation>
      
Kaltenborn, J., Clay, V., Macfarlane, A. R., and Schneebeli, M.: Machine
Learning for Snow Stratigraphy Classification, in: NeurIPS 2021 Workshop on
Tackling Climate Change with Machine Learning,
<a href="https://www.climatechange.ai/papers/neurips2021/48" target="_blank"/> (last access: 3 August 2023), 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib30"><label>Kaltenborn et al.(2022)Kaltenborn, Macfarlane, Clay, and
Schneebeli</label><mixed-citation>
      
Kaltenborn, J., Macfarlane, A. R., Clay, V., and Schneebeli, M.: Pre-trained
Models for SMP Classification and Segmentation, Zenodo [code], <a href="https://doi.org/10.5281/zenodo.7063521" target="_blank">https://doi.org/10.5281/zenodo.7063521</a>,
2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib31"><label>King et al.(2015)King, Kelly, Kasurak, Duguay, Gunn, Rutter, Watts,
and Derksen</label><mixed-citation>
      
King, J., Kelly, R., Kasurak, A., Duguay, C., Gunn, G., Rutter, N., Watts, T.,
and Derksen, C.: Spatio-temporal influence of tundra snow properties on
Ku-band (17.2 GHz) backscatter, J. Glaciol., 61, 267–279,
<a href="https://doi.org/10.3189/2015JoG14J020" target="_blank">https://doi.org/10.3189/2015JoG14J020</a>, 2015.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib32"><label>King et al.(2020a)King, Howell, Brady, Toose, Derksen,
Haas, and Beckers</label><mixed-citation>
      
King, J., Howell, S., Brady, M., Toose, P., Derksen, C., Haas, C., and Beckers, J.: Local-scale variability of snow density on Arctic sea ice, The Cryosphere, 14, 4323–4339, <a href="https://doi.org/10.5194/tc-14-4323-2020" target="_blank">https://doi.org/10.5194/tc-14-4323-2020</a>, 2020a.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib33"><label>King et al.(2020b)King, Howell, Brady, Toose, Derksen,
Haas, and Beckers</label><mixed-citation>
      
King, J., Howell, S., Brady, M., Toose, P., Derksen, C., Haas, C., and Beckers,
J.: SnowMicroPen Measurements on Sea Ice 2016–2017, Zenodo,
<a href="https://doi.org/10.5281/zenodo.4068349" target="_blank">https://doi.org/10.5281/zenodo.4068349</a>, 2020b.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib34"><label>Knust(2017)</label><mixed-citation>
      
Knust, R.: Polar Research and Supply Vessel POLARSTERN Operated by
the Alfred-Wegener-Institute, Journal of Large-Scale Research
Facilities, 3, A119–A119, <a href="https://doi.org/10.17815/jlsrf-3-163" target="_blank">https://doi.org/10.17815/jlsrf-3-163</a>, 2017.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib35"><label>Lemaître et al.(2017)Lemaître, Nogueira, and
Aridas</label><mixed-citation>
      
Lemaître, G., Nogueira, F., and Aridas, C. K.: Imbalanced-learn: A python
toolbox to tackle the curse of imbalanced datasets in machine learning,
J. Mach. Learn. Res., 18, 559–563,
<a href="https://www.jmlr.org/papers/volume18/16-365/16-365.pdf" target="_blank"/>  (last access: 3 August 2023), 2017.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib36"><label>Li et al.(2020)Li, Hasanaj, and Li</label><mixed-citation>
      
Li, D., Hasanaj, E., and Li, S.: 3 – Baselines,
<a href="https://blog.ml.cmu.edu/2020/08/31/3-baselines/" target="_blank"/> (last access:
4 March 2021), 2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib37"><label>Light et al.(2015)Light, Perovich, Webster, Polashenski, and
Dadic</label><mixed-citation>
      
Light, B., Perovich, D. K., Webster, M. A., Polashenski, C., and Dadic, R.:
Optical properties of melting first-year Arctic sea ice, J.
Geophys. Res.-Oceans, 120, 7657–7675, <a href="https://doi.org/10.1002/2015JC011163" target="_blank">https://doi.org/10.1002/2015JC011163</a>,
2015.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib38"><label>Liu et al.(2008)Liu, Wu, and Zhou</label><mixed-citation>
      
Liu, X.-Y., Wu, J., and Zhou, Z.-H.: Exploratory undersampling for
class-imbalance learning, IEEE Transactions on Systems, Man, and Cybernetics,
Part B (Cybernetics), 39, 539–550, <a href="https://doi.org/10.1109/TSMCB.2008.2007853" target="_blank">https://doi.org/10.1109/TSMCB.2008.2007853</a>, 2008.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib39"><label>Lloyd(1982)</label><mixed-citation>
      
Lloyd, S.: Least squares quantization in PCM, IEEE Transactions on Information
Theory, 28, 129–137, <a href="https://doi.org/10.1109/TIT.1982.1056489" target="_blank">https://doi.org/10.1109/TIT.1982.1056489</a>, 1982.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib40"><label>Löwe and Van Herwijnen(2012)</label><mixed-citation>
      
Löwe, H. and Van Herwijnen, A.: A Poisson shot noise model for
micro-penetration of snow, Cold Reg. Sci. Technol., 70, 62–70,
<a href="https://doi.org/10.1016/j.coldregions.2011.09.001" target="_blank">https://doi.org/10.1016/j.coldregions.2011.09.001</a>, 2012.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib41"><label>Lutz et al.(2007)Lutz, Birkeland, Kronholm, Hansen, and
Aspinall</label><mixed-citation>
      
Lutz, E., Birkeland, K. W., Kronholm, K., Hansen, K., and Aspinall, R.: Surface
hoar characteristics derived from a snow micropenetrometer using moving
window statistical operations, Cold Reg. Sci. Technol., 47,
118–133, <a href="https://doi.org/10.1016/j.coldregions.2006.08.021" target="_blank">https://doi.org/10.1016/j.coldregions.2006.08.021</a>, 2007.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib42"><label>Macfarlane et al.(2023)Macfarlane, Schneebeli, Dadic, Tavri,
Immerz, Polashenski, Krampe, Clemens-Sewall, Wagner, Perovich, Henna-Reetta,
Raphael, Matero, Regnery, Smith, Nicolaus, Jaggi, Oggier, Webster, Lehning,
Kolabutin, Itkin, Naderpour, Pirazzini, Hammerle, Arndt, and
Fons</label><mixed-citation>
      
Macfarlane, A., Schneebeli, M., Dadic, R., Tavri, A., Immerz, A., Polashenski,
C., Krampe, D., Clemens-Sewall, D., Wagner, D., Perovich, D., Henna-Reetta,
H., Raphael, I., Matero, I., Regnery, J., Smith, M., Nicolaus, M., Jaggi, M.,
Oggier, M., Webster, M., Lehning, M., Kolabutin, N., Itkin, P., Naderpour,
R., Pirazzini, R., Hammerle, S., Arndt, S., and Fons, S.: A Database of Snow
on Sea Ice in the Central Arctic Collected during the MOSAiC expedition,
Scientific Data, 10, 398, <a href="https://doi.org/10.1038/s41597-023-02273-1" target="_blank">https://doi.org/10.1038/s41597-023-02273-1</a>, 2023.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib43"><label>Macfarlane et al.(2021)Macfarlane, Schneebeli, Dadic,
Wagner, Arndt, Clemens-Sewall, Hämmerle, Hannula, Jaggi,
Kolabutin, Krampe, Lehning, Matero, Nicolaus, Oggier,
Pirazzini, Polashenski, Raphael, Regnery, Shimanchuck, Smith, and
Tavri</label><mixed-citation>
      
Macfarlane, A. R., Schneebeli, M., Dadic, R., Wagner, D. N., Arndt,
S., Clemens-Sewall, D., Hämmerle, S., Hannula, H.-R., Jaggi, M.,
Kolabutin, N., Krampe, D., Lehning, M., Matero, I., Nicolaus, M.,
Oggier, M., Pirazzini, R., Polashenski, C., Raphael, I., Regnery,
J., Shimanchuck, E., Smith, M. M., and Tavri, A.: Snowpit SnowMicroPen
(SMP) force profiles collected during the MOSAiC expedition, PANGAEA [data set],
<a href="https://doi.org/10.1594/PANGAEA.935554" target="_blank">https://doi.org/10.1594/PANGAEA.935554</a>, 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib44"><label>Matzl and Schneebeli(2006)</label><mixed-citation>
      
Matzl, M. and Schneebeli, M.: Measuring specific surface area of snow by
near-infrared photography, J. Glaciol., 52, 558–564,
<a href="https://doi.org/10.3189/172756506781828412" target="_blank">https://doi.org/10.3189/172756506781828412</a>, 2006.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib45"><label>Ménard et al.(2019)Ménard, Essery, Barr, Bartlett, Derry,
Dumont, Fierz, Kim, Kontu, Lejeune et al.</label><mixed-citation>
      
Ménard, C. B., Essery, R., Barr, A., Bartlett, P., Derry, J., Dumont, M., Fierz, C., Kim, H., Kontu, A., Lejeune, Y., Marks, D., Niwano, M., Raleigh, M., Wang, L., and Wever, N.: Meteorological and evaluation datasets for snow modelling at 10 reference sites: description of in situ and bias-corrected reanalysis data, Earth Syst. Sci. Data, 11, 865–880, <a href="https://doi.org/10.5194/essd-11-865-2019" target="_blank">https://doi.org/10.5194/essd-11-865-2019</a>, 2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib46"><label>Merkouriadi et al.(2017)Merkouriadi, Gallet, Graham, Liston,
Polashenski, Rösel, and Gerland</label><mixed-citation>
      
Merkouriadi, I., Gallet, J.-C., Graham, R. M., Liston, G. E., Polashenski, C.,
Rösel, A., and Gerland, S.: Winter snow conditions on Arctic sea ice
north of Svalbard during the Norwegian young sea ICE (N-ICE2015) expedition,
J. Geophys. Res.-Atmos., 122, 10–837,
<a href="https://doi.org/10.1002/2016JD026035" target="_blank">https://doi.org/10.1002/2016JD026035</a>, 2017.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib47"><label>Nguyen and Guo(2007)</label><mixed-citation>
      
Nguyen, N. and Guo, Y.: Comparisons of sequence labeling algorithms and
extensions, in: Proceedings of the 24th International Conference on Machine
Learning, pp. 681–688, <a href="https://doi.org/10.1145/1273496.1273582" target="_blank">https://doi.org/10.1145/1273496.1273582</a>, 2007.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib48"><label>Nicolaus et al.(2022)Nicolaus, Perovich, Spreen, Granskog, von
Albedyll, Angelopoulos, Anhaus, Arndt, Belter, Bessonov, Birnbaum, Brauchle,
Calmer, Cardellach, Cheng, Clemens-Sewall, Dadic, Damm, de Boer, Demir,
Dethloff, Divine, Fong, Fons, Frey, Fuchs, Gabarró, Gerland, Goessling,
Gradinger, Haapala, Haas, Hamilton, Hannula, Hendricks, Herber, Heuzé,
Hoppmann, Høyland, Huntemann, Hutchings, Hwang, Itkin, Jacobi, Jaggi,
Jutila, Kaleschke, Katlein, Kolabutin, Krampe, Kristensen, Krumpen, Kurtz,
Lampert, Lange, Lei, Light, Linhardt, Liston, Loose, Macfarlane, Mahmud,
Matero, Maus, Morgenstern, Naderpour, Nandan, Niubom, Oggier, Oppelt,
Pätzold, Perron, Petrovsky, Pirazzini, Polashenski, Rabe, Raphael, Regnery,
Rex, Ricker, Riemann-Campe, Rinke, Rohde, Salganik, Scharien, Schiller,
Schneebeli, Semmling, Shimanchuk, Shupe, Smith, Smolyanitsky, Sokolov,
Stanton, Stroeve, Thielke, Timofeeva, Tonboe, Tavri, Tsamados, Wagner,
Watkins, Webster, and Wendisch</label><mixed-citation>
      
Nicolaus, M., Perovich, D. K., Spreen, G., Granskog, M. A., von Albedyll, L.,
Angelopoulos, M., Anhaus, P., Arndt, S., Belter, H. J., Bessonov, V.,
Birnbaum, G., Brauchle, J., Calmer, R., Cardellach, E., Cheng, B.,
Clemens-Sewall, D., Dadic, R., Damm, E., de Boer, G., Demir, O., Dethloff,
K., Divine, D. V., Fong, A. A., Fons, S., Frey, M. M., Fuchs, N., Gabarró,
C., Gerland, S., Goessling, H. F., Gradinger, R., Haapala, J., Haas, C.,
Hamilton, J., Hannula, H.-R., Hendricks, S., Herber, A., Heuzé, C.,
Hoppmann, M., Høyland, K. V., Huntemann, M., Hutchings, J. K., Hwang, B.,
Itkin, P., Jacobi, H.-W., Jaggi, M., Jutila, A., Kaleschke, L., Katlein, C.,
Kolabutin, N., Krampe, D., Kristensen, S. S., Krumpen, T., Kurtz, N.,
Lampert, A., Lange, B. A., Lei, R., Light, B., Linhardt, F., Liston, G. E.,
Loose, B., Macfarlane, A. R., Mahmud, M., Matero, I. O., Maus, S.,
Morgenstern, A., Naderpour, R., Nandan, V., Niubom, A., Oggier, M., Oppelt,
N., Pätzold, F., Perron, C., Petrovsky, T., Pirazzini, R., Polashenski, C.,
Rabe, B., Raphael, I. A., Regnery, J., Rex, M., Ricker, R., Riemann-Campe,
K., Rinke, A., Rohde, J., Salganik, E., Scharien, R. K., Schiller, M.,
Schneebeli, M., Semmling, M., Shimanchuk, E., Shupe, M. D., Smith, M. M.,
Smolyanitsky, V., Sokolov, V., Stanton, T., Stroeve, J., Thielke, L.,
Timofeeva, A., Tonboe, R. T., Tavri, A., Tsamados, M., Wagner, D. N.,
Watkins, D., Webster, M., and Wendisch, M.: Overview of the MOSAiC
expedition: Snow and sea ice, Elementa: Science of the Anthropocene, 10,
<a href="https://doi.org/10.1525/elementa.2021.000046" target="_blank">https://doi.org/10.1525/elementa.2021.000046</a>, 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib49"><label>Nixdorf et al.(2021)Nixdorf, Dethloff, Rex, Shupe, Sommerfeld,
Perovich, Nicolaus, Heuzé, Rabe, Loose, Damm, Gradinger, Fong, Maslowski,
Rinke, Kwok, Spreen, Wendisch, Herber, Hirsekorn, Mohaupt, Frickenhaus,
Immerz, Weiss-Tuider, König, Mengedoht, Regnery, Gerchow, Ransby, Krumpen,
Morgenstern, Haas, Kanzow, Rack, Saitzev, Sokolov, Makarov, Schwarze,
Wunderlich, Wurr, and Boetius</label><mixed-citation>
      
Nixdorf, U., Dethloff, K., Rex, M., Shupe, M., Sommerfeld, A., Perovich, D. K.,
Nicolaus, M., Heuzé, C., Rabe, B., Loose, B., Damm, E., Gradinger, R., Fong,
A., Maslowski, W., Rinke, A., Kwok, R., Spreen, G., Wendisch, M., Herber, A.,
Hirsekorn, M., Mohaupt, V., Frickenhaus, S., Immerz, A., Weiss-Tuider, K.,
König, B., Mengedoht, D., Regnery, J., Gerchow, P., Ransby, D., Krumpen, T.,
Morgenstern, A., Haas, C., Kanzow, T., Rack, F. R., Saitzev, V., Sokolov, V.,
Makarov, A., Schwarze, S., Wunderlich, T., Wurr, K., and Boetius, A.: MOSAiC
Extended Acknowledgement, Zenodo, <a href="https://doi.org/10.5281/zenodo.5541624" target="_blank">https://doi.org/10.5281/zenodo.5541624</a>, 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib50"><label>IPCC(2022)</label><mixed-citation>
      
IPCC: The Ocean and Cryosphere in a Changing
Climate: Special Report of the Intergovernmental Panel on Climate
Change, Cambridge University Press, Cambridge, <a href="https://doi.org/10.1017/9781009157964" target="_blank">https://doi.org/10.1017/9781009157964</a>,
2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib51"><label>Pedregosa et al.(2011)Pedregosa, Varoquaux, Gramfort, Michel,
Thirion, Grisel, Blondel, Prettenhofer, Weiss, Dubourg, Vanderplas, Passos,
Cournapeau, Brucher, Perrot, and Édouard Duchesnay</label><mixed-citation>
      
Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel,
O., Blondel, M., Prettenhofer, P., Weiss, R., Dubourg, V., Vanderplas, J.,
Passos, A., Cournapeau, D., Brucher, M., Perrot, M., and Édouard
Duchesnay: Scikit-learn: Machine Learning in Python, J. Mach.
Learn. Res., 12, 2825–2830,
<a href="http://jmlr.org/papers/v12/pedregosa11a.html" target="_blank"/> (last access: 3 August 2023), 2011.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib52"><label>Pfeffer and Mrugala(2002)</label><mixed-citation>
      
Pfeffer, W. T. and Mrugala, R.: Temperature gradient and initial snow density
as controlling factors in the formation and structure of hard depth hoar,
J. Glaciol., 48, 485–494, <a href="https://doi.org/10.3189/172756502781831098" target="_blank">https://doi.org/10.3189/172756502781831098</a>, 2002.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib53"><label>Proksch et al.(2015)Proksch, Löwe, and
Schneebeli</label><mixed-citation>
      
Proksch, M., Löwe, H., and Schneebeli, M.: Density, specific surface area,
and correlation length of snow measured by high-resolution penetrometry,
J. Geophys. Res.-Ea. Surf., 120, 346–362,
<a href="https://doi.org/10.1002/2014JF003266" target="_blank">https://doi.org/10.1002/2014JF003266</a>, 2015.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib54"><label>Rinke et al.(2021)Rinke, Cassano, Cassano, Jaiser, and
Handorf</label><mixed-citation>
      
Rinke, A., Cassano, J. J., Cassano, E. N., Jaiser, R., and Handorf, D.:
Meteorological conditions during the MOSAiC expedition: Normal or
anomalous?, Elementa: Science of the Anthropocene, 9, 00023,
<a href="https://doi.org/10.1525/elementa.2021.00023" target="_blank">https://doi.org/10.1525/elementa.2021.00023</a>, 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib55"><label>Russell and Norvig(2021)</label><mixed-citation>
      
Russell, S. J. and Norvig, P.: Artificial intelligence: a modern approach,
Pearson series in artificial intelligence, Pearson, Hoboken, fourth edition
edn., 1136 pp., ISBN 978-0-13-461099-3, 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib56"><label>Sandells et al.(2023)Sandells, Rutter, Wivell, Essery, Fox, Harlow,
Picard, Roy, Royer, and Toose</label><mixed-citation>
      
Sandells, M., Rutter, N., Wivell, K., Essery, R., Fox, S., Harlow, C., Picard, G., Roy, A., Royer, A., and Toose, P.: Simulation of Arctic snow microwave emission in surface-sensitive atmosphere channels, EGUsphere [preprint], <a href="https://doi.org/10.5194/egusphere-2023-696" target="_blank">https://doi.org/10.5194/egusphere-2023-696</a>, 2023.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib57"><label>Satyawali et al.(2009)Satyawali, Schneebeli, Pielmeier, Stucki, and
Singh</label><mixed-citation>
      
Satyawali, P., Schneebeli, M., Pielmeier, C., Stucki, T., and Singh, A.:
Preliminary characterization of Alpine snow using SnowMicroPen, Cold Reg.
Sci. Technol., 55, 311–320,
<a href="https://doi.org/10.1016/j.coldregions.2008.09.003" target="_blank">https://doi.org/10.1016/j.coldregions.2008.09.003</a>, 2009.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib58"><label>Schneebeli et al.(1999)Schneebeli, Pielmeier, and
Johnson</label><mixed-citation>
      
Schneebeli, M., Pielmeier, C., and Johnson, J. B.: Measuring snow
microstructure and hardness using a high resolution penetrometer, Cold
Reg. Sci. Technol., 30, 101–114,
<a href="https://doi.org/10.1016/S0165-232X(99)00030-0" target="_blank">https://doi.org/10.1016/S0165-232X(99)00030-0</a>, 1999.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib59"><label>Schölkopf and Smola(2002)Schölkopf, Smola, Bach
et al.</label><mixed-citation>
      
Schölkopf, B. and Smola, A. J.: Learning with kernels:
support vector machines, regularization, optimization, and beyond, MIT press, 626 pp.,
ISBN 0-262-19475-9,
2002.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib60"><label>Schuster and Paliwal(1997)</label><mixed-citation>
      
Schuster, M. and Paliwal, K. K.: Bidirectional recurrent neural networks, IEEE
transactions on Signal Processing, 45, 2673–2681, <a href="https://doi.org/10.1109/78.650093" target="_blank">https://doi.org/10.1109/78.650093</a>,
1997.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib61"><label>Soni and Mathai(2015)</label><mixed-citation>
      
Soni, R. and Mathai, K. J.: Improved Twitter Sentiment Prediction through
Cluster-then-Predict Model, arXiv [preprint],
<a href="https://doi.org/10.48550/arXiv.1509.02437" target="_blank">https://doi.org/10.48550/arXiv.1509.02437</a>, 2015.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib62"><label>Steger et al.(2013)Steger, Kotlarski, Jonas, and
Schär</label><mixed-citation>
      
Steger, C., Kotlarski, S., Jonas, T., and Schär, C.: Alpine snow cover in a
changing climate: a regional climate model perspective, Clim. Dynam., 41,
735–754, <a href="https://doi.org/10.1007/s00382-012-1545-3" target="_blank">https://doi.org/10.1007/s00382-012-1545-3</a>, 2013.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib63"><label>Stone(1974)</label><mixed-citation>
      
Stone, M.: Cross-validatory choice and assessment of statistical predictions,
J. R. Stat. Soc. B, 36,
111–133, <a href="https://doi.org/10.1111/j.2517-6161.1974.tb00994.x" target="_blank">https://doi.org/10.1111/j.2517-6161.1974.tb00994.x</a>, 1974.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib64"><label>Sturm and Liston(2021)</label><mixed-citation>
      
Sturm, M. and Liston, G. E.: Revisiting the Global Seasonal Snow
Classification: An Updated Dataset for Earth System
Applications, J. Hydrometeorol., 22, 2917–2938,
<a href="https://doi.org/10.1175/JHM-D-21-0070.1" target="_blank">https://doi.org/10.1175/JHM-D-21-0070.1</a>, 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib65"><label>Sturm and Massom(2017)</label><mixed-citation>
      
Sturm, M. and Massom, R. A.: Snow in the sea ice system: friend or foe?, in:
Sea Ice, John Wiley &amp; Sons, Ltd, pp. 65–109, section: 3,
<a href="https://doi.org/10.1002/9781118778371.ch3" target="_blank">https://doi.org/10.1002/9781118778371.ch3</a>,  2017.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib66"><label>Theodorou et al.(2014)Theodorou, Mporas, and
Fakotakis</label><mixed-citation>
      
Theodorou, T., Mporas, I., and Fakotakis, N.: An Overview of Automatic Audio
Segmentation, International Journal of Information Technology and Computer
Science, 6, 1–9, <a href="https://doi.org/10.5815/ijitcs.2014.11.01" target="_blank">https://doi.org/10.5815/ijitcs.2014.11.01</a>, 2014.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib67"><label>Trivedi et al.(2015)Trivedi, Pardos, and
Heffernan</label><mixed-citation>
      
Trivedi, S., Pardos, Z. A., and Heffernan, N. T.: The Utility of Clustering
in Prediction Tasks, arXiv [preprint], <a href="https://doi.org/10.48550/arXiv.1509.06163" target="_blank">https://doi.org/10.48550/arXiv.1509.06163</a>, 2015.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib68"><label>Vionnet et al.(2012)Vionnet, Brun, Morin, Boone, Faroux, Le Moigne,
Martin, and Willemet</label><mixed-citation>
      
Vionnet, V., Brun, E., Morin, S., Boone, A., Faroux, S., Le Moigne, P., Martin, E., and Willemet, J.-M.: The detailed snowpack scheme Crocus and its implementation in SURFEX v7.2, Geosci. Model Dev., 5, 773–791, <a href="https://doi.org/10.5194/gmd-5-773-2012" target="_blank">https://doi.org/10.5194/gmd-5-773-2012</a>, 2012.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib69"><label>Wever et al.(2022)Wever, Keenan, Kausch, and
Lehning</label><mixed-citation>
      
Wever, N., Keenan, E., Kausch, T., and Lehning, M.: SnowMicroPen measurements
and manual snowpits from Dronning Maud Land, East Antarctica, EnviDat,
<a href="https://doi.org/10.16904/envidat.331" target="_blank">https://doi.org/10.16904/envidat.331</a>, 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib70"><label>Wu et al.(2008)Wu, Kumar, Quinlan, Ghosh, Yang, Motoda, McLachlan,
Ng, Liu, Philip et al.</label><mixed-citation>
      
Wu, X., Kumar, V., Ross Quinlan, J., Ghosh, J., Yang, Q., Motoda, H., McLachlan, G. J., Ng, A., Liu, B., Yu, P. S., Zhou, Z.-H., Steinbach, M., Hand, D. J., and Steinberg, D.: Top 10 algorithms in data
mining, Knowl. Inf. Syst., 14, 1–37,
<a href="https://doi.org/10.1007/s10115-007-0114-2" target="_blank">https://doi.org/10.1007/s10115-007-0114-2</a>, 2008.


    </mixed-citation></ref-html>
<ref-html id="bib1.bib71"><label>Yarowsky(1995)</label><mixed-citation>
      
Yarowsky, D.: Unsupervised word sense disambiguation rivaling supervised
methods, in: 33rd annual meeting of the association for computational
linguistics, pp. 189–196, <a href="https://doi.org/10.3115/981658.981684" target="_blank">https://doi.org/10.3115/981658.981684</a>, 1995.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib72"><label>Yoshua et al.(2006)Yoshua, Olivier, and Nicolas Le</label><mixed-citation>
      
Yoshua, B., Olivier, D., and Nicolas Le, R.: 192193Label Propagation and
Quadratic Criterion, in: Semi-Supervised Learning, The MIT Press,
<a href="https://doi.org/10.7551/mitpress/9780262033589.003.0011" target="_blank">https://doi.org/10.7551/mitpress/9780262033589.003.0011</a>, 2006.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib73"><label>Zhou et al.(2003)Zhou, Bousquet, Lal, Weston, and
Schölkopf</label><mixed-citation>
      
Zhou, D., Bousquet, O., Lal, T., Weston, J., and Schölkopf, B.: Learning with
Local and Global Consistency, in: Advances in Neural Information
Processing Systems, edited by: Thrun, S., Saul, L., and Schölkopf, B.,
vol. 16, pp. 321–328, MIT Press,
<a href="https://proceedings.neurips.cc/paper_files/paper/2003/file/87682805257e619d49b8e0dfdc14affa-Paper.pdf" target="_blank"/> (last access: 3 August 2023),
2003.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib74"><label>Zhu and Ghahramani(2002)</label><mixed-citation>
      
Zhu, X. and Ghahramani, Z.: Learning from labeled and unlabeled data with label
propagation, Tech. rep., Carnegie Mellon University,
<a href="https://mlg.eng.cam.ac.uk/zoubin/papers/CMU-CALD-02-107.pdf" target="_blank"/>   (last access: 3 August 2023),
2002.

    </mixed-citation></ref-html>--></article>
