<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing with OASIS Tables v3.0 20080202//EN" "https://jats.nlm.nih.gov/nlm-dtd/publishing/3.0/journalpub-oasis3.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:oasis="http://docs.oasis-open.org/ns/oasis-exchange/table" xml:lang="en" dtd-version="3.0" article-type="research-article"><?xmltex \bartext{Model description paper}?>
  <front>
    <journal-meta><journal-id journal-id-type="publisher">GMD</journal-id><journal-title-group>
    <journal-title>Geoscientific Model Development</journal-title>
    <abbrev-journal-title abbrev-type="publisher">GMD</abbrev-journal-title><abbrev-journal-title abbrev-type="nlm-ta">Geosci. Model Dev.</abbrev-journal-title>
  </journal-title-group><issn pub-type="epub">1991-9603</issn><publisher>
    <publisher-name>Copernicus Publications</publisher-name>
    <publisher-loc>Göttingen, Germany</publisher-loc>
  </publisher></journal-meta>
    <article-meta>
      <article-id pub-id-type="doi">10.5194/gmd-16-6479-2023</article-id><title-group><article-title>pyESDv1.0.1: an open-source Python framework for empirical-statistical downscaling of climate information</article-title><alt-title>pyESDv1.0.1</alt-title>
      </title-group><?xmltex \runningtitle{pyESDv1.0.1}?><?xmltex \runningauthor{D.~Boateng and S.~G.~Mutz}?>
      <contrib-group>
        <contrib contrib-type="author" corresp="yes" rid="aff1">
          <name><surname>Boateng</surname><given-names>Daniel</given-names></name>
          <email>daniel.boateng@uni-tuebingen.de</email>
        <ext-link>https://orcid.org/0000-0002-3423-9947</ext-link></contrib>
        <contrib contrib-type="author" corresp="no" rid="aff1 aff2">
          <name><surname>Mutz</surname><given-names>Sebastian G.</given-names></name>
          
        <ext-link>https://orcid.org/0000-0001-8180-6150</ext-link></contrib>
        <aff id="aff1"><label>1</label><institution>Department of Geosciences, University of Tübingen, Tübingen, Germany</institution>
        </aff>
        <aff id="aff2"><label>2</label><institution>School of Geographical and Earth Sciences, University of Glasgow, Scotland, UK </institution>
        </aff>
      </contrib-group>
      <author-notes><corresp id="corr1">Daniel Boateng (daniel.boateng@uni-tuebingen.de)</corresp></author-notes><pub-date><day>14</day><month>November</month><year>2023</year></pub-date>
      
      <volume>16</volume>
      <issue>22</issue>
      <fpage>6479</fpage><lpage>6514</lpage>
      <history>
        <date date-type="received"><day>26</day><month>March</month><year>2023</year></date>
           <date date-type="rev-request"><day>5</day><month>April</month><year>2023</year></date>
           <date date-type="rev-recd"><day>20</day><month>September</month><year>2023</year></date>
           <date date-type="accepted"><day>21</day><month>September</month><year>2023</year></date>
      </history>
      <permissions>
        <copyright-statement>Copyright: © 2023 Daniel Boateng</copyright-statement>
        <copyright-year>2023</copyright-year>
      <license license-type="open-access"><license-p>This work is licensed under the Creative Commons Attribution 4.0 International License. To view a copy of this licence, visit <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by/4.0/">https://creativecommons.org/licenses/by/4.0/</ext-link></license-p></license></permissions><self-uri xlink:href="https://gmd.copernicus.org/articles/16/6479/2023/gmd-16-6479-2023.html">This article is available from https://gmd.copernicus.org/articles/16/6479/2023/gmd-16-6479-2023.html</self-uri><self-uri xlink:href="https://gmd.copernicus.org/articles/16/6479/2023/gmd-16-6479-2023.pdf">The full text article is available as a PDF file from https://gmd.copernicus.org/articles/16/6479/2023/gmd-16-6479-2023.pdf</self-uri>
      <abstract><title>Abstract</title>

      <p id="d1e96">The nature and severity of climate change impacts vary significantly from region to region. Consequently, high-resolution climate information is needed for meaningful impact assessments and the design of mitigation strategies. This demand has led to an increase in the application of empirical-statistical downscaling (ESD) models to general circulation model (GCM) simulations of future climate. In contrast to dynamical downscaling, the perfect prognosis ESD (PP-ESD) approach has several benefits, including low computation costs, the prevention of the propagation of GCM-specific errors, and high compatibility with different GCMs. Despite their advantages, the use of ESD models and the resulting data products is hampered by (1) the lack of accessible and user-friendly downscaling software packages that implement the entire downscaling cycle, (2) difficulties reproducing existing data products and assessing their credibility, and (3) difficulties reconciling different ESD-based predictions for the same region. We address these issues with a new open-source Python PP-ESD modeling framework called pyESD. pyESD implements the entire downscaling cycle, i.e., routines for data preparation, predictor selection and construction, model selection and training, evaluation, utility tools for relevant statistical tests, visualization, and more. The package includes a collection of well-established machine learning algorithms and allows the user to choose a variety of estimators, cross-validation schemes, objective function measures, and hyperparameter optimization in relatively few lines of code. The package is well-documented, highly modular, and flexible. It allows quick and reproducible downscaling of any climate information, such as precipitation, temperature, wind speed, or even short-term glacier length and mass changes. We demonstrate the use and effectiveness of the new PP-ESD framework by generating weather-station-based downscaling products for precipitation and temperature in complex mountainous terrain in southwestern Germany. The application example covers all important steps of the downscaling cycle and different levels of experimental complexity. All scripts and datasets used in the case study are publicly available to (1) ensure the reproducibility and replicability of the modeled results and (2) simplify learning to use the software package.</p>
  </abstract>
    
<funding-group>
<award-group id="gs1">
<funding-source>Deutsche Forschungsgemeinschaft</funding-source>
<award-id>MU4188/3-1 and MU4188/1-1</award-id>
</award-group>
</funding-group>
</article-meta>
  </front>
<body>
      

<sec id="Ch1.S1" sec-type="intro">
  <label>1</label><title>Introduction</title>
      <p id="d1e108">The impacts of anthropogenic climate change are far-reaching and spatially heterogeneous. Consequently, regional- and local-scale predictions of 21st century climate evolution are needed to help guide the design of adaptation measures, vulnerability assessments, and resilience strategies (Field and Barros, 2014; Weaver et al., 2013). General circulation models (GCMs) are well-established tools for simulating climate trends in response to different anthropogenic and natural forcings, such as atmospheric CO<inline-formula><mml:math id="M1" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> concentrations, land cover, and orbital changes. They are process-driven models based on our understanding of atmospheric physics. They are commonly used to predict future trends of climate change by prescribing predicted future forcings described by the Representative Concentration Pathways (RCPs). RCPs are greenhouse gas concentration scenarios that quantify the radiative forcing of plausible demographic and technological developments, as well as anthropogenic activities (Meinshausen et al., 2011; Pachauri et al., 2014). While GCMs can produce useful estimates of<?pagebreak page6480?> many climate system elements on the global and synoptic scale (such as circulation patterns), mesoscale atmospheric processes, clouds, and specific climate variables like precipitation are still relatively poorly represented (e.g., Steppeler et al., 2003). Moreover, GCM simulations are affected by systematic biases on the local and regional scale due to their coarse resolutions and model parameterization (e.g., Errico et al., 2001). These can lead to inaccurate predictions on the spatial scales that are relevant for regional climate change impact assessments, such as studies investigating the impacts on the hydrological cycle (Boé et al., 2009), mountain glaciers (Mutz et al., 2016; Mutz and Aschauer, 2022), air quality (e.g., Colette et al., 2012), and agriculture (e.g., Shahhosseini et al., 2020). Therefore, GCM-based predictions are downscaled by performing dynamical downscaling or statistical downscaling, with empirical-statistical downscaling (ESD) being one type of  statistical downscaling (Murphy, 2000; Schmidli et al., 2007; Wilby and Dawson, 2013).</p>
      <p id="d1e120">Dynamical downscaling involves the nesting of regional climate models (RCMs) into coarse-resolution GCM simulations to produce higher-resolution regional estimates. While RCMs allow an easy exploration of physical processes leading to the predicted climate, they are computationally costly. Furthermore, slight changes in the model domain and boundary conditions require the repetition of the whole process, thereby limiting their application in many climate impact studies (e.g., Giorgi and Mearns, 1991; Xu et al., 2019). ESD is computationally less costly and implicitly considers local conditions, such as topography and vegetation, without the need to parameterize them explicitly. It is widely used for climate change impact studies and relies on establishing empirical transfer functions to relate large-scale atmospheric variables (predictors) to a local-scale observation (predictand). ESD models can be directly coupled to GCMs (e.g., Mutz et al., 2021) or RCMs (e.g., Sunyer et al., 2015; Laflamme et al., 2016; Jakob Themeßl et al., 2011) in a one-way coupling or pipeline with no feedback into the climate models. ESD can be broadly categorized into perfect prognosis (PP) and model output statistics (MOS) approaches (Maraun and Widmann, 2018; Marzban et al., 2006). MOS uses simulated predictors from the GCM or RCM to find the transfer function and generate a predictand time series with bias corrections (e.g., Sachindra et al., 2014; Wilby et al., 1998). Therefore, the MOS-ESD transfer functions are specific to a particular GCM or RCM and not easily transferable to other models. In contrast, the PP-ESD approach is GCM- and RCM-agnostic: ESD models are obtained from observational data for both the predictand and predictors and can therefore be coupled to any GCM or RCM (e.g., Hertig et al., 2019; Mutz et al., 2021; Ramon et al., 2021; Tatli et al., 2004). Therefore, this paper, and the software package presented in it, focuses primarily on the PP-ESD approach.</p>
      <p id="d1e123">The PP-ESD modeling framework consists of four critical steps to establish and evaluate the empirical transfer functions that constitute an ESD model (e.g., Maraun et al., 2010; Maraun and Widmann, 2018): (1) the first step involves the selection and construction of predictors. The selection of the most informative and relevant predictors generally increases the performance and robustness of  PP-ESD models. Preliminary predictor selection should be guided by knowledge of the atmospheric dynamics that govern a specific regional climate. This selection may be refined using statistical dependency measures such as correlation analysis (e.g., Wilby et al., 2002; Wilby and Wigley, 2002), regularization regression (e.g., Hammami et al., 2012), stepwise multi-linear regression (e.g., Mutz et al., 2021), and decision tree selection (e.g., Nourani et al., 2019). The selected predictors should be able to explain most of the predictand's variability and must be represented well by the GCMs (Maraun and Widmann, 2018; Wilby et al., 2004). (2) The second step involves the selection of the learning algorithms (i.e., the learning model used for training the ESD model). These range from classical regressions and analog models, including parametric and nonparametric models (Gutiérrez et al., 2013; Zorita and Storch, 1999; Lorenz, 1969), to advanced machine learning (ML) algorithms (e.g., Sachindra et al., 2018; Xu et al., 2020). The various techniques vary in complexity, scalability, interpretability, and underlying assumptions. For example, classical regressions and analog models allow better interpretations of the simulated results and are usually simpler to implement. On the other hand, several ML algorithms have the ability to capture more complex links between predictors and predictands and do not require an explicit assumption of the distribution of observational data during the optimization process (Jordan and Mitchell, 2015; Raissi and Karniadakis, 2018). The choice of the optimal PP-ESD training technique depends on the predictand variable (e.g., precipitation and temperature), length of the observational records, spatiotemporal variability, spatial coherence, regional setting, and temporal stationarity of the transfer functions. (3) The third step involves the actual training and validation of the PP-ESD models, and (4) the final step is the PP-ESD model evaluation.</p>
      <p id="d1e126">The high demand for climate change information on the regional and local scale has led to the widespread use of ESD methods and an overwhelming body of research to sort through in order to select the most suitable technique for a specific problem. In the past, generalized linear models (GLMs) (e.g., Fealy and Sweeney, 2007), regularization models (e.g., Li et al., 2020), Bayesian regression models (Das et al., 2014; e.g., Zhang and Yan, 2015), support vector machines (SVMs) (e.g., Chen et al., 2010; Ghosh and Mujumdar, 2008), artificial neural networks (ANNs) (e.g., Sachindra et al., 2018; Vu et al., 2016; Xu et al., 2020), homogeneous (e.g., random forest) and heterogeneous (e.g., stacking) ensemble learning models (e.g., Massaoudi et al., 2021; Pang et al., 2017; Zhang et al., 2021), and others have been used to construct PP-ESD models and downscale climate information. However, there is no universal protocol to<?pagebreak page6481?> help choose a robust model for a specific region and climate variable (Gutiérrez et al., 2019), thus making the selection of the most suitable learning algorithm challenging. Moreover, the recent increase in ML algorithms and platforms (e.g., programming languages and software) exacerbates the problem by creating an even wider range of PP-ESD techniques without well-defined protocols. These have shifted the focus toward the establishment of standardized user-friendly tools that would resolve most of the issues related to the development of PP-ESD models. Such tools exist in various forms and tackle a certain aspect of the inherent ESD modeling complexities to ensure fast and efficient climate-impact-related studies. For example, the R-package <italic>esd</italic>, developed and maintained by the Norwegian Meteorological Institute (MET Norway), comprises many utility functions for data retrieval, manipulation and visualization, commonly used statistical tools, and implementations of GLM and regression techniques for generating ESD models (Benestad et al., 2015b). Moreover, an interactive web-based downscaling tool developed as part of the EU-funded ENSEMBLES project (van der Linden and Mitchell, 2009) provides an end-to-end framework through data access, computing resources, and ESD model alternatives (Gutiérrez et al., 2012). The decision support tool <italic>sdsm</italic> (Wilby et al., 2002) provides auxiliary downscaling routines like predictor screening, regression, model evaluation, and visualization for near-surface weather variables on a daily scale. Most recently, the climate analysis tool Climate4R has been extended with statistical downscaling functionalities (downscaleR) that provide a wide range of MOS and PP techniques (Bedia et al., 2020). While these tools provide specialist solutions, there is no single tool or modeling framework that provides a wide range of contemporary (and commonly used) algorithms and implements all downscaling steps (i.e., predictor selection and construction, learning algorithm selection, training and validation of ESD models, GCM–ESD model coupling, model evaluation, visualization, and relevant statistical tools). Moreover, there is no user-friendly ESD tool written in a widely used programming language like Python, which would remove barriers for the use of ESD techniques in research and teaching. Many of the Python-based tools currently available are primarily designed for bias correction in MOS downscaling, and extending these tools to the PP-ESD framework would diversify the publicly available downscaling tools (e.g., xclim, Bourgault et al., 2023; ibicus, Spuler et al., 2023; CCdowncaling, Polasky et al., 2023). A complete, user-friendly, robust, and efficient open-source downscaling framework would contribute significantly to climate change impact assessment studies by (a) empowering researchers through accessible software and easy switches between alternative methods, (b) allowing for efficient updating of predictions in a consistent modeling framework, (c) increasing the transparency and reproducibility of results, and (d) removing barriers in teaching in order to familiarize future generations of researchers with the ESD approach.</p>
      <p id="d1e136">Here, we introduce a new PP-ESD framework that addresses the gaps highlighted above. It is the thoroughly tested, heavily documented, efficient, and user-friendly open-source Python Empirical-Statistical Downscaling (pyESD) package. pyESD adopts an object-oriented programming (OOP) style and treats the predictand data archives (e.g., the weather station) as objects with many functionalities and attributes relevant to ESD modeling. It is flexible with regards to the training dataset and predictand variable. For example, pyESD's predecessors were successfully applied for the prediction of local temperatures (Mutz et al., 2021) and glacier mass balance (Mutz and Aschauer, 2022) in South America. Here, we additionally demonstrate its capabilities in downscaling precipitation in complex terrain in southwestern Germany. pyESD comprises a collection of utilities and methods for data preparation, predictor selection, data transformation, predictor construction, model selection and training, evaluation, statistical testing, and visualization. Unlike  existing packages, pyESD also includes common machine learning algorithms (i.e., different estimators, cross-validation schemes, objective function measures, hyperparameter optimizers, etc.) that can be experimented with in a few lines of code.</p>
      <p id="d1e139">In the first part of this paper (Sect. 2), we provide detailed descriptions of the model structure and the theoretical background for the implemented methods. In the second part (Sect. 3), we demonstrate the package's functionalities with an illustrative case study for a hydrological subcatchment in mountainous terrain in southwestern Germany. Here, we walk the reader through a typical downscaling process with pyESD. More specifically, we generate station-based downscaling products for precipitation and temperature changes in response to different RCPs. When discussing downscaling-related tasks, we list the corresponding pyESD routines as italicized function names. We only use publicly available data for a set of weather stations to ensure the reproducibility and replicability of the results (see Sect. 3). Moreover, all the scripts used for the case study are provided and can be easily adapted to suit the researcher's focus. We discuss the application example in Sect. 4 and conclude with a summary and important remarks in Sect. 5.</p>
</sec>
<sec id="Ch1.S2">
  <label>2</label><title>Model structure</title>
      <p id="d1e150">The PP-ESD downscaling cycle involves technical and laborious steps that must be carefully addressed to ensure the robustness and accuracy of local-scale climate predictions. The pyESD package implements all these steps in an efficient modeling pipeline for an easier workflow. In this section, we describe this workflow (Fig. 1) along with the main features of the package.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F1" specific-use="star"><?xmltex \currentcnt{1}?><?xmltex \def\figurename{Figure}?><label>Figure 1</label><caption><p id="d1e155">The main features and workflow of PP-ESD implemented in the pyESD package (highlighted by the dashed red box). The weather station and reanalysis datasets are used to select the robust predictors for model training and validation. The trained PP-ESD model is then coupled to GCM simulations forced with different scenarios to predict the local-scale future estimates that can be used for climate change impact assessment (not included in the pyESD package).</p></caption>
        <?xmltex \igopts{width=369.885827pt}?><graphic xlink:href="https://gmd.copernicus.org/articles/16/6479/2023/gmd-16-6479-2023-f01.png"/>

      </fig>

<?xmltex \hack{\newpage}?>
<?pagebreak page6482?><sec id="Ch1.S2.SS1">
  <label>2.1</label><title>Data structure and preprocessing</title>
      <p id="d1e174">PP-ESD modeling requires (1) predictand data from weather stations or other observational systems, (2) reanalysis datasets for the construction of predictors, and (3) GCM or RCM output for the construction of simulated predictors if the PP-ESD models are used for downscaling simulated climates. To understand the workflow demonstrated in later sections, the reader needs to be aware of few important package design choices related to data structure and preprocessing. <list list-type="bullet"><list-item>
      <p id="d1e179">The package adopts the OOP paradigm and treats every predictand data archive (e.g., weather station or glacier) as an object. Since the current version of the package focuses only on station-based downscaling, we will henceforth describe it only as the weather station object. The package accepts the (typical for weather stations) comma-separated value (CSV) file format. These files contain the predictand time series, such as a temperature record, as well as weather station attributes like the weather station's name, ID, and location. The <italic>read_station_csv</italic> from the <italic>pyESD.weatherstation</italic> module initiates each weather station as a separate object using the <italic>StationOperator</italic> that features all the other functionalities. The weather station object is associated with at least one predictand dataset (i.e., the values of at least one climate variable recorded at that particular station). Furthermore, the initialized object includes all attributes and methods required for the complete downscaling cycle. For instance, the package adopts the <italic>fit</italic> and <italic>predict</italic> framework of the scikit-learn Python package (Pedregosa et al., 2011) that can be directly applied to the weather station object.</p></list-item><list-item>
      <p id="d1e198">The data needed for predictor construction are read from files in the network Common Data Form (netCDF) format with the Xarray toolkit (Hoyer and Hamman, 2017). Due to the size of these datasets and the computations required to construct the predictors, the memory demand can be very high, and repeating this step every time a new model is trained or applied becomes computationally very costly. This problem is circumvented by storing the constructed predictors for each weather station in pickle files. At the next runtime, these can quickly be read (or unpacked) to reduce the<?pagebreak page6483?> computational costs and facilitate faster experimentation with the package.</p></list-item><list-item>
      <p id="d1e202">Since reanalysis datasets, climate model output, and weather station data are provided by different data centers and have varied structures and attributes, it is well outside the scope of our project to write and include a unified data processing function for all. Instead, the preprocessing functions of the current version of pyESD are written for state-of-the-art, representative, and publicly available datasets. More specifically, they work with weather station data from the German Weather Service (Deutscher Wetterdienst, DWD) and the ERA5 reanalysis product (Hersbach et al., 2020). These preprocessing functions are provided as part of the package utilities (<italic>pyESD.data_preprocess_utils</italic>) and can easily be adapted to work for researchers' preferred datasets. The functions will be expanded in the future to allow experimentation with other popular datasets and assess the sensitivity of ESD model performance to the choice of reanalysis datasets (e.g., Brands et al., 2012).</p></list-item></list></p>
</sec>
<sec id="Ch1.S2.SS2">
  <label>2.2</label><title>Predictor selection and construction</title>
      <p id="d1e216">The PP-ESD approach is highly sensitive to the choice of predictors and learning models (Maraun et al., 2019a; Gutiérrez et al., 2019). Moreover, since PP-ESD models are empirical in nature, the predictors serve as proxies for all the relevant physical processes and must be informative enough to account for the local predictand variability (Huth, 1999, 2004; Maraun and Widmann, 2018). Therefore, the selection of potential predictors should be informed by our knowledge of the atmospheric dynamics that control the climate variability of the study area. For example, synoptic-scale climate features, such as atmospheric teleconnection patterns, control much of the regional-scale climate variability. It is therefore recommended to consider these as potential predictors. Statistical techniques, such as methods for feature selection or dimension reduction, may then be applied to reduce the list of physically relevant potential predictors to a smaller selection of predictors that have a robust statistical relationship with the predictand. These steps contribute to the performance of the models and also resolve some of the issues related to multicollinearity and overfitting (e.g., Mutz et al., 2016). The pyESD package adopts three different wrapper feature selection techniques that can be explored for different models: (1) recursive feature elimination (Chen and Jeong, 2007), (2) tree-based feature selection (Zhou et al., 2021), and (3) sequential feature selection (Ferri et al., 1994). The methods are included in <italic>pyESD.feature_selection</italic> as <italic>RecursiveFeatureElimination</italic>, <italic>TreeBasedSelection</italic>, and <italic>SequetialFeatureSelection</italic>, respectively. Furthermore, classical filter feature selection techniques, such as correlation analyses, are also included as a method of the weather station object.</p>
      <p id="d1e231">Predictors are typically constructed by (1) computing the regional means of a physically relevant climate variable or (2) constructing index time series for relevant synoptic-scale climate phenomena. The package allows the user to consider a few important aspects for each type of predictor. <list list-type="order"><list-item>
      <p id="d1e236">The area over which the climate variable is averaged can significantly affect model performance. In complex terrain with high-frequency topography, for example, choosing a smaller spatial extent may result in the predictor having a higher explanatory power. Therefore, a radius (with a default value of 200 km) around the weather station may be defined by the user to determine the size of the area used for the computation of the regional means.</p></list-item><list-item>
      <p id="d1e240">Empirical orthogonal function (EOF) analysis is a well-established tool for capturing atmospheric teleconnection patterns and reducing high-dimensional climate datasets to index time series that represent the variability of prominent modes of synoptic-scale climate phenomena (Storch and von Zwiers, 2002). The current version of pyESD includes functions for the extraction of EOF-based index time series for dominant extratropical teleconnection patterns in the Northern Hemisphere (<italic>pyESD.teleconnections</italic>). More specifically, it allows the computation of index values for the North Atlantic Oscillation (NAO) as well as the East Atlantic (EA), Scandinavian (SCAN), and East Atlantic–Western Russian (EAWR) oscillation patterns (e.g., Boateng et al., 2022). It will be expanded to consider Southern Hemisphere patterns in future versions.</p></list-item></list> After the selection and construction of predictors, their raw values can be transformed before model training. For instance,  <italic>MonthlyStandardizer</italic> implemented in <italic>pyESD.standardizer</italic> can be used to remove the seasonal trends in each predictor by centering and scaling the data. Such transformation can reduce biases toward high-variance predictors, ensure generalization, and improve the representation of predictors constructed from GCM output (e.g., Bedia et al., 2020; Benestad et al., 2015a). Principal component analysis (PCA) is another transformation tool included in the package (<italic>pyESD.standardizer.PCAScaling</italic>). It can be applied to (a) reduce the raw predictor values to information that is relevant to the predictand and (b) prevent multicollinearity-related problems during model training (e.g., Mutz et al., 2016).</p>
</sec>
<sec id="Ch1.S2.SS3">
  <label>2.3</label><title>Learning models</title>
      <p id="d1e264">The empirical relationship between local predictand and large-scale predictors is often complicated due to the complex dynamics in the climate system. However, ML algorithms have been demonstrated to perform well in extracting hidden patterns in climate data that are relevant for<?pagebreak page6484?> building more complex transfer functions (e.g., Raissi and Karniadakis, 2018). Specifically, neural networks have been explored for downscaling climate information due to their ability to establish a complex and nonlinear relationship between predictands and predictors (e.g., Nourani et al., 2019; Gardner and Dorling, 1998; Vu et al., 2016). Moreover, support vector machine (SVM) models have been used to capture the links between predictors and predictands by mapping the low-dimensional data into a high-dimensional feature space with the use of kernel functions (e.g., Anandhi et al., 2008; Tripathi et al., 2006). Previous studies have also applied multi-model ensembles due to their ability to reduce model variance and capture the distribution of the training data (e.g., Xu et al., 2020; Massaoudi et al., 2021; Gu et al., 2022).</p>
      <p id="d1e267">Selecting the most appropriate model or algorithm for a specific location or predictand can be challenging because one needs to consider many case-specific factors like data dimensionality, distribution, temporal resolution, and explainability. This problem is exacerbated by the lack of well-established frameworks for climate information downscaling (Gutiérrez et al., 2019). The pyESD package addresses this challenge with the implementation of many ML models that are different with regard to their theoretical paradigms, assumptions, and model structure. The implementation of commonly used models in the same package allows researchers to experiment with different learning models and to replicate and update their research based on emerging recommendations for specific predictands and geographical locations. The implementation of statistical and ML models in pyESD mainly relies on the open-source scientific framework scikit-learn tool (Pedregosa et al., 2011). In the following subsections, we briefly explain the principles behind the ML methods that are included in the pyESD package.</p>
<sec id="Ch1.S2.SS3.SSS1">
  <label>2.3.1</label><title>Regularization regressors</title>
      <p id="d1e277">Regularization models are penalized regression techniques that shrink the coefficients of uninformative predictors to improve model accuracy and prediction interpretability (Hastie et al., 2001; Tibshirani, 1996; Gareth et al., 2013). The coefficients of non-robust predictors are set to zero by minimizing the absolute values of regression coefficients or minimizing the sum of squares of the coefficients. The former is referred to as L1 regularization and adopted by the least absolute shrinkage and selection operator (LASSO) method. The latter is referred to as L2 regularization and adopted by the ridge regression method. The regularization term (<inline-formula><mml:math id="M2" display="inline"><mml:mi>R</mml:mi></mml:math></inline-formula>) and the updated cost function for a linear equation of <inline-formula><mml:math id="M3" display="inline"><mml:mi>p</mml:mi></mml:math></inline-formula> independent variables or predictors, <inline-formula><mml:math id="M4" display="inline"><mml:mrow><mml:msub><mml:mi>X</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, are defined as
              <disp-formula id="Ch1.E1" content-type="numbered"><label>1</label><mml:math id="M5" display="block"><mml:mrow><mml:mi>R</mml:mi><mml:mfenced close=")" open="("><mml:mi mathvariant="italic">β</mml:mi></mml:mfenced><mml:mo>=</mml:mo><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>p</mml:mi></mml:munderover><mml:mfenced open="|" close="|"><mml:mrow><mml:msub><mml:mi mathvariant="italic">β</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:mfenced></mml:mrow></mml:math></disp-formula>
            for L1 regularization and<?xmltex \hack{\newpage}?>
              <disp-formula id="Ch1.E2" content-type="numbered"><label>2</label><mml:math id="M6" display="block"><mml:mrow><mml:mi>R</mml:mi><mml:mfenced close=")" open="("><mml:mi mathvariant="italic">β</mml:mi></mml:mfenced><mml:mo>=</mml:mo><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>p</mml:mi></mml:munderover><mml:msubsup><mml:mi mathvariant="italic">β</mml:mi><mml:mi>i</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow></mml:math></disp-formula>
            for L2 regularization. Therefore, the updated cost function is defined as
              <disp-formula id="Ch1.E3" content-type="numbered"><label>3</label><mml:math id="M7" display="block"><mml:mrow><mml:mtext>cost</mml:mtext><mml:mo>=</mml:mo><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>j</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>n</mml:mi></mml:munderover><mml:msup><mml:mfenced close=")" open="("><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>p</mml:mi></mml:munderover><mml:msub><mml:mi>X</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:msub><mml:mi mathvariant="italic">β</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:mfenced><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>+</mml:mo><mml:mi mathvariant="italic">λ</mml:mi><mml:mi>R</mml:mi><mml:mfenced open="(" close=")"><mml:mi mathvariant="italic">β</mml:mi></mml:mfenced><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>
            where <inline-formula><mml:math id="M8" display="inline"><mml:mi mathvariant="italic">λ</mml:mi></mml:math></inline-formula> is the tuning parameter that controls the severity of the penalty defined in Eqs. (1) and (2), and <inline-formula><mml:math id="M9" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">β</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> represents the coefficients. The package features implementations of the LASSO and ridge regression using a cross-validation (CV) scheme with random bootstrapping to iteratively optimize <inline-formula><mml:math id="M10" display="inline"><mml:mi mathvariant="italic">λ</mml:mi></mml:math></inline-formula>. These are included as LassoCV and RidgeCV, respectively. The optimization of the cost function in Eq. (3) is usually based on the coordinate descent algorithm to fit the coefficients (Wu and Lange, 2008). The pyESD package also includes an implementation of LassoCV that uses a less greedy version of the optimizer (LassoLarsCV). It is computationally more efficient by using the least angle regression (Efron et al., 2004) for fitting the coefficients.</p>
</sec>
<sec id="Ch1.S2.SS3.SSS2">
  <label>2.3.2</label><title>Bayesian regression</title>
      <p id="d1e475">Bayesian regression employs a type of conditional modeling to obtain the posterior probability (<inline-formula><mml:math id="M11" display="inline"><mml:mi>p</mml:mi></mml:math></inline-formula>) of the target variable (<inline-formula><mml:math id="M12" display="inline"><mml:mi>y</mml:mi></mml:math></inline-formula>), given a combination of predictor variables (<inline-formula><mml:math id="M13" display="inline"><mml:mi>X</mml:mi></mml:math></inline-formula>), regression coefficients (<inline-formula><mml:math id="M14" display="inline"><mml:mi>w</mml:mi></mml:math></inline-formula>), and random variables (<inline-formula><mml:math id="M15" display="inline"><mml:mi mathvariant="italic">α</mml:mi></mml:math></inline-formula>) estimated from the data (Bishop and Nasrabadi, 2006; Neal, 2012). In its simplest form, the normal linear model, the predictand <inline-formula><mml:math id="M16" display="inline"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> (given the predictors <inline-formula><mml:math id="M17" display="inline"><mml:mrow><mml:msub><mml:mi>X</mml:mi><mml:mi>j</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>), follows a Gaussian distribution <inline-formula><mml:math id="M18" display="inline"><mml:mrow><mml:mi>N</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="italic">μ</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="italic">σ</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>. Therefore, to estimate the full probabilistic model, <inline-formula><mml:math id="M19" display="inline"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is assumed to be normally distributed around <inline-formula><mml:math id="M20" display="inline"><mml:mrow><mml:msub><mml:mi>X</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mi>w</mml:mi></mml:mrow></mml:math></inline-formula>:
              <disp-formula id="Ch1.E4" content-type="numbered"><label>4</label><mml:math id="M21" display="block"><mml:mrow><mml:mi>p</mml:mi><mml:mfenced close=")" open="("><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mi mathvariant="normal">|</mml:mi><mml:mi>X</mml:mi><mml:mo>,</mml:mo><mml:mi>w</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="italic">α</mml:mi></mml:mrow></mml:mfenced><mml:mo>=</mml:mo><mml:mi>N</mml:mi><mml:mfenced close=")" open="("><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mi mathvariant="normal">|</mml:mi><mml:msub><mml:mi>X</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mi>w</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="italic">α</mml:mi></mml:mrow></mml:mfenced><mml:mo>.</mml:mo></mml:mrow></mml:math></disp-formula>
            This approach also permits the use of regularizers in the optimization process. The Bayesian ridge regression procedure (BayesianRidge) estimates the regression coefficients from a spherical Gaussian and L2 regularization (Eq. 2). The regularizer parameters (<inline-formula><mml:math id="M22" display="inline"><mml:mrow><mml:mi mathvariant="italic">α</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="italic">λ</mml:mi></mml:mrow></mml:math></inline-formula>) are estimated by maximizing the log marginal likelihood under a Gaussian prior over <inline-formula><mml:math id="M23" display="inline"><mml:mi>w</mml:mi></mml:math></inline-formula> with a precision of <inline-formula><mml:math id="M24" display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="italic">λ</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula> (Tipping, 2001; MacKay, 1992):
              <disp-formula id="Ch1.E5" content-type="numbered"><label>5</label><mml:math id="M25" display="block"><mml:mrow><mml:mi>p</mml:mi><mml:mfenced open="(" close=")"><mml:mrow><mml:mi>w</mml:mi><mml:mi mathvariant="normal">|</mml:mi><mml:mi mathvariant="italic">α</mml:mi></mml:mrow></mml:mfenced><mml:mo>=</mml:mo><mml:mi>N</mml:mi><mml:mfenced open="(" close=")"><mml:mrow><mml:mi>w</mml:mi><mml:mi mathvariant="normal">|</mml:mi><mml:mn mathvariant="normal">0</mml:mn><mml:mo>,</mml:mo><mml:msup><mml:mi mathvariant="italic">λ</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup><mml:msub><mml:mi mathvariant="normal">|</mml:mi><mml:mi>p</mml:mi></mml:msub></mml:mrow></mml:mfenced><mml:mo>.</mml:mo></mml:mrow></mml:math></disp-formula>
            This means that the parameters (<inline-formula><mml:math id="M26" display="inline"><mml:mi mathvariant="italic">α</mml:mi></mml:math></inline-formula>, <inline-formula><mml:math id="M27" display="inline"><mml:mi mathvariant="italic">λ</mml:mi></mml:math></inline-formula>, and <inline-formula><mml:math id="M28" display="inline"><mml:mi>w</mml:mi></mml:math></inline-formula> in Eqs. 4 and 5) are estimated jointly in the calibration process. Automatic relevance determination regression (ARD) is an alternative model included in the package. It differs from BayesianRidge in estimating sparse regression coefficients and using centered elliptic Gaussian priors over the coefficients <inline-formula><mml:math id="M29" display="inline"><mml:mi>w</mml:mi></mml:math></inline-formula> (Wipf and Nagarajan, 2007; Tipping, 2001). Previous studies have<?pagebreak page6485?> used sparse Bayesian learning (relevance vector machine – RVM) for downscaling climate information (e.g., Das et al., 2014; Ghosh and Mujumdar, 2008).</p>
</sec>
<sec id="Ch1.S2.SS3.SSS3">
  <label>2.3.3</label><title>Artificial neural network</title>
      <p id="d1e754">The multilayer perceptron (MLP) is a classical example of a feed-forward ANN, meaning that the flow of data through the neural network is unidirectional without recurrent connections between the layers (Gardner and Dorling, 1998; Pal and Mitra, 1992). MLP is a supervised learning algorithm that consists of three layers (i.e., an input, hidden, and output layer) connected by transformation coefficients (weights) using nonlinear activation such as the hyperbolic function. More specifically, the learning algorithm with one hidden layer for the training sets <inline-formula><mml:math id="M30" display="inline"><mml:mrow><mml:mfenced open="(" close=")"><mml:mrow><mml:msub><mml:mi>X</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:mfenced></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M31" display="inline"><mml:mrow><mml:mfenced close=")" open="("><mml:mrow><mml:msub><mml:mi>X</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:mfenced></mml:mrow></mml:math></inline-formula>, …, <inline-formula><mml:math id="M32" display="inline"><mml:mrow><mml:mfenced close=")" open="("><mml:mrow><mml:msub><mml:mi>X</mml:mi><mml:mi>n</mml:mi></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mi>n</mml:mi></mml:msub></mml:mrow></mml:mfenced></mml:mrow></mml:math></inline-formula>, where <inline-formula><mml:math id="M33" display="inline"><mml:mrow><mml:msub><mml:mi>X</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mi mathvariant="italic">ϵ</mml:mi><mml:msup><mml:mi>R</mml:mi><mml:mi>n</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M34" display="inline"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mi mathvariant="italic">ϵ</mml:mi><mml:mfenced open="{" close="}"><mml:mrow><mml:mn mathvariant="normal">0</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:mfenced></mml:mrow></mml:math></inline-formula>, can be defined as
              <disp-formula id="Ch1.E6" content-type="numbered"><label>6</label><mml:math id="M35" display="block"><mml:mrow><mml:mi>f</mml:mi><mml:mfenced close=")" open="("><mml:mi>X</mml:mi></mml:mfenced><mml:mo>=</mml:mo><mml:msub><mml:mi>W</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mi mathvariant="italic">θ</mml:mi><mml:mfenced open="(" close=")"><mml:mrow><mml:msubsup><mml:mi>W</mml:mi><mml:mn mathvariant="normal">1</mml:mn><mml:mi>T</mml:mi></mml:msubsup><mml:mi>X</mml:mi><mml:mo>+</mml:mo><mml:msub><mml:mi>b</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:mfenced><mml:mo>+</mml:mo><mml:msub><mml:mi>b</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>
            where <inline-formula><mml:math id="M36" display="inline"><mml:mi mathvariant="italic">θ</mml:mi></mml:math></inline-formula> is the activation function, and <inline-formula><mml:math id="M37" display="inline"><mml:mrow><mml:msub><mml:mi>b</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M38" display="inline"><mml:mrow><mml:msub><mml:mi>b</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> are the model biases added to the hidden and output layer. The weights connecting the layers are optimized with the backpropagation algorithm (Hecht-Nielsen, 1992; Rumelhart et al., 1986) with a mean squared error loss function. Moreover, the L2 regularization (Eq. 2) method is applied to avoid overfitting by shrinking the weights with higher magnitudes. Therefore, the optimized squared error loss function is defined as
              <disp-formula id="Ch1.E7" content-type="numbered"><label>7</label><mml:math id="M39" display="block"><mml:mrow><mml:mtext>Loss</mml:mtext><mml:mfenced close=")" open="("><mml:mrow><mml:mover accent="true"><mml:mi>y</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mo>,</mml:mo><mml:mi>y</mml:mi><mml:mo>,</mml:mo><mml:mi>W</mml:mi></mml:mrow></mml:mfenced><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mn mathvariant="normal">1</mml:mn><mml:mn mathvariant="normal">2</mml:mn></mml:mfrac></mml:mstyle><mml:msubsup><mml:mfenced open="∥" close="∥"><mml:mrow><mml:mover accent="true"><mml:mi>y</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mo>-</mml:mo><mml:mi>y</mml:mi></mml:mrow></mml:mfenced><mml:mn mathvariant="normal">2</mml:mn><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup><mml:mo>+</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mi mathvariant="italic">α</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:mfrac></mml:mstyle><mml:msubsup><mml:mfenced close="∥" open="∥"><mml:mi>W</mml:mi></mml:mfenced><mml:mn mathvariant="normal">2</mml:mn><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>
            where  <inline-formula><mml:math id="M40" display="inline"><mml:mrow><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mi mathvariant="italic">α</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:mfrac></mml:mstyle><mml:msubsup><mml:mfenced close="∥" open="∥"><mml:mi>W</mml:mi></mml:mfenced><mml:mn mathvariant="normal">2</mml:mn><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow></mml:math></inline-formula> is the L2 penalty that shrinks the model complexity. Often, the derivative of the loss function with respect to the weights is determined until the residual error of the model is satisfactory. The stochastic gradient descent algorithm (Bottou, 1991; Kingma and Ba, 2014) is used as a solver for updating the weights (defined in Eq. 6) in a maximum number of iterations until a satisfactory loss (Eq. 7) is achieved. Moreover, the choice of the parameters, such as the size of hidden layers, activation function, and learning algorithm, is relevant to the performance of the model (Diaz et al., 2017). The exhaustive search algorithm with CV bootstrapping is a simple and efficient method for parameter optimization (Pontes et al., 2016) and therefore included in the pyESD package (GridSearchCV).</p>
</sec>
<sec id="Ch1.S2.SS3.SSS4">
  <label>2.3.4</label><title>Support vector machine</title>
      <p id="d1e1026">Support vector regression (SVR) uses the principles of SVM as a regression technique. The learning algorithms are based on Vapnik–Chervonenkis (VC) theory and empirical risk minimization that is designed to solve linear and nonlinear problems. This is achieved by applying kernel functions to map low-dimensional data to higher- or even infinite-dimensional feature space (Vapnik, 2000; Cristianini and Shawe-Taylor, 2000). In principle, the model creates a hyperplane in a vector space containing groups of data points. This hyperplane is a linear classifier that maximizes the group margins. Given finite predictor and predictand data points <inline-formula><mml:math id="M41" display="inline"><mml:mrow><mml:mfenced close=")" open="("><mml:mrow><mml:msub><mml:mi>X</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:mfenced></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M42" display="inline"><mml:mrow><mml:mfenced open="(" close=")"><mml:mrow><mml:msub><mml:mi>X</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:mfenced></mml:mrow></mml:math></inline-formula>, …, <inline-formula><mml:math id="M43" display="inline"><mml:mrow><mml:mfenced open="(" close=")"><mml:mrow><mml:msub><mml:mi>X</mml:mi><mml:mi>n</mml:mi></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mi>n</mml:mi></mml:msub></mml:mrow></mml:mfenced></mml:mrow></mml:math></inline-formula>, where <inline-formula><mml:math id="M44" display="inline"><mml:mrow><mml:msub><mml:mi>X</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mi mathvariant="italic">ϵ</mml:mi><mml:msup><mml:mi>R</mml:mi><mml:mi>n</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M45" display="inline"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mi mathvariant="italic">ϵ</mml:mi><mml:mi>R</mml:mi></mml:mrow></mml:math></inline-formula>, the regressor can be defined as
              <disp-formula id="Ch1.E8" content-type="numbered"><label>8</label><mml:math id="M46" display="block"><mml:mrow><mml:mi>f</mml:mi><mml:mfenced open="(" close=")"><mml:mrow><mml:mi>X</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="bold-italic">w</mml:mi></mml:mrow></mml:mfenced><mml:mo>=</mml:mo><mml:msup><mml:mi>w</mml:mi><mml:mi>T</mml:mi></mml:msup><mml:mi mathvariant="italic">ϕ</mml:mi><mml:mfenced open="(" close=")"><mml:mi>X</mml:mi></mml:mfenced><mml:mo>+</mml:mo><mml:mi>b</mml:mi><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>
            where the support vectors <inline-formula><mml:math id="M47" display="inline"><mml:mi mathvariant="bold-italic">w</mml:mi></mml:math></inline-formula> and model bias <inline-formula><mml:math id="M48" display="inline"><mml:mi>b</mml:mi></mml:math></inline-formula> are the optimal parameters that minimize the cost function in Eqs. (9):
              <disp-formula id="Ch1.E9" content-type="numbered"><label>9</label><mml:math id="M49" display="block"><mml:mrow><mml:mtext>cost</mml:mtext><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mn mathvariant="normal">1</mml:mn><mml:mn mathvariant="normal">2</mml:mn></mml:mfrac></mml:mstyle><mml:msup><mml:mi>w</mml:mi><mml:mi>T</mml:mi></mml:msup><mml:mi>w</mml:mi><mml:mo>+</mml:mo><mml:mi>C</mml:mi><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>n</mml:mi></mml:munderover><mml:mfenced close=")" open="("><mml:mrow><mml:msub><mml:mi mathvariant="italic">ξ</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>+</mml:mo><mml:msub><mml:mover accent="true"><mml:mi mathvariant="italic">ξ</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:mfenced><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>
            subject to
              <disp-formula id="Ch1.Ex1"><mml:math id="M50" display="block"><mml:mrow><mml:mfenced open="{" close="}"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:mi>f</mml:mi><mml:mfenced open="(" close=")"><mml:mrow><mml:msub><mml:mi>X</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>,</mml:mo><mml:mi>w</mml:mi></mml:mrow></mml:mfenced><mml:mo>≤</mml:mo><mml:mi mathvariant="italic">ε</mml:mi><mml:mo>+</mml:mo><mml:msub><mml:mover accent="true"><mml:mi mathvariant="italic">ξ</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mi>i</mml:mi></mml:msub><mml:mo>,</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mi>f</mml:mi><mml:mfenced close=")" open="("><mml:mrow><mml:msub><mml:mi>X</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>,</mml:mo><mml:mi>w</mml:mi></mml:mrow></mml:mfenced><mml:mo>-</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>≤</mml:mo><mml:mi mathvariant="italic">ε</mml:mi><mml:mo>+</mml:mo><mml:msub><mml:mi mathvariant="italic">ξ</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>,</mml:mo></mml:mrow></mml:mfenced></mml:mrow></mml:math></disp-formula>
            where <inline-formula><mml:math id="M51" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">ξ</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M52" display="inline"><mml:mrow><mml:msub><mml:mover accent="true"><mml:mi mathvariant="italic">ξ</mml:mi><mml:mo mathvariant="normal" stretchy="false">^</mml:mo></mml:mover><mml:mi>i</mml:mi></mml:msub><mml:mo>≥</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:math></inline-formula>, and <inline-formula><mml:math id="M53" display="inline"><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mi mathvariant="normal">…</mml:mi><mml:mi>n</mml:mi></mml:mrow></mml:math></inline-formula> are the slack variables (the upper and lower training errors) subject to the error tolerance of <inline-formula><mml:math id="M54" display="inline"><mml:mi mathvariant="italic">ε</mml:mi></mml:math></inline-formula> that prevents overfitting. <inline-formula><mml:math id="M55" display="inline"><mml:mi>C</mml:mi></mml:math></inline-formula> represents a regularization term that determines the balance between minimal loss and maximal margins. The cost function in Eq. (9) is solved using Lagrange's formula (Balasundaram and Tanveer, 2013) to obtain the optimized function:
              <disp-formula id="Ch1.E10" content-type="numbered"><label>10</label><mml:math id="M56" display="block"><mml:mrow><mml:mi>f</mml:mi><mml:mfenced close=")" open="("><mml:mi>X</mml:mi></mml:mfenced><mml:mo>=</mml:mo><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>n</mml:mi></mml:munderover><mml:mfenced close=")" open="("><mml:mrow><mml:msub><mml:mi mathvariant="italic">α</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mover accent="true"><mml:mi mathvariant="italic">α</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:mfenced><mml:mi mathvariant="italic">ϕ</mml:mi><mml:mfenced open="(" close=")"><mml:mrow><mml:msub><mml:mi>X</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>X</mml:mi><mml:mi>j</mml:mi></mml:msub></mml:mrow></mml:mfenced><mml:mo>+</mml:mo><mml:mi>b</mml:mi><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>
            where <inline-formula><mml:math id="M57" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">α</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M58" display="inline"><mml:mrow><mml:msub><mml:mover accent="true"><mml:mi mathvariant="italic">α</mml:mi><mml:mo mathvariant="normal" stretchy="false">^</mml:mo></mml:mover><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> are Lagrange multipliers, and <inline-formula><mml:math id="M59" display="inline"><mml:mrow><mml:mi mathvariant="italic">ϕ</mml:mi><mml:mfenced open="(" close=")"><mml:mrow><mml:msub><mml:mi>X</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>X</mml:mi><mml:mi>j</mml:mi></mml:msub></mml:mrow></mml:mfenced></mml:mrow></mml:math></inline-formula> is the kernel function which implicitly maps the training vectors in Eq. (8) into a higher-dimensional space. The SVR method of the pyESD package includes linear, polynomial, sigmoid, and Gaussian radial basis function (RBF) kernels (Hofmann et al., 2008). Moreover, the degree of regularization (<inline-formula><mml:math id="M60" display="inline"><mml:mi>C</mml:mi></mml:math></inline-formula>) and the coefficient of the kernels (<inline-formula><mml:math id="M61" display="inline"><mml:mi mathvariant="italic">γ</mml:mi></mml:math></inline-formula>) is given a range of values so that the hyperparameter optimization algorithm can determine the best model. Due to the expensive nature of SVR, the package uses a randomized search algorithm in a CV setting for the hyperparameter optimization (Bergstra and Bengio, 2012). However, hyperparameters optimization algorithms, such as Bayesian and grid search (Snoek et al., 2012; Pontes et al., 2016; Bergstra et al., 2011) methods, are also provided as alternatives. Previous downscaling projects have taken advantage of the SVR method due to its ability to map data into higher-dimensional space and exclude outliers from the training process (Ghosh and Mujumdar, 2008; Chen et al., 2010; Sachindra et al., 2018; Anandhi et al., 2008; Tripathi et al., 2006).</p>
</sec>
<sec id="Ch1.S2.SS3.SSS5">
  <label>2.3.5</label><title>Ensemble machine learning</title>
      <p id="d1e1502">Each ML technique is associated with challenges that arise from the method's limitations and underlying assumptions.<?pagebreak page6486?> These have to be considered carefully in the evaluation of the resulting downscaling product. Some of these challenges can be overcome by an integration of different ML models for a specific task (Dietterich, 2000; Zhang and Ma, 2012). Integrated ML models have been suggested to outperform single ML models in downscaling climate information (e.g., Liu et al., 2015). Ensemble models typically use different ML algorithms (base learners) to extract information from the training data, then use a second set of ML algorithms (meta-learners) that learn from the first and combine the individual predictions into an ensemble. Ensemble models can be categorized by (a) the selection of base learners and (b) the method of combining the individual predictions from the base learners. Here, we summarize the more prominent ensemble models that are included in the pyESD package.</p>
</sec>
<sec id="Ch1.S2.SS3.SSSx1" specific-use="unnumbered">
  <title>Bagging</title>
      <p id="d1e1511">Bagging ensemble models consist of ML algorithms that generate several instances of base learners using random subsets of the training data and then aggregate the information for the final estimates (Breiman, 1996a; Quinlan, 1996). Such algorithms integrate randomization into the learning process and thereby often ensure the reduction of the variance of the individual base learners (e.g., decision trees). Moreover,  bagging techniques constitute a simple way to improve model performance without the need to adapt the underlying base algorithm. Since bagging works well with complex algorithms like decision trees, we also consider tree-based ensembles for the pyESD package. More specifically, we include implementations of the random forest (RandomForest) and extremely randomized tree (ExtraTree) methods in addition to classical bagging.</p>
      <p id="d1e1514">The RandomForest algorithm builds multiple independent tree-based learners. The trees can be constructed with the full set of predictors or a random subset. Each tree is constructed from a random sample of the training data in a bootstrapping process (Breiman, 2001). The algorithm uses the remaining training data (i.e., out-of-bag data) to estimate the error rate and evaluate the model's robustness. In contrast, the ExtraTree algorithm considers the discriminative thresholds from each predictor rather than the subset of predictors (Geurts et al., 2006). This usually adds more weight to the variance reduction and slightly improves the model bias. Tree-based ensembles are particularly suitable for establishing a nonlinear relationship between predictors and predictands (e.g., Pang et al., 2017; He et al., 2016).</p>
</sec>
<sec id="Ch1.S2.SS3.SSSx2" specific-use="unnumbered">
  <title>Boosting</title>
      <p id="d1e1523">In recent years, boosting models have also been applied for the downscaling of climate information (e.g., Fan et al., 2021; Zhang et al., 2021). Boosting models are meta-estimators that are built sequentially from multiple base learners with the primary objective of reducing the model bias and variance. In principle, the method “boosts”  weaker base learners (i.e., estimators that perform only slightly better than random guessing) by converting them into strong ones in an iterative process. The technique assumes that the base learning model is distribution-free (Schapire, 1999) and iteratively improves the weaker base learners by applying weights to the training data through the adjustment of the input points with prediction errors from the previous prediction (Schapire, 2003; Schapire and Freund, 2013). There are many boosting algorithms due to the many possible methods of weighting the training data and tuning the weaker base learners. In the pyESD package, we include (1) adaptive boosting (Adaboost), (2) gradient tree boosting (GradientBoost) with  a gradient boosting algorithm by Friedman (2001), and (3) extreme gradient boosting (XGBoost). A brief summary of each is provided below. <list list-type="order"><list-item>
      <p id="d1e1528">The Adaboost algorithm is a well-established model for improving the accuracy of weak base learners (Freund and Schapire, 1997). The model is adaptive in the sense that the training data are sequentially adjusted based on the previous performance of the weaker model. The model uses a weighted majority vote (or sum) to combine the individual prediction from the weaker learners and produce a robust final prediction. The implemented version uses a decision tree algorithm as the base estimator to develop the boosted ensemble predictions.</p></list-item><list-item>
      <p id="d1e1532">The GradientBoost algorithm considers the boosting process to be a numerical optimization problem that minimizes a loss function in a stage-wise additive model by adding weaker learners using a gradient descent procedure. This generalization allows the tuning of an arbitrary differentiable loss function which can be selected based on a specific problem. Specifically, in pyESD, squared errors are used in the minimization of the loss function.</p></list-item><list-item>
      <p id="d1e1536">XGBoost, a recent extension of the GradientBoost algorithm, is designed to reduce computational time and improve model performance (Chen and Guestrin, 2016). The model uses regularization terms to penalize the final weights and prevent overfitting. The algorithm also uses shrinkage and column subsampling techniques to avoid overfitting. Moreover, the model can handle sparse data by using a sparsity-aware split function.</p></list-item></list></p>
</sec>
<sec id="Ch1.S2.SS3.SSSx3" specific-use="unnumbered">
  <title>Stacked generalization</title>
      <p id="d1e1545">The stacked generalization method (or “stacking”) has previously been used for the downscaling  climate information and has shown improved prediction robustness over singular models (e.g., Massaoudi et al., 2021; Gu et al., 2022). It is designed to enhance prediction accuracy and generality by taking advantage of the mutual complementarity of the base-model predictions. The approach was introduced by<?pagebreak page6487?> Wolpert (1992) and demonstrated for regression tasks and unsupervised learning by Breiman (1996b) and Leblanc and Tibshirani (1996), respectively. In principle, the following process is implemented: in the first step, the training data and base models, referred to as level-0 data and level-0 models by Wolpert (1992), are used to generate the first set of predictions. Then a meta-learning model (level-1 generalizer) is used to optimally combine the previous predictions (level-1 data) into final estimates. Lastly, the method applies a cross-validation technique and generates new “stacked” datasets for a final learning step. Generally, the performance of stacked generalization is constrained by the attributes used to generate the level-1 data and the type of algorithm used for higher-level learning (Ting and Witten, 1999). We consider these limitations by providing a wide range of models that can be used as the level-0 models and the level-l generalizer. The base learners can be selected from the different ML models presented in the previous sections. The reader is advised that previous studies (e.g., Reid and Grudic, 2009) suggest the use of a more restrictive model like LassoCV and ExtraTree as the meta-learner to prevent overfitting.</p>
</sec>
</sec>
<sec id="Ch1.S2.SS4">
  <label>2.4</label><title>Model training</title>
      <p id="d1e1557">The process of training and testing the PP-ESD models is the most critical stage in the downscaling procedure, since it determines much of the robustness of the final models, as well as the accuracy of the predictions they generate. The process typically involves the following steps: (1) the observational records are separated into training and testing datasets. (2) The training datasets are used to establish the transfer functions that make up the PP-ESD models. (3) The trained models are then evaluated on the independent testing datasets (Sect. 2.5). In the model training process, hyperparameter optimization techniques (e.g., GridSearchCV) are used to fine-tune the transfer function parameters, such as regression coefficients, to optimize model performance. Cross-validation (CV) techniques are applied to split the whole training dataset into smaller training and validation data sections and allow the assessment and iterative improvement of the model parameters during training while also preventing overfitting (Moore, 2001; Santos et al., 2018). In this category of techniques, the <inline-formula><mml:math id="M62" display="inline"><mml:mi>k</mml:mi></mml:math></inline-formula>-fold framework is the most used for climate information downscaling models. It partitions the training data into <inline-formula><mml:math id="M63" display="inline"><mml:mi>k</mml:mi></mml:math></inline-formula> equally sized and mutually exclusive subsamples, which are also referred to as folds (Stone, 1976; Markatou et al., 2005). More specifically, for each iteration step, one fold is used for model validation, and the remaining <inline-formula><mml:math id="M64" display="inline"><mml:mrow><mml:mi>k</mml:mi><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:math></inline-formula> folds are used for model training. The leave-one-out CV technique (Lachenbruch and Mickey, 1968) is an alternative and has been used for the development of ESD models (e.g., Gutiérrez et al., 2013). Cross-validation techniques rely on the fundamental assumption of independent and identically distributed (i.i.d) data. They, therefore, treat the data as a result of a generative process that has no memory of previously generated samples (Arlot and Celisse, 2010). The assumption of i.i.d might not be valid for time series data (e.g., Bergmeir and Benítez, 2012) due to seasonal effects, for example. To circumvent this problem, monthly bootstrapped resampling and time series splitters are included in the pyESD package. The <italic>pyESD.splitter</italic> module contains all CV frameworks available for model training, including the <inline-formula><mml:math id="M65" display="inline"><mml:mi>k</mml:mi></mml:math></inline-formula>-fold, leave-one-out, and other CV schemes. The validation metrics used for optimizing the model parameters include the coefficient of determination (<inline-formula><mml:math id="M66" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula>) (Eq. 11), root mean squared error (RMSE) (Eq. 13), mean absolute error (MAE) (Eq. 14), and others that are summarized in Sect. 2.5. The final values for the validation metrics, which reflect the model performance during training, are arithmetic means of the individual values for each iteration. In this paper, we refer to them as CV performance metrics (i.e., CV <inline-formula><mml:math id="M67" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula>, CV RMSE, and CV MAE).</p>
</sec>
<sec id="Ch1.S2.SS5">
  <label>2.5</label><title>Model evaluation</title>
      <p id="d1e1627">In the process of downscaling climate information, best practice involves the use of stringent model evaluation schemes with independent data outside the training data range (Wilby et al., 2004). Retaining a section of the data as a testing dataset (Sect. 2.4) is recommended if longer records (e.g., <inline-formula><mml:math id="M68" display="inline"><mml:mrow><mml:mo>≥</mml:mo><mml:mn mathvariant="normal">30</mml:mn></mml:mrow></mml:math></inline-formula> years) are available. It allows (a) a completely independent evaluation of the trained model's performance and (b) an assessment of the sensitivity of the model to the chosen training dataset. In the case of time series, the latter can provide insights into the model's sensitivity to the calibration period and the temporal stationarity of the model's transfer functions. If the records are short (e.g., <inline-formula><mml:math id="M69" display="inline"><mml:mrow><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">30</mml:mn></mml:mrow></mml:math></inline-formula> years), the CV metrics (Sect. 2.4) can be used, albeit with caveats, as nonideal estimates for the model's performance (e.g., Mutz et al., 2021). For the remainder of this section, however, we will assume that longer records and completely independent testing datasets are available.</p>
      <p id="d1e1650">The PP-ESD model is evaluated on the basis of the model's predictions <inline-formula><mml:math id="M70" display="inline"><mml:mover accent="true"><mml:mi>y</mml:mi><mml:mo mathvariant="normal" stretchy="false">^</mml:mo></mml:mover></mml:math></inline-formula> and the observed values <inline-formula><mml:math id="M71" display="inline"><mml:mi>y</mml:mi></mml:math></inline-formula>. In pyESD, the following performance metrics are implemented. <list list-type="order"><list-item>
      <?pagebreak page6488?><p id="d1e1672">The coefficient of determination (<inline-formula><mml:math id="M72" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula>) represents the fraction of the predictand's observed variance that can be explained by the predictors. It can be seen as a measure of how well the model predicts the unseen data (Wilks, 2011). The <inline-formula><mml:math id="M73" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> for the predicted values <inline-formula><mml:math id="M74" display="inline"><mml:mrow><mml:msub><mml:mover accent="true"><mml:mi>y</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> in relation to the observed data <inline-formula><mml:math id="M75" display="inline"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> for <inline-formula><mml:math id="M76" display="inline"><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:math></inline-formula>, …, <inline-formula><mml:math id="M77" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula> samples is defined as<disp-formula id="Ch1.E11" content-type="numbered"><label>11</label><mml:math id="M78" display="block"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mfenced close=")" open="("><mml:mrow><mml:mi>y</mml:mi><mml:mo>,</mml:mo><mml:mover accent="true"><mml:mi>y</mml:mi><mml:mo mathvariant="normal" stretchy="false">^</mml:mo></mml:mover></mml:mrow></mml:mfenced><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>-</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>n</mml:mi></mml:munderover><mml:msup><mml:mfenced close=")" open="("><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mover accent="true"><mml:mi>y</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:mfenced><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow><mml:mrow><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>n</mml:mi></mml:munderover><mml:msup><mml:mfenced close=")" open="("><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:mover accent="true"><mml:mi>y</mml:mi><mml:mo mathvariant="normal">¯</mml:mo></mml:mover></mml:mrow></mml:mfenced><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:mfrac></mml:mstyle><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>where <inline-formula><mml:math id="M79" display="inline"><mml:mover accent="true"><mml:mi>y</mml:mi><mml:mo mathvariant="normal">¯</mml:mo></mml:mover></mml:math></inline-formula> is the mean of the observed data, <inline-formula><mml:math id="M80" display="inline"><mml:mrow><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>n</mml:mi></mml:munderover><mml:msup><mml:mfenced close=")" open="("><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mover accent="true"><mml:mi>y</mml:mi><mml:mo mathvariant="normal" stretchy="false">^</mml:mo></mml:mover><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:mfenced><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mspace width="0.125em" linebreak="nobreak"/></mml:mrow></mml:math></inline-formula>is the sum of squared residuals (SSR), and <inline-formula><mml:math id="M81" display="inline"><mml:mrow><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>n</mml:mi></mml:munderover><mml:msup><mml:mfenced open="(" close=")"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:mover accent="true"><mml:mi>y</mml:mi><mml:mo mathvariant="normal">¯</mml:mo></mml:mover></mml:mrow></mml:mfenced><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> is the total sum of squares (SST). <inline-formula><mml:math id="M82" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> can range from <inline-formula><mml:math id="M83" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mi mathvariant="normal">∞</mml:mi></mml:mrow></mml:math></inline-formula> to 1, where 1 is the best possible score and negative values are indicative of an arbitrary, worse model. An <inline-formula><mml:math id="M84" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> value of 0 is indicative of a model that would always predict the <inline-formula><mml:math id="M85" display="inline"><mml:mover accent="true"><mml:mi>y</mml:mi><mml:mo mathvariant="normal">¯</mml:mo></mml:mover></mml:math></inline-formula>. In this case, the model represents no improvement over simply using the mean <inline-formula><mml:math id="M86" display="inline"><mml:mover accent="true"><mml:mi>y</mml:mi><mml:mo mathvariant="normal">¯</mml:mo></mml:mover></mml:math></inline-formula> as a model.</p>
      <p id="d1e1974">Pearson's correlation coefficient (PCC) evaluates the linear correlation between the model predictions <inline-formula><mml:math id="M87" display="inline"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> and observed data <inline-formula><mml:math id="M88" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>. The PCC of 1 indicates a perfect positive correlation, <inline-formula><mml:math id="M89" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:math></inline-formula> indicates a perfect anticorrelation, and 0 indicates no correlation between the predicted and observed values. The PCC for <inline-formula><mml:math id="M90" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula> samples is defined as<disp-formula id="Ch1.E12" content-type="numbered"><label>12</label><mml:math id="M91" display="block"><mml:mrow><mml:msub><mml:mtext>PCC</mml:mtext><mml:mrow><mml:mi>x</mml:mi><mml:mi>y</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>n</mml:mi></mml:munderover><mml:mfenced close=")" open="("><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:mover accent="true"><mml:mi>x</mml:mi><mml:mo mathvariant="normal">¯</mml:mo></mml:mover></mml:mrow></mml:mfenced><mml:mfenced close=")" open="("><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:mover accent="true"><mml:mi>y</mml:mi><mml:mo mathvariant="normal">¯</mml:mo></mml:mover></mml:mrow></mml:mfenced></mml:mrow><mml:mrow><mml:msqrt><mml:mrow><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>n</mml:mi></mml:munderover><mml:msup><mml:mfenced open="(" close=")"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:mover accent="true"><mml:mi>x</mml:mi><mml:mo mathvariant="normal">¯</mml:mo></mml:mover></mml:mrow></mml:mfenced><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:msqrt><mml:msqrt><mml:mrow><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>n</mml:mi></mml:munderover><mml:msup><mml:mfenced close=")" open="("><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:mover accent="true"><mml:mi>y</mml:mi><mml:mo mathvariant="normal">¯</mml:mo></mml:mover></mml:mrow></mml:mfenced><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:msqrt></mml:mrow></mml:mfrac></mml:mstyle><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>where the <inline-formula><mml:math id="M92" display="inline"><mml:mover accent="true"><mml:mi>x</mml:mi><mml:mo mathvariant="normal">¯</mml:mo></mml:mover></mml:math></inline-formula> and <inline-formula><mml:math id="M93" display="inline"><mml:mover accent="true"><mml:mi>y</mml:mi><mml:mo mathvariant="normal">¯</mml:mo></mml:mover></mml:math></inline-formula> are the means of the <inline-formula><mml:math id="M94" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M95" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> values, respectively.</p>
      <p id="d1e2183">The root mean squared error (RMSE) estimates the mean magnitude of error between the predictions and observations. The RMSE is given in the physical units of the observed data and not standardized. Smaller values indicate better model performance. The RMSE for predictions <inline-formula><mml:math id="M96" display="inline"><mml:mrow><mml:msub><mml:mover accent="true"><mml:mi>y</mml:mi><mml:mo mathvariant="normal" stretchy="false">^</mml:mo></mml:mover><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> and observations <inline-formula><mml:math id="M97" display="inline"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> of <inline-formula><mml:math id="M98" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula> samples is calculated as<disp-formula id="Ch1.E13" content-type="numbered"><label>13</label><mml:math id="M99" display="block"><mml:mrow><mml:mtext>RMSE</mml:mtext><mml:mfenced close=")" open="("><mml:mrow><mml:mi>y</mml:mi><mml:mo>,</mml:mo><mml:mover accent="true"><mml:mi>y</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover></mml:mrow></mml:mfenced><mml:mo>=</mml:mo><mml:msqrt><mml:mrow><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mn mathvariant="normal">1</mml:mn><mml:mi>n</mml:mi></mml:mfrac></mml:mstyle><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>n</mml:mi></mml:munderover><mml:msup><mml:mfenced open="(" close=")"><mml:mrow><mml:msub><mml:mover accent="true"><mml:mi>y</mml:mi><mml:mo mathvariant="normal" stretchy="false">^</mml:mo></mml:mover><mml:mi>i</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:mfenced><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:msqrt><mml:mo>.</mml:mo></mml:mrow></mml:math></disp-formula>The mean absolute error (MAE) is a scale-dependent accuracy measure that also provides information on the errors between the predictions and observations. The MAE is estimated as the sum of absolute errors normalized by the sample size (<inline-formula><mml:math id="M100" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula>). The MAE is calculated as<disp-formula id="Ch1.E14" content-type="numbered"><label>14</label><mml:math id="M101" display="block"><mml:mrow><mml:mtext>MAE</mml:mtext><mml:mfenced open="(" close=")"><mml:mrow><mml:mi>y</mml:mi><mml:mo>,</mml:mo><mml:mover accent="true"><mml:mi>y</mml:mi><mml:mo mathvariant="normal" stretchy="false">^</mml:mo></mml:mover></mml:mrow></mml:mfenced><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mn mathvariant="normal">1</mml:mn><mml:mi>n</mml:mi></mml:mfrac></mml:mstyle><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>n</mml:mi></mml:munderover><mml:mfenced open="|" close="|"><mml:mrow><mml:msub><mml:mover accent="true"><mml:mi>y</mml:mi><mml:mo mathvariant="normal" stretchy="false">^</mml:mo></mml:mover><mml:mi>i</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:mfenced><mml:mo>.</mml:mo></mml:mrow></mml:math></disp-formula></p></list-item></list> Additional metrics such as the mean squared error (MSE), mean absolute percentage error (MAPE), maximum error, adjusted <inline-formula><mml:math id="M102" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> (Miles, 2014), and Nash–Sutcliffe efficiency (NSE) (Nash and Sutcliffe, 1970) are included in pyESD. However, the predicted values from the trained model and their corresponding observed values can be evaluated using other metrics not included in pyESD. For example, additional metrics like the model skill score <inline-formula><mml:math id="M103" display="inline"><mml:mi>E</mml:mi></mml:math></inline-formula> and the revised <inline-formula><mml:math id="M104" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> (RRS), which combines correlation, bias measure, and the capacity to capture variability, can be used (Onyutha, 2021). We highlight that the limitations and assumptions underpinning these metrics should be considered when interpreting performance metrics. For example, the RMSE is sensitive to outliers because the squaring of errors assigns more weight to large errors. This implies that a single outlier can bias its estimate and lead to a misinterpretation of extreme data points in the predictand. Although MAE is less sensitive to outliers compared to RMSE, its treatment of all errors with equal weight may not adequately account for the impact of extreme errors on model performance. Consequently, both metrics should be interpreted with respect to the mean of the observed values. On the other hand, the Pearson correlation coefficient (PCC) assumes a linear relationship between the predicted and observed values and a bivariate normal distribution. However, distance correlation (Székely et al., 2007), which is more computationally demanding and makes no assumptions about the relationship or distribution, can be considered. Chaudhuri and Hu (2019) demonstrated a fast algorithm that can be used to compute the distance correlation.</p>
</sec>
<sec id="Ch1.S2.SS6">
  <label>2.6</label><title>GCM–ESD coupling and local-scale predictions</title>
      <p id="d1e2385">The developed and tested PP-ESD model can finally be coupled to coarse-scale climate information. If the PP-ESD model was developed with the intention to downscale predictions of future climate change, the next logical step is to couple it to GCM simulations forced with different greenhouse gas concentration scenarios. Since PP-ESD is the bias-free downscaling alternative to MOS-ESD, PP-ESD models may be coupled to all GCMs, provided that the predictors are adequately represented by the GCMs. This condition may be alleviated to an extent by standardizing the simulated predictor (Bedia et al., 2020). An analysis of the distribution similarity between the observed and simulated predictors can be conducted to test the assumption of representation. For example, the Kolmogorov–Smirnov (KS) test, which is implemented as part of the pyESD package utilities, is a nonparametric statistical hypothesis test that can be used to evaluate the null hypothesis (<inline-formula><mml:math id="M105" display="inline"><mml:mrow><mml:msub><mml:mi>H</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>) that the observation-based predictors and simulated predictors are of the same theoretical distribution.</p>
      <p id="d1e2399">The first step in ESD–GCM coupling is to utilize the GCM output to recreate the predictors used in the training of the ESD model. This may involve anything from constructing simple temperature regional means to reconstructing multivariate indices for more complex climate phenomena. In the case of index-based predictors such as NAO, EA, SCAN, and others, the simulated indices are reconstructed by projecting the pressure anomalies of the GCM onto the EOF loading patterns of the predictors (e.g., Mutz et al., 2016). This ensures that the physical meaning of the index values is maintained. The ESD model then takes these simulated predictors as input and generates local-scale predictions according to the model's transfer functions. The added value of the resulting downscaling product can be evaluated by comparing<?pagebreak page6489?> the downscaled values to the raw outputs of different GCMs and RCMs. Finally, the high-resolution local-scale predictions can be used to drive climate change impact assessment models to predict flood frequency (e.g., Padulano et al., 2021; Hodgkins et al., 2017), agricultural changes (e.g., Mearns et al., 1996), changes in water resources (e.g., Dau et al., 2021), and more.</p>
</sec>
</sec>
<sec id="Ch1.S3">
  <label>3</label><title>Illustrative case study: Neckar catchment</title>
      <p id="d1e2411">We demonstrate the complete downscaling workflow and highlight most of the functionalities of the pyESD package in an illustrative case study. The study uses the PP-ESD approach and is set in the Neckar catchment, a hydrological catchment in southwestern Germany that consists of complex mountainous terrain with topographic elevations between 200 and 1000 m above sea level (Fig. 2). The region is climatically complex, since local climates are influenced by atmospheric teleconnection patterns (e.g., NAO, EA, and SCAND), orographic effects (e.g., Kunstmann et al., 2004), and the Mediterranean climate (Bárdossy, 2010; Ludwig et al., 2003). The catchment experiences maximum precipitation (80–120 mm per month) and temperature (15–18 <inline-formula><mml:math id="M106" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula>C) in the summer months (Fig. 3). The catchment serves as a water supply for drinking and agricultural activities (Selle et al., 2013). We use this catchment for our case study because (a) it is a suitable region to test the strengths and limitations of the pyESD downscaling package, and (b) generating 21st century climate change estimates can contribute to regional climate impact assessments and adaptation.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F2" specific-use="star"><?xmltex \currentcnt{2}?><?xmltex \def\figurename{Figure}?><label>Figure 2</label><caption><p id="d1e2425">Weather station locations and elevations in the Neckar catchment. The red circles represent temperature stations (ID corresponds to Table 1b), and the black circles represent precipitation stations (ID corresponds to Table 1a). The color map shows the elevation and delineates the extent of the catchment.</p></caption>
        <?xmltex \igopts{width=483.69685pt}?><graphic xlink:href="https://gmd.copernicus.org/articles/16/6479/2023/gmd-16-6479-2023-f02.png"/>

      </fig>

      <?xmltex \floatpos{t}?><fig id="Ch1.F3" specific-use="star"><?xmltex \currentcnt{3}?><?xmltex \def\figurename{Figure}?><label>Figure 3</label><caption><p id="d1e2436">Long-term (1958–2020) monthly means of <bold>(a)</bold> precipitation and <bold>(b)</bold> temperature, averaged over all stations in the catchment. The error bars are the standard deviations that represent inter-station variability. The maximum precipitation and temperature in the catchment are recorded in the summer season (JJA).</p></caption>
        <?xmltex \igopts{width=426.791339pt}?><graphic xlink:href="https://gmd.copernicus.org/articles/16/6479/2023/gmd-16-6479-2023-f03.png"/>

      </fig>

      <p id="d1e2452">In this case study, we apply pyESD to predict local temperature and precipitation changes for 22 weather stations located in the catchment (Table 1) and demonstrate the package's flexibility by performing experiments with the different modeling alternatives. We show most of the PP-ESD steps required for generating robust downscaling products. These steps include (1) predictor selection and construction; (2) model selection, training, and cross-validation; (3) model evaluation; and (4) generating future predictions through ESD–GCM coupling (see Sect. 3.2 for details). We note that the focus of the case study is more on demonstrating the pyESD workflow and functionality and less on detailed discussions of the downscaled results and their implications. In order to allow readers to reproduce and learn from this application example, we only use public and freely available datasets (see Sect. 3.1 for more details about the data). Moreover, all scripts used in this study (i.e., data preprocessing, modeling, and visualization scripts) are provided in the code and data availability section.</p>

<?xmltex \floatpos{p}?><table-wrap id="Ch1.T1" specific-use="star"><?xmltex \currentcnt{1}?><label>Table 1</label><caption><p id="d1e2458">IDs (specific to this study), names, coordinates, and elevation (m) for weather stations recording <bold>(a)</bold> precipitation and <bold>(b)</bold> temperature.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="5">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="left"/>
     <oasis:colspec colnum="3" colname="col3" align="right"/>
     <oasis:colspec colnum="4" colname="col4" align="right"/>
     <oasis:colspec colnum="5" colname="col5" align="right"/>
     <oasis:thead>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"><bold>(a)</bold> ID</oasis:entry>
         <oasis:entry colname="col2">Name</oasis:entry>
         <oasis:entry colname="col3">Longitude</oasis:entry>
         <oasis:entry colname="col4">Latitude</oasis:entry>
         <oasis:entry colname="col5">Elevation</oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row>
         <oasis:entry colname="col1">1</oasis:entry>
         <oasis:entry colname="col2">Baltmannsweiler–Hohengehren</oasis:entry>
         <oasis:entry colname="col3">9.45</oasis:entry>
         <oasis:entry colname="col4">48.76</oasis:entry>
         <oasis:entry colname="col5">457</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">2</oasis:entry>
         <oasis:entry colname="col2">Boll Bad</oasis:entry>
         <oasis:entry colname="col3">9.62</oasis:entry>
         <oasis:entry colname="col4">48.64</oasis:entry>
         <oasis:entry colname="col5">423</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">3</oasis:entry>
         <oasis:entry colname="col2">Eschbronn–Mariazell</oasis:entry>
         <oasis:entry colname="col3">8.47</oasis:entry>
         <oasis:entry colname="col4">48.19</oasis:entry>
         <oasis:entry colname="col5">716</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">4</oasis:entry>
         <oasis:entry colname="col2">Fellbach</oasis:entry>
         <oasis:entry colname="col3">9.27</oasis:entry>
         <oasis:entry colname="col4">48.81</oasis:entry>
         <oasis:entry colname="col5">280</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">5</oasis:entry>
         <oasis:entry colname="col2">Goeppingen–Jebenhausen</oasis:entry>
         <oasis:entry colname="col3">9.63</oasis:entry>
         <oasis:entry colname="col4">48.69</oasis:entry>
         <oasis:entry colname="col5">368</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">6</oasis:entry>
         <oasis:entry colname="col2">Haigerloch–Weildorf</oasis:entry>
         <oasis:entry colname="col3">8.77</oasis:entry>
         <oasis:entry colname="col4">48.37</oasis:entry>
         <oasis:entry colname="col5">524</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">7</oasis:entry>
         <oasis:entry colname="col2">Hechingen</oasis:entry>
         <oasis:entry colname="col3">8.98</oasis:entry>
         <oasis:entry colname="col4">48.38</oasis:entry>
         <oasis:entry colname="col5">518</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">8</oasis:entry>
         <oasis:entry colname="col2">Heubach Ostalb</oasis:entry>
         <oasis:entry colname="col3">9.94</oasis:entry>
         <oasis:entry colname="col4">48.80</oasis:entry>
         <oasis:entry colname="col5">450</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">9</oasis:entry>
         <oasis:entry colname="col2">Horb–Betra</oasis:entry>
         <oasis:entry colname="col3">8.66</oasis:entry>
         <oasis:entry colname="col4">48.41</oasis:entry>
         <oasis:entry colname="col5">544</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">10</oasis:entry>
         <oasis:entry colname="col2">Klippeneck</oasis:entry>
         <oasis:entry colname="col3">8.75</oasis:entry>
         <oasis:entry colname="col4">48.11</oasis:entry>
         <oasis:entry colname="col5">973</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">11</oasis:entry>
         <oasis:entry colname="col2">Lorch Kreis Ostalb–Waldhausen</oasis:entry>
         <oasis:entry colname="col3">9.64</oasis:entry>
         <oasis:entry colname="col4">48.78</oasis:entry>
         <oasis:entry colname="col5">296</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">12</oasis:entry>
         <oasis:entry colname="col2">Metzingen</oasis:entry>
         <oasis:entry colname="col3">9.27</oasis:entry>
         <oasis:entry colname="col4">48.54</oasis:entry>
         <oasis:entry colname="col5">354</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">13</oasis:entry>
         <oasis:entry colname="col2">Oberndorf Neckar</oasis:entry>
         <oasis:entry colname="col3">8.58</oasis:entry>
         <oasis:entry colname="col4">48.29</oasis:entry>
         <oasis:entry colname="col5">516</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">14</oasis:entry>
         <oasis:entry colname="col2">Rosenfeld–Bickelsberg</oasis:entry>
         <oasis:entry colname="col3">8.69</oasis:entry>
         <oasis:entry colname="col4">48.29</oasis:entry>
         <oasis:entry colname="col5">676</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">15</oasis:entry>
         <oasis:entry colname="col2">Stoetten</oasis:entry>
         <oasis:entry colname="col3">9.86</oasis:entry>
         <oasis:entry colname="col4">48.67</oasis:entry>
         <oasis:entry colname="col5">734</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">16</oasis:entry>
         <oasis:entry colname="col2">Stuttgart–Echterdingen</oasis:entry>
         <oasis:entry colname="col3">9.22</oasis:entry>
         <oasis:entry colname="col4">48.69</oasis:entry>
         <oasis:entry colname="col5">371</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">17</oasis:entry>
         <oasis:entry colname="col2">Stuttgart (Schnarrenberg)</oasis:entry>
         <oasis:entry colname="col3">9.20</oasis:entry>
         <oasis:entry colname="col4">48.83</oasis:entry>
         <oasis:entry colname="col5">314</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">18</oasis:entry>
         <oasis:entry colname="col2">Winterbach Rems–Murr–Kreis</oasis:entry>
         <oasis:entry colname="col3">9.47</oasis:entry>
         <oasis:entry colname="col4">48.80</oasis:entry>
         <oasis:entry colname="col5">240</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"><bold>(b)</bold> ID</oasis:entry>
         <oasis:entry colname="col2">Name</oasis:entry>
         <oasis:entry colname="col3">Longitude</oasis:entry>
         <oasis:entry colname="col4">Latitude</oasis:entry>
         <oasis:entry colname="col5">Elevation</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">1</oasis:entry>
         <oasis:entry colname="col2">Hechingen</oasis:entry>
         <oasis:entry colname="col3">8.98</oasis:entry>
         <oasis:entry colname="col4">48.38</oasis:entry>
         <oasis:entry colname="col5">518</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">2</oasis:entry>
         <oasis:entry colname="col2">Klippeneck</oasis:entry>
         <oasis:entry colname="col3">8.75</oasis:entry>
         <oasis:entry colname="col4">48.11</oasis:entry>
         <oasis:entry colname="col5">973</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">3</oasis:entry>
         <oasis:entry colname="col2">Lenningen–Schopfloch</oasis:entry>
         <oasis:entry colname="col3">9.53</oasis:entry>
         <oasis:entry colname="col4">48.54</oasis:entry>
         <oasis:entry colname="col5">758</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">4</oasis:entry>
         <oasis:entry colname="col2">Murrhardt</oasis:entry>
         <oasis:entry colname="col3">9.57</oasis:entry>
         <oasis:entry colname="col4">48.97</oasis:entry>
         <oasis:entry colname="col5">344</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">5</oasis:entry>
         <oasis:entry colname="col2">Rottweil</oasis:entry>
         <oasis:entry colname="col3">8.64</oasis:entry>
         <oasis:entry colname="col4">48.18</oasis:entry>
         <oasis:entry colname="col5">588</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">6</oasis:entry>
         <oasis:entry colname="col2">Schwaebisch Gmuend–Strassdorf</oasis:entry>
         <oasis:entry colname="col3">9.80</oasis:entry>
         <oasis:entry colname="col4">48.78</oasis:entry>
         <oasis:entry colname="col5">415</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">7</oasis:entry>
         <oasis:entry colname="col2">Stoetten</oasis:entry>
         <oasis:entry colname="col3">9.86</oasis:entry>
         <oasis:entry colname="col4">48.67</oasis:entry>
         <oasis:entry colname="col5">734</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">8</oasis:entry>
         <oasis:entry colname="col2">Stuttgart–Echterdingen</oasis:entry>
         <oasis:entry colname="col3">9.22</oasis:entry>
         <oasis:entry colname="col4">48.69</oasis:entry>
         <oasis:entry colname="col5">371</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">9</oasis:entry>
         <oasis:entry colname="col2">Stuttgart (Schnarrenberg)</oasis:entry>
         <oasis:entry colname="col3">9.20</oasis:entry>
         <oasis:entry colname="col4">48.83</oasis:entry>
         <oasis:entry colname="col5">314</oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table><?xmltex \gdef\@currentlabel{1}?></table-wrap>

<?xmltex \hack{\newpage}?>
<sec id="Ch1.S3.SS1">
  <label>3.1</label><title>Datasets</title>
<sec id="Ch1.S3.SS1.SSS1">
  <label>3.1.1</label><title>Weather station data</title>
      <p id="d1e3035">Monthly precipitation and temperature station data from the German Weather Service (Deutscher Wetterdienst, DWD accessible from <uri>https://cdc.dwd.de/portal/</uri>, last access: 30 October 2023) served as the predictand time series in this study. We considered all weather station records that (a) originated from measurements in the Quelle–Enz subcatchment, (b) covered the time period of 1958 to 2020, and (c) were at least 30 years in length. Even though there is no well-established and universally valid recommendation for the minimum record length in a PP-ESD approach (e.g., Hewitson et al., 2014), we chose a conservative 30-year threshold to ensure the models can be evaluated with truly independent, retained data (see Sect. 2.5). The remaining weather stations are summarized in Table 1. These were loaded into predictand station objects (SOs) as follows.</p>
      <p id="d1e3041"><?xmltex \hack{\noindent}?><monospace>1 from pyESD.Weatherstation import read_station_csv</monospace><?xmltex \hack{\newline}?>
<monospace>2 variable = "Temperature" #or 'Precipitation'</monospace><?xmltex \hack{\newline}?>
<monospace>3 SO = read_station_csv(filename, variable)</monospace></p>
</sec>
<sec id="Ch1.S3.SS1.SSS2">
  <label>3.1.2</label><title>Reanalysis datasets</title>
      <p id="d1e3062">The ERA5 reanalysis products, produced and managed by the European Centre for Medium-Range Weather Forecasting (ECMWF), were used to construct the predictors in this study. ERA5 is based on historical records from various observational systems (e.g., oceans buoys, aircraft, weather stations) that are dynamically interpolated with numerical forecasting models in a four-dimensional variational (4D-Var) data assimilation scheme to generate global, homogeneous, spatially gridded datasets (Bell et al., 2021). It has a spatial resolution of approximately 31 km (or TL639) and is available as hourly data, covering 1950 to the present day with a 5 d lag of data availability (Hersbach et al., 2020). For this study, however, mean monthly values were used in the construction of potential predictors (Table 2). These are publicly available from the Copernicus Climate Data Store (CDS) (accessible at <uri>https://cds.climate.copernicus.eu</uri>, last access: 30 October 2023).</p>

<?xmltex \floatpos{p}?><table-wrap id="Ch1.T2" specific-use="star"><?xmltex \currentcnt{2}?><label>Table 2</label><caption><p id="d1e3071">Potential predictors considered for PP-ESD models and the frequency of their selection for <bold>(a)</bold> precipitation and <bold>(b)</bold> temperature stations (based on the final predictor selection method).</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="5">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="left"/>
     <oasis:colspec colnum="3" colname="col3" align="left"/>
     <oasis:colspec colnum="4" colname="col4" align="left"/>
     <oasis:colspec colnum="5" colname="col5" align="left"/>
     <oasis:thead>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">Name</oasis:entry>
         <oasis:entry colname="col3">Description</oasis:entry>
         <oasis:entry colname="col4"><bold>(a)</bold></oasis:entry>
         <oasis:entry colname="col5"><bold>(b)</bold></oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row>
         <oasis:entry colname="col1">1</oasis:entry>
         <oasis:entry colname="col2">t2m</oasis:entry>
         <oasis:entry colname="col3">Near-surface temperature</oasis:entry>
         <oasis:entry colname="col4">8</oasis:entry>
         <oasis:entry colname="col5">8</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">2</oasis:entry>
         <oasis:entry colname="col2">tp</oasis:entry>
         <oasis:entry colname="col3">Total precipitation</oasis:entry>
         <oasis:entry colname="col4">18</oasis:entry>
         <oasis:entry colname="col5">9</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">3</oasis:entry>
         <oasis:entry colname="col2">msl</oasis:entry>
         <oasis:entry colname="col3">Mean sea level pressure</oasis:entry>
         <oasis:entry colname="col4">4</oasis:entry>
         <oasis:entry colname="col5">6</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">4</oasis:entry>
         <oasis:entry colname="col2">v10</oasis:entry>
         <oasis:entry colname="col3">Near-surface meridional wind</oasis:entry>
         <oasis:entry colname="col4">7</oasis:entry>
         <oasis:entry colname="col5">7</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">5</oasis:entry>
         <oasis:entry colname="col2">u10</oasis:entry>
         <oasis:entry colname="col3">Near-surface zonal wind</oasis:entry>
         <oasis:entry colname="col4">10</oasis:entry>
         <oasis:entry colname="col5">7</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">6</oasis:entry>
         <oasis:entry colname="col2">NAO</oasis:entry>
         <oasis:entry colname="col3">North Atlantic Oscillation index</oasis:entry>
         <oasis:entry colname="col4">9</oasis:entry>
         <oasis:entry colname="col5">5</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">7</oasis:entry>
         <oasis:entry colname="col2">EAWR</oasis:entry>
         <oasis:entry colname="col3">East Atlantic–Western Russian oscillation index</oasis:entry>
         <oasis:entry colname="col4">11</oasis:entry>
         <oasis:entry colname="col5">3</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">8</oasis:entry>
         <oasis:entry colname="col2">SCAN</oasis:entry>
         <oasis:entry colname="col3">Scandinavian oscillation patterns</oasis:entry>
         <oasis:entry colname="col4">11</oasis:entry>
         <oasis:entry colname="col5">5</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">9</oasis:entry>
         <oasis:entry colname="col2">EA</oasis:entry>
         <oasis:entry colname="col3">East Atlantic patterns</oasis:entry>
         <oasis:entry colname="col4">10</oasis:entry>
         <oasis:entry colname="col5">4</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">10</oasis:entry>
         <oasis:entry colname="col2">v_plev</oasis:entry>
         <oasis:entry colname="col3">Meridional wind at pressure levels 250, 500, 700, 850, and 1000 hPa</oasis:entry>
         <oasis:entry colname="col4">9, 7, 7, 10, 8</oasis:entry>
         <oasis:entry colname="col5">7, 3, 8, 5, 7</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">11</oasis:entry>
         <oasis:entry colname="col2">u_plev</oasis:entry>
         <oasis:entry colname="col3">Zonal wind at pressure levels 250, 500, 700, 850, and 1000 hPa</oasis:entry>
         <oasis:entry colname="col4">4, 9, 7, 6, 11</oasis:entry>
         <oasis:entry colname="col5">7, 5, 5, 5, 8</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">12</oasis:entry>
         <oasis:entry colname="col2">r_plev</oasis:entry>
         <oasis:entry colname="col3">Relative humidity at pressure levels 250, 500, 700, 850, and 1000 hPa</oasis:entry>
         <oasis:entry colname="col4">7, 8, 15, 7, 11</oasis:entry>
         <oasis:entry colname="col5">7, 4, 5, 5, 6</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">13</oasis:entry>
         <oasis:entry colname="col2">z_plev</oasis:entry>
         <oasis:entry colname="col3">Geopotential height at pressure levels 250, 500, 700, 850, and 1000 hPa</oasis:entry>
         <oasis:entry colname="col4">3, 6, 4, 6, 5</oasis:entry>
         <oasis:entry colname="col5">4, 6, 5, 7, 5</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">14</oasis:entry>
         <oasis:entry colname="col2">t_plev</oasis:entry>
         <oasis:entry colname="col3">Temperature at pressure levels 250, 500, 700, 850, and 1000 hPa</oasis:entry>
         <oasis:entry colname="col4">10, 9, 7, 7, 6</oasis:entry>
         <oasis:entry colname="col5">5, 5, 6, 8, 9</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">15</oasis:entry>
         <oasis:entry colname="col2">d2m</oasis:entry>
         <oasis:entry colname="col3">Near-surface dew-point temperature</oasis:entry>
         <oasis:entry colname="col4">6</oasis:entry>
         <oasis:entry colname="col5">5</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">16</oasis:entry>
         <oasis:entry colname="col2">dtd</oasis:entry>
         <oasis:entry colname="col3">Dew-point temperature depression at pressure levels 250, 500, 700, 850, and 1000 hPa</oasis:entry>
         <oasis:entry colname="col4">7, 6, 13, 7, 11</oasis:entry>
         <oasis:entry colname="col5">4, 2, 2, 3, 1</oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table><?xmltex \gdef\@currentlabel{2}?></table-wrap>

</sec>
<sec id="Ch1.S3.SS1.SSS3">
  <label>3.1.3</label><title>GCM simulation datasets</title>
      <?pagebreak page6491?><p id="d1e3419">For the ESD–GCM coupling, the predictors were reconstructed from an MPI-ESM (Max Planck Institute Earth System Model) GCM simulation that follows the protocols of the World Climate Research Programme's (WCRP) Coupled Model Intercomparison Project phase 5 (CMIP5) (Taylor et al., 2012). We highlight that CMIP5 model output was chosen in this illustrative study to enable consistent comparison with previous regional climate models over the region and any GCM outputs (e.g., CMIP6) can be combined with pyESD. For the case study, we consider several simulations (accessible at <uri>https://cds.climate.copernicus.eu</uri>, last access: 30 October 2023) forced with different RCP scenarios (Moss et al., 2010) to predict the local-scale response to the plausible range of forcings. In order to highlight the added value of the downscaled product, the local-scale future estimates are compared to the coarser predictions of several GCMs (i.e., MPI-ESM, CESM1-CAM5 of the National Center for Atmospheric Research – NCAR, Kay et al., 2015, and HadGE2-ES of the Hadley Centre of the UK Met Office, Collins et al., 2008) and RCMs (CORDEX-Europe simulation with MPI-CSC-REMO2009 driven with boundary conditions from MPI-ESM).</p>
</sec>
</sec>
<sec id="Ch1.S3.SS2">
  <label>3.2</label><title>Methods</title>
<sec id="Ch1.S3.SS2.SSS1">
  <label>3.2.1</label><title>Predictor selection and construction</title>
      <p id="d1e3441">The considered predictors must be large-scale climate elements that are both physically and empirically relevant to predicting the local-scale climate variability in the vicinity of the weather station. The physical relevance of considered predictors (Table 2) is established through previous studies and general climatological merit. We then apply a monthly standardizer transformer to remove the seasonality trends and scale the individual predictors. The empirical relationship with the predictand is then evaluated with PCCs defined in Eq. (12). Finally, first estimates of their predictive skills are obtained through the application of the package's recursive, sequential, and tree-based algorithms in a CV setting. These preliminary experiments are conducted to refine the selection of predictors further. After the predictor selection process, each weather station and predictand is associated with a particular subset of predictors (Table 2) that are later used to train the final ESD model for the station (Sect. 3.2.2).</p>
      <p id="d1e3444">The steps above are implemented with pyESD as follows. <list list-type="order"><list-item>
      <p id="d1e3449">We create a list (predictors) of all considered predictors with physical relevance to the predictand. We then use the <italic>set_predictors</italic> method of the station object (SO) to read the data in the local directory (<italic>predictordir</italic>) and construct regional means with a defined radius of 200 km around the station location. These are regional means of relevant climate variables and serve as the simplest type of predictor. For the<?pagebreak page6492?> construction of indices for atmospheric teleconnection patterns (i.e., NAO, EA, SCAN, and EAWR), which serve as further predictors, the package automatically calls the <italic>pyESD.teleconnections</italic> module if the pattern's acronym is included in the list of predictors.<?xmltex \hack{\newline}?>
<monospace>1 predictors = ["t2m", "tp", "NAO"</monospace><?xmltex \hack{\newline}?>
<monospace>   ,..., "nth predictor"]</monospace><?xmltex \hack{\newline}?>
<monospace>2 SO.set_predictors(variable,</monospace><?xmltex \hack{\newline}?>
<monospace>   predictors, predictordir,</monospace><?xmltex \hack{\newline}?>
<monospace>   radius=200) # radius in km</monospace></p></list-item><list-item>
      <p id="d1e3483">We apply the monthly standardizer and then use the <italic>predictor_correlation</italic> method to estimate the PCC between the predictand and predictors.<?xmltex \hack{\newline}?>
<monospace>1 SO.set_standardizer(variable,</monospace><?xmltex \hack{\newline}?>
<monospace>     standardizer = MonthlyStandardizer</monospace><?xmltex \hack{\newline}?>
<monospace>     (detrending=True, scaling=True))</monospace><?xmltex \hack{\newline}?>
<monospace>2 df_corr = SO.predictor_correlation</monospace><?xmltex \hack{\newline}?>
<monospace>    (variable, predictor_range,</monospace><?xmltex \hack{\newline}?>
<monospace>ERA5Data, fit_predictors=True,</monospace><?xmltex \hack{\newline}?>
<monospace>fit_predictand=True,</monospace><?xmltex \hack{\newline}?>
<monospace>method="pearson")</monospace></p></list-item><list-item>
      <p id="d1e3524">The final refinement of the predictor list is implemented as part of the <italic>fit</italic> method. We use the <italic>set_model</italic> method to define the ARD regressor, TimeSeriesSplitter CV setting, and call the <italic>fit</italic> method in a loop through the three types of selector methods.<?xmltex \hack{\newline}?>
<monospace>1 SO.set_model(variable, method="ARD",</monospace><?xmltex \hack{\newline}?>
<monospace>cv=TimeSeriesSplit(n_splits=10))</monospace><?xmltex \hack{\newline}?>
<monospace>2 selector_methods = ["Recursive",</monospace><?xmltex \hack{\newline}?>
<monospace>     "TreeBased", "Sequential"]</monospace><?xmltex \hack{\newline}?>
<monospace>3 for selector_method in</monospace><?xmltex \hack{\newline}?>
<monospace>    selector_methods:</monospace><?xmltex \hack{\newline}?>
<monospace>4 SO.fit(variable, predictor_range,</monospace><?xmltex \hack{\newline}?>
<monospace>     ERA5Data, fit_predictors=True,</monospace><?xmltex \hack{\newline}?>
<monospace>predictor_selector=True,</monospace><?xmltex \hack{\newline}?>
<monospace>selector_method =</monospace><?xmltex \hack{\newline}?>
<monospace>selector_method, select_regressor)</monospace></p></list-item></list></p>
</sec>
<sec id="Ch1.S3.SS2.SSS2">
  <label>3.2.2</label><title>Model training and validation</title>
      <p id="d1e3591">Model training and validation are performed separately for each predictand and weather station. The models are trained in a CV setting for the period 1958–2010 and then assessed<?pagebreak page6493?> on independent retained data for the period 2011–2020. In the training process, we use seven different methods before deciding on an estimator for the final model. These methods include at least one representative for each of the families of ML algorithms (see Sect. 2.3) except SVR. We exclude SVR due to its high computational demands for optimization and to ensure the easy reproducibility of the illustrative example on less powerful computers. We perform the initial model training and validation with the LassoLarsCV, ARD, MLP, RandomForest, XGBoost, bagging, and stacking regressors using a <italic>KFold(n_splits</italic><inline-formula><mml:math id="M107" display="inline"><mml:mo>=</mml:mo></mml:math></inline-formula><italic>10)</italic> validation scheme for hyperparameter optimization. For the stacking regressor, we use all the other regressors as base estimators (i.e., level-0 learners) and ExtraTree as the meta-learner. The final ESD model is then selected based on the CV metrics (i.e., CV <inline-formula><mml:math id="M108" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> and CV RMSE) of the individual models.</p>
      <p id="d1e3616">The steps above are implemented with pyESD as follows: the models are trained with the <italic>fit</italic> method as described within Sect. 3.2.2. The <italic>cross_validate_and_predict</italic> method is applied to calculate the CV metrics and generate the predictions for the training period 1958–2010. The <italic>predict</italic> method is then used to generate predictions for the 2011–2020 period from the models trained in the 1958–2010 period. Finally, the <italic>evaluate</italic> method is used to obtain the model performance metrics based on the 2011–2020 predictions and retained data. The <inline-formula><mml:math id="M109" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula>, RMSE, and MAE (see Sect. 2.5) are used as both CV and evaluation metrics in this study. The ERA5 reanalysis product is specified as the predictor dataset for all these methods.<?xmltex \hack{\newline}?>
<monospace>1 cv_score_1958to2010, predict_1958to2010 = SO.cross_validate_and_predict(variable, from1958to2010, ERA5Data)</monospace><?xmltex \hack{\newline}?>
<monospace>2 predict_2011to2020 = SO.predict(variable, from2011to2020, ERA5Data)</monospace><?xmltex \hack{\newline}?>
<monospace>3 scores_2011to2020 = SO.evaluate(variable, from2011to2020, ERA5Data)</monospace></p>
</sec>
<sec id="Ch1.S3.SS2.SSS3">
  <label>3.2.3</label><title>Future prediction</title>
      <p id="d1e3663">Future predictions are generated by coupling the final ESD models to GCM simulations for the 21st century. In the illustrative example, we use MPI-ESM simulations that were forced with greenhouse gas concentration scenarios RCP2.6, RCP4.5, and RCP8.5. This coupling is achieved as follows: the predictors selected during model training are reconstructed from the GCM output. These simulated predictors are standardized with the <italic>MonthlyStandardizer</italic> parameters obtained from the reanalysis predictors to ensure data homogenization. Prediction anomalies are calculated using the training period 1958–2010 as a reference. The resulting RCP-specific 21st century prediction anomaly time series are then used to calculate the annual means (2020–2100), as well as the seasonal (DJF, MAM, JJA, SON) and annual 30-year climatologies for the mid-century (2040–2070) and the end of the century (2070–2100). The predicted anomalies are then back-transformed to their respective absolute values for all stations and compared to the raw outputs of GCMs (i.e., CESM1-CAM5, HadGE2-ES, EURO-CORDEX, and MPI-ESM; see Sect. 3.1.3) using the nearest grid point. In pyESD, a future prediction can be generated by using the <italic>predict</italic> method (Sect. 3.2.2) and specifying the GCM output as the predictor data source.</p>
      <p id="d1e3672">The PP-ESD approach relies on the assumption that the predictors are well-represented by the GCM. We therefore perform KS tests to evaluate the distribution similarity between GCM and ERA5 predictors for the datasets' temporal overlap. The KS statistic lies within the 0–1 range, with lower values indicating greater distribution similarity. For our two-sided tests, we reject the null hypothesis (<inline-formula><mml:math id="M110" display="inline"><mml:mrow><mml:msub><mml:mi>H</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> means the datasets have identical underlying distributions) in the case of <inline-formula><mml:math id="M111" display="inline"><mml:mi>p</mml:mi></mml:math></inline-formula> values being smaller than 0.05. We perform the test on the raw monthly time series, monthly anomalies, and standardized anomalies in order to isolate the distributional differences of the first and second moments error propagation (Bedia et al., 2020). The <italic>KS_stat</italic> function implemented in the <italic>pyESD.utils</italic> module is used to test several of the informative predictors (such as tp, t2m, r850, u850, and v850).</p>
</sec>
</sec>
</sec>
<sec id="Ch1.S4">
  <label>4</label><title>Results and discussion</title>
      <p id="d1e3709">In this section, we present and discuss the results of the illustrative case study. The discussion places more emphasis on the functionalities of the package than the climatological implications. Specifically, we discuss the results of the predictor selection step (Sect. 4.1), the training and validation of the model (Sect. 4.2), the final model performance (Sect. 4.3), and the future predictions generated through the ESD–GCM coupling (Sect. 4.4).</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F4" specific-use="star"><?xmltex \currentcnt{4}?><?xmltex \def\figurename{Figure}?><label>Figure 4</label><caption><p id="d1e3714">Cross-validation <inline-formula><mml:math id="M112" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> and RMSE for the predictor selection methods (recursive  in red, tree-based in green, and sequential in black) for precipitation <bold>(a, c)</bold> and temperature <bold>(b, d)</bold> station records. The individual methods performed similarly well, suggesting that each of the implemented methods may be used to refine the list of potential predictors.</p></caption>
        <?xmltex \igopts{width=426.791339pt}?><graphic xlink:href="https://gmd.copernicus.org/articles/16/6479/2023/gmd-16-6479-2023-f04.png"/>

      </fig>

<sec id="Ch1.S4.SS1">
  <label>4.1</label><title>Predictor selection</title>
      <p id="d1e3747">All implemented predictor selection methods demonstrated merit, and the correlation analyses revealed strong linear dependencies between the predictand variables and potential predictors (Figs. A1 and A2). For example, precipitation records are highly correlated (PCC <inline-formula><mml:math id="M113" display="inline"><mml:mrow><mml:mo>≥</mml:mo><mml:mn mathvariant="normal">0.5</mml:mn></mml:mrow></mml:math></inline-formula>) with large-scale total precipitation (tp), atmospheric relative humidity (<inline-formula><mml:math id="M114" display="inline"><mml:mi>r</mml:mi></mml:math></inline-formula>), and zonal wind velocity (<inline-formula><mml:math id="M115" display="inline"><mml:mi>u</mml:mi></mml:math></inline-formula>) up to the mid-tropospheric level (i.e., 500–1000 hPa) (Fig. A1). The temperature records are highly correlated (PPC <inline-formula><mml:math id="M116" display="inline"><mml:mrow><mml:mo>≥</mml:mo><mml:mn mathvariant="normal">0.7</mml:mn></mml:mrow></mml:math></inline-formula>) with near-surface temperature (t2m), atmospheric temperature (<inline-formula><mml:math id="M117" display="inline"><mml:mi>t</mml:mi></mml:math></inline-formula> on all levels), and dew-point temperature depression (dtd) up to the mid-troposphere (Fig. A2). Both predictands also show a good correlation (PCC <inline-formula><mml:math id="M118" display="inline"><mml:mrow><mml:mo>≥</mml:mo><mml:mn mathvariant="normal">0.25</mml:mn></mml:mrow></mml:math></inline-formula>) with the indices of the atmospheric teleconnection patterns (i.e., NAO, EA, EAWR, and SCAN). The predictor selection methods (i.e., recursive, tree-based,<?pagebreak page6494?> and sequential) perform similarly for all the precipitation and temperature stations (Fig. 4). More specifically, the three methods yield CV <inline-formula><mml:math id="M119" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> values of 0.5 to 0.75 (Fig. 4a), CV RMSE values of <inline-formula><mml:math id="M120" display="inline"><mml:mrow><mml:mo>≤</mml:mo><mml:mn mathvariant="normal">25</mml:mn></mml:mrow></mml:math></inline-formula> mm per month (Fig. 4c) for precipitation, CV <inline-formula><mml:math id="M121" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> values of <inline-formula><mml:math id="M122" display="inline"><mml:mrow><mml:mo>≥</mml:mo><mml:mn mathvariant="normal">0.95</mml:mn></mml:mrow></mml:math></inline-formula> (Fig. 4b), and CV RMSE values of 0.3 to 0.6 <inline-formula><mml:math id="M123" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula>C (Fig. 4d) for temperature stations. Since the methods did not show a significant difference in performance, the recursive method was applied for the refinement of the set of predictors, since it allows more flexibility and a stepwise iteration of several combinations of potential predictors (e.g., Mutz et al., 2021; Hammami et al., 2012; Li et al., 2020). The frequencies with which specific predictors were selected using the recursive method are listed in Table 2.</p>
      <p id="d1e3853">The predictors tp and t2m were included for most of the precipitation and temperature station records, respectively. This indicates that variations in the larger-scale precipitation and temperature fields already explain much of the local-scale predictand variability in the vicinity of the weather stations. Many of the refined predictor sets also included indices of the NAO (9 of 18 precipitation stations, 5 of 9 temperature stations), SCAN (11 of 18 precipitation stations, 5 of 9 temperature stations), EA (10 of 18 precipitation stations, 4 of 9 temperature stations), and EAWR (11 of 18 precipitation stations, 3 of 9 temperature stations). This confirms the strong manifestation of Northern Hemisphere atmospheric teleconnection patterns in the local-scale precipitation and temperature in the catchment (e.g., Bárdossy, 2010; Ludwig et al., 2003). Their exclusion from the other stations is likely due to the fact that their variability might already be captured by zonal and meridional wind speeds and synoptic pressure variables like geopotential height (<inline-formula><mml:math id="M124" display="inline"><mml:mi>z</mml:mi></mml:math></inline-formula>) and mean sea level pressure (slp) (Hurrell and Van Loon, 1997; Hurrell, 1995; Barnston and Livezey, 1987; Maraun and Widmann, 2018). Relative humidity was selected as a predictor for most of the precipitation stations. This is consistent with the results of many other studies (e.g., Gutiérrez et al., 2019; Hammami et al., 2012) and our physical understanding of it as a measure of humidity that takes saturation vapor pressure into consideration.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F5" specific-use="star"><?xmltex \currentcnt{5}?><?xmltex \def\figurename{Figure}?><label>Figure 5</label><caption><p id="d1e3865">Cross-validation <inline-formula><mml:math id="M125" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> and RMSE box plots comparing the experimental regressors' performance for all the precipitation <bold>(a, c)</bold> and temperature <bold>(b, d)</bold> stations. The red lines inside the box represent the median, the lower and upper box boundaries indicate the 25th and 75th percentiles, and the lower and upper error lines show the 10th and 90th percentiles, respectively. The black plus marks show the outliers outside the range of the 10th and 90th percentile.</p></caption>
          <?xmltex \igopts{width=398.338583pt}?><graphic xlink:href="https://gmd.copernicus.org/articles/16/6479/2023/gmd-16-6479-2023-f05.png"/>

        </fig>

      <?xmltex \floatpos{t}?><fig id="Ch1.F6" specific-use="star"><?xmltex \currentcnt{6}?><?xmltex \def\figurename{Figure}?><label>Figure 6</label><caption><p id="d1e3894">Prediction example for the Hechingen station using the final regressor for precipitation <bold>(a, c)</bold> and temperature <bold>(b, d)</bold>. The top panels <bold>(a, b)</bold> show the linear relationship between the predictions and observed values, as well as the PCC (<inline-formula><mml:math id="M126" display="inline"><mml:mi>R</mml:mi></mml:math></inline-formula> value) for the testing data (blue-colored circles). The bottom panels <bold>(c, d)</bold> show the 1-year moving average of the observed (green, solid) and ERA5-driven predictions for the training period (blue, dash-dotted) and the testing period (red, dashed).</p></caption>
          <?xmltex \igopts{width=483.69685pt}?><graphic xlink:href="https://gmd.copernicus.org/articles/16/6479/2023/gmd-16-6479-2023-f06.png"/>

        </fig>

</sec>
<sec id="Ch1.S4.SS2">
  <label>4.2</label><title>Performance of individual estimators</title>
      <p id="d1e3931">We experimented with seven different regressors before deciding on the regressor that would be used to establish the final ESD models (see Sect.3.2.2). A total of 126<?pagebreak page6495?> precipitation and 63 temperature experimental models were generated with the seven regressors. Overall, most of the experimental models performed reasonably well with a mean CV <inline-formula><mml:math id="M127" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> of <inline-formula><mml:math id="M128" display="inline"><mml:mrow><mml:mo>≥</mml:mo><mml:mn mathvariant="normal">0.5</mml:mn></mml:mrow></mml:math></inline-formula> for precipitation and <inline-formula><mml:math id="M129" display="inline"><mml:mrow><mml:mo>≥</mml:mo><mml:mn mathvariant="normal">0.9</mml:mn></mml:mrow></mml:math></inline-formula> for temperature stations (Fig. 5). The MLP models, on the other hand, performed relatively poorly with CV <inline-formula><mml:math id="M130" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> values of <inline-formula><mml:math id="M131" display="inline"><mml:mrow><mml:mo>≤</mml:mo><mml:mn mathvariant="normal">0.4</mml:mn></mml:mrow></mml:math></inline-formula> for precipitation and <inline-formula><mml:math id="M132" display="inline"><mml:mrow><mml:mo>≤</mml:mo><mml:mn mathvariant="normal">0.9</mml:mn></mml:mrow></mml:math></inline-formula> for temperature. This is due to the fact that MLP model calibration requires longer records and a more complex architecture to capture most of the informative patterns in the training data. This study, however, uses a simplified architecture to make the results reproducible without higher computational requirements. The result can likely be improved with more data (e.g., by using daily values) and an increase in hidden layers (Sect. 2.2.3). The overall performance of the experimental models underlines the methods' suitability for downscaling.</p>
      <p id="d1e3997">Among the better-performing precipitation models, the LassoLarsCV and ARD methods yielded the best results (CV <inline-formula><mml:math id="M133" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.55</mml:mn></mml:mrow></mml:math></inline-formula>–0.75, CV RSME <inline-formula><mml:math id="M134" display="inline"><mml:mo>=</mml:mo></mml:math></inline-formula> 20–23 mm per month), followed by the RandomForest and bagging ensembles (CV <inline-formula><mml:math id="M135" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.48</mml:mn></mml:mrow></mml:math></inline-formula>–0.70, CV RSME <inline-formula><mml:math id="M136" display="inline"><mml:mo>=</mml:mo></mml:math></inline-formula> 21 to 25 mm per month), as well as the XGBoost ensemble regressor (CV <inline-formula><mml:math id="M137" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.39</mml:mn></mml:mrow></mml:math></inline-formula>–0.65, CV RMSE <inline-formula><mml:math id="M138" display="inline"><mml:mo>=</mml:mo></mml:math></inline-formula> 22–27 mm per month). Stacking all experimental models into a meta-regressor also yields good results (CV <inline-formula><mml:math id="M139" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.45</mml:mn></mml:mrow></mml:math></inline-formula>–0.7, CV RMSE <inline-formula><mml:math id="M140" display="inline"><mml:mo>=</mml:mo></mml:math></inline-formula> 20–26 mm per month) despite the poor performance of the MLP regressors. Based on these results, the LassoLarsCV, ARD, RandomForest, and bagging regressors were selected as the final base learner for the stacking model. ExtaTree was chosen as the final meta-learner to prevent overfitting issues by placing an additional discriminative threshold on all the base regressor's predictions (Geurts et al., 2006).</p>
      <p id="d1e4089">The experimental temperature models showed similar patterns in performance but performed better overall. LassoLarCV and ARD emerge as the best-performing models (CV <inline-formula><mml:math id="M141" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.85</mml:mn></mml:mrow></mml:math></inline-formula>–0.98, CV RMSE <inline-formula><mml:math id="M142" display="inline"><mml:mo>=</mml:mo></mml:math></inline-formula> 0.2–0.6 <inline-formula><mml:math id="M143" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula>C), followed by the RandomForest and bagging regressors (CV <inline-formula><mml:math id="M144" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.8</mml:mn></mml:mrow></mml:math></inline-formula>–0.96, CV RMSE <inline-formula><mml:math id="M145" display="inline"><mml:mo>=</mml:mo></mml:math></inline-formula> 0.3–0.7 <inline-formula><mml:math id="M146" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula>C), as well as the XGBoost and stacking ensemble regressors (CV <inline-formula><mml:math id="M147" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.75</mml:mn></mml:mrow></mml:math></inline-formula>–0.96, CV RMSE <inline-formula><mml:math id="M148" display="inline"><mml:mo>=</mml:mo></mml:math></inline-formula> 0.3–0.8 <inline-formula><mml:math id="M149" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula>C). Therefore, we also selected stacking (with LassoLarsCV, ARD, RandomForest, bagging) for the final temperature models, too.</p>
</sec>
<?pagebreak page6496?><sec id="Ch1.S4.SS3">
  <label>4.3</label><title>Performance of the final estimator</title>
      <p id="d1e4194">Following the analysis of the seven experimental models (Sect. 4.2), the recursive predictor selection method and stacking learning model (with LassoLarsCV, ARD, RandomForest, and bagging) were selected for the generation of the final ESD models. The models were trained on the 1958–2010 data in a CV setting and evaluated on the retained data in the 2011–2020 period. <inline-formula><mml:math id="M150" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula>, RMSE, and MAE were used as performance metrics for the CV setting and the final evaluation (Tables 3 and 4). The models' performance was good overall but varied notably between different stations. The prediction skill estimates were higher for temperature than for precipitation. For temperature (Table 4), the explained variance estimates (“Fit <inline-formula><mml:math id="M151" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula>”) are in the range of 0.81–0.98 (<inline-formula><mml:math id="M152" display="inline"><mml:mrow><mml:mi mathvariant="italic">μ</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.94</mml:mn></mml:mrow></mml:math></inline-formula>), and CV <inline-formula><mml:math id="M153" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> values are in the range of 0.84 to 0.98 (<inline-formula><mml:math id="M154" display="inline"><mml:mrow><mml:mi mathvariant="italic">μ</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.93</mml:mn></mml:mrow></mml:math></inline-formula>), whereas for precipitation (Table 3), the explained variance estimates are in the range of 0.58–0.84 (<inline-formula><mml:math id="M155" display="inline"><mml:mrow><mml:mi mathvariant="italic">μ</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.71</mml:mn></mml:mrow></mml:math></inline-formula>), and CV <inline-formula><mml:math id="M156" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> values are in the range of 0.54–0.72 (0.65). The accuracy measures display a similar discrepancy with CV RMSE of 0.3–0.6 <inline-formula><mml:math id="M157" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula>C (<inline-formula><mml:math id="M158" display="inline"><mml:mrow><mml:mi mathvariant="italic">μ</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.42</mml:mn></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M159" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula>C) and CV MAE of 0.2–0.50 <inline-formula><mml:math id="M160" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula>C (<inline-formula><mml:math id="M161" display="inline"><mml:mrow><mml:mi mathvariant="italic">μ</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.34</mml:mn></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M162" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula>C) for temperature, as well as CV RMSE of 20–24 mm per month (<inline-formula><mml:math id="M163" display="inline"><mml:mrow><mml:mi mathvariant="italic">μ</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">21</mml:mn></mml:mrow></mml:math></inline-formula> mm per month) and CV MAE of 14–18 mm per month (<inline-formula><mml:math id="M164" display="inline"><mml:mrow><mml:mi mathvariant="italic">μ</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">16</mml:mn></mml:mrow></mml:math></inline-formula> mm per month) for precipitation.</p>

<?xmltex \floatpos{t}?><table-wrap id="Ch1.T3" specific-use="star"><?xmltex \currentcnt{3}?><label>Table 3</label><caption><p id="d1e4366">Model performance metrics (i.e., <inline-formula><mml:math id="M165" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula>, RMSE, and MAE) for all the precipitation stations. The final ESD models were trained in a CV setting on datasets from 1958–2010 and evaluated on independent, retained data from 2011–2020.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="9">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="left"/>
     <oasis:colspec colnum="3" colname="col3" align="right"/>
     <oasis:colspec colnum="4" colname="col4" align="right"/>
     <oasis:colspec colnum="5" colname="col5" align="right"/>
     <oasis:colspec colnum="6" colname="col6" align="right"/>
     <oasis:colspec colnum="7" colname="col7" align="right"/>
     <oasis:colspec colnum="8" colname="col8" align="right"/>
     <oasis:colspec colnum="9" colname="col9" align="right"/>
     <oasis:thead>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">ID</oasis:entry>
         <oasis:entry colname="col2">Name</oasis:entry>
         <oasis:entry colname="col3">(Fit) <inline-formula><mml:math id="M166" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col4">CV <inline-formula><mml:math id="M167" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col5">CV RMSE</oasis:entry>
         <oasis:entry colname="col6">CV MAE</oasis:entry>
         <oasis:entry colname="col7"><inline-formula><mml:math id="M168" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col8">RMSE</oasis:entry>
         <oasis:entry colname="col9">MAE</oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row>
         <oasis:entry colname="col1">1</oasis:entry>
         <oasis:entry colname="col2">Baltmannsweiler–Hohengehren</oasis:entry>
         <oasis:entry colname="col3">0.71</oasis:entry>
         <oasis:entry colname="col4">0.67</oasis:entry>
         <oasis:entry colname="col5">20</oasis:entry>
         <oasis:entry colname="col6">15</oasis:entry>
         <oasis:entry colname="col7">0.63</oasis:entry>
         <oasis:entry colname="col8">22</oasis:entry>
         <oasis:entry colname="col9">18</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">2</oasis:entry>
         <oasis:entry colname="col2">Boll Bad</oasis:entry>
         <oasis:entry colname="col3">0.70</oasis:entry>
         <oasis:entry colname="col4">0.69</oasis:entry>
         <oasis:entry colname="col5">21</oasis:entry>
         <oasis:entry colname="col6">15</oasis:entry>
         <oasis:entry colname="col7">0.60</oasis:entry>
         <oasis:entry colname="col8">24</oasis:entry>
         <oasis:entry colname="col9">19</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">3</oasis:entry>
         <oasis:entry colname="col2">Eschbronn–Mariazell</oasis:entry>
         <oasis:entry colname="col3">0.74</oasis:entry>
         <oasis:entry colname="col4">0.69</oasis:entry>
         <oasis:entry colname="col5">20</oasis:entry>
         <oasis:entry colname="col6">16</oasis:entry>
         <oasis:entry colname="col7">0.59</oasis:entry>
         <oasis:entry colname="col8">23</oasis:entry>
         <oasis:entry colname="col9">18</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">4</oasis:entry>
         <oasis:entry colname="col2">Fellbach</oasis:entry>
         <oasis:entry colname="col3">0.61</oasis:entry>
         <oasis:entry colname="col4">0.57</oasis:entry>
         <oasis:entry colname="col5">20</oasis:entry>
         <oasis:entry colname="col6">15</oasis:entry>
         <oasis:entry colname="col7">0.59</oasis:entry>
         <oasis:entry colname="col8">20</oasis:entry>
         <oasis:entry colname="col9">15</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">5</oasis:entry>
         <oasis:entry colname="col2">Goeppingen–Jebenhausen</oasis:entry>
         <oasis:entry colname="col3">0.71</oasis:entry>
         <oasis:entry colname="col4">0.68</oasis:entry>
         <oasis:entry colname="col5">21</oasis:entry>
         <oasis:entry colname="col6">16</oasis:entry>
         <oasis:entry colname="col7">0.62</oasis:entry>
         <oasis:entry colname="col8">23</oasis:entry>
         <oasis:entry colname="col9">18</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">6</oasis:entry>
         <oasis:entry colname="col2">Haigerloch–Weildorf</oasis:entry>
         <oasis:entry colname="col3">0.64</oasis:entry>
         <oasis:entry colname="col4">0.62</oasis:entry>
         <oasis:entry colname="col5">20</oasis:entry>
         <oasis:entry colname="col6">15</oasis:entry>
         <oasis:entry colname="col7">0.74</oasis:entry>
         <oasis:entry colname="col8">17</oasis:entry>
         <oasis:entry colname="col9">13</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">7</oasis:entry>
         <oasis:entry colname="col2">Hechingen</oasis:entry>
         <oasis:entry colname="col3">0.63</oasis:entry>
         <oasis:entry colname="col4">0.61</oasis:entry>
         <oasis:entry colname="col5">20</oasis:entry>
         <oasis:entry colname="col6">15</oasis:entry>
         <oasis:entry colname="col7">0.74</oasis:entry>
         <oasis:entry colname="col8">17</oasis:entry>
         <oasis:entry colname="col9">13</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">8</oasis:entry>
         <oasis:entry colname="col2">Heubach Ostalb</oasis:entry>
         <oasis:entry colname="col3">0.78</oasis:entry>
         <oasis:entry colname="col4">0.65</oasis:entry>
         <oasis:entry colname="col5">24</oasis:entry>
         <oasis:entry colname="col6">18</oasis:entry>
         <oasis:entry colname="col7">0.65</oasis:entry>
         <oasis:entry colname="col8">25</oasis:entry>
         <oasis:entry colname="col9">21</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">9</oasis:entry>
         <oasis:entry colname="col2">Horb–Betra</oasis:entry>
         <oasis:entry colname="col3">0.84</oasis:entry>
         <oasis:entry colname="col4">0.72</oasis:entry>
         <oasis:entry colname="col5">21</oasis:entry>
         <oasis:entry colname="col6">16</oasis:entry>
         <oasis:entry colname="col7">0.74</oasis:entry>
         <oasis:entry colname="col8">21</oasis:entry>
         <oasis:entry colname="col9">16</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">10</oasis:entry>
         <oasis:entry colname="col2">Klippeneck</oasis:entry>
         <oasis:entry colname="col3">0.67</oasis:entry>
         <oasis:entry colname="col4">0.63</oasis:entry>
         <oasis:entry colname="col5">21</oasis:entry>
         <oasis:entry colname="col6">16</oasis:entry>
         <oasis:entry colname="col7">0.70</oasis:entry>
         <oasis:entry colname="col8">21</oasis:entry>
         <oasis:entry colname="col9">17</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">11</oasis:entry>
         <oasis:entry colname="col2">Lorch Kreis Ostalb–Waldhausen</oasis:entry>
         <oasis:entry colname="col3">0.79</oasis:entry>
         <oasis:entry colname="col4">0.72</oasis:entry>
         <oasis:entry colname="col5">21</oasis:entry>
         <oasis:entry colname="col6">15</oasis:entry>
         <oasis:entry colname="col7">0.64</oasis:entry>
         <oasis:entry colname="col8">24</oasis:entry>
         <oasis:entry colname="col9">20</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">12</oasis:entry>
         <oasis:entry colname="col2">Metzingen</oasis:entry>
         <oasis:entry colname="col3">0.79</oasis:entry>
         <oasis:entry colname="col4">0.61</oasis:entry>
         <oasis:entry colname="col5">20</oasis:entry>
         <oasis:entry colname="col6">16</oasis:entry>
         <oasis:entry colname="col7">0.64</oasis:entry>
         <oasis:entry colname="col8">20</oasis:entry>
         <oasis:entry colname="col9">16</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">13</oasis:entry>
         <oasis:entry colname="col2">Oberndorf Neckar</oasis:entry>
         <oasis:entry colname="col3">0.75</oasis:entry>
         <oasis:entry colname="col4">0.71</oasis:entry>
         <oasis:entry colname="col5">23</oasis:entry>
         <oasis:entry colname="col6">17</oasis:entry>
         <oasis:entry colname="col7">0.66</oasis:entry>
         <oasis:entry colname="col8">28</oasis:entry>
         <oasis:entry colname="col9">22</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">14</oasis:entry>
         <oasis:entry colname="col2">Rosenfeld–Bickelsberg</oasis:entry>
         <oasis:entry colname="col3">0.70</oasis:entry>
         <oasis:entry colname="col4">0.69</oasis:entry>
         <oasis:entry colname="col5">20</oasis:entry>
         <oasis:entry colname="col6">15</oasis:entry>
         <oasis:entry colname="col7">0.70</oasis:entry>
         <oasis:entry colname="col8">21</oasis:entry>
         <oasis:entry colname="col9">16</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">15</oasis:entry>
         <oasis:entry colname="col2">Stoetten</oasis:entry>
         <oasis:entry colname="col3">0.75</oasis:entry>
         <oasis:entry colname="col4">0.72</oasis:entry>
         <oasis:entry colname="col5">23</oasis:entry>
         <oasis:entry colname="col6">17</oasis:entry>
         <oasis:entry colname="col7">0.68</oasis:entry>
         <oasis:entry colname="col8">25</oasis:entry>
         <oasis:entry colname="col9">20</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">16</oasis:entry>
         <oasis:entry colname="col2">Stuttgart–Echterdingen</oasis:entry>
         <oasis:entry colname="col3">0.61</oasis:entry>
         <oasis:entry colname="col4">0.56</oasis:entry>
         <oasis:entry colname="col5">20</oasis:entry>
         <oasis:entry colname="col6">14</oasis:entry>
         <oasis:entry colname="col7">0.68</oasis:entry>
         <oasis:entry colname="col8">16</oasis:entry>
         <oasis:entry colname="col9">13</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">17</oasis:entry>
         <oasis:entry colname="col2">Stuttgart (Schnarrenberg)</oasis:entry>
         <oasis:entry colname="col3">0.58</oasis:entry>
         <oasis:entry colname="col4">0.54</oasis:entry>
         <oasis:entry colname="col5">20</oasis:entry>
         <oasis:entry colname="col6">14</oasis:entry>
         <oasis:entry colname="col7">0.50</oasis:entry>
         <oasis:entry colname="col8">21</oasis:entry>
         <oasis:entry colname="col9">15</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">18</oasis:entry>
         <oasis:entry colname="col2">Winterbach Rems–Murr–Kreis</oasis:entry>
         <oasis:entry colname="col3">0.72</oasis:entry>
         <oasis:entry colname="col4">0.66</oasis:entry>
         <oasis:entry colname="col5">20</oasis:entry>
         <oasis:entry colname="col6">15</oasis:entry>
         <oasis:entry colname="col7">0.61</oasis:entry>
         <oasis:entry colname="col8">23</oasis:entry>
         <oasis:entry colname="col9">18</oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table><?xmltex \gdef\@currentlabel{3}?></table-wrap>

<?xmltex \floatpos{t}?><table-wrap id="Ch1.T4" specific-use="star"><?xmltex \currentcnt{4}?><label>Table 4</label><caption><p id="d1e5035">Model performance metrics (i.e., <inline-formula><mml:math id="M169" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula>, RMSE, and MAE) for all the temperature stations. The final ESD models were trained in a CV setting on datasets from 1958–2010 and evaluated on independent, retained data from 2011–2020.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="9">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="left"/>
     <oasis:colspec colnum="3" colname="col3" align="right"/>
     <oasis:colspec colnum="4" colname="col4" align="right"/>
     <oasis:colspec colnum="5" colname="col5" align="right"/>
     <oasis:colspec colnum="6" colname="col6" align="right"/>
     <oasis:colspec colnum="7" colname="col7" align="right"/>
     <oasis:colspec colnum="8" colname="col8" align="right"/>
     <oasis:colspec colnum="9" colname="col9" align="right"/>
     <oasis:thead>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">ID</oasis:entry>
         <oasis:entry colname="col2">Name</oasis:entry>
         <oasis:entry colname="col3">Train <inline-formula><mml:math id="M170" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col4">CV <inline-formula><mml:math id="M171" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col5">CV RMSE</oasis:entry>
         <oasis:entry colname="col6">CV MAE</oasis:entry>
         <oasis:entry colname="col7"><inline-formula><mml:math id="M172" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col8">RMSE</oasis:entry>
         <oasis:entry colname="col9">MAE</oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row>
         <oasis:entry colname="col1">1</oasis:entry>
         <oasis:entry colname="col2">Hechingen</oasis:entry>
         <oasis:entry colname="col3">0.96</oasis:entry>
         <oasis:entry colname="col4">0.96</oasis:entry>
         <oasis:entry colname="col5">0.30</oasis:entry>
         <oasis:entry colname="col6">0.30</oasis:entry>
         <oasis:entry colname="col7">0.93</oasis:entry>
         <oasis:entry colname="col8">1.3</oasis:entry>
         <oasis:entry colname="col9">1.2</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">2</oasis:entry>
         <oasis:entry colname="col2">Klippeneck</oasis:entry>
         <oasis:entry colname="col3">0.94</oasis:entry>
         <oasis:entry colname="col4">0.94</oasis:entry>
         <oasis:entry colname="col5">0.40</oasis:entry>
         <oasis:entry colname="col6">0.30</oasis:entry>
         <oasis:entry colname="col7">0.94</oasis:entry>
         <oasis:entry colname="col8">1.3</oasis:entry>
         <oasis:entry colname="col9">1.2</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">3</oasis:entry>
         <oasis:entry colname="col2">Lenningen–Schopfloch</oasis:entry>
         <oasis:entry colname="col3">0.95</oasis:entry>
         <oasis:entry colname="col4">0.93</oasis:entry>
         <oasis:entry colname="col5">0.50</oasis:entry>
         <oasis:entry colname="col6">0.40</oasis:entry>
         <oasis:entry colname="col7">0.91</oasis:entry>
         <oasis:entry colname="col8">0.9</oasis:entry>
         <oasis:entry colname="col9">0.7</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">4</oasis:entry>
         <oasis:entry colname="col2">Murrhardt</oasis:entry>
         <oasis:entry colname="col3">0.81</oasis:entry>
         <oasis:entry colname="col4">0.84</oasis:entry>
         <oasis:entry colname="col5">0.60</oasis:entry>
         <oasis:entry colname="col6">0.50</oasis:entry>
         <oasis:entry colname="col7">0.77</oasis:entry>
         <oasis:entry colname="col8">1</oasis:entry>
         <oasis:entry colname="col9">0.8</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">5</oasis:entry>
         <oasis:entry colname="col2">Rottweil</oasis:entry>
         <oasis:entry colname="col3">0.94</oasis:entry>
         <oasis:entry colname="col4">0.92</oasis:entry>
         <oasis:entry colname="col5">0.50</oasis:entry>
         <oasis:entry colname="col6">0.40</oasis:entry>
         <oasis:entry colname="col7">0.92</oasis:entry>
         <oasis:entry colname="col8">1.1</oasis:entry>
         <oasis:entry colname="col9">1</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">6</oasis:entry>
         <oasis:entry colname="col2">Schwaebisch Gmuend–Strassdorf</oasis:entry>
         <oasis:entry colname="col3">0.89</oasis:entry>
         <oasis:entry colname="col4">0.85</oasis:entry>
         <oasis:entry colname="col5">0.60</oasis:entry>
         <oasis:entry colname="col6">0.50</oasis:entry>
         <oasis:entry colname="col7">0.91</oasis:entry>
         <oasis:entry colname="col8">0.5</oasis:entry>
         <oasis:entry colname="col9">0.4</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">7</oasis:entry>
         <oasis:entry colname="col2">Stoetten</oasis:entry>
         <oasis:entry colname="col3">0.98</oasis:entry>
         <oasis:entry colname="col4">0.98</oasis:entry>
         <oasis:entry colname="col5">0.30</oasis:entry>
         <oasis:entry colname="col6">0.20</oasis:entry>
         <oasis:entry colname="col7">0.94</oasis:entry>
         <oasis:entry colname="col8">1.4</oasis:entry>
         <oasis:entry colname="col9">1.4</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">8</oasis:entry>
         <oasis:entry colname="col2">Stuttgart–Echterdingen</oasis:entry>
         <oasis:entry colname="col3">0.98</oasis:entry>
         <oasis:entry colname="col4">0.97</oasis:entry>
         <oasis:entry colname="col5">0.30</oasis:entry>
         <oasis:entry colname="col6">0.20</oasis:entry>
         <oasis:entry colname="col7">0.94</oasis:entry>
         <oasis:entry colname="col8">1.5</oasis:entry>
         <oasis:entry colname="col9">1.4</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">9</oasis:entry>
         <oasis:entry colname="col2">Stuttgart (Schnarrenberg)</oasis:entry>
         <oasis:entry colname="col3">0.98</oasis:entry>
         <oasis:entry colname="col4">0.96</oasis:entry>
         <oasis:entry colname="col5">0.30</oasis:entry>
         <oasis:entry colname="col6">0.30</oasis:entry>
         <oasis:entry colname="col7">0.95</oasis:entry>
         <oasis:entry colname="col8">1.6</oasis:entry>
         <oasis:entry colname="col9">1.5</oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table><?xmltex \gdef\@currentlabel{4}?></table-wrap>

      <p id="d1e5421">The final model evaluation using independent, retained data from 2011–2020 yielded <inline-formula><mml:math id="M173" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> values of up to 0.95 as well as average RMSE and MAE of <inline-formula><mml:math id="M174" display="inline"><mml:mrow><mml:mo>∼</mml:mo><mml:mn mathvariant="normal">1.0</mml:mn></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M175" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula>C for temperature and <inline-formula><mml:math id="M176" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> values of up to 0.74, average RMSE of 22 mm per month, and MAE of 17 mm per month for precipitation. The<?pagebreak page6497?> discrepancy in temperature and precipitation model performance is unsurprising, since the thermodynamics and atmospheric dynamics controlling precipitation variability are more difficult to represent (e.g., Shepherd, 2014). Regardless, the overall performance speaks in favor of applying the study's approach to downscale midlatitude climate in complex terrain. Moreover, the models' similar performance during CV and the final evaluation indicates that the models were not overfitted and that the predictand–predictor relationships hold outside the observed period. Finally, it is worth noting that the stacking regressor performed better than the individual base models, even when all the potential regressors of the initial experiments (Sect. 4.2) were stacked into a meta-regressor. Such improvements demonstrate the advantage of the ease of experimentation through a package like pyESD.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F7" specific-use="star"><?xmltex \currentcnt{7}?><?xmltex \def\figurename{Figure}?><label>Figure 7</label><caption><p id="d1e5467">Predicted regional annual means of  precipitation in response to <bold>(a)</bold> RCP2.6 (black), <bold>(b)</bold> RCP4.5 (red), and <bold>(c)</bold> RCP8.5 (blue). The solid lines represent the values averaged over all stations, and the shaded boundaries indicate the corresponding variability range (1 standard deviation). The time series are smoothed with a 1-year moving average with a centered mean.</p></caption>
          <?xmltex \igopts{width=341.433071pt}?><graphic xlink:href="https://gmd.copernicus.org/articles/16/6479/2023/gmd-16-6479-2023-f07.png"/>

        </fig>

      <?xmltex \floatpos{t}?><fig id="Ch1.F8" specific-use="star"><?xmltex \currentcnt{8}?><?xmltex \def\figurename{Figure}?><label>Figure 8</label><caption><p id="d1e5487"><bold>(a)</bold> Observed precipitation (1958–2100) as well as seasonal (i.e., spring – MAM, summer – JJA, autumn – SON, and winter – DJF) and annual end-of-century (30-year) precipitation climatologies as a result of RCP2.6 <bold>(b)</bold> and RCP8.5 <bold>(c)</bold> forcing. Brown (green)  indicates a decrease (increase) in precipitation relative to the observed means (1958–2010).</p></caption>
          <?xmltex \igopts{width=398.338583pt}?><graphic xlink:href="https://gmd.copernicus.org/articles/16/6479/2023/gmd-16-6479-2023-f08.png"/>

        </fig>

      <p id="d1e5504">We visualize a prediction example (Fig. 6) to (a) provide a less abstract presentation of these results and (b) demonstrate the type of figure generated by the plotting utility functions in the <italic>pyESD.plot</italic> module. The figure depicts the predictions generated by the final ESD model for the Hechingen station, a station that records precipitation and temperature (station ID 7 and 1, respectively). The observed and predicted values for 2011–2020 are highly correlated, with PCCs of 0.85 (Fig. 6a) for precipitation and 0.97 (Fig. 6b) for temperature. The time series comparisons also demonstrate the models' abilities to predict the variability of the observed values in both the training and testing period (Fig. 6a and b). Prior to this study, PP-ESD models had not been directly applied to the weather stations in the catchment. However, our models are among the best performing for temperature and precipitation when we compare them to models from other studies across Europe (e.g., Gutiérrez et al., 2019; Hertig et al., 2019; Schmidli et al., 2007). For instance, Gutiérrez et al. (2019) performed an intercomparison of statistical<?pagebreak page6498?> downscaling model performance for 86 stations across Europe using the MOS, PP, and WG methods. The Spearman correlation of the downscaled and observed values yielded <inline-formula><mml:math id="M177" display="inline"><mml:mi>R</mml:mi></mml:math></inline-formula> values in the range of <inline-formula><mml:math id="M178" display="inline"><mml:mrow><mml:mo>∼</mml:mo><mml:mn mathvariant="normal">0.0</mml:mn></mml:mrow></mml:math></inline-formula>–0.7 (with many stations <inline-formula><mml:math id="M179" display="inline"><mml:mrow><mml:mo>≤</mml:mo><mml:mn mathvariant="normal">0.5</mml:mn></mml:mrow></mml:math></inline-formula>) for precipitation and 0.3–0.95 for temperature. These comparisons also underline the suitability of the pyESD methods for downscaling climate information even in complex mountainous regions.</p>
</sec>
<sec id="Ch1.S4.SS4">
  <label>4.4</label><title>Prediction of local responses to 21st century climate change</title>
      <p id="d1e5545">The predictions of local precipitation and temperature responses to 21st century climate change were generated by coupling the final ESD models to MPI-ESM simulations forced with greenhouse gas concentration scenarios RCP2.6, RCP4.5, and RCP8.5 (Sect. 3.2.3). The results are presented as deviations from the monthly long-term means of the training period (1958–2010) and referred to as “anomalies” hereafter. The annual mean anomaly time series were computed<?pagebreak page6499?> with a 1-year moving average with a centered mean (Figs. 7 and 9).</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F9" specific-use="star"><?xmltex \currentcnt{9}?><?xmltex \def\figurename{Figure}?><label>Figure 9</label><caption><p id="d1e5550">Predicted regional annual means of the temperature in response to <bold>(a)</bold> RCP2.6 (black), <bold>(b)</bold> RCP4.5 (red), and <bold>(c)</bold> RCP8.5 (blue). The solid lines represent the values averaged over all stations, and the shaded boundaries indicate the corresponding variability range (1 standard deviation). The time series are smoothed with a 1-year moving average with a centered mean.</p></caption>
          <?xmltex \igopts{width=341.433071pt}?><graphic xlink:href="https://gmd.copernicus.org/articles/16/6479/2023/gmd-16-6479-2023-f09.png"/>

        </fig>

      <?xmltex \floatpos{t}?><fig id="Ch1.F10"><?xmltex \currentcnt{10}?><?xmltex \def\figurename{Figure}?><label>Figure 10</label><caption><p id="d1e5570"><bold>(a)</bold> Observed temperature (1958–2100) as well as seasonal (i.e., spring – MAM, summer – JJA, autumn – SON, and winter – DJF) and annual end-of-century (30-year) temperature climatologies as a result of RCP2.6 <bold>(b)</bold> and RCP8.5 <bold>(c)</bold> forcing. Blue (red)  indicates a decrease (increase) in temperature relative to the observed means (1958–2010).</p></caption>
          <?xmltex \igopts{width=236.157874pt}?><graphic xlink:href="https://gmd.copernicus.org/articles/16/6479/2023/gmd-16-6479-2023-f10.png"/>

        </fig>

      <p id="d1e5588">The precipitation predictions (Fig. 7) for RCP8.5 (RCP4.5) show a strong (weak) positive trend towards the end of the century. This trend is even more pronounced for the predicted temperatures (Fig. 9) in the catchment. The predicted precipitation changes vary greatly between weather stations. Furthermore, the RCPs change the magnitude but not the pattern of the predictions for each station. For instance, stations that show an increase (decrease) in precipitation for the RCP2.6 predict a greater increase (decrease) in response to RCP4.5 and RCP8.5. The annual and seasonal 30-year end-of-century climatologies show an overall increase in precipitation in response to both RCP2.6 and RCP4.5 (Fig. 8) for most of the stations. The annual end-of-century climatologies deviate from the present day (1958–2010) by ca. <inline-formula><mml:math id="M180" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">5</mml:mn></mml:mrow></mml:math></inline-formula> to 20 mm per month for RCP8.5 and ca. <inline-formula><mml:math id="M181" display="inline"><mml:mrow><mml:mo>≤</mml:mo><mml:mn mathvariant="normal">5</mml:mn></mml:mrow></mml:math></inline-formula> mm per month for RCP2.6. Overall, the ESD models predict a precipitation increase of ca. 10 %–20 % until the end of the century. Furthermore, the seasonal climatologies reveal a shift of maximum precipitation away from the summer season for some stations. Such shifts in seasonality and an overall decrease in summer precipitation have previously been predicted (e.g., Gobiet et al., 2014; Paparrizos et al., 2017; Feldmann et al., 2013). Prior to this study, no ESD–GCM-based predictions of the 21st century precipitation changes had been developed for the weather stations of the catchment. However, the models' predictions of the precipitation response to higher greenhouse gas concentration scenarios are comparable to coarser predictions by other studies using RCMs or ESD models (Feldmann et al., 2013; Kunstmann et al., 2004; Paparrizos et al., 2017; Lau et al., 2013). The precipitation predictions generated in this case study can be used further for climate impact assessments, such as assessments of the probability of flooding and drought across the hydrological catchment. The projected shifts in seasonality across the catchment represents potentially valuable information for agricultural planning.</p>
      <p id="d1e5611">The predicted temperature anomalies (Fig. 9) reveal a strong (weak) positive trend for RCP8.5 (RCP4.5). The<?pagebreak page6500?> end-of-century climatologies reveal only moderate warming of ca. <inline-formula><mml:math id="M182" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">0.5</mml:mn></mml:mrow></mml:math></inline-formula> to 1 <inline-formula><mml:math id="M183" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula>C for RCP2.6 and significant warming (ca. 2–4 <inline-formula><mml:math id="M184" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula>C) for all seasons in response to RCP8.5 (Fig. 10). More specifically, the investigated region is predicted to experience the most warming (<inline-formula><mml:math id="M185" display="inline"><mml:mrow><mml:mo>≥</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M186" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula>C) in the summer season. There are few differences in predicted warming between the stations of the catchment. Generally, the estimated magnitude of warming towards the end of the century is in agreement with the IPCC report (IPCC, 2021) and other downscaled estimates (e.g., Kunstmann et al., 2004; Gutiérrez et al., 2019). The predicted warming would likely implicate societal and ecological systems and stresses the need for efficient adaptation and mitigation strategies.</p>
      <p id="d1e5661">The case study highlights the efficiency and robustness of the downscaling steps implemented in the pyESD package. However, as noted in previous sections, the accuracy of the predictions generated by a GCM–ESD model coupling relies on the predictors being adequately represented by the GCMs. KS tests were performed to evaluate this for the temporal overlap (1979–2000) between the ERA5 reanalysis<?pagebreak page6501?> product and the MPI-ESM GCM output (Sect. 3.2.3). Results from these tests show significant differences in the distribution of ERA5 and MPI-ESM when the raw monthly time series are considered, thus violating the assumptions of the PP-ESD approach. However, this issue does not persist for monthly standardized anomalies of precipitation and temperature (Fig. 11). Previous studies yielded similar results when using seasonal standardizers (Bedia et al., 2020) and principal component transformations (Benestad et al., 2015a), both of which are included in the pyESD package.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F11" specific-use="star"><?xmltex \currentcnt{11}?><?xmltex \def\figurename{Figure}?><label>Figure 11</label><caption><p id="d1e5666">The KS two-sided statistical testing score maps the ERA5 reanalysis product and MPI-ESM GCM output for precipitation <bold>(a–c)</bold> and temperature <bold>(d–f)</bold>. The KS test was applied to raw values, anomalies (centered with zero means), and standardized anomalies with unit variance values (columns from left to right, respectively). The grid boxes with black cross stippling represent low <inline-formula><mml:math id="M187" display="inline"><mml:mi>p</mml:mi></mml:math></inline-formula> values (<inline-formula><mml:math id="M188" display="inline"><mml:mrow><mml:mi>p</mml:mi><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">0.05</mml:mn></mml:mrow></mml:math></inline-formula>), suggesting statistically significant differences in distribution between the ERA5 and MPI-ESM time series.</p></caption>
          <?xmltex \igopts{width=483.69685pt}?><graphic xlink:href="https://gmd.copernicus.org/articles/16/6479/2023/gmd-16-6479-2023-f11.png"/>

        </fig>

      <?xmltex \floatpos{t}?><fig id="Ch1.F12" specific-use="star"><?xmltex \currentcnt{12}?><?xmltex \def\figurename{Figure}?><label>Figure 12</label><caption><p id="d1e5702">Comparison of 20-year annual precipitation climatologies predicted by the ESD models of this study (black), GCMs (i.e., MPI-ESM in green, CESM5 in red, HadGEM2 in gold), and RCMs (i.e., and CORDEX in purple) for RCP2.6 <bold>(a, b)</bold> and RCP8.5 <bold>(c, d)</bold>.</p></caption>
          <?xmltex \igopts{width=398.338583pt}?><graphic xlink:href="https://gmd.copernicus.org/articles/16/6479/2023/gmd-16-6479-2023-f12.png"/>

        </fig>

      <?xmltex \floatpos{t}?><fig id="Ch1.F13" specific-use="star"><?xmltex \currentcnt{13}?><?xmltex \def\figurename{Figure}?><label>Figure 13</label><caption><p id="d1e5720">Comparison of 20-year annual temperature climatologies predicted by the ESD models of this study (black), GCMs (i.e., MPI-ESM in green, CESM5 in  red, HadGEM2  in gold), and RCMs (i.e., and CORDEX in purple) for RCP2.6 <bold>(a, b)</bold> and RCP8.5 <bold>(c, d)</bold>.</p></caption>
          <?xmltex \igopts{width=398.338583pt}?><graphic xlink:href="https://gmd.copernicus.org/articles/16/6479/2023/gmd-16-6479-2023-f13.png"/>

        </fig>

</sec>
<sec id="Ch1.S4.SS5">
  <label>4.5</label><title>Comparison of GCM and ESD-based predictions</title>
      <p id="d1e5743">A comparison of the ESD-generated annual 20-year climatologies for the mid-century (2040–2060) and the end of the century (2080–2100) to the model output of GCMs and RCMs (i.e., EURO-CORDEX) reveals several differences. The GCMs (MPI-ESM and HadGEM2) predict <inline-formula><mml:math id="M189" display="inline"><mml:mrow><mml:mo>∼</mml:mo><mml:mn mathvariant="normal">20</mml:mn></mml:mrow></mml:math></inline-formula> mm per month (<inline-formula><mml:math id="M190" display="inline"><mml:mrow><mml:mo>∼</mml:mo><mml:mn mathvariant="normal">30</mml:mn></mml:mrow></mml:math></inline-formula> %) higher precipitation rates than the ESD models and RCMs. The ESD-based precipitation predictions of this study are closest to the RCM estimates but <inline-formula><mml:math id="M191" display="inline"><mml:mrow><mml:mo>∼</mml:mo><mml:mo>≥</mml:mo><mml:mn mathvariant="normal">5</mml:mn></mml:mrow></mml:math></inline-formula> mm per month higher in magnitude for most of the stations (Fig. 12). The closeness of the ESD-based and RCM-based estimates underlines the added value of our ESD approach for downscaling precipitation. However, there are significant (<inline-formula><mml:math id="M192" display="inline"><mml:mrow><mml:mo>∼</mml:mo><mml:mn mathvariant="normal">4</mml:mn></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M193" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula>C) differences between the ESD-based and RCM-based temperature estimates (Fig. 13). The ESD-based temperature predictions were higher than those of the RCM but lower than those of the GCM. Both the RCM and ESD models used boundary conditions from the same GCM (MPI-ESM). The RCM reduced the GCM temperatures by more (<inline-formula><mml:math id="M194" display="inline"><mml:mrow><mml:mo>∼</mml:mo><mml:mn mathvariant="normal">8</mml:mn></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M195" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula>C) than the ESD models (<inline-formula><mml:math id="M196" display="inline"><mml:mrow><mml:mo>∼</mml:mo><mml:mn mathvariant="normal">4</mml:mn></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M197" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula>C or less). This may be a reflection of both (a) the selection of GCM near-surface temperatures as predictors in the ESD models and (b) the shrinking of regression coefficients when the ESD transfer functions are determined.</p>
</sec>
</sec>
<sec id="Ch1.S5" sec-type="conclusions">
  <label>5</label><title>Summary and conclusion</title>
      <p id="d1e5846">Contemporary climate change and its impacts increase the demand for high-resolution, regional- and local-scale predictions. These can be generated in a most cost-effective way through the application of the PP-ESD (perfect prognosis empirical-statistical downscaling) approach. The pyESD Python package we introduce here is a well-developed tool and modeling framework for applying and experimenting with PP-ESD for any climate variable (e.g., precipitation, wind speed, and temperature). The package complements existing tools through the following key specialties and strengths. <list list-type="order"><list-item>
      <p id="d1e5851">The package is well-structured and designed in OOP style that treats the weather stations as objects with many functionality attributes that cover all the PP-ESD modeling routines. As a result, all modeling steps can be executed on the initialized station objects with a few lines of code.</p></list-item><list-item>
      <p id="d1e5855">The package is designed in a way that knowing its API (Application Programming Interface), which is introduced in the package's extensive documentation, is sufficient to implement all downscaling steps. In other words, no advanced knowledge of Python (or programming) is required to use the package for research purposes. On the other hand, the package's design is modular and flexible enough to allow advanced users to build on it or adjust it to their needs.</p></list-item><list-item>
      <?pagebreak page6503?><p id="d1e5859">The package implements different predictor selection techniques (i.e., recursive, tree-based, and sequential) that can be manually selected and experimented with. The package allows the user to include a variety of predictors, ranging from regional near-surface temperatures to synoptic-scale teleconnection patterns. The package features many transformation techniques such as <italic>MonthlyStandardizer</italic> and <italic>PCAScalling</italic> that can be used to reduce biases towards specific predictors.</p></list-item><list-item>
      <p id="d1e5869">The package includes a variety of machine learning techniques with different underlying principles and theorems. The package also features many ensemble models (Sect. 2.3), cross-validation schemes, and hyperparameter optimization techniques that can easily be experimented with in a few lines of code.</p></list-item><list-item>
      <p id="d1e5873">The package's core modules are accompanied by utility functions for data preprocessing, post-processing, and serialization to save computational resources, as well as visualization tools and ESD-relevant statistical methods like EOF analysis, correlation, and distribution similarity tests.</p></list-item></list> We demonstrated some of the package's functionalities by developing and applying ESD models to generate precipitation and temperature predictions for a sub-hydrological catchment in complex mountainous terrain in southwestern Germany. The models were evaluated with different metrics and were found to perform well (e.g., <inline-formula><mml:math id="M198" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>≥</mml:mo><mml:mn mathvariant="normal">0.7</mml:mn></mml:mrow></mml:math></inline-formula> for precipitation and <inline-formula><mml:math id="M199" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>≥</mml:mo><mml:mn mathvariant="normal">0.9</mml:mn></mml:mrow></mml:math></inline-formula> for temperature). In order to ensure the reproducibility of the results and allow  easy practical entry for potential users, the application example uses publicly available datasets, and all the scripts used for this study are made available.</p>
      <p id="d1e5907">Despite the promising results of the illustrative case study, the reader is informed of the following important limitations: generally, the PP-ESD approach to predictions relies on the assumption that the empirical relationships between predictor and predictand remain valid through time. While statistical downscaling models have successfully been used for the past climate of the pre-industrial era (Reichert et al., 1999) and Last Glacial Maximum (Vrac et al., 2007), the merit of this assumption must be evaluated on a case-by-case basis. For example, geographical boundary conditions that affect the local climate, such as topography or vegetation cover, are only implicitly considered in the empirical transfer functions. The empirical relationship between predictors and predictands may break down if these boundary conditions change significantly (e.g., Mutz and Aschauer, 2022). Furthermore, the performance of PP-ESD models also depends on the accuracy of the GCMs they are coupled to. In our case study, the developed ESD models were coupled to a single, albeit<?pagebreak page6504?> well-established, GCM (MPI-ESM). However, we generally recommend the use of GCM ensembles to prevent biases towards a specific GCM.</p>
      <p id="d1e5910">The current version of the package includes all functions needed to develop, evaluate, and apply station-based ESD models and generate predictions of local-scale climate change. Nevertheless, the package remains under active development to expand upon its functionality. Planned improvements include an extension of functions to make pyESD suitable for downscaling gridded datasets or satellite observations. The grid-based analysis would contribute to the design of spatial downscaling models (e.g., Chen et al., 2012; Jia et al., 2011). Moreover, we intend to expand the selection of machine learning techniques by including deep learning models that have been proven useful in downscaling (e.g., Baño-Medina et al., 2020; Quesada-Chacón et al., 2022). Finally, we intend to build a graphical, web-based interface to make the package more accessible and easy to use for researchers, students, and people outside the scientific community.</p><?xmltex \hack{\clearpage}?>
</sec>

      
      </body>
    <back><app-group>

<?pagebreak page6505?><app id="App1.Ch1.S1">
  <?xmltex \currentcnt{A}?><label>Appendix A</label><title>Supplementary results of the illustrative case study</title>

      <?xmltex \floatpos{h!}?><fig id="App1.Ch1.S1.F14"><?xmltex \currentcnt{A1}?><?xmltex \def\figurename{Figure}?><label>Figure A1</label><caption><p id="d1e5927">Correlation between the precipitation predictand and the potential predictors listed in Table 2, expressed as PCCs.</p></caption>
        <?xmltex \hack{\hsize\textwidth}?>
        <?xmltex \igopts{width=398.338583pt}?><graphic xlink:href="https://gmd.copernicus.org/articles/16/6479/2023/gmd-16-6479-2023-f14.png"/>

      </fig>

      <?xmltex \floatpos{h!}?><fig id="App1.Ch1.S1.F15"><?xmltex \currentcnt{A2}?><?xmltex \def\figurename{Figure}?><label>Figure A2</label><caption><p id="d1e5940">Correlation between the temperature predictand and the potential predictors listed in Table 2, expressed as PCCs.</p></caption>
        <?xmltex \hack{\hsize\textwidth}?>
        <?xmltex \igopts{width=483.69685pt}?><graphic xlink:href="https://gmd.copernicus.org/articles/16/6479/2023/gmd-16-6479-2023-f15.png"/>

      </fig>

<?xmltex \hack{\clearpage}?><?xmltex \floatpos{h!}?><fig id="App1.Ch1.S1.F16"><?xmltex \currentcnt{A3}?><?xmltex \def\figurename{Figure}?><label>Figure A3</label><caption><p id="d1e5955"><bold>(a)</bold> Observed precipitation (1958–2010) as well as seasonal (i.e., spring – MAM, summer – JJA, autumn – SON, and winter – DJF) and annual mid-century (30-year) precipitation climatologies as a result of RCP2.6 <bold>(b)</bold> and RCP8.5 <bold>(c)</bold> forcing. Brown (green)  indicates a decrease (increase) in precipitation relative to the observed means (1958–2010).</p></caption>
        <?xmltex \hack{\hsize\textwidth}?>
        <?xmltex \igopts{width=284.527559pt}?><graphic xlink:href="https://gmd.copernicus.org/articles/16/6479/2023/gmd-16-6479-2023-f16.png"/>

      </fig>

      <?xmltex \floatpos{h!}?><fig id="App1.Ch1.S1.F17"><?xmltex \currentcnt{A4}?><?xmltex \def\figurename{Figure}?><label>Figure A4</label><caption><p id="d1e5976">Observed temperature (1958–2010) as well as seasonal (i.e., spring – MAM, summer – JJA, autumn – SON, and winter – DJF) and annual mid-century (30-year) temperature climatologies as a result of RCP2.6 <bold>(b)</bold> and RCP8.5 <bold>(c)</bold> forcing. Blue (red)  indicates a decrease (increase) in temperature relative to the observed means (1958–2010).</p></caption>
        <?xmltex \hack{\hsize\textwidth}?>
        <?xmltex \igopts{width=156.490157pt}?><graphic xlink:href="https://gmd.copernicus.org/articles/16/6479/2023/gmd-16-6479-2023-f17.png"/>

      </fig>

<?xmltex \hack{\clearpage}?><?xmltex \floatpos{h!}?><fig id="App1.Ch1.S1.F18"><?xmltex \currentcnt{A5}?><?xmltex \def\figurename{Figure}?><label>Figure A5</label><caption><p id="d1e5996">The KS two-sided statistical testing score maps the ERA5 reanalysis product and MPI-ESM GCM output for relative humidity <bold>(a–c)</bold>, zonal wind velocity <bold>(d–f)</bold>, and meridional wind velocity <bold>(g–i)</bold> at 850 hPa. The KS test was applied to raw values, anomalies (centered with zero means), and standardized anomalies with unit variance values (columns from left to right, respectively). The grid boxes with black cross stippling represent low <inline-formula><mml:math id="M200" display="inline"><mml:mi>p</mml:mi></mml:math></inline-formula> values (<inline-formula><mml:math id="M201" display="inline"><mml:mrow><mml:mi>p</mml:mi><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">0.05</mml:mn></mml:mrow></mml:math></inline-formula>), suggesting statistically significant differences in distribution between the ERA5 and MPI-ESM time series.</p></caption>
        <?xmltex \hack{\hsize\textwidth}?>
        <?xmltex \igopts{width=483.69685pt}?><graphic xlink:href="https://gmd.copernicus.org/articles/16/6479/2023/gmd-16-6479-2023-f18.png"/>

      </fig>

<?xmltex \hack{\clearpage}?>
</app>
  </app-group><notes notes-type="codedataavailability"><title>Code and data availability</title>

      <p id="d1e6041">The study's illustrative case study relies on publicly available datasets. More specifically, the precipitation and temperature datasets are accessible through the Climate Data Centre of the DWD  (Deutscher Wetterdienst, version V21.3). The subcatchment datasets used in this study are interactively available through <uri>https://cdc.dwd.de/portal/shortlink/425267fe-e4fd-4fff-9969-14c7d3aa25de</uri> (Deutscher Wetterdienst, 2023) and <uri>https://cdc.dwd.de/portal/shortlink/da6f555d-d6f6-426a-a8ba-b96683c76ea9</uri> (last access: 30 October 2023) for precipitation and temperature stations, respectively. The ERA5 reanalysis datasets can also be downloaded through the Copernicus Climate Data Store (CDS)  at <ext-link xlink:href="https://doi.org/10.24381/cds.6860a573" ext-link-type="DOI">10.24381/cds.6860a573</ext-link> (Hersbach et al., 2023) for pressure level and <ext-link xlink:href="https://doi.org/10.24381/cds.68d2bb30" ext-link-type="DOI">10.24381/cds.68d2bb30</ext-link> (Muñoz Sabater, 2019) for surface-level variables. However, the processed weather stations and the serialized pickle files of the regional means of the predictors for all the stations are provided as part of the supporting material (<ext-link xlink:href="https://doi.org/10.5281/zenodo.7767681" ext-link-type="DOI">10.5281/zenodo.7767681</ext-link>, Boateng and Mutz, 2023). The MPI-ESM GCM datasets used as simulated predictors can also be downloaded from the CDS by selecting MPI-ESM-LR as the model for the AMIP as well as the RCP2.6, 4.5, and 8.5 experiments: see <ext-link xlink:href="https://doi.org/10.24381/cds.3b4b5bc9" ext-link-type="DOI">10.24381/cds.3b4b5bc9</ext-link> (Copernicus Climate Change Service, Climate Data Store, 2018a) for pressure-level variables and <ext-link xlink:href="https://doi.org/10.24381/cds.9d44a987" ext-link-type="DOI">10.24381/cds.9d44a987</ext-link> (Copernicus Climate Change Service, Climate Data Store, 2018b) for surface variables. Moreover, the station-based downscaling estimates of future climate scenarios for all the stations are also included in the supporting material (<ext-link xlink:href="https://doi.org/10.5281/zenodo.7767681" ext-link-type="DOI">10.5281/zenodo.7767681</ext-link>, Boateng and Mutz, 2023).</p>

      <p id="d1e6069">The pyESD (version 1.0.1) software, including the documentation website source files, is available through many platforms, including the following. <list list-type="bullet"><list-item>
      <p id="d1e6074">GitHub: <uri>https://github.com/Dan-Boat/PyESD</uri> (last access: 30 October 2023)</p></list-item><list-item>
      <p id="d1e6081">Python package index (PyPI): <uri>https://pypi.org/project/PyESD/</uri> (last access: 30 October 2023)</p></list-item><list-item>
      <p id="d1e6088">Zenodo (v1.0.1 release): <?xmltex \hack{\newline}?> <ext-link xlink:href="https://doi.org/10.5281/zenodo.7767629" ext-link-type="DOI">10.5281/zenodo.7767629</ext-link> (Boateng, 2023)</p></list-item></list></p>

      <p id="d1e6096">Developer: Daniel Boateng, University of Tübingen</p>

      <p id="d1e6099">Hardware requirements: general-purpose computer</p>

      <p id="d1e6102">Programming language: Python (version 3.7 or later)</p>

      <p id="d1e6106">The installation of the package and its required dependencies are highlighted on the documentation website: <uri>https://dan-boat.github.io/PyESD/</uri> (last access: 30 October 2023). The usage of the package and its functionalities are also presented in the documentation. The control scripts of the study's illustrative case study are also provided as part of the supporting material (<ext-link xlink:href="https://doi.org/10.5281/zenodo.7767681" ext-link-type="DOI">10.5281/zenodo.7767681</ext-link>, Boateng and Mutz, 2023) and also presented in the example section of the documentation.</p>
  </notes><notes notes-type="authorcontribution"><title>Author contributions</title>

      <p id="d1e6118">DB: pyESD software and documentation website development, conceptualization, modeling, data analysis, visualization, and writing of the original paper. SGM: supervision, paper editing, and funding acquisition.</p>
  </notes><notes notes-type="competinginterests"><title>Competing interests</title>

      <p id="d1e6124">The contact author has declared that neither of the authors has any competing interests.</p>
  </notes><notes notes-type="disclaimer"><title>Disclaimer</title>

      <p id="d1e6130">Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors.</p>
  </notes><ack><title>Acknowledgements</title><p id="d1e6136">This study was partially supported by the German Science Foundation (DFG) under grant nos. MU4188/3-1 and MU4188/1-1, awarded to Sebastian G. Mutz. We acknowledge the World Climate Research Programme, which, through its Working Group on Coupled Modelling, coordinated and promoted CMIP5. We thank the climate modeling groups for producing and making available their model output, the Earth System Grid Federation (ESGF) for archiving the data and providing access, and the multiple funding agencies who support CMIP5 and ESGF. We also thank the European Centre for Medium-Range Weather Forecasts for providing the ERA5 data product and the Deutsche Wetterdienst (DWD) for providing the weather station records. W We thank Charles Onyutha and the three anonymous reviewers for their constructive reviews.</p></ack><notes notes-type="financialsupport"><title>Financial support</title>

      <p id="d1e6141">This research has been supported by the Deutsche Forschungsgemeinschaft (grant nos. MU4188/3-1 and MU4188/1-1).<?xmltex \hack{\newline}?><?xmltex \hack{\newline}?>This open-access publication was funded <?xmltex \notforhtml{\newline}?> by the University of Tübingen.</p>
  </notes><notes notes-type="reviewstatement"><title>Review statement</title>

      <p id="d1e6152">This paper was edited by Charles Onyutha and reviewed by three anonymous referees.</p>
  </notes><ref-list>
    <title>References</title>

      <ref id="bib1.bib1"><label>1</label><?label 1?><mixed-citation>Anandhi, A., Srinivas, V. V., Nanjundiah, R. S., and Nagesh Kumar, D.: Downscaling precipitation to river basin in India for IPCC SRES scenarios using support vector machine, Int. J. Climatol., 28, 401–420, <ext-link xlink:href="https://doi.org/10.1002/joc.1529" ext-link-type="DOI">10.1002/joc.1529</ext-link>, 2008.</mixed-citation></ref>
      <ref id="bib1.bib2"><label>2</label><?label 1?><mixed-citation>Arlot, S. and Celisse, A.: A survey of cross-validation procedures for model selection, Stat. Surv., 4, 40–79, <ext-link xlink:href="https://doi.org/10.1214/09-SS054" ext-link-type="DOI">10.1214/09-SS054</ext-link>, 2010.</mixed-citation></ref>
      <ref id="bib1.bib3"><label>3</label><?label 1?><mixed-citation> Balasundaram, S. and Tanveer, M.: On Lagrangian twin support vector regression, Neural Comput. Appl., 22, 257–267, 2013.</mixed-citation></ref>
      <ref id="bib1.bib4"><label>4</label><?label 1?><mixed-citation>Baño-Medina, J., Manzanas, R., and Gutiérrez, J. M.: Configuration and intercomparison of deep learning neural models for statistical downscaling, Geosci. Model Dev., 13, 2109–2124, <ext-link xlink:href="https://doi.org/10.5194/gmd-13-2109-2020" ext-link-type="DOI">10.5194/gmd-13-2109-2020</ext-link>, 2020.</mixed-citation></ref>
      <?pagebreak page6509?><ref id="bib1.bib5"><label>5</label><?label 1?><mixed-citation>Bárdossy, A.: Atmospheric circulation pattern classification for South-West Germany using hydrological variables, Phys. Chem. Earth Parts A/B/C, 35, 498–506, <ext-link xlink:href="https://doi.org/10.1016/j.pce.2010.02.007" ext-link-type="DOI">10.1016/j.pce.2010.02.007</ext-link>, 2010.</mixed-citation></ref>
      <ref id="bib1.bib6"><label>6</label><?label 1?><mixed-citation>Barnston, A. G. and Livezey, R. E.: Classification, Seasonality and Persistence of Low-Frequency Atmospheric Circulation Patterns, Mon. Weather Rev., 115, 1083–1126, <ext-link xlink:href="https://doi.org/10.1175/1520-0493(1987)115&lt;1083:CSAPOL&gt;2.0.CO;2" ext-link-type="DOI">10.1175/1520-0493(1987)115&lt;1083:CSAPOL&gt;2.0.CO;2</ext-link>, 1987.</mixed-citation></ref>
      <ref id="bib1.bib7"><label>7</label><?label 1?><mixed-citation>Bedia, J., Baño-Medina, J., Legasa, M. N., Iturbide, M., Manzanas, R., Herrera, S., Casanueva, A., San-Martín, D., Cofiño, A. S., and Gutiérrez, J. M.: Statistical downscaling with the downscaleR package (v3.1.0): contribution to the VALUE intercomparison experiment, Geosci. Model Dev., 13, 1711–1735, <ext-link xlink:href="https://doi.org/10.5194/gmd-13-1711-2020" ext-link-type="DOI">10.5194/gmd-13-1711-2020</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bib8"><label>8</label><?label 1?><mixed-citation>Bell, B., Hersbach, H., Simmons, A., Berrisford, P., Dahlgren, P., Horányi, A., Muñoz-Sabater, J., Nicolas, J., Radu, R., Schepers, D., Soci, C., Villaume, S., Bidlot, J.-R., Haimberger, L., Woollen, J., Buontempo, C., and Thépaut, J.-N.: The ERA5 global reanalysis: Preliminary extension to 1950, Q. J. Roy. Meteor. Soc., 147, 4186–4227, <ext-link xlink:href="https://doi.org/10.1002/qj.4174" ext-link-type="DOI">10.1002/qj.4174</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bib9"><label>9</label><?label 1?><mixed-citation>Benestad, R. E., Chen, D., Mezghani, A., Fan, L., and Parding, K.: On using principal components to represent stations in empirical–statistical downscaling, Tellus A, 67, 28326, <ext-link xlink:href="https://doi.org/10.3402/tellusa.v67.28326" ext-link-type="DOI">10.3402/tellusa.v67.28326</ext-link>, 2015a.</mixed-citation></ref>
      <ref id="bib1.bib10"><label>10</label><?label 1?><mixed-citation>Benestad, R. E., Mezghani, A., and Parding M. K.: “esd” – The Empirical-Statistical Downscaling tool &amp; its visualisation capabilities, Figshare, <ext-link xlink:href="https://doi.org/10.6084/m9.figshare.1454425.v1" ext-link-type="DOI">10.6084/m9.figshare.1454425.v1</ext-link>, 2015b.</mixed-citation></ref>
      <ref id="bib1.bib11"><label>11</label><?label 1?><mixed-citation>Bergmeir, C. and Benítez, J. M.: On the use of cross-validation for time series predictor evaluation, Inform. Sciences, 191, 192–213, <ext-link xlink:href="https://doi.org/10.1016/j.ins.2011.12.028" ext-link-type="DOI">10.1016/j.ins.2011.12.028</ext-link>, 2012.</mixed-citation></ref>
      <ref id="bib1.bib12"><label>12</label><?label 1?><mixed-citation> Bergstra, J. and Bengio, Y.: Random search for hyper-parameter optimization, J. Mach. Learn. Res., 13, 281–305, 2012.</mixed-citation></ref>
      <ref id="bib1.bib13"><label>13</label><?label 1?><mixed-citation> Bergstra, J., Bardenet, R., Bengio, Y., and Kégl, B.: lgorithms for hyper-parameter optimization, in: Proceedings of the 24th International Conference on Neural Information Processing Systems, 2546–2554,  2011.</mixed-citation></ref>
      <ref id="bib1.bib14"><label>14</label><?label 1?><mixed-citation> Bishop, C. M. and Nasrabadi, N. M.: Pattern recognition and machine learning, Springer, ISBN 978-1-4939-3843-8, 2006.</mixed-citation></ref>
      <ref id="bib1.bib15"><label>15</label><?label 1?><mixed-citation>Boateng, D.: Dan-Boat/PyESD: PyESDv1.0.1 (v1.0.1), Zenodo [code], <ext-link xlink:href="https://doi.org/10.5281/zenodo.7767629" ext-link-type="DOI">10.5281/zenodo.7767629</ext-link>, 2023.</mixed-citation></ref>
      <ref id="bib1.bib16"><label>16</label><?label 1?><mixed-citation>Boateng, D. and Mutz, S. G.: Supporting material for PyESDv1.0.1 An open-source Python framework for empirical-statistical downscaling of climate information, Zenodo [data set], <ext-link xlink:href="https://doi.org/10.5281/zenodo.7767681" ext-link-type="DOI">10.5281/zenodo.7767681</ext-link>, 2023.</mixed-citation></ref>
      <ref id="bib1.bib17"><label>17</label><?label 1?><mixed-citation>Boateng, D., Mutz, S. G., Ballian, A., Meijers, M. J. M., Methner, K., Botsyun, S., Mulch, A., and Ehlers, T. A.: The effects of diachronous surface uplift of the European Alps on regional climate and the oxygen isotopic composition of precipitation, Earth Syst. Dynam. Discuss. [preprint], <ext-link xlink:href="https://doi.org/10.5194/esd-2022-48" ext-link-type="DOI">10.5194/esd-2022-48</ext-link>, in review, 2022.</mixed-citation></ref>
      <ref id="bib1.bib18"><label>18</label><?label 1?><mixed-citation>Boé, J., Terray, L., Martin, E., and Habets, F.: Projected changes in components of the hydrological cycle in French river basins during the 21st century, Water Resour. Res., 45,  W08426,  <ext-link xlink:href="https://doi.org/10.1029/2008WR007437" ext-link-type="DOI">10.1029/2008WR007437</ext-link>, 2009.</mixed-citation></ref>
      <ref id="bib1.bib19"><label>19</label><?label 1?><mixed-citation>Bottou, L.: Stochastic gradient learning in neural networks, Proc. Neuro-Nımes, 91, <uri>https://leon.bottou.org/publications/pdf/nimes-1991.pdf</uri> (last access 30 October 2023), 1991.</mixed-citation></ref>
      <ref id="bib1.bib20"><label>20</label><?label 1?><mixed-citation>Bourgault, P., Huard, D., Smith, T. J., Logan, T., Aoun, A., Lavoie, J., Dupuis, É., Rondeau-Genesse, G., Alegre, R., Barnes, C., Laperrière, A. B., Biner, S., Caron, D., Ehbrecht, C., Fyke, J., Keel, T., Labonté, M.-P., Lierhammer, L., Low, J.-F., Quinn, J., Roy, P., Squire, D., Stephens, A., Tanguy, M., and Whelan, C.: xclim: xarray-based climate data analytics, J. Open Source Softw., 8, 5415, <ext-link xlink:href="https://doi.org/10.21105/joss.05415" ext-link-type="DOI">10.21105/joss.05415</ext-link>, 2023.</mixed-citation></ref>
      <ref id="bib1.bib21"><label>21</label><?label 1?><mixed-citation>Brands, S., Gutiérrez, J. M., Herrera, S., and Cofiño, A. S.: On the Use of Reanalysis Data for Downscaling, J. Climate, 25, 2517–2526, <ext-link xlink:href="https://doi.org/10.1175/JCLI-D-11-00251.1" ext-link-type="DOI">10.1175/JCLI-D-11-00251.1</ext-link>, 2012.</mixed-citation></ref>
      <ref id="bib1.bib22"><label>22</label><?label 1?><mixed-citation> Breiman, L.: Bagging predictors, Mach. Learn., 24, 123–140, 1996a.</mixed-citation></ref>
      <ref id="bib1.bib23"><label>23</label><?label 1?><mixed-citation>Breiman, L.: Stacked regressions, Mach. Learn., 24, 49–64, <ext-link xlink:href="https://doi.org/10.1007/BF00117832" ext-link-type="DOI">10.1007/BF00117832</ext-link>, 1996b.</mixed-citation></ref>
      <ref id="bib1.bib24"><label>24</label><?label 1?><mixed-citation> Breiman, L.: Random forests, Mach. Learn., 45, 5–32, 2001.</mixed-citation></ref>
      <ref id="bib1.bib25"><label>25</label><?label 1?><mixed-citation>Chaudhuri, A. and Hu, W.: A fast algorithm for computing distance correlation, Comput. Stat. Data Anal., 135, 15–24, <ext-link xlink:href="https://doi.org/10.1016/j.csda.2019.01.016" ext-link-type="DOI">10.1016/j.csda.2019.01.016</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bib26"><label>26</label><?label 1?><mixed-citation>Chen, J., Brissette, F. P., and Leconte, R.: Coupling statistical and dynamical methods for spatial downscaling of precipitation, Clim. Change, 114, 509–526, <ext-link xlink:href="https://doi.org/10.1007/s10584-012-0452-2" ext-link-type="DOI">10.1007/s10584-012-0452-2</ext-link>, 2012.</mixed-citation></ref>
      <ref id="bib1.bib27"><label>27</label><?label 1?><mixed-citation>Chen, S.-T., Yu, P.-S., and Tang, Y.-H.: Statistical downscaling of daily precipitation using support vector machines and multivariate analysis, J. Hydrol., 385, 13–22, <ext-link xlink:href="https://doi.org/10.1016/j.jhydrol.2010.01.021" ext-link-type="DOI">10.1016/j.jhydrol.2010.01.021</ext-link>, 2010.</mixed-citation></ref>
      <ref id="bib1.bib28"><label>28</label><?label 1?><mixed-citation>Chen, T. and Guestrin, C.: XGBoost: A Scalable Tree Boosting System, in: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD '16: The 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco California USA, 785–794, <ext-link xlink:href="https://doi.org/10.1145/2939672.2939785" ext-link-type="DOI">10.1145/2939672.2939785</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bib29"><label>29</label><?label 1?><mixed-citation>Chen, X. and Jeong, J. C.: Enhanced recursive feature elimination, in: Sixth International Conference on Machine Learning and Applications (ICMLA 2007), Sixth International Conference on Machine Learning and Applications (ICMLA 2007), 429–435, <ext-link xlink:href="https://doi.org/10.1109/ICMLA.2007.35" ext-link-type="DOI">10.1109/ICMLA.2007.35</ext-link>, 2007.</mixed-citation></ref>
      <ref id="bib1.bib30"><label>30</label><?label 1?><mixed-citation>Colette, A., Granier, C., Hodnebrog, Ø., Jakobs, H., Maurizi, A., Nyiri, A., Rao, S., Amann, M., Bessagnet, B., D'Angiola, A., Gauss, M., Heyes, C., Klimont, Z., Meleux, F., Memmesheimer, M., Mieville, A., Rouïl, L., Russo, F., Schucht, S., Simpson, D., Stordal, F., Tampieri, F., and Vrac, M.: Future air quality in Europe: a multi-model assessment of projected exposure to ozone, Atmos. Chem. Phys., 12, 10613–10630, <ext-link xlink:href="https://doi.org/10.5194/acp-12-10613-2012" ext-link-type="DOI">10.5194/acp-12-10613-2012</ext-link>, 2012.</mixed-citation></ref>
      <ref id="bib1.bib31"><label>31</label><?label 1?><mixed-citation> Collins, W. J., Bellouin, N., Doutriaux-Boucher, M., Gedney, N., Hinton, T., Jones, C. D., Liddicoat, S., Martin, G., O'Connor, F., and Rae, J.: Evaluation of the HadGEM2 model, Met Office Exeter, UK, 2008.</mixed-citation></ref>
      <ref id="bib1.bib32"><label>32</label><?label 1?><mixed-citation>Copernicus Climate Change Service, Climate Data Store: CMIP5 monthly data on pressure levels, Copernicus Climate Change Service (C3S) Climate Data Store (CDS) [data set], <ext-link xlink:href="https://doi.org/10.24381/cds.3b4b5bc9" ext-link-type="DOI">10.24381/cds.3b4b5bc9</ext-link>, 2018a.</mixed-citation></ref>
      <ref id="bib1.bib33"><label>33</label><?label 1?><mixed-citation>Copernicus Climate Change Service, Climate Data Store: CMIP5 monthly data on single levels, Copernicus Climate Change Service (C3S) Climate Data Store (CDS) [data set], <ext-link xlink:href="https://doi.org/10.24381/cds.9d44a987" ext-link-type="DOI">10.24381/cds.9d44a987</ext-link>, 2018b.</mixed-citation></ref>
      <?pagebreak page6510?><ref id="bib1.bib34"><label>34</label><?label 1?><mixed-citation>Cristianini, N. and Shawe-Taylor, J.: An introduction to support vector machines and other kernel-based learning methods, Cambridge University Press, <ext-link xlink:href="https://doi.org/10.1017/CBO9780511801389" ext-link-type="DOI">10.1017/CBO9780511801389</ext-link>, 2000.</mixed-citation></ref>
      <ref id="bib1.bib35"><label>35</label><?label 1?><mixed-citation>Das, D., Dy, J., Ross, J., Obradovic, Z., and Ganguly, A. R.: Non-parametric Bayesian mixture of sparse regressions with application towards feature selection for statistical downscaling, Nonlin. Processes Geophys., 21, 1145–1157, <ext-link xlink:href="https://doi.org/10.5194/npg-21-1145-2014" ext-link-type="DOI">10.5194/npg-21-1145-2014</ext-link>, 2014.</mixed-citation></ref>
      <ref id="bib1.bib36"><label>36</label><?label 1?><mixed-citation>Dau, Q. V., Kuntiyawichai, K., and Adeloye, A. J.: Future Changes in Water Availability Due to Climate Change Projections for Huong Basin, Vietnam, Environ. Process., 8, 77–98, <ext-link xlink:href="https://doi.org/10.1007/s40710-020-00475-y" ext-link-type="DOI">10.1007/s40710-020-00475-y</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bib37"><label>37</label><?label 1?><mixed-citation>Deutscher Wetterdienst: Sub-catchment datasets, Deutscher Wetterdienst [data set], <uri>https://cdc.dwd.de/portal/shortlink/da6f555d-d6f6-426a-a8ba-b96683c76ea9</uri>, (last access: 30 October 2023),  2023.</mixed-citation></ref>
      <ref id="bib1.bib38"><label>38</label><?label 1?><mixed-citation>Diaz, G. I., Fokoue-Nkoutche, A., Nannicini, G., and Samulowitz, H.: An effective algorithm for hyperparameter optimization of neural networks, IBM J. Res. Develop., 61, 9:1–9:11,  <ext-link xlink:href="https://doi.org/10.1147/JRD.2017.2709578" ext-link-type="DOI">10.1147/JRD.2017.2709578</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bib39"><label>39</label><?label 1?><mixed-citation>Dietterich, T. G.: Ensemble Methods in Machine Learning, in: Multiple Classifier Systems, Berlin, Heidelberg, 1–15, <ext-link xlink:href="https://doi.org/10.1007/3-540-45014-9_1" ext-link-type="DOI">10.1007/3-540-45014-9_1</ext-link>, 2000.</mixed-citation></ref>
      <ref id="bib1.bib40"><label>40</label><?label 1?><mixed-citation>Efron, B., Hastie, T., Johnstone, I., and Tibshirani, R.: Least angle regression, Ann. Stat., 32, 407–499, <ext-link xlink:href="https://doi.org/10.1214/009053604000000067" ext-link-type="DOI">10.1214/009053604000000067</ext-link>, 2004.</mixed-citation></ref>
      <ref id="bib1.bib41"><label>41</label><?label 1?><mixed-citation> Errico, R. M., Stensrud, D. J., and Raeder, K. D.: Estimation of the error distributions of precipitation produced by convective parametrization schemes, Q. J. Roy. Meteor. Soc., 127, 2495–2512, 2001.</mixed-citation></ref>
      <ref id="bib1.bib42"><label>42</label><?label 1?><mixed-citation>Fan, J., Wu, L., Zheng, J., and Zhang, F.: Medium-range forecasting of daily reference evapotranspiration across China using numerical weather prediction outputs downscaled by extreme gradient boosting, J. Hydrol., 601, 126664, <ext-link xlink:href="https://doi.org/10.1016/j.jhydrol.2021.126664" ext-link-type="DOI">10.1016/j.jhydrol.2021.126664</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bib43"><label>43</label><?label 1?><mixed-citation>Fealy, R. and Sweeney, J.: Statistical downscaling of precipitation for a selection of sites in Ireland employing a generalised linear modelling approach, Int. J. Climatol., 27, 2083–2094, <ext-link xlink:href="https://doi.org/10.1002/joc.1506" ext-link-type="DOI">10.1002/joc.1506</ext-link>, 2007.</mixed-citation></ref>
      <ref id="bib1.bib44"><label>44</label><?label 1?><mixed-citation>Feldmann, H., Schädler, G., Panitz, H.-J., and Kottmeier, C.: Near future changes of extreme precipitation over complex terrain in Central Europe derived from high resolution RCM ensemble simulations, Int. J. Climatol., 33, 1964–1977, <ext-link xlink:href="https://doi.org/10.1002/joc.3564" ext-link-type="DOI">10.1002/joc.3564</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bib45"><label>45</label><?label 1?><mixed-citation>Ferri, F. J., Pudil, P., Hatef, M., and Kittler, J.: Comparative study of techniques for large-scale feature selection, in: Machine Intelligence and Pattern Recognition, vol. 16, edited by: Gelsema, E. S. and Kanal, L. S., North-Holland, 403–413, <ext-link xlink:href="https://doi.org/10.1016/B978-0-444-81892-8.50040-7" ext-link-type="DOI">10.1016/B978-0-444-81892-8.50040-7</ext-link>, 1994.</mixed-citation></ref>
      <ref id="bib1.bib46"><label>46</label><?label 1?><mixed-citation> Field, C. B. and Barros, V. R.: Climate Change 2014 – Impacts, Adaptation and Vulnerability: Regional Aspects, Cambridge University Press, 695 pp., 2014.</mixed-citation></ref>
      <ref id="bib1.bib47"><label>47</label><?label 1?><mixed-citation>Freund, Y. and Schapire, R. E.: A Decision-Theoretic Generalization of On-Line Learning and an Application to Boosting, J. Comput. Syst. Sci., 55, 119–139, <ext-link xlink:href="https://doi.org/10.1006/jcss.1997.1504" ext-link-type="DOI">10.1006/jcss.1997.1504</ext-link>, 1997.</mixed-citation></ref>
      <ref id="bib1.bib48"><label>48</label><?label 1?><mixed-citation> Friedman, J. H.: Greedy function approximation: a gradient boosting machine, Ann. Stat., 29, 1189–1232, 2001.</mixed-citation></ref>
      <ref id="bib1.bib49"><label>49</label><?label 1?><mixed-citation> Gardner, M. W. and Dorling, S. R.: Artificial neural networks (the multilayer perceptron) – a review of applications in the atmospheric sciences, Atmos. Environ., 32, 2627–2636, 1998.</mixed-citation></ref>
      <ref id="bib1.bib50"><label>50</label><?label 1?><mixed-citation>Gareth, J., Daniela, W., Trevor, H., and Robert, T.: An introduction to statistical learning: with applications in R, Spinger, <ext-link xlink:href="https://doi.org/10.1007/978-1-4614-7138-7" ext-link-type="DOI">10.1007/978-1-4614-7138-7</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bib51"><label>51</label><?label 1?><mixed-citation> Geurts, P., Ernst, D., and Wehenkel, L.: Extremely randomized trees, Mach. Learn., 63, 3–42, 2006.</mixed-citation></ref>
      <ref id="bib1.bib52"><label>52</label><?label 1?><mixed-citation>Ghosh, S. and Mujumdar, P. P.: Statistical downscaling of GCM simulations to streamflow using relevance vector machine, Adv. Water Resour., 31, 132–146, <ext-link xlink:href="https://doi.org/10.1016/j.advwatres.2007.07.005" ext-link-type="DOI">10.1016/j.advwatres.2007.07.005</ext-link>, 2008.</mixed-citation></ref>
      <ref id="bib1.bib53"><label>53</label><?label 1?><mixed-citation>Giorgi, F. and Mearns, L. O.: Approaches to the simulation of regional climate change: A review, Rev. Geophys., 29, 191–216, <ext-link xlink:href="https://doi.org/10.1029/90RG02636" ext-link-type="DOI">10.1029/90RG02636</ext-link>, 1991.</mixed-citation></ref>
      <ref id="bib1.bib54"><label>54</label><?label 1?><mixed-citation>Gobiet, A., Kotlarski, S., Beniston, M., Heinrich, G., Rajczak, J., and Stoffel, M.: 21st century climate change in the European Alps – A review, Sci. Total Environ., 493, 1138–1151, <ext-link xlink:href="https://doi.org/10.1016/j.scitotenv.2013.07.050" ext-link-type="DOI">10.1016/j.scitotenv.2013.07.050</ext-link>, 2014.</mixed-citation></ref>
      <ref id="bib1.bib55"><label>55</label><?label 1?><mixed-citation>Gu, J., Liu, S., Zhou, Z., Chalov, S. R., and Zhuang, Q.: A Stacking Ensemble Learning Model for Monthly Rainfall Prediction in the Taihu Basin, China, Water, 14, 492, <ext-link xlink:href="https://doi.org/10.3390/w14030492" ext-link-type="DOI">10.3390/w14030492</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bib56"><label>56</label><?label 1?><mixed-citation>Gutiérrez, J. M., San Martín, D., Cofiño, A. S., Herrera, S., Manzanas, R., and Frías, M. D.: User Guide of the ENSEMBLES Downscaling Portal (version 2), Tech. Rep. 2.2011, Santander Meteorology Group, CSIC-UC,  <uri>https://core.ac.uk/download/pdf/36151579.pdf</uri> (last access: 30 October 2023), 2012.</mixed-citation></ref>
      <ref id="bib1.bib57"><label>57</label><?label 1?><mixed-citation>Gutiérrez, J. M., San-Martín, D., Brands, S., Manzanas, R., and Herrera, S.: Reassessing Statistical Downscaling Techniques for Their Robust Application under Climate Change Conditions, J. Climate, 26, 171–188, <ext-link xlink:href="https://doi.org/10.1175/JCLI-D-11-00687.1" ext-link-type="DOI">10.1175/JCLI-D-11-00687.1</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bib58"><label>58</label><?label 1?><mixed-citation>Gutiérrez, J. M., Maraun, D., Widmann, M., Huth, R., Hertig, E., Benestad, R., Roessler, O., Wibig, J., Wilcke, R., Kotlarski, S., San Martín, D., Herrera, S., Bedia, J., Casanueva, A., Manzanas, R., Iturbide, M., Vrac, M., Dubrovsky, M., Ribalaygua, J., Pórtoles, J., Räty, O., Räisänen, J., Hingray, B., Raynaud, D., Casado, M. J., Ramos, P., Zerenner, T., Turco, M., Bosshard, T., Štěpánek, P., Bartholy, J., Pongracz, R., Keller, D. E., Fischer, A. M., Cardoso, R. M., Soares, P. M. M., Czernecki, B., and Pagé, C.: An intercomparison of a large ensemble of statistical downscaling methods over Europe: Results from the VALUE perfect predictor cross-validation experiment, Int. J. Climatol., 39, 3750–3785, <ext-link xlink:href="https://doi.org/10.1002/joc.5462" ext-link-type="DOI">10.1002/joc.5462</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bib59"><label>59</label><?label 1?><mixed-citation>Hammami, D., Lee, T. S., Ouarda, T. B. M. J., and Lee, J.: Predictor selection for downscaling GCM data with LASSO, J. Geophys. Res.-Atmos., 117, D17116,  <ext-link xlink:href="https://doi.org/10.1029/2012JD017864" ext-link-type="DOI">10.1029/2012JD017864</ext-link>, 2012.</mixed-citation></ref>
      <ref id="bib1.bib60"><label>60</label><?label 1?><mixed-citation>Hastie, T., Friedman, J., and Tibshirani, R.: The Elements of Statistical Learning, Springer New York, New York, NY, <ext-link xlink:href="https://doi.org/10.1007/978-0-387-21606-5" ext-link-type="DOI">10.1007/978-0-387-21606-5</ext-link>, 2001.</mixed-citation></ref>
      <ref id="bib1.bib61"><label>61</label><?label 1?><mixed-citation>He, X., Chaney, N. W., Schleiss, M., and Sheffield, J.: Spatial downscaling of precipitation using adaptable random forests, Water Resour. Res., 52, 8217–8237, <ext-link xlink:href="https://doi.org/10.1002/2016WR019034" ext-link-type="DOI">10.1002/2016WR019034</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bib62"><label>62</label><?label 1?><mixed-citation> Hecht-Nielsen, R.: Theory of the backpropagation neural network, in: Neural networks for perception, Elsevier, 65–93, 1992.</mixed-citation></ref>
      <?pagebreak page6511?><ref id="bib1.bib63"><label>63</label><?label 1?><mixed-citation>Hersbach, H., Bell, B., Berrisford, P., Hirahara, S., Horányi, A., Muñoz-Sabater, J., Nicolas, J., Peubey, C., Radu, R., Schepers, D., Simmons, A., Soci, C., Abdalla, S., Abellan, X., Balsamo, G., Bechtold, P., Biavati, G., Bidlot, J., Bonavita, M., De Chiara, G., Dahlgren, P., Dee, D., Diamantakis, M., Dragani, R., Flemming, J., Forbes, R., Fuentes, M., Geer, A., Haimberger, L., Healy, S., Hogan, R. J., Hólm, E., Janisková, M., Keeley, S., Laloyaux, P., Lopez, P., Lupu, C., Radnoti, G., de Rosnay, P., Rozum, I., Vamborg, F., Villaume, S., and Thépaut, J.-N.: The ERA5 global reanalysis, Q. J. Roy. Meteor. Soc., 146, 1999–2049, <ext-link xlink:href="https://doi.org/10.1002/qj.3803" ext-link-type="DOI">10.1002/qj.3803</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bib64"><label>64</label><?label 1?><mixed-citation>Hersbach, H., Bell, B., Berrisford, P., Biavati, G., Horányi, A., Muñoz Sabater, J., Nicolas, J., Peubey, C., Radu, R., Rozum, I., Schepers, D., Simmons, A., Soci, C., Dee, D., and Thépaut, J.-N.: ERA5 monthly averaged data on pressure levels from 1940 to present, Copernicus Climate Change Service (C3S) Climate Data Store (CDS) [data set], <ext-link xlink:href="https://doi.org/10.24381/cds.6860a573" ext-link-type="DOI">10.24381/cds.6860a573</ext-link>, 2023.</mixed-citation></ref>
      <ref id="bib1.bib65"><label>65</label><?label 1?><mixed-citation>Hertig, E., Maraun, D., Bartholy, J., Pongracz, R., Vrac, M., Mares, I., Gutiérrez, J. M., Wibig, J., Casanueva, A., and Soares, P. M. M.: Comparison of statistical downscaling methods with respect to extreme events over Europe: Validation results from the perfect predictor experiment of the COST Action VALUE, Int. J. Climatol., 39, 3846–3867, <ext-link xlink:href="https://doi.org/10.1002/joc.5469" ext-link-type="DOI">10.1002/joc.5469</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bib66"><label>66</label><?label 1?><mixed-citation>Hewitson, B. C., Daron, J., Crane, R. G., Zermoglio, M. F., and Jack, C.: Interrogating empirical-statistical downscaling, Clim. Change, 122, 539–554, <ext-link xlink:href="https://doi.org/10.1007/s10584-013-1021-z" ext-link-type="DOI">10.1007/s10584-013-1021-z</ext-link>, 2014.</mixed-citation></ref>
      <ref id="bib1.bib67"><label>67</label><?label 1?><mixed-citation>Hodgkins, G. A., Whitfield, P. H., Burn, D. H., Hannaford, J., Renard, B., Stahl, K., Fleig, A. K., Madsen, H., Mediero, L., Korhonen, J., Murphy, C., and Wilson, D.: Climate-driven variability in the occurrence of major floods across North America and Europe, J. Hydrol., 552, 704–717, <ext-link xlink:href="https://doi.org/10.1016/j.jhydrol.2017.07.027" ext-link-type="DOI">10.1016/j.jhydrol.2017.07.027</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bib68"><label>68</label><?label 1?><mixed-citation>Hofmann, T., Schölkopf, B., and Smola, A. J.: Kernel methods in machine learning, Ann. Stat., 36, 1171–1220, <ext-link xlink:href="https://doi.org/10.1214/009053607000000677" ext-link-type="DOI">10.1214/009053607000000677</ext-link>, 2008.</mixed-citation></ref>
      <ref id="bib1.bib69"><label>69</label><?label 1?><mixed-citation>Hoyer, S. and Hamman, J.: xarray: N-D labeled Arrays and Datasets in Python, J. Open Res. Softw., 5, 10, <ext-link xlink:href="https://doi.org/10.5334/jors.148" ext-link-type="DOI">10.5334/jors.148</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bib70"><label>70</label><?label 1?><mixed-citation>Hurrell, J. W.: Decadal Trends in the North Atlantic Oscillation: Regional Temperatures and Precipitation, Science, 269, 676–679, <ext-link xlink:href="https://doi.org/10.1126/science.269.5224.676" ext-link-type="DOI">10.1126/science.269.5224.676</ext-link>, 1995.</mixed-citation></ref>
      <ref id="bib1.bib71"><label>71</label><?label 1?><mixed-citation>Hurrell, J. W. and Van Loon, H.: Decadal Variations in Climate Associated with the North Atlantic Oscillation, in: Climatic Change at High Elevation Sites, edited by: Diaz, H. F., Beniston, M., and Bradley, R. S., Springer Netherlands, Dordrecht, 69–94, <ext-link xlink:href="https://doi.org/10.1007/978-94-015-8905-5_4" ext-link-type="DOI">10.1007/978-94-015-8905-5_4</ext-link>, 1997.</mixed-citation></ref>
      <ref id="bib1.bib72"><label>72</label><?label 1?><mixed-citation>Huth, R.: Statistical downscaling in central Europe: evaluation of methods and potential predictors, Clim. Res., 13, 91–101, <ext-link xlink:href="https://doi.org/10.3354/cr013091" ext-link-type="DOI">10.3354/cr013091</ext-link>, 1999.</mixed-citation></ref>
      <ref id="bib1.bib73"><label>73</label><?label 1?><mixed-citation>Huth, R.: Sensitivity of Local Daily Temperature Change Estimates to the Selection of Downscaling Models and Predictors, J. Climate, 17, 640–652, <ext-link xlink:href="https://doi.org/10.1175/1520-0442(2004)017&lt;0640:SOLDTC&gt;2.0.CO;2" ext-link-type="DOI">10.1175/1520-0442(2004)017&lt;0640:SOLDTC&gt;2.0.CO;2</ext-link>, 2004.</mixed-citation></ref>
      <ref id="bib1.bib74"><label>74</label><?label 1?><mixed-citation>IPCC: Climate Change 2021: The Physical Science Basis. Contribution of Working Group I to the Sixth Assessment Report of the Intergovernmental Panel on Climate Change, edited by: Masson-Delmotte, V.,  Zhai, P.,  Pirani, A.,  Connors, S. L.,  Péan, C., Berger, S.,  Caud, N.,  Chen, Y.,  Goldfarb, L., Gomis, M. I.,  Huang, M.,  Leitzell, K.,  Lonnoy, E., Matthews, J. B. R.,  Maycock, T. K.,  Waterfield, T.,  Yelekçi, O.,  Yu, R., and  Zhou, B., Cambridge University Press, Cambridge, United Kingdom and New York, NY, USA, 2391 pp., <ext-link xlink:href="https://doi.org/10.1017/9781009157896" ext-link-type="DOI">10.1017/9781009157896</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bib75"><label>75</label><?label 1?><mixed-citation>Jakob Themeßl, M., Gobiet, A., and Leuprecht, A.: Empirical-statistical downscaling and error correction of daily precipitation from regional climate models, Int. J. Climatol., 31, 1530–1544, <ext-link xlink:href="https://doi.org/10.1002/joc.2168" ext-link-type="DOI">10.1002/joc.2168</ext-link>, 2011.</mixed-citation></ref>
      <ref id="bib1.bib76"><label>76</label><?label 1?><mixed-citation>Jia, S., Zhu, W., Lű, A., and Yan, T.: A statistical spatial downscaling algorithm of TRMM precipitation based on NDVI and DEM in the Qaidam Basin of China, Remote Sens. Environ., 115, 3069–3079, <ext-link xlink:href="https://doi.org/10.1016/j.rse.2011.06.009" ext-link-type="DOI">10.1016/j.rse.2011.06.009</ext-link>, 2011.</mixed-citation></ref>
      <ref id="bib1.bib77"><label>77</label><?label 1?><mixed-citation>Jordan, M. I. and Mitchell, T. M.: Machine learning: Trends, perspectives, and prospects, Science, 349, 255–260, <ext-link xlink:href="https://doi.org/10.1126/science.aaa8415" ext-link-type="DOI">10.1126/science.aaa8415</ext-link>, 2015.</mixed-citation></ref>
      <ref id="bib1.bib78"><label>78</label><?label 1?><mixed-citation>Kay, J. E., Deser, C., Phillips, A., Mai, A., Hannay, C., Strand, G., Arblaster, J. M., Bates, S. C., Danabasoglu, G., Edwards, J., Holland, M., Kushner, P., Lamarque, J.-F., Lawrence, D., Lindsay, K., Middleton, A., Munoz, E., Neale, R., Oleson, K., Polvani, L., and Vertenstein, M.: The Community Earth System Model (CESM) Large Ensemble Project: A Community Resource for Studying Climate Change in the Presence of Internal Climate Variability, B. Am. Meteorol. Soc., 96, 1333–1349, <ext-link xlink:href="https://doi.org/10.1175/BAMS-D-13-00255.1" ext-link-type="DOI">10.1175/BAMS-D-13-00255.1</ext-link>, 2015.</mixed-citation></ref>
      <ref id="bib1.bib79"><label>79</label><?label 1?><mixed-citation> Kingma, D. P. and Ba, J.: Adam: A method for stochastic optimization, arXiv [preprint], arXiv:1412.6980, 2014.</mixed-citation></ref>
      <ref id="bib1.bib80"><label>80</label><?label 1?><mixed-citation>Kunstmann, H., Schneider, K., Forkel, R., and Knoche, R.: Impact analysis of climate change for an Alpine catchment using high resolution dynamic downscaling of ECHAM4 time slices, Hydrol. Earth Syst. Sci., 8, 1031–1045, <ext-link xlink:href="https://doi.org/10.5194/hess-8-1031-2004" ext-link-type="DOI">10.5194/hess-8-1031-2004</ext-link>, 2004.</mixed-citation></ref>
      <ref id="bib1.bib81"><label>81</label><?label 1?><mixed-citation>Lachenbruch, P. A. and Mickey, M. R.: Estimation of Error Rates in Discriminant Analysis, Technometrics, 10, 1–11, <ext-link xlink:href="https://doi.org/10.1080/00401706.1968.10490530" ext-link-type="DOI">10.1080/00401706.1968.10490530</ext-link>, 1968.</mixed-citation></ref>
      <ref id="bib1.bib82"><label>82</label><?label 1?><mixed-citation>Laflamme, E. M., Linder, E., and Pan, Y.: Statistical downscaling of regional climate model output to achieve projections of precipitation extremes, Weather Climate Extremes, 12, 15–23, <ext-link xlink:href="https://doi.org/10.1016/j.wace.2015.12.001" ext-link-type="DOI">10.1016/j.wace.2015.12.001</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bib83"><label>83</label><?label 1?><mixed-citation>Lau, W. K.-M., Wu, H.-T., and Kim, K.-M.: A canonical response of precipitation characteristics to global warming from CMIP5 models, Geophys. Res. Lett., 40, 3163–3169, <ext-link xlink:href="https://doi.org/10.1002/grl.50420" ext-link-type="DOI">10.1002/grl.50420</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bib84"><label>84</label><?label 1?><mixed-citation>Leblanc, M. and Tibshirani, R.: Combining Estimates in Regression and Classification, J. Am. Stat. A., 91, 1641–1650, <ext-link xlink:href="https://doi.org/10.1080/01621459.1996.10476733" ext-link-type="DOI">10.1080/01621459.1996.10476733</ext-link>, 1996.</mixed-citation></ref>
      <ref id="bib1.bib85"><label>85</label><?label 1?><mixed-citation>Li, J., Pollinger, F., and Paeth, H.: Comparing the Lasso Predictor-Selection and Regression Method with Classical Approaches of Precipitation Bias Adjustment in Decadal Climate Predictions, Mon. Weather Rev., 148, 4339–4351, <ext-link xlink:href="https://doi.org/10.1175/MWR-D-19-0302.1" ext-link-type="DOI">10.1175/MWR-D-19-0302.1</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bib86"><label>86</label><?label 1?><mixed-citation>Liu, J., Yuan, D., Zhang, L., Zou, X., and Song, X.: Comparison of Three Statistical Downscaling Methods and Ensemble Downscaling Method Based on Bayesian Model Averaging in Upper Hanjiang River Basin, China, Adv. Meteorol., 2016, e7463963, <ext-link xlink:href="https://doi.org/10.1155/2016/7463963" ext-link-type="DOI">10.1155/2016/7463963</ext-link>, 2015.</mixed-citation></ref>
      <?pagebreak page6512?><ref id="bib1.bib87"><label>87</label><?label 1?><mixed-citation>Lorenz, E. N.: Atmospheric Predictability as Revealed by Naturally Occurring Analogues, J. Atmos. Sci., 26, 636–646, <ext-link xlink:href="https://doi.org/10.1175/1520-0469(1969)26&lt;636:APARBN&gt;2.0.CO;2" ext-link-type="DOI">10.1175/1520-0469(1969)26&lt;636:APARBN&gt;2.0.CO;2</ext-link>, 1969.</mixed-citation></ref>
      <ref id="bib1.bib88"><label>88</label><?label 1?><mixed-citation>Ludwig, R., Taschner, S., and Mauser, W.: Modelling floods in the Ammer catchment: limitations and challenges with a coupled meteo-hydrological model approach, Hydrol. Earth Syst. Sci., 7, 833–847, <ext-link xlink:href="https://doi.org/10.5194/hess-7-833-2003" ext-link-type="DOI">10.5194/hess-7-833-2003</ext-link>, 2003.</mixed-citation></ref>
      <ref id="bib1.bib89"><label>89</label><?label 1?><mixed-citation> MacKay, D. J.: Bayesian interpolation, Neural Comput., 4, 415–447, 1992.</mixed-citation></ref>
      <ref id="bib1.bib90"><label>90</label><?label 1?><mixed-citation>Maraun, D. and Widmann, M. (Eds.): Structure of Statistical Downscaling Methods, in: Statistical Downscaling and Bias Correction for Climate Research, Cambridge University Press, Cambridge, 135–140, <ext-link xlink:href="https://doi.org/10.1017/9781107588783.011" ext-link-type="DOI">10.1017/9781107588783.011</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bib91"><label>91</label><?label 1?><mixed-citation>Maraun, D., Wetterhall, F., Ireson, A. M., Chandler, R. E., Kendon, E. J., Widmann, M., Brienen, S., Rust, H. W., Sauter, T., Themeßl, M., Venema, V. K. C., Chun, K. P., Goodess, C. M., Jones, R. G., Onof, C., Vrac, M., and Thiele-Eich, I.: Precipitation downscaling under climate change: Recent developments to bridge the gap between dynamical models and the end user, Rev. Geophys., 48, RG3003, <ext-link xlink:href="https://doi.org/10.1029/2009RG000314" ext-link-type="DOI">10.1029/2009RG000314</ext-link>, 2010.</mixed-citation></ref>
      <ref id="bib1.bib92"><label>92</label><?label 1?><mixed-citation>Maraun, D., Huth, R., Gutiérrez, J. M., Martín, D. S., Dubrovsky, M., Fischer, A., Hertig, E., Soares, P. M. M., Bartholy, J., Pongrácz, R., Widmann, M., Casado, M. J., Ramos, P., and Bedia, J.: The VALUE perfect predictor experiment: Evaluation of temporal variability, Int. J. Climatol., 39, 3786–3818, <ext-link xlink:href="https://doi.org/10.1002/joc.5222" ext-link-type="DOI">10.1002/joc.5222</ext-link>, 2019a.</mixed-citation></ref>
      <ref id="bib1.bib93"><label>93</label><?label 1?><mixed-citation>Maraun, D., Widmann, M., and Gutiérrez, J. M.: Statistical downscaling skill under present climate conditions: A synthesis of the VALUE perfect predictor experiment, Int. J. Climatol., 39, 3692–3703, <ext-link xlink:href="https://doi.org/10.1002/joc.5877" ext-link-type="DOI">10.1002/joc.5877</ext-link>, 2019b.</mixed-citation></ref>
      <ref id="bib1.bib94"><label>94</label><?label 1?><mixed-citation> Markatou, M., Tian, H., Biswas, S., and Hripcsak, G. M.: Analysis of variance of cross-validation estimators of the generalization error, J. Mach. Learn. Res.,  6, 1127–1168, 2005.</mixed-citation></ref>
      <ref id="bib1.bib95"><label>95</label><?label 1?><mixed-citation>Marzban, C., Sandgathe, S., and Kalnay, E.: MOS, Perfect Prog, and Reanalysis, Mon. Weather Rev., 134, 657–663, <ext-link xlink:href="https://doi.org/10.1175/MWR3088.1" ext-link-type="DOI">10.1175/MWR3088.1</ext-link>, 2006.</mixed-citation></ref>
      <ref id="bib1.bib96"><label>96</label><?label 1?><mixed-citation>Massaoudi, M., Refaat, S. S., Chihi, I., Trabelsi, M., Oueslati, F. S., and Abu-Rub, H.: A novel stacked generalization ensemble-based hybrid LGBM-XGB-MLP model for Short-Term Load Forecasting, Energy, 214, 118874, <ext-link xlink:href="https://doi.org/10.1016/j.energy.2020.118874" ext-link-type="DOI">10.1016/j.energy.2020.118874</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bib97"><label>97</label><?label 1?><mixed-citation>Mearns, L. O., Rosenzweig, C., and Goldberg, R.: The effect of changes in daily and interannual climatic variability on CERES-Wheat: A sensitivity study, Clim. Change, 32, 257–292, <ext-link xlink:href="https://doi.org/10.1007/BF00142465" ext-link-type="DOI">10.1007/BF00142465</ext-link>, 1996.</mixed-citation></ref>
      <ref id="bib1.bib98"><label>98</label><?label 1?><mixed-citation>Meinshausen, M., Smith, S. J., Calvin, K., Daniel, J. S., Kainuma, M. L. T., Lamarque, J.-F., Matsumoto, K., Montzka, S. A., Raper, S. C. B., Riahi, K., Thomson, A., Velders, G. J. M., and van Vuuren, D. P. P.: The RCP greenhouse gas concentrations and their extensions from 1765 to 2300, Clim. Change, 109, 213, <ext-link xlink:href="https://doi.org/10.1007/s10584-011-0156-z" ext-link-type="DOI">10.1007/s10584-011-0156-z</ext-link>, 2011.</mixed-citation></ref>
      <ref id="bib1.bib99"><label>99</label><?label 1?><mixed-citation>Miles, J.: R Squared, Adjusted R Squared, in: Wiley StatsRef: Statistics Reference Online, John Wiley &amp; Sons, Ltd, <ext-link xlink:href="https://doi.org/10.1002/9781118445112.stat06627" ext-link-type="DOI">10.1002/9781118445112.stat06627</ext-link>, 2014.</mixed-citation></ref>
      <ref id="bib1.bib100"><label>100</label><?label 1?><mixed-citation> Moore, A. W.: Cross-validation for detecting and preventing overfitting, School of Computer Science Carneigie Mellon University, 133, 2001.</mixed-citation></ref>
      <ref id="bib1.bib101"><label>101</label><?label 1?><mixed-citation>Moss, R. H., Edmonds, J. A., Hibbard, K. A., Manning, M. R., Rose, S. K., van Vuuren, D. P., Carter, T. R., Emori, S., Kainuma, M., Kram, T., Meehl, G. A., Mitchell, J. F. B., Nakicenovic, N., Riahi, K., Smith, S. J., Stouffer, R. J., Thomson, A. M., Weyant, J. P., and Wilbanks, T. J.: The next generation of scenarios for climate change research and assessment, Nature, 463, 747–756, <ext-link xlink:href="https://doi.org/10.1038/nature08823" ext-link-type="DOI">10.1038/nature08823</ext-link>, 2010.</mixed-citation></ref>
      <ref id="bib1.bib102"><label>102</label><?label 1?><mixed-citation>Muñoz Sabater, J.: ERA5-Land monthly averaged data from 1950 to present, Copernicus Climate Change Service (C3S) Climate Data Store (CDS), <ext-link xlink:href="https://doi.org/10.24381/cds.68d2bb30" ext-link-type="DOI">10.24381/cds.68d2bb30</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bib103"><label>103</label><?label 1?><mixed-citation>Murphy, J.: Predictions of climate change over Europe using statistical and dynamical downscaling techniques, Int. J. Climatol., 20, 489–501, <ext-link xlink:href="https://doi.org/10.1002/(SICI)1097-0088(200004)20:5&lt;489::AID-JOC484&gt;3.0.CO;2-6" ext-link-type="DOI">10.1002/(SICI)1097-0088(200004)20:5&lt;489::AID-JOC484&gt;3.0.CO;2-6</ext-link>, 2000.</mixed-citation></ref>
      <ref id="bib1.bib104"><label>104</label><?label 1?><mixed-citation>Mutz, S., Paeth, H., and Winkler, S.: Modelling of future mass balance changes of Norwegian glaciers by application of a dynamical–statistical model, Clim. Dynam., 46, 1581–1597, <ext-link xlink:href="https://doi.org/10.1007/s00382-015-2663-5" ext-link-type="DOI">10.1007/s00382-015-2663-5</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bib105"><label>105</label><?label 1?><mixed-citation>Mutz, S. G. and Aschauer, J.: Empirical glacier mass-balance models for South America, J. Glaciol., 68, 912–926, <ext-link xlink:href="https://doi.org/10.1017/jog.2022.6" ext-link-type="DOI">10.1017/jog.2022.6</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bib106"><label>106</label><?label 1?><mixed-citation>Mutz, S. G., Scherrer, S., Muceniece, I., and Ehlers, T. A.: Twenty-first century regional temperature response in Chile based on empirical-statistical downscaling, Clim. Dynam., 56, 2881–2894, <ext-link xlink:href="https://doi.org/10.1007/s00382-020-05620-9" ext-link-type="DOI">10.1007/s00382-020-05620-9</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bib107"><label>107</label><?label 1?><mixed-citation>Nash, J. E. and Sutcliffe, J. V.: River flow forecasting through conceptual models part I – A discussion of principles, J. Hydrol., 10, 282–290,  <ext-link xlink:href="https://doi.org/10.1016/0022-1694(70)90255-6" ext-link-type="DOI">10.1016/0022-1694(70)90255-6</ext-link>, 1970.</mixed-citation></ref>
      <ref id="bib1.bib108"><label>108</label><?label 1?><mixed-citation>Neal, R. M.: Bayesian learning for neural networks, Springer Science &amp; Business Media, <ext-link xlink:href="https://doi.org/10.1007/978-1-4612-0745-0" ext-link-type="DOI">10.1007/978-1-4612-0745-0</ext-link>, 2012.</mixed-citation></ref>
      <ref id="bib1.bib109"><label>109</label><?label 1?><mixed-citation>Nourani, V., Razzaghzadeh, Z., Baghanam, A. H., and Molajou, A.: ANN-based statistical downscaling of climatic parameters using decision tree predictor screening method, Theor. Appl. Climatol., 137, 1729–1746, <ext-link xlink:href="https://doi.org/10.1007/s00704-018-2686-z" ext-link-type="DOI">10.1007/s00704-018-2686-z</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bib110"><label>110</label><?label 1?><mixed-citation>Onyutha, C.: A hydrological model skill score and revised R-squared, Hydrol. Res., 53, 51–64, <ext-link xlink:href="https://doi.org/10.2166/nh.2021.071" ext-link-type="DOI">10.2166/nh.2021.071</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bib111"><label>111</label><?label 1?><mixed-citation> Pachauri, R. K., Allen, M. R., Barros, V. R., Broome, J., Cramer, W., Christ, R., Church, J. A., Clarke, L., Dahe, Q., Dasgupta, P., Dubash, N. K., Edenhofer, O., Elgizouli, I., Field, C. B., Forster, P., Friedlingstein, P., Fuglestvedt, J., Gomez-Echeverri, L., Hallegatte, S., Hegerl, G., Howden, M., Jiang, K., Jimenez Cisneroz, B., Kattsov, V., Lee, H., Mach, K. J., Marotzke, J., Mastrandrea, M. D., Meyer, L., Minx, J., Mulugetta, Y., O'Brien, K., Oppenheimer, M., Pereira, J. J., Pichs-Madruga, R., Plattner, G.-K., Pörtner, H.-O., Power, S. B., Preston, B., Ravindranath, N. H., Reisinger, A., Riahi, K., Rusticucci, M., Scholes, R., Seyboth, K., Sokona, Y., Stavins, R., Stocker, T. F., Tschakert, P., van Vuuren, D., and van Ypserle, J.-P.: Climate Change 2014: Synthesis Report. Contribution of Working Groups I, II and III to the Fifth Assessment Report of the Intergovernmental Panel on Climate Change, edited by: Pachauri, R. K. and Meyer, L., IPCC, Geneva, Switzerland, 151 pp., 2014.</mixed-citation></ref>
      <ref id="bib1.bib112"><label>112</label><?label 1?><mixed-citation>Padulano, R., Rianna, G., Costabile, P., Costanzo, C., Del Giudice, G., and Mercogliano, P.: Propagation of variability in climate projections within urban flood modelling<?pagebreak page6513?>: A multi-purpose impact analysis, J. Hydrol., 602, 126756, <ext-link xlink:href="https://doi.org/10.1016/j.jhydrol.2021.126756" ext-link-type="DOI">10.1016/j.jhydrol.2021.126756</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bib113"><label>113</label><?label 1?><mixed-citation>Pal, S. K. and Mitra, S.: Multilayer perceptron, fuzzy sets, classifiaction, IEEE Trans Neural Netw., 3, 683–397, <ext-link xlink:href="https://doi.org/10.1109/72.159058" ext-link-type="DOI">10.1109/72.159058</ext-link>, 1992.</mixed-citation></ref>
      <ref id="bib1.bib114"><label>114</label><?label 1?><mixed-citation>Pang, B., Yue, J., Zhao, G., and Xu, Z.: Statistical Downscaling of Temperature with the Random Forest Model, Adv. Meteorol., 2017, e7265178, <ext-link xlink:href="https://doi.org/10.1155/2017/7265178" ext-link-type="DOI">10.1155/2017/7265178</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bib115"><label>115</label><?label 1?><mixed-citation>Paparrizos, S., Schindler, D., Potouridis, S., and Matzarakis, A.: Spatio-temporal analysis of present and future precipitation responses over South Germany, J. Water Clim. Change, 9, 490–499, <ext-link xlink:href="https://doi.org/10.2166/wcc.2017.009" ext-link-type="DOI">10.2166/wcc.2017.009</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bib116"><label>116</label><?label 1?><mixed-citation> Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., Blondel, M., Prettenhofer, P., Weiss, R., Dubourg, V., Vanderplas, J., Passos, A., Cournapeau, D., Brucher, M., Perrot, M., and Duchesnay, É.: Scikit-learn: Machine Learning in Python, J. Mach. Learn. Res., 12, 2825–2830, 2011.</mixed-citation></ref>
      <ref id="bib1.bib117"><label>117</label><?label 1?><mixed-citation>Polasky, A. D., Evans, J. L., and Fuentes, J. D.: CCdownscaling: A Python package for multivariable statistical climate model downscaling, Environ. Model. Softw., 165, 105712, <ext-link xlink:href="https://doi.org/10.1016/j.envsoft.2023.105712" ext-link-type="DOI">10.1016/j.envsoft.2023.105712</ext-link>, 2023.</mixed-citation></ref>
      <ref id="bib1.bib118"><label>118</label><?label 1?><mixed-citation>Pontes, F. J., Amorim, G. F., Balestrassi, P. P., Paiva, A. P., and Ferreira, J. R.: Design of experiments and focused grid search for neural network parameter optimization, Neurocomputing, 186, 22–34, <ext-link xlink:href="https://doi.org/10.1016/j.neucom.2015.12.061" ext-link-type="DOI">10.1016/j.neucom.2015.12.061</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bib119"><label>119</label><?label 1?><mixed-citation>Quesada-Chacón, D., Barfus, K., and Bernhofer, C.: Repeatable high-resolution statistical downscaling through deep learning, Geosci. Model Dev., 15, 7353–7370, <ext-link xlink:href="https://doi.org/10.5194/gmd-15-7353-2022" ext-link-type="DOI">10.5194/gmd-15-7353-2022</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bib120"><label>120</label><?label 1?><mixed-citation> Quinlan, J. R.: Bagging, boosting, and C4.S, in: Proceedings of the thirteenth national conference on Artificial intelligence – Volume 1, Portland, Oregon, 725–730, 1996.</mixed-citation></ref>
      <ref id="bib1.bib121"><label>121</label><?label 1?><mixed-citation>Raissi, M. and Karniadakis, G. E.: Hidden physics models: Machine learning of nonlinear partial differential equations, J. Comput. Phys., 357, 125–141, <ext-link xlink:href="https://doi.org/10.1016/j.jcp.2017.11.039" ext-link-type="DOI">10.1016/j.jcp.2017.11.039</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bib122"><label>122</label><?label 1?><mixed-citation>Ramon, J., Lledó, L., Bretonnière, P.-A., Samsó, M., and Doblas-Reyes, F. J.: A perfect prognosis downscaling methodology for seasonal prediction of local-scale wind speeds, Environ. Res. Lett., 16, 054010, <ext-link xlink:href="https://doi.org/10.1088/1748-9326/abe491" ext-link-type="DOI">10.1088/1748-9326/abe491</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bib123"><label>123</label><?label 1?><mixed-citation>Reichert, B. K., Bengtsson, L., and Åkesson, O.: A statistical modeling approach for the simulation of local paleoclimatic proxy records using general circulation model output, J. Geophys. Res.-Atmos., 104, 19071–19083, <ext-link xlink:href="https://doi.org/10.1029/1999JD900264" ext-link-type="DOI">10.1029/1999JD900264</ext-link>, 1999.</mixed-citation></ref>
      <ref id="bib1.bib124"><label>124</label><?label 1?><mixed-citation>Reid, S. and Grudic, G.: Regularized Linear Models in Stacked Generalization, in: Multiple Classifier Systems, Berlin, Heidelberg, 112–121, <ext-link xlink:href="https://doi.org/10.1007/978-3-642-02326-2_12" ext-link-type="DOI">10.1007/978-3-642-02326-2_12</ext-link>, 2009.</mixed-citation></ref>
      <ref id="bib1.bib125"><label>125</label><?label 1?><mixed-citation> Rumelhart, D. E., Hinton, G. E., and Williams, R. J.: Learning representations by back-propagating errors, Nature, 323, 533–536, 1986.</mixed-citation></ref>
      <ref id="bib1.bib126"><label>126</label><?label 1?><mixed-citation>Sachindra, D. A., Huang, F., Barton, A., and Perera, B. J. C.: Statistical downscaling of general circulation model outputs to precipitation – part 2: bias-correction and future projections, Int. J. Climatol., 34, 3282–3303, <ext-link xlink:href="https://doi.org/10.1002/joc.3915" ext-link-type="DOI">10.1002/joc.3915</ext-link>, 2014.</mixed-citation></ref>
      <ref id="bib1.bib127"><label>127</label><?label 1?><mixed-citation>Sachindra, D. A., Ahmed, K., Rashid, Md. M., Shahid, S., and Perera, B. J. C.: Statistical downscaling of precipitation using machine learning techniques, Atmos. Res., 212, 240–258, <ext-link xlink:href="https://doi.org/10.1016/j.atmosres.2018.05.022" ext-link-type="DOI">10.1016/j.atmosres.2018.05.022</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bib128"><label>128</label><?label 1?><mixed-citation>Santos, M. S., Soares, J. P., Abreu, P. H., Araujo, H., and Santos, J.: Cross-Validation for Imbalanced Datasets: Avoiding Overoptimistic and Overfitting Approaches [Research Frontier], IEEE Comput. Intell. M., 13, 59–76, <ext-link xlink:href="https://doi.org/10.1109/MCI.2018.2866730" ext-link-type="DOI">10.1109/MCI.2018.2866730</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bib129"><label>129</label><?label 1?><mixed-citation> Schapire, R. E.: A brief introduction to boosting, in: Ijcai, 1401–1406, 1999.</mixed-citation></ref>
      <ref id="bib1.bib130"><label>130</label><?label 1?><mixed-citation>Schapire, R. E.: The Boosting Approach to Machine Learning: An Overview, in: Nonlinear Estimation and Classification. Lecture Notes in Statistics, edited by: Denison, D. D., Hansen, M. H., Holmes, C. C., Mallick, B., and Yu, B., Springer, New York, NY, vol. 171, <ext-link xlink:href="https://doi.org/10.1007/978-0-387-21579-2_9" ext-link-type="DOI">10.1007/978-0-387-21579-2_9</ext-link>, 2003.</mixed-citation></ref>
      <ref id="bib1.bib131"><label>131</label><?label 1?><mixed-citation>Schapire, R. E. and Freund, Y.: Boosting: Foundations and algorithms, Kybernetes, 42, 164–166, <ext-link xlink:href="https://doi.org/10.1108/0368492131129554" ext-link-type="DOI">10.1108/0368492131129554</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bib132"><label>132</label><?label 1?><mixed-citation>Schmidli, J., Goodess, C. M., Frei, C., Haylock, M. R., Hundecha, Y., Ribalaygua, J., and Schmith, T.: Statistical and dynamical downscaling of precipitation: An evaluation and comparison of scenarios for the European Alps, J. Geophys. Res., 112, D04105, <ext-link xlink:href="https://doi.org/10.1029/2005JD007026" ext-link-type="DOI">10.1029/2005JD007026</ext-link>, 2007.</mixed-citation></ref>
      <ref id="bib1.bib133"><label>133</label><?label 1?><mixed-citation>Selle, B., Rink, K., and Kolditz, O.: Recharge and discharge controls on groundwater travel times and flow paths to production wells for the Ammer catchment in southwestern Germany, Environ. Earth Sci., 69, 443–452, <ext-link xlink:href="https://doi.org/10.1007/s12665-013-2333-z" ext-link-type="DOI">10.1007/s12665-013-2333-z</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bib134"><label>134</label><?label 1?><mixed-citation>Shahhosseini, M., Hu, G., and Archontoulis, S. V.: Forecasting Corn Yield With Machine Learning Ensembles, Front. Plant Sci., 11, 1120, <ext-link xlink:href="https://doi.org/10.3389/fpls.2020.01120" ext-link-type="DOI">10.3389/fpls.2020.01120</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bib135"><label>135</label><?label 1?><mixed-citation>Shepherd, T. G.: Atmospheric circulation as a source of uncertainty in climate change projections, Nat. Geosci., 7, 703–708, <ext-link xlink:href="https://doi.org/10.1038/ngeo2253" ext-link-type="DOI">10.1038/ngeo2253</ext-link>, 2014.</mixed-citation></ref>
      <ref id="bib1.bib136"><label>136</label><?label 1?><mixed-citation> Snoek, J., Larochelle, H., and Adams, R. P.: Practical bayesian optimization of machine learning algorithms, Adv. Neur. In., 25,  2951–2959, 2012.</mixed-citation></ref>
      <ref id="bib1.bib137"><label>137</label><?label 1?><mixed-citation>Spuler, F. R., Wessel, J. B., Comyn-Platt, E., Varndell, J., and Cagnazzo, C.: ibicus: a new open-source Python package and comprehensive interface for statistical bias adjustment and evaluation in climate modelling (v1.0.1), EGUsphere [preprint], <ext-link xlink:href="https://doi.org/10.5194/egusphere-2023-1481" ext-link-type="DOI">10.5194/egusphere-2023-1481</ext-link>, 2023.</mixed-citation></ref>
      <ref id="bib1.bib138"><label>138</label><?label 1?><mixed-citation>Steppeler, J., Doms, G., Schättler, U., Bitzer, H. W., Gassmann, A., Damrath, U., and Gregoric, G.: Meso-gamma scale forecasts using the nonhydrostatic model LM, Meteorol. Atmos. Phys., 82, 75–96, <ext-link xlink:href="https://doi.org/10.1007/s00703-001-0592-9" ext-link-type="DOI">10.1007/s00703-001-0592-9</ext-link>, 2003.</mixed-citation></ref>
      <ref id="bib1.bib139"><label>139</label><?label 1?><mixed-citation>Stone, M.: Cross-Validatory Choice and Assessment of Statistical Predictions (With Discussion), J. Roy. Stat. Soc. B, 38, 102–102, <ext-link xlink:href="https://doi.org/10.1111/j.2517-6161.1976.tb01573.x" ext-link-type="DOI">10.1111/j.2517-6161.1976.tb01573.x</ext-link>, 1976.</mixed-citation></ref>
      <ref id="bib1.bib140"><label>140</label><?label 1?><mixed-citation> Storch, H. von and Zwiers, F. W.: Statistical Analysis in Climate Research, Cambridge University Press, 995 pp., 2002.</mixed-citation></ref>
      <ref id="bib1.bib141"><label>141</label><?label 1?><mixed-citation>Sunyer, M. A., Gregersen, I. B., Rosbjerg, D., Madsen, H., Luchner, J., and Arnbjerg-Nielsen, K.: Comparison of different statistical downscaling methods to estimate changes in hourly extreme precipitation using RCM projections from ENSEMBLES, Int. J. Climatol., 35, 2528–2539, <ext-link xlink:href="https://doi.org/10.1002/joc.4138" ext-link-type="DOI">10.1002/joc.4138</ext-link>, 2015.</mixed-citation></ref>
      <ref id="bib1.bib142"><label>142</label><?label 1?><mixed-citation>Székely, G. J., Rizzo, M. L., and Bakirov, N. K.: Measuring and testing dependence by correlation of distances, Ann. Stat., 35, 2769–2794, <ext-link xlink:href="https://doi.org/10.1214/009053607000000505" ext-link-type="DOI">10.1214/009053607000000505</ext-link>, 2007.</mixed-citation></ref>
      <?pagebreak page6514?><ref id="bib1.bib143"><label>143</label><?label 1?><mixed-citation>Tatli, H., Nüzhet Dalfes, H., and Sibel Menteş, Ş.: A statistical downscaling method for monthly total precipitation over Turkey, Int. J. Climatol., 24, 161–180, <ext-link xlink:href="https://doi.org/10.1002/joc.997" ext-link-type="DOI">10.1002/joc.997</ext-link>, 2004.</mixed-citation></ref>
      <ref id="bib1.bib144"><label>144</label><?label 1?><mixed-citation>Taylor, K. E., Stouffer, R. J., and Meehl, G. A.: An Overview of CMIP5 and the Experiment Design, B. Am. Meteorol. Soc., 93, 485–498, <ext-link xlink:href="https://doi.org/10.1175/BAMS-D-11-00094.1" ext-link-type="DOI">10.1175/BAMS-D-11-00094.1</ext-link>, 2012.</mixed-citation></ref>
      <ref id="bib1.bib145"><label>145</label><?label 1?><mixed-citation>Tibshirani, R.: Regression Shrinkage and Selection Via the Lasso, J. Roy. Stat. Soc. B, 58, 267–288, <ext-link xlink:href="https://doi.org/10.1111/j.2517-6161.1996.tb02080.x" ext-link-type="DOI">10.1111/j.2517-6161.1996.tb02080.x</ext-link>, 1996.</mixed-citation></ref>
      <ref id="bib1.bib146"><label>146</label><?label 1?><mixed-citation>Ting, K. M. and Witten, I. H.: Issues in Stacked Generalization, J. Artif. Intell. Res., 10, 271–289, <ext-link xlink:href="https://doi.org/10.1613/jair.594" ext-link-type="DOI">10.1613/jair.594</ext-link>, 1999.</mixed-citation></ref>
      <ref id="bib1.bib147"><label>147</label><?label 1?><mixed-citation> Tipping, M. E.: Sparse Bayesian learning and the relevance vector machine, J. Mach. Learn. Res., 1, 211–244, 2001.</mixed-citation></ref>
      <ref id="bib1.bib148"><label>148</label><?label 1?><mixed-citation>Tripathi, S., Srinivas, V. V., and Nanjundiah, R. S.: Downscaling of precipitation for climate change scenarios: A support vector machine approach, J. Hydrol., 330, 621–640, <ext-link xlink:href="https://doi.org/10.1016/j.jhydrol.2006.04.030" ext-link-type="DOI">10.1016/j.jhydrol.2006.04.030</ext-link>, 2006.</mixed-citation></ref>
      <ref id="bib1.bib149"><label>149</label><?label 1?><mixed-citation>van der Linden, P. and Mitchell, J.: ENSEMBLES: Climate Change and its Impacts: Summary of research and results from the ENSEMBLES project – European Environment Agency (EEA), Tech. rep., Met Office Hadley Centre, FitzRoy Road, Exeter EX1 3PB, UK, <uri>http://www.eea.europa.eu/data-and-maps/indicators/global-and-european-temperature/ensembles-climate-change-and-its</uri> (last access: 30 October 2023), 2009.</mixed-citation></ref>
      <ref id="bib1.bib150"><label>150</label><?label 1?><mixed-citation>Vapnik, V.: The nature of statistical learning theory, Springer, New York, NY, <ext-link xlink:href="https://doi.org/10.1007/978-1-4757-3264-1" ext-link-type="DOI">10.1007/978-1-4757-3264-1</ext-link>, 1999.</mixed-citation></ref>
      <ref id="bib1.bib151"><label>151</label><?label 1?><mixed-citation>Vrac, M., Marbaix, P., Paillard, D., and Naveau, P.: Non-linear statistical downscaling of present and LGM precipitation and temperatures over Europe, Clim. Past, 3, 669–682, <ext-link xlink:href="https://doi.org/10.5194/cp-3-669-2007" ext-link-type="DOI">10.5194/cp-3-669-2007</ext-link>, 2007.</mixed-citation></ref>
      <ref id="bib1.bib152"><label>152</label><?label 1?><mixed-citation>Vu, M. T., Aribarg, T., Supratid, S., Raghavan, S. V., and Liong, S.-Y.: Statistical downscaling rainfall using artificial neural network: significantly wetter Bangkok?, Theor. Appl. Climatol., 126, 453–467, <ext-link xlink:href="https://doi.org/10.1007/s00704-015-1580-1" ext-link-type="DOI">10.1007/s00704-015-1580-1</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bib153"><label>153</label><?label 1?><mixed-citation>Weaver, C. P., Lempert, R. J., Brown, C., Hall, J. A., Revell, D., and Sarewitz, D.: Improving the contribution of climate model information to decision making: the value and demands of robust decision frameworks, WIREs Clim. Change, 4, 39–60, <ext-link xlink:href="https://doi.org/10.1002/wcc.202" ext-link-type="DOI">10.1002/wcc.202</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bib154"><label>154</label><?label 1?><mixed-citation>Wilby, R. L. and Dawson, C. W.: The Statistical DownScaling Model: insights from one decade of application, Int. J. Climatol., 33, 1707–1719, <ext-link xlink:href="https://doi.org/10.1002/joc.3544" ext-link-type="DOI">10.1002/joc.3544</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bib155"><label>155</label><?label 1?><mixed-citation>Wilby, R. L. and Wigley, T. M. L.: Future changes in the distribution of daily precipitation totals across North America, Geophys. Res. Lett., 29, 39-1–39–4, <ext-link xlink:href="https://doi.org/10.1029/2001GL013048" ext-link-type="DOI">10.1029/2001GL013048</ext-link>, 2002. </mixed-citation></ref><?xmltex \hack{\newpage}?>
      <ref id="bib1.bib156"><label>156</label><?label 1?><mixed-citation>Wilby, R. L., Wigley, T. M. L., Conway, D., Jones, P. D., Hewitson, B. C., Main, J., and Wilks, D. S.: Statistical downscaling of general circulation model output: A comparison of methods, Water Resour. Res., 34, 2995–3008, <ext-link xlink:href="https://doi.org/10.1029/98WR02577" ext-link-type="DOI">10.1029/98WR02577</ext-link>, 1998.</mixed-citation></ref>
      <ref id="bib1.bib157"><label>157</label><?label 1?><mixed-citation>Wilby, R. L., Dawson, C. W., and Barrow, E. M.: sdsm – a decision support tool for the assessment of regional climate change impacts, Environ. Model. Softw., 17, 145–157, <ext-link xlink:href="https://doi.org/10.1016/S1364-8152(01)00060-3" ext-link-type="DOI">10.1016/S1364-8152(01)00060-3</ext-link>, 2002.</mixed-citation></ref>
      <ref id="bib1.bib158"><label>158</label><?label 1?><mixed-citation> Wilby, R. L., Charles, S. P., Zorita, E., Timbal, B., Whetton, P., and Mearns, L. O.: Guidelines for use of climate scenarios developed from statistical downscaling methods, Supporting material of the Intergovernmental Panel on Climate Change, available from the DDC of IPCC TGCIA, 27, 2004.</mixed-citation></ref>
      <ref id="bib1.bib159"><label>159</label><?label 1?><mixed-citation> Wilks, D. S.: Statistical methods in the atmospheric sciences, Academic press, 698 pp., 2011.</mixed-citation></ref>
      <ref id="bib1.bib160"><label>160</label><?label 1?><mixed-citation> Wipf, D. and Nagarajan, S.: A new view of automatic relevance determination, Adv. Neur. In., 20, 2007.</mixed-citation></ref>
      <ref id="bib1.bib161"><label>161</label><?label 1?><mixed-citation>Wolpert, D. H.: Stacked generalization, Neural Networks, 5, 241–259, <ext-link xlink:href="https://doi.org/10.1016/S0893-6080(05)80023-1" ext-link-type="DOI">10.1016/S0893-6080(05)80023-1</ext-link>, 1992.</mixed-citation></ref>
      <ref id="bib1.bib162"><label>162</label><?label 1?><mixed-citation>Wu, T. T. and Lange, K.: Coordinate descent algorithms for lasso penalized regression, Ann. Appl. Stat., 2, 224–244, <ext-link xlink:href="https://doi.org/10.1214/07-AOAS147" ext-link-type="DOI">10.1214/07-AOAS147</ext-link>, 2008.</mixed-citation></ref>
      <ref id="bib1.bib163"><label>163</label><?label 1?><mixed-citation>Xu, R., Chen, N., Chen, Y., and Chen, Z.: Downscaling and Projection of Multi-CMIP5 Precipitation Using Machine Learning Methods in the Upper Han River Basin, Adv. Meteorol., 2020, e8680436, <ext-link xlink:href="https://doi.org/10.1155/2020/8680436" ext-link-type="DOI">10.1155/2020/8680436</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bib164"><label>164</label><?label 1?><mixed-citation>Xu, Z., Han, Y., and Yang, Z.: Dynamical downscaling of regional climate: A review of methods and limitations, Sci. China Earth Sci., 62, 365–375, <ext-link xlink:href="https://doi.org/10.1007/s11430-018-9261-5" ext-link-type="DOI">10.1007/s11430-018-9261-5</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bib165"><label>165</label><?label 1?><mixed-citation>Zhang, C. and Ma, Y. (Eds.): Ensemble Machine Learning: Methods and Applications, Springer, New York, NY, <ext-link xlink:href="https://doi.org/10.1007/978-1-4419-9326-7" ext-link-type="DOI">10.1007/978-1-4419-9326-7</ext-link>, 2012.</mixed-citation></ref>
      <ref id="bib1.bib166"><label>166</label><?label 1?><mixed-citation>Zhang, J., Liu, K., and Wang, M.: Downscaling Groundwater Storage Data in China to a 1-km Resolution Using Machine Learning Methods, Remote Sens., 13, 523, <ext-link xlink:href="https://doi.org/10.3390/rs13030523" ext-link-type="DOI">10.3390/rs13030523</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bib167"><label>167</label><?label 1?><mixed-citation>Zhang, X. and Yan, X.: A new statistical precipitation downscaling method with Bayesian model averaging: a case study in China, Clim. Dynam., 45, 2541–2555, <ext-link xlink:href="https://doi.org/10.1007/s00382-015-2491-7" ext-link-type="DOI">10.1007/s00382-015-2491-7</ext-link>, 2015.</mixed-citation></ref>
      <ref id="bib1.bib168"><label>168</label><?label 1?><mixed-citation>Zhou, H., Zhang, J., Zhou, Y., Guo, X., and Ma, Y.: A feature selection algorithm of decision tree based on feature weight, Expert Syst. Appl., 164, 113842, <ext-link xlink:href="https://doi.org/10.1016/j.eswa.2020.113842" ext-link-type="DOI">10.1016/j.eswa.2020.113842</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bib169"><label>169</label><?label 1?><mixed-citation>Zorita, E. and von Storch, H.: The Analog Method as a Simple Statistical Downscaling Technique: Comparison with More Complicated Methods, J. Climate, 12, 2474–2489, <ext-link xlink:href="https://doi.org/10.1175/1520-0442(1999)012&lt;2474:TAMAAS&gt;2.0.CO;2" ext-link-type="DOI">10.1175/1520-0442(1999)012&lt;2474:TAMAAS&gt;2.0.CO;2</ext-link>, 1999.</mixed-citation></ref>

  </ref-list></back>
    <!--<article-title-html>pyESDv1.0.1: an open-source Python framework for empirical-statistical downscaling of climate information</article-title-html>
<abstract-html/>
<ref-html id="bib1.bib1"><label>1</label><mixed-citation>
      
Anandhi, A., Srinivas, V. V., Nanjundiah, R. S., and Nagesh Kumar, D.:
Downscaling precipitation to river basin in India for IPCC SRES scenarios
using support vector machine, Int. J. Climatol., 28,
401–420, <a href="https://doi.org/10.1002/joc.1529" target="_blank">https://doi.org/10.1002/joc.1529</a>, 2008.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib2"><label>2</label><mixed-citation>
      
Arlot, S. and Celisse, A.: A survey of cross-validation procedures for model
selection, Stat. Surv., 4, 40–79, <a href="https://doi.org/10.1214/09-SS054" target="_blank">https://doi.org/10.1214/09-SS054</a>, 2010.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib3"><label>3</label><mixed-citation>
      
Balasundaram, S. and Tanveer, M.: On Lagrangian twin support vector
regression, Neural Comput. Appl., 22, 257–267, 2013.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib4"><label>4</label><mixed-citation>
      
Baño-Medina, J., Manzanas, R., and Gutiérrez, J. M.: Configuration and intercomparison of deep learning neural models for statistical downscaling, Geosci. Model Dev., 13, 2109–2124, <a href="https://doi.org/10.5194/gmd-13-2109-2020" target="_blank">https://doi.org/10.5194/gmd-13-2109-2020</a>, 2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib5"><label>5</label><mixed-citation>
      
Bárdossy, A.: Atmospheric circulation pattern classification for
South-West Germany using hydrological variables, Phys. Chem.
Earth Parts A/B/C, 35, 498–506, <a href="https://doi.org/10.1016/j.pce.2010.02.007" target="_blank">https://doi.org/10.1016/j.pce.2010.02.007</a>, 2010.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib6"><label>6</label><mixed-citation>
      
Barnston, A. G. and Livezey, R. E.: Classification, Seasonality and
Persistence of Low-Frequency Atmospheric Circulation Patterns, Mon.
Weather Rev., 115, 1083–1126, <a href="https://doi.org/10.1175/1520-0493(1987)115&lt;1083:CSAPOL&gt;2.0.CO;2" target="_blank">https://doi.org/10.1175/1520-0493(1987)115&lt;1083:CSAPOL&gt;2.0.CO;2</a>, 1987.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib7"><label>7</label><mixed-citation>
      
Bedia, J., Baño-Medina, J., Legasa, M. N., Iturbide, M., Manzanas, R., Herrera, S., Casanueva, A., San-Martín, D., Cofiño, A. S., and Gutiérrez, J. M.: Statistical downscaling with the downscaleR package (v3.1.0): contribution to the VALUE intercomparison experiment, Geosci. Model Dev., 13, 1711–1735, <a href="https://doi.org/10.5194/gmd-13-1711-2020" target="_blank">https://doi.org/10.5194/gmd-13-1711-2020</a>, 2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib8"><label>8</label><mixed-citation>
      
Bell, B., Hersbach, H., Simmons, A., Berrisford, P., Dahlgren, P.,
Horányi, A., Muñoz-Sabater, J., Nicolas, J., Radu, R., Schepers, D.,
Soci, C., Villaume, S., Bidlot, J.-R., Haimberger, L., Woollen, J.,
Buontempo, C., and Thépaut, J.-N.: The ERA5 global reanalysis:
Preliminary extension to 1950, Q. J. Roy. Meteor.
Soc., 147, 4186–4227, <a href="https://doi.org/10.1002/qj.4174" target="_blank">https://doi.org/10.1002/qj.4174</a>,
2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib9"><label>9</label><mixed-citation>
      
Benestad, R. E., Chen, D., Mezghani, A., Fan, L., and Parding, K.: On using
principal components to represent stations in empirical–statistical
downscaling, Tellus A, 67, 28326,
<a href="https://doi.org/10.3402/tellusa.v67.28326" target="_blank">https://doi.org/10.3402/tellusa.v67.28326</a>, 2015a.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib10"><label>10</label><mixed-citation>
      
Benestad, R. E., Mezghani, A., and Parding M. K.: “esd” – The
Empirical-Statistical Downscaling tool &amp; its visualisation
capabilities, Figshare, <a href="https://doi.org/10.6084/m9.figshare.1454425.v1" target="_blank">https://doi.org/10.6084/m9.figshare.1454425.v1</a>,
2015b.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib11"><label>11</label><mixed-citation>
      
Bergmeir, C. and Benítez, J. M.: On the use of cross-validation for
time series predictor evaluation, Inform. Sciences, 191, 192–213,
<a href="https://doi.org/10.1016/j.ins.2011.12.028" target="_blank">https://doi.org/10.1016/j.ins.2011.12.028</a>, 2012.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib12"><label>12</label><mixed-citation>
      
Bergstra, J. and Bengio, Y.: Random search for hyper-parameter
optimization, J. Mach. Learn. Res., 13, 281–305, 2012.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib13"><label>13</label><mixed-citation>
      
Bergstra, J., Bardenet, R., Bengio, Y., and Kégl, B.: lgorithms for hyper-parameter optimization, in: Proceedings of the 24th International Conference on Neural Information Processing Systems, 2546–2554,  2011.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib14"><label>14</label><mixed-citation>
      
Bishop, C. M. and Nasrabadi, N. M.: Pattern recognition and machine
learning, Springer, ISBN 978-1-4939-3843-8, 2006.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib15"><label>15</label><mixed-citation>
      
Boateng, D.: Dan-Boat/PyESD: PyESDv1.0.1 (v1.0.1), Zenodo [code], <a href="https://doi.org/10.5281/zenodo.7767629" target="_blank">https://doi.org/10.5281/zenodo.7767629</a>, 2023.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib16"><label>16</label><mixed-citation>
      
Boateng, D. and Mutz, S. G.: Supporting material for PyESDv1.0.1 An open-source Python framework for empirical-statistical downscaling of climate information, Zenodo [data set], <a href="https://doi.org/10.5281/zenodo.7767681" target="_blank">https://doi.org/10.5281/zenodo.7767681</a>, 2023.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib17"><label>17</label><mixed-citation>
      
Boateng, D., Mutz, S. G., Ballian, A., Meijers, M. J. M., Methner, K., Botsyun, S., Mulch, A., and Ehlers, T. A.: The effects of diachronous surface uplift of the European Alps on regional climate and the oxygen isotopic composition of precipitation, Earth Syst. Dynam. Discuss. [preprint], <a href="https://doi.org/10.5194/esd-2022-48" target="_blank">https://doi.org/10.5194/esd-2022-48</a>, in review, 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib18"><label>18</label><mixed-citation>
      
Boé, J., Terray, L., Martin, E., and Habets, F.: Projected changes in
components of the hydrological cycle in French river basins during the 21st
century, Water Resour. Res., 45,  W08426,  <a href="https://doi.org/10.1029/2008WR007437" target="_blank">https://doi.org/10.1029/2008WR007437</a>, 2009.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib19"><label>19</label><mixed-citation>
      
Bottou, L.: Stochastic gradient learning in neural networks, Proc.
Neuro-Nımes, 91, <a href="https://leon.bottou.org/publications/pdf/nimes-1991.pdf" target="_blank"/> (last access 30 October 2023), 1991.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib20"><label>20</label><mixed-citation>
      
Bourgault, P., Huard, D., Smith, T. J., Logan, T., Aoun, A., Lavoie, J.,
Dupuis, É., Rondeau-Genesse, G., Alegre, R., Barnes, C., Laperrière,
A. B., Biner, S., Caron, D., Ehbrecht, C., Fyke, J., Keel, T., Labonté,
M.-P., Lierhammer, L., Low, J.-F., Quinn, J., Roy, P., Squire, D., Stephens,
A., Tanguy, M., and Whelan, C.: xclim: xarray-based climate data analytics,
J. Open Source Softw., 8, 5415, <a href="https://doi.org/10.21105/joss.05415" target="_blank">https://doi.org/10.21105/joss.05415</a>, 2023.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib21"><label>21</label><mixed-citation>
      
Brands, S., Gutiérrez, J. M., Herrera, S., and Cofiño, A. S.: On the
Use of Reanalysis Data for Downscaling, J. Climate, 25, 2517–2526,
<a href="https://doi.org/10.1175/JCLI-D-11-00251.1" target="_blank">https://doi.org/10.1175/JCLI-D-11-00251.1</a>, 2012.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib22"><label>22</label><mixed-citation>
      
Breiman, L.: Bagging predictors, Mach. Learn., 24, 123–140, 1996a.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib23"><label>23</label><mixed-citation>
      
Breiman, L.: Stacked regressions, Mach. Learn., 24, 49–64, <a href="https://doi.org/10.1007/BF00117832" target="_blank">https://doi.org/10.1007/BF00117832</a>, 1996b.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib24"><label>24</label><mixed-citation>
      
Breiman, L.: Random forests, Mach. Learn., 45, 5–32, 2001.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib25"><label>25</label><mixed-citation>
      
Chaudhuri, A. and Hu, W.: A fast algorithm for computing distance
correlation, Comput. Stat. Data Anal., 135,
15–24, <a href="https://doi.org/10.1016/j.csda.2019.01.016" target="_blank">https://doi.org/10.1016/j.csda.2019.01.016</a>, 2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib26"><label>26</label><mixed-citation>
      
Chen, J., Brissette, F. P., and Leconte, R.: Coupling statistical and
dynamical methods for spatial downscaling of precipitation, Clim. Change,
114, 509–526, <a href="https://doi.org/10.1007/s10584-012-0452-2" target="_blank">https://doi.org/10.1007/s10584-012-0452-2</a>, 2012.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib27"><label>27</label><mixed-citation>
      
Chen, S.-T., Yu, P.-S., and Tang, Y.-H.: Statistical downscaling of daily
precipitation using support vector machines and multivariate analysis,
J. Hydrol., 385, 13–22, <a href="https://doi.org/10.1016/j.jhydrol.2010.01.021" target="_blank">https://doi.org/10.1016/j.jhydrol.2010.01.021</a>, 2010.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib28"><label>28</label><mixed-citation>
      
Chen, T. and Guestrin, C.: XGBoost: A Scalable Tree Boosting System, in:
Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge
Discovery and Data Mining, KDD '16: The 22nd ACM SIGKDD International
Conference on Knowledge Discovery and Data Mining, San Francisco California
USA, 785–794, <a href="https://doi.org/10.1145/2939672.2939785" target="_blank">https://doi.org/10.1145/2939672.2939785</a>, 2016.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib29"><label>29</label><mixed-citation>
      
Chen, X. and Jeong, J. C.: Enhanced recursive feature elimination, in: Sixth
International Conference on Machine Learning and Applications (ICMLA 2007),
Sixth International Conference on Machine Learning and Applications (ICMLA
2007), 429–435, <a href="https://doi.org/10.1109/ICMLA.2007.35" target="_blank">https://doi.org/10.1109/ICMLA.2007.35</a>, 2007.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib30"><label>30</label><mixed-citation>
      
Colette, A., Granier, C., Hodnebrog, Ø., Jakobs, H., Maurizi, A., Nyiri, A., Rao, S., Amann, M., Bessagnet, B., D'Angiola, A., Gauss, M., Heyes, C., Klimont, Z., Meleux, F., Memmesheimer, M., Mieville, A., Rouïl, L., Russo, F., Schucht, S., Simpson, D., Stordal, F., Tampieri, F., and Vrac, M.: Future air quality in Europe: a multi-model assessment of projected exposure to ozone, Atmos. Chem. Phys., 12, 10613–10630, <a href="https://doi.org/10.5194/acp-12-10613-2012" target="_blank">https://doi.org/10.5194/acp-12-10613-2012</a>, 2012.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib31"><label>31</label><mixed-citation>
      
Collins, W. J., Bellouin, N., Doutriaux-Boucher, M., Gedney, N., Hinton, T.,
Jones, C. D., Liddicoat, S., Martin, G., O'Connor, F., and Rae, J.:
Evaluation of the HadGEM2 model, Met Office Exeter, UK, 2008.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib32"><label>32</label><mixed-citation>
      
Copernicus Climate Change Service, Climate Data Store: CMIP5 monthly data on pressure levels, Copernicus Climate Change Service (C3S) Climate Data Store (CDS) [data set], <a href="https://doi.org/10.24381/cds.3b4b5bc9" target="_blank">https://doi.org/10.24381/cds.3b4b5bc9</a>, 2018a.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib33"><label>33</label><mixed-citation>
      
Copernicus Climate Change Service, Climate Data Store: CMIP5 monthly data on single levels, Copernicus Climate Change Service (C3S) Climate Data Store (CDS) [data set], <a href="https://doi.org/10.24381/cds.9d44a987" target="_blank">https://doi.org/10.24381/cds.9d44a987</a>, 2018b.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib34"><label>34</label><mixed-citation>
      
Cristianini, N. and Shawe-Taylor, J.: An introduction to support vector
machines and other kernel-based learning methods, Cambridge University
Press, <a href="https://doi.org/10.1017/CBO9780511801389" target="_blank">https://doi.org/10.1017/CBO9780511801389</a>, 2000.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib35"><label>35</label><mixed-citation>
      
Das, D., Dy, J., Ross, J., Obradovic, Z., and Ganguly, A. R.: Non-parametric Bayesian mixture of sparse regressions with application towards feature selection for statistical downscaling, Nonlin. Processes Geophys., 21, 1145–1157, <a href="https://doi.org/10.5194/npg-21-1145-2014" target="_blank">https://doi.org/10.5194/npg-21-1145-2014</a>, 2014.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib36"><label>36</label><mixed-citation>
      
Dau, Q. V., Kuntiyawichai, K., and Adeloye, A. J.: Future Changes in Water
Availability Due to Climate Change Projections for Huong Basin, Vietnam,
Environ. Process., 8, 77–98, <a href="https://doi.org/10.1007/s40710-020-00475-y" target="_blank">https://doi.org/10.1007/s40710-020-00475-y</a>, 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib37"><label>37</label><mixed-citation>
      
Deutscher Wetterdienst: Sub-catchment datasets, Deutscher Wetterdienst [data set], <a href="https://cdc.dwd.de/portal/shortlink/da6f555d-d6f6-426a-a8ba-b96683c76ea9" target="_blank"/>, (last access: 30 October 2023),  2023.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib38"><label>38</label><mixed-citation>
      
Diaz, G. I., Fokoue-Nkoutche, A., Nannicini, G., and Samulowitz, H.: An
effective algorithm for hyperparameter optimization of neural networks, IBM
J. Res. Develop., 61, 9:1–9:11,  <a href="https://doi.org/10.1147/JRD.2017.2709578" target="_blank">https://doi.org/10.1147/JRD.2017.2709578</a>, 2017.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib39"><label>39</label><mixed-citation>
      
Dietterich, T. G.: Ensemble Methods in Machine Learning, in: Multiple
Classifier Systems, Berlin, Heidelberg, 1–15, <a href="https://doi.org/10.1007/3-540-45014-9_1" target="_blank">https://doi.org/10.1007/3-540-45014-9_1</a>, 2000.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib40"><label>40</label><mixed-citation>
      
Efron, B., Hastie, T., Johnstone, I., and Tibshirani, R.: Least angle
regression, Ann. Stat., 32, 407–499, <a href="https://doi.org/10.1214/009053604000000067" target="_blank">https://doi.org/10.1214/009053604000000067</a>, 2004.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib41"><label>41</label><mixed-citation>
      
Errico, R. M., Stensrud, D. J., and Raeder, K. D.: Estimation of the error
distributions of precipitation produced by convective parametrization
schemes, Q. J. Roy. Meteor. Soc., 127,
2495–2512, 2001.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib42"><label>42</label><mixed-citation>
      
Fan, J., Wu, L., Zheng, J., and Zhang, F.: Medium-range forecasting of daily
reference evapotranspiration across China using numerical weather prediction
outputs downscaled by extreme gradient boosting, J. Hydrol., 601,
126664, <a href="https://doi.org/10.1016/j.jhydrol.2021.126664" target="_blank">https://doi.org/10.1016/j.jhydrol.2021.126664</a>, 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib43"><label>43</label><mixed-citation>
      
Fealy, R. and Sweeney, J.: Statistical downscaling of precipitation for a
selection of sites in Ireland employing a generalised linear modelling
approach, Int. J. Climatol., 27, 2083–2094, <a href="https://doi.org/10.1002/joc.1506" target="_blank">https://doi.org/10.1002/joc.1506</a>, 2007.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib44"><label>44</label><mixed-citation>
      
Feldmann, H., Schädler, G., Panitz, H.-J., and Kottmeier, C.: Near
future changes of extreme precipitation over complex terrain in Central
Europe derived from high resolution RCM ensemble simulations, Int. J. Climatol., 33, 1964–1977, <a href="https://doi.org/10.1002/joc.3564" target="_blank">https://doi.org/10.1002/joc.3564</a>, 2013.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib45"><label>45</label><mixed-citation>
      
Ferri, F. J., Pudil, P., Hatef, M., and Kittler, J.: Comparative study of
techniques for large-scale feature selection, in: Machine Intelligence and Pattern
Recognition, vol. 16, edited by: Gelsema, E. S. and Kanal, L. S.,
North-Holland, 403–413, <a href="https://doi.org/10.1016/B978-0-444-81892-8.50040-7" target="_blank">https://doi.org/10.1016/B978-0-444-81892-8.50040-7</a>, 1994.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib46"><label>46</label><mixed-citation>
      
Field, C. B. and Barros, V. R.: Climate Change 2014 – Impacts, Adaptation
and Vulnerability: Regional Aspects, Cambridge University Press, 695 pp.,
2014.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib47"><label>47</label><mixed-citation>
      
Freund, Y. and Schapire, R. E.: A Decision-Theoretic Generalization of
On-Line Learning and an Application to Boosting, J. Comput.
Syst. Sci., 55, 119–139, <a href="https://doi.org/10.1006/jcss.1997.1504" target="_blank">https://doi.org/10.1006/jcss.1997.1504</a>, 1997.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib48"><label>48</label><mixed-citation>
      
Friedman, J. H.: Greedy function approximation: a gradient boosting machine,
Ann. Stat., 29, 1189–1232, 2001.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib49"><label>49</label><mixed-citation>
      
Gardner, M. W. and Dorling, S. R.: Artificial neural networks (the
multilayer perceptron) – a review of applications in the atmospheric
sciences, Atmos. Environ., 32, 2627–2636, 1998.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib50"><label>50</label><mixed-citation>
      
Gareth, J., Daniela, W., Trevor, H., and Robert, T.: An introduction to
statistical learning: with applications in R, Spinger, <a href="https://doi.org/10.1007/978-1-4614-7138-7" target="_blank">https://doi.org/10.1007/978-1-4614-7138-7</a>, 2013.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib51"><label>51</label><mixed-citation>
      
Geurts, P., Ernst, D., and Wehenkel, L.: Extremely randomized trees, Mach.
Learn., 63, 3–42, 2006.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib52"><label>52</label><mixed-citation>
      
Ghosh, S. and Mujumdar, P. P.: Statistical downscaling of GCM simulations to
streamflow using relevance vector machine, Adv. Water Resour., 31,
132–146, <a href="https://doi.org/10.1016/j.advwatres.2007.07.005" target="_blank">https://doi.org/10.1016/j.advwatres.2007.07.005</a>,
2008.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib53"><label>53</label><mixed-citation>
      
Giorgi, F. and Mearns, L. O.: Approaches to the simulation of regional
climate change: A review, Rev. Geophys., 29, 191–216, <a href="https://doi.org/10.1029/90RG02636" target="_blank">https://doi.org/10.1029/90RG02636</a>, 1991.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib54"><label>54</label><mixed-citation>
      
Gobiet, A., Kotlarski, S., Beniston, M., Heinrich, G., Rajczak, J., and
Stoffel, M.: 21st century climate change in the European Alps – A review,
Sci. Total Environ., 493, 1138–1151, <a href="https://doi.org/10.1016/j.scitotenv.2013.07.050" target="_blank">https://doi.org/10.1016/j.scitotenv.2013.07.050</a>, 2014.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib55"><label>55</label><mixed-citation>
      
Gu, J., Liu, S., Zhou, Z., Chalov, S. R., and Zhuang, Q.: A Stacking
Ensemble Learning Model for Monthly Rainfall Prediction in the Taihu Basin,
China, Water, 14, 492, <a href="https://doi.org/10.3390/w14030492" target="_blank">https://doi.org/10.3390/w14030492</a>, 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib56"><label>56</label><mixed-citation>
      
Gutiérrez, J. M., San Martín, D., Cofiño, A. S., Herrera, S., Manzanas, R., and Frías, M. D.: User Guide of the ENSEMBLES Downscaling Portal (version 2), Tech. Rep. 2.2011, Santander Meteorology Group, CSIC-UC,  <a href="https://core.ac.uk/download/pdf/36151579.pdf" target="_blank"/> (last access: 30 October 2023), 2012.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib57"><label>57</label><mixed-citation>
      
Gutiérrez, J. M., San-Martín, D., Brands, S., Manzanas, R., and
Herrera, S.: Reassessing Statistical Downscaling Techniques for Their Robust
Application under Climate Change Conditions, J. Climate, 26,
171–188, <a href="https://doi.org/10.1175/JCLI-D-11-00687.1" target="_blank">https://doi.org/10.1175/JCLI-D-11-00687.1</a>, 2013.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib58"><label>58</label><mixed-citation>
      
Gutiérrez, J. M., Maraun, D., Widmann, M., Huth, R., Hertig, E.,
Benestad, R., Roessler, O., Wibig, J., Wilcke, R., Kotlarski, S., San
Martín, D., Herrera, S., Bedia, J., Casanueva, A., Manzanas, R.,
Iturbide, M., Vrac, M., Dubrovsky, M., Ribalaygua, J., Pórtoles, J.,
Räty, O., Räisänen, J., Hingray, B., Raynaud, D., Casado, M. J.,
Ramos, P., Zerenner, T., Turco, M., Bosshard, T., Štěpánek, P.,
Bartholy, J., Pongracz, R., Keller, D. E., Fischer, A. M., Cardoso, R. M.,
Soares, P. M. M., Czernecki, B., and Pagé, C.: An intercomparison of a
large ensemble of statistical downscaling methods over Europe: Results from
the VALUE perfect predictor cross-validation experiment, Int. J. Climatol., 39, 3750–3785, <a href="https://doi.org/10.1002/joc.5462" target="_blank">https://doi.org/10.1002/joc.5462</a>, 2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib59"><label>59</label><mixed-citation>
      
Hammami, D., Lee, T. S., Ouarda, T. B. M. J., and Lee, J.: Predictor
selection for downscaling GCM data with LASSO, J. Geophys.
Res.-Atmos., 117, D17116,  <a href="https://doi.org/10.1029/2012JD017864" target="_blank">https://doi.org/10.1029/2012JD017864</a>, 2012.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib60"><label>60</label><mixed-citation>
      
Hastie, T., Friedman, J., and Tibshirani, R.: The Elements of Statistical
Learning, Springer New York, New York, NY, <a href="https://doi.org/10.1007/978-0-387-21606-5" target="_blank">https://doi.org/10.1007/978-0-387-21606-5</a>, 2001.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib61"><label>61</label><mixed-citation>
      
He, X., Chaney, N. W., Schleiss, M., and Sheffield, J.: Spatial downscaling
of precipitation using adaptable random forests, Water Resour. Res.,
52, 8217–8237, <a href="https://doi.org/10.1002/2016WR019034" target="_blank">https://doi.org/10.1002/2016WR019034</a>, 2016.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib62"><label>62</label><mixed-citation>
      
Hecht-Nielsen, R.: Theory of the backpropagation neural network, in: Neural
networks for perception, Elsevier, 65–93, 1992.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib63"><label>63</label><mixed-citation>
      
Hersbach, H., Bell, B., Berrisford, P., Hirahara, S., Horányi, A.,
Muñoz-Sabater, J., Nicolas, J., Peubey, C., Radu, R., Schepers, D.,
Simmons, A., Soci, C., Abdalla, S., Abellan, X., Balsamo, G., Bechtold, P.,
Biavati, G., Bidlot, J., Bonavita, M., De Chiara, G., Dahlgren, P., Dee, D.,
Diamantakis, M., Dragani, R., Flemming, J., Forbes, R., Fuentes, M., Geer,
A., Haimberger, L., Healy, S., Hogan, R. J., Hólm, E., Janisková,
M., Keeley, S., Laloyaux, P., Lopez, P., Lupu, C., Radnoti, G., de Rosnay,
P., Rozum, I., Vamborg, F., Villaume, S., and Thépaut, J.-N.: The ERA5
global reanalysis, Q. J. Roy. Meteor. Soc.,
146, 1999–2049, <a href="https://doi.org/10.1002/qj.3803" target="_blank">https://doi.org/10.1002/qj.3803</a>, 2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib64"><label>64</label><mixed-citation>
      
Hersbach, H., Bell, B., Berrisford, P., Biavati, G., Horányi, A., Muñoz Sabater, J., Nicolas, J., Peubey, C., Radu, R., Rozum, I., Schepers, D., Simmons, A., Soci, C., Dee, D., and Thépaut, J.-N.: ERA5 monthly averaged data on pressure levels from 1940 to present, Copernicus Climate Change Service (C3S) Climate Data Store (CDS) [data set], <a href="https://doi.org/10.24381/cds.6860a573" target="_blank">https://doi.org/10.24381/cds.6860a573</a>, 2023.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib65"><label>65</label><mixed-citation>
      
Hertig, E., Maraun, D., Bartholy, J., Pongracz, R., Vrac, M., Mares, I.,
Gutiérrez, J. M., Wibig, J., Casanueva, A., and Soares, P. M. M.:
Comparison of statistical downscaling methods with respect to extreme events
over Europe: Validation results from the perfect predictor experiment of the
COST Action VALUE, Int. J. Climatol., 39, 3846–3867,
<a href="https://doi.org/10.1002/joc.5469" target="_blank">https://doi.org/10.1002/joc.5469</a>, 2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib66"><label>66</label><mixed-citation>
      
Hewitson, B. C., Daron, J., Crane, R. G., Zermoglio, M. F., and Jack, C.:
Interrogating empirical-statistical downscaling, Clim. Change, 122,
539–554, <a href="https://doi.org/10.1007/s10584-013-1021-z" target="_blank">https://doi.org/10.1007/s10584-013-1021-z</a>, 2014.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib67"><label>67</label><mixed-citation>
      
Hodgkins, G. A., Whitfield, P. H., Burn, D. H., Hannaford, J., Renard, B.,
Stahl, K., Fleig, A. K., Madsen, H., Mediero, L., Korhonen, J., Murphy, C.,
and Wilson, D.: Climate-driven variability in the occurrence of major floods
across North America and Europe, J. Hydrol., 552, 704–717,
<a href="https://doi.org/10.1016/j.jhydrol.2017.07.027" target="_blank">https://doi.org/10.1016/j.jhydrol.2017.07.027</a>, 2017.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib68"><label>68</label><mixed-citation>
      
Hofmann, T., Schölkopf, B., and Smola, A. J.: Kernel methods in machine
learning, Ann. Stat., 36, 1171–1220, <a href="https://doi.org/10.1214/009053607000000677" target="_blank">https://doi.org/10.1214/009053607000000677</a>, 2008.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib69"><label>69</label><mixed-citation>
      
Hoyer, S. and Hamman, J.: xarray: N-D labeled Arrays and Datasets in Python,
J. Open Res. Softw., 5, 10, <a href="https://doi.org/10.5334/jors.148" target="_blank">https://doi.org/10.5334/jors.148</a>, 2017.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib70"><label>70</label><mixed-citation>
      
Hurrell, J. W.: Decadal Trends in the North Atlantic Oscillation: Regional
Temperatures and Precipitation, Science, 269, 676–679, <a href="https://doi.org/10.1126/science.269.5224.676" target="_blank">https://doi.org/10.1126/science.269.5224.676</a>, 1995.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib71"><label>71</label><mixed-citation>
      
Hurrell, J. W. and Van Loon, H.: Decadal Variations in Climate Associated
with the North Atlantic Oscillation, in: Climatic Change at High Elevation
Sites, edited by: Diaz, H. F., Beniston, M., and Bradley, R. S., Springer
Netherlands, Dordrecht, 69–94, <a href="https://doi.org/10.1007/978-94-015-8905-5_4" target="_blank">https://doi.org/10.1007/978-94-015-8905-5_4</a>, 1997.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib72"><label>72</label><mixed-citation>
      
Huth, R.: Statistical downscaling in central Europe: evaluation of methods
and potential predictors, Clim. Res., 13, 91–101, <a href="https://doi.org/10.3354/cr013091" target="_blank">https://doi.org/10.3354/cr013091</a>, 1999.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib73"><label>73</label><mixed-citation>
      
Huth, R.: Sensitivity of Local Daily Temperature Change Estimates to the
Selection of Downscaling Models and Predictors, J. Climate, 17,
640–652, <a href="https://doi.org/10.1175/1520-0442(2004)017&lt;0640:SOLDTC&gt;2.0.CO;2" target="_blank">https://doi.org/10.1175/1520-0442(2004)017&lt;0640:SOLDTC&gt;2.0.CO;2</a>, 2004.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib74"><label>74</label><mixed-citation>
      
IPCC: Climate Change 2021: The Physical Science Basis. Contribution of Working Group I to the Sixth Assessment Report of the Intergovernmental Panel on
Climate Change, edited by: Masson-Delmotte, V.,  Zhai, P.,  Pirani, A.,  Connors, S. L.,  Péan, C., Berger, S.,  Caud, N.,  Chen, Y.,  Goldfarb, L., Gomis, M. I.,  Huang, M.,  Leitzell, K.,  Lonnoy, E.,
Matthews, J. B. R.,  Maycock, T. K.,  Waterfield, T.,  Yelekçi, O.,  Yu, R., and  Zhou, B., Cambridge University Press, Cambridge, United Kingdom and New York, NY, USA, 2391 pp.,
<a href="https://doi.org/10.1017/9781009157896" target="_blank">https://doi.org/10.1017/9781009157896</a>, 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib75"><label>75</label><mixed-citation>
      
Jakob Themeßl, M., Gobiet, A., and Leuprecht, A.: Empirical-statistical
downscaling and error correction of daily precipitation from regional
climate models, Int. J. Climatol., 31, 1530–1544,
<a href="https://doi.org/10.1002/joc.2168" target="_blank">https://doi.org/10.1002/joc.2168</a>, 2011.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib76"><label>76</label><mixed-citation>
      
Jia, S., Zhu, W., Lű, A., and Yan, T.: A statistical spatial downscaling
algorithm of TRMM precipitation based on NDVI and DEM in the Qaidam Basin of
China, Remote Sens. Environ., 115, 3069–3079, <a href="https://doi.org/10.1016/j.rse.2011.06.009" target="_blank">https://doi.org/10.1016/j.rse.2011.06.009</a>, 2011.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib77"><label>77</label><mixed-citation>
      
Jordan, M. I. and Mitchell, T. M.: Machine learning: Trends, perspectives,
and prospects, Science, 349, 255–260, <a href="https://doi.org/10.1126/science.aaa8415" target="_blank">https://doi.org/10.1126/science.aaa8415</a>, 2015.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib78"><label>78</label><mixed-citation>
      
Kay, J. E., Deser, C., Phillips, A., Mai, A., Hannay, C., Strand, G.,
Arblaster, J. M., Bates, S. C., Danabasoglu, G., Edwards, J., Holland, M.,
Kushner, P., Lamarque, J.-F., Lawrence, D., Lindsay, K., Middleton, A.,
Munoz, E., Neale, R., Oleson, K., Polvani, L., and Vertenstein, M.: The
Community Earth System Model (CESM) Large Ensemble Project: A Community
Resource for Studying Climate Change in the Presence of Internal Climate
Variability, B. Am. Meteorol. Soc., 96,
1333–1349, <a href="https://doi.org/10.1175/BAMS-D-13-00255.1" target="_blank">https://doi.org/10.1175/BAMS-D-13-00255.1</a>, 2015.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib79"><label>79</label><mixed-citation>
      
Kingma, D. P. and Ba, J.: Adam: A method for stochastic optimization, arXiv
[preprint], arXiv:1412.6980, 2014.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib80"><label>80</label><mixed-citation>
      
Kunstmann, H., Schneider, K., Forkel, R., and Knoche, R.: Impact analysis of climate change for an Alpine catchment using high resolution dynamic downscaling of ECHAM4 time slices, Hydrol. Earth Syst. Sci., 8, 1031–1045, <a href="https://doi.org/10.5194/hess-8-1031-2004" target="_blank">https://doi.org/10.5194/hess-8-1031-2004</a>, 2004.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib81"><label>81</label><mixed-citation>
      
Lachenbruch, P. A. and Mickey, M. R.: Estimation of Error Rates in
Discriminant Analysis, Technometrics, 10, 1–11, <a href="https://doi.org/10.1080/00401706.1968.10490530" target="_blank">https://doi.org/10.1080/00401706.1968.10490530</a>, 1968.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib82"><label>82</label><mixed-citation>
      
Laflamme, E. M., Linder, E., and Pan, Y.: Statistical downscaling of
regional climate model output to achieve projections of precipitation
extremes, Weather Climate Extremes, 12, 15–23, <a href="https://doi.org/10.1016/j.wace.2015.12.001" target="_blank">https://doi.org/10.1016/j.wace.2015.12.001</a>, 2016.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib83"><label>83</label><mixed-citation>
      
Lau, W. K.-M., Wu, H.-T., and Kim, K.-M.: A canonical response of
precipitation characteristics to global warming from CMIP5 models,
Geophys. Res. Lett., 40, 3163–3169, <a href="https://doi.org/10.1002/grl.50420" target="_blank">https://doi.org/10.1002/grl.50420</a>, 2013.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib84"><label>84</label><mixed-citation>
      
Leblanc, M. and Tibshirani, R.: Combining Estimates in Regression and
Classification, J. Am. Stat. A., 91,
1641–1650, <a href="https://doi.org/10.1080/01621459.1996.10476733" target="_blank">https://doi.org/10.1080/01621459.1996.10476733</a>,
1996.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib85"><label>85</label><mixed-citation>
      
Li, J., Pollinger, F., and Paeth, H.: Comparing the Lasso
Predictor-Selection and Regression Method with Classical Approaches of
Precipitation Bias Adjustment in Decadal Climate Predictions, Mon.
Weather Rev., 148, 4339–4351, <a href="https://doi.org/10.1175/MWR-D-19-0302.1" target="_blank">https://doi.org/10.1175/MWR-D-19-0302.1</a>, 2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib86"><label>86</label><mixed-citation>
      
Liu, J., Yuan, D., Zhang, L., Zou, X., and Song, X.: Comparison of Three
Statistical Downscaling Methods and Ensemble Downscaling Method Based on
Bayesian Model Averaging in Upper Hanjiang River Basin, China, Adv. Meteorol., 2016, e7463963, <a href="https://doi.org/10.1155/2016/7463963" target="_blank">https://doi.org/10.1155/2016/7463963</a>, 2015.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib87"><label>87</label><mixed-citation>
      
Lorenz, E. N.: Atmospheric Predictability as Revealed by Naturally Occurring
Analogues, J. Atmos. Sci., 26, 636–646, <a href="https://doi.org/10.1175/1520-0469(1969)26&lt;636:APARBN&gt;2.0.CO;2" target="_blank">https://doi.org/10.1175/1520-0469(1969)26&lt;636:APARBN&gt;2.0.CO;2</a>, 1969.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib88"><label>88</label><mixed-citation>
      
Ludwig, R., Taschner, S., and Mauser, W.: Modelling floods in the Ammer catchment: limitations and challenges with a coupled meteo-hydrological model approach, Hydrol. Earth Syst. Sci., 7, 833–847, <a href="https://doi.org/10.5194/hess-7-833-2003" target="_blank">https://doi.org/10.5194/hess-7-833-2003</a>, 2003.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib89"><label>89</label><mixed-citation>
      
MacKay, D. J.: Bayesian interpolation, Neural Comput., 4, 415–447,
1992.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib90"><label>90</label><mixed-citation>
      
Maraun, D. and Widmann, M. (Eds.): Structure of Statistical Downscaling
Methods, in: Statistical Downscaling and Bias Correction for Climate
Research, Cambridge University Press, Cambridge, 135–140, <a href="https://doi.org/10.1017/9781107588783.011" target="_blank">https://doi.org/10.1017/9781107588783.011</a>, 2018.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib91"><label>91</label><mixed-citation>
      
Maraun, D., Wetterhall, F., Ireson, A. M., Chandler, R. E., Kendon, E. J.,
Widmann, M., Brienen, S., Rust, H. W., Sauter, T., Themeßl, M., Venema,
V. K. C., Chun, K. P., Goodess, C. M., Jones, R. G., Onof, C., Vrac, M., and
Thiele-Eich, I.: Precipitation downscaling under climate change: Recent
developments to bridge the gap between dynamical models and the end user,
Rev. Geophys., 48, RG3003, <a href="https://doi.org/10.1029/2009RG000314" target="_blank">https://doi.org/10.1029/2009RG000314</a>, 2010.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib92"><label>92</label><mixed-citation>
      
Maraun, D., Huth, R., Gutiérrez, J. M., Martín, D. S., Dubrovsky,
M., Fischer, A., Hertig, E., Soares, P. M. M., Bartholy, J., Pongrácz,
R., Widmann, M., Casado, M. J., Ramos, P., and Bedia, J.: The VALUE perfect
predictor experiment: Evaluation of temporal variability, Int. J. Climatol., 39, 3786–3818, <a href="https://doi.org/10.1002/joc.5222" target="_blank">https://doi.org/10.1002/joc.5222</a>, 2019a.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib93"><label>93</label><mixed-citation>
      
Maraun, D., Widmann, M., and Gutiérrez, J. M.: Statistical downscaling
skill under present climate conditions: A synthesis of the VALUE perfect
predictor experiment, Int. J. Climatol., 39, 3692–3703,
<a href="https://doi.org/10.1002/joc.5877" target="_blank">https://doi.org/10.1002/joc.5877</a>, 2019b.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib94"><label>94</label><mixed-citation>
      
Markatou, M., Tian, H., Biswas, S., and Hripcsak, G. M.: Analysis of
variance of cross-validation estimators of the generalization error, J. Mach. Learn. Res.,  6, 1127–1168, 2005.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib95"><label>95</label><mixed-citation>
      
Marzban, C., Sandgathe, S., and Kalnay, E.: MOS, Perfect Prog, and
Reanalysis, Mon. Weather Rev., 134, 657–663, <a href="https://doi.org/10.1175/MWR3088.1" target="_blank">https://doi.org/10.1175/MWR3088.1</a>, 2006.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib96"><label>96</label><mixed-citation>
      
Massaoudi, M., Refaat, S. S., Chihi, I., Trabelsi, M., Oueslati, F. S., and
Abu-Rub, H.: A novel stacked generalization ensemble-based hybrid
LGBM-XGB-MLP model for Short-Term Load Forecasting, Energy, 214, 118874,
<a href="https://doi.org/10.1016/j.energy.2020.118874" target="_blank">https://doi.org/10.1016/j.energy.2020.118874</a>, 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib97"><label>97</label><mixed-citation>
      
Mearns, L. O., Rosenzweig, C., and Goldberg, R.: The effect of changes in
daily and interannual climatic variability on CERES-Wheat: A sensitivity
study, Clim. Change, 32, 257–292, <a href="https://doi.org/10.1007/BF00142465" target="_blank">https://doi.org/10.1007/BF00142465</a>, 1996.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib98"><label>98</label><mixed-citation>
      
Meinshausen, M., Smith, S. J., Calvin, K., Daniel, J. S., Kainuma, M. L. T.,
Lamarque, J.-F., Matsumoto, K., Montzka, S. A., Raper, S. C. B., Riahi, K.,
Thomson, A., Velders, G. J. M., and van Vuuren, D. P. P.: The RCP greenhouse
gas concentrations and their extensions from 1765 to 2300, Clim. Change,
109, 213, <a href="https://doi.org/10.1007/s10584-011-0156-z" target="_blank">https://doi.org/10.1007/s10584-011-0156-z</a>, 2011.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib99"><label>99</label><mixed-citation>
      
Miles, J.: R Squared, Adjusted R Squared, in: Wiley StatsRef: Statistics
Reference Online, John Wiley &amp; Sons, Ltd, <a href="https://doi.org/10.1002/9781118445112.stat06627" target="_blank">https://doi.org/10.1002/9781118445112.stat06627</a>, 2014.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib100"><label>100</label><mixed-citation>
      
Moore, A. W.: Cross-validation for detecting and preventing overfitting,
School of Computer Science Carneigie Mellon University, 133, 2001.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib101"><label>101</label><mixed-citation>
      
Moss, R. H., Edmonds, J. A., Hibbard, K. A., Manning, M. R., Rose, S. K.,
van Vuuren, D. P., Carter, T. R., Emori, S., Kainuma, M., Kram, T., Meehl,
G. A., Mitchell, J. F. B., Nakicenovic, N., Riahi, K., Smith, S. J.,
Stouffer, R. J., Thomson, A. M., Weyant, J. P., and Wilbanks, T. J.: The
next generation of scenarios for climate change research and assessment,
Nature, 463, 747–756, <a href="https://doi.org/10.1038/nature08823" target="_blank">https://doi.org/10.1038/nature08823</a>,
2010.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib102"><label>102</label><mixed-citation>
      
Muñoz Sabater, J.: ERA5-Land monthly averaged data from 1950 to present, Copernicus Climate Change Service (C3S) Climate Data Store (CDS), <a href="https://doi.org/10.24381/cds.68d2bb30" target="_blank">https://doi.org/10.24381/cds.68d2bb30</a>, 2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib103"><label>103</label><mixed-citation>
      
Murphy, J.: Predictions of climate change over Europe using statistical and
dynamical downscaling techniques, Int. J. Climatol., 20, 489–501,
<a href="https://doi.org/10.1002/(SICI)1097-0088(200004)20:5&lt;489::AID-JOC484&gt;3.0.CO;2-6" target="_blank">https://doi.org/10.1002/(SICI)1097-0088(200004)20:5&lt;489::AID-JOC484&gt;3.0.CO;2-6</a>, 2000.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib104"><label>104</label><mixed-citation>
      
Mutz, S., Paeth, H., and Winkler, S.: Modelling of future mass balance
changes of Norwegian glaciers by application of a dynamical–statistical
model, Clim. Dynam., 46, 1581–1597, <a href="https://doi.org/10.1007/s00382-015-2663-5" target="_blank">https://doi.org/10.1007/s00382-015-2663-5</a>, 2016.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib105"><label>105</label><mixed-citation>
      
Mutz, S. G. and Aschauer, J.: Empirical glacier mass-balance models for
South America, J. Glaciol., 68, 912–926, <a href="https://doi.org/10.1017/jog.2022.6" target="_blank">https://doi.org/10.1017/jog.2022.6</a>, 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib106"><label>106</label><mixed-citation>
      
Mutz, S. G., Scherrer, S., Muceniece, I., and Ehlers, T. A.: Twenty-first
century regional temperature response in Chile based on
empirical-statistical downscaling, Clim. Dynam., 56, 2881–2894, <a href="https://doi.org/10.1007/s00382-020-05620-9" target="_blank">https://doi.org/10.1007/s00382-020-05620-9</a>, 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib107"><label>107</label><mixed-citation>
      
Nash, J. E. and Sutcliffe, J. V.: River flow forecasting through conceptual
models part I – A discussion of principles, J. Hydrol., 10,
282–290,  <a href="https://doi.org/10.1016/0022-1694(70)90255-6" target="_blank">https://doi.org/10.1016/0022-1694(70)90255-6</a>, 1970.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib108"><label>108</label><mixed-citation>
      
Neal, R. M.: Bayesian learning for neural networks, Springer Science &amp;
Business Media, <a href="https://doi.org/10.1007/978-1-4612-0745-0" target="_blank">https://doi.org/10.1007/978-1-4612-0745-0</a>, 2012.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib109"><label>109</label><mixed-citation>
      
Nourani, V., Razzaghzadeh, Z., Baghanam, A. H., and Molajou, A.: ANN-based
statistical downscaling of climatic parameters using decision tree predictor
screening method, Theor. Appl. Climatol., 137, 1729–1746, <a href="https://doi.org/10.1007/s00704-018-2686-z" target="_blank">https://doi.org/10.1007/s00704-018-2686-z</a>, 2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib110"><label>110</label><mixed-citation>
      
Onyutha, C.: A hydrological model skill score and revised R-squared,
Hydrol. Res., 53, 51–64, <a href="https://doi.org/10.2166/nh.2021.071" target="_blank">https://doi.org/10.2166/nh.2021.071</a>, 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib111"><label>111</label><mixed-citation>
      
Pachauri, R. K., Allen, M. R., Barros, V. R., Broome, J., Cramer, W.,
Christ, R., Church, J. A., Clarke, L., Dahe, Q., Dasgupta, P., Dubash, N.
K., Edenhofer, O., Elgizouli, I., Field, C. B., Forster, P., Friedlingstein,
P., Fuglestvedt, J., Gomez-Echeverri, L., Hallegatte, S., Hegerl, G.,
Howden, M., Jiang, K., Jimenez Cisneroz, B., Kattsov, V., Lee, H., Mach, K.
J., Marotzke, J., Mastrandrea, M. D., Meyer, L., Minx, J., Mulugetta, Y.,
O'Brien, K., Oppenheimer, M., Pereira, J. J., Pichs-Madruga, R., Plattner,
G.-K., Pörtner, H.-O., Power, S. B., Preston, B., Ravindranath, N. H.,
Reisinger, A., Riahi, K., Rusticucci, M., Scholes, R., Seyboth, K., Sokona,
Y., Stavins, R., Stocker, T. F., Tschakert, P., van Vuuren, D., and van
Ypserle, J.-P.: Climate Change 2014: Synthesis Report. Contribution of
Working Groups I, II and III to the Fifth Assessment Report of the
Intergovernmental Panel on Climate Change, edited by: Pachauri, R. K. and
Meyer, L., IPCC, Geneva, Switzerland, 151 pp., 2014.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib112"><label>112</label><mixed-citation>
      
Padulano, R., Rianna, G., Costabile, P., Costanzo, C., Del Giudice, G., and
Mercogliano, P.: Propagation of variability in climate projections within
urban flood modelling: A multi-purpose impact analysis, J. Hydrol., 602, 126756, <a href="https://doi.org/10.1016/j.jhydrol.2021.126756" target="_blank">https://doi.org/10.1016/j.jhydrol.2021.126756</a>, 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib113"><label>113</label><mixed-citation>
      
Pal, S. K. and Mitra, S.: Multilayer perceptron, fuzzy sets, classifiaction, IEEE Trans Neural Netw., 3, 683–397, <a href="https://doi.org/10.1109/72.159058" target="_blank">https://doi.org/10.1109/72.159058</a>,
1992.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib114"><label>114</label><mixed-citation>
      
Pang, B., Yue, J., Zhao, G., and Xu, Z.: Statistical Downscaling of
Temperature with the Random Forest Model, Adv. Meteorol., 2017,
e7265178, <a href="https://doi.org/10.1155/2017/7265178" target="_blank">https://doi.org/10.1155/2017/7265178</a>, 2017.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib115"><label>115</label><mixed-citation>
      
Paparrizos, S., Schindler, D., Potouridis, S., and Matzarakis, A.:
Spatio-temporal analysis of present and future precipitation responses over
South Germany, J. Water Clim. Change, 9, 490–499, <a href="https://doi.org/10.2166/wcc.2017.009" target="_blank">https://doi.org/10.2166/wcc.2017.009</a>, 2017.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib116"><label>116</label><mixed-citation>
      
Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel,
O., Blondel, M., Prettenhofer, P., Weiss, R., Dubourg, V., Vanderplas, J.,
Passos, A., Cournapeau, D., Brucher, M., Perrot, M., and Duchesnay, É.:
Scikit-learn: Machine Learning in Python, J. Mach. Learn. Res., 12,
2825–2830, 2011.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib117"><label>117</label><mixed-citation>
      
Polasky, A. D., Evans, J. L., and Fuentes, J. D.: CCdownscaling: A Python
package for multivariable statistical climate model downscaling,
Environ. Model. Softw., 165, 105712, <a href="https://doi.org/10.1016/j.envsoft.2023.105712" target="_blank">https://doi.org/10.1016/j.envsoft.2023.105712</a>, 2023.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib118"><label>118</label><mixed-citation>
      
Pontes, F. J., Amorim, G. F., Balestrassi, P. P., Paiva, A. P., and
Ferreira, J. R.: Design of experiments and focused grid search for neural
network parameter optimization, Neurocomputing, 186, 22–34, <a href="https://doi.org/10.1016/j.neucom.2015.12.061" target="_blank">https://doi.org/10.1016/j.neucom.2015.12.061</a>, 2016.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib119"><label>119</label><mixed-citation>
      
Quesada-Chacón, D., Barfus, K., and Bernhofer, C.: Repeatable high-resolution statistical downscaling through deep learning, Geosci. Model Dev., 15, 7353–7370, <a href="https://doi.org/10.5194/gmd-15-7353-2022" target="_blank">https://doi.org/10.5194/gmd-15-7353-2022</a>, 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib120"><label>120</label><mixed-citation>
      
Quinlan, J. R.: Bagging, boosting, and C4.S, in: Proceedings of the
thirteenth national conference on Artificial intelligence – Volume 1,
Portland, Oregon, 725–730, 1996.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib121"><label>121</label><mixed-citation>
      
Raissi, M. and Karniadakis, G. E.: Hidden physics models: Machine learning
of nonlinear partial differential equations, J. Comput.
Phys., 357, 125–141, <a href="https://doi.org/10.1016/j.jcp.2017.11.039" target="_blank">https://doi.org/10.1016/j.jcp.2017.11.039</a>, 2018.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib122"><label>122</label><mixed-citation>
      
Ramon, J., Lledó, L., Bretonnière, P.-A., Samsó, M., and
Doblas-Reyes, F. J.: A perfect prognosis downscaling methodology for
seasonal prediction of local-scale wind speeds, Environ. Res. Lett., 16,
054010, <a href="https://doi.org/10.1088/1748-9326/abe491" target="_blank">https://doi.org/10.1088/1748-9326/abe491</a>, 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib123"><label>123</label><mixed-citation>
      
Reichert, B. K., Bengtsson, L., and Åkesson, O.: A statistical modeling
approach for the simulation of local paleoclimatic proxy records using
general circulation model output, J. Geophys. Res.-Atmos., 104, 19071–19083, <a href="https://doi.org/10.1029/1999JD900264" target="_blank">https://doi.org/10.1029/1999JD900264</a>, 1999.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib124"><label>124</label><mixed-citation>
      
Reid, S. and Grudic, G.: Regularized Linear Models in Stacked
Generalization, in: Multiple Classifier Systems, Berlin, Heidelberg,
112–121, <a href="https://doi.org/10.1007/978-3-642-02326-2_12" target="_blank">https://doi.org/10.1007/978-3-642-02326-2_12</a>, 2009.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib125"><label>125</label><mixed-citation>
      
Rumelhart, D. E., Hinton, G. E., and Williams, R. J.: Learning
representations by back-propagating errors, Nature, 323, 533–536, 1986.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib126"><label>126</label><mixed-citation>
      
Sachindra, D. A., Huang, F., Barton, A., and Perera, B. J. C.: Statistical
downscaling of general circulation model outputs to precipitation – part 2:
bias-correction and future projections, Int. J. Climatol., 34, 3282–3303, <a href="https://doi.org/10.1002/joc.3915" target="_blank">https://doi.org/10.1002/joc.3915</a>,
2014.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib127"><label>127</label><mixed-citation>
      
Sachindra, D. A., Ahmed, K., Rashid, Md. M., Shahid, S., and Perera, B. J.
C.: Statistical downscaling of precipitation using machine learning
techniques, Atmos. Res., 212, 240–258, <a href="https://doi.org/10.1016/j.atmosres.2018.05.022" target="_blank">https://doi.org/10.1016/j.atmosres.2018.05.022</a>, 2018.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib128"><label>128</label><mixed-citation>
      
Santos, M. S., Soares, J. P., Abreu, P. H., Araujo, H., and Santos, J.:
Cross-Validation for Imbalanced Datasets: Avoiding Overoptimistic and
Overfitting Approaches [Research Frontier], IEEE Comput. Intell.
M., 13, 59–76, <a href="https://doi.org/10.1109/MCI.2018.2866730" target="_blank">https://doi.org/10.1109/MCI.2018.2866730</a>, 2018.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib129"><label>129</label><mixed-citation>
      
Schapire, R. E.: A brief introduction to boosting, in: Ijcai, 1401–1406,
1999.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib130"><label>130</label><mixed-citation>
      
Schapire, R. E.: The Boosting Approach to Machine Learning: An Overview, in: Nonlinear Estimation and Classification. Lecture Notes in Statistics, edited by: Denison, D. D., Hansen, M. H., Holmes, C. C., Mallick, B., and Yu, B., Springer, New York, NY, vol. 171, <a href="https://doi.org/10.1007/978-0-387-21579-2_9" target="_blank">https://doi.org/10.1007/978-0-387-21579-2_9</a>, 2003.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib131"><label>131</label><mixed-citation>
      
Schapire, R. E. and Freund, Y.: Boosting: Foundations and algorithms,
Kybernetes, 42, 164–166, <a href="https://doi.org/10.1108/0368492131129554" target="_blank">https://doi.org/10.1108/0368492131129554</a>, 2013.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib132"><label>132</label><mixed-citation>
      
Schmidli, J., Goodess, C. M., Frei, C., Haylock, M. R., Hundecha, Y.,
Ribalaygua, J., and Schmith, T.: Statistical and dynamical downscaling of
precipitation: An evaluation and comparison of scenarios for the European
Alps, J. Geophys. Res., 112, D04105, <a href="https://doi.org/10.1029/2005JD007026" target="_blank">https://doi.org/10.1029/2005JD007026</a>, 2007.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib133"><label>133</label><mixed-citation>
      
Selle, B., Rink, K., and Kolditz, O.: Recharge and discharge controls on
groundwater travel times and flow paths to production wells for the Ammer
catchment in southwestern Germany, Environ. Earth Sci., 69, 443–452,
<a href="https://doi.org/10.1007/s12665-013-2333-z" target="_blank">https://doi.org/10.1007/s12665-013-2333-z</a>, 2013.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib134"><label>134</label><mixed-citation>
      
Shahhosseini, M., Hu, G., and Archontoulis, S. V.: Forecasting Corn Yield
With Machine Learning Ensembles, Front. Plant Sci., 11, 1120, <a href="https://doi.org/10.3389/fpls.2020.01120" target="_blank">https://doi.org/10.3389/fpls.2020.01120</a>, 2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib135"><label>135</label><mixed-citation>
      
Shepherd, T. G.: Atmospheric circulation as a source of uncertainty in
climate change projections, Nat. Geosci., 7, 703–708, <a href="https://doi.org/10.1038/ngeo2253" target="_blank">https://doi.org/10.1038/ngeo2253</a>, 2014.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib136"><label>136</label><mixed-citation>
      
Snoek, J., Larochelle, H., and Adams, R. P.: Practical bayesian optimization
of machine learning algorithms, Adv. Neur. In., 25,  2951–2959, 2012.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib137"><label>137</label><mixed-citation>
      
Spuler, F. R., Wessel, J. B., Comyn-Platt, E., Varndell, J., and Cagnazzo, C.: ibicus: a new open-source Python package and comprehensive interface for statistical bias adjustment and evaluation in climate modelling (v1.0.1), EGUsphere [preprint], <a href="https://doi.org/10.5194/egusphere-2023-1481" target="_blank">https://doi.org/10.5194/egusphere-2023-1481</a>, 2023.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib138"><label>138</label><mixed-citation>
      
Steppeler, J., Doms, G., Schättler, U., Bitzer, H. W., Gassmann, A.,
Damrath, U., and Gregoric, G.: Meso-gamma scale forecasts using the
nonhydrostatic model LM, Meteorol. Atmos. Phys., 82, 75–96, <a href="https://doi.org/10.1007/s00703-001-0592-9" target="_blank">https://doi.org/10.1007/s00703-001-0592-9</a>, 2003.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib139"><label>139</label><mixed-citation>
      
Stone, M.: Cross-Validatory Choice and Assessment of Statistical Predictions
(With Discussion), J. Roy. Stat. Soc. B, 38, 102–102, <a href="https://doi.org/10.1111/j.2517-6161.1976.tb01573.x" target="_blank">https://doi.org/10.1111/j.2517-6161.1976.tb01573.x</a>, 1976.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib140"><label>140</label><mixed-citation>
      
Storch, H. von and Zwiers, F. W.: Statistical Analysis in Climate Research,
Cambridge University Press, 995 pp., 2002.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib141"><label>141</label><mixed-citation>
      
Sunyer, M. A., Gregersen, I. B., Rosbjerg, D., Madsen, H., Luchner, J., and
Arnbjerg-Nielsen, K.: Comparison of different statistical downscaling
methods to estimate changes in hourly extreme precipitation using RCM
projections from ENSEMBLES, Int. J. Climatol., 35,
2528–2539, <a href="https://doi.org/10.1002/joc.4138" target="_blank">https://doi.org/10.1002/joc.4138</a>, 2015.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib142"><label>142</label><mixed-citation>
      
Székely, G. J., Rizzo, M. L., and Bakirov, N. K.: Measuring and testing
dependence by correlation of distances, Ann. Stat., 35,
2769–2794, <a href="https://doi.org/10.1214/009053607000000505" target="_blank">https://doi.org/10.1214/009053607000000505</a>, 2007.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib143"><label>143</label><mixed-citation>
      
Tatli, H., Nüzhet Dalfes, H., and Sibel Menteş, Ş.: A
statistical downscaling method for monthly total precipitation over Turkey,
Int. J. Climatol., 24, 161–180, <a href="https://doi.org/10.1002/joc.997" target="_blank">https://doi.org/10.1002/joc.997</a>, 2004.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib144"><label>144</label><mixed-citation>
      
Taylor, K. E., Stouffer, R. J., and Meehl, G. A.: An Overview of CMIP5 and
the Experiment Design, B. Am. Meteorol. Soc., 93,
485–498, <a href="https://doi.org/10.1175/BAMS-D-11-00094.1" target="_blank">https://doi.org/10.1175/BAMS-D-11-00094.1</a>, 2012.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib145"><label>145</label><mixed-citation>
      
Tibshirani, R.: Regression Shrinkage and Selection Via the Lasso, J. Roy. Stat. Soc. B, 58, 267–288,
<a href="https://doi.org/10.1111/j.2517-6161.1996.tb02080.x" target="_blank">https://doi.org/10.1111/j.2517-6161.1996.tb02080.x</a>, 1996.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib146"><label>146</label><mixed-citation>
      
Ting, K. M. and Witten, I. H.: Issues in Stacked Generalization, J.
Artif. Intell. Res., 10, 271–289, <a href="https://doi.org/10.1613/jair.594" target="_blank">https://doi.org/10.1613/jair.594</a>, 1999.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib147"><label>147</label><mixed-citation>
      
Tipping, M. E.: Sparse Bayesian learning and the relevance vector machine,
J. Mach. Learn. Res., 1, 211–244, 2001.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib148"><label>148</label><mixed-citation>
      
Tripathi, S., Srinivas, V. V., and Nanjundiah, R. S.: Downscaling of
precipitation for climate change scenarios: A support vector machine
approach, J. Hydrol., 330, 621–640, <a href="https://doi.org/10.1016/j.jhydrol.2006.04.030" target="_blank">https://doi.org/10.1016/j.jhydrol.2006.04.030</a>, 2006.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib149"><label>149</label><mixed-citation>
      
van der Linden, P. and Mitchell, J.: ENSEMBLES: Climate Change and its Impacts: Summary of research and results from the ENSEMBLES project – European Environment Agency (EEA), Tech. rep., Met Office Hadley Centre, FitzRoy Road, Exeter EX1 3PB, UK, <a href="http://www.eea.europa.eu/data-and-maps/indicators/global-and-european-temperature/ensembles-climate-change-and-its" target="_blank"/> (last access: 30 October 2023), 2009.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib150"><label>150</label><mixed-citation>
      
Vapnik, V.: The nature of statistical learning theory, Springer, New York, NY, <a href="https://doi.org/10.1007/978-1-4757-3264-1" target="_blank">https://doi.org/10.1007/978-1-4757-3264-1</a>, 1999.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib151"><label>151</label><mixed-citation>
      
Vrac, M., Marbaix, P., Paillard, D., and Naveau, P.: Non-linear statistical downscaling of present and LGM precipitation and temperatures over Europe, Clim. Past, 3, 669–682, <a href="https://doi.org/10.5194/cp-3-669-2007" target="_blank">https://doi.org/10.5194/cp-3-669-2007</a>, 2007.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib152"><label>152</label><mixed-citation>
      
Vu, M. T., Aribarg, T., Supratid, S., Raghavan, S. V., and Liong, S.-Y.:
Statistical downscaling rainfall using artificial neural network:
significantly wetter Bangkok?, Theor. Appl. Climatol., 126, 453–467,
<a href="https://doi.org/10.1007/s00704-015-1580-1" target="_blank">https://doi.org/10.1007/s00704-015-1580-1</a>, 2016.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib153"><label>153</label><mixed-citation>
      
Weaver, C. P., Lempert, R. J., Brown, C., Hall, J. A., Revell, D., and
Sarewitz, D.: Improving the contribution of climate model information to
decision making: the value and demands of robust decision frameworks, WIREs
Clim. Change, 4, 39–60, <a href="https://doi.org/10.1002/wcc.202" target="_blank">https://doi.org/10.1002/wcc.202</a>,
2013.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib154"><label>154</label><mixed-citation>
      
Wilby, R. L. and Dawson, C. W.: The Statistical DownScaling Model: insights
from one decade of application, Int. J. Climatol., 33,
1707–1719, <a href="https://doi.org/10.1002/joc.3544" target="_blank">https://doi.org/10.1002/joc.3544</a>, 2013.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib155"><label>155</label><mixed-citation>
      
Wilby, R. L. and Wigley, T. M. L.: Future changes in the distribution of
daily precipitation totals across North America, Geophys. Res.
Lett., 29, 39-1–39–4, <a href="https://doi.org/10.1029/2001GL013048" target="_blank">https://doi.org/10.1029/2001GL013048</a>,
2002.


    </mixed-citation></ref-html>
<ref-html id="bib1.bib156"><label>156</label><mixed-citation>
      
Wilby, R. L., Wigley, T. M. L., Conway, D., Jones, P. D., Hewitson, B. C.,
Main, J., and Wilks, D. S.: Statistical downscaling of general circulation
model output: A comparison of methods, Water Resour. Res., 34,
2995–3008, <a href="https://doi.org/10.1029/98WR02577" target="_blank">https://doi.org/10.1029/98WR02577</a>, 1998.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib157"><label>157</label><mixed-citation>
      
Wilby, R. L., Dawson, C. W., and Barrow, E. M.: sdsm – a decision support
tool for the assessment of regional climate change impacts, Environ.
Model. Softw., 17, 145–157, <a href="https://doi.org/10.1016/S1364-8152(01)00060-3" target="_blank">https://doi.org/10.1016/S1364-8152(01)00060-3</a>, 2002.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib158"><label>158</label><mixed-citation>
      
Wilby, R. L., Charles, S. P., Zorita, E., Timbal, B., Whetton, P., and
Mearns, L. O.: Guidelines for use of climate scenarios developed from
statistical downscaling methods, Supporting material of the
Intergovernmental Panel on Climate Change, available from the DDC of IPCC
TGCIA, 27, 2004.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib159"><label>159</label><mixed-citation>
      
Wilks, D. S.: Statistical methods in the atmospheric sciences, Academic
press, 698 pp., 2011.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib160"><label>160</label><mixed-citation>
      
Wipf, D. and Nagarajan, S.: A new view of automatic relevance determination,
Adv. Neur. In., 20, 2007.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib161"><label>161</label><mixed-citation>
      
Wolpert, D. H.: Stacked generalization, Neural Networks, 5, 241–259,
<a href="https://doi.org/10.1016/S0893-6080(05)80023-1" target="_blank">https://doi.org/10.1016/S0893-6080(05)80023-1</a>, 1992.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib162"><label>162</label><mixed-citation>
      
Wu, T. T. and Lange, K.: Coordinate descent algorithms for lasso penalized
regression, Ann. Appl. Stat., 2, 224–244, <a href="https://doi.org/10.1214/07-AOAS147" target="_blank">https://doi.org/10.1214/07-AOAS147</a>, 2008.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib163"><label>163</label><mixed-citation>
      
Xu, R., Chen, N., Chen, Y., and Chen, Z.: Downscaling and Projection of
Multi-CMIP5 Precipitation Using Machine Learning Methods in the Upper Han
River Basin, Adv. Meteorol., 2020, e8680436, <a href="https://doi.org/10.1155/2020/8680436" target="_blank">https://doi.org/10.1155/2020/8680436</a>, 2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib164"><label>164</label><mixed-citation>
      
Xu, Z., Han, Y., and Yang, Z.: Dynamical downscaling of regional climate: A
review of methods and limitations, Sci. China Earth Sci., 62, 365–375,
<a href="https://doi.org/10.1007/s11430-018-9261-5" target="_blank">https://doi.org/10.1007/s11430-018-9261-5</a>, 2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib165"><label>165</label><mixed-citation>
      
Zhang, C. and Ma, Y. (Eds.): Ensemble Machine Learning: Methods and Applications, Springer, New York, NY, <a href="https://doi.org/10.1007/978-1-4419-9326-7" target="_blank">https://doi.org/10.1007/978-1-4419-9326-7</a>, 2012.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib166"><label>166</label><mixed-citation>
      
Zhang, J., Liu, K., and Wang, M.: Downscaling Groundwater Storage Data in
China to a 1-km Resolution Using Machine Learning Methods, Remote Sens.,
13, 523, <a href="https://doi.org/10.3390/rs13030523" target="_blank">https://doi.org/10.3390/rs13030523</a>, 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib167"><label>167</label><mixed-citation>
      
Zhang, X. and Yan, X.: A new statistical precipitation downscaling method
with Bayesian model averaging: a case study in China, Clim. Dynam., 45,
2541–2555, <a href="https://doi.org/10.1007/s00382-015-2491-7" target="_blank">https://doi.org/10.1007/s00382-015-2491-7</a>, 2015.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib168"><label>168</label><mixed-citation>
      
Zhou, H., Zhang, J., Zhou, Y., Guo, X., and Ma, Y.: A feature selection
algorithm of decision tree based on feature weight, Expert Syst.
Appl., 164, 113842, <a href="https://doi.org/10.1016/j.eswa.2020.113842" target="_blank">https://doi.org/10.1016/j.eswa.2020.113842</a>, 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib169"><label>169</label><mixed-citation>
      
Zorita, E. and von Storch, H.: The Analog Method as a Simple Statistical
Downscaling Technique: Comparison with More Complicated Methods, J.
Climate, 12, 2474–2489, <a href="https://doi.org/10.1175/1520-0442(1999)012&lt;2474:TAMAAS&gt;2.0.CO;2" target="_blank">https://doi.org/10.1175/1520-0442(1999)012&lt;2474:TAMAAS&gt;2.0.CO;2</a>, 1999.

    </mixed-citation></ref-html>--></article>
