Articles | Volume 18, issue 2
https://doi.org/10.5194/gmd-18-239-2025
https://doi.org/10.5194/gmd-18-239-2025
Methods for assessment of models
 | 
20 Jan 2025
Methods for assessment of models |  | 20 Jan 2025

pathways-ensemble-analysis v1.1.0: an open-source library for systematic and robust analysis of pathway ensembles

Lara Welder, Neil Grant, and Matthew J. Gidden
Abstract

Ensembles of mitigation pathways, produced by multiple different models, are becoming increasingly influential as the world seeks to define climate goals and implement policy to meet them. In this context, a range of open-source codes has been developed to standardise and facilitate the systematic and robust analysis of mitigation pathways. We introduce a new open-source package, pathways-ensemble-analysis, which provides an object-oriented framework for the key steps in analysis, describing its structure and providing an illustrative example of its use. By following the suggested application steps of the tool, a user can conveniently perform a systematic and robust analysis of pathway ensembles. This tool is therefore a further step which can help the community in conducting best practices in pathway ensemble analysis.

1 Introduction

Energy and emissions pathways, such as those produced by integrated assessment models (IAMs), are becoming increasingly influential as the world attempts to address the issue of global warming and reduce emissions rapidly towards net zero in line with the Paris Agreement (Keppo et al., 2021; Krey, 2014; Weyant, 2017).

There are, however, many different future pathways which could comply with the Paris Agreement. Such pathways may vary across demographic, socio-economic, and technological dimensions, meaning that there is a large solution space of possible low-carbon futures which merit consideration. There is therefore a need to understand how to compare and contrast different pathways (Grant et al., 2020) as well as how to draw robust insights from a large number of pathways (Guivarch et al., 2022b). This requires the analysis not of single pathways but of a pathway ensemble – a collection of multiple energy and emission pathways.

The analysis of pathway ensembles has grown rapidly in recent years, largely due to the rise of scenario databases (Byers et al., 2022; Huppmann et al., 2018). These are databases containing a large number of pathways, often produced by a wide range of underlying IAMs. Such ensembles were created to accompany the Intergovernmental Panel on Climate Change (IPCC) Special Report on Global Warming of 1.5 °C and again for the IPCC's Sixth Assessment Report. They have become influential sources of information on what the world needs to do to limit warming to 1.5 °C and have been used by a wide range of actors. The rise of such databases has initiated a discussion about how to derive robust insights from them (Ferrari et al., 2022; Guivarch et al., 2022b).

The development of pathway ensemble analysis has been supported by standardised open data and open-source code. In the context of the submission and analysis process of scenarios for not only IPCC-related activities but also model intercomparison projects, a standardised way of managing data structures with the open-source Python package nomenclature has been developed (Huppmann et al., 2024), which can help standardise the scenario data provided, enabling easier comparison of different pathways using data templates (IAMC, 2024). To analyse, validate, and visualise the scenario data given in this data template, the open-source Python library pyam has also been developed (Huppmann et al., 2021, 2023). The library includes a number of plotting options which enable a side-by-side comparison of models and/or scenarios with only small amounts of additional coding required.

The tools developed so far provide standardised data reporting and analytical tools, which can help when analysing the large number of pathways concurrently. However, there remains space to further develop tools for pathway ensemble analysis. In particular, the ability to filter pathways to select a subset of a broader ensemble, the ability to identify illustrative pathways via a systematic approach, and the ability to visualise and plot key indicators of the ensemble as a whole remain important tasks for which further tools can be developed.

Here we present a new Python-based open-source package, the pathways-ensemble-analysis, or p–e–a, tool (Welder and Grant, 2023). This package provides these functions, improving the ability of the community to conduct systematic and robust analysis of pathway ensembles in a convenient way.

1.1 The use of ensemble analysis in the literature

Different forms of pathway ensemble analysis can be found in the literature.

1.1.1 Model intercomparison exercises

Model intercomparison projects are designed to investigate a specific research question with different models that have harmonised scenario parameter assumptions. In these, the pathways analysis can be performed in situ, allowing for adaptations and iterations of model–scenario combinations. Insights can be obtained from within-ensemble agreement but should be caveated if “structural differences are not systematic and models share approaches or components” (Parker, 2013; Wilson et al., 2021).

Recent model intercomparisons which have produced and analysed pathway ensembles have explored the cost and attainability of meeting climate goals without overshoot (Riahi et al., 2021), the potential for good practice policies to close the emission gap (van Soest et al., 2021), and the temperature implications of current mitigative efforts (van de Ven et al., 2023) and help determine the structural differences between models (Dekker et al., 2023a).

1.1.2 Assessing a pathway ensemble ex situ

As well as in situ pathway ensemble analysis, it is also possible to conduct ex situ analysis. Ex situ refers to analysis of ensembles which have already been created either for a specific research project or by combining pathways from multiple different research projects. The ensemble is now being analysed after its creation to answer a given research question. The most obvious example is the ex situ analysis of scenario databases collated and assessed by the IPCC.

Two examples of how to derive ex situ insights from a pathway ensemble are statistically derived, stand-alone indicators and the analysis of illustrative pathways. Such an ensemble can be “unstructured” in the sense of it not originating from a single model intercomparison exercise but rather being a collection of different, individual projects that can “give an indication of the spread of results in the literature” (van Diemen et al., 2022).

  • Stand-alone indicators. These highlight an individual aspect of a pathway ensemble based on statistical averages. For example, the median level of greenhouse gas (GHG) reductions from 2019–2030 in a pathway ensemble can be calculated as a stand-alone indicator. Such indicators are valuable but represent a statistical property of the ensemble rather than a single, self-consistent pathway that has a particular underlying scenario narrative. Examples of stand-alone indicators include key benchmarks on global emission reductions provided by the IPCC (IPCC, 2023) as well as the expansion rate of global renewable capacities to meet a climate goal (Climate Analytics, 2023) or emission reduction levels needed to keep a country on track with the Paris Agreement (Climate Action Tracker, 2024; Climate Analytics, 2024). We note that stand-alone indicators can also be used for in situ analysis, as seen in Dekker et al. (2023a), and also that scenario ensembles should not generally be seen as statistical ensembles, and thus the interpretation of medians or other quantiles of the distribution requires care (see Sect. 1.1.3 and the Conclusion section for further discussion of this topic).

  • Illustrative pathways. These, on the other hand, are single pathways extracted from the ensemble because they demonstrate particular dynamics which are of interest. They can be used to investigate the “implication of choices on socio-economic development and climate policies, and the associated transformation of the main [greenhouse gas]-emitting sectors” that result from a particular set of assumptions/particular scenario narrative (Riahi et al., 2022). Illustrative pathways have been used to communicate results in a wide range of settings (Grant et al., 2022a; Riahi et al., 2022; Smith et al., 2023).

To determine stand-alone indicators based on statistics or to select illustrative pathways from a pathway ensemble, analyses often start by applying a filtering process which returns a subset of pathways of particular interest for the analysis.

A simple example of a filtering process is the application of filters to ensure that the pathways display correct historical behaviour, which is also known as a “vetting” process, (Guivarch et al., 2022a). This filtered ensemble is then further used to determine stand-alone indicators, such as, for example, emission reduction levels and levels of carbon dioxide removal as well as five illustrative mitigation pathways (Riahi et al., 2022).

The filtering process can also be applied more rigorously, for example by being informed by a political framework, such as the Paris Agreement, or feasibility; sustainability; or ethical concerns, such as, for example, about the technical potential for carbon storage (Grant et al., 2022b), the availability of sustainable biomass (Fuss et al., 2018), or distributive justice concerning negative emissions (Minx et al., 2018). Applying such filters can have a strong impact on the results, which highlights the importance of applying filters in a rigorous and systematic way (Achakulwisut et al., 2023).

1.1.3 Challenges, risks, and good practices

In situ model intercomparison projects strive towards clean comparisons of pathway data for the specific research question they investigate. While they enable a focused exploration of a specific research question, they are, however, labour- and computational-resource-intensive and require access to input data, models, and required hardware.

Performing ex situ analysis on larger pathway ensembles pulls together a larger set of evidence. The potential benefits of using large ensembles include that they may better capture uncertainties; increase the salience, credibility, and legitimacy of the information produced; and be a way of building a comprehensive or representative picture of the knowledge produced by modellers (Guivarch et al., 2022b). Such ensembles are nevertheless not a meaningful, random statistical sample that fully covers a potential solution space. Bias exists, for example, through model fingerprints and/or an over-representation of multiple similar scenarios coming from the same model intercomparison projects (Guivarch et al., 2022b; Peters et al., 2023). This can introduce confounding effects beyond the mechanism that an ex situ analysis attempts to study.

Given the challenges and risks, Guivarch et al. propose a three-step approach for preparing and using ensembles of mitigation scenarios (Guivarch et al., 2022b), which include

  • 1.

    pre-processing the ensemble, including quality control and vetting as well as reporting and potentially correcting bias;

  • 2.

    either

    • a.

      transparently selecting scenarios from the ensemble, for example, based on specific (un)desirable outcomes, plausibility criteria, or seeking to represent the diversity of the ensemble, or

    • b.

      exploring the full ensemble; and

  • 3.

    providing users with efficient access to the information, including decision support and communication tools and transparent and reproducible meta analysis.

In addition to this, we highlight that when communicating statistical properties calculated from a pathway ensemble, it is important to highlight that these describe and parameterise the existing “ensemble of opportunity” of (generally) normative scenarios rather than a full statistical ensemble. As such, interpreting these values as indicative of probabilities, expected values or statistical ranges should be avoided.

1.2 Aim of the p–e–a package

Both the in situ and the ex situ pathway ensemble analysis share a number of common steps. These are

  • the evaluation of criteria based on model results;

  • an optional filtering process to select only a subset of pathways; and

  • a well-laid-out, if desired rated, side-by-side comparison of the remaining pathways with their evaluated criteria, which can then be used for further analysis.

These steps should be guided by the abovementioned good practices (Guivarch et al., 2022b).

This paper introduces a Python-based workflow, pathways-ensemble-analysis, which standardises and automates these steps, building on existing work in the research community, such as the Python library pyam. The workflow can thus support the analysis of model intercomparison projects and pathway ensembles by providing additional, easily obtained insights which provide a fast and, when guided by good practices, well-laid-out and comprehensible overview of the pathway ensemble of interest. This can be used in in situ, ex situ, and blended project setups, in which both elements are present.

The method of this workflow is outlined in the next section and an application is presented in the section that follows it.

2 Method

In the Method section, we first illustrate the workflow of the Python package. Second, we provide a description of how the package is implemented.

2.1 Workflow

This section describes the developed workflow which derives a well-laid-out, comprehensible overview of a pathway ensemble. The workflow is implemented in an object-oriented manner in the open-source Python library pathways-ensemble-analysis.

Figure 1 visualises the following illustrative workflow:

  • 1.

    The analysis starts with extracting pathway data. Typically, these are either obtained from local files in the IAMC data format or downloaded from a pathway database, such as, for example, the ones hosted by the International Institute for Applied Systems Analysis (IIASA) (Huppmann et al., 2018), which can be conveniently accessed using pyam. Typically, external data pre-processing routines are run on such datasets to address missing or faulty data. An example of missing but patchable data is if the total use of bioenergy in the power sector is given along with the use of bioenergy with carbon capture and storage (CCS), but the use of bioenergy without CCS is not provided. An example of faulty data is when the total electricity generation does not add up to the sum of its components, which can be remedied by either recalculating the total or dropping redundant components. Once the data are pre-processed, they are passed on as a pyam.IamDataFrame object.

  • 2.

    The next step is the definition and evaluation of criteria for each pathway. Examples of criteria are the emission reductions in 2030 with respect to a base year; the share of non-biomass renewables in 2050 in the power sector; the mean carbon sequestration via land use, biomass, or fossil fuels over a given number of years; the maximum exceedance probability of a temperature limit; or the magnitude of regional differentiation in a pathway. In this step, pyam's filtering functions and a mixture of algebraic operations with pyam and pandas is being used to evaluate the criteria before finally returning a pandas.DataFrame object.

  • 3.

    The next step is to filter the pathway ensemble to select a subset of the initial ensemble. A filtering process drops pathways with criteria outside a given range from the ensemble. Examples of filters are to avoid overreliance on negative emissions from land use or bioenergy with CCS across a given time period. This optional step is of specific interest for ex situ analysis of pre-existing pathway ensembles and might be of lesser importance for model intercomparison projects, which can partly enforce these filters a priori in their scenario input parameters.

  • 4.

    Having produced a filtered subset of pathways for analysis, the pathways can be rated along a range of criteria defined in step 2. The criteria used to rate pathways can be those which were used to filter the database and/or additional used-defined criteria. The usage of the rating function is twofold. On the one hand, the function can be used to normalise the criteria, for example, by mapping them to values from 0 to 1, and in this way improve the readability of the final output plots. On the other hand, the function can be used to rate the criteria of each pathway based on normative preferences. Simple examples of rating functions are

    • a.

      having a high share of non-biomass renewable electricity generation, x → x;

    • b.

      having a low share of fossil electricity generation, x1-x.

  • 5.

    Criteria rated in such a way are then available for visualisation. Outputs can, for example, be visualised with a heatmap which displays the rated criteria with the filtered pathways are sorted based on their overall rating.

https://gmd.copernicus.org/articles/18/239/2025/gmd-18-239-2025-f01

Figure 1Flowchart setup provided for the pathway ensemble analysis. The data pre-processing process in dashed lines is in theory optional but is advised to be addressed with external programming routines.

Download

2.2 Package description

The Python library containing the object-oriented setup of the workflow is structured as described below.

  • In the evaluation module, the core methods get_values, filter_values, rate, and filter_rating are located, which process the pathway data, user-defined criteria, and other user-defined input data as visualised in Fig. 1.

  • In the criteria module, classes for criteria are implemented, which, at a minimum, contain a criterion_name, rating_function, rating_weight, region, and region_aggregation_weight as class parameters and get_values and rate methods as class functions. The criteria module contains two sub-modules.

    • In the base module, the following criteria classes are currently implemented:

      • Criterion, the basic criterion class which other criteria inherit from;

      • SingleVariableCriterion, which evaluates the value of a variable for a given year and region;

      • AggregateCriterion, which evaluates the aggregate of a variable, for example, the average, minimum, or maximum, for given years and a given region;

      • ChangeOverTimeCriterion, which evaluates the change in a variable for a given year and region with respect to a reference year;

      • ShareCriterion, which evaluates the share of a component on the total for a given year and a given region;

      • CompareRegionCriterion, which takes a pre-defined criterion (for example, the share of renewables in the electricity mix) and two regions and calculates a metric which compares the value of the criterion in each region, and, currently, the comparison can be either a subtract or a divide operation.

    • In the library module, criteria for specific, reappearing use cases are implemented (pre-set parameters can be changed by the user), of which examples are the following:

      • Mean_CarbonSequestration_Fossil, which evaluates the average amount of global fossil CCS across the years 2040 to 2060. The rating function is informed by literature values on the potential of CCS (Budinis et al., 2018; Guivarch et al., 2022a).

      • Mean_CarbonSequestration_Biomass, which evaluates the average amount of globally sequestered carbon via bioenergy with CCS across the years 2040, 2050, and 2060. The rating function is informed by estimates of the global potential of sustainable negative emissions from bioenergy with CCS (Fuss et al., 2018).

      • Mean_CarbonSequestration_LandUse, which evaluates the average global carbon dioxide emissions from afforestation and reforestation across the years 2040, 2050, and 2060. The rating function is informed by estimates of the global potential of sustainable/feasible potential of negative emissions coming from afforestation and reforestation (Fuss et al., 2018; Grant et al., 2021).

      • Mean_Biomass_PrimaryEnergy, which evaluates the average global amount of biomass use in primary energy across the years 2040, 2050, and 2060. The rating function is informed by literature values on the sustainable technical potential of bioenergy (Creutzig et al., 2015; Frank et al., 2021).

  • The plot module is intended for providing plotting methods to the user. Currently, three main plotting methods are provided here. The first, called heatmap, enables the visualisation of the pathway ensemble for the criteria of interest. The second, called compare_ensemble, allows for multiple different pathway ensembles to be compared using box plots. The third one is inspired by recent work (Dekker et al., 2023a) and displays criteria values in form of a polar_chart.

  • The utils module contains a number of utility methods used in other modules.

  • A tests module is provided to ensure the quality of the code and support the continuous integration and development of new code.

3 Application

In this article, we demonstrate with one example how the pathways-ensemble-analysis repository can be used in the analysis of pathway ensembles. In this example, we use the package to identify a filtered subset of pathways from the IPCC AR6 scenario database (Byers et al., 2022); highlight the impact of filtering on ensemble statistics, for example, on stand-alone indicators; and identify an illustrative pathway for further investigation. Additional examples are briefly described in the last subsection, such as, for example, a recreation of the IPCC AR6 vetting process (Guivarch et al., 2022a) and a model fingerprint analysis in the style of recently published work (Dekker et al., 2023a). The code to reproduce the entire presented analysis can be found in notebooks folder in the Git repository of the package.

3.1 Input data to the workflow

The raw data which serve as input to this ensemble are the AR6 scenario database (Byers et al., 2022). This provides 97 pathways compatible with 1.5 °C , which are the starting point for our analysis. This selection is in itself already a filtering step, but one that can easily be achieved with the pyam library.

We conduct an analysis using eight user-defined criteria. We distinguish between primary criteria and secondary criteria. Primary criteria are used to filter the database, directly excluding pathways which have particular behaviour in order to select a subset of pathways for analysis. Secondary criteria are not used directly in the filtering process but are still used for rating and visualising the ensemble and supporting the selection of an illustrative pathway of interest. Generally, it is up to the user to decide which criteria to use for a filtering step and which to use for a rating step. The criteria are described in Table 1.

Table 1Primary and secondary criteria used in the example. A/R stands for afforestation/reforestation.

Download Print Version | Download XLSX

The filters of the primary criteria have been used (alongside others) to identify a Paris Agreement-compatible set of pathways in a recent analysis (Climate Analytics, 2023).

The secondary criteria are not used for filtering but are used to obtain further insights into the pathway ensemble. In this example, the aim is to focus on pathways which rapidly reduce fossil fuel demand based on the deployment of renewables and limited reliance on biomass or fossil CCS. Such a focus could be justified by the precautionary principle (which would suggest faster emissions cuts) or with reference to the potential sustainability/feasibility concerns relating to biomass (Creutzig et al., 2015) and CCS (Grant et al., 2022b).

3.2 Filtering of the ensemble and its impact on stand-alone indicators

Applying this filtering process to the IPCC's AR6 scenario database (Byers et al., 2022) reduces the number of pathways compatible with 1.5 °C from 97 to 30 pathways.

Figure 2 shows the impact that the filtering has on the secondary criteria using the compare_ensemble plotting function.

https://gmd.copernicus.org/articles/18/239/2025/gmd-18-239-2025-f02

Figure 2The impact of filtering on a selection of variables of interest.

Download

In this example, filtering the pathway ensemble to reduce reliance on future CDR leads to greater reductions in fossil fuel production/use by 2030 (a 35 % reduction from 2020 levels rather than a 29 % reduction seen in the unfiltered ensemble). This greater action is driven in part by accelerated deployment of renewables, with renewables making up 71 % of the global electricity mix in 2030, which is up from 67 % in the unfiltered ensemble. Reduced reliance on future carbon dioxide removal (CDR) also corresponds to reduced reliance on biomass as an energy carrier.

The changes in the median of these stand-alone indicators are sometimes minor, but there are nevertheless large changes in the overall ensemble range. This is particularly evident in the share of renewables, biomass demand, and fossil CCS deployment indicators, where the filtering process excludes those pathways with the lowest deployment of renewables and highest reliance on biomass/fossil CCS. As interquartile or total ensemble ranges are often provided alongside the median as influential key statistics (Riahi et al., 2022; Rogelj et al., 2018), this highlights the potential influence of filtering on the results of pathway analysis. Given the key focus at the moment on the role of fossil fuels in mitigation pathways (Achakulwisut et al., 2023), the influence of the filtering on a key benchmark such as fossil fuel reductions also shows the critical importance of considering filtering as part of a pathway ensemble analysis.

The such filtered pathway ensemble can now be used to determine stand-alone indicators, such as the median and ranges visualised in Fig. 2. If more insights into the ensemble are desired, a side-by-side visualisation, coupled with an optional rating step, can be performed.

3.3 Rating and visualising the pathway ensemble

A side-by-side comparison with normalised criteria values ranging from 0 to 1, for example, can support the analysis of how the different pathways achieve a transformation pathway compatible with 1.5 °C. The p–e–a's rate and heatmap plotting function can be used to facilitate this. If it is of additional interest to identify illustrative pathways for further analysis, these can be selected based on the ratings of each criterion.

We rank four main criteria to illustrate the differences between pathways. These criteria, first introduced as secondary criteria in Table 1, are shown in Table 2 with their rating functions. Rating functions have two main dimensions. First is whether the function is selecting for low or high values of the criterion. In this example, we select for low levels of biomass, fossil CCS, and total emissions (negative rating functions), with high levels of renewables (positive rating function). The second is the sensitivity of the rating function to the criterion values. By weighting the value of x more highly (e.g. lambda x: np.clip(2*x - 1, 0, 1)) and applying threshold values, the rating function can increase the selectivity of the analysis to this variable. In the above example, values under 0.5 would score zero, and then every increase of 0.01 above this would increase the score by 0.02. In this way, very tailored filters can be developed that select and highlight particular behaviours.

The developing of rating functions is an inherently normative process, but one which gives a high degree of control over which criteria to rate and the relative importance of each criterion. If this is transparently communicated, this flexibility and control is a key strength of the p–e–a.

Table 2Rating criteria for the analysis.

Download Print Version | Download XLSX

Having rated the pathways across the criterion of interest, we can visualise the pathways using the heatmap function. This function produces a heatmap in which each column represents an individual pathway and each row represents a user-defined criterion of interest. The function then calculates the aggregated rating for each pathway across the criteria and assigns the pathway a total rating. The highest-rated pathways are plotted on the left, with the pathway rating declining from left to right. The heatmap function gives the option to also plot criteria which are of interest but are not used in the overall rating itself.

Figure 3 shows such a heatmap for the filtered set identified using the criteria in Table 1 (the 15 highest-scoring pathways out of the 30 pathways which pass the filters are shown). The pathways are rated and ordered according to the four secondary criteria of interest. Therefore, we are identifying pathways which both

  • a.

    pass the filters which are used as strict exclusion criteria;

  • b.

    have rapid reductions in fossil fuels in the near term driven primarily by the deployment of renewables, with limited reliance on biomass and fossil CCS deployment.

The heatmap also provides further insights into the model dynamics. For example, we can see that a few REMIND-MAgPIE pathways have relatively low fossil CCS deployment and low average biomass demand, pointing at high wind and solar electricity shares in power generation without the need for fossil CCS. The shown COFFEE pathway has the highest share of renewable electricity generation, which is, however, linked to a strong reliance on biomass demand. We can further observe that the displayed WITCH pathways have agriculture, forestry, and other land uses (AFOLU) emissions within the sustainability limits while being more reliant on biomass both in terms of general demand and average bioenergy with carbon capture and storage (BECCS) deployment.

https://gmd.copernicus.org/articles/18/239/2025/gmd-18-239-2025-f03

Figure 3Heatmap that enables identification illustrative pathways.

Download

3.4 Selecting an illustrative pathway from the ensemble

As mentioned in the Introduction section of this work, illustrative pathways can be extracted from the ensemble to demonstrate particular dynamics of interest. They can be used to investigate the “implication of choices on socio-economic development and climate policies, and the associated transformation of the main [greenhouse gas]-emitting sectors” that result from a particular set of assumptions/particular scenario narrative (Riahi et al., 2022).

The process we have applied so far has identified pathways which pass the defined exclusion criteria and promote rapid emission reductions in the near term driven primarily by the deployment of renewables, with limited reliance on biomass and fossil CCS deployment. The first two pathways on the left side of the heatmap comply with these criteria particularly well – with having the highest rating across the ensemble – and are therefore candidates for further analysis. It is of interest to note that these are in fact two of the three illustrative mitigation pathways compatible with 1.5 °C selected by the IPCC AR6 for further analysis (Riahi et al., 2022).

The identification of illustrative pathways with differently chosen socio-economic developments and climate policies can be identified in a similar manner using differently specified criteria.

3.5 Additional examples

The tool can be flexibly applied to investigate different characteristics of pathway ensembles. In the following, we briefly show two such examples. A detailed derivation and description of these examples can be found in the repository of the tool.

3.5.1 Vetting process

The IPCC AR6 vetting process (Guivarch et al., 2022a) can be recreated in a straightforward manner with the tool. In this process, pathways that have historical energy and emission values outside of an acceptable range are being dropped from further analyses. Figure 4 displays the vetted historical criteria, where the legend indicates how many pathways have information on the vetted criteria and how many of these remain in the ensemble after the filtering process.

https://gmd.copernicus.org/articles/18/239/2025/gmd-18-239-2025-f04

Figure 4Filtering based on historical data vetting as done in the IPCC AR6 vetting process (Guivarch et al., 2022a).

3.5.2 Fingerprint analysis

Inspired by recently published work on energy model fingerprints in mitigation scenarios (Dekker et al., 2023a), the polar_chart plotting function can display statistical characteristics of the chosen criteria/indicators; see Fig. 5.

https://gmd.copernicus.org/articles/18/239/2025/gmd-18-239-2025-f05

Figure 5Example of a polar chart plot created with the pathway ensemble analysis tool (inspired by recently published work on energy model fingerprints in mitigation scenarios; Dekker et al., 2023a).

4 Conclusion

The open-source library presented in this work provides the research community with a tool to perform analyses of pathway ensembles. The library utilises and expands the existing work of the community, specifically the pyam library, guaranteeing compatibility with current data standards and coding practices as well as ease of use.

The open-source availability on GitLab provides transparency to the implemented method and aims to encourage the community to contribute and further expand the library. A testing module is integrated to support the continuous integration and development of new code.

The object-oriented implementation of the core code of the library provides the user of the code with the ability to design the analysis in a flexible manner, for example, by setting the parameters of predefined criteria freely or by having the option to easily define new criteria as needed. It furthermore significantly shortens otherwise implemented code, resulting in concise and easy-to-write code blocks, which provides a good overview over the analysis and therefore convenience to the user.

The library has a wide range of applications, including pathway ensemble analysis in model intercomparison exercises or deriving ex situ insights from (unstructured) pathway ensembles, for example, to determine stand-alone statistical indicators or illustrative pathways. For this purpose, the library provides key functionalities commonly used in ensemble analysis. These include the definition of criteria of interest and the evaluation, filtering, and rating of these criteria, as well as visualisation functions which can help demonstrate the impact of filtering and rating.

The impact of the filtering and rating operations are relevant to be cognisant of at almost all steps of such analyses. One example to highlight is the calculation of stand-alone statistical indicators, such as the level of fossil fuel reduction that complies with the Paris Agreement. The simple application provided in this work, which reduces the reliance on future CDR and therefore implies greater levels of ambition in the near term, is already an example of this.

Limitations exist to both the dataset and the method for processing these datasets in such analyses. The scenario data themselves can have missing or faulty data, the solution space is not statistically representative, and therefore the calculation, and interpretation of statistical indicators are challenging. While working with illustrative pathways is not affected by the latter, the selection process of getting to these pathways is always influenced by the user-defined criteria with their filtering and rating functions.

Nevertheless, literature also points out the benefits of using large ensembles ex situ, for example, so that they may better capture uncertainties (Guivarch et al., 2022b). Under the premise that the underlying scenario dataset, with its bias and the choice of criteria, filters, and rating functions, is processed with good practices, for example, that are clearly communicated, the functionalities of the pathways-ensemble-analysis tool provide a foundation for performing a transparent, robust, and systematic analysis of a pathway ensemble. This library could be used in future community endeavours, such as the construction and evaluation of new IPCC scenario databases, model intercomparison projects, and ex situ analysis of IPCC databases to provide key metrics such as CDR and emission reduction requirements. Criteria with predefined rating functions and filters could be discussed and standardised across the community.

Future work can be identified when reviewing recent work on pathway ensemble analysis in literature (Dekker et al., 2023a, b; Guivarch et al., 2022b; Smith, 2022).

While filtering is a key step in determining robust insights into a pathway ensemble, the structure of the ensemble should also be reflected upon critically. Here, one example is whether the calculation of stand-alone indicators should be weighted by the frequency with which a particular model features in the pathway ensemble. This could help avoid models with a specific fingerprint and a high (or low) occurrence from being over-represented (or under-represented) in the insights derived from the ensemble. The pathways-ensemble-analysis library could be extended so that the calculation of stand-alone indicators accounts for their relative representation in the overall ensemble. At the same time, models with a high level of occurrence in the pathway ensemble could still provide a statistically relevant distribution of pathways, in which case weighting by model frequency may be less appropriate.

Literature also provides inspiration for new analysis and plotting routines, such as for identifying model fingerprints by analysing criteria for individual models or by determining cluster of pathways with distinct characteristics (i.e. criteria). The pathways-ensemble-analysis library could be used further in this endeavour, with an illustrative example provided on the repository.

Code and data availability

The general pathways-ensemble-analysis GitLab repository is available under the MIT License at https://gitlab.com/climateanalytics/pathways-ensemble-analysis (GitLab, 2025). Version 1.0.0 of the pathways-ensemble-analysis repository, which is presented in this paper, is available on GitLab and archived on Zenodo (https://doi.org/10.5281/zenodo.10197980, Welder and Grant, 2023). Version 1.1.0, which includes an updated version of the input data and scripts to run the model and produce the plots for all the simulations presented in this paper, is available on GitLab and Zenodo as well (https://doi.org/10.5281/zenodo.11057268, Welder and Grant, 2024).

Author contributions

LW conceptualised the workflow, set up the initial software code basis, and wrote the manuscript draft; NG contributed to the conceptualisation of the workflow, expanded the software code basis, was the lead on the development of the shown application, and wrote the manuscript draft; MJG contributed to the conceptualisation of the workflow and reviewed and edited the manuscript.

Competing interests

The contact author has declared that none of the authors has any competing interests.

Disclaimer

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors.

Financial support

This research has been supported by the IKEA Foundation (grant no. G-2107-01829).

Review statement

This paper was edited by Christoph Müller and reviewed by Jan Ivar Korsbakken and Mark Dekker.

References

Achakulwisut, P., Erickson, P., Guivarch, C., Schaeffer, R., Brutschin, E., and Pye, S.: Global fossil fuel reduction pathways under different climate mitigation strategies and ambitions, Nat. Commun., 14, 5425, https://doi.org/10.1038/s41467-023-41105-z, 2023. 

Budinis, S., Krevor, S., Dowell, N. M., Brandon, N., and Hawkes, A.: An assessment of CCS costs, barriers and potential, Energy Strateg. Rev., 22, 61–81, https://doi.org/10.1016/j.esr.2018.08.003, 2018. 

Byers, E., Krey, V., Kriegler, E., Riahi, K., Schaeffer, R., Kikstra, J., Lamboll, R., Nicholls, Z., Sandstad, M., Smith, C., van der Wijst, K., Al -Khourdajie, A., Lecocq, F., Portugal-Pereira, J., Saheb, Y., Stromman, A., Winkler, H., Auer, C., Brutschin, E., Gidden, M., Hackstock, P., Harmsen, M., Huppmann, D., Kolp, P., Lepault, C., Lewis, J., Marangoni, G., Müller-Casseres, E., Skeie, R., Werning, M., Calvin, K., Forster, P., Guivarch, C., Hasegawa, T., Meinshausen, M., Peters, G., Rogelj, J., Samset, B., Steinberger, J., Tavoni, M., and van Vuuren, D.: AR6 Scenarios Database (1.1), Zenodo [data set], https://doi.org/10.5281/zenodo.7197970, 2022. 

Climate Action Tracker: Modelled domestic pathways, https://climateactiontracker.org/methodology/cat-rating-methodology/modelled-domestic-pathways/ (last access: 17 January 2024), 2024. 

Climate Analytics: 2030 targets aligned to 1.5 °C: Evidence from the latest global pathways, https://climateanalytics.org/publications/2023/2030-targets-aligned-to-15c-evidence-from-the-latest-global-pathways/ (last access: 15 June 2023), 2023. 

Climate Analytics: 1.5 °C National Pathway Explorer, https://1p5ndc-pathways.climateanalytics.org (last access: 1 October 2024), 2024. 

Creutzig, F., Ravindranath, N. H., Berndes, G., Bolwig, S., Bright, R., Cherubini, F., Chum, H., Corbera, E., Delucchi, M., Faaij, A., Fargione, J., Haberl, H., Heath, G., Lucon, O., Plevin, R., Popp, A., Robledo-Abad, C., Rose, S., Smith, P., Stromman, A., Suh, S., and Masera, O.: Bioenergy and climate change mitigation: An assessment, GCB Bioenergy, 7, 916–944, https://doi.org/10.1111/gcbb.12205, 2015. 

Dekker, M. M., Daioglou, V., Pietzcker, R., Rodrigues, R., de Boer, H.-S., Dalla Longa, F., Drouet, L., Emmerling, J., Fattahi, A., Fotiou, T., Fragkos, P., Fricko, O., Gusheva, E., Harmsen, M., Huppmann, D., Kannavou, M., Krey, V., Lombardi, F., Luderer, G., Pfenninger, S., Tsiropoulos, I., Zakeri, B., van der Zwaan, B., Usher, W., and van Vuuren, D.: Identifying energy model fingerprints in mitigation scenarios, Nature Energy, 8, 1395–1404, https://doi.org/10.1038/s41560-023-01399-1, 2023a. 

Dekker, M. M., Hof, A. F., van den Berg, M., Daioglou, V., van Heerden, R., van der Wijst, K.-I., and van Vuuren, D. P.: Spread in climate policy scenarios unravelled, Nature, 624, 309–316, https://doi.org/10.1038/s41586-023-06738-6, 2023b. 

Ferrari, L., Carlino, A., Gazzotti, P., Tavoni, M., and Castelletti, A.: From optimal to robust climate strategies: expanding integrated assessment model ensembles to manage economic, social, and environmental objectives, Environ. Res. Lett., 17, 084029, https://doi.org/10.1088/1748-9326/ac843b, 2022. 

Frank, S., Gusti, M., Havlík, P., Lauri, P., DiFulvio, F., Forsell, N., Hasegawa, T., Krisztin, T., Palazzo, A., and Valin, H.: Land-based climate change mitigation potentials within the agenda for sustainable development, Environ. Res. Lett., 16, 024006, https://doi.org/10.1088/1748-9326/abc58a, 2021. 

Fuss, S., Lamb, W. F., Callaghan, M. W., Hilaire, J., Creutzig, F., Amann, T., Beringer, T., De Oliveira Garcia, W., Hartmann, J., Khanna, T., Luderer, G., Nemet, G. F., Rogelj, J., Smith, P., Vicente, J. V., Wilcox, J., Del Mar Zamora Dominguez, M., and Minx, J. C.: Negative emissions – Part 2: Costs, potentials and side effects, Environ. Res. Lett., 13, 063002, https://doi.org/10.1088/1748-9326/aabf9f, 2018. 

GitLab: Pathways Ensemble Analysis, GitLab [data set], https://gitlab.com/climateanalytics/pathways-ensemble-analysis (last access: 18 January 2025), 2025. 

Grant, N., Hawkes, A., Napp, T., and Gambhir, A.: The appropriate use of reference scenarios in mitigation analysis, Nat. Clim. Change, 10, 605–610, https://doi.org/10.1038/s41558-020-0826-9, 2020. 

Grant, N., Hawkes, A., Mittal, S., and Gambhir, A.: The policy implications of an uncertain carbon dioxide removal potential, Joule, 5, 2593–2605, https://doi.org/10.1016/j.joule.2021.09.004, 2021. 

Grant, N., Wilson, R., Majid, A., Welder, L., Hörsch, J., Fyson, C., and Hare, B.: 1.5 °C Pathways for the EU27: accelerating climate action to deliver the Paris Agreement, https://climateanalytics.org/publications/15c-pathways-for-the-eu27-accelerating-climate-action-to (last access: 18 January 2025), 2022a. 

Grant, N., Gambhir, A., Mittal, S., Greig, C., and Köberle, A. C.: Enhancing the realism of decarbonisation scenarios with practicable regional constraints on CO2 storage capacity, Int. J. Greenh. Gas Con., 120, 103766, https://doi.org/10.1016/j.ijggc.2022.103766, 2022b. 

Guivarch, C., Kriegler, E., Joana Portugal-Pereira, V. B., Edmonds, J., Fischedick, M., Havlík, P., Jaramillo, P., Krey, V., Lecocq, F., Lucena, A. F. P., Meinshausen, M., Mirasgedis, S., O'Neill, B., Peters, G. P., Rogelj, J., Rose, S., Saheb, Y., Strbac, G., Strømman, A. H., van Vuuren, D. P., and Zhou, N.: Annex III: Scenarios and modelling methods, in: Climate Change 2022: Mitigation of Climate Change. Contribution of Working Group III to the Sixth Assessment Report of the Intergovernmental Panel on Climate Change, edited by: Shukla, P. R., Skea, J., Slade, R., Al Khourdajie, A., van Diemen, R., McCollum, D., Pathak, M., Some, S., Vyas, P., Fradera, R., Belkacemi, M., Hasija, A., Lisboa, G., Luz, S., and Malley, J., Cambridge University Press, Cambridge, UK and New York, NY, USA, 1841–1908, https://doi.org/10.1017/9781009157926.022, 2022a. 

Guivarch, C., Le Gallic, T., Bauer, N., Fragkos, P., Huppmann, D., Jaxa-Rozen, M., Keppo, I., Kriegler, E., Krisztin, T., Marangoni, G., Pye, S., Riahi, K., Schaeffer, R., Tavoni, M., Trutnevyte, E., van Vuuren, D., and Wagner, F.: Using large ensembles of climate change mitigation scenarios for robust insights, Nat. Clim. Change, 12, 428–435, https://doi.org/10.1038/s41558-022-01349-x, 2022b. 

Huppmann, D., Rogelj, J., Kriegler, E., Krey, V., and Riahi, K.: A new scenario resource for integrated 1.5 °C research, Nat. Clim. Change, 8, 1027–1030, https://doi.org/10.1038/s41558-018-0317-4, 2018. 

Huppmann, D., Gidden, M., Nicholls, Z., Hörsch, J., Lamboll, R., Kishimoto, P., Burandt, T., Fricko, O., Byers, E., Kikstra, J., Brinkerink, M., Budzinski, M., Maczek, F., Zwickl-Bernhard, S., Welder, L., Alvarez Quispe, E., and Smith, C.: pyam: Analysis and visualisation of integrated assessment and macro-energy scenarios, version 2; peer review: 3 approved, Open Research Europe, 1, 74, https://doi.org/10.12688/openreseurope.13633.2, 2021. 

Huppmann, D., Gidden, M. J., Nicholls, Z., Hörsch, J., Lamboll, R. D., Kishimoto, P. N., Burandt, T., Fricko, O., Byers, E., Kikstra, J. S., Brinkerink, M., Budzinski, M., Maczek, F., Zwickl-Bernhard, S., Welder, L., Alvarez Quispe, E. F., and Smith, C. J.: pyam: analysis and visualization of integrated- assessment and macro-energy scenarios, Zenodo [code], https://doi.org/10.5281/zenodo.10391054, 2023. 

Huppmann, D., Wienpahl, L., Hackstock, P., and Castella, L.: nomenclature, Zenodo [data set], https://doi.org/10.5281/zenodo.10462491, 2024. 

IAMC: IAMC time-series data template: https://www.iamconsortium.org/scientific-working-groups/data-protocols-and-management/iamc-time-series-data-template/ (last access: 19 January 2024), 2024. 

IPCC: Climate Change: Synthesis Report. Contribution of Working Groups I, II and III to the Sixth Assessment Report of the Intergovernmental Panel on Climate Change, edited by: Core Writing Team, Lee, H., and Romero, J., IPCC, Geneva, Switzerland, Intergovernmental Panel on Climate Change (IPCC), https://doi.org/10.59327/IPCC/AR6-9789291691647, 2023. 

Keppo, I., Butnar, I., Bauer, N., Caspani, M., Edelenbosch, O., Emmerling, J., Fragkos, P., Guivarch, C., Harmsen, M., Lefevre, J., Le Gallic, T., Leimbach, M., Mcdowall, W., Mercure, J. F., Schaeffer, R., Trutnevyte, E., and Wagner, F.: Exploring the possibility space: taking stock of the diverse capabilities and gaps in integrated assessment models, Environ. Res. Lett., 16, 053006, https://doi.org/10.1088/1748-9326/abe5d8, 2021. 

Krey, V.: Global energy-climate scenarios and models: A review, WIREs Energy Environ., 3, 363–383, https://doi.org/10.1002/wene.98, 2014. 

Minx, J. C., Lamb, W. F., Callaghan, M. W., Fuss, S., Hilaire, J., Creutzig, F., Amann, T., Beringer, T., De Oliveira Garcia, W., Hartmann, J., Khanna, T., Lenzi, D., Luderer, G., Nemet, G. F., Rogelj, J., Smith, P., Vicente Vicente, J. L., Wilcox, J., and Del Mar Zamora Dominguez, M.: Negative emissions - Part 1: Research landscape and synthesis, Environ. Res. Lett., 13, 063001, https://doi.org/10.1088/1748-9326/aabf9b, 2018. 

Parker, W. S.: Ensemble modeling, uncertainty and robust predictions, WIREs Clim. Change, 4, 213–223, https://doi.org/10.1002/wcc.220, 2013. 

Peters, G. P., Al Khourdajie, A., Sognnaes, I., and Sanderson, B. M.: AR6 scenarios database: an assessment of current practices and future recommendations, npj Climate Action, 2, 31, https://doi.org/10.1038/s44168-023-00050-9, 2023. 

Riahi, K., Bertram, C., Huppmann, D., Rogelj, J., Bosetti, V., Cabardos, A., Deppermann, A., Drouet, L., Frank, S., Fricko, O., Fujimori, S., Harmsen, M., Hasegawa, T., Krey, V., Zwaan, B. V. D., Vrontisi, Z., Longa, F. D., Després, J., Keramidas, K., Kishimoto, P., Kriegler, E., Meinshausen, M., Nogueira, L. P., Oshiro, K., Popp, A., Rochedo, P. R. R., Ünlü, G., Ruijven, B. V., Takakura, J., Tavoni, M., and Vuuren, D. V.: Cost and attainability of meeting stringent climate targets without overshoot, Nat. Clim. Change, 11, 1063–1069, https://doi.org/10.1038/s41558-021-01215-2, 2021. 

Riahi, K., Schaeffer, R., Arango, J., Calvin, K., Guivarch, C., Hasegawa, T., Jiang, K., Kriegler, E., Matthews, R., Peters, G. P., Rao, A., Robertson, S., Sebbit, A. M., Steinberger, J., Tavoni, M., and Van Vuuren, D. P.: Mitigation pathways compatible with long-term goals., in: IPCC, 2022: Climate Change 2022: Mitigation of Climate Change. Contribution of Working Group III to the Sixth Assessment Report of the Intergovernmental Panel on Climate Change, edited by: Shukla, P. R., Skea, J., Slade, R., Khourdajie, A. A., van Diemen, R., McCollum, D., Pathak, M., Some, S., Vyas, P., Fradera, R., Belkacemi, M., Hasija, A., Lisboa, G., Luz, S., and Malley, J., Cambridge University Press, Cambridge, UK and New York, NY, USA, https://doi.org/10.1017/9781009157926.005, 2022. 

Rogelj, J., Shindell, D., Jiang, K., Fifita, S., Forster, P., Ginzburg, V., Handa, C., Kheshgi, H., Kobayashi, S., Kriegler, E., Mundaca, L., Gomis, M. I., Lonnoy, E., Maycock, T., and Tignor, M.: Mitigation Pathways Compatible with 1.5 °C in the Context of Sustainable Development, Global warming of 1.5 °C. An IPCC Special Report on the impacts of global warming of 1.5 °C above pre-industrial levels and related global greenhouse gas emission pathways, in the context of strengthening the global response to the threat of climate change, IPCC, https://www.ipcc.ch/sr15/chapter/chapter-2/ (last access: 18 January 2025), 2018. 

Smith, C. J.: A framework for downweighting similar scenarios in integrated assessment model ensembles, Scenarios Forum, Laxenburg, Austria, https://scenariosforum.org/wp-content/uploads/sites/3/2022/06/SF22_Agenda.pdf (last access: 18 January 2025), 2022. 

Smith, S. M., Geden, O., Nemet, G. F., Gidden, M. J., Lamb, W. F., Powis, C., Bellamy, R., Callaghan, M. W., Cowie, A., Cox, E., Fuss, S., Gasser, T., Grassi, G., Greene, J., Lück, S., Mohan, A., Müller-Hansen, F., Peters, G. P., Pratama, Y., Repke, T., Riahi, K., Schenuit, F., Steinhauser, J., Strefler, J., Valenzuela, J. M., and Minx, J. C.: The State of Carbon Dioxide Removal – 1st Edition, The State of Carbon Dioxide Removal, https://doi.org/10.17605/OSF.IO/W3B4Z, 2023. 

van Soest, H. L., Aleluia Reis, L., Baptista, L. B., Bertram, C., Després, J., Drouet, L., den Elzen, M., Fragkos, P., Fricko, O., Fujimori, S., Grant, N., Harmsen, M., Iyer, G., Keramidas, K., Köberle, A. C., Kriegler, E., Malik, A., Mittal, S., Oshiro, K., Riahi, K., Roelfsema, M., van Ruijven, B., Schaeffer, R., Silva Herran, D., Tavoni, M., Unlu, G., Vandyck, T., and van Vuuren, D. P.: Global roll-out of comprehensive policy measures may aid in bridging emissions gap, Nat. Commun., 12, 6419, https://doi.org/10.1038/s41467-021-26595-z, 2021. 

van de Ven, D. J., Mittal, S., Gambhir, A., Lamboll, R. D., Doukas, H., Giarola, S., Hawkes, A., Koasidis, K., Köberle, A. C., McJeon, H., Perdana, S., Peters, G. P., Rogelj, J., Sognnaes, I., Vielle, M., and Nikas, A.: A multimodel analysis of post-Glasgow climate targets and feasibility challenges, Nat. Clim. Change, 13.6, 570–578 https://doi.org/10.1038/s41558-023-01661-0, 2023. 

van Diemen, R., Matthews, J. B. R., Möller, V., Fuglestvedt, J. S., Masson-Delmotte, V., Méndez, C., Reisinger, A., and Semenov, S.: Annex I: Glossary, in: Climate Change 2022: Mitigation of Climate Change. Contribution of Working Group III to the Sixth Assessment Report of the Intergovernmental Panel on Climate Change, edited by: Shukla, P. R., Skea, J., Slade, R., Al Khourdajie, A., van Diemen, R., McCollum, D., Pathak, M., Some, S., Vyas, P., Fradera, R., Belkacemi, M., Hasija, A., Lisboa, G., Luz, S., and Malley, J., Cambridge University Press, Cambridge, UK and New York, NY, USA, 1793–1820, https://doi.org/10.1017/9781009157926.020, 2022.  

Welder, L. and Grant, N.: pathways-ensemble-analysis, Zenodo [code], https://doi.org/10.5281/zenodo.10197980, 2023. 

Welder, L. and Grant, N.: pathways-ensemble-analysis, Zenodo [code], https://doi.org/10.5281/zenodo.11057268, 2024. 

Weyant, J.: Some Contributions of Integrated Assessment Models of Global Climate Change, Rev. Env. Econ. Policy, 11, 115–137, https://doi.org/10.1093/reep/rew018, 2017. 

Wilson, C., Guivarch, C., Kriegler, E., van Ruijven, B., van Vuuren, D. P., Krey, V., Schwanitz, V. J., and Thompson, E. L.: Evaluating process-based integrated assessment models of climate change mitigation, Climatic Change, 166, 3, https://doi.org/10.1007/s10584-021-03099-9, 2021. 

Download
Short summary
Pathways investigating the link between emissions and global warming have been continuously used to inform climate policy. We have developed a tool that can facilitate the systematic and robust analysis of ensembles of such pathways. We describe the structure of this tool and then show an illustrative application of it. The application indicates the usefulness of the tool to the research community and shows how it can be used to establish best practices.