Abstract

GMD

Geoscientific Model Development

GMD

Geosci. Model Dev.

1991-9603

Copernicus Publications

Göttingen, Germany

10.5194/gmd-19-4633-2026

Spatialize v1.0: a Python/C++ library for ensemble spatial interpolation

Spatialize: a library for ensemble spatial interpolation

Navarro

Felipe

felipe.navarro@amtc.cl

https://orcid.org/0000-0002-6951-3504

Egaña

Alvaro F.

https://orcid.org/0000-0001-8720-4783

Ehrenfeld

Alejandro

Garrido

Felipe

Valenzuela

María Jesús

Sánchez-Pérez

Juan F.

1Advanced Mining Technology Center (AMTC), Universidad de Chile, Avenida Tupper 2007, RM, Santiago, Chile 2Advanced Laboratory for Geostatistical Supercomputing (ALGES), Department of Mining Engineering – AMTC, Faculty of Mathematical and Physical Sciences, Universidad de Chile, Avenida Tupper 2069, RM, Santiago, Chile 3Department of Applied Physics and Naval Technology, Universidad Politécnica de Cartagena (UPCT), Cartagena, Spain

Felipe Navarro (felipe.navarro@amtc.cl)

1June2026

19 10 46334660 8July2025 29August2025 12April2026 29April2026

2026

This work is licensed under the Creative Commons Attribution 4.0 International License. To view a copy of this licence, visit https://creativecommons.org/licenses/by/4.0/

This article is available from https://gmd.copernicus.org/articles/19/4633/2026/gmd-19-4633-2026.html

The full text article is available as a PDF file from https://gmd.copernicus.org/articles/19/4633/2026/gmd-19-4633-2026.pdf

Abstract

In this paper, we present Spatialize, an open-source library that implements ensemble spatial interpolation, a novel method that combines the simplicity of basic interpolation methods with the power of classical geostatistical tools, like Kriging. It leverages the richness of stochastic modelling and ensemble learning, making it robust, scalable and suitable for large datasets. In addition, Spatialize provides a powerful framework for uncertainty quantification, offering both point estimates and empirical posterior distributions. It is implemented in Python 3.x, with a C++ core for improved performance, and is designed to be easy to use, requiring minimal user intervention. This library aims to bridge the gap between expert and non-expert users of geostatistics by providing automated tools that rival traditional geostatistical methods. Here, we present a detailed description of Spatialize along with a wealth of examples of its use.

Agencia Nacional de Investigación y Desarrollo

AFB230001

IT23I013

TA24I10053

CIA250010

1Introduction

A significant challenge in the field of geosciences is the issue of sparsity that is often observed in spatial databases, such as soil properties, climate data, or mineral concentrations, which are characterised at limited point locations . The presence of these data gaps hinders a comprehensive understanding of the variable's domain. The central issue, therefore, is estimating values at unmeasured locations. Various spatial interpolation algorithms have been developed for this purpose.

Geostatistics is a field focused on the analysis, estimation, and modelling of spatial variables. Unlike traditional statistics, geostatistics emphasises the spatial dependencies between observations . The Kriging technique, the most relevant exponent of geostatistical interpolation , was initially devised for the estimation of gold reserves . As an unbiased linear estimator that minimises estimation error at each position, Kriging is commonly referred to as BLUE (Best Linear Unbiased Estimator) . Beyond providing robust estimates, Kriging also facilitates the calculation of estimation variance, which is widely used for assessing spatial prediction uncertainty . A variety of user-friendly tools are available for the implementation of Kriging, including PyKrige (last access: 28 May 2026), PySAL (last access: 28 May 2026), gstat (last access: 28 May 2026), automap (last access: 28 May 2026), geoR (last access: 28 May 2026), and fields (last access: 28 May 2026). Nevertheless, it should be noted that the use of Kriging without actual knowledge of the model may result in suboptimal and misleading outcomes . In particular, parameter selection and spatial continuity modelling have a significant effect on the accuracy of Kriging estimates . However, the correct determination of these inputs requires substantial expertise and data , which creates a significant barrier for most potential users. Moreover, the task becomes increasingly complex when the variables under study are of a dynamic, spatio-temporal nature , or are not structurable as a regular grid .

In summary, spatial interpolation tasks, when assessed from the perspective of classical geostatistical analysis, can be time-consuming and require considerable expertise. Consequently, there is a need for more straightforward yet effective spatial interpolation methods that can address highly dynamic spatial problems without necessitating manual spatial analysis tasks.

In contrast to geostatistical approaches, deterministic models employ straightforward calculations; nevertheless, they are only capable of producing estimations , and thus do not offer uncertainty quantifications. The most widely applied of these methods is inverse distance weighting (IDW), a simple yet powerful spatial interpolation method that uses a weighted average of surrounding point values to estimate the unknown value at an unsampled location . In recent years, variants of IDW have been successfully used in a variety of applications, including estimation of air pollution levels , soil moisture and water quality . The main limitation of IDW is that it does not take into account the spatial structure or correlation of the variable being interpolated. This can lead to over-smoothing or under-smoothing of the estimated values, depending on the degree of spatial correlation in the data .

Another promising approach for spatial interpolation is the use of machine learning-based methods, which can learn complex spatial relationships from large datasets without requiring manual spatial modelling . For example, and proposed deep learning-based spatial interpolation methods to estimate solar radiation and interpolate seismic data, respectively. Nevertheless, challenges that arise from deep learning models are, firstly, the need for large amounts of data and computational resources to train them, and secondly, the necessity to measure additional variables other than the one under study. This is especially true for complex spatial-temporal problems, where the number of input variables and temporal observations can be substantial . A further challenge in using deep learning for spatial interpolation is the difficulty of interpreting the results. These models are often referred to as “black boxes”, meaning that the process by which predictions are derived remains uncertain. This can be problematic in situations where transparency and interpretability are important, such as in environmental applications .

In addressing the need for a simple and flexible spatial interpolation technique, able to adapt to highly dynamic phenomena, scalable to big data, interpretable, and most importantly widely accessible to the entire geoscientific community, and independently proposed a new state-of-the-art spatial interpolation method based on ensemble learning. This method combines the simplicity of methods such as IDW with the power of Kriging spatial analysis, which the authors of the latter named Ensemble Spatial Interpolation (ESI). This model is able to provide reliable estimates that are comparable to those of Kriging, while eliminating the need for manual spatial continuity modelling. Its main features are: (a) it is based on a stochastic space partitioning process, which aids in managing large datasets; (b) it is built under an ensemble scheme, which guarantees robustness despite the use of weak local interpolation functions with small subsets of data, and (c) it provides a powerful framework for uncertainty quantification, as it is based on a Bayesian scheme, thus yielding an empirical posterior distribution of the estimate (instead of a single point estimate).

The aim of this article is to present spatialize, a novel software library that facilitates an efficient implementation of ESI. Spatialize has been designed to be easy to use, efficient and flexible. The core of the library is implemented in C++ with a Python 3.x programming API. It is available as an open-source project, making it accessible to researchers and practitioners in industry and academia. The subsequent sections provide a comprehensive overview of the ESI model and the Spatialize library, including its features and capabilities. We also present several examples of how the library can be used in practical applications. Finally, the future directions of the library and its potential impact on spatial interpolation research and practice will be discussed.

2Ensemble spatial interpolation

Ensemble learning is usually regarded as the statistical and computational conception of the “wisdom of the crowd”, whose idea is to collect and combine the points of view of many experts to produce an ensemble result . An ensemble model z^ can be formulated as: 1z^=G(f1(x∗),…,fm(x∗)) where x∗ is a vector of covariances and {f1,…,fm} is a set of weak voter (regression, classification, interpolation, etc.) functions. Function G is an aggregation function that combines the responses from each voter function and it can be as simple as majority voting for classification , averaging for regression, or more sophisticated approaches such as a mixture of experts (MoE) .

Figure 1

Generative process to draw a time-dependent stochastic partition set of size m. Time goes along the x-axis, ending at time given by λ(α), where α controls the average depth for the trees in the forest. Red lines indicate the current time partition cut. Note: Reprinted from “Ensemble Spatial Interpolation: A New Approach to Natural or Anthropogenic Variable Assessment”, by , Natural Resources Research, 30, 3777–3793.

2.1Weak voter function set generation

In the ESI model, the construction of the set of weak voter functions is achieved through the concept of “bootstrapping the space” , which involves generating different spatial configurations through random partitioning of the space where the data are located. These partitions are generated in a way that any combination of data is possible, while preserving their spatial locations and avoiding data clustering (Fig. ). Each partition creates unique data subsets within the partition cells, where any spatial interpolation method can be applied. An unmeasured location is then estimated by combining (aggregating) the estimates derived from all the partition elements across the set of partitions where that location falls .

It is well known that a spatial partition data structure can be represented as a tree , where nodes represent partition spaces and edges indicate containment relationships. This representation enables efficient spatial data querying and operations on the data contained in the spaces, similar to how certain related hierarchical data structures, such as k-d trees and octrees, are used in spatial indexing. Thus, in practice, generating a set of random partitions is equivalent to generating a forest of random tree structures, analogous to how multiple decision trees form a random forest.

The spatialize library uses two methodologies to generate these partitions of space:

(a) Mondrian Forests (MF): As proposed by and introduced by . The latter is a non-parametric Bayesian strategy that is employed for both classification and online regression, which has been demonstrated to be as efficacious as other high-level ensemble learning methods, such as random forests or additive trees .

In the context of ESI, MF is a collection of random tree structures defined as: 2MF(Θ,λ|m)={T1,…,Tm}, Tk∼MT(Θ,λ) where m is the number of random tree structures in the forest and MT(Θ,λ) is the generative process, described in Algorithm 1, which produces random samples of tree structures. The latter represents random space partitions of the target domain Θ that are assumed to be Θ=[a1,b1]×…×[ad,bd]⊂Rd.

Algorithm 1

MT(Θ,λ) Sampling Algorithm

procedure Main()

SampleMT(Θ,λ) {Θ=[a1,b1]×…×[ad,bd]}

end procedure

procedure SampleMT(θ,λ)

SampleMTBranch(θ,λ,0)

end procedure

procedure SampleMTBranch(θ,λ,τ)

E∼Exp(μ(θ)) {μ is a measure on Rd}

if (τ+E)<λ then

dx∼Discrete(p1,…,pd)

x∼U([adx,bdx])

SampleMTBranch(θ>,λ,(τ+E))

SampleMTBranch(θ<,λ,(τ+E))

end if

end procedure

Figure 2

Generative process to draw a topological stochastic partition set of size m. The number of Voronoi nuclei is given by λ(α), where α controls the average depth for the partitions in the forest. Red points indicate the selected locations, among the sampled locations, for the nuclei.

Algorithm 1 implements a temporal stochastic process that generates Mondrian partitions, i.e. nested random partitions aligned with coordinate axes. Parameter λ represents the process's finite lifetime, while τ represents the elapsed time through recursion levels. The recursive partitioning process is as follows: Line 8 samples the “time” until the next cut in the sub-box θ from an exponential distribution, parametrized such that E(E)=1/μ(θ), where μ(θ)=∑i=1d(biθ-aiθ). This ensures smaller sub-boxes are less likely to be partitioned. In line 10, the dimension to be partitioned is selected, where pk determines the probability of selecting dimension k. For a sub-box θ, pk is proportional to (bkθ-akθ), favouring the partition of larger sides. In line 11, the cut point is randomly determined along the selected dimension, creating sub-boxes θ> and θ<. Figure illustrates the partitioning process, with each red line representing a cut at time t for all Tk.

(b) Voronoi Forests (VF): A variation of (a) which employs Voronoi partitions instead of Mondrian trees, in a manner analogous to that described by . However, rather than using a fixed number of nuclei, spatialize employs a random number per partition, calibrated to ensure that the expected number of data points per cell matches that of a Mondrian tree.

Thus, a Voronoi Forest (VF) is a collection of structures defined as: 3VF(Θ,λ|m)={T1,…,Tm}, Tk∼VT(Θ,λ) where VT(Θ,λ) is the generative process that produces a Voronoi partition. The process begins with the selection of K, the number of Voronoi nuclei, which is drawn from a Poisson distribution with parameter λ. Next, a random sample of K nuclei, denoted by ΦK={c1,…,cK}⊆Θ, is randomly generated. Finally, the target domain Θ is partitioned by assigning all contained locations x∈Θ to the nearest nuclei based on the Euclidean distance. A Voronoi cell is thus defined by Li={x∈Θ:‖x-ci‖≤‖x-cj‖∀cj∈ΦK,j≠i}. The process of generating a Voronoi Forest is illustrated in Fig. .

2.1.1Model training

Let us define conditioning data as a set M={zj}j=1Ns of Ns measurements of a variable of interest, obtained at specific spatial locations P={xj}j=1Ns within a particular region of a d-dimensional space. The classical formulation of spatial interpolation can be stated as: find a d-variate function S(P,M) that fulfils the condition S(P,M)(xj)=zj, j={1,…,Ns}

The sub-index (P,M) indicates that the interpolation function is constructed using both the values of the measurements and their locations.

Now, let us assume P⊂Θ. Then, both a Mondrian Forest, MF(Θ,λ|m), and a Voronoi Forest, VF(Θ,λ|m), can be “trained” when a set of d-dimensional data points (the conditioning data) are used to condition the sampling of {T1,…,Tm}. We see, then, that:

Figure 3

Tree structure schema. (a) Decision tree structure. (b) The spatial partition corresponding to the decision tree structure shown in (a). Black points represent conditioning data. Note: Reprinted from “Ensemble Spatial Interpolation: A New Approach to Natural or Anthropogenic Variable Assessment”, by , Natural Resources Research, 30, 3777–3793.

Figure 4

Illustration of θ∗ given a set of data points, which enables Algorithm 1 to be trained using sample data. Note: Reprinted from “Ensemble Spatial Interpolation: A New Approach to Natural or Anthropogenic Variable Assessment”, by , Natural Resources Research, 30, 3777–3793.

(a) When using Mondrian Forests

A trained Mondrian Forest is defined as: 4MF(Θ,λ|P,m)={T1,…,Tm}, Tk∼MT(Θ,λ|P).

A Mondrian tree can be trained by conditioning the partitioning process to the data (Fig. b). Thus, a trained random partition set, MF(Θ,λ|P,m), can be obtained by modifying Algorithm 1 as follows:

For any box θ define θ∗=k(θ,P) as the smallest sub-box containing all conditioning positions in θ (see Fig. ).

The probability of splitting a sub-box (line 8) is replaced by: Sample E∼Exp(μ(θ∗)).

Lines 10 and 11 in Algorithm 1 are adjusted to work on θ∗ instead of θ.

As a result of this modification, sub-boxes containing highly concentrated data are more likely to be partitioned. This offers the advantage of ensuring that most leaf nodes (i.e. the resulting sub-boxes) contain a reasonable amount of conditioning data, thus avoiding spatial clustering.

(b) When using Voronoi Forests

A trained Voronoi Forest (VF) is defined as: 5VF(Θ,λ|P,m)={T1,…,Tm}, Tk∼VT(Θ,λ|P).

A trained random partition set, VF(Θ,λ|P,m), can be obtained by sampling the m sets of Voronoi nuclei from the locations at which measurements of the variable of interest are available, as opposed to being sampled from Θ. For each tree, this results in the set ΦK={c1,…,cK} where K≤Ns and ΦK⊆P.

Thus, the process to obtain samples from VT(Θ,λ|P) is as follows:

Sample K∼Poisson(λ),1≤K≤Ns.

Sample ΦK={c1,…,cK} from the measured locations P

Establish each Voronoi cell as Li={x∈Θ:‖x-ci‖≤‖x-cj‖∀cj∈ΦK,j≠i}.

Sampling from the measured locations P ensures that all partition cells L1,…,LK, generated by sampling VT(Θ,λ|P), will contain at least one measured location.

2.1.2Weak voter function set

For an unmeasured position x∗∈Θ, we define Lk⊂(P,M) as the set of conditioning data points contained within the partition cell where x∗ falls into in tree Tk.

Then, let us consider a base interpolation function S(P,M), which can be any spatial interpolator for which no additional information, other than measurements and their locations, is required to interpolate new positions – such as Kriging or IDW.

Now, for each tree Tk, let SLk be the base interpolation function restricted to Lk. Thus, the kth weak voter function for x∗ is obtained by applying the base interpolator S(P,M) to estimate the value at x∗ using only the points in Lk.

Formally, the weak voter function set is defined as: 6fk(x∗)=SLk(x∗), k=1,…,m.

2.2Interpolation using the trained model

Let us denote xk∗=fk(x∗)=SLk(x∗) as the kth weak voter function for x∗. Then, {xk∗}k=1m corresponds to the set of weak voter functions for x∗ resulting from all m trees.

Thus, the interpolation function Z(P,M) corresponds to the aggregation of these weak voter functions: 7e∗=Z(P,M)(x∗)=G{xk∗}k=1m.

The simplest choice for the aggregation function G is the mean E[⋅]. In this case, the interpolation function becomes: 8eE∗=1m∑k=1mxk∗.

2.3Interpolation precision modelling

A precision model p∗ for Z(P,M)(x∗) can be defined using a loss function L as follows: 9p∗=EP^{xk∗}mLe∗,{xk∗}m.

Equation () represents a generalisation of the mean-variance concept within the context of Bayesian variability. Here, EP^(⋅), represents the expectation over the posterior distribution, averages L over all sampled partitions, weighted by their posterior probability.

When using the mean-based interpolator eE∗, we can define an associated interpolation variance (or error) VeE∗(x∗) as: 10pE∗=VeE∗(x∗)=1m∑k=1mxk∗-eE∗2.

2.4Rule of thumb for parameter choice 2.4.1When using Mondrian Forests

The domain parameter Θ can be considered as any bounding box containing the positions of the conditioning data P. Thus, in practice, the only two parameters of the model are:

The number of partitions (or tree structures) m. A reasonable suggestion for this parameter is that higher is better, keeping in mind that higher values will directly impact time performance. Experiments have shown that certain stability is reached for m≥500 , so this would be a good starting point.

The process lifetime λ. The only restriction for this parameter is that it must be positive. This renders the selection, or any sensitivity analysis, of its value challenging. In order to address this issue, a function of a normalised parameter α∈[0,1) is used to obtain suitable values for λ, which is defined as follows:11λ(α)=1μ(Θ)(1-α).

In practice, α controls the average tree depth in the forest, determining how finely the space is partitioned. In this way, α=0 will generate the coarsest partition, while α→1 will generate finer ones. α must be carefully chosen to ensure that the base interpolation function SLk has sufficient sample data. Once m has been defined, it is recommended to use cross-validation to find the optimal α, typically within [0.7,0.95].

2.4.2When using Voronoi Forests

As seen in Mondrian Forest, the domain parameter Θ may be regarded as any bounding box encompassing P. Consequently, the parameters of the model remain identical, yet their respective roles and practical considerations differ:

The number of partitions (or tree structures) m. A reasonable suggestion for this parameter is that higher is better, keeping in mind that higher values will directly impact time performance. However, the possibility of reaching stability for a certain value of m is yet to be studied in the context of Voronoi Forests.

The process lifetime λ. This parameter determines the expected number of Voronoi nuclei for each partition. In the context of Voronoi Forest, λ is related to the mean of the Poisson distribution. Although the only theoretical restriction for this parameter is that it must be greater than one, in spatialize, it is also constrained to match the expected number of leaves in a Mondrian tree. To this end, λ is calculated by multiplying the parameter α, defined in the context of Mondrian forests, by a factor according to the number of observations Ns, as follows:12λ(α)=12×Ns×α,α∈[0,1).

As in the case of Mondrian partitions, α also controls how coarse or fine the partitions are by affecting the number of Voronoi nuclei. Thus, α=0.25 will generate the coarsest partition, while α→1 will generate finer ones.

In the context of sampling from a trained tree VT(Θ,λ|P), additional considerations arise with respect to m and λ. Firstly, the number of nuclei, K(λ(α)), must not exceed the number of observations, Ns. Secondly, as K(λ(α))→Ns, when sampling from P, the number of possible Voronoi nuclei combinations decreases, and thus a large m value may result in duplicated partitions. To avoid such inefficiencies, it is recommended that m≪NsK, or alternatively, not employing trained trees (also possible on spatialize). Alternatively, spatialize also permits the direct sampling of ΦK from Θ (i.e. not employing trained trees).

3The spatialize library

Spatialize is an open source library for spatial analysis which offers different tools for implementing the Ensemble Spatial Interpolation (ESI) model. The main motivation behind the development of the spatialize library was to provide the scientific and technical community with a robust and automated spatial estimation tool that can be used across different disciplines by researchers and professionals who are not experts in geostatistics. In this sense, spatialize provides automated tools that eliminate the need for manual spatial analysis and extensive domain expertise.

The spatialize library is organized into three functional layers. First, the API is implemented in Python, offering three high-level functions: esi_griddata() performs ESI estimation on regular grids, esi_nongriddata() generates estimates at arbitrary spatial locations, and esi_hparams_search() searches for optimal ESI parameters. The latter employs cross-validation to determine the parameter combination that yields the minimum error out of a previously defined set. Second, the library provides aggregation functions according to Eq. (), as well as precision estimation functions following Eq. (), including a class for implementing custom precision functions. Test datasets are also included. Third, the spatial estimation methods are efficiently implemented in C++ for optimal performance.

The three main functions – esi_hparams_search(), esi_griddata(), and esi_nongriddata() – share three mandatory parameters: points, an n-samples × n-dimensions array specifying sample locations; values, an n-samples × 1 array containing the observed values at these locations; and xi, the query locations where estimates are required (as a standard array for non-gridded data or a meshgrid for gridded data). Optional arguments include ESI-specific parameters (n_partitions, alpha, data_cond, agg_function, seed, local_interpolator, and p_process) and local interpolator-specific parameters. A comprehensive description of these functions and their arguments can be found in the user manual, which is provided as supplementary material.

Spatialize 1.0.2 offers two alternatives for ESI local interpolators: Inverse Distance Weighting (IDW), a deterministic interpolator that assigns greater weighting to points in closer proximity, and an Ordinary Kriging variant with a normalized covariance matrix. Additionally, an implementation of traditional IDW is provided. In a different vein, two space partitioning methods are available, Mondrian Forests and Voronoi Forests

Currently, spatialize offers support for up to two dimensions when using Voronoi partitions and five dimensions when using Mondrian. For the 4D (space-time) and 5D (spatial with two angles, for fault description, for example) case, the implementation includes only IDW as local ESI interpolator.

Table 1

Comparison of spatialize with commonly used spatial interpolation libraries.

Feature Spatialize v1.0.2 SciPy scikit-learn PyKrige GSTools Pyinterpolate gstat geoR Core Interpolation Methods Ensemble Spatial Interpolation ✓ – – – – – – – Simple Kriging – – – – ✓ ✓ ✓ ✓ Ordinary Kriging – – ✓^a ✓ ✓ ✓ ✓ ✓ Universal Kriging – – – ✓ ✓ ✓ ✓ ✓ Inverse distance weighting ✓ ✓^b ✓^c – – ✓ ✓ – Radial basis functions – ✓ – – – – – – Uncertainty Quantification Kriging variance/uncertainty – – ✓^a ✓ ✓ ✓ ✓ ✓ Empirical posterior distributions ✓ – – – – – – – Custom loss functions ✓ – – – – – – – User Input Automatic parameter fitting ✓ – ✓ ✓ ✓ ✓ ✓^d ✓ Manual variogram selection – – – ✓ ✓ ✓ ✓ ✓ Spatial Support Gridded data ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ Non-gridded data ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ High-dimensional (> 3D) ✓^e ✓^f ✓ – ✓ – ✓ – Scalability and Performance Parallelization support – – ✓ – ✓ ✓ ✓ – C++/compiled core ✓ – – – – – ✓ – API Language Python ✓ ✓ ✓ ✓ ✓ ✓ – – R – – – – – – ✓ ✓

^a Through Gaussian Process Regression (GaussianProcessRegressor). ^b Must be manually implemented using (scipy.spatial.distance). ^c Through k-nearest neighbors (KNeighborsRegressor). ^d Available using automap. ^e Only available with Mondrian partitions. ^f RBFInterpolator supports arbitrary dimensions.

Table offers a comparative analysis of the capabilities of spatialize and those of libraries commonly utilized for spatial interpolation.

The functionality of the aforementioned tools is demonstrated in the subsequent section.

4Usage examples

The standard usage of spatialize follows a clear roadmap: 1.

definition of inputs (points, values, xi);

hyperparameter optimization using the esi_hparams_search function;

execution of the ESI algorithm using either esi_griddata() or esi_nongriddata();

retrieving the estimation with result.estimation() and the uncertainty quantification with result.precision().

This section presents usage examples demonstrating the core functionality of spatialize, including hyperparameter optimization, gridded and non-gridded data estimation, and uncertainty quantification.

These examples illustrate how automated grid searches over ESI parameters make geostatistical spatial estimation accessible without requiring expert selection of a priori parameters, while achieving performance comparable to or better than other automatic spatial estimation tools, as well as offering uncertainty quantifications. For comparison, examples using SciPy and PyKrige are also presented.

4.1Datasets

The examples are built upon two distinct case studies: a synthetic dataset and a real-world dataset.

4.1.1Synthetic dataset

The synthetic dataset consists of a two-dimensional surface defined by a cubic-type function, adapted from the scipy.inter polate.griddata documentation. This function exhibits non-linear spatial variation with oscillatory features, making it suitable for evaluating interpolation performance under controlled conditions.

The dataset comprises a regular grid of 100 × 200 points spanning the unit square [0,1]×[0,1], with 1000 randomly sampled data points and ground truth values available at all grid locations.

The primary purpose of using synthetic data is to enable thorough quantitative performance evaluation by calculating error metrics (RMSE, MAE, etc.) across the entire estimation grid. This is not possible in real-world applications, where reference values are typically limited to sampled locations.

Figure 5

(a) Ground truth cubic function alongside (b) randomly sampled conditioning data (n=1000) used for spatial interpolation.

Code snippet 1 shows the generation of the grid, sampling points, and corresponding values generated by the cubic type function. Figure illustrates the underlying function alongside the randomly sampled conditioning data used for interpolation.

In this case study, we evaluate spatialize performance on gridded estimation using the esi_griddata() function. We compare ESI implementations with IDW and Kriging as local interpolators (ESI-IDW and ESI-Kriging, respectively) against traditional IDW and SciPy interpolators. We employ the esi_hparams_search() function to automatically optimize ESI hyperparameters through grid search, demonstrating the library's capability for automated tuning without requiring expert knowledge. Additionally, we demonstrate the process of quantifying uncertainty through built-in and custom loss functions.

4.1.2Real-world dataset

The real-world dataset consists of copper grade measurements from drill holes in the Andes, representing a typical mineral exploration scenario with sparse spatial sampling. The irregular sampling pattern reflects realistic exploration drilling strategies where sample locations are constrained by geological, topographical, and economic factors.

The dataset comprises 400 copper grade measurements at irregularly spaced locations over a [0,400]×[0,600] spatial extent. The estimation domain is discretized using a regular two-dimensional grid with uniform 2-unit spacing, resulting in 200 × 300 nodes for a total of 60 000 estimation points.

The dataset employed for this example, as well as corresponding Ordinary Kriging baseline estimates, are available directly through spatialize, as shown in Code snippet 2.

The available samples are displayed in Fig. .

Figure 6

Copper grade measurements used for spatial interpolation.

In this case study, we demonstrate spatialize performance on real-world non-gridded data using the esi_nongriddata() function. We compare ESI-IDW with different partitioning strategies (Mondrian and Voronoi) against ESI-Kriging and Ordinary Kriging. We assess performance via cross-validation, since ground truth values are unavailable at unsampled locations.

4.2Synthetic data case study

This section presents the procedure and results from the synthetic data case study, demonstrating the use of spatialize on gridded data.

4.2.1Spatialize: ESI-IDW implementation

We begin by implementing ESI with IDW as local interpolator.

First, we perform a parameter search using the esi_hparams_search function. As shown in Code snippet 3, this function receives ranges or sets of parameters that produce different combinations for interpolation. We define a comprehensive set of combinations, including different exponent values, alpha values, and aggregation functions. Additionally, we use the Voronoi partitioning method because it generates more regular partition elements, which aligns well with the radial nature of the local IDW interpolator.

Since interpolation methods reproduce reference data values at grid points by construction, the parameter search function employs K-fold cross-validation to calculate the error. K points are removed from the dataset in each iteration, and the estimation error is calculated at these locations. The final error metric is obtained by averaging across all iterations. In this example, we use K=10.

Figure 7

Cross-validation error for grid search using IDW as local interpolator. (a) Histogram of errors; (b) errors in the sequence of scenarios during the search.

Figure 8

(a) ESI grid search best estimation using IDW as local interpolator; (b) Precision obtained with MSE (default) loss function.

In Code snippet 3, the object search_result stores the grid search results. Figure displays the cross-validation errors for the 100+ parameter combinations evaluated during the search. The dual visualization provides complementary insights: the histogram (left panel) shows the distribution of estimation errors, while the sequential plot (right panel) reveals the evolution of error across scenarios. This plot can be quickly obtained using the method search_result.plot_cv_error().

The minimum error of 0.0114 was achieved in scenario number 8, which corresponds to the following parameters:

Finally, Code snippet 4 performs the ESI-IDW estimation using the optimal parameters found in the search. For convenience, the estimation function is designed to accept the dictionary returned by the method search_result.best_result() directly as an argument

The n_partitions parameter can be overwritten in the ESI implementation, which allows for the use of a smaller number of partitions in the grid search for faster parameter optimization.

. The estimates can be then retrieved using the result.estimation() method, while their corresponding precision is obtained through result.precision().

Figure presents the ESI-IDW estimation alongside its corresponding precision map, calculated with the default loss function (MSE). The resulting interpolation partially captures the structural features of the reference function, though some differences remain, particularly in regions characterized by pronounced spatial variation. Accordingly, the precision map reveals elevated uncertainty in these same regions.

4.2.2Spatialize: ESI-Kriging implementation

We now employ Kriging as the local interpolator for ESI, following the same parameter search and estimation procedure described in the ESI-IDW example. The underlying partitioning is carried out using the Mondrian forest process.

Code snippet 5 implements the parameter optimization process. We evaluate four omnidirectional variogram models across a range of nugget and sill values. The function generates and compares all possible parameter combinations to identify the optimal configuration.

Figure 9

Cross-validation error for ESI grid search using Kriging as local interpolator. (a) Histogram of errors; (b) Errors in the sequence of scenarios during the search.

Figure shows the cross-validation error frequency graphs (left) and the different error levels for each scenario (right).

Finally, we perform the estimation based on the parameters of the scenario with the lowest cross-validation error, which in this case has a value of 0.0053 at index 603. The optimal parameters are the following:

Code snippet 6 shows the implementation of ESI, applying the optimized parameters through the best_params_found argument. Just as in the ESI-IDW case, using result.estimation() returns the estimates, while result.precision() computes their precision.

Figure 10

(a) ESI grid search best estimation using Kriging as local interpolator; (b) Precision obtained with MSE (default) loss function.

Figure presents the ESI-Kriging estimation alongside its corresponding precision map, calculated with the default loss function (MSE). The ESI-Kriging interpolation successfully recovers the structural features of the reference function, despite the absence of domain-specific knowledge or formal variographic analysis in the model specification. The precision map reveals generally low uncertainty throughout the domain, with localized increases near boundaries of high-gradient regions, consistent with expected interpolation behaviour in areas of rapid spatial variation.

4.2.3Spatialize: custom precision functions

In this section, we demonstrate the implementation of custom precision metrics for ESI estimates. As a reminder, precision in ESI is calculated by aggregating a loss function that compares each partition-specific ESI estimate with the final aggregated estimation.

The spatialize library provides a modular framework for implementing custom loss functions. For instance, Code Snippet 7 demonstrates a custom implementation of the operational error loss, which is also available as a built-in class in the lossfunction module.

The function architecture consists of two nested components: an outer function that computes necessary parameters, and an inner function decorated with @loss() that specifies both the point-wise error calculation and the aggregation method for combining scenario-specific losses into a single precision layer. The operational error precision can then be obtained by using result.precision(op_error_precision).

Figure 11

Mean squared error and operational error precision for an ESI-IDW interpolation. (a) The original cubic type function; (b) ESI-IDW interpolation; (c) Mean squared error precision; (d) Operational error precision.

Figure 12

Mean squared error and operational error for an ESI-Kriging interpolation. (a) The original cubic type function; (b) ESI-Kriging interpolation; (c) Mean squared error precision; (d) Operational error precision.

Figures and compare the default MSE-based precision (contained in the ESIResult object) with the precision calculated using the custom operational error function for ESI-IDW and ESI-Kriging, respectively.

Both precision functions illustrate that ESI-IDW estimates have a higher uncertainty than those produced by ESI-Kriging in this scenario. However, each loss function highlights different aspects of this uncertainty. The mean squared error precision emphasizes regions of higher uncertainty, whereas the operational error precision clearly delineates the uncertainty in the spatial structure itself as represented by the interpolation.

4.2.4Comparison with traditional IDW

To demonstrate the added value of ensemble spatial interpolation, we now compare the estimates provided by standard IDW interpolation against the previously presented ESI-IDW estimations.

First, Code snippet 8 implements an automated hyperparameter optimization using the idw_hparams_search function, analogous to the esi_hparams_search function.

Figure 13

Cross-validation error for grid search for IDW interpolator without ESI. (a) Histogram of errors; (b) Errors in the sequence of scenarios during the search.

Figure illustrates the cross-validation errors for each scenario in the grid search. The minimum error obtained has a value of 0.0155 at index 0. This scenario used a radius of 0.07 and an exponent of 0.001.

Then, we generate estimates using the best hyperparameters found in the search, as shown in Code snippet 9.

Figure 14

Best estimation using IDW interpolator without ESI. (a) The original cubic type function; (b) Traditional IDW interpolation.

Figure shows the IDW estimates obtained using the optimal parameters. While the interpolator partially recovers the original image structures, ESI-IDW (Fig. ) achieves notably superior resemblance to the reference image.

4.2.5Comparison with SciPy

To benchmark our results against established Python interpolation tools, we employ the griddata function from the scipy.interpolate module using three standard methods: “nearest neighbour”, “linear” and “cubic” interpolation. Code snippet 10 demonstrates the implementation of these three approaches.

Figure 15

Comparison of SciPy estimates for the same gridded dataset. (a) Nearest-neighbour interpolation; (b) Linear interpolation; (c) Piecewise cubic interpolation.

Figure presents the estimates generated by the three SciPy interpolation methods. Since the ground-truth function is cubic, the cubic interpolator unsurprisingly achieves the best visual reconstruction. This illustrates the inductive bias effect: when prior knowledge about the underlying function is available, selecting a matching interpolator yields superior results. However, in practical scenarios lacking such previous knowledge, choosing an appropriate interpolator becomes challenging.

Finally, it should be noted that the interpolators implemented in the SciPy library are deterministic and therefore do not provide uncertainty quantification.

Figure 16

Comparison of ESI and SciPy estimates for the same gridded dataset. (a) ESI-IDW interpolation; (b) ESI-Kriging interpolation; (c) IDW interpolation; (d) SciPy nearest-neighbour interpolation; (e) SciPy linear interpolation; (f) SciPy piecewise cubic interpolation.

4.2.6Performance comparison

This section provides a visual and quantitative comparison of all interpolators employed in the synthetic data case study.

Figure presents the estimates side-by-side. Visually, ESI Kriging, along with the linear and cubic interpolators, appear to offer the best reconstruction of the target image. Notably, ESI achieves acceptable results without prior structural assumptions, as opposed to the SciPy interpolators.

Figure 17

Comparison of ESI and SciPy estimation errors for the same gridded dataset. (a) ESI-IDW interpolation; (b) ESI-Kriging interpolation; (c) IDW interpolation; (d) SciPy nearest-neighbour interpolation; (e) SciPy linear interpolation; (f) SciPy piecewise cubic interpolation.

Figure presents error maps for each method. Notably, the error patterns of methods that failed to fully reproduce the target structure visually – such as ESI-IDW, IDW, and nearest-neighbor interpolation – exhibit the cubic spatial structure itself, indicating their inability to capture it. In contrast, the error maps of ESI-Kriging, linear interpolation, and cubic interpolation show minimal structural patterns, confirming their superior capture of the underlying function. Additionally, all methods exhibit higher errors in areas with more pronounced value gradients.

Finally, Table presents the mean absolute error (MAE), root mean squared error (RMSE), and mean squared error (MSE) for all interpolations. The cubic interpolator achieves the lowest values across all three metrics, which is expected given the cubic nature of the underlying spatial structure. ESI-Kriging achieves the second-best performance, followed by the linear interpolator. Classic IDW demonstrates the highest errors.

Table 2

Performance metrics for ESI and SciPy interpolations for the same gridded dataset.

MAE RMSE MSE ESI IDW 0.011126 0.018622 0.000347 ESI Kriging 0.004882 0.010482 0.000110 IDW 0.018195 0.028064 0.000788 Nearest 0.012256 0.020766 0.000431 Linear 0.005840 0.012281 0.000151 Cubic 0.001769 0.005118 0.000026

Although ESI-IDW did not achieve exceptional performance in this case study, its superior results relative to traditional IDW highlight the value of the ensemble framework in enhancing the performance of a fundamentally deterministic interpolation method.

4.3Real-world case study

This section presents the procedure and results from the real-world data case study, demonstrating the use of spatialize on non-gridded data.

4.3.1Spatialize: ESI-IDW with Mondrian partitioning

We begin by implementing ESI with IDW as local interpolator. As mentioned within Sect. , both Mondrian Forest and Voronoi Forest can be used as the partitioning method in the case of ESI-IDW

In contrast, when using Kriging local interpolation, spatialize currently implements only the Mondrian Forest partitioning approach.

. In this first example, we employ Mondrian partitions.

First, we employ the esi_hparams_search() function to obtain optimal parameters for the estimation. The evaluated set is shown in Code snippet 11.

Figure illustrates the cross-validation errors of the 168 search scenarios.

Figure 18

Cross-validation error for the ESI-IDW non-gridded estimation parameter grid search. (a) Histogram of errors; (b) Errors in the sequence of scenarios during the search.

The best-case scenario has the following parameters:

As observed, the optimal exponent for the IDW interpolator is relatively high. Recall that when the exponent equals zero, the local estimate becomes a simple average of all neighbors, producing a smoothing effect similar to Kriging. Conversely, as the exponent approaches infinity, IDW converges to a nearest neighbor estimator, resulting in more abrupt spatial transitions.

With these optimal parameters, we then perform the corresponding estimation using the esi_nongriddata() function, as shown in Code snippet 12.

Just like in the gridded-data case, the estimates can be retrieved using the result.estimation() method, while their corresponding precision is obtained through result.precision().

Once we obtain the prediction, we use the lf.OperationalErrorLoss() function to implement the “Operational Error” loss function for ESI precision calculations:

This loss function can then be passed to the precision method: result.precision(op_error).

Figure 19

(a) Best parameter non-gridded estimation with ESI-IDW; (b) Precision obtained with operational error loss function.

Figure shows the resulting estimation and the corresponding precision based on the operational error loss function. The figure exhibits a somewhat pixelated texture, which can be attributed to the high optimal exponent value for the local IDW interpolator. The precision map reveals elevated uncertainty near regions corresponding to the maximum estimated values.

4.3.2Spatialize: ESI-IDW with Voronoi partitioning

We now employ ESI-IDW with Voronoi as the partitioning method, following the same parameter search configuration and estimation procedure described in the Mondrian example.

Figure illustrates the cross-validation errors across the 336 evaluated parameter combinations. Note that the Voronoi partitioning method yields twice the number of search scenarios compared to Mondrian partitioning, as the data_cond parameter offers two options: conditioning the partition on sample locations or using an unconditioned partition.

Figure 20

Cross-validation error for the ESI-IDW non-gridded estimation parameter grid search, using Voronoi partitioning method. (a) Histogram of errors; (b) Errors in the sequence of scenarios during the search.

In this case, the best-case scenario has the following parameters:

As in the previous case using Mondrian partitioning, the optimal exponent for the local interpolator is relatively high, implying that the resulting estimates should exhibit abrupt spatial transitions. However, cross-validation selected a coarser partition for Voronoi (alpha=0.5) than for Mondrian (alpha=0.9). Although the alpha parameter controls partition granularity in both methods, its operational mechanism differs between them, explaining this discrepancy.

Figure 21

a) Best parameter non-gridded estimation with ESI-IDW; (b) Precision obtained with operational error loss function, this time with Voronoi partition method.

Next, we perform the corresponding ESI estimation using the esi_nongriddata() function with the optimal parameters for Voronoi partitioning. Figure presents the resulting estimates and their associated precision based on the operational error loss function.

As observed with Mondrian partitioning, the estimates exhibit abrupt spatial transitions. However, the precision map exhibits substantially higher values, indicating greater uncertainty in the Voronoi implementation. Whether this elevated uncertainty reflects geometric differences between the partitioning methods or the contrasting optimal alpha remains unclear and requires further investigation.

4.3.3Spatialize: ESI-Kriging

Analogous to the ESI-IDW case, we present estimates for the ESI-Kriging case, where Kriging serves as the local interpolator for ESI

The Voronoi partitioning method cannot be applied with the local Kriging interpolator, as this functionality has not yet been implemented in spatialize.

The parameter grid, presented in Code snippet 13, includes four omnidirectional covariance function models, each evaluated across multiple values of alpha, nugget, range, and sill parameters.

Figure 22

Cross-validation error for the ESI-Kriging non-gridded estimation parameter grid search. (a) Histogram of errors; (b) Errors in the sequence of scenarios during the search.

Figure illustrates the cross-validation error for all evaluated parameter combinations.

In this case, the optimal parameters are:

Figure 23

(a) Best parameter non-gridded estimation with ESI-Kriging; (b) Precision obtained with operational error loss function.

Using the optimal parameters, we generate the corresponding estimates via the esi_nongriddata() function. We compute the precision using the operational error loss function, following the same procedure as in the ESI-IDW case. Figure presents the resulting estimation and precision maps. Compared to ESI-IDW, the ESI-Kriging estimates exhibit substantially smoother spatial transitions.

4.3.4Comparison with Kriging

To compare the ESI estimates against the gold standard method in the mining industry, we now present Ordinary Kriging estimates.

To produce the Ordinary Kriging estimate, an omnidirectional experimental variogram was computed and fitted to a theoretical variogram model comprising two nested spherical structures (Eq. ). 13γ(h)=sill11.5hrange1-0.5hrange13+sill21.5hrange2-0.5hrange23 where: sill1=0.17, range1=95, sill2=0.14, range2=220.

Figure 24

Experimental (dots) and theoretical (blue line) variograms for the Global Ordinary Kriging example.

Both experimental and theoretical variograms are shown in Fig. .

Additionally, we have implemented an automated workflow that employs scikit-learn to run a parameter grid search for identifying the optimal variogram model, followed by PyKrige to perform Ordinary Kriging using the identified optimal parameters.

This automated approach differs from the process carried out by the expert in three ways: It uses global Kriging instead of domains depending on local stationarities; it determines variogram parameters in a heuristic way rather than through expert judgment; and it assumes that the variogram is the same in all directions (isotropy), ignoring potential anisotropies in the data. The result obtained for the parameter search is as follows:

Figure 25

(a) Global Ordinary Kriging estimation based on non-gridded example data points; (b) Automated Kriging estimation with PyKrige.

Figure compares the manual expert implementation of Global Ordinary Kriging (left) – previously presented in – with the automated implementation (right).

As we can see in Fig. , the estimation obtained with the automated algorithm is comparable to the ordinary Kriging performed manually on the basis of expert judgment. This is because the example is an ideal case of isotropy and stationarity, i.e. it fulfils the basic assumptions of a linear method such as Kriging.

4.3.5Performance comparison

This section provides a visual and quantitative comparison of all interpolators employed in the real-world data case study.

Figure 26

Comparison of ESI and Ordinary Kriging estimates for the same non-gridded dataset. (a) ESI-IDW-Mondrian interpolation; (b) ESI-IDW-Voronoi interpolation; (c) ESI-Kriging interpolation; (d) PyKrige automated Ordinary Kriging interpolation; (e) Ordinary Kriging interpolation.

Figure presents the estimates side-by-side. Both ESI-IDW implementations yield visually similar results characterized by abrupt spatial transitions. In contrast, ESI-Kriging, PyKrige, and Ordinary Kriging produce substantially smoother estimates. Notably, ESI-Kriging exhibits moderately less smoothing than the conventional Kriging approaches, potentially achieving a more favourable balance between spatial continuity and preservation of local variability.

We employed two approaches for quantitative performance evaluation. First, we performed leave-one-out cross-validation with global hyperparameter optimization, as individual optimization for each of the 400 samples would be computationally prohibitive. Second, we conducted 10-fold cross-validation with hyperparameter optimization performed within each fold.

Table presents the mean absolute error (MAE), root mean squared error (RMSE), and mean squared error (MSE) for all automated interpolation methods, obtained using leave-one-out cross-validation. For both ESI and Kriging methods, parameters were determined from the previously conducted grid searches to avoid re-optimization within each fold. Both ESI-IDW implementations achieve comparable performance, although inferior to ESI-Kriging and Ordinary Kriging, with the Mondrian partitioning yielding slightly smaller errors. Ordinary Kriging, implemented with PyKrige, achieves the lowest RMSE and MSE values, while ESI-Kriging achieves the lowest MAE. Since RMSE and MSE penalize larger errors more heavily than MAE, these results suggest that ESI-Kriging produces more accurate estimates on average, but is more susceptible to occasional large prediction errors.

Table 3

Leave-one-out cross-validation performance metrics for ESI and PyKrige interpolations for the same non-gridded dataset.

MAE RMSE MSE ESI IDW Mondrian 0.096676 0.151732 0.023023 ESI IDW Voronoi 0.096933 0.154417 0.023845 ESI Kriging 0.088666 0.150786 0.022736 PyKrige 0.090447 0.148988 0.022197

Table presents the mean absolute error (MAE), root mean squared error (RMSE), and mean squared error (MSE) for all automated interpolation methods, obtained using 10-fold cross-validation. ESI-Kriging achieved the best performance across all metrics, followed by PyKrige's Ordinary Kriging. As in the leave-one-out case, ESI-IDW with Mondrian partitioning outperformed the Voronoi method. For all methods and metrics, errors were higher than in the leave-one-out case.

Table 4

K-fold cross-validation performance metrics for ESI and PyKrige interpolations for the same non-gridded dataset (K=10).

MAE RMSE MSE ESI IDW Mondrian 0.102496 0.161492 0.026080 ESI IDW Voronoi 0.101940 0.162085 0.026271 ESI Kriging 0.091971 0.152885 0.023374 PyKrige 0.094391 0.153786 0.023650

Comparing the two validation approaches reveals two key findings.

First, the higher errors across all methods in 10-fold validation – where training datasets are smaller – confirms that interpolation quality decreases with reduced data availability. Second, while ESI-Kriging outperformed PyKrige under 10-fold validation, it showed higher RMSE and MSE under leave-one-out validation. This suggests the ESI ensemble scheme may provide additional benefits when limited data availability makes both parameter optimization and interpolation more challenging. However, these results may be influenced by the specific fold configurations.

Given the ideal conditions in this dataset – isotropy, stationarity, and linear spatial correlation, as confirmed by the expert variogram analysis – one might expect Ordinary Kriging to perform best. However, as observed with IDW in the synthetic data case study, ESI's non-linear ensemble scheme proves beneficial, particularly under limited data availability. Moreover, ESI-Kriging offers competitive performance with automated parameter selection. This provides significant advantages in more complex scenarios involving spatial heterogeneity, where determining appropriate variogram models is challenging or where available models in automatable tools such as PyKrige may be unsuitable.

5Conclusion and future work

We have introduced spatialize, an open-source Python library that implements ensemble spatial interpolation (ESI), a novel data-driven geostatistical technique grounded in computational statistics and ensemble learning principles. The library addresses a critical gap in geostatistics by providing automated functionality with performance comparable to expert-level methods while quantifying prediction uncertainty.

The primary strength of spatialize lies in its minimal requirement for user intervention, enabling researchers without specialized geostatistical expertise to obtain optimal spatial predictions. In cases where user input is necessary – such as hyperparameter selection – the library provides tools that facilitate the process.

Our vision is for spatialize to become one of the leading open-source geostatistical libraries available in Python. To this end, future work includes the following enhancements:

Implement additional partition generation processes, such as Mondrian processes with random rotations, to increase the expressiveness and diversity of space partitions.

Integrate adaptive local interpolators to improve performance in heterogeneous spatial domains.

Incorporate interpolators compatible with spatial statistical tools, including conditional autoregressive (CAR)-type models.

Extend functionality to accommodate the estimation of categorical variables.

Implement GPU support to enhance computational performance and enable seamless integration with platforms such as Google Colab.

Provide classical geostatistical tools for expert users, including more general Kriging implementations that support nested structures and other advanced configurations.

Code and data availability

Both the source code for spatialize and the usage examples shown above are available in the spatialize project on Github , which can be accessed at https://github.com/alges/spatialize (28 May 2026). The version presented in this paper is archived on Zenodo at 10.5281/zenodo.16782612 , along with a supplementary User Manual.

In particular, the usage examples are available to be run in the examples/scripted_examples folder.

The simulated data estimation example (Sect. ) employs the following scripts:

esi_griddata_mondrian_voronoi.py

esi_griddata_and_scipy.py

idw_and_esi_grid_search.py

esi_custom_precision.py

The real-world data estimation example (Sect. ) employs the following scripts:

esi_nongriddata.py

pykrige_example.py

The data that is used in the real-world example is part of the spatialize library package and is installed with it in spatialize.resources. Besides, the spatialize.data module contains the functions for loading this data.

The supplement related to this article is available online at https://doi.org/10.5194/gmd-19-4633-2026-supplement.

Author contributions

Research conceptualisation, paper preparation, and analysis were performed by AFE, AE and FN. The ESI framework was first developed by FG, FN and AFE. FN, AFE, AE, FG, MJV, and JFS contributed to the paper edits and technical review.

Competing interests

The contact author has declared that none of the authors has any competing interests.

Disclaimer

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. The authors bear the ultimate responsibility for providing appropriate place names. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.

Acknowledgements

This work was supported in part by the Chilean National Agency for Research and Development ANID) under Project FONDEF IT23I013, FONDEF TA24I10053, ANID/BASAL AFB230001 and ANID AMTC CIA250010. We acknowledge the computational resources provided, the useful insights, and the support provided by the whole ALGES team.

Financial support

This research has been supported by the Agencia Nacional de Investigación y Desarrollo (grant nos. ANID/BASAL AFB230001, ANID AMTC CIA250010, IT23I013, and TA24I10053).

Review statement

This paper was edited by Klaus Klingmüller and reviewed by two anonymous referees.

References Abdulwadood et al.(2021)Abdulwadood, Al-Hassany, and Mustafa

Abdulwadood, H. W., Al-Hassany, G. S., and Mustafa, R. I.: IDW Interpolation of Soil Moisture Retention Curve Utilizing GIS, IOP Conference Series: Earth and Environmental Science, 856, 012040, 10.1088/1755-1315/856/1/012040, 2021.

Abzalov(2016)

Abzalov, M.: Applied Mining Geology, vol. 12 of Modern Approaches in Solid Earth Sciences, Springer, 10.1007/978-3-319-39264-6, 2016.

Assibey-Bonsu(2017)

Assibey-Bonsu, W.: Professor Danie Krige's First Memorial Lecture: The Basic Tenets of Evaluating the Mineral Resource Assets of Mining Companies, as Observed in Professor Danie Krige's Pioneering Work Over Half a Century, in: Geostatistics Valencia 2016, edited by: Gómez-Hernández, J., Rodrigo-Ilarri, J., Rodrigo-Clavero, M. E., Cassiraga, E., and Vargas-Guzmán, J. A., Quantitative Geology and Geostatistics, Springer, 3–25, 10.1007/978-3-319-46819-8_1, 2017.

Boroh et al.(2022)Boroh, Kouayep Lawou, Mfenjou, and Ngounouno

Boroh, A. W., Kouayep Lawou, S., Mfenjou, M. L., and Ngounouno, I.: Comparison of geostatistical and machine learning models for predicting geochemical concentration of iron: case of the Nkout iron deposit (south Cameroon), J. Afr. Earth Sci., 195, 104662, 10.1016/j.jafrearsci.2022.104662, 2022.

Breiman(2001)

Breiman, L.: Random forests, Mach. Learn., 45, 5–32, 10.1023/A:1010933404324, 2001.

Chilès and Desassis(2018)

Chilès, J.-P. and Desassis, N.: Fifty Years of Kriging, Springer International Publishing, pp. 589–612, 10.1007/978-3-319-78999-6_29, 2018.

Cohen and Intrator(2000)

Cohen, S. and Intrator, N.: A hybrid projection based and Radial Basis Function architecture, in: Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), Springer, 10.1007/3-540-45014-9_14, 2000.

Collins et al.(2002)Collins, Schapire, and Singer

Collins, M., Schapire, R. E., and Singer, Y.: Logistic regression, AdaBoost and Bregman distances, Mach. Learn., 48, 253–285, 2002.

de Sousa Mendes et al.(2020)de Sousa Mendes, Demattê, Souza, Urbina Salazar, and Amorim

de Sousa Mendes, W., Demattê, J. A., Souza, A., Urbina Salazar, D., and Amorim, M.: Geostatistics or machine learning for mapping soil attributes and agricultural practices, Revista Ceres, 67, 330–336, 10.1590/0034-737X202067040010, 2020.

Džeroski and Ženko(2004)

Džeroski, S. and Ženko, B.: Is combining classifiers with stacking better than selecting the best one?, Mach. Learn., 54, 255–273, 10.1023/B:MACH.0000015881.36452.6e, 2004.

Egaña et al.(2021)Egaña, Navarro, Maleki, Grandón, Carter, and Soto

Egaña, A., Navarro, F., Maleki, M., Grandón, F., Carter, F., and Soto, F.: Ensemble Spatial Interpolation: A New Approach to Natural or Anthropogenic Variable Assessment, Natural Resources Research, 30, 3777–3793, 10.1007/s11053-021-09860-2, 2021.

Fischer and Getis(2009)

Fischer, M. and Getis, A., eds.: Handbook of Applied Spatial Analysis: Software Tools, Methods and Applications, Springer, 10.1007/978-3-642-03647-7, 2009.

Friedman et al.(2000)Friedman, Hastie, Tibshirani, and others

Friedman, J., Hastie, T., and Tibshirani, R.: Additive logistic regression: a statistical view of boosting (with discussion and a rejoinder by the authors), Anna. Stat., 28, 337–407, 10.1214/aos/1016218223, 2000.

Hamdi et al.(2022)Hamdi, Shaban, Erradi, Mohamed, Rumi, and Salim

Hamdi, A., Shaban, K., Erradi, A., Mohamed, A., Rumi, S. K., and Salim, F. D.: Spatiotemporal data mining: a survey on challenges and open problems, Artif. Intell. Rev., 55, 1441–1488, 10.1007/s10462-021-09994-y, 2022.

Hastie et al.(2009)Hastie, Tibshirani, and Friedman

Hastie, T., Tibshirani, R., and Friedman, J.: Elements of Statistical Learning, 2nd edn., Springer, 10.1007/978-0-387-84858-7, 2009.

Hothorn and Lausen(2005)

Hothorn, T. and Lausen, B.: Bundling classifiers by bagging trees, Comput. Stat. Data An., 49, 1068–1078, 10.1016/j.csda.2004.06.019, 2005.

Jacobs(1995)

Jacobs, R. A.: Methods For Combining Experts' Probability Assessments, Neural Comput., 7, 867–888, 10.1162/neco.1995.7.5.867, 1995.

Jacobs et al.(1991)Jacobs, Jordan, Nowlan, and Hinton

Jacobs, R. A., Jordan, M. I., Nowlan, S. J., and Hinton, G. E.: Adaptive Mixtures of Local Experts, Neural Comput., 3, 79–87, 10.1162/neco.1991.3.1.79, 1991.

Jordan and Xu(1995)

Jordan, M. I. and Xu, L.: Convergence results for the EM approach to mixtures of experts architectures, Neural Networks, 8, 1409–1431, 10.1016/0893-6080(95)00014-3, 1995.

Khouni et al.(2021)Khouni, Louhichi, and Ghrabi

Khouni, I., Louhichi, G., and Ghrabi, A.: Use of GIS based Inverse Distance Weighted interpolation to assess surface water quality: Case of Wadi El Bey, Tunisia, Environmental Technology & Innovation, 24, 101892, 10.1016/j.eti.2021.101892, 2021.

Kirkwood et al.(2022)Kirkwood, Economou, Pugeault, and Odbert

Kirkwood, C., Economou, T., Pugeault, N., and Odbert, H.: Bayesian Deep Learning for Spatial Interpolation in the Presence of Auxiliary Information, Math. Geosci., 54, 507–531, 10.1007/s11004-021-09988-0, 2022.

Kleijnen(2017)

Kleijnen, J.: Kriging: Methods and Applications, CentER Discussion Paper Series No. 2017-047, Center for Economic Research, 10.2139/ssrn.3075151, 2017.

Lakshminarayanan et al.(2014)Lakshminarayanan, Roy, and Teh

Lakshminarayanan, B., Roy, D. M., and Teh, Y. W.: Mondrian forests: Efficient online random forests, Adv. Neural Inf. Process. Syst., 4, 3140–3148, 2014.

Leirvik and Yuan(2021)

Leirvik, T. and Yuan, M.: A Machine Learning Technique for Spatial Interpolation of Solar Radiation Observations, Earth Space Science, 8, e2020EA001527, 10.1029/2020EA001527, 2021.

Li and Heap(2014)

Li, J. and Heap, A. D.: Spatial interpolation methods applied in the environmental sciences: A review, 53, 173–189, 10.1016/j.envsoft.2013.12.008, 2014.

Li et al.(2011)Li, Heap, Potter, and Daniell

Li, J., Heap, A., Potter, A., and Daniell, J.: Application of machine learning methods to spatial interpolation of environmental variables, Environ. Modell. Softw., 26, 10.1016/j.envsoft.2011.07.004, 2011.

Li et al.(2017)

Li, J., Fan, Z., and Deng, M.: A Method of Spatial Interpolation of Air Pollution Concentration Considering Wind Direction and Speed, Journal of Geo-information Science, 19, 382, 10.3724/SP.J.1047.2017.00382, 2017.

Li(2021)

Li, Z.: An enhanced dual IDW method for high-quality geospatial interpolation, Sci. Rep., 11, 9903, 10.1038/s41598-021-89172-w, 2021.

Maroufpoor et al.(2020)Maroufpoor, Bozorg-Haddad, and Chu

Maroufpoor, S., Bozorg-Haddad, O., and Chu, X.: Chapter 9 – Geostatistics: principles and methods, in: Handbook of Probabilistic Models, edited by: Samui, P., Tien Bui, D., Chakraborty, S., and Deo, R. C., Butterworth-Heinemann, pp. 229–242, 10.1016/B978-0-12-816514-0.00009-6, 2020.

McKinley and Atkinson(2020)

McKinley, J. and Atkinson, P.: A Special Issue on the Importance of Geostatistics in the Era of Data Science, Math. Geosci., 52, 311–315, 10.1007/s11004-020-09858-1, 2020.

Menafoglio et al.(2018)Menafoglio, Gaetani, and Secchi

Menafoglio, A., Gaetani, G., and Secchi, P.: Random domain decompositions for object-oriented Kriging over complex domains, Stoch. Env. Res. Risk A., 32, 3421–3437, 10.1007/s00477-018-1596-z, 2018.

Meng et al.(2013)

Meng, Q., Liu, Z., and Borders, B. E.: Assessment of regression kriging for spatial interpolation – comparisons of seven GIS interpolation methods, Cartogr. Geogr. Inf. Sci., 40, 28–39, 10.1080/15230406.2013.762138, 2013.

Mitáš and Mitášová(1988)

Mitáš, L. and Mitášová, H.: General variational approach to the interpolation problem, Comput. Math. Appl., 16, 983–992, 10.1016/0898-1221(88)90255-6, 1988.

Navarro et al.(2025a)Navarro, Egaña, Ehrenfeld, Garrido, Valenzuela, and Sanchez Perez

Navarro, F., Egaña, A., Ehrenfeld, A., Garrido, F., Valenzuela, M. J., and Sanchez Perez, J. F.: Spatialize v1.0: A Python/C++ Library for Ensemble Spatial Interpolation, Zenodo [code], 10.5281/zenodo.16782612, 2025a.

Navarro et al.(2025b)Navarro, Egaña, and Garrido

Navarro, F., Egaña, A. F., and Garrido, F.: Spatialize, Github [code], https://github.com/alges/spatialize (last access: 29 January 2026), 2025b.

Oliver and Webster(2014)

Oliver, M. and Webster, R.: A tutorial guide to geostatistics: Computing and modelling variograms and kriging, CATENA, 113, 56–69, 10.1016/j.catena.2013.09.006, 2014.

Pannecoucke et al.(2020)Pannecoucke, Le Coz, Freulon, and de Fouquet

Pannecoucke, L., Le Coz, M., Freulon, X., and de Fouquet, C.: Combining geostatistics and simulations of flow and transport to characterize contamination within the unsaturated zone, Sci. Total Environ., 699, 134216, 10.1016/j.scitotenv.2019.134216, 2020.

Paudel et al.(2023)Paudel, de Wit, Boogaard, Marcos, Osinga, and Athanasiadis

Paudel, D., de Wit, A., Boogaard, H., Marcos, D., Osinga, S., and Athanasiadis, I. N.: Interpretability of deep learning models for crop yield forecasting, Comput. Electron. Agr., 206, 107663, 10.1016/j.compag.2023.107663, 2023.

Reid and Grudic(2009)

Reid, S. and Grudic, G.: Regularized linear models in stacked generalization, in: Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), Springer, 10.1007/978-3-642-02326-2_12, 2009.

Samet(1984)

Samet, H.: The quadtree and related hierarchical data structures, ACM Comput. Surv. (CSUR), 16, 187–260, 1984.

Samson and Deutsch(2022)

Samson, M. and Deutsch, C.: A Hybrid Estimation Technique Using Elliptical Radial Basis Neural Networks and Cokriging, Math. Geosci., 54, 573–591, 10.1007/s11004-021-09969-3, 2022.

Susanto et al.(2016)Susanto, De Souza, and He

Susanto, F., De Souza, P., and He, J.: Spatiotemporal Interpolation for Environmental Modelling, Sensors, 16, 10.3390/s16081245, 2016.

Varouchakis et al.(2012)Varouchakis, Hristopulos, and Karatzas

Varouchakis, E., Hristopulos, D., and Karatzas, G.: Improving kriging of groundwater level data using nonlinear normalizing transformations—a field application, Hydrolog. Sci. J., 57, 1404–1419, 10.1080/02626667.2012.717174, 2012.

Virdee and Kottegoda(1984)

Virdee, T. and Kottegoda, N.: A brief review of kriging and its application to optimal interpolation and observation well selection, Hydrolog. Sci. J., 29, 367–387, 10.1080/02626668409490957, 1984.

Wang et al.(2019)Wang, Zhang, Lu, and Wang

Wang, B., Zhang, N., Lu, W., and Wang, J.: Deep-learning-based seismic data interpolation: A preliminary result, GEOPHYSICS, 84, V11–V20, 10.1190/geo2017-0495.1, 2019.

Wang et al.(2017)Wang, Akeju, and Zhao

Wang, Y., Akeju, O. V., and Zhao, T.: Interpolation of spatially varying but sparsely measured geo-data: A comparative study, Eng. Geol., 231, 200–217, 10.1016/j.enggeo.2017.10.019, 2017.