Articles | Volume 17, issue 15
https://doi.org/10.5194/gmd-17-5897-2024
https://doi.org/10.5194/gmd-17-5897-2024
Methods for assessment of models
 | 
07 Aug 2024
Methods for assessment of models |  | 07 Aug 2024

kNNDM CV: k-fold nearest-neighbour distance matching cross-validation for map accuracy estimation

Jan Linnenbrink, Carles Milà, Marvin Ludwig, and Hanna Meyer

Related authors

Investigating Moran’s I Properties for Spatial Machine Learning: A Preliminary Analysis
Jakub Nowosad and Hanna Meyer
AGILE GIScience Ser., 6, 40, https://doi.org/10.5194/agile-giss-6-40-2025,https://doi.org/10.5194/agile-giss-6-40-2025, 2025
Estimation of local training data point densities to support the assessment of spatial prediction uncertainty
Fabian Lukas Schumacher, Christian Knoth, Marvin Ludwig, and Hanna Meyer
EGUsphere, https://doi.org/10.5194/egusphere-2024-2730,https://doi.org/10.5194/egusphere-2024-2730, 2024
Short summary
Random forests with spatial proxies for environmental modelling: opportunities and pitfalls
Carles Milà, Marvin Ludwig, Edzer Pebesma, Cathryn Tonne, and Hanna Meyer
Geosci. Model Dev., 17, 6007–6033, https://doi.org/10.5194/gmd-17-6007-2024,https://doi.org/10.5194/gmd-17-6007-2024, 2024
Short summary
DEVELOPING TRANSFERABLE SPATIAL PREDICTION MODELS: A CASE STUDY OF SATELLITE BASED LANDCOVER MAPPING
M. Ludwig, J. Bahlmann, E. Pebesma, and H. Meyer
Int. Arch. Photogramm. Remote Sens. Spatial Inf. Sci., XLIII-B3-2022, 135–141, https://doi.org/10.5194/isprs-archives-XLIII-B3-2022-135-2022,https://doi.org/10.5194/isprs-archives-XLIII-B3-2022-135-2022, 2022
AntAir: satellite-derived 1 km daily Antarctic air temperatures since 2003
Hanna Meyer, Marwan Katurji, Florian Detsch, Fraser Morgan, Thomas Nauss, Pierre Roudier, and Peyman Zawar-Reza
Earth Syst. Sci. Data Discuss., https://doi.org/10.5194/essd-2019-215,https://doi.org/10.5194/essd-2019-215, 2019
Preprint withdrawn
Short summary

Related subject area

Earth and space science informatics
DustNet (v1): skilful neural network predictions of dust aerosols over the Saharan desert
Trish E. Nowak, Andy T. Augousti, Benno I. Simmons, and Stefan Siegert
Geosci. Model Dev., 18, 3509–3532, https://doi.org/10.5194/gmd-18-3509-2025,https://doi.org/10.5194/gmd-18-3509-2025, 2025
Short summary
RiverBedDynamics v1.0: a Landlab component for computing two-dimensional sediment transport and river bed evolution
Angel D. Monsalve, Samuel R. Anderson, Nicole M. Gasparini, and Elowyn M. Yager
Geosci. Model Dev., 18, 3427–3451, https://doi.org/10.5194/gmd-18-3427-2025,https://doi.org/10.5194/gmd-18-3427-2025, 2025
Short summary
A GPU parallelization of the neXtSIM-DG dynamical core (v0.3.1)
Robert Jendersie, Christian Lessig, and Thomas Richter
Geosci. Model Dev., 18, 3017–3040, https://doi.org/10.5194/gmd-18-3017-2025,https://doi.org/10.5194/gmd-18-3017-2025, 2025
Short summary
The Earth System Grid Federation (ESGF) Virtual Aggregation (CMIP6 v20240125)
Ezequiel Cimadevilla, Bryan N. Lawrence, and Antonio S. Cofiño
Geosci. Model Dev., 18, 2461–2478, https://doi.org/10.5194/gmd-18-2461-2025,https://doi.org/10.5194/gmd-18-2461-2025, 2025
Short summary
Can AI be enabled to perform dynamical downscaling? A latent diffusion model to mimic kilometer-scale COSMO5.0_CLM9 simulations
Elena Tomasi, Gabriele Franch, and Marco Cristoforetti
Geosci. Model Dev., 18, 2051–2078, https://doi.org/10.5194/gmd-18-2051-2025,https://doi.org/10.5194/gmd-18-2051-2025, 2025
Short summary

Cited articles

Beygelzimer, A., Kakadet, S., Langford, J., Arya, S., Mount, D., and Li, S.: FNN: Fast Nearest Neighbor Search Algorithms and Applications, r package version 1.1.3.1, https://CRAN.R-project.org/package=FNN (last access: 29 July 2024), 2022. a
Brenning, A.: Spatial cross-validation and bootstrap for the assessment of prediction rules in remote sensing: The R package sperrorest, in: 2012 IEEE Int. Geosci. Remote, 5372–5375, https://doi.org/10.1109/IGARSS.2012.6352393, 2012. a, b
Brenning, A.: Spatial machine-learning model diagnostics: a model-agnostic distance-based approach, Int. J. Geograph. Inf. Sci., 37, 584–606, https://doi.org/10.1080/13658816.2022.2131789, 2022. a, b
Conover, W. J.: Practical nonparametric statistics, vol. 350, John wiley & sons, ISBN 978-0-471-16068-7, 1999. a
Corporation, M. and Weston, S.: doParallel: Foreach Parallel Adaptor for the “parallel” Package, r package version 1.0.17, https://CRAN.R-project.org/package=doParallel (last access: 29 July 2024), 2022. a
Download
Short summary
Estimation of map accuracy based on cross-validation (CV) in spatial modelling is pervasive but controversial. Here, we build upon our previous work and propose a novel, prediction-oriented k-fold CV strategy for map accuracy estimation in which the distribution of geographical distances between prediction and training points is taken into account when constructing the CV folds. Our method produces more reliable estimates than other CV methods and can be used for large datasets.
Share