Articles | Volume 17, issue 15
https://doi.org/10.5194/gmd-17-5897-2024
https://doi.org/10.5194/gmd-17-5897-2024
Methods for assessment of models
 | 
07 Aug 2024
Methods for assessment of models |  | 07 Aug 2024

kNNDM CV: k-fold nearest-neighbour distance matching cross-validation for map accuracy estimation

Jan Linnenbrink, Carles Milà, Marvin Ludwig, and Hanna Meyer

Download

Interactive discussion

Status: closed

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
  • CC1: 'Comment on egusphere-2023-1308', Nils Tjaden, 07 Jul 2023
  • RC1: 'Comment on egusphere-2023-1308', Italo Goncalves, 23 Aug 2023
  • RC2: 'Comment on egusphere-2023-1308', Anonymous Referee #2, 23 Aug 2023
  • AC1: 'Comment on egusphere-2023-1308', Jan Linnenbrink, 19 Oct 2023

Peer review completion

AR: Author's response | RR: Referee report | ED: Editor decision | EF: Editorial file upload
AR by Jan Linnenbrink on behalf of the Authors (07 Nov 2023)  Author's response   Author's tracked changes   Manuscript 
ED: Referee Nomination & Report Request started (03 Dec 2023) by Rohitash Chandra
RR by Italo Goncalves (05 Dec 2023)
RR by Ute Mueller (07 Jan 2024)
RR by Anonymous Referee #4 (07 Jan 2024)
ED: Reconsider after major revisions (25 Jan 2024) by Rohitash Chandra
AR by Jan Linnenbrink on behalf of the Authors (04 Mar 2024)  Author's response   Author's tracked changes   Manuscript 
ED: Referee Nomination & Report Request started (08 Apr 2024) by Rohitash Chandra
RR by Ute Mueller (15 Apr 2024)
RR by Wen Luo (17 Apr 2024)
ED: Publish as is (17 Jun 2024) by Rohitash Chandra
AR by Jan Linnenbrink on behalf of the Authors (18 Jun 2024)  Manuscript 
Download
Short summary
Estimation of map accuracy based on cross-validation (CV) in spatial modelling is pervasive but controversial. Here, we build upon our previous work and propose a novel, prediction-oriented k-fold CV strategy for map accuracy estimation in which the distribution of geographical distances between prediction and training points is taken into account when constructing the CV folds. Our method produces more reliable estimates than other CV methods and can be used for large datasets.