Articles | Volume 19, issue 7
https://doi.org/10.5194/gmd-19-2657-2026
https://doi.org/10.5194/gmd-19-2657-2026
Methods for assessment of models
 | 
07 Apr 2026
Methods for assessment of models |  | 07 Apr 2026

Validation strategies for deep learning-based groundwater level time series prediction using exogenous meteorological input features

Fabienne Doll, Tanja Liesch, Maria Wetzel, Stefan Kunz, and Stefan Broda

Download

Interactive discussion

Status: closed

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
  • RC1: 'Comment on egusphere-2025-3539', Anonymous Referee #1, 27 Sep 2025
    • AC1: 'Reply on RC1', Fabienne Doll, 18 Nov 2025
  • RC2: 'Comment on egusphere-2025-3539', Anonymous Referee #2, 04 Oct 2025
    • AC2: 'Reply on RC2', Fabienne Doll, 18 Nov 2025
  • RC3: 'Comment on egusphere-2025-3539', Anonymous Referee #3, 06 Oct 2025
    • AC3: 'Reply on RC3', Fabienne Doll, 18 Nov 2025

Peer review completion

AR – Author's response | RR – Referee report | ED – Editor decision | EF – Editorial file upload
AR by Fabienne Doll on behalf of the Authors (16 Dec 2025)  Author's response   Author's tracked changes   Manuscript 
ED: Referee Nomination & Report Request started (30 Dec 2025) by Dan Lu
RR by Anonymous Referee #3 (11 Jan 2026)
ED: Publish as is (27 Jan 2026) by Dan Lu
AR by Fabienne Doll on behalf of the Authors (10 Feb 2026)  Manuscript 

Post-review adjustments

AA – Author's adjustment | EA – Editor approval
AA by Fabienne Doll on behalf of the Authors (10 Feb 2026)   Author's adjustment   Manuscript
EA: Adjustments approved (10 Feb 2026) by Dan Lu
Download
Short summary
With the growing use of machine learning for groundwater level (GWL) prediction, proper performance estimation is crucial. This study compares three validation strategies—blocked cross-validation (bl-CV), repeated out-of-sample (repOOS), and out-of-sample (OOS)—for 1D-CNN and LSTM models using meteorological inputs. Results show that bl-CV offers the most reliable performance estimates, while OOS is the most uncertain, highlighting the need for careful method selection.
Share