Articles | Volume 18, issue 23
https://doi.org/10.5194/gmd-18-9417-2025
https://doi.org/10.5194/gmd-18-9417-2025
Development and technical paper
 | 
03 Dec 2025
Development and technical paper |  | 03 Dec 2025

Data clustering to optimise the representativity of observational data in air quality data assimilation: a case study with EURAD-IM (version 5.9.1 DA)

Alexander Hermanns, Anne Caroline Lange, Julia Kowalski, Hendrik Fuchs, and Philipp Franke

Download

Interactive discussion

Status: closed

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
  • RC1: 'Comment on egusphere-2025-450', Anonymous Referee #1, 17 Jun 2025
  • RC2: 'Comment on egusphere-2025-450', Anonymous Referee #2, 18 Jun 2025
  • RC3: 'Comment on egusphere-2025-450', Anonymous Referee #3, 22 Jun 2025

Peer review completion

AR – Author's response | RR – Referee report | ED – Editor decision | EF – Editorial file upload
AR by Alexander Hermanns on behalf of the Authors (08 Aug 2025)  Author's response   Author's tracked changes   Manuscript 
ED: Referee Nomination & Report Request started (23 Aug 2025) by Yongze Song
RR by Anonymous Referee #3 (18 Sep 2025)
RR by Anonymous Referee #1 (26 Sep 2025)
ED: Publish subject to technical corrections (15 Oct 2025) by Yongze Song
AR by Alexander Hermanns on behalf of the Authors (27 Oct 2025)  Author's response   Manuscript 
Download
Short summary
For air quality analyses, data assimilation models split available data into assimilation and validation data sets. The former is used to generate the analysis, the latter to verify the simulations. A preprocessor classifying the observations by the data characteristics is developed based on clustering algorithms. The assimilation and validation data sets are compiled by equally allocating data of each cluster. The resulting improvement of the analysis is evaluated with an air quality model.
Share