MeteoSaver v1.0: a machine-learning based software  for the transcription of historical weather data

Muheki, Derrick; Vercruysse, Bas; Chandrasekar, Krishna Kumar Thirukokaranam; Verbruggen, Christophe; Birkholz, Julie M.; Hufkens, Koen; Verbeeck, Hans; Boeckx, Pascal; Lampe, Seppe; Hawkins, Ed; Thorne, Peter; Ntumba, Dominique Kankonde; Moulasa, Olivier Kapalay; Thiery, Wim

doi:10.5194/gmd-19-3213-2026

Articles | Volume 19, issue 8

https://doi.org/10.5194/gmd-19-3213-2026

© Author(s) 2026. This work is distributed under
the Creative Commons Attribution 4.0 License.

https://doi.org/10.5194/gmd-19-3213-2026

© Author(s) 2026. This work is distributed under
the Creative Commons Attribution 4.0 License.

Articles | Volume 19, issue 8

Model description paper

|

23 Apr 2026

Model description paper |

| 23 Apr 2026

MeteoSaver v1.0: a machine-learning based software for the transcription of historical weather data

Derrick Muheki, Bas Vercruysse, Krishna Kumar Thirukokaranam Chandrasekar, Christophe Verbruggen, Julie M. Birkholz, Koen Hufkens, Hans Verbeeck, Pascal Boeckx, Seppe Lampe, Ed Hawkins, Peter Thorne, Dominique Kankonde Ntumba, Olivier Kapalay Moulasa, and Wim Thiery

Download

Final revised paper (published on 23 Apr 2026)
Supplement to the final revised paper
Preprint (discussion started on 10 Jun 2025)
Supplement to the preprint

Interactive discussion

Status: closed

RC1:
'Comment on egusphere-2024-3779', Anonymous Referee #1, 15 Feb 2026
“MeteoSaver v1.0: a machine-learning based software for the transcription of historical weather data” by Derrick et al.

https://doi.org/10.5194/egusphere-2024-3779
Preprint. Discussion started: 10 June 2025
General Assessment
This manuscript presents MeteoSaver v1.0, an open-source, machine-learning based pipeline for the transcription, quality control, and structuring of historical meteorological records. The work is technically strong, well motivated, and relevant for climate data rescue, particularly in data-scarce regions.
The system is a valuable contribution to the field of historical climate data rescue, and the open-source, modular design is commendable.. The paper represents a valuable contribution at the interface of climate science, machine learning, and data engineering.
To strengthen the manuscript, I recommend expanding or more clearly contextualizing the validation, clarifying accuracy requirements for climate applications, and addressing potential biases introduced by rule-based QC. Addressing these points will significantly enhance the practical usefulness of the software.
The manuscript reports several performance metrics (e.g., transcription match rates, MAE, quality flags). The reported median match rate of approximately 74% between MeteoSaver outputs and manual transcription is relatively low for climate data rescue applications, where accuracy requirements are often stringent. While the authors also report a median MAE of 0.3 °C for temperature, the relationship between these metrics and their implications for downstream climate analyses is not sufficiently discussed. The paper should sufficiently explain:
What level of transcription error is acceptable for climate or meteorological analysis,

How the reported errors might affect climatologies, trend analyses, or extreme-event detection,

Whether the current performance is adequate for the intended use cases.

The authors should clearly link their validation results to the types of climate analyses for which MeteoSaver outputs are (or are not) suitable, and explicitly discuss limitations.

I recommend “minor revision”.

Below are few Minor Comments
The resolution of figures 13 & 14 should be improved. Some validation figures would benefit from clearer guidance on how they should be interpreted by climate scientists.

The discussion would benefit from a short paragraph situating MeteoSaver within the broader ecosystem of data rescue tools.

The evaluation focuses primarily on temperature variables. Please clarify whether MeteoSaver currently supports other common variables (precipitation, humidity, pressure, radiation) and whether different QC logic would be required.

The reported processing time of approximately 8 minutes per sheet on a laptop raises questions. Can you provide estimates for batch processing on HPC or cloud infrastructure and discuss expected performance for thousands of sheets and potential bottlenecks.

Either extend the validation to at least one additional archive (different country or table layout), or explicitly acknowledge and discuss the limitations of the current validation, clearly stating that the reported performance metrics may not generalize to other historical datasets.

p.14: “Following the transcription of the data, quality assessment and quality control (QA/QC) is carried out to ensure the final output data is highly accurate with reference to the original handwritten daily temperature records (see Fig. 9).”

>> The phrase “highly accurate” is not operationally defined. It would strengthen the methodology to clarify whether “accuracy” here refers to:
internal consistency (logical constraints, totals, relationships),

agreement with manually transcribed values,

or conformity with physical bounds.

Clarifying this will help readers understand what the QA/QC module is designed to guarantee.
p.16:
“If this condition is not met, a specific adjustment, unique to our sheets, is applied: the first digit is removed from the value, and the cell is flagged to indicate this manipulation (see Fig. 11 a-b, with manipulated values in b shown in orange).”

>> This is a data transformation rule, not only a quality check. It would help to explicitly describe this as a correction operation and to specify its assumptions (e.g., why the first digit is assumed to be erroneous, and under what conditions this may fail).
“However, if the check is passed, the transcribed temperature values are then adjusted to match the required decimal places, set to one in this case (see Fig. 11 b–c).”

>> This step modifies the data but is not mathematically described. Please clarify:
whether this is rounding, truncation, or scaling,

and how uncertainty introduced by this step is handled.

“For the daily maximum temperature threshold, we use 40°C. For the daily minimum temperature threshold, we use 5°C.”

>> The manuscript would benefit from a brief discussion of how sensitive the results are to these fixed thresholds, and whether they are intended to be region-specific or globally applicable.
p.19:
“Only the confirmed (green) daily temperature values are passed to the next module, Data Formatting and Upload (sect. 3.6).”

>> This implies that a large portion of transcribed data may be excluded. Please indicate the proportion of discarded values and discuss potential impacts on time series completeness. Here the manuscript transitions from checking to correcting. Explicitly distinguishing these two roles would improve conceptual clarity.
“Two examples … illustrate the sequence of QA/QC checks performed on the initial transcribed values, leading to the final confirmed values (flagged in green).”

>> Figure 11 shows the propagation of flags and value states, but the underlying equations and replacement rules are not visible in the figure. Consider annotating the panels with the rule names (threshold, digit removal, Eq. 1-4, etc.) to make the logic traceable.
p.20:
“At this stage, an additional check is performed, which was not included in the QA/QC module due to the availability of longer temperature series at this point.”

>> This introduces a new methodological step after the main QA/QC description. For structural clarity, it may be preferable to describe this earlier as an optional extension of Module 5.
Citation: https://doi.org/10.5194/egusphere-2024-3779-RC1
- AC1: 'Reply on RC1', Derrick Muheki, 20 Mar 2026
  
  We thank the reviewer for their time and valuable suggestions to improve the manuscript. In the attached PDF document, we respond to the individual comments and illustrate modifications to the manuscript to accommodate the concerns raised. We believe that the manuscript has benefited from these modifications.
  
  Citation: https://doi.org/10.5194/egusphere-2024-3779-AC1
- AC3: 'Reply on RC1', Derrick Muheki, 20 Mar 2026
  
  As described in the attached PDF, we have expanded Sections 3, 4 and 5 to clarify the reported accuracy metrics for climate or meteorological analyses as well as the potential use cases for the transcribed data. Additionally, we have accommodated the additional concerns raised. We believe that the manuscript has benefited from these modifications.
  
  Citation: https://doi.org/10.5194/egusphere-2024-3779-AC3
RC2:
'Comment on egusphere-2024-3779', Chris Lennard, 20 Feb 2026

I commend this work that has developed an automated procedure to transcribe and digitise tabular meteorological observations, particularly in data sparse regions where weather data are currently stored in archives. If manually done, this is an enormously onerous and thankless task so an automated procedure, such as the one presented here, is an invaluable tool in the data rescue endeavour.
Technically, each progressive step of the MeteoSaver v1.0 package is logical, building on the previous step and in each step the quality control checks are thorough and account for errors in the transcription as well as common errors found in recorded station data.
The quality control checks of the resultant transcribed data are those generally used to assess the quality of temperature data. This seems to be by design and is understandable as temperature is a relatively homogeneous variable to test the automatic transcription methodology and quality control procedures on.
However, in the title of the paper and elsewhere in the manuscript the phrase "Weather data” is used, which implicitly includes rainfall (and other data). In lines 444 - 449 the authors mention variables other than temperature but I note rainfall is missing in this list. I would suggest rainfall is an extremely important variable to transcribe, particularly in currently data-sparse regions, given the large observational uncertainties in these regions as described in the Introduction. The observation data sheets presented in the paper include rainfall so I would like to understand why it does not seem to be a variable being considered for transcription.
I would therefore be grateful for the authors to include a paragraph or two about why rainfall is not considered in the paper, and if lines 444 - 449 are an indication of the variables to be considered in later versions of the software, why rainfall does not feature. I’m not asking the authors to present rainfall in the paper as has been done for temperature, only an explanation of why (it may be more complex a task).
While I do realise this is Version 1 of the software, I also recognise the enormous potential it presents to reduce the huge uncertainties in rainfall variability and trend in data sparse regions, particularly in Africa.
Lastly, given the focus on temperature, perhaps constrain the title of the paper to temperature.
My congratulations and thanks again to the authors for developing this valuable tool that will make it easier to transcribe paper-based weather data for use in digital observational datasets. It has the potential to reduce observational uncertainties in currently data sparse areas, including the DRC.

Citation: https://doi.org/10.5194/egusphere-2024-3779-RC2
- AC2: 'Reply on RC2', Derrick Muheki, 20 Mar 2026
  
  We thank the reviewer for their time and positive assessment of the software, and valuable suggestions to improve the manuscript. We fully agree that precipitation is an important variable to transcribe, particularly in data-scarce regions such as the Central Africa where observational coverage is limited.
  To better illustrate MeteoSaver’s capability to handle precipitation data, we have now incorporated a precipitation pentad consistency check within the QA/QC module in MeteoSaver v1.0 and in the revised manuscript. We have clarified this point in the revised manuscript in Sect 3.5, 5.3 and 5.4, where we now illustrate the pentad (multi-day) QA/QC check on transcribed precipitation and clarify that the current demonstration focuses on temperature and precipitation variables.
  In the attached PDF document, we illustrate modifications to the manuscript to accommodate the reviewers' comments. We believe that the manuscript has benefited from these modifications, and that the original title is appropriate now that these modifications have been implemented.
  
  Citation: https://doi.org/10.5194/egusphere-2024-3779-AC2

Peer review completion

AR – Author's response | RR – Referee report | ED – Editor decision | EF – Editorial file upload

AR by Derrick Muheki on behalf of the Authors (20 Mar 2026) Author's response Author's tracked changes Manuscript

ED: Publish as is (27 Mar 2026) by Taesam Lee

AR by Derrick Muheki on behalf of the Authors (02 Apr 2026) Manuscript

Short summary

Archives worldwide host vast records of observed weather data crucial for understanding climate variability. However, most of these records are still in paper form, limiting their use. To address this, we developed MeteoSaver, an open-source tool, to transcribe these records to machine-readable format. Applied to ten handwritten temperature sheets, it achieved a median accuracy of 74 %. This tool offers a promising solution to preserve records from archives and unlock historical weather insights.