Preprints
https://doi.org/10.5194/gmd-2020-332
https://doi.org/10.5194/gmd-2020-332

Submitted as: model description paper 23 Oct 2020

Submitted as: model description paper | 23 Oct 2020

Review status: a revised version of this preprint was accepted for the journal GMD and is expected to appear here in due course.

MLAir (v1.0) – a tool to enable fast and flexible machine learning on air data time series

Lukas H. Leufen1,2, Felix Kleinert1,2, and Martin G. Schultz1 Lukas H. Leufen et al.
  • 1Research Centre Jülich, Jülich Supercomputing Centre, Germany
  • 2University of Bonn, Institute of Geosciences, Germany

Abstract. With MLAir (Machine Learning on Air data) we created a software environment that simplifies and accelerates the exploration of new machine learning (ML) models for the analysis and forecasting of meteorological and air quality time series. Thereby MLAir is not developed as an abstract workflow, but hand in hand with actual scientific questions. It thus addresses scientists with either a meteorological or a ML background. Due to their relative ease of use and spectacular results in other application areas, neural networks and other ML methods are gaining enormous momentum also in the weather and air quality research communities. Even though there are already many books and tutorials describing how to conduct a ML experiment, there are many stumbling blocks for a newcomer. In contrast, people familiar with ML concepts and technology often have difficulties understanding the nature of atmospheric data. With MLAir we have addressed a number of these pitfalls so that it becomes easier for scientists of both domains to rapidly start off their ML application. MLAir has been developed in such a way that it is easy to use and is designed from the very beginning as a standalone, fully functional experiment. Due to its flexible, modular code base, code modifications are easy and personal experiment schedules can be quickly derived. The package also includes a set of simple validation tools to facilitate the evaluation of ML results using standard meteorological statistics. MLAir can easily be ported onto different computing environments from desktop workstations to high-end supercomputers with or without graphics processing units (GPU).

Lukas H. Leufen et al.

 
Status: closed
Status: closed
AC: Author comment | RC: Referee comment | SC: Short comment | EC: Editor comment
Printer-friendly Version - Printer-friendly version Supplement - Supplement
 
Status: closed
Status: closed
AC: Author comment | RC: Referee comment | SC: Short comment | EC: Editor comment
Printer-friendly Version - Printer-friendly version Supplement - Supplement

Lukas H. Leufen et al.

Model code and software

MLAir (v1.0.0) - a tool to enable fast and flexible machine learning on air data time series - Source Code Lukas Hubert Leufen, Felix Kleinert, and Martin Georg Schultz https://doi.org/10.34730/fcc6b509d5394dad8cfdfc6e9fff2bec

Lukas H. Leufen et al.

Viewed

Total article views: 506 (including HTML, PDF, and XML)
HTML PDF XML Total BibTeX EndNote
380 125 1 506 5 4
  • HTML: 380
  • PDF: 125
  • XML: 1
  • Total: 506
  • BibTeX: 5
  • EndNote: 4
Views and downloads (calculated since 23 Oct 2020)
Cumulative views and downloads (calculated since 23 Oct 2020)

Viewed (geographical distribution)

Total article views: 417 (including HTML, PDF, and XML) Thereof 413 with geography defined and 4 with unknown origin.
Country # Views %
  • 1
1
 
 
 
 
Latest update: 26 Feb 2021
Download
Short summary
MLAir provides a coherent end-to-end structure for a typical time series analysis workflow using machine learning (ML). Yet, it is adaptable to a variety of ML use cases. The user has a free hand with the ML model itself and can select from different methods during preprocessing, training, and postprocessing. MLAir offers tools to track the experiment conduction, documents the necessary ML parameters, and creates a variety of publication-ready plots.