Articles | Volume 15, issue 8
https://doi.org/10.5194/gmd-15-3433-2022
© Author(s) 2022. This work is distributed under
the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
https://doi.org/10.5194/gmd-15-3433-2022
© Author(s) 2022. This work is distributed under
the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
Efficient high-dimensional variational data assimilation with machine-learned reduced-order models
Romit Maulik
CORRESPONDING AUTHOR
240, Argonne National Laboratory, Lemont, IL 60439, USA
Vishwas Rao
240, Argonne National Laboratory, Lemont, IL 60439, USA
Jiali Wang
240, Argonne National Laboratory, Lemont, IL 60439, USA
Gianmarco Mengaldo
Department of Mechanical Engineering, National University of Singapore, Block EA, #07-08, 9 Engineering Drive 1, Singapore
Emil Constantinescu
240, Argonne National Laboratory, Lemont, IL 60439, USA
Bethany Lusch
240, Argonne National Laboratory, Lemont, IL 60439, USA
Prasanna Balaprakash
240, Argonne National Laboratory, Lemont, IL 60439, USA
Ian Foster
240, Argonne National Laboratory, Lemont, IL 60439, USA
Rao Kotamarthi
240, Argonne National Laboratory, Lemont, IL 60439, USA
Related authors
No articles found.
Qiuyi Wu, Julie Bessac, Whitney Huang, Jiali Wang, and Rao Kotamarthi
Adv. Stat. Clim. Meteorol. Oceanogr., 8, 205–224, https://doi.org/10.5194/ascmo-8-205-2022, https://doi.org/10.5194/ascmo-8-205-2022, 2022
Short summary
Short summary
We study wind conditions and their potential future changes across the U.S. via a statistical conditional framework. We conclude that changes between historical and future wind directions are small, but wind speeds are generally weakened in the projected period, with some locations being intensified. Moreover, winter wind speeds are projected to decrease in the northwest, Colorado, and the northern Great Plains (GP), while summer wind speeds over the southern GP slightly increase in the future.
William J. Shaw, Larry K. Berg, Mithu Debnath, Georgios Deskos, Caroline Draxl, Virendra P. Ghate, Charlotte B. Hasager, Rao Kotamarthi, Jeffrey D. Mirocha, Paytsar Muradyan, William J. Pringle, David D. Turner, and James M. Wilczak
Wind Energ. Sci., 7, 2307–2334, https://doi.org/10.5194/wes-7-2307-2022, https://doi.org/10.5194/wes-7-2307-2022, 2022
Short summary
Short summary
This paper provides a review of prominent scientific challenges to characterizing the offshore wind resource using as examples phenomena that occur in the rapidly developing wind energy areas off the United States. The paper also describes the current state of modeling and observations in the marine atmospheric boundary layer and provides specific recommendations for filling key current knowledge gaps.
Chuxuan Li, Alexander L. Handwerger, Jiali Wang, Wei Yu, Xiang Li, Noah J. Finnegan, Yingying Xie, Giuseppe Buscarnera, and Daniel E. Horton
Nat. Hazards Earth Syst. Sci., 22, 2317–2345, https://doi.org/10.5194/nhess-22-2317-2022, https://doi.org/10.5194/nhess-22-2317-2022, 2022
Short summary
Short summary
In January 2021 a storm triggered numerous debris flows in a wildfire burn scar in California. We use a hydrologic model to assess debris flow susceptibility in pre-fire and postfire scenarios. Compared to pre-fire conditions, postfire conditions yield dramatic increases in peak water discharge, substantially increasing debris flow susceptibility. Our work highlights the hydrologic model's utility in investigating and potentially forecasting postfire debris flows at regional scales.
Caleb Phillips, Lindsay M. Sheridan, Patrick Conry, Dimitrios K. Fytanidis, Dmitry Duplyakin, Sagi Zisman, Nicolas Duboc, Matt Nelson, Rao Kotamarthi, Rod Linn, Marc Broersma, Timo Spijkerboer, and Heidi Tinnesand
Wind Energ. Sci., 7, 1153–1169, https://doi.org/10.5194/wes-7-1153-2022, https://doi.org/10.5194/wes-7-1153-2022, 2022
Short summary
Short summary
Adoption of distributed wind turbines for energy generation is hindered by challenges associated with siting and accurate estimation of the wind resource. This study evaluates classic and commonly used methods alongside new state-of-the-art models derived from simulations and machine learning approaches using a large dataset from the Netherlands. We find that data-driven methods are most effective at predicting production at real sites and new models reliably outperform classic methods.
Jiali Wang, Zhengchun Liu, Ian Foster, Won Chang, Rajkumar Kettimuthu, and V. Rao Kotamarthi
Geosci. Model Dev., 14, 6355–6372, https://doi.org/10.5194/gmd-14-6355-2021, https://doi.org/10.5194/gmd-14-6355-2021, 2021
Short summary
Short summary
Downscaling, the process of generating a higher spatial or time dataset from a coarser observational or model dataset, is a widely used technique. Two common methodologies for performing downscaling are to use either dynamic (physics-based) or statistical (empirical). Here we develop a novel methodology, using a conditional generative adversarial network (CGAN), to perform the downscaling of a model's precipitation forecasts and describe the advantages of this method compared to the others.
Jaydeep Singh, Narendra Singh, Narendra Ojha, Amit Sharma, Andrea Pozzer, Nadimpally Kiran Kumar, Kunjukrishnapillai Rajeev, Sachin S. Gunthe, and V. Rao Kotamarthi
Geosci. Model Dev., 14, 1427–1443, https://doi.org/10.5194/gmd-14-1427-2021, https://doi.org/10.5194/gmd-14-1427-2021, 2021
Short summary
Short summary
Atmospheric models often have limitations in simulating the geographically complex and climatically important central Himalayan region. In this direction, we have performed regional modeling at high resolutions to improve the simulation of meteorology and dynamics through a better representation of the topography. The study has implications for further model applications to investigate the effects of anthropogenic pressure over the Himalaya.
Jiali Wang, Prasanna Balaprakash, and Rao Kotamarthi
Geosci. Model Dev., 12, 4261–4274, https://doi.org/10.5194/gmd-12-4261-2019, https://doi.org/10.5194/gmd-12-4261-2019, 2019
Short summary
Short summary
Parameterizations are frequently used in models representing physical phenomena and are often the computationally expensive portions of the code. Using model output from simulations performed using a weather model, we train deep neural networks to provide an accurate alternative to a physics-based parameterization. We demonstrate that a domain-aware deep neural network can successfully simulate the entire diurnal cycle of the boundary layer physics and the results are transferable.
Jiali Wang, Cheng Wang, Vishwas Rao, Andrew Orr, Eugene Yan, and Rao Kotamarthi
Geosci. Model Dev., 12, 3523–3539, https://doi.org/10.5194/gmd-12-3523-2019, https://doi.org/10.5194/gmd-12-3523-2019, 2019
Short summary
Short summary
WRF-Hydro needs to be calibrated to optimize its output with respect to observations. However, when applied to a relatively large domain, both WRF-Hydro simulations and calibrations require intensive computing resources and are best performed in parallel. This study ported an independent calibration tool (parameter estimation tool – PEST) to high-performance computing clusters and adapted it to work with WRF-Hydro. The results show significant speedup for model calibration.
Jeffrey D. Mirocha, Matthew J. Churchfield, Domingo Muñoz-Esparza, Raj K. Rai, Yan Feng, Branko Kosović, Sue Ellen Haupt, Barbara Brown, Brandon L. Ennis, Caroline Draxl, Javier Sanz Rodrigo, William J. Shaw, Larry K. Berg, Patrick J. Moriarty, Rodman R. Linn, Veerabhadra R. Kotamarthi, Ramesh Balakrishnan, Joel W. Cline, Michael C. Robinson, and Shreyas Ananthan
Wind Energ. Sci., 3, 589–613, https://doi.org/10.5194/wes-3-589-2018, https://doi.org/10.5194/wes-3-589-2018, 2018
Short summary
Short summary
This paper validates the use of idealized large-eddy simulations with periodic lateral boundary conditions to provide boundary-layer flow quantities of interest for wind energy applications. Sensitivities to model formulation, forcing parameter values, and grid configurations were also examined, both to ascertain the robustness of the technique and to characterize inherent uncertainties, as required for the evaluation of more general wind plant flow simulation approaches under development.
K. K. Shukla, K. Niranjan Kumar, D. V. Phanikumar, R. K. Newsom, V. R. Kotamarthi, T. B. M. J. Ouarda, and M. V. Ratnam
Atmos. Meas. Tech. Discuss., https://doi.org/10.5194/amt-2016-162, https://doi.org/10.5194/amt-2016-162, 2016
Revised manuscript not accepted
Short summary
Short summary
Estimation of Cloud base height was carried out by using various ground based instruments (Doppler Lidar and Ceilometer) and satellite datasets (MODIS) over central Himalayan region for the first time. The present study demonstrates the potential of Doppler Lidar in precise estimation of cloud base height and updraft velocities. More such deployments will be invaluable inputs for regional weather prediction models over complex Himalayan terrains.
Y. Feng, V. R. Kotamarthi, R. Coulter, C. Zhao, and M. Cadeddu
Atmos. Chem. Phys., 16, 247–264, https://doi.org/10.5194/acp-16-247-2016, https://doi.org/10.5194/acp-16-247-2016, 2016
Short summary
Short summary
Aerosol radiative effects are of great importance for climate studies over South Asia, such as the weakening of the South Asian monsoon in the 20th century. This study reveals the altitude dependence of commonly underestimated aerosol radiative properties over this region. It further demonstrates the importance of constraining aerosol vertical distributions and partitioning of scattering vs absorbing aerosols in simulating the subsequent regional dynamical and hydrological responses to aerosols.
B. A. Drewniak, U. Mishra, J. Song, J. Prell, and V. R. Kotamarthi
Biogeosciences, 12, 2119–2129, https://doi.org/10.5194/bg-12-2119-2015, https://doi.org/10.5194/bg-12-2119-2015, 2015
J. Elliott, C. Müller, D. Deryng, J. Chryssanthacopoulos, K. J. Boote, M. Büchner, I. Foster, M. Glotter, J. Heinke, T. Iizumi, R. C. Izaurralde, N. D. Mueller, D. K. Ray, C. Rosenzweig, A. C. Ruane, and J. Sheffield
Geosci. Model Dev., 8, 261–277, https://doi.org/10.5194/gmd-8-261-2015, https://doi.org/10.5194/gmd-8-261-2015, 2015
Short summary
Short summary
We present and describe the Global Gridded Crop Model Intercomparison (GGCMI) project, an ongoing international effort to 1) validate global models of crop productivity, 2) improve models through detailed analysis of processes, and 3) assess the impacts of climate change on agriculture and food security. We present analysis of data inputs for the project, detailed protocols for conducting and evaluating simulation outputs, and example results.
V. S. Manoharan, R. Kotamarthi, Y. Feng, and M. P. Cadeddu
Atmos. Chem. Phys., 14, 1159–1165, https://doi.org/10.5194/acp-14-1159-2014, https://doi.org/10.5194/acp-14-1159-2014, 2014
Y. Feng, V. Ramanathan, and V. R. Kotamarthi
Atmos. Chem. Phys., 13, 8607–8621, https://doi.org/10.5194/acp-13-8607-2013, https://doi.org/10.5194/acp-13-8607-2013, 2013
B. Drewniak, J. Song, J. Prell, V. R. Kotamarthi, and R. Jacob
Geosci. Model Dev., 6, 495–515, https://doi.org/10.5194/gmd-6-495-2013, https://doi.org/10.5194/gmd-6-495-2013, 2013
Related subject area
Numerical methods
A comparison of Eulerian and Lagrangian methods for vertical particle transport in the water column
AutoQS v1: automatic parametrization of QuickSampling based on training images analysis
Implementation and application of ensemble optimal interpolation on an operational chemistry weather model for improving PM2.5 and visibility predictions
A dynamical core based on a discontinuous Galerkin method for higher-order finite-element sea ice modeling
GStatSim V1.0: a Python package for geostatistical interpolation and conditional simulation
Leveraging Google's Tensor Processing Units for tsunami-risk mitigation planning in the Pacific Northwest and beyond
An improved subgrid channel model with upwind-form artificial diffusion for river hydrodynamics and floodplain inundation simulation
A model instability issue in the National Centers for Environmental Prediction Global Forecast System version 16 and potential solutions
A comparison of 3-D spherical shell thermal convection results at low to moderate Rayleigh number using ASPECT (version 2.2.0) and CitcomS (version 3.3.1)
LISFLOOD-FP 8.1: new GPU-accelerated solvers for faster fluvial/pluvial flood simulations
Fast approximate Barnes interpolation: illustrated by Python-Numba implementation fast-barnes-py v1.0
Strategies for conservative and non-conservative monotone remapping on the sphere
GeoINR 1.0: an implicit neural representation network for three-dimensional geological modelling
Modeling large‐scale landform evolution with a stream power law for glacial erosion (OpenLEM v37): benchmarking experiments against a more process-based description of ice flow (iSOSIA v3.4.3)
A mixed finite-element discretisation of the shallow-water equations
Multifidelity Monte Carlo estimation for efficient uncertainty quantification in climate-related modeling
Massively parallel modeling and inversion of electrical resistivity tomography data using PFLOTRAN
Parallelized domain decomposition for multi-dimensional Lagrangian random walk mass-transfer particle tracking schemes
The Intelligent Prospector v1.0: geoscientific model development and prediction by sequential data acquisition planning with application to mineral exploration
Predicting peak daily maximum 8 h ozone and linkages to emissions and meteorology in Southern California using machine learning methods (SoCAB-8HR V1.0)
Transfer learning for landslide susceptibility modeling using domain adaptation and case-based reasoning
ISMIP-HOM benchmark experiments using Underworld
spyro: a Firedrake-based wave propagation and full-waveform-inversion finite-element solver
Spatial filtering in a 6D hybrid-Vlasov scheme to alleviate adaptive mesh refinement artifacts: a case study with Vlasiator (versions 5.0, 5.1, and 5.2.1)
A Bayesian data assimilation framework for lake 3D hydrodynamic models with a physics-preserving particle filtering method using SPUX-MITgcm v1
A fast, single-iteration ensemble Kalman smoother for sequential data assimilation
Characterizing uncertainties of Earth system modeling with heterogeneous many-core architecture computing
Metrics for Intercomparison of Remapping Algorithms (MIRA) protocol applied to Earth system models
Impact of the numerical solution approach of a plant hydrodynamic model (v0.1) on vegetation dynamics
Islet: interpolation semi-Lagrangian element-based transport
Multi-dimensional hydrological–hydraulic model with variational data assimilation for river networks and floodplains
Assessing the robustness and scalability of the accelerated pseudo-transient method
Assessment of stochastic weather forecast of precipitation near European cities, based on analogs of circulation
University of Warsaw Lagrangian Cloud Model (UWLCM) 2.0: adaptation of a mixed Eulerian–Lagrangian numerical model for heterogeneous computing clusters
Prediction error growth in a more realistic atmospheric toy model with three spatiotemporal scales
On numerical broadening of particle-size spectra: a condensational growth study using PyMPDATA 1.0
Lossy checkpoint compression in full waveform inversion: a case study with ZFPv0.5.5 and the overthrust model
Blockworlds 0.1.0: a demonstration of anti-aliased geophysics for probabilistic inversions of implicit and kinematic geological models
Improved double Fourier series on a sphere and its application to a semi-implicit semi-Lagrangian shallow-water model
SciKit-GStat 1.0: a SciPy-flavored geostatistical variogram estimation toolbox written in Python
Flow-Py v1.0: a customizable, open-source simulation tool to estimate runout and intensity of gravitational mass flows
Emulation of high-resolution land surface models using sparse Gaussian processes with application to JULES
A three-dimensional variational data assimilation system for aerosol optical properties based on WRF-Chem v4.0: design, development, and application of assimilating Himawari-8 aerosol observations
Implementation of a Gaussian Markov random field sampler for forward uncertainty quantification in the Ice-sheet and Sea-level System Model v4.19
A method for assessment of the general circulation model quality using the K-means clustering algorithm: a case study with GETM v2.5
An explicit GPU-based material point method solver for elastoplastic problems (ep2-3De v1.0)
MagIC v5.10: a two-dimensional message-passing interface (MPI) distribution for pseudo-spectral magnetohydrodynamics simulations in spherical geometry
Machine-learning models to replicate large-eddy simulations of air pollutant concentrations along boulevard-type streets
Recalculation of error growth models' parameters for the ECMWF forecast system
How biased are our models? – a case study of the alpine region
Tor Nordam, Ruben Kristiansen, Raymond Nepstad, Erik van Sebille, and Andy M. Booth
Geosci. Model Dev., 16, 5339–5363, https://doi.org/10.5194/gmd-16-5339-2023, https://doi.org/10.5194/gmd-16-5339-2023, 2023
Short summary
Short summary
We describe and compare two common methods, Eulerian and Lagrangian models, used to simulate the vertical transport of material in the ocean. They both solve the same transport problems but use different approaches for representing the underlying equations on the computer. The main focus of our study is on the numerical accuracy of the two approaches. Our results should be useful for other researchers creating or using these types of transport models.
Mathieu Gravey and Grégoire Mariethoz
Geosci. Model Dev., 16, 5265–5279, https://doi.org/10.5194/gmd-16-5265-2023, https://doi.org/10.5194/gmd-16-5265-2023, 2023
Short summary
Short summary
Multiple‐point geostatistics are widely used to simulate complex spatial structures based on a training image. The use of these methods relies on the possibility of finding optimal training images and parametrization of the simulation algorithms. Here, we propose finding an optimal set of parameters using only the training image as input. The main advantage of our approach is to remove the risk of overfitting an objective function.
Siting Li, Ping Wang, Hong Wang, Yue Peng, Zhaodong Liu, Wenjie Zhang, Hongli Liu, Yaqiang Wang, Huizheng Che, and Xiaoye Zhang
Geosci. Model Dev., 16, 4171–4191, https://doi.org/10.5194/gmd-16-4171-2023, https://doi.org/10.5194/gmd-16-4171-2023, 2023
Short summary
Short summary
Optimizing the initial state of atmospheric chemistry model input is one of the most essential methods to improve forecast accuracy. Considering the large computational load of the model, we introduce an ensemble optimal interpolation scheme (EnOI) for operational use and efficient updating of the initial fields of chemical components. The results suggest that EnOI provides a practical and cost-effective technique for improving the accuracy of chemical weather numerical forecasts.
Thomas Richter, Véronique Dansereau, Christian Lessig, and Piotr Minakowski
Geosci. Model Dev., 16, 3907–3926, https://doi.org/10.5194/gmd-16-3907-2023, https://doi.org/10.5194/gmd-16-3907-2023, 2023
Short summary
Short summary
Sea ice covers not only the pole regions but affects the weather and climate globally. For example, its white surface reflects more sunlight than land. The oceans around the poles are therefore kept cool, which affects the circulation in the oceans worldwide. Simulating the behavior and changes in sea ice on a computer is, however, very difficult. We propose a new computer simulation that better models how cracks in the ice change over time and show this by comparing to other simulations.
Emma J. MacKie, Michael Field, Lijing Wang, Zhen Yin, Nathan Schoedl, Matthew Hibbs, and Allan Zhang
Geosci. Model Dev., 16, 3765–3783, https://doi.org/10.5194/gmd-16-3765-2023, https://doi.org/10.5194/gmd-16-3765-2023, 2023
Short summary
Short summary
Earth scientists often have to fill in spatial gaps in measurements. This gap-filling or interpolation can be accomplished with geostatistical methods, where the statistical relationships between measurements are used to inform how these gaps should be filled. Despite the broad utility of these methods, there are few freely available geostatistical software applications. We present GStatSim, a Python package for performing different geostatistical interpolation methods.
Ian Madden, Simone Marras, and Jenny Suckale
Geosci. Model Dev., 16, 3479–3500, https://doi.org/10.5194/gmd-16-3479-2023, https://doi.org/10.5194/gmd-16-3479-2023, 2023
Short summary
Short summary
To aid risk managers who may wish to rapidly assess tsunami risk but may lack high-performance computing infrastructure, we provide an accessible software package able to rapidly model tsunami inundation over real topography by leveraging Google's Tensor Processing Unit, a high-performance hardware. Minimally trained users can take advantage of the rapid modeling abilities provided by this package via a web browser thanks to the ease of use of Google Cloud Platform.
Youtong Rong, Paul Bates, and Jeffrey Neal
Geosci. Model Dev., 16, 3291–3311, https://doi.org/10.5194/gmd-16-3291-2023, https://doi.org/10.5194/gmd-16-3291-2023, 2023
Short summary
Short summary
A novel subgrid channel (SGC) model is developed for river–floodplain modelling, allowing utilization of subgrid-scale bathymetric information while performing computations on relatively coarse grids. By including adaptive artificial diffusion, potential numerical instability, which the original SGC solver had, in low-friction regions such as urban areas is addressed. Evaluation of the new SGC model through structured tests confirmed that the accuracy and stability have improved.
Xiaqiong Zhou and Hann-Ming Henry Juang
Geosci. Model Dev., 16, 3263–3274, https://doi.org/10.5194/gmd-16-3263-2023, https://doi.org/10.5194/gmd-16-3263-2023, 2023
Short summary
Short summary
The National Centers for Environmental Prediction Global Forecast System version 16 experienced model instability failures in real-time runs resolved by increasing the minimum thickness depth parameter. Further investigation revealed that the issue was caused by the advection of geopotential heights at the model's layer interfaces. By replacing high-order boundary conditions with zero-gradient boundary conditions for interface-wind reconstruction, the instability was effectively addressed.
Grant T. Euen, Shangxin Liu, Rene Gassmöller, Timo Heister, and Scott D. King
Geosci. Model Dev., 16, 3221–3239, https://doi.org/10.5194/gmd-16-3221-2023, https://doi.org/10.5194/gmd-16-3221-2023, 2023
Short summary
Short summary
Due to the increasing availability of high-performance computing over the past few decades, numerical models have become an important tool for research. Here we test two geodynamic codes that produce such models: ASPECT, a newer code, and CitcomS, an older one. We show that they produce solutions that are extremely close. As methods and codes become more complex over time, showing reproducibility allows us to seamlessly link previously known information to modern methodologies.
Mohammad Kazem Sharifian, Georges Kesserwani, Alovya Ahmed Chowdhury, Jeffrey Neal, and Paul Bates
Geosci. Model Dev., 16, 2391–2413, https://doi.org/10.5194/gmd-16-2391-2023, https://doi.org/10.5194/gmd-16-2391-2023, 2023
Short summary
Short summary
This paper describes a new release of the LISFLOOD-FP model for fast and efficient flood simulations. It features a new non-uniform grid generator that uses multiwavelet analyses to sensibly coarsens the resolutions where the local topographic variations are smooth. Moreover, the model is parallelised on the graphical processing units (GPUs) to further boost computational efficiency. The performance of the model is assessed for five real-world case studies, noting its potential applications.
Bruno K. Zürcher
Geosci. Model Dev., 16, 1697–1711, https://doi.org/10.5194/gmd-16-1697-2023, https://doi.org/10.5194/gmd-16-1697-2023, 2023
Short summary
Short summary
We present a novel algorithm to efficiently compute Barnes interpolation, which is a method for transforming data values recorded at irregularly spaced points into a corresponding regular grid. In contrast to naive implementations with an algorithmic complexity that depends on the product of the number of sample points and the number of grid points, our approach reduces this dependency to their sum.
David H. Marsico and Paul A. Ullrich
Geosci. Model Dev., 16, 1537–1551, https://doi.org/10.5194/gmd-16-1537-2023, https://doi.org/10.5194/gmd-16-1537-2023, 2023
Short summary
Short summary
Climate models involve several different components, such as the atmosphere, ocean, and land models. Information needs to be exchanged, or remapped, between these models, and devising algorithms for performing this exchange is important for ensuring the accuracy of climate simulations. In this paper, we examine the efficacy of several traditional and novel approaches to remapping on the sphere and demonstrate where our approaches offer improvement.
Michael Hillier, Florian Wellmann, Eric de Kemp, Ernst Schetselaar, Boyan Brodaric, and Karine Bédard
Geosci. Model Dev. Discuss., https://doi.org/10.5194/gmd-2022-290, https://doi.org/10.5194/gmd-2022-290, 2023
Revised manuscript accepted for GMD
Short summary
Short summary
Neural networks can be used effectively to model three-dimensional geological structures from point data, sampling geological interfaces, units, and orientations of structural features. Existing neural network approaches for this type of modelling are advanced by the efficient incorporation of unconformities, new knowledge inputs, and new techniques to improve data fitting. These advances permit the modelling of large scale geological structures with low fitting error using noisy datasets.
Moritz Liebl, Jörg Robl, Stefan Hergarten, David Lundbek Egholm, and Kurt Stüwe
Geosci. Model Dev., 16, 1315–1343, https://doi.org/10.5194/gmd-16-1315-2023, https://doi.org/10.5194/gmd-16-1315-2023, 2023
Short summary
Short summary
In this study, we benchmark a topography-based model for glacier erosion (OpenLEM) with a well-established process-based model (iSOSIA). Our experiments show that large-scale erosion patterns and particularly the transformation of valley length geometry from fluvial to glacial conditions are very similar in both models. This finding enables the application of OpenLEM to study the influence of climate and tectonics on glaciated mountains with reasonable computational effort on standard PCs.
James Kent, Thomas Melvin, and Golo Albert Wimmer
Geosci. Model Dev., 16, 1265–1276, https://doi.org/10.5194/gmd-16-1265-2023, https://doi.org/10.5194/gmd-16-1265-2023, 2023
Short summary
Short summary
This paper introduces the Met Office's new shallow water model. The shallow water model is a building block towards the Met Office's new atmospheric dynamical core. The shallow water model is tested on a number of standard spherical shallow water test cases, including flow over mountains and unstable jets. Results show that the model produces similar results to other shallow water models in the literature.
Anthony Gruber, Max Gunzburger, Lili Ju, Rihui Lan, and Zhu Wang
Geosci. Model Dev., 16, 1213–1229, https://doi.org/10.5194/gmd-16-1213-2023, https://doi.org/10.5194/gmd-16-1213-2023, 2023
Short summary
Short summary
This work applies a novel technical tool, multifidelity Monte Carlo (MFMC) estimation, to three climate-related benchmark experiments involving oceanic, atmospheric, and glacial modeling. By considering useful quantities such as maximum sea height and total (kinetic) energy, we show that MFMC leads to predictions which are more accurate and less costly than those obtained by standard methods. This suggests MFMC as a potential drop-in replacement for estimation in realistic climate models.
Piyoosh Jaysaval, Glenn E. Hammond, and Timothy C. Johnson
Geosci. Model Dev., 16, 961–976, https://doi.org/10.5194/gmd-16-961-2023, https://doi.org/10.5194/gmd-16-961-2023, 2023
Short summary
Short summary
We present a robust and highly scalable implementation of numerical forward modeling and inversion algorithms for geophysical electrical resistivity tomography data. The implementation is publicly available and developed within the framework of PFLOTRAN (http://www.pflotran.org), an open-source, state-of-the-art massively parallel subsurface flow and transport simulation code. The paper details all the theoretical and implementation aspects of the new capabilities along with test examples.
Lucas Schauer, Michael J. Schmidt, Nicholas B. Engdahl, Stephen D. Pankavich, David A. Benson, and Diogo Bolster
Geosci. Model Dev., 16, 833–849, https://doi.org/10.5194/gmd-16-833-2023, https://doi.org/10.5194/gmd-16-833-2023, 2023
Short summary
Short summary
We develop a multi-dimensional, parallelized domain decomposition strategy for mass-transfer particle tracking methods in two and three dimensions, investigate different procedures for decomposing the domain, and prescribe an optimal tiling based on physical problem parameters and the number of available CPU cores. For an optimally subdivided diffusion problem, the parallelized algorithm achieves nearly perfect linear speedup in comparison with the serial run-up to thousands of cores.
John Mern and Jef Caers
Geosci. Model Dev., 16, 289–313, https://doi.org/10.5194/gmd-16-289-2023, https://doi.org/10.5194/gmd-16-289-2023, 2023
Short summary
Short summary
In this work, we formulate the sequential geoscientific data acquisition problem as a problem that is similar to playing chess against nature, except the pieces are not fully observed. Solutions to these problems are given in AI and rarely used in geoscientific data planning. We illustrate our approach to a simple 2D problem of mineral exploration.
Ziqi Gao, Yifeng Wang, Petros Vasilakos, Cesunica E. Ivey, Khanh Do, and Armistead G. Russell
Geosci. Model Dev., 15, 9015–9029, https://doi.org/10.5194/gmd-15-9015-2022, https://doi.org/10.5194/gmd-15-9015-2022, 2022
Short summary
Short summary
While the national ambient air quality standard of ozone is based on the 3-year average of the fourth highest 8 h maximum (MDA8) ozone concentrations, these predicted extreme values using numerical methods are always biased low. We built four computational models (GAM, MARS, random forest and SVR) to predict the fourth highest MDA8 ozone in Southern California using precursor emissions, meteorology and climatological patterns. All models presented acceptable performance, with GAM being the best.
Zhihao Wang, Jason Goetz, and Alexander Brenning
Geosci. Model Dev., 15, 8765–8784, https://doi.org/10.5194/gmd-15-8765-2022, https://doi.org/10.5194/gmd-15-8765-2022, 2022
Short summary
Short summary
A lack of inventory data can be a limiting factor in developing landslide predictive models, which are crucial for supporting hazard policy and decision-making. We show how case-based reasoning and domain adaptation (transfer-learning techniques) can effectively retrieve similar landslide modeling situations for prediction in new data-scarce areas. Using cases in Italy, Austria, and Ecuador, our findings support the application of transfer learning for areas that require rapid model development.
Till Sachau, Haibin Yang, Justin Lang, Paul D. Bons, and Louis Moresi
Geosci. Model Dev., 15, 8749–8764, https://doi.org/10.5194/gmd-15-8749-2022, https://doi.org/10.5194/gmd-15-8749-2022, 2022
Short summary
Short summary
Knowledge of the internal structures of the major continental ice sheets is improving, thanks to new investigative techniques. These structures are an essential indication of the flow behavior and dynamics of ice transport, which in turn is important for understanding the actual impact of the vast amounts of water trapped in continental ice sheets on global sea-level rise. The software studied here is specifically designed to simulate such structures and their evolution.
Keith J. Roberts, Alexandre Olender, Lucas Franceschini, Robert C. Kirby, Rafael S. Gioria, and Bruno S. Carmo
Geosci. Model Dev., 15, 8639–8667, https://doi.org/10.5194/gmd-15-8639-2022, https://doi.org/10.5194/gmd-15-8639-2022, 2022
Short summary
Short summary
Finite-element methods (FEMs) permit the use of more flexible unstructured meshes but are rarely used in full waveform inversions (FWIs), an iterative process that reconstructs velocity models of earth’s subsurface, due to computational and memory storage costs. To reduce those costs, novel software is presented allowing the use of high-order mass-lumped FEMs on triangular meshes, together with a material-property mesh-adaptation performance-enhancing strategy, enabling its use in FWIs.
Konstantinos Papadakis, Yann Pfau-Kempf, Urs Ganse, Markus Battarbee, Markku Alho, Maxime Grandin, Maxime Dubart, Lucile Turc, Hongyang Zhou, Konstantinos Horaites, Ivan Zaitsev, Giulia Cozzani, Maarja Bussov, Evgeny Gordeev, Fasil Tesema, Harriet George, Jonas Suni, Vertti Tarvus, and Minna Palmroth
Geosci. Model Dev., 15, 7903–7912, https://doi.org/10.5194/gmd-15-7903-2022, https://doi.org/10.5194/gmd-15-7903-2022, 2022
Short summary
Short summary
Vlasiator is a plasma simulation code that simulates the entire near-Earth space at a global scale. As 6D simulations require enormous amounts of computational resources, Vlasiator uses adaptive mesh refinement (AMR) to lighten the computational burden. However, due to Vlasiator’s grid topology, AMR simulations suffer from grid aliasing artifacts that affect the global results. In this work, we present and evaluate the performance of a mechanism for alleviating those artifacts.
Artur Safin, Damien Bouffard, Firat Ozdemir, Cintia L. Ramón, James Runnalls, Fotis Georgatos, Camille Minaudo, and Jonas Šukys
Geosci. Model Dev., 15, 7715–7730, https://doi.org/10.5194/gmd-15-7715-2022, https://doi.org/10.5194/gmd-15-7715-2022, 2022
Short summary
Short summary
Reconciling the differences between numerical model predictions and observational data is always a challenge. In this paper, we investigate the viability of a novel approach to the calibration of a three-dimensional hydrodynamic model of Lake Geneva, where the target parameters are inferred in terms of distributions. We employ a filtering technique that generates physically consistent model trajectories and implement a neural network to enable bulk-to-skin temperature conversion.
Colin Grudzien and Marc Bocquet
Geosci. Model Dev., 15, 7641–7681, https://doi.org/10.5194/gmd-15-7641-2022, https://doi.org/10.5194/gmd-15-7641-2022, 2022
Short summary
Short summary
Iterative optimization techniques, the state of the art in data assimilation, have largely focused on extending forecast accuracy to moderate- to long-range forecast systems. However, current methodology may not be cost-effective in reducing forecast errors in online, short-range forecast systems. We propose a novel optimization of these techniques for online, short-range forecast cycles, simultaneously providing an improvement in forecast accuracy and a reduction in the computational cost.
Yangyang Yu, Shaoqing Zhang, Haohuan Fu, Lixin Wu, Dexun Chen, Yang Gao, Zhiqiang Wei, Dongning Jia, and Xiaopei Lin
Geosci. Model Dev., 15, 6695–6708, https://doi.org/10.5194/gmd-15-6695-2022, https://doi.org/10.5194/gmd-15-6695-2022, 2022
Short summary
Short summary
To understand the scientific consequence of perturbations caused by slave cores in heterogeneous computing environments, we examine the influence of perturbation amplitudes on the determination of the cloud bottom and cloud top and compute the probability density function (PDF) of generated clouds. A series of comparisons of the PDFs between homogeneous and heterogeneous systems show consistently acceptable error tolerances when using slave cores in heterogeneous computing environments.
Vijay S. Mahadevan, Jorge E. Guerra, Xiangmin Jiao, Paul Kuberry, Yipeng Li, Paul Ullrich, David Marsico, Robert Jacob, Pavel Bochev, and Philip Jones
Geosci. Model Dev., 15, 6601–6635, https://doi.org/10.5194/gmd-15-6601-2022, https://doi.org/10.5194/gmd-15-6601-2022, 2022
Short summary
Short summary
Coupled Earth system models require transfer of field data between multiple components with varying spatial resolutions to determine the correct climate behavior. We present the Metrics for Intercomparison of Remapping Algorithms (MIRA) protocol to evaluate the accuracy, conservation properties, monotonicity, and local feature preservation of four different remapper algorithms for various unstructured mesh problems of interest. Future extensions to more practical use cases are also discussed.
Yilin Fang, L. Ruby Leung, Ryan Knox, Charlie Koven, and Ben Bond-Lamberty
Geosci. Model Dev., 15, 6385–6398, https://doi.org/10.5194/gmd-15-6385-2022, https://doi.org/10.5194/gmd-15-6385-2022, 2022
Short summary
Short summary
Accounting for water movement in the soil and water transport within the plant is important for plant growth in Earth system modeling. We implemented different numerical approaches for a plant hydrodynamic model and compared their impacts on the simulated aboveground biomass (AGB) at single points and globally. We found care should be taken when discretizing the number of soil layers for numerical simulations as it can significantly affect AGB if accuracy and computational costs are of concern.
Andrew M. Bradley, Peter A. Bosler, and Oksana Guba
Geosci. Model Dev., 15, 6285–6310, https://doi.org/10.5194/gmd-15-6285-2022, https://doi.org/10.5194/gmd-15-6285-2022, 2022
Short summary
Short summary
Tracer transport in atmosphere models can be computationally expensive. We describe a flexible and efficient interpolation semi-Lagrangian method, the Islet method. It permits using up to three grids that share an element grid: a dynamics grid for computing quantities such as the wind velocity; a physics parameterizations grid; and a tracer grid. The Islet method performs well on a number of verification problems and achieves high performance in the E3SM Atmosphere Model version 2.
Léo Pujol, Pierre-André Garambois, and Jérôme Monnier
Geosci. Model Dev., 15, 6085–6113, https://doi.org/10.5194/gmd-15-6085-2022, https://doi.org/10.5194/gmd-15-6085-2022, 2022
Short summary
Short summary
This contribution presents a new numerical model for representing hydraulic–hydrological quantities at the basin scale. It allows modeling large areas at a low computational cost, with fine zooms where needed. It allows the integration of local and satellite measurements, via data assimilation methods, to improve the model's match to observations. Using this capability, good matches to in situ observations are obtained on a model of the complex Adour river network with fine zooms on floodplains.
Ludovic Räss, Ivan Utkin, Thibault Duretz, Samuel Omlin, and Yuri Y. Podladchikov
Geosci. Model Dev., 15, 5757–5786, https://doi.org/10.5194/gmd-15-5757-2022, https://doi.org/10.5194/gmd-15-5757-2022, 2022
Short summary
Short summary
Continuum mechanics-based modelling of physical processes at large scale requires huge computational resources provided by massively parallel hardware such as graphical processing units. We present a suite of numerical algorithms, implemented using the Julia language, that efficiently leverages the parallelism. We demonstrate that our implementation is efficient, scalable and robust and showcase applications to various geophysical problems.
Meriem Krouma, Pascal Yiou, Céline Déandreis, and Soulivanh Thao
Geosci. Model Dev., 15, 4941–4958, https://doi.org/10.5194/gmd-15-4941-2022, https://doi.org/10.5194/gmd-15-4941-2022, 2022
Short summary
Short summary
We evaluated the skill of a stochastic weather generator (SWG) to forecast precipitation at different time scales and in different areas of western Europe from analogs of Z500 hPa. The SWG has the skill to simulate precipitation for 5 and 10 d. We found that forecast weaknesses can be associated with specific weather patterns. The comparison with ECMWF forecasts confirms the skill of our model. This work is important because it provides information about weather forecasts over specific areas.
Piotr Dziekan and Piotr Zmijewski
Geosci. Model Dev., 15, 4489–4501, https://doi.org/10.5194/gmd-15-4489-2022, https://doi.org/10.5194/gmd-15-4489-2022, 2022
Short summary
Short summary
Detailed computer simulations of clouds are important for understanding Earth's atmosphere and climate. The paper describes how the UWLCM has been adapted to work on supercomputers. A distinctive feature of UWLCM is that air flow is calculated by processors at the same time as cloud droplets are modeled by graphics cards. Thanks to this, use of computing resources is maximized and the time to complete simulations of large domains is not affected by communications between supercomputer nodes.
Hynek Bednář and Holger Kantz
Geosci. Model Dev., 15, 4147–4161, https://doi.org/10.5194/gmd-15-4147-2022, https://doi.org/10.5194/gmd-15-4147-2022, 2022
Short summary
Short summary
A scale-dependent error growth described by a power law or by a quadratic hypothesis is studied in Lorenz’s system with three spatiotemporal levels. The validity of power law is extended by including a saturation effect. The quadratic hypothesis can only serve as a first guess. In addition, we study the initial error growth for the ECMWF forecast system. Fitting the parameters, we conclude that there is an intrinsic limit of predictability after 22 days.
Michael A. Olesik, Jakub Banaśkiewicz, Piotr Bartman, Manuel Baumgartner, Simon Unterstrasser, and Sylwester Arabas
Geosci. Model Dev., 15, 3879–3899, https://doi.org/10.5194/gmd-15-3879-2022, https://doi.org/10.5194/gmd-15-3879-2022, 2022
Short summary
Short summary
In systems such as atmospheric clouds, droplets undergo growth through condensation of vapor. The broadness of the resultant size spectrum of droplets influences precipitation likelihood and the radiative properties of clouds. One of the inherent limitations of simulations of the problem is the so-called numerical diffusion causing overestimation of the spectrum width, hence the term numerical broadening. In the paper, we take a closer look at one of the algorithms used in this context: MPDATA.
Navjot Kukreja, Jan Hückelheim, Mathias Louboutin, John Washbourne, Paul H. J. Kelly, and Gerard J. Gorman
Geosci. Model Dev., 15, 3815–3829, https://doi.org/10.5194/gmd-15-3815-2022, https://doi.org/10.5194/gmd-15-3815-2022, 2022
Short summary
Short summary
Full waveform inversion (FWI) is a partial-differential equation (PDE)-constrained optimization problem that is notorious for its high computational load and memory footprint. In this paper we present a method that combines recomputation with lossy compression to accelerate the computation with minimal loss of precision in the results. We show this using experiments running FWI with a variety of compression settings on a popular academic dataset.
Richard Scalzo, Mark Lindsay, Mark Jessell, Guillaume Pirot, Jeremie Giraud, Edward Cripps, and Sally Cripps
Geosci. Model Dev., 15, 3641–3662, https://doi.org/10.5194/gmd-15-3641-2022, https://doi.org/10.5194/gmd-15-3641-2022, 2022
Short summary
Short summary
This paper addresses numerical challenges in reasoning about geological models constrained by sensor data, especially models that describe the history of an area in terms of a sequence of events. Our method ensures that small changes in simulated geological features, such as the position of a boundary between two rock layers, do not result in unrealistically large changes to resulting sensor measurements, as occur presently using several popular modeling packages.
Hiromasa Yoshimura
Geosci. Model Dev., 15, 2561–2597, https://doi.org/10.5194/gmd-15-2561-2022, https://doi.org/10.5194/gmd-15-2561-2022, 2022
Short summary
Short summary
This paper proposes a new double Fourier series (DFS) method on a sphere that improves the numerical stability of a model compared with conventional DFS methods. The shallow-water model and the advection model using the new DFS method give stable results without the appearance of high-wavenumber noise near the poles. The model using the new DFS method is faster than the model using spherical harmonics (especially at high resolutions) and gives almost the same results.
Mirko Mälicke
Geosci. Model Dev., 15, 2505–2532, https://doi.org/10.5194/gmd-15-2505-2022, https://doi.org/10.5194/gmd-15-2505-2022, 2022
Short summary
Short summary
I preset SciKit-GStat, a well-documented and tested Python package for variogram estimation. The variogram is the core means of geostatistics, which almost all other methods rely on. Geostatistical interpolation and field generation are widely spread in geoscience, i.e., for data assimilation or modeling.
While SciKit-GStat focuses on effective and intuitive variogram estimation, it can interface with other prominent packages and make its variograms available for a multitude of methods.
Christopher J. L. D'Amboise, Michael Neuhauser, Michaela Teich, Andreas Huber, Andreas Kofler, Frank Perzl, Reinhard Fromm, Karl Kleemayr, and Jan-Thomas Fischer
Geosci. Model Dev., 15, 2423–2439, https://doi.org/10.5194/gmd-15-2423-2022, https://doi.org/10.5194/gmd-15-2423-2022, 2022
Short summary
Short summary
The term gravitational mass flow (GMF) covers various natural hazard processes such as snow avalanches, rockfall, landslides, and debris flows. Here we present the open-source GMF simulation tool Flow-Py. The model equations are based on simple geometrical relations in three-dimensional terrain. We show that Flow-Py is an educational, innovative GMF simulation tool with three computational experiments: 1. validation of implementation, 2. performance, and 3. expandability.
Evan Baker, Anna B. Harper, Daniel Williamson, and Peter Challenor
Geosci. Model Dev., 15, 1913–1929, https://doi.org/10.5194/gmd-15-1913-2022, https://doi.org/10.5194/gmd-15-1913-2022, 2022
Short summary
Short summary
We have adapted machine learning techniques to build a model of the land surface in Great Britain. The model was trained using data from a very complex land surface model called JULES. Our model is faster at producing simulations and predictions and can investigate many different scenarios, which can be used to improve our understanding of the climate and could also be used to help make local decisions.
Daichun Wang, Wei You, Zengliang Zang, Xiaobin Pan, Yiwen Hu, and Yanfei Liang
Geosci. Model Dev., 15, 1821–1840, https://doi.org/10.5194/gmd-15-1821-2022, https://doi.org/10.5194/gmd-15-1821-2022, 2022
Short summary
Short summary
This paper presents a 3D variational data assimilation system for aerosol optical properties, including aerosol optical thickness (AOT) retrievals and lidar-based aerosol profiles, which was developed for a size-resolved sectional model in WRF-Chem. To directly assimilate aerosol optical properties, an observation operator based on the Mie scattering theory was designed. The results show that Himawari-8 AOT assimilation can significantly improve model aerosol analyses and forecasts.
Kevin Bulthuis and Eric Larour
Geosci. Model Dev., 15, 1195–1217, https://doi.org/10.5194/gmd-15-1195-2022, https://doi.org/10.5194/gmd-15-1195-2022, 2022
Short summary
Short summary
We present and implement a stochastic solver to sample spatially and temporal varying uncertain input parameters in the Ice-sheet and Sea-level System Model, such as ice thickness or surface mass balance. We represent these sources of uncertainty using Gaussian random fields with Matérn covariance function. We generate random samples of this random field using an efficient computational approach based on solving a stochastic partial differential equation.
Urmas Raudsepp and Ilja Maljutenko
Geosci. Model Dev., 15, 535–551, https://doi.org/10.5194/gmd-15-535-2022, https://doi.org/10.5194/gmd-15-535-2022, 2022
Short summary
Short summary
A model's ability to reproduce the state of a simulated object is always a subject of discussion. A new method for the multivariate assessment of numerical model skills uses the K-means algorithm for clustering model errors. All available data that fall into the model domain and simulation period are incorporated into the skill assessment. The clustered errors are used for spatial and temporal analysis of the model accuracy. The method can be applied to different types of geoscientific models.
Emmanuel Wyser, Yury Alkhimenkov, Michel Jaboyedoff, and Yury Y. Podladchikov
Geosci. Model Dev., 14, 7749–7774, https://doi.org/10.5194/gmd-14-7749-2021, https://doi.org/10.5194/gmd-14-7749-2021, 2021
Short summary
Short summary
We propose an implementation of the material point method using graphical processing units (GPUs) to solve elastoplastic problems in three-dimensional configurations, such as the granular collapse or the slumping mechanics, i.e., landslide. The computational power of GPUs promotes fast code executions, compared to a traditional implementation using central processing units (CPUs). This allows us to study complex three-dimensional problems tackling high spatial resolution.
Rafael Lago, Thomas Gastine, Tilman Dannert, Markus Rampp, and Johannes Wicht
Geosci. Model Dev., 14, 7477–7495, https://doi.org/10.5194/gmd-14-7477-2021, https://doi.org/10.5194/gmd-14-7477-2021, 2021
Short summary
Short summary
In this work we discuss a two-dimensional distributed parallelization of MagIC, an open-source code for the numerical solution of the magnetohydrodynamics equations. Such a parallelization involves several challenges concerning the distribution of work and data. We detail our algorithm and compare it with the established, optimized, one-dimensional distribution in the context of the dynamo benchmark and discuss the merits of both implementations.
Moritz Lange, Henri Suominen, Mona Kurppa, Leena Järvi, Emilia Oikarinen, Rafael Savvides, and Kai Puolamäki
Geosci. Model Dev., 14, 7411–7424, https://doi.org/10.5194/gmd-14-7411-2021, https://doi.org/10.5194/gmd-14-7411-2021, 2021
Short summary
Short summary
This study aims to replicate computationally expensive high-resolution large-eddy simulations (LESs) with regression models to simulate urban air quality and pollutant dispersion. The model development, including feature selection, model training and cross-validation, and detection of concept drift, has been described in detail. Of the models applied, log-linear regression shows the best performance. A regression model can replace LES unless high accuracy is needed.
Hynek Bednář, Aleš Raidl, and Jiří Mikšovský
Geosci. Model Dev., 14, 7377–7389, https://doi.org/10.5194/gmd-14-7377-2021, https://doi.org/10.5194/gmd-14-7377-2021, 2021
Short summary
Short summary
Forecast errors in numerical weather prediction systems grow in time. To quantify the impacts of this growth, parametric error growth models may be employed. This study recalculates and newly defines parameters for several statistic models approximating error growth in the ECMWF forecasting system. Accurate values of parameters are important because they are used to evaluate improvements of the forecasting systems or to estimate predictability.
Denise Degen, Cameron Spooner, Magdalena Scheck-Wenderoth, and Mauro Cacace
Geosci. Model Dev., 14, 7133–7153, https://doi.org/10.5194/gmd-14-7133-2021, https://doi.org/10.5194/gmd-14-7133-2021, 2021
Short summary
Short summary
In times of worldwide energy transitions, an understanding of the subsurface is increasingly important to provide renewable energy sources such as geothermal energy. To validate our understanding of the subsurface we require data. However, the data are usually not distributed equally and introduce a potential misinterpretation of the subsurface. Therefore, in this study we investigate the influence of measurements on temperature distribution in the European Alps.
Cited articles
Akella, S. and Navon, I.: Different approaches to model error formulation in
4D-Var: A study with high-resolution advection schemes, Tellus A, 61,
112–128, 2009. a
Bauer, H.-S., Schwitalla, T., Wulfmeyer, V., Bakhshaii, A., Ehret, U., Neuper,
M., and Caumont, O.: Quantitative precipitation estimation based on
high-resolution numerical weather prediction and data assimilation with
WRF – a performance test, Tellus A,
67, 25047, https://doi.org/10.3402/tellusa.v67.25047, 2015. a
Brajard, J., Carrassi, A., Bocquet, M., and Bertino, L.: Combining data
assimilation and machine learning to emulate a dynamical model from sparse
and noisy observations: A case study with the Lorenz 96 model, J. Comput. Sci., 44, 101171, https://doi.org/10.1016/j.jocs.2020.101171, 2020. a, b
Brajard, J., Carrassi, A., Bocquet, M., and Bertino, L.: Combining data
assimilation and machine learning to infer unresolved scale parametrization,
Philos. T. R. Soc. A, 379, 20200086, https://doi.org/10.1098/rsta.2020.0086, 2021. a
Buehner, M.: Ensemble-derived stationary and flow-dependent background-error
covariances: Evaluation in a quasi-operational NWP setting, Q. J. Roy. Meteor. Soc., 131, 1013–1043, 2005. a
Cardinali, C., Žagar, N., Radnoti, G., and Buizza, R.: Representing model
error in ensemble data assimilation, Nonlinear Proc. Geophys., 21,
971–985, 2014. a
Carmichael, G. R., Sandu, A., Chai, T., Daescu, D. N., Constantinescu, E. M.,
and Tang, Y.: Predicting air quality: Improvements through advanced methods
to integrate models and measurements, J. Comput. Phys., 227, 3540–3571, 2008. a
Casas, C. Q., Arcucci, R., Wu, P., Pain, C., and Guo, Y.-K.: A reduced order
deep data assimilation model, Physica D: Nonlinear Phenomena, 412, 132615, https://doi.org/10.1016/j.physd.2020.132615, 2020. a, b
Chatterjee, A.: An introduction to the proper orthogonal decomposition, Current Science, 78, 808–817, 2000. a
Chennault, A., Popov, A. A., Subrahmanya, A. N., Cooper, R., Karpatne, A., and
Sandu, A.: Adjoint-Matching Neural Network Surrogates for Fast 4D-Var Data
Assimilation, CoRR, abs/2111.08626, https://doi.org/10.48550/ARXIV.2111.08626, 2021. a
Daley, R.: Atmospheric Data Analysis, Cambridge University Press, 2, https://books.google.com/books (last access: 27 April 2022), 1993. a, b
Errico, R. M.: What is an adjoint model?, B. Am. Meteorol. Soc., 78, 2577–2592, 1997. a
Errico, R. M. and Raeder, K. D.: An examination of the accuracy of the
linearization of a mesoscale model with moist physics, Q. J. R. Meteor. Soc., 125, 169–195, 1999. a
Errico, R. M., Vukicevic, T., and Raeder, K.: Examination of the accuracy of a tangent linear model, Tellus A, 45, 462–477, 1993. a
Frerix, T., Kochkov, D., Smith, J. A., Cremers, D., Brenner, M. P., and Hoyer, S.: Variational Data Assimilation with a Learned Inverse Observation
Operator, in: Proceedings of the 38th International Conference on Machine
Learning (ICML), Proceedings of Machine Learning Research (PMLR), 139, 3449–3458, https://proceedings.mlr.press/v139/frerix21a.html (last access: 27 April 2022), 2021. a
Glimm, J., Hou, S., Lee, Y., Sharp, D., and Ye, K.: Sources of uncertainty and error in the simulation of flow in porous media, Comput. Appl. Math., 23, 109–120, 2004. a
Gustafsson, N., Janjić, T., Schraff, C., Leuenberger, D., Weissmann, M.,
Reich, H., Brousseau, P., Montmerle, T., Wattrelot, E., Bučánek,
A., Mile, M., Hamdi, R., Lindskog, M., Barkmeijer, J., Dahlbom, M., Macpherson, B., Ballard, S., Inverarity, G., Carley, J., Alexander, C., Dowell, D., Liu, S., Ikuta, Y., and Fujita, T.: Survey of data assimilation methods for convective-scale
numerical weather prediction at operational centres, Q. J. R. Meteor. Soc., 144, 1218–1256, https://doi.org/10.1002/qj.3179, 2018. a
Hansen, J. A.: Accounting for model error in ensemble-based state estimation
and forecasting, Mon. Weather Rev., 130, 2373–2391, 2002. a
Hatfield, S., Chantry, M., Dueben, P., Lopez, P., Geer, A., and Palmer, T.:
Building Tangent-Linear and Adjoint Models for Data Assimilation With Neural
Networks, J. Adv. Model. Earth Sy., 13, e2021MS002521, https://doi.org/10.1029/2021MS002521, 2021. a
Hochreiter, S. and Schmidhuber, J.: Long short-term memory, Neural computation,
9, 1735–1780, 1997. a
Holmes, P., Lumley, J. L., Berkooz, G., and Rowley, C. W.: Turbulence, Coherent
Structures, Dynamical Systems and Symmetry, Cambridge University Press, p. 386, ISBN 9781107008250, 2012. a
Lario, A., Maulik, R., Rozza, G., and Mengaldo, G.: Neural-network learning of SPOD latent dynamics, arXiv preprint arXiv:2110.09218, p. 27,
https://doi.org/10.48550/arXiv.2110.09218, 2021. a, b
Le Dimet, F. and Talagrand, O.: Variational algorithms for analysis and
assimilation of meteorological observations: theoretical aspects, Tellus A,
38, 97–110, 1986. a
Le Guen, V. and Thome, N.: Disentangling physical dynamics from unknown factors for unsupervised video prediction, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 13–19 June 2020, Seattle, WA, USA, 11474–11484, https://doi.org/10.1109/CVPR42600.2020.01149, 2020. a
Lorenc, A. C. and Rawlins, F.: Why does 4D-Var beat 3D-Var?, Quarterly
J. Roy. Meteorol. Soc., 131, 3247–3257, 2005. a
Lynch, P.: The origins of computer weather prediction and climate modeling,
J. Comput. Phys., 227, 3431–3444, 2008. a
Mack, J., Arcucci, R., Molina-Solana, M., and Guo, Y.-K.: Attention-based
convolutional autoencoders for 3D-variational data assimilation, Comput. Method. Appl. M., 372, 113291, https://doi.org/10.1016/j.cma.2020.113291, 2020. a
Maulik, R.: AIEADA/LSTM_Var_Prototype: GMD-2021-415: AIEADA 1.0: Efficient high-dimensional variational data assimilation with machine-learned reduced-order models (GMD_v1), Zenodo [data set] [code], https://doi.org/10.5281/zenodo.6382921, 2022. a
Maulik, R. and Mengaldo, G.: PyParSVD: A streaming, distributed and
randomized singular-value-decomposition library, 2021 7th International Workshop on Data Analysis and Reduction for Big Scientific Data (DRBSD-7), p. 19-25, https://doi.org/10.1109/DRBSD754563.2021.00007, 2021. a
Maulik, R., Egele, R., Lusch, B., and Balaprakash, P.: Recurrent neural network
architecture search for geophysical emulation, in: SC20: International
Conference for High Performance Computing, Networking, Storage and Analysis, Atlanta, Georgia, IEEE, p. 14, ISBN 9781728199986, 2020. a
Maulik, R., Lusch, B., and Balaprakash, P.: Reduced-order modeling of
advection-dominated systems with recurrent neural networks and convolutional
autoencoders, Physics of Fluids, 33, 037106, https://doi.org/10.1063/5.0039986, 2021. a
Mengaldo, G. and Maulik, R.: PySPOD: A Python package for Spectral Proper
Orthogonal Decomposition (SPOD), Journal of Open Source Software, 6, 2862, https://doi.org/10.21105/joss.02862, 2021. a
Mohan, A. T. and Gaitonde, D. V.: A deep learning based approach to reduced
order modeling for turbulent flow control using LSTM neural networks, arXiv, preprint arXiv:1804.09269, https://doi.org/10.48550/arXiv.1804.09269, 2018. a
Moritz, P., Nishihara, R., Wang, S., Tumanov, A., Liaw, R., Liang, E., Elibol,
M., Yang, Z., Paul, W., Jordan, M. I., and Stoica, I.: Ray: A distributed
framework for emerging AI applications, in: 13th USENIX Symposium on
Operating Systems Design and Implementation, 561–577, ISBN 9781931971478, 2018. a
Nocedal, J. and Wright, S. J.: Sequential quadratic programming, Numerical
Optimization, 529–562, https://doi.org/10.1007/978-0-387-40065-5_18, 2006. a
Orrell, D., Smith, L., Barkmeijer, J., and Palmer, T. N.: Model error in weather forecasting, Nonlin. Processes Geophys., 8, 357–371, https://doi.org/10.5194/npg-8-357-2001, 2001. a
Palmer, T., Shutts, G., Hagedorn, R., Doblas-Reyes, F., Jung, T., and
Leutbecher, M.: Representing model uncertainty in weather and climate
prediction, Annu. Rev. Earth Planet. Sci, 33, 163–93, 2005. a
Pawar, S. and San, O.: Data assimilation empowered neural network
parametrizations for subgrid processes in geophysical flows, Physical Review Fluids, 6, 050501, https://doi.org/10.1103/PhysRevFluids.6.050501, 2021. a
Pawar, S., Rahman, S., Vaddireddy, H., San, O., Rasheed, A., and Vedula, P.: A
deep learning enabler for nonintrusive reduced order modeling of fluid flows,
Physics of Fluids, 31, 085101, https://doi.org/10.1063/1.5113494, 2019. a
Pawar, S., Ahmed, S. E., San, O., Rasheed, A., and Navon, I. M.: Long
short-term memory embedded nudging schemes for nonlinear data assimilation of
geophysical flows, Physics of Fluids, 32, 076606, https://doi.org/10.1063/5.0012853, 2020. a
Penny, S. G., Smith, T. A., Chen, T.-C., Platt, J. A., Lin, H.-Y., Goodliff,
M., and Abarbanel, H. D. I.: Integrating recurrent neural networks with data
assimilation for scalable data-driven state estimation, arXiv preprint,
arXiv:2109.12269, 14, e2021MS002843, https://doi.org/10.1029/2021MS002843, 2021. a, b
Popov, A. A. and Sandu, A.: Multifidelity ensemble Kalman filtering using
surrogate models defined by physics-informed autoencoders, arXiv preprint,
arXiv:2102.13025, https://doi.org/10.48550/arXiv.2102.13025, 2021. a
Rao, V. and Sandu, A.: A posteriori error estimates for the solution of
variational inverse problems, SIAM/ASA, Journal on Uncertainty Quantification,
3, 737–761, 2015. a
Rasp, S. and Thuerey, N.: Data-Driven Medium-Range Weather Prediction With a
Resnet Pretrained on Climate Simulations: A New Model for WeatherBench,
J. Adv. Model. Earth Sy., 13, e2020MS002405, https://doi.org/10.1029/2020MS002405, 2021. a
Rasp, S., Dueben, P. D., Scher, S., Weyn, J. A., Mouatadid, S., and Thuerey,
N.: WeatherBench: A benchmark data set for data-driven weather forecasting,
J. Adv. Model. Earth Sy., 12, e2020MS002203, https://doi.org/10.1029/2020MS002203, 2020. a
Reidmiller, D., Avery, C., Easterling, D., Kunkel, K., Lewis, K., Maycock, T., and Stewart, B.: Fourth national climate assessment, Volume II: Impacts,
Risks, and Adaptation in the United States, U.S. Global Change Research Program, Washington, DC, USA, 1515 pp., https://doi.org/10.7930/NCA4.2018, 2018. a
Sandu, A. and Chai, T.: Chemical data assimilation – An overview, Atmosphere, 2, 426–463, 2011. a
Sandu, A., Daescu, D. N., Carmichael, G. R., and Chai, T.: Adjoint sensitivity analysis of regional air quality models, J. Comput. Phys.,
204, 222–252, 2005. a
Schmidt, O. T., Mengaldo, G., Balsamo, G., and Wedi, N. P.: Spectral empirical orthogonal function analysis of weather and climate data, Mon. Weather Rev., 147, 2979–2995, 2019. a
Trémolet, Y.: Accounting for an imperfect model in 4D-Var, Q. J. R. Meteor. Soc., 132, 2483–2504, https://doi.org/10.1256/qj.05.224, 2006. a, b
Trémolet, Y.: Model-error estimation in 4D-Var, Q. J. R. Meteor. Soc., 133, 1267–1280, https://doi.org/10.1002/qj.94, 2007. a, b
Wang, J. and Kotamarthi, V. R.: Downscaling with a nested regional climate
model in near-surface fields over the contiguous United States, J. Geophys. Res.-Atmos., 119, 8778–8797, 2014. a
Zupanski, D. and Zupanski, M.: Model error estimation employing an ensemble
data assimilation approach, Mon. Weather Rev., 134, 1337–1354, 2006. a
Download
The requested paper has a corresponding corrigendum published. Please read the corrigendum first before downloading the article.
- Article
(2884 KB) - Full-text XML
Short summary
In numerical weather prediction, data assimilation is frequently utilized to enhance the accuracy of forecasts from equation-based models. In this work we use a machine learning framework that approximates a complex dynamical system given by the geopotential height. Instead of using an equation-based model, we utilize this machine-learned alternative to dramatically accelerate both the forecast and the assimilation of data, thereby reducing need for large computational resources.
In numerical weather prediction, data assimilation is frequently utilized to enhance the...