Articles | Volume 14, issue 8
https://doi.org/10.5194/gmd-14-5205-2021
https://doi.org/10.5194/gmd-14-5205-2021
Development and technical paper
 | 
18 Aug 2021
Development and technical paper |  | 18 Aug 2021

Copula-based synthetic data augmentation for machine-learning emulators

David Meyer, Thomas Nagler, and Robin J. Hogan

Related authors

Evaluation of downward and upward solar irradiances simulated by the Integrated Forecasting System of ECMWF using airborne observations above Arctic low-level clouds
Hanno Müller, André Ehrlich, Evelyn Jäkel, Johannes Röttenbacher, Benjamin Kirbus, Michael Schäfer, Robin J. Hogan, and Manfred Wendisch
Atmos. Chem. Phys., 24, 4157–4175, https://doi.org/10.5194/acp-24-4157-2024,https://doi.org/10.5194/acp-24-4157-2024, 2024
Short summary
Evaluating the Representation of Arctic Cirrus Solar Radiative Effects in the IFS with Airborne Measurements
Johannes Röttenbacher, André Ehrlich, Hanno Müller, Florian Ewald, Anna E. Luebke, Benjamin Kirbus, Robin J. Hogan, and Manfred Wendisch
EGUsphere, https://doi.org/10.5194/egusphere-2024-281,https://doi.org/10.5194/egusphere-2024-281, 2024
Short summary
An intercomparison of EarthCARE cloud, aerosol, and precipitation retrieval products
Shannon L. Mason, Howard W. Barker, Jason N. S. Cole, Nicole Docter, David P. Donovan, Robin J. Hogan, Anja Hünerbein, Pavlos Kollias, Bernat Puigdomènech Treserras, Zhipeng Qu, Ulla Wandinger, and Gerd-Jan van Zadelhoff
Atmos. Meas. Tech., 17, 875–898, https://doi.org/10.5194/amt-17-875-2024,https://doi.org/10.5194/amt-17-875-2024, 2024
Short summary
Evaluation of vertically resolved longwave radiation in SPARTACUS-Urban 0.7.3 and the sensitivity to urban surface temperatures
Megan A. Stretton, William Morrison, Robin J. Hogan, and Sue Grimmond
Geosci. Model Dev., 16, 5931–5947, https://doi.org/10.5194/gmd-16-5931-2023,https://doi.org/10.5194/gmd-16-5931-2023, 2023
Short summary
A unified synergistic retrieval of clouds, aerosols, and precipitation from EarthCARE: the ACM-CAP product
Shannon L. Mason, Robin J. Hogan, Alessio Bozzo, and Nicola L. Pounder
Atmos. Meas. Tech., 16, 3459–3486, https://doi.org/10.5194/amt-16-3459-2023,https://doi.org/10.5194/amt-16-3459-2023, 2023
Short summary

Related subject area

Earth and space science informatics
Focal-TSMP: deep learning for vegetation health prediction and agricultural drought assessment from a regional climate simulation
Mohamad Hakam Shams Eddin and Juergen Gall
Geosci. Model Dev., 17, 2987–3023, https://doi.org/10.5194/gmd-17-2987-2024,https://doi.org/10.5194/gmd-17-2987-2024, 2024
Short summary
Tomofast-x 2.0: an open-source parallel code for inversion of potential field data with topography using wavelet compression
Vitaliy Ogarko, Kim Frankcombe, Taige Liu, Jeremie Giraud, Roland Martin, and Mark Jessell
Geosci. Model Dev., 17, 2325–2345, https://doi.org/10.5194/gmd-17-2325-2024,https://doi.org/10.5194/gmd-17-2325-2024, 2024
Short summary
Functional analysis of variance (ANOVA) for carbon flux estimates from remote sensing data
Jonathan Hobbs, Matthias Katzfuss, Hai Nguyen, Vineet Yadav, and Junjie Liu
Geosci. Model Dev., 17, 1133–1151, https://doi.org/10.5194/gmd-17-1133-2024,https://doi.org/10.5194/gmd-17-1133-2024, 2024
Short summary
The 4D reconstruction of dynamic geological evolution processes for renowned geological features
Jiateng Guo, Zhibin Liu, Xulei Wang, Lixin Wu, Shanjun Liu, and Yunqiang Li
Geosci. Model Dev., 17, 847–864, https://doi.org/10.5194/gmd-17-847-2024,https://doi.org/10.5194/gmd-17-847-2024, 2024
Short summary
Accelerating Lagrangian transport simulations on graphics processing units: performance optimizations of MPTRAC v2.6
Lars Hoffmann, Kaveh Haghighi Mood, Andreas Herten, Markus Hrywniak, Jiri Kraus, Jan Clemens, and Mingzhao Liu
EGUsphere, https://doi.org/10.5194/egusphere-2023-2547,https://doi.org/10.5194/egusphere-2023-2547, 2024
Short summary

Cited articles

Aas, K., Czado, C., Frigessi, A., and Bakken, H.: Pair-copula constructions of multiple dependence, Insur. Math. Econ., 44, 182–198, https://doi.org/10.1016/j.insmatheco.2007.02.001, 2009. 
Abadi, M., Barham, P., Chen, J., Chen, Z., Davis, A., Dean, J., Devin, M., Ghemawat, S., Irving, G., Isard, M., Kudlur, M., Levenberg, J., Monga, R., Moore, S., Murray, D. G., Steiner, B., Tucker, P., Vasudevan, V., Warden, P., Wicke, M., Yu, Y., and Zheng, X.: TensorFlow: A System for Large-Scale Machine Learning, in: 12th USENIX Symposium on Operating Systems Design and Implementation (OSDI 16), Savannah, GA, 265–283, 2016. 
Bolton, T. and Zanna, L.: Applications of Deep Learning to Ocean Data Inference and Subgrid Parameterization, J. Adv. Model. Earth Syst., 11, 376–399, https://doi.org/10.1029/2018MS001472, 2019. 
Brenowitz, N. D. and Bretherton, C. S.: Prognostic Validation of a Neural Network Unified Physics Parameterization, Geophys. Res. Lett., 45, 6289–6298, https://doi.org/10.1029/2018GL078510, 2018. 
Cheruy, F., Chevallier, F., Morcrette, J.-J., Scott, N. A., and Chédin, A.: Une méthode utilisant les techniques neuronales pour le calcul rapide de la distribution verticale du bilan radiatif thermique terrestre, Comptes Rendus de l'Academie des Sciences Serie II, 322, 665–672, hal-02954375, 1996. 
Download
Short summary
A major limitation in training machine-learning emulators is often caused by the lack of data. This paper presents a cheap way to increase the size of training datasets using statistical techniques and thereby improve the performance of machine-learning emulators.