Articles | Volume 14, issue 8
https://doi.org/10.5194/gmd-14-5205-2021
https://doi.org/10.5194/gmd-14-5205-2021
Development and technical paper
 | 
18 Aug 2021
Development and technical paper |  | 18 Aug 2021

Copula-based synthetic data augmentation for machine-learning emulators

David Meyer, Thomas Nagler, and Robin J. Hogan

Related authors

Variation in shortwave water vapour continuum and impact on clear-sky shortwave radiative feedback
Kaah P. Menang, Stefan A. Buehler, Lukas Kluft, Robin J. Hogan, and Florian E. Roemer
EGUsphere, https://doi.org/10.5194/egusphere-2024-3051,https://doi.org/10.5194/egusphere-2024-3051, 2024
This preprint is open for discussion and under review for Atmospheric Chemistry and Physics (ACP).
Short summary
Radiative Closure Assessment of Retrieved Cloud and Aerosol Properties for the EarthCARE Mission: The ACMB-DF Product
Howard W. Barker, Jason N. S. Cole, Najda Villefranque, Zhipeng Qu, Almudena Velázquez Blázquez, Carlos Domenech, Shannon L. Mason, and Robin J. Hogan
EGUsphere, https://doi.org/10.5194/egusphere-2024-1651,https://doi.org/10.5194/egusphere-2024-1651, 2024
Short summary
Inclusion of the ECMWF ecRad radiation scheme (v1.5.0) in the MAR model (v3.14), regional evaluation for Belgium and assessment of surface shortwave spectral fluxes at Uccle observatory
Jean-François Grailet, Robin J. Hogan, Nicolas Ghilain, Xavier Fettweis, and Marilaure Grégoire
EGUsphere, https://doi.org/10.5194/egusphere-2024-1858,https://doi.org/10.5194/egusphere-2024-1858, 2024
Short summary
Evaluating the representation of Arctic cirrus solar radiative effects in the Integrated Forecasting System with airborne measurements
Johannes Röttenbacher, André Ehrlich, Hanno Müller, Florian Ewald, Anna E. Luebke, Benjamin Kirbus, Robin J. Hogan, and Manfred Wendisch
Atmos. Chem. Phys., 24, 8085–8104, https://doi.org/10.5194/acp-24-8085-2024,https://doi.org/10.5194/acp-24-8085-2024, 2024
Short summary
Coupling the urban canopy model TEB (SURFEXv9.0) with the radiation model SPARTACUS-Urbanv0.6.1 for more realistic urban radiative exchange calculation
Robert Schoetter, Robin James Hogan, Cyril Caliot, and Valéry Masson
EGUsphere, https://doi.org/10.5194/egusphere-2024-1118,https://doi.org/10.5194/egusphere-2024-1118, 2024
Short summary

Related subject area

Earth and space science informatics
The effect of lossy compression of numerical weather prediction data on data analysis: a case study using enstools-compression 2023.11
Oriol Tintó Prims, Robert Redl, Marc Rautenhaus, Tobias Selz, Takumi Matsunobu, Kameswar Rao Modali, and George Craig
Geosci. Model Dev., 17, 8909–8925, https://doi.org/10.5194/gmd-17-8909-2024,https://doi.org/10.5194/gmd-17-8909-2024, 2024
Short summary
GNNWR: an open-source package of spatiotemporal intelligent regression methods for modeling spatial and temporal nonstationarity
Ziyu Yin, Jiale Ding, Yi Liu, Ruoxu Wang, Yige Wang, Yijun Chen, Jin Qi, Sensen Wu, and Zhenhong Du
Geosci. Model Dev., 17, 8455–8468, https://doi.org/10.5194/gmd-17-8455-2024,https://doi.org/10.5194/gmd-17-8455-2024, 2024
Short summary
Random forests with spatial proxies for environmental modelling: opportunities and pitfalls
Carles Milà, Marvin Ludwig, Edzer Pebesma, Cathryn Tonne, and Hanna Meyer
Geosci. Model Dev., 17, 6007–6033, https://doi.org/10.5194/gmd-17-6007-2024,https://doi.org/10.5194/gmd-17-6007-2024, 2024
Short summary
An improved global pressure and zenith wet delay model with optimized vertical correction considering the spatiotemporal variability in multiple height-scale factors
Chunhua Jiang, Xiang Gao, Huizhong Zhu, Shuaimin Wang, Sixuan Liu, Shaoni Chen, and Guangsheng Liu
Geosci. Model Dev., 17, 5939–5959, https://doi.org/10.5194/gmd-17-5939-2024,https://doi.org/10.5194/gmd-17-5939-2024, 2024
Short summary
kNNDM CV: k-fold nearest-neighbour distance matching cross-validation for map accuracy estimation
Jan Linnenbrink, Carles Milà, Marvin Ludwig, and Hanna Meyer
Geosci. Model Dev., 17, 5897–5912, https://doi.org/10.5194/gmd-17-5897-2024,https://doi.org/10.5194/gmd-17-5897-2024, 2024
Short summary

Cited articles

Aas, K., Czado, C., Frigessi, A., and Bakken, H.: Pair-copula constructions of multiple dependence, Insur. Math. Econ., 44, 182–198, https://doi.org/10.1016/j.insmatheco.2007.02.001, 2009. 
Abadi, M., Barham, P., Chen, J., Chen, Z., Davis, A., Dean, J., Devin, M., Ghemawat, S., Irving, G., Isard, M., Kudlur, M., Levenberg, J., Monga, R., Moore, S., Murray, D. G., Steiner, B., Tucker, P., Vasudevan, V., Warden, P., Wicke, M., Yu, Y., and Zheng, X.: TensorFlow: A System for Large-Scale Machine Learning, in: 12th USENIX Symposium on Operating Systems Design and Implementation (OSDI 16), Savannah, GA, 265–283, 2016. 
Bolton, T. and Zanna, L.: Applications of Deep Learning to Ocean Data Inference and Subgrid Parameterization, J. Adv. Model. Earth Syst., 11, 376–399, https://doi.org/10.1029/2018MS001472, 2019. 
Brenowitz, N. D. and Bretherton, C. S.: Prognostic Validation of a Neural Network Unified Physics Parameterization, Geophys. Res. Lett., 45, 6289–6298, https://doi.org/10.1029/2018GL078510, 2018. 
Cheruy, F., Chevallier, F., Morcrette, J.-J., Scott, N. A., and Chédin, A.: Une méthode utilisant les techniques neuronales pour le calcul rapide de la distribution verticale du bilan radiatif thermique terrestre, Comptes Rendus de l'Academie des Sciences Serie II, 322, 665–672, hal-02954375, 1996. 
Download
Short summary
A major limitation in training machine-learning emulators is often caused by the lack of data. This paper presents a cheap way to increase the size of training datasets using statistical techniques and thereby improve the performance of machine-learning emulators.