Articles | Volume 15, issue 9
https://doi.org/10.5194/gmd-15-3519-2022
https://doi.org/10.5194/gmd-15-3519-2022
Methods for assessment of models
 | 
05 May 2022
Methods for assessment of models |  | 05 May 2022

Nested leave-two-out cross-validation for the optimal crop yield model selection

Thi Lan Anh Dinh and Filipe Aires

Related authors

Impacts of land-use change on biospheric carbon: an oriented benchmark using the ORCHIDEE land surface model
Thi Lan Anh Dinh, Daniel Goll, Philippe Ciais, and Ronny Lauerwald
Geosci. Model Dev., 17, 6725–6744, https://doi.org/10.5194/gmd-17-6725-2024,https://doi.org/10.5194/gmd-17-6725-2024, 2024
Short summary

Related subject area

Integrated assessment modeling
GCAM–GLORY v1.0: representing global reservoir water storage in a multi-sector human–Earth system model
Mengqi Zhao, Thomas B. Wild, Neal T. Graham, Son H. Kim, Matthew Binsted, A. F. M. Kamal Chowdhury, Siwa Msangi, Pralit L. Patel, Chris R. Vernon, Hassan Niazi, Hong-Yi Li, and Guta W. Abeshu
Geosci. Model Dev., 17, 5587–5617, https://doi.org/10.5194/gmd-17-5587-2024,https://doi.org/10.5194/gmd-17-5587-2024, 2024
Short summary
pathways-ensemble-analysis v1.0.0: an open-source library for systematic and robust analysis of pathways ensembles
Lara Welder, Neil Grant, and Matthew J. Gidden
EGUsphere, https://doi.org/10.5194/egusphere-2024-761,https://doi.org/10.5194/egusphere-2024-761, 2024
Short summary
CLASH – Climate-responsive Land Allocation model with carbon Storage and Harvests
Tommi Ekholm, Nadine-Cyra Freistetter, Aapo Rautiainen, and Laura Thölix
Geosci. Model Dev., 17, 3041–3062, https://doi.org/10.5194/gmd-17-3041-2024,https://doi.org/10.5194/gmd-17-3041-2024, 2024
Short summary
Carbon Monitor Power-Simulators (CMP-SIM v1.0) across countries: a data-driven approach to simulate daily power generation
Léna Gurriaran, Yannig Goude, Katsumasa Tanaka, Biqing Zhu, Zhu Deng, Xuanren Song, and Philippe Ciais
Geosci. Model Dev., 17, 2663–2682, https://doi.org/10.5194/gmd-17-2663-2024,https://doi.org/10.5194/gmd-17-2663-2024, 2024
Short summary
Intercomparison of multiple two-way coupled meteorology and air quality models (WRF v4.1.1–CMAQ v5.3.1, WRF–Chem v4.1.1, and WRF v3.7.1–CHIMERE v2020r1) in eastern China
Chao Gao, Xuelei Zhang, Aijun Xiu, Qingqing Tong, Hongmei Zhao, Shichun Zhang, Guangyi Yang, Mengduo Zhang, and Shengjin Xie
Geosci. Model Dev., 17, 2471–2492, https://doi.org/10.5194/gmd-17-2471-2024,https://doi.org/10.5194/gmd-17-2471-2024, 2024
Short summary

Cited articles

Agri4cast: Crop Calendar, https://agri4cast.jrc.ec.europa.eu/DataPortal/Index.aspx?o=, last access: 20 June 2021. a
Allen, D. M.: The Relationship Between Variable Selection and Data Agumentation and a Method for Prediction, Technometrics, 16, 125–127, https://doi.org/10.1080/00401706.1974.10489157, 1974. a, b
Amarasinghe, U. A., Hoanh, C. T., D'haeze, D., and Hung, T. Q.: Toward sustainable coffee production in Vietnam: More coffee with less water, Agr. Syst., 136, 96–105, https://doi.org/10.1016/j.agsy.2015.02.008, 2015. a
Ambroise, C. and McLachlan, G. J.: Selection bias in gene extraction on the basis of microarray gene-expression data, P. Natl. Acad. Sci. USA, 99, 6562–6566, https://doi.org/10.1073/pnas.102102699, 2002. a
Anh, D. T. L. and Filipe, A.: Code and Data for the Leave-Two-Out Method, Zenodo [code], https://doi.org/10.5281/zenodo.5159363, 2021. a
Download
Short summary
We proposed the leave-two-out method (i.e. one particular implementation of the nested cross-validation) to determine the optimal statistical crop model (using the validation dataset) and estimate its true generalization ability (using the testing dataset). This approach is applied to two examples (robusta coffee in Cu M'gar and grain maize in France). The results suggested that the simple models are more suitable in crop modelling where a limited number of samples is available.