Articles | Volume 15, issue 9
https://doi.org/10.5194/gmd-15-3519-2022
https://doi.org/10.5194/gmd-15-3519-2022
Methods for assessment of models
 | 
05 May 2022
Methods for assessment of models |  | 05 May 2022

Nested leave-two-out cross-validation for the optimal crop yield model selection

Thi Lan Anh Dinh and Filipe Aires

Related authors

Impacts of land-use change on biospheric carbon: an oriented benchmark using ORCHIDEE land surface model
Thi Lan Anh Dinh, Daniel Goll, Philippe Ciais, and Ronny Lauerwald
Geosci. Model Dev. Discuss., https://doi.org/10.5194/gmd-2024-42,https://doi.org/10.5194/gmd-2024-42, 2024
Preprint under review for GMD
Short summary

Related subject area

Integrated assessment modeling
Modelling long-term industry energy demand and CO2 emissions in the system context using REMIND (version 3.1.0)
Michaja Pehl, Felix Schreyer, and Gunnar Luderer
Geosci. Model Dev., 17, 2015–2038, https://doi.org/10.5194/gmd-17-2015-2024,https://doi.org/10.5194/gmd-17-2015-2024, 2024
Short summary
Minimal variance-based outlier detection method using forward search model error in a leveling network
Utkan Mustafa Durdağ
Geosci. Model Dev. Discuss., https://doi.org/10.5194/gmd-2023-210,https://doi.org/10.5194/gmd-2023-210, 2023
Revised manuscript accepted for GMD
Short summary
Carbon Monitor Power - Simulators (CMP-SIM v1.0) across countries: a data-driven approach to simulate daily power demand
Léna Gurriaran, Yannig Goude, Katsumasa Tanaka, Biqing Zhu, Zhu Deng, Xuanren Song, and Philippe Ciais
EGUsphere, https://doi.org/10.5194/egusphere-2023-1313,https://doi.org/10.5194/egusphere-2023-1313, 2023
Short summary
CLASH – Climate-responsive Land Allocation model with carbon Storage and Harvests
Tommi Ekholm, Nadine-Cyra Freistetter, Aapo Rautiainen, and Laura Thölix
Geosci. Model Dev. Discuss., https://doi.org/10.5194/gmd-2023-146,https://doi.org/10.5194/gmd-2023-146, 2023
Revised manuscript accepted for GMD
Short summary
Bidirectional coupling of the long-term integrated assessment model REgional Model of INvestments and Development (REMIND) v3.0.0 with the hourly power sector model Dispatch and Investment Evaluation Tool with Endogenous Renewables (DIETER) v1.0.2
Chen Chris Gong, Falko Ueckerdt, Robert Pietzcker, Adrian Odenweller, Wolf-Peter Schill, Martin Kittel, and Gunnar Luderer
Geosci. Model Dev., 16, 4977–5033, https://doi.org/10.5194/gmd-16-4977-2023,https://doi.org/10.5194/gmd-16-4977-2023, 2023
Short summary

Cited articles

Agri4cast: Crop Calendar, https://agri4cast.jrc.ec.europa.eu/DataPortal/Index.aspx?o=, last access: 20 June 2021. a
Allen, D. M.: The Relationship Between Variable Selection and Data Agumentation and a Method for Prediction, Technometrics, 16, 125–127, https://doi.org/10.1080/00401706.1974.10489157, 1974. a, b
Amarasinghe, U. A., Hoanh, C. T., D'haeze, D., and Hung, T. Q.: Toward sustainable coffee production in Vietnam: More coffee with less water, Agr. Syst., 136, 96–105, https://doi.org/10.1016/j.agsy.2015.02.008, 2015. a
Ambroise, C. and McLachlan, G. J.: Selection bias in gene extraction on the basis of microarray gene-expression data, P. Natl. Acad. Sci. USA, 99, 6562–6566, https://doi.org/10.1073/pnas.102102699, 2002. a
Anh, D. T. L. and Filipe, A.: Code and Data for the Leave-Two-Out Method, Zenodo [code], https://doi.org/10.5281/zenodo.5159363, 2021. a
Download
Short summary
We proposed the leave-two-out method (i.e. one particular implementation of the nested cross-validation) to determine the optimal statistical crop model (using the validation dataset) and estimate its true generalization ability (using the testing dataset). This approach is applied to two examples (robusta coffee in Cu M'gar and grain maize in France). The results suggested that the simple models are more suitable in crop modelling where a limited number of samples is available.