General comments:
This publication presents a highly relevant validation technique for statistical crop models, i.e. a possibility to select the input variables and validate the model independently of the testing data set, which improves the robustness of the model. The LTO validation is presented in two case studies. Results show that the LTO validation leads to more robust results and enables a more realistically assessment of the forecasting performance. Because rigorous validation remains rare in the statistical crop modelling community, this paper is of high interest to other scientists. Even though the proposed approach has not often been applied yet, it is also not a new approach. Laudien et al. (2020 and 2022) and Meroni et al. (2021) present examples in which an independent variable selection has been applied to forecast crop yields. However, the explicit comparison of the influence of a different number of input variables (either inputs or potential predictors) on the model performance is – to our best knowledge - new and interesting for a wider audience. The paper is well-structured and has a clear, but partly colloquial language.
Major comments:
- The comparison of the selection of the best number of predictors and inputs between LOO and LTO does not include the results for fewer than 3 variables. As the RMSE for the validation and the testing increases with the number of inputs, the question arises whether the optimal value is even lower than 3. Also, the paper title suggests that the optimal number of inputs is found. The risk of too simplistic models is not explored in the paper, which would be an interesting addition to the presented results.
- Laudien et al. (2020): “Robustly forecasting maize yields in Tanzania based on climatic predictors”; Meroni et al. (2021): “Yield forecasting with machine learning and small data: What gains for grains?”; Laudien et al. (2022): “A forecast of staple crop production in Burkina Faso to enable early warnings of shortages in domestic food availability” provide examples of an independent variable selection in a statistical crop model to forecast yields. Whereas Laudien et al. call it “level 2 LOOCV”, Meroni et al. (2021) also call it nested oos validation. The statement that the proposed LTO approach has never been used before (Line 50-51) is therefore not correct. Please rephrase this sentence.
- As the authors state, the number of input variables, the number of potential input variables and the model type influence the robustness of the model/forecast and the inclusion of all of these aspects in the study is relevant. The terms model selection, model complexity or “finding the best model” are often used to describe the selection of inputs, potential inputs and model type even though these terms encompass also other aspects, as pointed out in section 2.3. These terms therefore do not adequately describe what the authors are examining, which leads to confusion. Also, the terms are used ambiguously, e.g. in Line 266 it says “Here, the model complexity is considered as a representative example of model selection.” The paper would benefit from a clearer language concerning what is actually investigated, i.e. either input selection, potential input selection or selection of the model type.
- The reference to Dinh et al. (2022) is made about 10 times in the paper – also as a way to justify some methodological decisions – even though this paper is not yet published and has the same first author. Please consider finding alternative literature sources.
Minor comments:
- L5: It is not impossible to split the data set into 3 in statistical modelling. Please rephrase.
- L33-34: “not an easy task” is a rather subjective statement. Sentence should be rephrased.
- L 43: This statement should be supported by references.
- L 46: Instead of “from” – it should be “for”.
- L74, 76, 79: Do you mean reproductive stage?
- L87: Please provide a reference.
- L99: Please reformulate the phrase. “change in time” is misleading. Do you mean changes in the timing of phenological stages?
- L115, L197: There is no comma after “i.e.”. Please, also check at other parts of the paper.
- L116: It should be “at” the equator, not “in” the equator.
- L117-118: How has the data being matched to administrative levels? This should be specified, as it influences the results.
- L148: The error term in the regression equation is missing.
- L179: The regression does not necessarily require an intercept (In case the dependent variable was demeaned beforehand, the intercept is no longer needed.). Therefore, it is not always n_input +1. Please rephrase, e.g. the LIN model “usually” requires n_input +1 inputs.
- L189-192: This statement does not belong to the section on Methods. Findings are presented later in the paper and should not already be mentioned at this point.
- L166-67: Please rephrase. Also many other factors can influence the model performance, not only complexity and potential input variables.
- L219: This is “often” the case in crop modelling studies, however there are also studies with big samples (e.g. Schauberger et al. (2022): French crop yield, area and production data for ten staple crops from 1900 to 2018 at county resolution, Lobell (2008): Prioritizing Climate Change Adaptation Needs for Food Security in 2030 or Renard, Tilmann (2019): National food production stabilized by crop diversity).
- L276: Applicability of a model is not only defined by its skill – please rephrase.
- L296: It should be “models” not “model”.
- L300: Robust statistical models can also be based on smaller samples than 19. Please make the sentence more general by e.g. saying: “when having a limited sample.”
- L306f: It is not illusionary to model complex weather-yield relations with a sample of 19 observations - many papers show that it is possible. The choice of input variables should also account for more complex weather-yield relations (i.e. only studying monthly mean temperature or precipitation sum might not be sufficient). Rather refrain from this statement.
- L312 and L430: Do you mean key phenological phases in plant development by moments of coffee?
- L312-316: This is a very interesting discussion as it explains why the selected variables potentially show a good performance in the model. However, this should be supported by literature.
- L322: Weather is only one factor among other factors. However, the examples are not well-chosen, i.e. by omitting the yield trend you deliberately omit the influence of e.g. agricultural practices (e.g. irrigation) that usually only change gradually over time. Also, one could argue that diseases are indirectly covered in statistical models. Please, refer to literature at this point to support your examples.
- L323-324: As pointed out earlier, even with smaller samples, the model can capture complex and robust weather-yield relationships. The model quality depends on many other factors such as the quality of the input data, the choice of potential predictors, the accuracy of the defined growing season etc. Please delete this sentence or support it with literature.
- L346: Please provide an explanation of why the validation and test errors show so much variability.
- L359 and L84: The reason for the selection of the case study regions should be made explicit (the selection of Cu M’gar as one district in 4 major coffee producing regions is based on a paper that is not yet published and the selection of the 10 maize producing regions in France is not explained at all).
- L425: The remaining variability could also stem from other factors (see comment to L322) and change this sentence accordingly.
- L427: A possibility is also that the input variables do not sufficiently cover crop sensitive climatic drivers as only mean temperature and precipitation sum are considered in this study.
- L434: The sentence does not make sense. Please rephrase.
- L444: What do you mean with “other crops will be investigated”? Afterwards you cite papers that already studied these crops I suppose.
- L446-447: The sentences are not easy to understand in terms of language. Please rephrase. |