Submitted as: methods for assessment of models |
| 04 May 2020
Status: this preprint was under review for the journal GMD but the revision was not accepted.
From R-squared to coefficient of model accuracy for assessing "goodness-of-fits"
Charles Onyutha
Abstract. Modelers tend to focus more on advancing methods of statistical and mathematical modeling than developing novel techniques for comparing modeled results with observations or establishing metrics for model performance assessment. Perhaps solely the most extensively applied "goodness-of-fit" measure especially for assessing performance of regression models is the coefficient of determination R2. Normally, high R2 tends to be associated with an efficient model. Nevertheless, R2 has been cited to have no importance in the classical model of regression. Even in its use in descriptive statistics, R2 is known to have questionable justification. R2 is inadequate in assessing model performance because it does not give any information on the model residuals. Furthermore, R-squared can be low for an effective model. Contrastingly, a very poor model fit can yield high R2. Regressing X on Y yields R2 which is the same as that if Y is regressed on X thereby invalidating its use as a coefficient of determination. Taking into account the drawbacks of using R2, this paper introduces coefficient of model accuracy (CMA) the derivation of which comprises an analogy to the R2. However, instead of simply squaring an ordinary Pearson's product-moment correlation coefficient to obtain R2, CMA comprises the product of nonparametric sample correlation and model bias. Acceptability of the introduced method can be found demonstrated through comparison of results from simulations by hydrological models calibrated using CMA and other existing objective functions.
Received: 14 Feb 2020 – Discussion started: 04 May 2020
Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.
R2 despite its wide use in assessing model performance has several drawbacks. While taking into account the drawbacks, this paper introduces another metric (coefficient of model accuracy MCA) which is capable of assessing "goodness-of-fits". Stepwise derivation of CMA comprises an analogy to the R2. Suitability of CMA for assessing model performance was demonstrated through comparison of simulations by hydrological models calibrated using CMA and other existing objective functions.
R2 despite its wide use in assessing model performance has several drawbacks. While taking into...