Articles | Volume 17, issue 15
https://doi.org/10.5194/gmd-17-6007-2024
https://doi.org/10.5194/gmd-17-6007-2024
Model evaluation paper
 | 
14 Aug 2024
Model evaluation paper |  | 14 Aug 2024

Random forests with spatial proxies for environmental modelling: opportunities and pitfalls

Carles Milà, Marvin Ludwig, Edzer Pebesma, Cathryn Tonne, and Hanna Meyer

Related authors

kNNDM CV: k-fold nearest-neighbour distance matching cross-validation for map accuracy estimation
Jan Linnenbrink, Carles Milà, Marvin Ludwig, and Hanna Meyer
Geosci. Model Dev., 17, 5897–5912, https://doi.org/10.5194/gmd-17-5897-2024,https://doi.org/10.5194/gmd-17-5897-2024, 2024
Short summary

Related subject area

Earth and space science informatics
A GPU parallelization of the neXtSIM-DG dynamical core (v0.3.1)
Robert Jendersie, Christian Lessig, and Thomas Richter
Geosci. Model Dev., 18, 3017–3040, https://doi.org/10.5194/gmd-18-3017-2025,https://doi.org/10.5194/gmd-18-3017-2025, 2025
Short summary
The Earth System Grid Federation (ESGF) Virtual Aggregation (CMIP6 v20240125)
Ezequiel Cimadevilla, Bryan N. Lawrence, and Antonio S. Cofiño
Geosci. Model Dev., 18, 2461–2478, https://doi.org/10.5194/gmd-18-2461-2025,https://doi.org/10.5194/gmd-18-2461-2025, 2025
Short summary
Can AI be enabled to perform dynamical downscaling? A latent diffusion model to mimic kilometer-scale COSMO5.0_CLM9 simulations
Elena Tomasi, Gabriele Franch, and Marco Cristoforetti
Geosci. Model Dev., 18, 2051–2078, https://doi.org/10.5194/gmd-18-2051-2025,https://doi.org/10.5194/gmd-18-2051-2025, 2025
Short summary
Moving beyond post hoc explainable artificial intelligence: a perspective paper on lessons learned from dynamical climate modeling
Ryan J. O'Loughlin, Dan Li, Richard Neale, and Travis A. O'Brien
Geosci. Model Dev., 18, 787–802, https://doi.org/10.5194/gmd-18-787-2025,https://doi.org/10.5194/gmd-18-787-2025, 2025
Short summary
Remote-sensing-based forest canopy height mapping: some models are useful, but might they provide us with even more insights when combined?
Nikola Besic, Nicolas Picard, Cédric Vega, Jean-Daniel Bontemps, Lionel Hertzog, Jean-Pierre Renaud, Fajwel Fogel, Martin Schwartz, Agnès Pellissier-Tanon, Gabriel Destouet, Frédéric Mortier, Milena Planells-Rodriguez, and Philippe Ciais
Geosci. Model Dev., 18, 337–359, https://doi.org/10.5194/gmd-18-337-2025,https://doi.org/10.5194/gmd-18-337-2025, 2025
Short summary

Cited articles

Baddeley, A., Rubak, E., and Turner, R.: Spatial point patterns: methodology and applications with R, CRC Press, ISBN 9781482210200, 2015. a
Behrens, T. and Viscarra Rossel, R. A.: On the interpretability of predictors in spatial data science: The information horizon, Sci. Rep.-UK, 10, 16737, https://doi.org/10.1038/s41598-020-73773-y, 2020. a, b
Behrens, T., Schmidt, K., Viscarra Rossel, R. A., Gries, P., Scholten, T., and MacMillan, R. A.: Spatial modelling with Euclidean distance fields and machine learning, Eur. J. Soil Sci., 69, 757–770, 2018. a, b, c, d
Breiman, L.: Random forests, Mach. Learn., 45, 5–32, 2001. a
Breiman, L.: Manual on setting up, using, and understanding random forests v3.1, Statistics Department University of California Berkeley, CA, USA, 1, 3–42, https://www.stat.berkeley.edu/~breiman/Using_random_forests_V3.1.pdf (last access: 24 April 2023), 2002. a
Download
Short summary
Spatial proxies, such as coordinates and distances, are often used as predictors in random forest models for predictive mapping. In a simulation and two case studies, we investigated the conditions under which their use is appropriate. We found that spatial proxies are not always beneficial and should not be used as a default approach without careful consideration. We also provide insights into the reasons behind their suitability, how to detect them, and potential alternatives.
Share