Articles | Volume 17, issue 15
https://doi.org/10.5194/gmd-17-6007-2024
https://doi.org/10.5194/gmd-17-6007-2024
Model evaluation paper
 | 
14 Aug 2024
Model evaluation paper |  | 14 Aug 2024

Random forests with spatial proxies for environmental modelling: opportunities and pitfalls

Carles Milà, Marvin Ludwig, Edzer Pebesma, Cathryn Tonne, and Hanna Meyer

Related authors

kNNDM CV: k-fold nearest-neighbour distance matching cross-validation for map accuracy estimation
Jan Linnenbrink, Carles Milà, Marvin Ludwig, and Hanna Meyer
Geosci. Model Dev., 17, 5897–5912, https://doi.org/10.5194/gmd-17-5897-2024,https://doi.org/10.5194/gmd-17-5897-2024, 2024
Short summary

Cited articles

Baddeley, A., Rubak, E., and Turner, R.: Spatial point patterns: methodology and applications with R, CRC Press, ISBN 9781482210200, 2015. a
Behrens, T. and Viscarra Rossel, R. A.: On the interpretability of predictors in spatial data science: The information horizon, Sci. Rep.-UK, 10, 16737, https://doi.org/10.1038/s41598-020-73773-y, 2020. a, b
Behrens, T., Schmidt, K., Viscarra Rossel, R. A., Gries, P., Scholten, T., and MacMillan, R. A.: Spatial modelling with Euclidean distance fields and machine learning, Eur. J. Soil Sci., 69, 757–770, 2018. a, b, c, d
Breiman, L.: Random forests, Mach. Learn., 45, 5–32, 2001. a
Breiman, L.: Manual on setting up, using, and understanding random forests v3.1, Statistics Department University of California Berkeley, CA, USA, 1, 3–42, https://www.stat.berkeley.edu/~breiman/Using_random_forests_V3.1.pdf (last access: 24 April 2023), 2002. a
Download
Short summary
Spatial proxies, such as coordinates and distances, are often used as predictors in random forest models for predictive mapping. In a simulation and two case studies, we investigated the conditions under which their use is appropriate. We found that spatial proxies are not always beneficial and should not be used as a default approach without careful consideration. We also provide insights into the reasons behind their suitability, how to detect them, and potential alternatives.
Share