Articles | Volume 17, issue 15
https://doi.org/10.5194/gmd-17-6007-2024
© Author(s) 2024. This work is distributed under
the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
https://doi.org/10.5194/gmd-17-6007-2024
© Author(s) 2024. This work is distributed under
the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
Random forests with spatial proxies for environmental modelling: opportunities and pitfalls
Barcelona Institute for Global Health (ISGlobal), Barcelona, Spain
Universitat Pompeu Fabra (UPF), Barcelona, Spain
Marvin Ludwig
Institute of Landscape Ecology, University of Münster, Münster, Germany
Edzer Pebesma
Institute for Geoinformatics, University of Münster, Münster, Germany
Cathryn Tonne
Barcelona Institute for Global Health (ISGlobal), Barcelona, Spain
Universitat Pompeu Fabra (UPF), Barcelona, Spain
CIBER Epidemiología y Salud Pública (CIBERESP), Madrid, Spain
Hanna Meyer
Institute of Landscape Ecology, University of Münster, Münster, Germany
Viewed
Total article views: 4,029 (including HTML, PDF, and XML)
Cumulative views and downloads
(calculated since 24 Jan 2024)
| HTML | XML | Total | BibTeX | EndNote | |
|---|---|---|---|---|---|
| 3,328 | 609 | 92 | 4,029 | 98 | 158 |
- HTML: 3,328
- PDF: 609
- XML: 92
- Total: 4,029
- BibTeX: 98
- EndNote: 158
Total article views: 3,409 (including HTML, PDF, and XML)
Cumulative views and downloads
(calculated since 14 Aug 2024)
| HTML | XML | Total | BibTeX | EndNote | |
|---|---|---|---|---|---|
| 2,883 | 464 | 62 | 3,409 | 74 | 141 |
- HTML: 2,883
- PDF: 464
- XML: 62
- Total: 3,409
- BibTeX: 74
- EndNote: 141
Total article views: 620 (including HTML, PDF, and XML)
Cumulative views and downloads
(calculated since 24 Jan 2024)
| HTML | XML | Total | BibTeX | EndNote | |
|---|---|---|---|---|---|
| 445 | 145 | 30 | 620 | 24 | 17 |
- HTML: 445
- PDF: 145
- XML: 30
- Total: 620
- BibTeX: 24
- EndNote: 17
Viewed (geographical distribution)
Total article views: 4,029 (including HTML, PDF, and XML)
Thereof 3,927 with geography defined
and 102 with unknown origin.
Total article views: 3,409 (including HTML, PDF, and XML)
Thereof 3,290 with geography defined
and 119 with unknown origin.
Total article views: 620 (including HTML, PDF, and XML)
Thereof 620 with geography defined
and 0 with unknown origin.
| Country | # | Views | % |
|---|
| Country | # | Views | % |
|---|
| Country | # | Views | % |
|---|
| Total: | 0 |
| HTML: | 0 |
| PDF: | 0 |
| XML: | 0 |
- 1
1
| Total: | 0 |
| HTML: | 0 |
| PDF: | 0 |
| XML: | 0 |
- 1
1
| Total: | 0 |
| HTML: | 0 |
| PDF: | 0 |
| XML: | 0 |
- 1
1
Cited
22 citations as recorded by crossref.
- Assessing human-caused wildfire ignition likelihood across Europe P. Gelabert et al.
- Multilevel small-area childhood stunting risk estimation: Insights from spatial ensemble learning, agro-ecological and environmentally remotely sensed indicators G. Nduwayezu et al.
- Hawai‘i's pelagic longline fishery demonstrates the need to consider multispecies impacts in bluewater time-area closures J. Van Wert et al.
- Phenotyping maize stay green traits via in situ leaf hyperspectral reflectance sensing H. Elsharawy et al.
- The utility of combining deep learning with metabarcoding to model biodiversity dynamics at a national scale A. Baggström et al.
- An assessment of spatial random forests for environmental mapping: the case of groundwater nitrate concentration J. Frank et al.
- Machine learning for predictive mapping of exceedance probabilities for potentially toxic elements in Czech farmland J. Skála et al.
- Limited microclimatic buffering capacity in boreal forests calls for sustainable management strategies I. Starck et al.
- Summer foraging habitat suitability for highly mobile male beluga whales in the Eastern Beaufort Sea and Arctic Archipelago L. Storrie et al.
- Moving Beyond Temperature Metrics in Coral Bleaching Prediction Using Interpretable Machine Learning M. Cheung et al.
- Disentangling Climatic and Surface-Physical Drivers of the Urban Heat Island Using Explainable AI Across U.S. Cities O. Aljarrah & D. Goulias
- Urban Heat Mapping Strategies for Predicting Near-Surface Air Temperature in Unsampled Cities in Iowa M. Ecker et al.
- Importance ranking of data and model uncertainties in quantile regression forest-based spatial predictions when data are sparse, imprecise and clustered J. Rohmer
- Downscaling local distribution of cattle over Guadeloupe archipelago: An adapted method for disaggregating census data V. Dufleit et al.
- Harnessing vegetation indices and remote sensing to assess the impact of Cyclone Kenneth on banana plantations: Insights from Ngazidja Island (Comoros) A. Abdoussalami et al.
- A geographically weighted XGBoost framework for Pixel-Level modeling of vegetation responses using Multi-Source Earth Observation data Y. Khosravi & T. Ouarda
- An interpretable machine learning approach for alkalinity reconstruction in the Mediterranean Sea T. Tonelli et al.
- Kriging prior regression: A case for kriging-based spatial features with TabPFN in soil mapping J. Schmidinger et al.
- Spatial autocorrelation in machine learning for modelling soil organic carbon A. Kmoch et al.
- Deciphering environmental determinants of soil metals in dust-influenced arid soils through interpretable models across data-availability scenarios Z. Ebrahimi-Khusfi et al.
- Predictive Modeling and Simulation of CO2 Trapping Mechanisms: Insights into Efficiency and Long-Term Sequestration Strategies O. Ejehu et al.
- Predicting thermal trends in smart buildings using machine learning: a case study C. Mejía Rodriguez et al.
22 citations as recorded by crossref.
- Assessing human-caused wildfire ignition likelihood across Europe P. Gelabert et al.
- Multilevel small-area childhood stunting risk estimation: Insights from spatial ensemble learning, agro-ecological and environmentally remotely sensed indicators G. Nduwayezu et al.
- Hawai‘i's pelagic longline fishery demonstrates the need to consider multispecies impacts in bluewater time-area closures J. Van Wert et al.
- Phenotyping maize stay green traits via in situ leaf hyperspectral reflectance sensing H. Elsharawy et al.
- The utility of combining deep learning with metabarcoding to model biodiversity dynamics at a national scale A. Baggström et al.
- An assessment of spatial random forests for environmental mapping: the case of groundwater nitrate concentration J. Frank et al.
- Machine learning for predictive mapping of exceedance probabilities for potentially toxic elements in Czech farmland J. Skála et al.
- Limited microclimatic buffering capacity in boreal forests calls for sustainable management strategies I. Starck et al.
- Summer foraging habitat suitability for highly mobile male beluga whales in the Eastern Beaufort Sea and Arctic Archipelago L. Storrie et al.
- Moving Beyond Temperature Metrics in Coral Bleaching Prediction Using Interpretable Machine Learning M. Cheung et al.
- Disentangling Climatic and Surface-Physical Drivers of the Urban Heat Island Using Explainable AI Across U.S. Cities O. Aljarrah & D. Goulias
- Urban Heat Mapping Strategies for Predicting Near-Surface Air Temperature in Unsampled Cities in Iowa M. Ecker et al.
- Importance ranking of data and model uncertainties in quantile regression forest-based spatial predictions when data are sparse, imprecise and clustered J. Rohmer
- Downscaling local distribution of cattle over Guadeloupe archipelago: An adapted method for disaggregating census data V. Dufleit et al.
- Harnessing vegetation indices and remote sensing to assess the impact of Cyclone Kenneth on banana plantations: Insights from Ngazidja Island (Comoros) A. Abdoussalami et al.
- A geographically weighted XGBoost framework for Pixel-Level modeling of vegetation responses using Multi-Source Earth Observation data Y. Khosravi & T. Ouarda
- An interpretable machine learning approach for alkalinity reconstruction in the Mediterranean Sea T. Tonelli et al.
- Kriging prior regression: A case for kriging-based spatial features with TabPFN in soil mapping J. Schmidinger et al.
- Spatial autocorrelation in machine learning for modelling soil organic carbon A. Kmoch et al.
- Deciphering environmental determinants of soil metals in dust-influenced arid soils through interpretable models across data-availability scenarios Z. Ebrahimi-Khusfi et al.
- Predictive Modeling and Simulation of CO2 Trapping Mechanisms: Insights into Efficiency and Long-Term Sequestration Strategies O. Ejehu et al.
- Predicting thermal trends in smart buildings using machine learning: a case study C. Mejía Rodriguez et al.
Saved (final revised paper)
Latest update: 28 Apr 2026
Short summary
Spatial proxies, such as coordinates and distances, are often used as predictors in random forest models for predictive mapping. In a simulation and two case studies, we investigated the conditions under which their use is appropriate. We found that spatial proxies are not always beneficial and should not be used as a default approach without careful consideration. We also provide insights into the reasons behind their suitability, how to detect them, and potential alternatives.
Spatial proxies, such as coordinates and distances, are often used as predictors in random...