Articles | Volume 15, issue 20
https://doi.org/10.5194/gmd-15-7791-2022
© Author(s) 2022. This work is distributed under
the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
https://doi.org/10.5194/gmd-15-7791-2022
© Author(s) 2022. This work is distributed under
the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
Development of a regional feature selection-based machine learning system (RFSML v1.0) for air pollution forecasting over China
Li Fang
Jiangsu Key Laboratory of Atmospheric Environment Monitoring and Pollution Control, Jiangsu Collaborative Innovation Center of Atmospheric Environment and Equipment Technology, School of Environmental Science and Engineering, Nanjing University of Information Science and Technology, Nanjing, Jiangsu, China
Jiangsu Key Laboratory of Atmospheric Environment Monitoring and Pollution Control, Jiangsu Collaborative Innovation Center of Atmospheric Environment and Equipment Technology, School of Environmental Science and Engineering, Nanjing University of Information Science and Technology, Nanjing, Jiangsu, China
Arjo Segers
TNO, Department of Climate, Air and Sustainability, Utrecht, the Netherlands
Hai Xiang Lin
Institute of Environmental Sciences, Leiden University, Leiden, the Netherlands
Delft Institute of Applied Mathematics, Delft University of Technology, Delft, the Netherlands
Mijie Pang
Jiangsu Key Laboratory of Atmospheric Environment Monitoring and Pollution Control, Jiangsu Collaborative Innovation Center of Atmospheric Environment and Equipment Technology, School of Environmental Science and Engineering, Nanjing University of Information Science and Technology, Nanjing, Jiangsu, China
Cong Xiao
Key Laboratory of Petroleum Engineering, Ministry of Education, China University of Petroleum, Beijing, China
Tuo Deng
Delft Institute of Applied Mathematics, Delft University of Technology, Delft, the Netherlands
Hong Liao
CORRESPONDING AUTHOR
Jiangsu Key Laboratory of Atmospheric Environment Monitoring and Pollution Control, Jiangsu Collaborative Innovation Center of Atmospheric Environment and Equipment Technology, School of Environmental Science and Engineering, Nanjing University of Information Science and Technology, Nanjing, Jiangsu, China
Viewed
Total article views: 2,775 (including HTML, PDF, and XML)
Cumulative views and downloads
(calculated since 06 Jul 2022)
HTML | XML | Total | Supplement | BibTeX | EndNote | |
---|---|---|---|---|---|---|
1,733 | 981 | 61 | 2,775 | 249 | 63 | 73 |
- HTML: 1,733
- PDF: 981
- XML: 61
- Total: 2,775
- Supplement: 249
- BibTeX: 63
- EndNote: 73
Total article views: 2,053 (including HTML, PDF, and XML)
Cumulative views and downloads
(calculated since 24 Oct 2022)
HTML | XML | Total | Supplement | BibTeX | EndNote | |
---|---|---|---|---|---|---|
1,248 | 762 | 43 | 2,053 | 120 | 55 | 67 |
- HTML: 1,248
- PDF: 762
- XML: 43
- Total: 2,053
- Supplement: 120
- BibTeX: 55
- EndNote: 67
Total article views: 722 (including HTML, PDF, and XML)
Cumulative views and downloads
(calculated since 06 Jul 2022)
HTML | XML | Total | Supplement | BibTeX | EndNote | |
---|---|---|---|---|---|---|
485 | 219 | 18 | 722 | 129 | 8 | 6 |
- HTML: 485
- PDF: 219
- XML: 18
- Total: 722
- Supplement: 129
- BibTeX: 8
- EndNote: 6
Viewed (geographical distribution)
Total article views: 2,775 (including HTML, PDF, and XML)
Thereof 2,730 with geography defined
and 45 with unknown origin.
Total article views: 2,053 (including HTML, PDF, and XML)
Thereof 2,053 with geography defined
and 0 with unknown origin.
Total article views: 722 (including HTML, PDF, and XML)
Thereof 651 with geography defined
and 71 with unknown origin.
Country | # | Views | % |
---|
Country | # | Views | % |
---|
Country | # | Views | % |
---|
Total: | 0 |
HTML: | 0 |
PDF: | 0 |
XML: | 0 |
- 1
1
Total: | 0 |
HTML: | 0 |
PDF: | 0 |
XML: | 0 |
- 1
1
Total: | 0 |
HTML: | 0 |
PDF: | 0 |
XML: | 0 |
- 1
1
Cited
12 citations as recorded by crossref.
- AI-driven approaches for air pollution modelling: A comprehensive systematic review L. Garbagna et al. 10.1016/j.envpol.2025.125937
- Application of artificial intelligence in air pollution monitoring and forecasting: A systematic review S. Chadalavada et al. 10.1016/j.envsoft.2024.106312
- A gridded air quality forecast through fusing site-available machine learning predictions from RFSML v1.0 and chemical transport model results from GEOS-Chem v13.1.0 using the ensemble Kalman filter L. Fang et al. 10.5194/gmd-16-4867-2023
- Conditional Feature Selection: Evaluating Model Averaging When Selecting Features with Shapley Values F. Huber & V. Steinhage 10.3390/geomatics4030016
- Entropy-based feature selection for capturing impacts in Earth system models with abrupt forcing J. Watkins et al. 10.1016/j.cam.2025.116724
- Integrating Machine Learning with Causal Inference to Improve Prediction of Ammonium Wet Deposition in the Pearl River Delta R. Lin et al. 10.3390/su17051970
- Observational operator for fair model evaluation with ground NO2 measurements L. Fang et al. 10.5194/gmd-17-8267-2024
- Enhancing PM$$_{2.5}$$ Air Pollution Forecasting with Novel Random Imputation Based on Hybrid RNN-Bidirectional GRU (nRI RNN-BiGRU) Model N. Ahmad & V. Kumar 10.1007/s42979-025-04167-y
- New data-driven estimation of metal element in rocks using a hyperspectral data and geochemical data X. Ma et al. 10.1016/j.oregeorev.2024.105877
- Data-Driven Approaches for Predicting and Forecasting Air Quality in Urban Areas C. Rosca et al. 10.3390/app15084390
- Machine learning-enabled estimation and high-resolution forecasting of atmospheric VOCs B. Lu et al. 10.1016/j.atmosenv.2025.121364
- Feature Ranking on Small Samples: A Bayes-Based Approach A. Vatian et al. 10.3390/e27080773
12 citations as recorded by crossref.
- AI-driven approaches for air pollution modelling: A comprehensive systematic review L. Garbagna et al. 10.1016/j.envpol.2025.125937
- Application of artificial intelligence in air pollution monitoring and forecasting: A systematic review S. Chadalavada et al. 10.1016/j.envsoft.2024.106312
- A gridded air quality forecast through fusing site-available machine learning predictions from RFSML v1.0 and chemical transport model results from GEOS-Chem v13.1.0 using the ensemble Kalman filter L. Fang et al. 10.5194/gmd-16-4867-2023
- Conditional Feature Selection: Evaluating Model Averaging When Selecting Features with Shapley Values F. Huber & V. Steinhage 10.3390/geomatics4030016
- Entropy-based feature selection for capturing impacts in Earth system models with abrupt forcing J. Watkins et al. 10.1016/j.cam.2025.116724
- Integrating Machine Learning with Causal Inference to Improve Prediction of Ammonium Wet Deposition in the Pearl River Delta R. Lin et al. 10.3390/su17051970
- Observational operator for fair model evaluation with ground NO2 measurements L. Fang et al. 10.5194/gmd-17-8267-2024
- Enhancing PM$$_{2.5}$$ Air Pollution Forecasting with Novel Random Imputation Based on Hybrid RNN-Bidirectional GRU (nRI RNN-BiGRU) Model N. Ahmad & V. Kumar 10.1007/s42979-025-04167-y
- New data-driven estimation of metal element in rocks using a hyperspectral data and geochemical data X. Ma et al. 10.1016/j.oregeorev.2024.105877
- Data-Driven Approaches for Predicting and Forecasting Air Quality in Urban Areas C. Rosca et al. 10.3390/app15084390
- Machine learning-enabled estimation and high-resolution forecasting of atmospheric VOCs B. Lu et al. 10.1016/j.atmosenv.2025.121364
- Feature Ranking on Small Samples: A Bayes-Based Approach A. Vatian et al. 10.3390/e27080773
Latest update: 08 Aug 2025
Short summary
This study proposes a regional feature selection-based machine learning system to predict short-term air quality in China. The system has a tool that can figure out the importance of input data for better prediction. It provides large-scale air quality prediction that exhibits improved interpretability, fewer training costs, and higher accuracy compared with a standard machine learning system. It can act as an early warning for citizens and reduce exposure to PM2.5 and other air pollutants.
This study proposes a regional feature selection-based machine learning system to predict...