Articles | Volume 17, issue 9
https://doi.org/10.5194/gmd-17-3617-2024
https://doi.org/10.5194/gmd-17-3617-2024
Development and technical paper
 | 
06 May 2024
Development and technical paper |  | 06 May 2024

Diagnosing drivers of PM2.5 simulation biases in China from meteorology, chemical composition, and emission sources using an efficient machine learning method

Shuai Wang, Mengyuan Zhang, Yueqi Gao, Peng Wang, Qingyan Fu, and Hongliang Zhang

Related authors

Reconstructing long-term (1980–2022) daily ground particulate matter concentrations in India (LongPMInd)
Shuai Wang, Mengyuan Zhang, Hui Zhao, Peng Wang, Sri Harsha Kota, Qingyan Fu, Cong Liu, and Hongliang Zhang
Earth Syst. Sci. Data, 16, 3565–3577, https://doi.org/10.5194/essd-16-3565-2024,https://doi.org/10.5194/essd-16-3565-2024, 2024
Short summary

Related subject area

Atmospheric sciences
ClimKern v1.2: a new Python package and kernel repository for calculating radiative feedbacks
Tyler P. Janoski, Ivan Mitevski, Ryan J. Kramer, Michael Previdi, and Lorenzo M. Polvani
Geosci. Model Dev., 18, 3065–3079, https://doi.org/10.5194/gmd-18-3065-2025,https://doi.org/10.5194/gmd-18-3065-2025, 2025
Short summary
Accounting for effects of coagulation and model uncertainties in particle number concentration estimates based on measurements from sampling lines – a Bayesian inversion approach with SLIC v1.0
Matti Niskanen, Aku Seppänen, Henri Oikarinen, Miska Olin, Panu Karjalainen, Santtu Mikkonen, and Kari Lehtinen
Geosci. Model Dev., 18, 2983–3001, https://doi.org/10.5194/gmd-18-2983-2025,https://doi.org/10.5194/gmd-18-2983-2025, 2025
Short summary
Top-down CO emission estimates using TROPOMI CO data in the TM5-4DVAR (r1258) inverse modeling suit
Johann Rasmus Nüß, Nikos Daskalakis, Fabian Günther Piwowarczyk, Angelos Gkouvousis, Oliver Schneising, Michael Buchwitz, Maria Kanakidou, Maarten C. Krol, and Mihalis Vrekoussis
Geosci. Model Dev., 18, 2861–2890, https://doi.org/10.5194/gmd-18-2861-2025,https://doi.org/10.5194/gmd-18-2861-2025, 2025
Short summary
The Multi-Compartment Hg Modeling and Analysis Project (MCHgMAP): mercury modeling to support international environmental policy
Ashu Dastoor, Hélène Angot, Johannes Bieser, Flora Brocza, Brock Edwards, Aryeh Feinberg, Xinbin Feng, Benjamin Geyman, Charikleia Gournia, Yipeng He, Ian M. Hedgecock, Ilia Ilyin, Jane Kirk, Che-Jen Lin, Igor Lehnherr, Robert Mason, David McLagan, Marilena Muntean, Peter Rafaj, Eric M. Roy, Andrei Ryjkov, Noelle E. Selin, Francesco De Simone, Anne L. Soerensen, Frits Steenhuisen, Oleg Travnikov, Shuxiao Wang, Xun Wang, Simon Wilson, Rosa Wu, Qingru Wu, Yanxu Zhang, Jun Zhou, Wei Zhu, and Scott Zolkos
Geosci. Model Dev., 18, 2747–2860, https://doi.org/10.5194/gmd-18-2747-2025,https://doi.org/10.5194/gmd-18-2747-2025, 2025
Short summary
Similarity-based analysis of atmospheric organic compounds for machine learning applications
Hilda Sandström and Patrick Rinke
Geosci. Model Dev., 18, 2701–2724, https://doi.org/10.5194/gmd-18-2701-2025,https://doi.org/10.5194/gmd-18-2701-2025, 2025
Short summary

Cited articles

Aleksankina, K., Reis, S., Vieno, M., and Heal, M. R.: Advanced methods for uncertainty assessment and global sensitivity analysis of an Eulerian atmospheric chemistry transport model, Atmos. Chem. Phys., 19, 2881–2898, https://doi.org/10.5194/acp-19-2881-2019, 2019. 
Bai, K., Li, K., Ma, M., Li, K., Li, Z., Guo, J., Chang, N.-B., Tan, Z., and Han, D.: LGHAP: the Long-term Gap-free High-resolution Air Pollutant concentration dataset, derived via tensor-flow-based multimodal data fusion, Earth Syst. Sci. Data, 14, 907–927, https://doi.org/10.5194/essd-14-907-2022, 2022. 
Beekmann, M. and Derognat, C.: Monte Carlo uncertainty analysis of a regional-scale transport chemistry model constrained by measurements from the atmospheric pollution over the Paris area (ESQUIF) campaign, J. Geophys. Res.-Atmos., 108, 8859, 10.1029/2003JD003391, 2003. 
Bei, N., Wu, J., Elser, M., Feng, T., Cao, J., El-Haddad, I., Li, X., Huang, R., Li, Z., Long, X., Xing, L., Zhao, S., Tie, X., Prévôt, A. S. H., and Li, G.: Impacts of meteorological uncertainties on the haze formation in Beijing–Tianjin–Hebei (BTH) during wintertime: a case study, Atmos. Chem. Phys., 17, 14579–14591, https://doi.org/10.5194/acp-17-14579-2017, 2017. 
Belgiu, M. and Drăguţ, L.: Random forest in remote sensing: A review of applications and future directions, ISPRS J. Photogramm., 114, 24–31, 2016. 
Download
Short summary
Numerical models are widely used in air pollution modeling but suffer from significant biases. The machine learning model designed in this study shows high efficiency in identifying such biases. Meteorology (relative humidity and cloud cover), chemical composition (secondary organic components and dust aerosols), and emission sources (residential activities) are diagnosed as the main drivers of bias in modeling PM2.5, a typical air pollutant. The results will help to improve numerical models.
Share