Articles | Volume 16, issue 4
https://doi.org/10.5194/gmd-16-1179-2023
© Author(s) 2023. This work is distributed under
the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
https://doi.org/10.5194/gmd-16-1179-2023
© Author(s) 2023. This work is distributed under
the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
The impact of altering emission data precision on compression efficiency and accuracy of simulations of the community multiscale air quality model
Michael S. Walters
Atmospheric and Environmental Systems Modeling Division,
Center for Environmental Measurement and Modeling, Office of Research and
Development, US Environmental Protection Agency, Research Triangle Park,
NC, USA
Oak Ridge Associated Universities, Oak Ridge, TN, USA
David C. Wong
CORRESPONDING AUTHOR
Atmospheric and Environmental Systems Modeling Division,
Center for Environmental Measurement and Modeling, Office of Research and
Development, US Environmental Protection Agency, Research Triangle Park,
NC, USA
Related authors
No articles found.
Shang Wu, David C. Wong, Jiandong Wang, Yuzhi Jin, Junjun Li, and Chunsong Lu
EGUsphere, https://doi.org/10.5194/egusphere-2025-4811, https://doi.org/10.5194/egusphere-2025-4811, 2025
This preprint is open for discussion and under review for Atmospheric Chemistry and Physics (ACP).
Short summary
Short summary
Climate simulations create huge amounts of data that are difficult to store and share. In this study, we developed a simple method to reduce file sizes while keeping the scientific information accurate. By carefully shortening numbers before applying compression, we tested different settings on U.S. weather simulations and found ways to save space without losing key results. This approach helps scientists work more efficiently and supports better access to climate data for the wider community.
Qingfang Su, Yifei Chen, Yangjun Wang, David C. Wong, Havala O. T. Pye, Ling Huang, Golam Sarwar, Benjamin Murphy, Bryan Place, and Li Li
EGUsphere, https://doi.org/10.5194/egusphere-2025-3627, https://doi.org/10.5194/egusphere-2025-3627, 2025
This preprint is open for discussion and under review for Geoscientific Model Development (GMD).
Short summary
Short summary
This study evaluated the PM2.5 simulation by the latest CRACMM mechanism coupled with CMAQ, covering different seasons and specific regions over China. Results derived by CRACMM are compared with two well-established chemical mechanisms, Saprc07 and CB6. Differences in PM2.5 and SOA drivers between CRACMM and the two existing mechanisms are further explored. Results provide a solid foundation for the further application of CRACMM in understanding and regulating air pollution globally.
Yuzhi Jin, Jiandong Wang, Chao Liu, David C. Wong, Golam Sarwar, Kathleen M. Fahey, Shang Wu, Jiaping Wang, Jing Cai, Zeyuan Tian, Zhouyang Zhang, Jia Xing, Aijun Ding, and Shuxiao Wang
Atmos. Chem. Phys., 25, 2613–2630, https://doi.org/10.5194/acp-25-2613-2025, https://doi.org/10.5194/acp-25-2613-2025, 2025
Short summary
Short summary
Black carbon (BC) affects climate and the environment, and its aging process alters its properties. Current models, like WRF-CMAQ, lack full accounting for it. We developed the WRF-CMAQ-BCG model to better represent BC aging by introducing bare and coated BC species and their conversion. The WRF-CMAQ-BCG model introduces the capability to simulate BC mixing states and bare and coated BC wet deposition, and it improves the accuracy of BC mass concentration and aerosol optics.
David C. Wong, Jeff Willison, Jonathan E. Pleim, Golam Sarwar, James Beidler, Russ Bullock, Jerold A. Herwehe, Rob Gilliam, Daiwen Kang, Christian Hogrefe, George Pouliot, and Hosein Foroutan
Geosci. Model Dev., 17, 7855–7866, https://doi.org/10.5194/gmd-17-7855-2024, https://doi.org/10.5194/gmd-17-7855-2024, 2024
Short summary
Short summary
This work describe how we linked the meteorological Model for Prediction Across Scales – Atmosphere (MPAS-A) with the Community Multiscale Air Quality (CMAQ) air quality model to form a coupled modelling system. This could be used to study air quality or climate and air quality interaction at a global scale. This new model scales well in high-performance computing environments and performs well with respect to ground surface networks in terms of ozone and PM2.5.
Christos I. Efstathiou, Elizabeth Adams, Carlie J. Coats, Robert Zelt, Mark Reed, John McGee, Kristen M. Foley, Fahim I. Sidi, David C. Wong, Steven Fine, and Saravanan Arunachalam
Geosci. Model Dev., 17, 7001–7027, https://doi.org/10.5194/gmd-17-7001-2024, https://doi.org/10.5194/gmd-17-7001-2024, 2024
Short summary
Short summary
We present a summary of enabling high-performance computing of the Community Multiscale Air Quality Model (CMAQ) – a state-of-the-science community multiscale air quality model – on two cloud computing platforms through documenting the technologies, model performance, scaling and relative merits. This may be a new paradigm for computationally intense future model applications. We initiated this work due to a need to leverage cloud computing advances and to ease the learning curve for new users.
Jiandong Wang, Jia Xing, Shuxiao Wang, Rohit Mathur, Jiaping Wang, Yuqiang Zhang, Chao Liu, Jonathan Pleim, Dian Ding, Xing Chang, Jingkun Jiang, Peng Zhao, Shovan Kumar Sahu, Yuzhi Jin, David C. Wong, and Jiming Hao
Atmos. Chem. Phys., 22, 5147–5156, https://doi.org/10.5194/acp-22-5147-2022, https://doi.org/10.5194/acp-22-5147-2022, 2022
Short summary
Short summary
Aerosols reduce surface solar radiation and change the photolysis rate and planetary boundary layer stability. In this study, the online coupled meteorological and chemistry model was used to explore the detailed pathway of how aerosol direct effects affect secondary inorganic aerosol. The effects through the dynamics pathway act as an equally or even more important route compared with the photolysis pathway in affecting secondary aerosol concentration in both summer and winter.
Amir H. Souri, Kelly Chance, Juseon Bak, Caroline R. Nowlan, Gonzalo González Abad, Yeonjin Jung, David C. Wong, Jingqiu Mao, and Xiong Liu
Atmos. Chem. Phys., 21, 18227–18245, https://doi.org/10.5194/acp-21-18227-2021, https://doi.org/10.5194/acp-21-18227-2021, 2021
Short summary
Short summary
The global pandemic is believed to have an impact on emissions of air pollutants such as nitrogen dioxide (NO2) and formaldehyde (HCHO). This study quantifies the changes in the amount of NOx and VOC emissions via state-of-the-art inverse modeling technique using satellite observations during the lockdown 2020 with respect to a baseline over Europe, which in turn, it permits unraveling atmospheric processes being responsible for ozone formation in a less cloudy month.
Kai Wang, Yang Zhang, Shaocai Yu, David C. Wong, Jonathan Pleim, Rohit Mathur, James T. Kelly, and Michelle Bell
Geosci. Model Dev., 14, 7189–7221, https://doi.org/10.5194/gmd-14-7189-2021, https://doi.org/10.5194/gmd-14-7189-2021, 2021
Short summary
Short summary
The two-way coupled WRF-CMAQ model accounting for complex chemistry–meteorology feedbacks has been applied to the long-term predictions of regional meteorology and air quality over the US. The model results show superior performance and importance of chemistry–meteorology feedbacks when compared to the offline coupled WRF and CMAQ simulations, which suggests that feedbacks should be considered along with other factors in developing future model applications to inform policy making.
K. Wyat Appel, Jesse O. Bash, Kathleen M. Fahey, Kristen M. Foley, Robert C. Gilliam, Christian Hogrefe, William T. Hutzell, Daiwen Kang, Rohit Mathur, Benjamin N. Murphy, Sergey L. Napelenok, Christopher G. Nolte, Jonathan E. Pleim, George A. Pouliot, Havala O. T. Pye, Limei Ran, Shawn J. Roselle, Golam Sarwar, Donna B. Schwede, Fahim I. Sidi, Tanya L. Spero, and David C. Wong
Geosci. Model Dev., 14, 2867–2897, https://doi.org/10.5194/gmd-14-2867-2021, https://doi.org/10.5194/gmd-14-2867-2021, 2021
Short summary
Short summary
This paper details the scientific updates in the recently released CMAQ version 5.3 (and v5.3.1) and also includes operational and diagnostic evaluations of CMAQv5.3.1 against observations and the previous version of the CMAQ (v5.2.1). This work was done to improve the underlying science in CMAQ. This article is used to inform the CMAQ modeling community of the updates to the modeling system and the expected change in model performance from these updates (versus the previous model version).
Cited articles
Appel, K. W., Bash, J. O., Fahey, K. M., Foley, K. M., Gilliam, R. C., Hogrefe, C., Hutzell, W. T., Kang, D., Mathur, R., Murphy, B. N., Napelenok, S. L., Nolte, C. G., Pleim, J. E., Pouliot, G. A., Pye, H. O. T., Ran, L., Roselle, S. J., Sarwar, G., Schwede, D. B., Sidi, F. I., Spero, T. L., and Wong, D. C.: The Community Multiscale Air Quality (CMAQ) model versions 5.3 and 5.3.1: system updates and evaluation, Geosci. Model Dev., 14, 2867–2897, https://doi.org/10.5194/gmd-14-2867-2021, 2021.
Burrows, M. and Wheeler, D. J.: A Block Sorting Data Compression Algorithm,
Tech. report, Digital Systems Research Center, Digital Equipment
Corporation, Palo Alto, CA,
http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.37.6774 (last access: 30 September 2021), 1994.
Byun, D. and Schere, K. L.: Review of the governing equations, computational
algorithms, and other components of the Models-3 Community Multiscale Air
Quality (CMAQ) modeling system, Appl. Mech. Rev., 59, 51–77,
https://doi.org/10.1115/1.2128636, 2006.
CMAS: CMAQ Model Version 5.3 Input Data – 1/1/2016–12/31/2016 12 km CONUS, Dataverse [data set], https://doi.org/10.15139/S3/MHNUNE, 2023.
Deutsch, L. P.: DEFLATE compressed data format specification version 1.3,
Tech. Rep. IETF RFC1951, Internet Engineering Task Force, Menlo Park, CA,
USA, https://doi.org/10.17487/RFC1951, 1996.
EPA: Community Multiscale Air Quality Modeling System (CMAQ), Access CMAQ Source Code, EPA [code], https://www.epa.gov/cmaq/access-cmaq-source-code (last access: 31 January 2023), 2022.
Huffman, D. A.: A method for the construction of minimum redundancy codes,
Proceedings of the IRE, Vol. 40, no. 9, Sept. 1952, 1098–1101, https://doi.org/10.1109/JRPROC.1952.273898, 1952.
Kouznetsov, R.: A note on precision-preserving compression of scientific data, Geosci. Model Dev., 14, 377–389, https://doi.org/10.5194/gmd-14-377-2021, 2021.
Kryukov, K., Ueda, M. T., Nakagawa, S., and Imanishi, T.: Sequence
Compression Benchmark (SCB) database – A comprehensive evaluation of
reference-free compressors for FASTA-formatted sequences, GigaScience,
9, giaa072, https://doi.org/10.1093/gigascience/giaa072,
2020.
United States Environmental Protection Agency: The Community Multiscale Air
Quality Model version 5.3.1, Zenodo [Software], https://doi.org/10.5281/zenodo.3585898,
2019.
Wong, D.: Tool and python programs for the paper “The Impact of Altering Emission Data Precision on Compression Efficiency and Accuracy of Simulations of the Community Multiscale Air Quality Model”, Zenodo [data set], https://doi.org/10.5281/zenodo.6620983, 2022a.
Wong, D.: 2016 CMAQ v531 12 kim CONUS domain test dataset for 1/1 – 1/5/2016, Zenodo [data set], https://doi.org/10.5281/zenodo.6624164, 2022b.
Zender, C. S.: Bit Grooming: statistically accurate precision-preserving quantization with compression, evaluated in the netCDF Operators (NCO, v4.4.8+), Geosci. Model Dev., 9, 3199–3211, https://doi.org/10.5194/gmd-9-3199-2016, 2016.
Ziv, J. and Lempel, A.: A universal algorithm for sequential data
compression, IEEE Transactions on Information Theory, 23,
337–343, https://doi.org/10.1109/TIT.1977.1055714, 1977.
Short summary
A typical numerical simulation that associates with a large amount of input and output data, applying popular compression software, gzip or bzip2, on data is one good way to mitigate data storage burden. This article proposes a simple technique to alter input, output, or input and output by keeping a specific number of significant digits in data and demonstrates an enhancement in compression efficiency on the altered data but maintains similar statistical performance of the numerical simulation.
A typical numerical simulation that associates with a large amount of input and output data,...