<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing with OASIS Tables v3.0 20080202//EN" "journalpub-oasis3.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:oasis="http://docs.oasis-open.org/ns/oasis-exchange/table" xml:lang="en" dtd-version="3.0" article-type="research-article"><?xmltex \hack{\allowdisplaybreaks}?><?xmltex \bartext{Development and technical paper}?>
  <front>
    <journal-meta><journal-id journal-id-type="publisher">GMD</journal-id><journal-title-group>
    <journal-title>Geoscientific Model Development</journal-title>
    <abbrev-journal-title abbrev-type="publisher">GMD</abbrev-journal-title><abbrev-journal-title abbrev-type="nlm-ta">Geosci. Model Dev.</abbrev-journal-title>
  </journal-title-group><issn pub-type="epub">1991-9603</issn><publisher>
    <publisher-name>Copernicus Publications</publisher-name>
    <publisher-loc>Göttingen, Germany</publisher-loc>
  </publisher></journal-meta>
    <article-meta>
      <article-id pub-id-type="doi">10.5194/gmd-16-1179-2023</article-id><title-group><article-title>The impact of altering emission data precision on compression efficiency and accuracy of simulations of the community multiscale air quality model</article-title><alt-title>The impact of altering emission data precision</alt-title>
      </title-group><?xmltex \runningtitle{The impact of altering emission data precision}?><?xmltex \runningauthor{M. S. Walters and D. C. Wong}?>
      <contrib-group>
        <contrib contrib-type="author" equal-contrib="yes" corresp="no" rid="aff1 aff2">
          <name><surname>Walters</surname><given-names>Michael S.</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" equal-contrib="yes" corresp="yes" rid="aff1">
          <name><surname>Wong</surname><given-names>David C.</given-names></name>
          <email>wong.david-c@epa.gov</email>
        </contrib>
        <aff id="aff1"><label>1</label><institution>Atmospheric and Environmental Systems Modeling Division,
Center for Environmental Measurement and Modeling,<?xmltex \hack{\break}?> Office of Research and
Development, US Environmental Protection Agency, Research Triangle Park,
NC, USA</institution>
        </aff>
        <aff id="aff2"><label>2</label><institution>Oak Ridge Associated Universities, Oak Ridge, TN, USA</institution>
        </aff><author-comment content-type="econtrib"><p>These authors contributed equally to this work.</p></author-comment>
      </contrib-group>
      <author-notes><corresp id="corr1">David C. Wong (wong.david-c@epa.gov)</corresp></author-notes><pub-date><day>20</day><month>February</month><year>2023</year></pub-date>
      
      <volume>16</volume>
      <issue>4</issue>
      <fpage>1179</fpage><lpage>1190</lpage>
      <history>
        <date date-type="received"><day>18</day><month>March</month><year>2022</year></date>
           <date date-type="rev-request"><day>27</day><month>June</month><year>2022</year></date>
           <date date-type="rev-recd"><day>30</day><month>January</month><year>2023</year></date>
           <date date-type="accepted"><day>1</day><month>February</month><year>2023</year></date>
      </history>
      <permissions>
        <copyright-statement>Copyright: © 2023 Michael S. Walters</copyright-statement>
        <copyright-year>2023</copyright-year>
      <license license-type="open-access"><license-p>This work is licensed under the Creative Commons Attribution 4.0 International License. To view a copy of this licence, visit <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by/4.0/">https://creativecommons.org/licenses/by/4.0/</ext-link></license-p></license></permissions><self-uri xlink:href="https://gmd.copernicus.org/articles/16/1179/2023/gmd-16-1179-2023.html">This article is available from https://gmd.copernicus.org/articles/16/1179/2023/gmd-16-1179-2023.html</self-uri><self-uri xlink:href="https://gmd.copernicus.org/articles/16/1179/2023/gmd-16-1179-2023.pdf">The full text article is available as a PDF file from https://gmd.copernicus.org/articles/16/1179/2023/gmd-16-1179-2023.pdf</self-uri>
      <abstract><title>Abstract</title>

      <p id="d1e102">The Community Multiscale Air Quality (CMAQ) model has been a vital
tool for air quality research and management at the United States
Environmental Protection Agency (US EPA) and at government environmental
agencies and academic institutions worldwide. The CMAQ model requires a significant amount of disk space to store and archive input and output files. For example, an annual simulation over the contiguous United States (CONUS) with horizontal grid-cell spacing of 12 km requires 2–3 TB of input data and can produce anywhere from 7–45 TB of output data, depending on modeling configuration and desired post-processing of the output (e.g., for evaluations or graphics). After a simulation is complete, model data are archived for several years, or even decades, to ensure the replicability of conducted research. As a result, careful disk space management is essential to optimize resources and ensure the uninterrupted progress of ongoing research and applications requiring large-scale, air quality modeling. Proper disk-space management may include applying optimal data-compression techniques that are executed on input and output files for all CMAQ simulations. There are several (not limited to) such utilities that compress files using lossless compression, such as GNU Gzip (gzip) and Basic Leucine Zipper Domain (bzip2). A new approach is proposed in this study that reduces the precision of the emission input for air quality modeling to reduce storage requirements (after a lossless compression utility is applied) and accelerate runtime. The new approach is tested using CMAQ simulations and post-processed CMAQ output to examine the impact on the performance of the air quality model. In total, four simulations were conducted, and nine cases were post-processed from direct simulation output to determine disk-space efficiency, runtime efficiency, and model (predictive) accuracy. Three simulations were run with emission input containing only five, four, or three significant digits. To enhance the analysis of disk-space efficiency, the output from the altered precision emission CMAQ simulations were additionally post-processed to contain five, four, or three significant digits. The fourth, and final, simulation was run using the full precision emission files with no alteration. Thus, in total, 13 gridded products (4 simulations and 9 altered precision output cases) were analyzed in this study.</p>

      <p id="d1e105">Results demonstrate that the altered precision emission files reduced the
disk-space footprint by 6 %, 25 %, and 48 % compared to the
unaltered emission files when using the bzip2 compression utility for files
containing five, four, or three significant digits, respectively. Similarly, the altered output files reduced the required disk space by 19 %, 47 %, and 69 % compared to the unaltered CMAQ output files when using the bzip2 compression utility for files containing five, four, or three significant digits, respectively. For both compressed datasets, bzip2 performed better than gzip, in terms of compression size, by 5 %–27 % for
emission data and 15 %–28 % for CMAQ output for files containing five, four, or three significant digits. Additionally, CMAQ runtime was reduced by 2 %–7 % for simulations using emission files with<?pagebreak page1180?> reduced precision data in a non-dedicated environment. Finally, the model-estimated pollutant concentrations from the four simulations were compared to observed data from the US EPA Air Quality System (AQS) and the Ammonia Monitoring Network (AMoN). Model performance statistics were impacted negligibly. In summary, by reducing the precision of CMAQ emission data to five, four, or three significant digits, the simulation runtime in a non-dedicated environment was slightly reduced, disk-space usage was substantially reduced, and model accuracy remained relatively unchanged compared to the base CMAQ simulation, which suggests that the precision of the emission data could be reduced to more efficiently use computing resources while minimizing the impact on CMAQ simulations.</p>
  </abstract>
    </article-meta>
  </front>
<body>
      

<sec id="Ch1.S1" sec-type="intro">
  <label>1</label><title>Introduction</title>
      <p id="d1e117">The Community Multiscale Air Quality (CMAQ) model (Byun and Schere, 2006) is
a sophisticated, 3D Eulerian (gridded) numerical modeling system based on message passing interface (MPI) that uses scientific first
principles to simulate the chemical transformation and transport of ozone,
particulate matter, toxic compounds, and acid deposition. Since the
formation and transformation of chemical species are functions of complex
atmospheric and chemical interactions, two primary input types are required
to initialize CMAQ simulations: meteorology and emissions. First,
meteorological data (such as temperature, wind, cloud formation, and
precipitation rate) provide atmospheric conditions to drive CMAQ. The second
required input field, which is the focal point of this study, is emission
data (i.e., emission rates from emission sources) that characterize
pollutants from both man-made and naturally occurring sources.</p>
      <p id="d1e120">The CMAQ model typically requires multiple emission datasets which occupy a
significant amount of disk space. Although disk space is becoming
progressively cheaper and more affordable, the research and computational
needs are rapidly increasing and becoming more complex. For instance, the
total sizes of emission and meteorological datasets are about 7.0 and 6.8 GB, respectively, for a 1 d CMAQ simulation for the contiguous United States (CONUS) with a horizontal resolution of 12 km. The total disk-space size for 1 d of output is 20 GB (for a typical output configuration considering only surface output and neglecting extra diagnostic output). Including 3D fields and diagnostic output, however, the total output disk-space size can easily be tripled. Most studies with CMAQ on this scale create at least a full year's worth of data, so aggressive disk-space management is justifiable to minimize overall costs associated with running CMAQ. Aggressive disk-space management could be a substantial cost-saving measure, regardless of whether simulations are conducted on-site (such as with a high-performance computing architecture or a Linux cluster) or by using cloud computing, where data retrievals can quickly elevate costs. Here, we propose optimizing disk space by compressing CMAQ emission datasets as one practical
consideration to maximize storage capacity. If successful, this option could
be extended to other input types with large disk-space needs, such as
meteorological data.</p>
      <p id="d1e123">Compression algorithms can be described as either lossless or lossy.
Lossless compression algorithms reduce disk space by replacing repeated
sequences with a smaller, unique identifier. Thus, an entire dataset can be
retrieved, once uncompressed, without alteration of the original dataset
(hence the name, lossless). Lossy algorithms, however, in terms of numeric
arrays, reduce disk space by manipulating the mantissa of individual
floating-point numbers. Typically, trailing, or insignificant bits, are
replaced with a sequence of zeros or ones. As a result, data are compressed
at the cost of numerical inconsistencies between the original dataset and the
compressed dataset.</p>
      <p id="d1e126">The concept of maximizing disk space by altering netCDF datasets has been
examined previously by Zender (2016) and Kouznetsov (2021). Zender (2016)
created a versatile toolset that compresses data based on user
specifications that are applied to the mantissa of floating-point datasets.
The first notable algorithm developed by Zender (2016) is
precision trimming, which is publicly available in the netCDF operators
(NCOs, <uri>http://nco.sourceforge.net/nco.html</uri>, last access: 11 April 2022) utility. Precision trimming sets all non-significant bits to zero (bit shaving) which, based on analysis,
produces an undesirable bias of the compressed data (Zender, 2016). As a
result, Zender (2016) introduced a Bit Grooming algorithm (default algorithm
in the NCO) that shaves (to zero) and sets (to one) the least significant bits of consecutive values. Despite the additional toolset, Kouznetsov (2021) found substantial artifacts, or numerical inconsistencies, in multipoint statistics caused by Bit Grooming. Due to the suboptimal results, Kouznetsov (2021) developed and evaluated multiple lossy compression algorithms with respect to the NCO's available toolsets from Zender (2016). Kouznetsov (2021) created a round and half-shaved lossy compression algorithm which both doubled compression accuracy by rounding the mantissa to the nearest value that has zero tail bits and by setting all tail bits to zero, except for the most significant bit which gets set to one (Kouznetsov, 2021).</p>
      <p id="d1e133">Excluding analyses conducted on datasets via lossy compression algorithms,
the authors are unaware of any studies that have been conducted on the
compression efficiency of floating-point datasets with respect to <inline-formula><mml:math id="M1" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula>
significant digits. Additionally, Zender (2016) and Kouznetsov (2021) did
not conduct evaluations regarding the impact of altered precision datasets
on numerical simulations. In this study, the precision of netCDF datasets will be reduced and compressed to explore compression efficiency, and the
resultant reduced precision datasets will be used to run CMAQ simulations to
quantify the impacts on runtime and on model accuracy as a<?pagebreak page1181?> result of dataset
manipulation via a lossy compression algorithm. This study proceeds as
follows: in Sect. 2, a description of the methodology will be provided,
followed by results in Sect. 3 and then the conclusions in Sect. 4.</p>
</sec>
<sec id="Ch1.S2">
  <label>2</label><title>Methodology</title>
      <p id="d1e151">All input and output files in this study are 32-bit, binary, netCDF files
which inherently contain seven or eight significant digits at most. To
perform this study, we created a simple tool written in Fortran to truncate
floating-point data in netCDF files by keeping <inline-formula><mml:math id="M2" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula> significant digits which are normalized in scientific notation. Table 1 shows several examples of this
numerical manipulation. We applied this tool to alter the precision of two
different datasets (input emission and CMAQ model output) by keeping <inline-formula><mml:math id="M3" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula>
significant digits.</p>

<?xmltex \floatpos{t}?><table-wrap id="Ch1.T1" specific-use="star"><?xmltex \currentcnt{1}?><label>Table 1</label><caption><p id="d1e171">Examples of precision-reducing transformations of floating points from their original forms (first column) to their altered precision forms
(second to fourth column).</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="4">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="left"/>
     <oasis:colspec colnum="3" colname="col3" align="left"/>
     <oasis:colspec colnum="4" colname="col4" align="left"/>
     <oasis:thead>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Original (orig)</oasis:entry>
         <oasis:entry colname="col2">Altered 5 (A05)</oasis:entry>
         <oasis:entry colname="col3">Altered 4 (A04)</oasis:entry>
         <oasis:entry colname="col4">Altered 3 (A03)</oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row>
         <oasis:entry colname="col1">0.005666635</oasis:entry>
         <oasis:entry colname="col2">0.0056666</oasis:entry>
         <oasis:entry colname="col3">0.005667</oasis:entry>
         <oasis:entry colname="col4">0.00567</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"><inline-formula><mml:math id="M4" display="inline"><mml:mrow><mml:mn mathvariant="normal">3.437405</mml:mn><mml:mo>×</mml:mo><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">6</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M5" display="inline"><mml:mrow><mml:mn mathvariant="normal">3.4374</mml:mn><mml:mo>×</mml:mo><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">6</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M6" display="inline"><mml:mrow><mml:mn mathvariant="normal">3.437</mml:mn><mml:mo>×</mml:mo><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">6</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M7" display="inline"><mml:mrow><mml:mn mathvariant="normal">3.44</mml:mn><mml:mo>×</mml:mo><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">6</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">0.0005319762</oasis:entry>
         <oasis:entry colname="col2">0.00053198</oasis:entry>
         <oasis:entry colname="col3">0.000532</oasis:entry>
         <oasis:entry colname="col4">0.000532</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"><inline-formula><mml:math id="M8" display="inline"><mml:mrow><mml:mn mathvariant="normal">3.437</mml:mn><mml:mo>×</mml:mo><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">6</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M9" display="inline"><mml:mrow><mml:mn mathvariant="normal">3.437</mml:mn><mml:mo>×</mml:mo><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">6</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M10" display="inline"><mml:mrow><mml:mn mathvariant="normal">3.437</mml:mn><mml:mo>×</mml:mo><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">6</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M11" display="inline"><mml:mrow><mml:mn mathvariant="normal">3.44</mml:mn><mml:mo>×</mml:mo><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">6</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">100 150.0</oasis:entry>
         <oasis:entry colname="col2">100 150.0</oasis:entry>
         <oasis:entry colname="col3">100 200.0</oasis:entry>
         <oasis:entry colname="col4">100 000.0</oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table></table-wrap>

      <p id="d1e412">For this study, CMAQ v5.3.1 (USEPA, 2019; Appel et al., 2021) was run with
459 columns, 299 rows, and 35 vertical layers with a horizontal grid-scale
resolution of 12 km (Fig. 1a). Emission input files consist of two area
sources and nine point sources (hourly). The area-source emission files
contain 57 and 62 variables, and the point-source files contain anywhere
from 54 to 58 variables (containing one vertical layer). Ten CMAQ output
files (nine of them are hourly) were generated in this study: three output
files were generated for simulation-restart purposes (SOILOUT, CGRID which
contains only 1 h data, and MEDIA), two files contained average
(APMDIAG and ACONC) and hourly (CONC) species concentrations, three files
held wet deposition (WETDEP1; 140 variables), dry deposition (DRYDEP; 174
variables), and deposition velocity (DEPV; 104 variables) output, and
lastly, the final file contained biogenic emission diagnostic output (B3GTS).</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F1" specific-use="star"><?xmltex \currentcnt{1}?><?xmltex \def\figurename{Figure}?><label>Figure 1</label><caption><p id="d1e418">Regions for spatial and temporal stratification <bold>(a)</bold>, AQS stations <bold>(b)</bold>, and AMON stations <bold>(c)</bold> for the proceeding evaluation.</p></caption>
        <?xmltex \igopts{width=369.885827pt}?><graphic xlink:href="https://gmd.copernicus.org/articles/16/1179/2023/gmd-16-1179-2023-f01.png"/>

      </fig>

      <p id="d1e436">In total, we conducted four annual CMAQ simulations for 2016: one with
unaltered emission data (simulation orig) and three with altered precision emission data by setting <inline-formula><mml:math id="M12" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula> to five (A05), four (A04), and three (A03) for all emission input files (gridded_no_rwc, gridded_rwc, ptnonipm, ptegu, ptagfire, ptfire,
ptfire_othna, pt_oilgas, cmv_c3_12, cmv_c1c2_12, and othpt)
utilized by CMAQ for this study. On the output side, direct CMAQ outputs
(ACONC, APMDIAG, DRYDEP, and WETDEP1) from the A05, A04, and A03 (in which A0n signifies an altered simulation which utilized altered precision emission data to <inline-formula><mml:math id="M13" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula> significant digits) simulations were similarly altered to possess five, four, or three significant digits (denoted as FX05, FX04, and FX03, respectively, in which FX0n signifies an altered precision case which was post-processed by an A0n simulation's CMAQ output). Emission input and CMAQ output data were then compressed separately by gzip (GNU Gzip, <uri>https://www.gnu.org/software/gzip</uri>, last access: 11 April 2022)
and bzip2 (<uri>https://www.sourceware.org/bzip2</uri>, last access: 11 April 2022) for all simulations and cases to determine compression efficiency in terms of the reduction of disk space. In summary, there are four separate simulations (called orig or abbreviated as A0n) and nine additional, altered precision output cases (abbreviated as FX0n). For example, a CMAQ simulation that was run with emission data that were processed with <inline-formula><mml:math id="M14" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula> equals five significant digits, then post-processed to possess three significant digits, is denoted as A05FX03 (see Table 2 for a full list of simulations and cases).</p>

<?xmltex \floatpos{t}?><table-wrap id="Ch1.T2" specific-use="star"><?xmltex \currentcnt{2}?><label>Table 2</label><caption><p id="d1e469">Setup of all simulations (orig, A05, A04, and A03) and cases analyzed in this study.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="8">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="left"/>
     <oasis:colspec colnum="3" colname="col3" align="left"/>
     <oasis:colspec colnum="4" colname="col4" align="left" colsep="1"/>
     <oasis:colspec colnum="5" colname="col5" align="left"/>
     <oasis:colspec colnum="6" colname="col6" align="left" colsep="1"/>
     <oasis:colspec colnum="7" colname="col7" align="left"/>
     <oasis:colspec colnum="8" colname="col8" align="left"/>
     <oasis:thead>
       <oasis:row rowsep="1">
         <oasis:entry namest="col1" nameend="col2" align="center">Unaltered emission data </oasis:entry>
         <oasis:entry namest="col3" nameend="col8" align="center">Altered precision emission data </oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">(a)</oasis:entry>
         <oasis:entry colname="col2">Simulation: orig</oasis:entry>
         <oasis:entry colname="col3">(b)</oasis:entry>
         <oasis:entry colname="col4">Simulation: A05</oasis:entry>
         <oasis:entry colname="col5">(c)</oasis:entry>
         <oasis:entry colname="col6">Simulation: A04</oasis:entry>
         <oasis:entry colname="col7">(d)</oasis:entry>
         <oasis:entry colname="col8">Simulation: A03</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry namest="col1" nameend="col2" align="center"/>
         <oasis:entry rowsep="1" namest="col3" nameend="col8" align="center">Altered precision CMAQ output </oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry namest="col1" nameend="col2" align="center"/>
         <oasis:entry colname="col3">(e)</oasis:entry>
         <oasis:entry colname="col4">Case: A05FX05</oasis:entry>
         <oasis:entry colname="col5">(h)</oasis:entry>
         <oasis:entry colname="col6">Case: A04FX05</oasis:entry>
         <oasis:entry colname="col7">(k)</oasis:entry>
         <oasis:entry colname="col8">Case: A03FX05</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry namest="col1" nameend="col2" align="center"/>
         <oasis:entry colname="col3">(f)</oasis:entry>
         <oasis:entry colname="col4">Case: A05FX04</oasis:entry>
         <oasis:entry colname="col5">(i)</oasis:entry>
         <oasis:entry colname="col6">Case: A04FX04</oasis:entry>
         <oasis:entry colname="col7">(l)</oasis:entry>
         <oasis:entry colname="col8">Case: A03FX04</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry namest="col1" nameend="col2" align="center"/>
         <oasis:entry colname="col3">(g)</oasis:entry>
         <oasis:entry colname="col4">Case: A05FX03</oasis:entry>
         <oasis:entry colname="col5">(j)</oasis:entry>
         <oasis:entry colname="col6">Case: A04FX03</oasis:entry>
         <oasis:entry colname="col7">(m)</oasis:entry>
         <oasis:entry colname="col8">Case: A03FX03</oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table></table-wrap>

      <p id="d1e617">Simulated numerical, or predictive, accuracy was analyzed against
concentrations of particulate matter with diameter less than 2.5 <inline-formula><mml:math id="M15" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>m (PM<inline-formula><mml:math id="M16" display="inline"><mml:mrow><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>, ozone (O<inline-formula><mml:math id="M17" display="inline"><mml:mrow><mml:msub><mml:mi/><mml:mn mathvariant="normal">3</mml:mn></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>, ammonia (NH<inline-formula><mml:math id="M18" display="inline"><mml:mrow><mml:msub><mml:mi/><mml:mn mathvariant="normal">3</mml:mn></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>, the wet-deposition rates of sodium (Na), ammonium (NH<inline-formula><mml:math id="M19" display="inline"><mml:mrow><mml:msub><mml:mi/><mml:mn mathvariant="normal">4</mml:mn></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>, chlorine (Cl), nitrate
(NO<inline-formula><mml:math id="M20" display="inline"><mml:mrow><mml:msub><mml:mi/><mml:mn mathvariant="normal">3</mml:mn></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>, sulfate (SO<inline-formula><mml:math id="M21" display="inline"><mml:mrow><mml:msub><mml:mi/><mml:mn mathvariant="normal">4</mml:mn></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>, and the dry-deposition rate of O<inline-formula><mml:math id="M22" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:math></inline-formula> for
all simulations and cases. PM<inline-formula><mml:math id="M23" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula> and O<inline-formula><mml:math id="M24" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:math></inline-formula> were evaluated at in situ
stations from the dataset of the United States Environmental Protection Agency's (US EPA) Air Quality System (AQS; Fig. 1b). Ammonia (NH<inline-formula><mml:math id="M25" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:math></inline-formula>) was evaluated at in situ stations utilizing observations from the Ammonia Monitoring Network (AMON; Fig. 1c). Hourly observations of O<inline-formula><mml:math id="M26" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:math></inline-formula> were processed to calculate the maximum 8 h daily average concentrations (MDA8) and paired in space and time with calculated MDA8 O<inline-formula><mml:math id="M27" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:math></inline-formula> from post-processed CMAQ output. Likewise, daily averaged PM<inline-formula><mml:math id="M28" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula> observations and 2-week-averaged NH<inline-formula><mml:math id="M29" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:math></inline-formula> observations were used to evaluate CMAQ. Observed values are paired with the volume-averaged pollutant estimate from CMAQ's surface layer's grid cell containing the air quality monitoring site (i.e., nearest neighbor). Statistical metrics were also calculated by pairing gridded values from the orig simulation (considered observed values) and the altered precision simulations and cases (considered the predicted values). Tabulated
statistical metrics for grid–grid pairing was computed by taking the mean
of hourly, statistical metrics.</p>
      <?pagebreak page1182?><p id="d1e774">Typical statistical metrics including mean bias (MB), correlation
coefficient (<inline-formula><mml:math id="M30" display="inline"><mml:mi>r</mml:mi></mml:math></inline-formula>), root mean square error (RMSE), and normalized mean bias
(NMB) are used to evaluate all chemical species in this analysis at
different temporal intervals and for different pairing methodologies (either
grid–point or grid–grid) which includes regional stratification (based on
regions from Fig. 1a) for several figures. The utilized statistical metrics
are denoted below in Eq. (<xref ref-type="disp-formula" rid="Ch1.E1"/>) through Eq. (<xref ref-type="disp-formula" rid="Ch1.E4"/>):<?xmltex \hack{\newpage}?><?xmltex \hack{\noindent}?>

              <disp-formula specific-use="gather" content-type="numbered"><mml:math id="M31" display="block"><mml:mtable displaystyle="true"><mml:mlabeledtr id="Ch1.E1"><mml:mtd><mml:mtext>1</mml:mtext></mml:mtd><mml:mtd><mml:mrow><mml:mstyle class="stylechange" displaystyle="true"/><mml:mi mathvariant="normal">MB</mml:mi><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mn mathvariant="normal">1</mml:mn><mml:mi>N</mml:mi></mml:mfrac></mml:mstyle><mml:mo>⋅</mml:mo><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>N</mml:mi></mml:munderover><mml:mfenced open="(" close=")"><mml:mrow><mml:msub><mml:mi>Y</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mi>X</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:mfenced><mml:mo>,</mml:mo></mml:mrow></mml:mtd></mml:mlabeledtr><mml:mlabeledtr id="Ch1.E2"><mml:mtd><mml:mtext>2</mml:mtext></mml:mtd><mml:mtd><mml:mrow><mml:mstyle displaystyle="true" class="stylechange"/><mml:mi>r</mml:mi><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mn mathvariant="normal">1</mml:mn><mml:mrow><mml:mi>N</mml:mi><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:mfrac></mml:mstyle><mml:mo>⋅</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>N</mml:mi></mml:munderover><mml:mfenced open="(" close=")"><mml:mrow><mml:mfenced close=")" open="("><mml:mrow><mml:msub><mml:mi>X</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:mover accent="true"><mml:mi>X</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover></mml:mrow></mml:mfenced><mml:mo>⋅</mml:mo><mml:mfenced close=")" open="("><mml:mrow><mml:msub><mml:mi>Y</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:mover accent="true"><mml:mi>Y</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover></mml:mrow></mml:mfenced></mml:mrow></mml:mfenced></mml:mrow><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>X</mml:mi></mml:msub><mml:mo>⋅</mml:mo><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>Y</mml:mi></mml:msub></mml:mrow></mml:mfrac></mml:mstyle><mml:mo>,</mml:mo></mml:mrow></mml:mtd></mml:mlabeledtr><mml:mlabeledtr id="Ch1.E3"><mml:mtd><mml:mtext>3</mml:mtext></mml:mtd><mml:mtd><mml:mrow><mml:mstyle class="stylechange" displaystyle="true"/><mml:mi mathvariant="normal">RMSE</mml:mi><mml:mo>=</mml:mo><mml:msqrt><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>N</mml:mi></mml:munderover><mml:msup><mml:mfenced close=")" open="("><mml:mrow><mml:msub><mml:mi>Y</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mi>X</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:mfenced><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow><mml:mi>N</mml:mi></mml:mfrac></mml:mstyle></mml:msqrt><mml:mo>,</mml:mo></mml:mrow></mml:mtd></mml:mlabeledtr><mml:mlabeledtr id="Ch1.E4"><mml:mtd><mml:mtext>4</mml:mtext></mml:mtd><mml:mtd><mml:mrow><mml:mstyle class="stylechange" displaystyle="true"/><mml:mi mathvariant="normal">NMB</mml:mi><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>N</mml:mi></mml:munderover><mml:mfenced close=")" open="("><mml:mrow><mml:msub><mml:mi>Y</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mi>X</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:mfenced></mml:mrow><mml:mrow><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>N</mml:mi></mml:munderover><mml:mfenced close=")" open="("><mml:mrow><mml:msub><mml:mi>X</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:mfenced></mml:mrow></mml:mfrac></mml:mstyle><mml:mo>⋅</mml:mo><mml:mn mathvariant="normal">100</mml:mn><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mi mathvariant="italic">%</mml:mi><mml:mo>,</mml:mo></mml:mrow></mml:mtd></mml:mlabeledtr></mml:mtable></mml:math></disp-formula>

          where <inline-formula><mml:math id="M32" display="inline"><mml:mi>N</mml:mi></mml:math></inline-formula> is the total number of observed and predicted pairs, <inline-formula><mml:math id="M33" display="inline"><mml:mi>X</mml:mi></mml:math></inline-formula> is the
observed value, <inline-formula><mml:math id="M34" display="inline"><mml:mi>Y</mml:mi></mml:math></inline-formula> is predicted value, <inline-formula><mml:math id="M35" display="inline"><mml:mi mathvariant="italic">σ</mml:mi></mml:math></inline-formula> is the sample standard
deviation of a distribution, and the overbars in Eq. (<xref ref-type="disp-formula" rid="Ch1.E2"/>) refer to the
sample mean of a distribution. Although many compression toolsets exist and
optimization is dependent on multiple factors (Kryukov et al., 2020), gzip
and bzip2 are the most public, reliable, and widely used compressors. Both
utilities are lossless compression algorithms which are available for Linux
users. In terms of functionality, gzip uses a compression algorithm called
Deflate (Deutsch, 1996) which reduces sequences of datasets by incorporating
a combination of LZ77 dictionary coding (Ziv and Lempel, 1977) and Huffman
entropy coding (Huffman, 1952). In comparison, bzip2 uses the
Burrows–Wheeler (Burrows and Wheeler, 1994) algorithm which sorts all
possible rotations of an input lexically and forms an output by
concatenating the last character from the sorted list. In terms of
compression ratio, bzip2 is notably better than gzip, however, with respect
to compression speed, gzip is significantly faster than bzip2. Due to their
availability and efficiency, both gzip and bzip2 are utilized in this study
(default settings).</p>
</sec>
<?pagebreak page1183?><sec id="Ch1.S3">
  <label>3</label><title>Results</title>
<sec id="Ch1.S3.SS1">
  <label>3.1</label><title>Data storage</title>
      <p id="d1e1093">The CMAQ input and output data are stored for future analyses and to ensure the reproducibility of modeling studies which demands a tremendous amount of
disk space for input and output files. Therefore, we propose easing the
disk-space burden by utilizing efficient compression algorithms. For this
section of the analysis, two popular, reliable, and efficient compression
utilities, gzip and bzip2, were utilized to determine compression efficiency
with respect to emission input (emissions mentioned in Sect. 2.) files and
CMAQ output (mentioned in Sect. 2. including CGRID, CONC, and SOILOUT)
files. Both compression utilities were applied daily to compress emission
input and CMAQ output files throughout the entirety of the 2016 simulation
(Fig. 2).</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F2"><?xmltex \currentcnt{2}?><?xmltex \def\figurename{Figure}?><label>Figure 2</label><caption><p id="d1e1098">Relative compression size of two utilities, gzip (solid line) and
bzip2 (dotted line), on daily emission files (labeled as Emiss.) and direct
CMAQ output (labeled as CMAQ) for 2016 with reduced precision settings: 5,
4, and 3 (labeled as Altered 05, Altered 04, and Altered 03, respectively).
Negative values indicate better compression efficiency.</p></caption>
          <?xmltex \igopts{width=236.157874pt}?><graphic xlink:href="https://gmd.copernicus.org/articles/16/1179/2023/gmd-16-1179-2023-f02.png"/>

        </fig>

      <p id="d1e1107">The gzip compression utility reduced the file sizes, on average by 1 %, 5 %, and 21 %. This translates into about 5, 26, and 111 GB
actual difference between the compressed orig case and the compressed A05, A04, and A03 emission datasets for the entire year of 2016, respectively. The reduction in file size (using gzip) was more substantial when applied to
reduced precision CMAQ output, with an average reduction in file size of 4 %, 19 %, and 67 %. This means about 167, 839, and 2016 GB
actual difference between the orig case and FX05, FX04, and FX03, respectively for the entire
year. With the bzip2 utility, the reduction in magnitude is much larger than
with gzip, with an average reduction of file size equal to 6 %, 25 %,
and 48 % (actual differences are about 27, 126, and 241 GB,
respectively for A05, A04, and A03 emission files and 19 %, 47 %, and 69 %
(actual differences are about 856, 2142, and 3115 GB, respectively)
for the compressed CMAQ output. Thus, bzip2 is found to be a more effective
tool than gzip by roughly 5 %, 20 %, and 27 % for emission data and
15 %, 28 % and 23 % for CMAQ output, for reduced precision by
keeping 5, 4, and 3 significant digits (reduced precision emissions and
reduced precision output data), respectively.</p>
</sec>
<sec id="Ch1.S3.SS2">
  <label>3.2</label><title>Runtime</title>
      <p id="d1e1118">We examined daily runtime (captured by an MPI function called
MPI_WTIME) for CMAQ using emission data prepared with
truncations of A05, A04, and A03 compared with running CMAQ with unaltered (<italic>orig</italic>)
emission data (Fig. 3). Even though the simulations were not performed in a
dedicated environment (results are not entirely consistent due to the
allocation of resources when the simulations were initialized), the daily
runtimes for A05, A04, and A03 were lower than the runtime of the orig simulation in most of the days.<?pagebreak page1184?> The total runtimes for the A05, A04, and A03 simulations were 3.13, 2.94, and 12.84 h faster than the orig case (2 %, 2 %, and 7 %, respectively of the relative reduction of runtime). There are two possible explanations for such behavior: first, during the execution of each case, CMAQ competed for <inline-formula><mml:math id="M36" display="inline"><mml:mrow class="chem"><mml:mi mathvariant="normal">I</mml:mi><mml:mo>/</mml:mo><mml:mi mathvariant="normal">O</mml:mi></mml:mrow></mml:math></inline-formula> resources with other tasks on the system. As a result, an <inline-formula><mml:math id="M37" display="inline"><mml:mrow class="chem"><mml:mi mathvariant="normal">I</mml:mi><mml:mo>/</mml:mo><mml:mi mathvariant="normal">O</mml:mi></mml:mrow></mml:math></inline-formula> bottleneck could explain spikes in relative runtime on certain simulation days (Fig. 3). Second, a change in emission input (due to the reduced precision emission data) could alter the pathway for the aerosol dynamics calculation.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F3"><?xmltex \currentcnt{3}?><?xmltex \def\figurename{Figure}?><label>Figure 3</label><caption><p id="d1e1150">Relative daily runtime with respect to different adjusted
emission input for the A03, A04, and A05 simulations for 2016.</p></caption>
          <?xmltex \igopts{width=236.157874pt}?><graphic xlink:href="https://gmd.copernicus.org/articles/16/1179/2023/gmd-16-1179-2023-f03.png"/>

        </fig>

</sec>
<sec id="Ch1.S3.SS3">
  <label>3.3</label><title>Accuracy</title>
      <p id="d1e1167">The accuracy of each case is first examined grid-to-point between modeled
output and in situ observations (Fig. 1; AQS and AMON) for all available
model–measurement pairs throughout 2016. In general, to gauge the accuracy
of CMAQ, bulk statistical metrics of bias, NMB, <inline-formula><mml:math id="M38" display="inline"><mml:mi>r</mml:mi></mml:math></inline-formula>, and RMSE have been
provided in Table 3 for the orig simulation. To compare bulk statistical metrics to the orig simulation, the absolute difference in bias, RMSE, minima (minimum difference between all model and observation pairs), and maxima was
calculated with respect to the altered simulations and cases for daily
PM<inline-formula><mml:math id="M39" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula>, MDA8 O<inline-formula><mml:math id="M40" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:math></inline-formula>, and 2-week-averaged NH<inline-formula><mml:math id="M41" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:math></inline-formula>. Overall,
negligible differences are apparent (Fig. 4). For example, the maximum
absolute, bulk statistical difference between the orig simulation and the
altered cases and simulations for daily PM<inline-formula><mml:math id="M42" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula>, MDA8 O<inline-formula><mml:math id="M43" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:math></inline-formula>, and
2-week-averaged NH<inline-formula><mml:math id="M44" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:math></inline-formula> did not exceed <inline-formula><mml:math id="M45" display="inline"><mml:mrow><mml:mn mathvariant="normal">1.4</mml:mn><mml:mo>×</mml:mo><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">4</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M46" display="inline"><mml:mrow><mml:mn mathvariant="normal">3.6</mml:mn><mml:mo>×</mml:mo><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">5</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M47" display="inline"><mml:mrow><mml:mn mathvariant="normal">1.1</mml:mn><mml:mo>×</mml:mo><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>, and <inline-formula><mml:math id="M48" display="inline"><mml:mrow><mml:mn mathvariant="normal">5.3</mml:mn><mml:mo>×</mml:mo><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M49" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>g m<inline-formula><mml:math id="M50" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula> or <inline-formula><mml:math id="M51" display="inline"><mml:mi mathvariant="normal">ppb</mml:mi></mml:math></inline-formula> for bias,
RMSE, minima, and maxima, respectively. Therefore, differences in terms of
maximum absolute, bulk statistical differences are quite small amongst the
unaltered simulation (orig) and the altered simulations and cases.</p>

<?xmltex \floatpos{t}?><table-wrap id="Ch1.T3" specific-use="star"><?xmltex \currentcnt{3}?><label>Table 3</label><caption><p id="d1e1335">Annual bulk statistical metrics for all grid–point pairs for the
unaltered simulation (orig) binned by species (row) and statistic (column).</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="5">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="right"/>
     <oasis:colspec colnum="3" colname="col3" align="right"/>
     <oasis:colspec colnum="4" colname="col4" align="right"/>
     <oasis:colspec colnum="5" colname="col5" align="right"/>
     <oasis:thead>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Case</oasis:entry>
         <oasis:entry colname="col2">Bias</oasis:entry>
         <oasis:entry colname="col3">NMB (%)</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M52" display="inline"><mml:mi>r</mml:mi></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col5">RMSE</oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row>
         <oasis:entry colname="col1">PM<inline-formula><mml:math id="M53" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula> (<inline-formula><mml:math id="M54" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>g m<inline-formula><mml:math id="M55" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>)</oasis:entry>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M56" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">0.02828948</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M57" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">0.37369379</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col4">0.53041275</oasis:entry>
         <oasis:entry colname="col5">5.01579136</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">MAD8 O<inline-formula><mml:math id="M58" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:math></inline-formula> (ppb)</oasis:entry>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M59" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1.70888518</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M60" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">4.07590175</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col4">0.76393761</oasis:entry>
         <oasis:entry colname="col5">7.93497772</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">NH<inline-formula><mml:math id="M61" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:math></inline-formula> (<inline-formula><mml:math id="M62" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>g m<inline-formula><mml:math id="M63" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>)</oasis:entry>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M64" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">0.42796669</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M65" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">35.05920328</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col4">0.51400293</oasis:entry>
         <oasis:entry colname="col5">1.28576807</oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table></table-wrap>

      <?xmltex \floatpos{p}?><fig id="Ch1.F4"><?xmltex \currentcnt{4}?><?xmltex \def\figurename{Figure}?><label>Figure 4</label><caption><p id="d1e1554">Absolute differences in bulk statistical metrics for daily
PM<inline-formula><mml:math id="M66" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula>, MDA8 O<inline-formula><mml:math id="M67" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:math></inline-formula>, and 2-week-averaged NH<inline-formula><mml:math id="M68" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:math></inline-formula> between the orig
simulation and the altered simulations and cases. Bulk statistical metrics
were calculated for all model–observation pairs at in situ stations.</p></caption>
          <?xmltex \igopts{width=236.157874pt}?><graphic xlink:href="https://gmd.copernicus.org/articles/16/1179/2023/gmd-16-1179-2023-f04.png"/>

        </fig>

      <?xmltex \floatpos{p}?><fig id="Ch1.F5" specific-use="star"><?xmltex \currentcnt{5}?><?xmltex \def\figurename{Figure}?><label>Figure 5</label><caption><p id="d1e1593">Stacked bar plots of RMSE (<inline-formula><mml:math id="M69" display="inline"><mml:mi>y</mml:mi></mml:math></inline-formula> axis) stratified by region (color),
simulation and case (<inline-formula><mml:math id="M70" display="inline"><mml:mi>x</mml:mi></mml:math></inline-formula> axis), and season (subplot) for daily PM<inline-formula><mml:math id="M71" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula>,
MDA8 O<inline-formula><mml:math id="M72" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:math></inline-formula>, and 2-week-averaged NH<inline-formula><mml:math id="M73" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:math></inline-formula> calculated from in situ
observation.</p></caption>
          <?xmltex \igopts{width=426.791339pt}?><graphic xlink:href="https://gmd.copernicus.org/articles/16/1179/2023/gmd-16-1179-2023-f05.png"/>

        </fig>

      <p id="d1e1643">Bulk statistical results with respect to in situ observations and compared
to the orig simulation (Fig. 4) are encouraging; differences are small, ignoring regional or temporal stratification. To determine if statistical results fluctuate spatially (by region) and or temporally (by season), RMSE was computed for nine different subregions (regions are portrayed in Fig. 1)
across the United States for four seasons (winter, spring, summer, and fall)
from the mentioned observation and model pairs. Each region's RMSE was
stacked together, by simulation and case, and plotted as “accumulated RMSE”
by species. Likewise, results are negligible for daily PM<inline-formula><mml:math id="M74" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula>, MDA8
O<inline-formula><mml:math id="M75" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:math></inline-formula>, and 2-week-averaged NH<inline-formula><mml:math id="M76" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:math></inline-formula>, respectively (Fig. 5) for all
regional and temporal stratifications and for all simulations and cases.</p>
      <p id="d1e1673">Results indicate that all simulations and cases have negligible differences
in terms of bulk statistical metrics across the United States and considering
regional and temporal stratifications. Statistical results conducted on in
situ observations were redone (methodologically) at the grid level for
hourly PM<inline-formula><mml:math id="M77" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula>, O<inline-formula><mml:math id="M78" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:math></inline-formula> and NH<inline-formula><mml:math id="M79" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:math></inline-formula>, using the orig simulation (as the
observed field) with respect to the altered precision simulations and cases
(predicted fields). The RMSE was first calculated for all hourly grid–grid
pairs for PM<inline-formula><mml:math id="M80" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula>, O<inline-formula><mml:math id="M81" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:math></inline-formula>, and NH<inline-formula><mml:math id="M82" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:math></inline-formula>. Only cells that are within each
region (Fig. 1a), within the contiguous US, were used to calculate hourly
RMSE for all available regional pairs. Next, the average, hourly RMSE was
calculated for each season and region based on spatial and temporal masking
using the regions portrayed in Fig. 1a. All stratifications were grouped
together as accumulative, stacked bar plots for different seasons by
simulation or case. Although differences are evident (Fig. 6), the scale of
such differences is quite small. For example, the total accumulative RMSE
for PM<inline-formula><mml:math id="M83" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula>, O<inline-formula><mml:math id="M84" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:math></inline-formula>, and NH<inline-formula><mml:math id="M85" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:math></inline-formula> (sum of all region's RMSE) did not
exceed 0.04 <inline-formula><mml:math id="M86" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>g m<inline-formula><mml:math id="M87" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>, 0.3 ppbV, and 0.05 ppbV, respectively for all cases and for all seasons.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F6" specific-use="star"><?xmltex \currentcnt{6}?><?xmltex \def\figurename{Figure}?><label>Figure 6</label><caption><p id="d1e1780">Stacked bar plots of changes to RMSE (<inline-formula><mml:math id="M88" display="inline"><mml:mi>y</mml:mi></mml:math></inline-formula> axis) stratified by region
(color), simulation and case (<inline-formula><mml:math id="M89" display="inline"><mml:mi>x</mml:mi></mml:math></inline-formula> axis), and season (subplot) for hourly
PM<inline-formula><mml:math id="M90" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula>, O<inline-formula><mml:math id="M91" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:math></inline-formula>, and NH<inline-formula><mml:math id="M92" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:math></inline-formula> calculated from grid–grid pairs with
respect to the orig simulation.</p></caption>
          <?xmltex \igopts{width=398.338583pt}?><graphic xlink:href="https://gmd.copernicus.org/articles/16/1179/2023/gmd-16-1179-2023-f06.png"/>

        </fig>

      <?xmltex \floatpos{t}?><fig id="Ch1.F7" specific-use="star"><?xmltex \currentcnt{7}?><?xmltex \def\figurename{Figure}?><label>Figure 7</label><caption><p id="d1e1832">Maximum absolute bias (versus the orig simulation) for PM<inline-formula><mml:math id="M93" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula> calculated from hourly output for all simulations and cases.</p></caption>
          <?xmltex \igopts{width=455.244094pt}?><graphic xlink:href="https://gmd.copernicus.org/articles/16/1179/2023/gmd-16-1179-2023-f07.png"/>

        </fig>

      <?xmltex \floatpos{t}?><fig id="Ch1.F8" specific-use="star"><?xmltex \currentcnt{8}?><?xmltex \def\figurename{Figure}?><label>Figure 8</label><caption><p id="d1e1853">Maximum absolute bias (versus the orig simulation) for O<inline-formula><mml:math id="M94" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:math></inline-formula> calculated from hourly output for all simulations and cases.</p></caption>
          <?xmltex \igopts{width=426.791339pt}?><graphic xlink:href="https://gmd.copernicus.org/articles/16/1179/2023/gmd-16-1179-2023-f08.png"/>

        </fig>

      <?xmltex \floatpos{t}?><fig id="Ch1.F9" specific-use="star"><?xmltex \currentcnt{9}?><?xmltex \def\figurename{Figure}?><label>Figure 9</label><caption><p id="d1e1873">Maximum absolute bias (versus the orig simulation) for NH<inline-formula><mml:math id="M95" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:math></inline-formula>
calculated from hourly output for all simulations and cases.</p></caption>
          <?xmltex \igopts{width=426.791339pt}?><graphic xlink:href="https://gmd.copernicus.org/articles/16/1179/2023/gmd-16-1179-2023-f09.png"/>

        </fig>

      <?pagebreak page1185?><p id="d1e1891">Additionally, the maximum absolute bias for all grid cells was determined
spatially between the orig simulation and the altered simulations and cases
throughout 2016 for PM<inline-formula><mml:math id="M96" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula>, O<inline-formula><mml:math id="M97" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:math></inline-formula>, and NH<inline-formula><mml:math id="M98" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:math></inline-formula> from gridded, hourly
(CMAQ) output. For PM<inline-formula><mml:math id="M99" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula>, all simulations and cases performed similarly,
in which no visual differences are apparent (Fig. 7). For O<inline-formula><mml:math id="M100" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:math></inline-formula> (Fig. 8)
and NH<inline-formula><mml:math id="M101" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:math></inline-formula> (Fig. 9), however, the differences become relatively large for
cases <inline-formula><mml:math id="M102" display="inline"><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:math></inline-formula>. In fact, for both species, spatial and magnitude error visibly
increase with fewer significant digits (simulations and cases). For
example, the maximum absolute bias is largest for the A03 simulations and even worse for the FX03 altered precision cases, ignoring the artifact of error across the Northeastern United States for O<inline-formula><mml:math id="M103" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:math></inline-formula> for the A05 simulations and cases (induced by the A05 simulation). The maximum absolute bias ranges, found by taking the range of all altered precision cases, for PM<inline-formula><mml:math id="M104" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula>, O<inline-formula><mml:math id="M105" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:math></inline-formula>, and NH<inline-formula><mml:math id="M106" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:math></inline-formula> are 46.77 <inline-formula><mml:math id="M107" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>g m<inline-formula><mml:math id="M108" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>, 0.4265 ppbV, and 18.78 ppbV, respectively (Table 4). The minimum absolute bias ranges for PM<inline-formula><mml:math id="M109" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula>, O<inline-formula><mml:math id="M110" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:math></inline-formula>, and NH<inline-formula><mml:math id="M111" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:math></inline-formula> are 5.573 <inline-formula><mml:math id="M112" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>g m<inline-formula><mml:math id="M113" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>, 0.5091 ppbV, 9.778 ppbV (Table 4), respectively. Based on range, error can potentially be quite large compared to the statistics provided in Fig. 6, however, large-scale error is not persistent based on the small accumulated RMSE for all regions grouped by CMAQ simulation and case (Fig. 6). For example, for PM<inline-formula><mml:math id="M114" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula>, the maximum
positive bias was (roughly) between 41 and 51 <inline-formula><mml:math id="M115" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>g m<inline-formula><mml:math id="M116" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula> for the FX03 cases (Table 4). Upon further investigation, this relatively significant error occurred at one grid cell because of an anomalous wildfire (Pioneer wildfire in Idaho from July to September of 2016). Prior to the onset of the Pioneer wildfire and after the wildfire was extinguished, PM<inline-formula><mml:math id="M117" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula> returned to normal levels with respect to the orig simulation for FX03 cases. Regardless, total accumulated values did not exceed 0.04 <inline-formula><mml:math id="M118" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>g m<inline-formula><mml:math id="M119" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>, 0.3 ppbV,
and 0.05 ppbV for PM<inline-formula><mml:math id="M120" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula>, O<inline-formula><mml:math id="M121" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:math></inline-formula>, and NH<inline-formula><mml:math id="M122" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:math></inline-formula> respectively. Since
errors associated with Figs. 7–9 are predominately small (maximum absolute
bias), relatively large error (similar to the discrepancies in bias for
PM<inline-formula><mml:math id="M123" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula> for the FX03 cases) is associated with brief spikes of certain
species within and around source regions.</p>

      <?xmltex \floatpos{p}?><fig id="Ch1.F10"><?xmltex \currentcnt{10}?><?xmltex \def\figurename{Figure}?><label>Figure 10</label><caption><p id="d1e2163">Total absolute bias difference between the orig simulation and the
altered cases and simulations by deposition rate (row) throughout 2016
utilizing hourly output.</p></caption>
          <?xmltex \igopts{width=241.848425pt}?><graphic xlink:href="https://gmd.copernicus.org/articles/16/1179/2023/gmd-16-1179-2023-f10.png"/>

        </fig>

<?xmltex \floatpos{p}?><table-wrap id="Ch1.T4" specific-use="star"><?xmltex \currentcnt{4}?><label>Table 4</label><caption><p id="d1e2176">Maximum and minimum biases (altered – orig) calculated from hourly CMAQ output for all simulations and cases with respect to the orig simulation across all grid cells.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="7">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="right"/>
     <oasis:colspec colnum="3" colname="col3" align="right" colsep="1"/>
     <oasis:colspec colnum="4" colname="col4" align="right"/>
     <oasis:colspec colnum="5" colname="col5" align="right" colsep="1"/>
     <oasis:colspec colnum="6" colname="col6" align="right"/>
     <oasis:colspec colnum="7" colname="col7" align="right"/>
     <oasis:thead>
       <oasis:row>
         <oasis:entry colname="col1">Case</oasis:entry>
         <oasis:entry rowsep="1" namest="col2" nameend="col3" align="center">PM<inline-formula><mml:math id="M124" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula> (<inline-formula><mml:math id="M125" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>g m<inline-formula><mml:math id="M126" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>) </oasis:entry>
         <oasis:entry rowsep="1" namest="col4" nameend="col5" align="center">Ozone (ppbV) </oasis:entry>
         <oasis:entry rowsep="1" namest="col6" nameend="col7" align="center">Ammonia (ppbV) </oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">Max.</oasis:entry>
         <oasis:entry colname="col3">Min.</oasis:entry>
         <oasis:entry colname="col4">Max.</oasis:entry>
         <oasis:entry colname="col5">Min.</oasis:entry>
         <oasis:entry colname="col6">Max.</oasis:entry>
         <oasis:entry colname="col7">Min.</oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row>
         <oasis:entry colname="col1">A05FX05</oasis:entry>
         <oasis:entry colname="col2">4.40819836</oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M127" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">4.69252777</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col4">0.260878</oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M128" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">0.08337</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col6">0.893507</oasis:entry>
         <oasis:entry colname="col7"><inline-formula><mml:math id="M129" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">0.61453</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">A05FX04</oasis:entry>
         <oasis:entry colname="col2">4.40777397</oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M130" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">4.69240379</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col4">0.263882</oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M131" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">0.08437</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col6">0.893806</oasis:entry>
         <oasis:entry colname="col7"><inline-formula><mml:math id="M132" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">4.01074</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">A05FX03</oasis:entry>
         <oasis:entry colname="col2">51.17382812</oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M133" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">9.62011719</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col4">0.499962</oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M134" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">0.50085</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col6">19.64355</oasis:entry>
         <oasis:entry colname="col7"><inline-formula><mml:math id="M135" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">4.86621</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">A05</oasis:entry>
         <oasis:entry colname="col2">4.40821075</oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M136" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">4.69258881</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col4">0.260483</oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M137" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">0.08343</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col6">0.893517</oasis:entry>
         <oasis:entry colname="col7"><inline-formula><mml:math id="M138" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">0.61455</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">A04FX05</oasis:entry>
         <oasis:entry colname="col2">4.99303246</oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M139" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">4.70221233</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col4">0.136284</oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M140" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">0.16548</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col6">0.875244</oasis:entry>
         <oasis:entry colname="col7"><inline-formula><mml:math id="M141" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1.14815</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">A04FX04</oasis:entry>
         <oasis:entry colname="col2">4.99263382</oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M142" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">4.70223236</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col4">0.136154</oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M143" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">0.16448</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col6">1.275146</oasis:entry>
         <oasis:entry colname="col7"><inline-formula><mml:math id="M144" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">4.01074</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">A04FX03</oasis:entry>
         <oasis:entry colname="col2">51.1640625</oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M145" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">9.51953125</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col4">0.503494</oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M146" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">0.50282</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col6">19.64355</oasis:entry>
         <oasis:entry colname="col7"><inline-formula><mml:math id="M147" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">5.02832</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">A04</oasis:entry>
         <oasis:entry colname="col2">4.99302673</oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M148" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">4.70224953</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col4">0.13604</oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M149" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">0.16512</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col6">0.867432</oasis:entry>
         <oasis:entry colname="col7"><inline-formula><mml:math id="M150" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1.14854</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">A03FX05</oasis:entry>
         <oasis:entry colname="col2">11.09228516</oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M151" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">6.66992188</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col4">0.223785</oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M152" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">0.22272</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col6">4.146118</oasis:entry>
         <oasis:entry colname="col7"><inline-formula><mml:math id="M153" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">7.44141</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">A03FX04</oasis:entry>
         <oasis:entry colname="col2">11.54589844</oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M154" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">10.265625</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col4">0.225784</oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M155" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">0.22272</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col6">4.446045</oasis:entry>
         <oasis:entry colname="col7"><inline-formula><mml:math id="M156" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">7.0415</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">A03FX03</oasis:entry>
         <oasis:entry colname="col2">41.18359375</oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M157" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">9.46972656</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col4">0.562561</oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M158" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">0.59249</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col6">19.64355</oasis:entry>
         <oasis:entry colname="col7"><inline-formula><mml:math id="M159" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">10.3923</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">A03</oasis:entry>
         <oasis:entry colname="col2">11.17675781</oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M160" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">7.01953125</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col4">0.224041</oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M161" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">0.22235</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col6">4.187866</oasis:entry>
         <oasis:entry colname="col7"><inline-formula><mml:math id="M162" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">7.47461</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table></table-wrap>

      <?pagebreak page1188?><p id="d1e2863">The final aspect of this evaluation explores differences of important
deposition rates using bar plots which depict the sum of hourly absolute
differences (for all cells across the domain) between the orig simulation and
the altered simulations and cases. Bar plots were created for the wet-deposition rates of sodium (Na), ammonium (NH<inline-formula><mml:math id="M163" display="inline"><mml:mrow><mml:msub><mml:mi/><mml:mn mathvariant="normal">4</mml:mn></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>, chlorine (Cl), nitrate
(NO<inline-formula><mml:math id="M164" display="inline"><mml:mrow><mml:msub><mml:mi/><mml:mn mathvariant="normal">3</mml:mn></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>, sulfate (SO<inline-formula><mml:math id="M165" display="inline"><mml:mrow><mml:msub><mml:mi/><mml:mn mathvariant="normal">4</mml:mn></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>, and the dry-deposition rate of O<inline-formula><mml:math id="M166" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:math></inline-formula> for
all altered precision simulations and cases. For all deposition rates, all 3 cases, A05FX03, A04FX03, and A03FX03, performed equally poor, relatively speaking, with respect to the orig simulation. The A05FX04, A04FX04, and A03FX04 cases performed nearly identically to the A05FX05,
A04FX05, and A03FX05 cases for all deposition rates, excluding the wet-deposition rate of sodium and sulfate and the dry-deposition rate of ozone. The altered precision 5 cases ( A05FX05, A04FX05, and A03FX05) and the altered simulations (A05, A04, and A03) performed nearly identically to the orig simulation for all deposition rates. Overall, considering that each bar plot in Fig. 10 represents the sum of all hourly differences across the entire domain, all species, simulations, and cases performed similarly with respect to the orig case, and hence, amongst each other. For comparison purposes, the annual sum, considering all grid cells within the contiguous United States, for the wet-deposition rates of sodium,
ammonium, chlorine, nitrate, and sulfate are <inline-formula><mml:math id="M167" display="inline"><mml:mrow><mml:mn mathvariant="normal">1.42</mml:mn><mml:mo>×</mml:mo><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mn mathvariant="normal">5</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M168" display="inline"><mml:mrow><mml:mn mathvariant="normal">6.69</mml:mn><mml:mo>×</mml:mo><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mn mathvariant="normal">4</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M169" display="inline"><mml:mrow><mml:mn mathvariant="normal">21.75</mml:mn><mml:mo>×</mml:mo><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mn mathvariant="normal">4</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M170" display="inline"><mml:mrow><mml:mn mathvariant="normal">2.58</mml:mn><mml:mo>×</mml:mo><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mn mathvariant="normal">5</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula>, and
<inline-formula><mml:math id="M171" display="inline"><mml:mrow><mml:mn mathvariant="normal">1.72</mml:mn><mml:mo>×</mml:mo><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mn mathvariant="normal">5</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> kg ha<inline-formula><mml:math id="M172" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>, respectively for the base
simulation (orig). Similarly, the annual sum for the dry-deposition rate of
ozone (contiguous United States) is <inline-formula><mml:math id="M173" display="inline"><mml:mrow><mml:mn mathvariant="normal">2.78</mml:mn><mml:mo>×</mml:mo><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mn mathvariant="normal">6</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> kg ha<inline-formula><mml:math id="M174" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>
for the base simulation.</p>
      <p id="d1e3027">No error accumulation due to the non-systematic changes in model inputs
(changing precision introduces both positive and negative changes in a
spatially and temporally random manner) can occur over the course of the
annual simulation for chemical species of interest such as O<inline-formula><mml:math id="M175" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:math></inline-formula> and
PM<inline-formula><mml:math id="M176" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula>. Their lifetimes are much shorter than a year, i.e., their
simulated budgets within the continental-scale modeling domain are
repeatedly exchanged through transport, emissions, and chemical and physical
sinks. All simulations (orig, A05,<?pagebreak page1189?> A04, and A03) are numerically stable (no compounding error over time).</p>
</sec>
</sec>
<sec id="Ch1.S4" sec-type="conclusions">
  <label>4</label><title>Conclusions</title>
      <p id="d1e3058">We have demonstrated that altering data by keeping a specified number of
significant digits in terms of emission input and/or simulated output,
increased compression efficiency based on two different, popular compression
utilities (gzip and bzip2). For emission data, bzip2 performed far better
than gzip and provided compression reduction, on average, by 6 %, 25 %, and 48 %, and 19 %, 47 %, and 69 % for
output data for the A05, A04, and A03 cases, respectively, compared to the orig case. In terms of daily simulation runtime for the entire simulation year, the A05, A04, and A03 simulations were faster than the orig simulation in an undedicated HPC system for most simulation days.</p>
      <p id="d1e3061">As for accuracy, results for all studied simulations, either with
altered precision emission only, or with altered precision emission plus
altered precision output, produced numerically insignificant differences.
For example, the maximum absolute, bulk statistical difference between the
orig simulation and the altered cases and simulations for daily PM<inline-formula><mml:math id="M177" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula>, MDA8 O<inline-formula><mml:math id="M178" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:math></inline-formula>, and 2-week-averaged NH<inline-formula><mml:math id="M179" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:math></inline-formula> did not exceed <inline-formula><mml:math id="M180" display="inline"><mml:mrow><mml:mn mathvariant="normal">1.4</mml:mn><mml:mo>×</mml:mo><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">4</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M181" display="inline"><mml:mrow><mml:mn mathvariant="normal">3.6</mml:mn><mml:mo>×</mml:mo><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">5</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M182" display="inline"><mml:mrow><mml:mn mathvariant="normal">1.1</mml:mn><mml:mo>×</mml:mo><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>, and <inline-formula><mml:math id="M183" display="inline"><mml:mrow><mml:mn mathvariant="normal">5.3</mml:mn><mml:mo>×</mml:mo><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M184" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>g m<inline-formula><mml:math id="M185" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula> or
<inline-formula><mml:math id="M186" display="inline"><mml:mi mathvariant="normal">ppb</mml:mi></mml:math></inline-formula> for bias, RMSE, minima, and maxima, respectively. Similarly,
a small range in values is replicated for all other bulk statistical metrics
such as MB, <inline-formula><mml:math id="M187" display="inline"><mml:mi>r</mml:mi></mml:math></inline-formula>, and RMSE. Results stratified by region and season are similar to those for bulk statistics. Based on the in situ evaluation, simulation performance is very similar amongst all cases, with visible differences for the A03 simulation and the FX03 cases in which error is spatially detected in Figs. 7–9.</p>
      <p id="d1e3198">Statistical inconsistencies arise when comparing grid–grid values of hourly
PM<inline-formula><mml:math id="M188" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula>, O<inline-formula><mml:math id="M189" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:math></inline-formula>, and NH<inline-formula><mml:math id="M190" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:math></inline-formula> versus the orig simulation. Results indicate that similarities amongst the orig simulation decreases with fewer significant digit simulations and cases when analyzing the stacked and stratified (region and season) RMSE bar plot (Fig. 6). More specifically, performance with respect to the orig simulation is worse for the A03 simulation and for the FX03 cases as well. Such discrepancies do not occur consistently based on results provided by bar plots of statistical metrics of deposition rates (Fig. 10). Instead, errors appear to be confined to source regions at specific instances based on the maximum absolute (hourly) error spatial plots with respect to the orig simulation (Figs. 7–9).</p>
      <?pagebreak page1190?><p id="d1e3228">In summary, altering datasets by truncation to retain fewer significant
digits significantly improved data compression and slightly improved
runtime. Based on the thorough, yet spatially limited, in situ evaluation,
this study has shown this proposed technique did not compromise model
accuracy based on an evaluation of simulations and cases at in situ
locations compared to current air quality thresholds for daily PM<inline-formula><mml:math id="M191" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula>,
MDA8 O<inline-formula><mml:math id="M192" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:math></inline-formula>, and 2-week-averaged NH<inline-formula><mml:math id="M193" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:math></inline-formula>. These results show the optimal
benefit of altering CMAQ input data by keeping three significant digits, then
subsequently keeping four significant digits for CMAQ output data. In
addition, this proposed technique could be beneficial for groups that
perform complex air quality modeling and want to improve disk-space
management while negligibly impacting the accuracy of the simulations. Based
on the success of this study, we propose testing these techniques on the
rest of CMAQ input files such as initial conditions, boundary conditions, and
meteorological data to determine the viability of these techniques to more
adeptly manage disk space without compromising the quality of the CMAQ
simulations used for research and to develop air quality management
strategies.</p>
</sec>

      
      </body>
    <back><notes notes-type="codedataavailability"><title>Code and data availability</title>

      <p id="d1e3262">The source code of the tool to alter data by keeping a specific number of
significant digits and a run script which includes usage instructions for
this tool, is available from <ext-link xlink:href="https://doi.org/10.5281/zenodo.6620983" ext-link-type="DOI">10.5281/zenodo.6620983</ext-link> (Wong, 2022a). CMAQ 5.3.1 is
available at <uri>https://www.epa.gov/cmaq/access-cmaq-source-code</uri> (last access: 30 September 2021, EPA, 2022).
Original, unaltered CMAQ input data for this study are available at
<ext-link xlink:href="https://doi.org/10.15139/S3/MHNUNE" ext-link-type="DOI">10.15139/S3/MHNUNE</ext-link> (CMAS, 2023). Original, unaltered CMAQ input data for this study
from 1 January to 1 May 2016 are available at <ext-link xlink:href="https://doi.org/10.5281/zenodo.6624164" ext-link-type="DOI">10.5281/zenodo.6624164</ext-link> (Wong, 2022b).</p>
  </notes><notes notes-type="authorcontribution"><title>Author contributions</title>

      <p id="d1e3280">MSW conducted the runs, performed data analysis, created graphics, and wrote
the first draft of the manuscript and worked with DCW to improve it. DCW
originated and oversaw this work, coded the tool to alter data by keeping a
specific number of significant digits, created scripts to run the entire
experiment, outlined the first draft of the manuscript, and contributed to
writing and improving the manuscript.</p>
  </notes><notes notes-type="competinginterests"><title>Competing interests</title>

      <p id="d1e3286">The contact author has declared that none of the authors has any competing interests.</p>
  </notes><notes notes-type="disclaimer"><title>Disclaimer</title>

      <p id="d1e3292">Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.</p>
  </notes><notes notes-type="reviewstatement"><title>Review statement</title>

      <p id="d1e3298">This paper was edited by Sergey Gromov and reviewed by two anonymous referees.</p>
  </notes><ref-list>
    <title>References</title>

      <ref id="bib1.bib1"><label>1</label><?label 1?><mixed-citation>Appel, K. W., Bash, J. O., Fahey, K. M., Foley, K. M., Gilliam, R. C., Hogrefe, C., Hutzell, W. T., Kang, D., Mathur, R., Murphy, B. N., Napelenok, S. L., Nolte, C. G., Pleim, J. E., Pouliot, G. A., Pye, H. O. T., Ran, L., Roselle, S. J., Sarwar, G., Schwede, D. B., Sidi, F. I., Spero, T. L., and Wong, D. C.: The Community Multiscale Air Quality (CMAQ) model versions 5.3 and 5.3.1: system updates and evaluation, Geosci. Model Dev., 14, 2867–2897, <ext-link xlink:href="https://doi.org/10.5194/gmd-14-2867-2021" ext-link-type="DOI">10.5194/gmd-14-2867-2021</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bib2"><label>2</label><?label 1?><mixed-citation>Burrows, M. and Wheeler, D. J.: A Block Sorting Data Compression Algorithm,
Tech. report, Digital Systems Research Center, Digital Equipment
Corporation, Palo Alto, CA,
<uri>http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.37.6774</uri> (last access: 30 September 2021), 1994.</mixed-citation></ref>
      <ref id="bib1.bib3"><label>3</label><?label 1?><mixed-citation>Byun, D. and Schere, K. L.: Review of the governing equations, computational
algorithms, and other components of the Models-3 Community Multiscale Air
Quality (CMAQ) modeling system, Appl. Mech. Rev., 59, 51–77,
<ext-link xlink:href="https://doi.org/10.1115/1.2128636" ext-link-type="DOI">10.1115/1.2128636</ext-link>, 2006.</mixed-citation></ref>
      <ref id="bib1.bib4"><label>4</label><?label 1?><mixed-citation>CMAS: CMAQ Model Version 5.3 Input Data – 1/1/2016–12/31/2016 12 km CONUS, Dataverse [data set], <ext-link xlink:href="https://doi.org/10.15139/S3/MHNUNE" ext-link-type="DOI">10.15139/S3/MHNUNE</ext-link>, 2023.</mixed-citation></ref>
      <ref id="bib1.bib5"><label>5</label><?label 1?><mixed-citation>Deutsch, L. P.: DEFLATE compressed data format specification version 1.3,
Tech. Rep. IETF RFC1951, Internet Engineering Task Force, Menlo Park, CA,
USA, <ext-link xlink:href="https://doi.org/10.17487/RFC1951" ext-link-type="DOI">10.17487/RFC1951</ext-link>, 1996.</mixed-citation></ref>
      <ref id="bib1.bib6"><label>6</label><?label 1?><mixed-citation>EPA: Community Multiscale Air Quality Modeling System (CMAQ), Access CMAQ Source Code, EPA [code], <uri>https://www.epa.gov/cmaq/access-cmaq-source-code</uri> (last access: 31 January 2023), 2022.</mixed-citation></ref>
      <ref id="bib1.bib7"><label>7</label><?label 1?><mixed-citation>Huffman, D. A.: A method for the construction of minimum redundancy codes,
Proceedings of the IRE, Vol. 40, no. 9, Sept. 1952,  1098–1101, <ext-link xlink:href="https://doi.org/10.1109/JRPROC.1952.273898" ext-link-type="DOI">10.1109/JRPROC.1952.273898</ext-link>, 1952.</mixed-citation></ref>
      <ref id="bib1.bib8"><label>8</label><?label 1?><mixed-citation>Kouznetsov, R.: A note on precision-preserving compression of scientific data, Geosci. Model Dev., 14, 377–389, <ext-link xlink:href="https://doi.org/10.5194/gmd-14-377-2021" ext-link-type="DOI">10.5194/gmd-14-377-2021</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bib9"><label>9</label><?label 1?><mixed-citation>Kryukov, K., Ueda, M. T., Nakagawa, S., and Imanishi, T.: Sequence
Compression Benchmark (SCB) database – A comprehensive evaluation of
reference-free compressors for FASTA-formatted sequences, GigaScience,
9,  giaa072, <ext-link xlink:href="https://doi.org/10.1093/gigascience/giaa072" ext-link-type="DOI">10.1093/gigascience/giaa072</ext-link>,
2020.</mixed-citation></ref>
      <ref id="bib1.bib10"><label>10</label><?label 1?><mixed-citation>United States Environmental Protection Agency: The Community Multiscale Air
Quality Model version 5.3.1, Zenodo [Software], <ext-link xlink:href="https://doi.org/10.5281/zenodo.3585898" ext-link-type="DOI">10.5281/zenodo.3585898</ext-link>,
2019.</mixed-citation></ref>
      <ref id="bib1.bib11"><label>11</label><?label 1?><mixed-citation>Wong, D.: Tool and python programs for the paper “The Impact of Altering Emission Data Precision on Compression Efficiency and Accuracy of Simulations of the Community Multiscale Air Quality Model”, Zenodo [data set], <ext-link xlink:href="https://doi.org/10.5281/zenodo.6620983" ext-link-type="DOI">10.5281/zenodo.6620983</ext-link>, 2022a.</mixed-citation></ref>
      <ref id="bib1.bib12"><label>12</label><?label 1?><mixed-citation>Wong, D.:   2016 CMAQ v531 12 kim CONUS domain test dataset for 1/1 – 1/5/2016, Zenodo [data set], <ext-link xlink:href="https://doi.org/10.5281/zenodo.6624164" ext-link-type="DOI">10.5281/zenodo.6624164</ext-link>, 2022b.</mixed-citation></ref>
      <ref id="bib1.bib13"><label>13</label><?label 1?><mixed-citation>Zender, C. S.: Bit Grooming: statistically accurate precision-preserving quantization with compression, evaluated in the netCDF Operators (NCO, v4.4.8<inline-formula><mml:math id="M194" display="inline"><mml:mo>+</mml:mo></mml:math></inline-formula>), Geosci. Model Dev., 9, 3199–3211, <ext-link xlink:href="https://doi.org/10.5194/gmd-9-3199-2016" ext-link-type="DOI">10.5194/gmd-9-3199-2016</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bib14"><label>14</label><?label 1?><mixed-citation>Ziv, J. and Lempel, A.: A universal algorithm for sequential data
compression, IEEE Transactions on Information Theory, 23,
337–343, <ext-link xlink:href="https://doi.org/10.1109/TIT.1977.1055714" ext-link-type="DOI">10.1109/TIT.1977.1055714</ext-link>, 1977.</mixed-citation></ref>

  </ref-list></back>
    <!--<article-title-html>The impact of altering emission data precision on compression efficiency and accuracy of simulations of the community multiscale air quality model</article-title-html>
<abstract-html/>
<ref-html id="bib1.bib1"><label>1</label><mixed-citation>
      
Appel, K. W., Bash, J. O., Fahey, K. M., Foley, K. M., Gilliam, R. C., Hogrefe, C., Hutzell, W. T., Kang, D., Mathur, R., Murphy, B. N., Napelenok, S. L., Nolte, C. G., Pleim, J. E., Pouliot, G. A., Pye, H. O. T., Ran, L., Roselle, S. J., Sarwar, G., Schwede, D. B., Sidi, F. I., Spero, T. L., and Wong, D. C.: The Community Multiscale Air Quality (CMAQ) model versions 5.3 and 5.3.1: system updates and evaluation, Geosci. Model Dev., 14, 2867–2897, <a href="https://doi.org/10.5194/gmd-14-2867-2021" target="_blank">https://doi.org/10.5194/gmd-14-2867-2021</a>, 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib2"><label>2</label><mixed-citation>
      
Burrows, M. and Wheeler, D. J.: A Block Sorting Data Compression Algorithm,
Tech. report, Digital Systems Research Center, Digital Equipment
Corporation, Palo Alto, CA,
<a href="http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.37.6774" target="_blank"/> (last access: 30 September 2021), 1994.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib3"><label>3</label><mixed-citation>
      
Byun, D. and Schere, K. L.: Review of the governing equations, computational
algorithms, and other components of the Models-3 Community Multiscale Air
Quality (CMAQ) modeling system, Appl. Mech. Rev., 59, 51–77,
<a href="https://doi.org/10.1115/1.2128636" target="_blank">https://doi.org/10.1115/1.2128636</a>, 2006.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib4"><label>4</label><mixed-citation>
      
CMAS: CMAQ Model Version 5.3 Input Data – 1/1/2016–12/31/2016 12&thinsp;km CONUS, Dataverse [data set], <a href="https://doi.org/10.15139/S3/MHNUNE" target="_blank">https://doi.org/10.15139/S3/MHNUNE</a>, 2023.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib5"><label>5</label><mixed-citation>
      
Deutsch, L. P.: DEFLATE compressed data format specification version 1.3,
Tech. Rep. IETF RFC1951, Internet Engineering Task Force, Menlo Park, CA,
USA, <a href="https://doi.org/10.17487/RFC1951" target="_blank">https://doi.org/10.17487/RFC1951</a>, 1996.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib6"><label>6</label><mixed-citation>
      
EPA: Community Multiscale Air Quality Modeling System (CMAQ), Access CMAQ Source Code, EPA [code], <a href="https://www.epa.gov/cmaq/access-cmaq-source-code" target="_blank"/> (last access: 31 January 2023), 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib7"><label>7</label><mixed-citation>
      
Huffman, D. A.: A method for the construction of minimum redundancy codes,
Proceedings of the IRE, Vol. 40, no. 9, Sept. 1952,  1098–1101, <a href="https://doi.org/10.1109/JRPROC.1952.273898" target="_blank">https://doi.org/10.1109/JRPROC.1952.273898</a>, 1952.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib8"><label>8</label><mixed-citation>
      
Kouznetsov, R.: A note on precision-preserving compression of scientific data, Geosci. Model Dev., 14, 377–389, <a href="https://doi.org/10.5194/gmd-14-377-2021" target="_blank">https://doi.org/10.5194/gmd-14-377-2021</a>, 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib9"><label>9</label><mixed-citation>
      
Kryukov, K., Ueda, M. T., Nakagawa, S., and Imanishi, T.: Sequence
Compression Benchmark (SCB) database – A comprehensive evaluation of
reference-free compressors for FASTA-formatted sequences, GigaScience,
9,  giaa072, <a href="https://doi.org/10.1093/gigascience/giaa072" target="_blank">https://doi.org/10.1093/gigascience/giaa072</a>,
2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib10"><label>10</label><mixed-citation>
      
United States Environmental Protection Agency: The Community Multiscale Air
Quality Model version 5.3.1, Zenodo [Software], <a href="https://doi.org/10.5281/zenodo.3585898" target="_blank">https://doi.org/10.5281/zenodo.3585898</a>,
2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib11"><label>11</label><mixed-citation>
      
Wong, D.: Tool and python programs for the paper “The Impact of Altering Emission Data Precision on Compression Efficiency and Accuracy of Simulations of the Community Multiscale Air Quality Model”, Zenodo [data set], <a href="https://doi.org/10.5281/zenodo.6620983" target="_blank">https://doi.org/10.5281/zenodo.6620983</a>, 2022a.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib12"><label>12</label><mixed-citation>
      
Wong, D.:   2016 CMAQ v531 12 kim CONUS domain test dataset for 1/1 – 1/5/2016, Zenodo [data set], <a href="https://doi.org/10.5281/zenodo.6624164" target="_blank">https://doi.org/10.5281/zenodo.6624164</a>, 2022b.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib13"><label>13</label><mixed-citation>
      
Zender, C. S.: Bit Grooming: statistically accurate precision-preserving quantization with compression, evaluated in the netCDF Operators (NCO, v4.4.8+), Geosci. Model Dev., 9, 3199–3211, <a href="https://doi.org/10.5194/gmd-9-3199-2016" target="_blank">https://doi.org/10.5194/gmd-9-3199-2016</a>, 2016.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib14"><label>14</label><mixed-citation>
      
Ziv, J. and Lempel, A.: A universal algorithm for sequential data
compression, IEEE Transactions on Information Theory, 23,
337–343, <a href="https://doi.org/10.1109/TIT.1977.1055714" target="_blank">https://doi.org/10.1109/TIT.1977.1055714</a>, 1977.

    </mixed-citation></ref-html>--></article>
