<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing with OASIS Tables v3.0 20080202//EN" "journalpub-oasis3.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:oasis="http://docs.oasis-open.org/ns/oasis-exchange/table" xml:lang="en" dtd-version="3.0">
  <front>
    <journal-meta><journal-id journal-id-type="publisher">GMD</journal-id><journal-title-group>
    <journal-title>Geoscientific Model Development</journal-title>
    <abbrev-journal-title abbrev-type="publisher">GMD</abbrev-journal-title><abbrev-journal-title abbrev-type="nlm-ta">Geosci. Model Dev.</abbrev-journal-title>
  </journal-title-group><issn pub-type="epub">1991-9603</issn><publisher>
    <publisher-name>Copernicus Publications</publisher-name>
    <publisher-loc>Göttingen, Germany</publisher-loc>
  </publisher></journal-meta>
    <article-meta>
      <article-id pub-id-type="doi">10.5194/gmd-13-6237-2020</article-id><title-group><article-title>Using wavelet transform and dynamic time warping to identify the limitations
of the CNN model as an air quality forecasting system</article-title><alt-title>Limitations of the CNN model as an AQF system</alt-title>
      </title-group><?xmltex \runningtitle{Limitations of the CNN model as an AQF system}?><?xmltex \runningauthor{E.~Eslami et~al.}?>
      <contrib-group>
        <contrib contrib-type="author" corresp="no">
          <name><surname>Eslami</surname><given-names>Ebrahim</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="yes">
          <name><surname>Choi</surname><given-names>Yunsoo</given-names></name>
          <email>ychoi6@uh.edu</email>
        </contrib>
        <contrib contrib-type="author" corresp="no">
          <name><surname>Lops</surname><given-names>Yannic</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="no">
          <name><surname>Sayeed</surname><given-names>Alqamah</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="no">
          <name><surname>Salman</surname><given-names>Ahmed Khan</given-names></name>
          
        </contrib>
        <aff id="aff1"><institution>Department of Earth and Atmospheric Sciences, University of Houston,
Houston, TX 77204, United States</institution>
        </aff>
      </contrib-group>
      <author-notes><corresp id="corr1">Yunsoo Choi (ychoi6@uh.edu)</corresp></author-notes><pub-date><day>9</day><month>December</month><year>2020</year></pub-date>
      
      <volume>13</volume>
      <issue>12</issue>
      <fpage>6237</fpage><lpage>6251</lpage>
      <history>
        <date date-type="received"><day>5</day><month>December</month><year>2019</year></date>
           <date date-type="rev-request"><day>9</day><month>March</month><year>2020</year></date>
           <date date-type="rev-recd"><day>12</day><month>October</month><year>2020</year></date>
           <date date-type="accepted"><day>25</day><month>October</month><year>2020</year></date>
      </history>
      <permissions>
        <copyright-statement>Copyright: © 2020 </copyright-statement>
        <copyright-year>2020</copyright-year>
      <license license-type="open-access"><license-p>This work is licensed under the Creative Commons Attribution 4.0 International License. To view a copy of this licence, visit <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by/4.0/">https://creativecommons.org/licenses/by/4.0/</ext-link></license-p></license></permissions><self-uri xlink:href="https://gmd.copernicus.org/articles/.html">This article is available from https://gmd.copernicus.org/articles/.html</self-uri><self-uri xlink:href="https://gmd.copernicus.org/articles/.pdf">The full text article is available as a PDF file from https://gmd.copernicus.org/articles/.pdf</self-uri>
      <abstract><title>Abstract</title>
    <p id="d1e114">As the deep learning algorithm has become a popular data analysis technique,
atmospheric scientists should have a balanced perception of its strengths
and limitations so that they can provide a powerful analysis of complex data
with well-established procedures. Despite the enormous success of the
algorithm in numerous applications, certain issues related to its
applications in air quality forecasting (AQF) require further analysis and
discussion. This study addresses significant limitations of an advanced deep
learning algorithm, the convolutional neural network (CNN), in two common
applications: (i) a real-time AQF model and (ii) a post-processing tool in
a dynamical AQF model, the Community Multi-scale Air Quality Model (CMAQ).
In both cases, the CNN model shows promising accuracy for ozone prediction
24 h in advance in both the United States of America and South Korea (with an
overall index of agreement exceeding 0.8). For the first case, we use the
wavelet transform to determine the reasons behind the poor performance of
CNN during the nighttime, cold months, and high-ozone episodes. We find that
when fine wavelet modes (hourly and daily) are relatively weak or when
coarse wavelet modes (weekly) are strong, the CNN model produces less
accurate forecasts. For the second case, we use the dynamic time warping
(DTW) distance analysis to compare post-processed results with their CMAQ
counterparts (as a base model). For CMAQ results that show a consistent DTW
distance from the observation, the post-processing approach properly
addresses the modeling bias with predicted indexes of agreement exceeding 0.85. When the DTW
distance of CMAQ versus observation is irregular, the post-processing approach
is unlikely to perform satisfactorily. Awareness of the limitations in CNN
models will enable scientists to develop more accurate regional or local air
quality forecasting systems by identifying the affecting factors in high-concentration episodes.</p>
  </abstract>
    </article-meta>
  </front>
<body>
      

<sec id="Ch1.S1" sec-type="intro">
  <label>1</label><title>Introduction</title>
      <p id="d1e126">Currently, atmospheric scientists have shown significant interest in
applying machine learning (ML) algorithms in their field, specifically for
air quality forecasting, remote sensing data retrieval, and hurricane
tracking. ML is a technique used for developing data-driven algorithms that
learn to mimic human behavior on the basis of a prior example or experience.
It is a tool that allows systems to more effectively deal with
knowledge-intensive problems in complex domains, which occurs via learning
that involves gathering information from a training dataset and using a
certain logic to purposefully detect a pattern of behavior. The fundamental
goal of ML models is to apply the detected patterns to make generalizations
beyond the examples in the training set.</p>
      <p id="d1e129">Generalizations stemming from ML models provide a scope of improvement in a
number of physical applications. Evidence of the growing interest in
applying ML is the rapid increase in the number of scientific publications
in this area, illustrated in Fig. S1 in the Supplement. However, the focus of these studies
was the general performance of the model ML models compared to that of
conventional statistical models rather than identifying the shortcomings of
such models in explaining the uncertainties of prediction models. Such
examples can be found in studies by Eslami et al. (2019, 2020a, b),
Choi et al. (2019), Sayeed et al. (2020), and Lops et al. (2019). To achieve
more reasonable outcomes, we must first explore the current challenges we
face when forecasting ambient air quality and then assess how or even
whether ML<?pagebreak page6238?> models can address these challenges to produce more accurate
forecasting.</p>
      <p id="d1e132">To develop a capable air quality forecasting tool, atmospheric scientists
often turn to chemical transport models (CTMs) and statistical models, both
of which use meteorological parameters and chemical precursors from previous
atmospheric conditions to estimate the following conditions. A brief summary
of these models appears in Zhang et al. (2012). Although CTMs, with their
dynamical implementation of atmospheric chemistry and physics, have shown
promise in forecasting, they are too computationally intensive for real-time
operational forecasts. Thus, computationally efficient statistical models
such as ML have emerged as alternative approaches. Unlike CTMs, however,
these models mainly rely on data from a network of monitoring stations that
are sparsely distributed and measure a limited number of meteorology and air
quality variables (Eslami et al., 2020a). Given the complexity of the
formation and depletion of air pollutants such as ozone, this limitation may be
vital in predicting extreme events (Eslami et al., 2020b).</p>
      <p id="d1e135">Another challenge in predicting ozone concentration is the “external”
relationships among predictors. For instance, as important meteorological
parameters, temperature and solar radiation are synoptic factors, while the
wind field is influenced by regional factors such as geography and
urbanization. Such conditions particularly affect ozone variability since
locally produced <inline-formula><mml:math id="M1" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">NO</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> emissions under certain meteorological
circumstances lead to the formation of ozone that is later transported by
the wind and detected by monitoring stations (Pan et al., 2015).
Nevertheless, station-specific ML models use such chemical and
meteorological variables as a footprint of local conditions.</p>
      <p id="d1e150">Although local emissions of ozone precursors are the dominant source of
ozone, particularly in urban areas, ozone pollution arising from sources
outside of a target region, such as background ozone, inevitably degrade
local air quality (Camalier et al., 2007). The lack of measurable
environmental variables that indicate the potential long-range transport of
air pollutants poses an unprecedented challenge for an ML model to estimate
ozone concentrations over downwind communities (Eslami et al., 2020a).
Because of the nonlinear spatial relationships between neighboring
monitoring stations, ML models as operational real-time forecasting systems
produce relative uncertainty.</p>
      <p id="d1e153">A number of studies have proposed solutions addressing the above limitations
of ML models. Eslami et al. (2020a) implement a deep convolutional neural
network (CNN) (Krizhevsky et al., 2012) model that uses hourly values of
several meteorological and air pollution variables to predict hourly ozone
concentrations 24 h in advance. Even though the accuracy of the
forecasting system guarantees a reasonable level of accuracy, it fails to
address high-ozone episodes owing to the infrequent occurrences of such
events, which led to the undertraining of the CNN model. In another study,
Eslami et al. (2020b) propose a data ensemble approach that mitigates this
issue by regularizing the training dataset toward capturing high-ozone
episodes. While the authors remove a significant portion of the
underprediction biases of the CNN model, its predictions of ozone during the
nighttime and on rainy days are unreliable. Sayeed et al. (2020) use
historical data covering a longer period within a diverse geographical
domain (Texas) to train a similar CNN model. Their results from stations for
which fewer measurements are available, while more accurate, are prone to
uncertainty. Using the outputs of air quality and meteorological forecast
models to map the hourly ozone concentrations at station locations, Choi et al. (2019) train a similar deep CNN model, a spatially generalized model
that bias-corrects ozone forecasts of the community multi-scale air quality
(CMAQ) model for all monitoring stations in the EPA AirNow network. Even
though the model significantly improved CMAQ forecasts, the bias-correction
process and the unbalanced CMAQ modeling outputs are unclear.</p>
      <p id="d1e156">This paper discusses certain limitations of the machine learning model using
wavelet transform and dynamic time warping (DTW). Wavelet transform is a
powerful technique for analyzing the temporal variation of a time series
(Grinsted et al., 2004). Wavelet analysis uses an adjustable resolution to
translate time series data and then decomposes the data into a certain
frequency level that cannot be achieved by other conventional methods such
as Fourier analysis (Huang et al., 2010). DTW is a nonlinear technique that
measures any alignment between two time series (i.e., model prediction and
observation in this study) by warping them to match their similarities
(Berndt and Clifford, 1994). By introducing two applications of CNN in the
real-time ozone forecasting system, we use these analytical tools to
identify the source of the prediction biases of the CNN model. In this
paper, we do not describe the forecasting results in detail but instead
refer the reader to studies by Eslami et al. (2020a, b), Choi et al.
(2019), and Sayeed et al. (2020).</p>
</sec>
<sec id="Ch1.S2">
  <label>2</label><title>Materials and methods</title>
<sec id="Ch1.S2.SS1">
  <label>2.1</label><title>Deep convolutional neural networks</title>
      <p id="d1e174">The deep CNN model (Krizhevsky et al., 2012) is a common deep learning
architecture that has long been used in numerous applications (Deng and Yu,
2014; Schmidhuber, 2015; Goodfellow et al., 2016; Litjens et al., 2017; Chen
et al., 2018; Kamilaris and Prenafeta-Boldú, 2018; Higham and Higham,
2019). Unlike other methods, the CNN model is capable of analyzing joint
features and attaining greater accuracy on large-scale datasets. Deep CNNs
can be trained to approximate smooth, highly nonlinear functions (LeCun et
al., 2015), rendering them appropriate for analyzing nonlinear processes in
the atmosphere. In addition, feature extraction using deep learning
algorithms is more efficient than<?pagebreak page6239?> using other neural network methods,
particularly when multiple hidden layers are structured (Krizhevsky et al.,
2012).</p>
      <p id="d1e177">A schematic for the deep CNN used in this paper appears in Fig. 1. The
figure shows the input layer of the CNN algorithm, which represents the
normalized time series of all input variables. The normalization process
prevents a steep cost function and averts one feature from overbearing
others. A filter passes through a set of units located in a small
neighborhood in the previous convolutional layer. With local receptive
fields, neurons can extract the elementary features of inputs that are then
combined with those of higher layers. The outputs of such a set of neurons
constitute a feature map (see Fig. 3). At each position, various types of
units in different feature maps compute various types of features. A
sequential implementation of this procedure for each feature map is used for
scanning the input data with a single neuron in a local receptive field and
storing the states of this neuron at corresponding locations in the feature
map. The constrained units in a feature map perform the same operation on
different instances in a time series, and several feature maps (with
different weight vectors) can comprise one convolutional layer. Thus,
multiple features can be extracted in each instance. Once a feature is
detected, its exact “location” becomes less important as long as its
approximate position relative to the other features is preserved (Krizhevsky
et al., 2012; LeCun et al., 2015).</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F1" specific-use="star"><?xmltex \currentcnt{1}?><label>Figure 1</label><caption><p id="d1e182">Schematic of the deep CNN model in our approach.</p></caption>
          <?xmltex \igopts{width=341.433071pt}?><graphic xlink:href="https://gmd.copernicus.org/articles/13/6237/2020/gmd-13-6237-2020-f01.png"/>

        </fig>

      <p id="d1e192">CNN uses a kernel of a given size to capture changes in the temporal
variation of the input data by sweeping through time series. The various
sections of the data are represented by feature maps. An additional layer
performs local averaging, called “pooling,” and subsampling reduces the
resolution of the feature map and the sensitivity of the output to possible
shifts and distortions. This step could potentially discard important
information (e.g., sudden ozone peaks) as explained in Sabour et al. (2017).
Hence, this study uses the convolution layer without pooling. The feature
maps are connected to a fully connected layer, which helps us to map each
feature of multiple inputs to the hourly ozone output (see Fig. 1).</p>
      <p id="d1e195">Compared to fully-connected multilayer perceptrons (MLPs) and recurrent
neural networks (RNNs), which have been extensively used as regression
models, CNNs are attractive for several reasons. MLPs and RNNs are not
explicitly designed to model variance within an estimation that results from
a complex interaction between several inputs and outputs. While MLPs of
sufficient size could indeed capture invariance, they require large networks
with a large training set. Compared to the CNNs proposed in this study, RNNs
are challenging to implement and computationally expensive (Eslami et al.,
2020a; Sayeed et al., 2020; Lops et al., 2019).</p>
</sec>
<sec id="Ch1.S2.SS2">
  <label>2.2</label><title>Wavelet transform</title>
      <p id="d1e206">Wavelet transformation decomposes a signal into a scale frequency space,
allowing the determination of the relative contributions of each temporal
scale present within a signal (Mallat, 1989). Wavelet decompositions are
powerful tools for analyzing the variation in signal properties across
different resolutions of geophysical variables (Mallat, 1989; Grinsted et al., 2004; Foufoula-Georgiou and Kumar, 2014). Using a fully scalable
modulated window that shifts along with the signal, the wavelet transform
overcomes the inability of the Fourier transform to represent a signal in
the time and frequency domain at the same time (see Fig. S2 in the
Supplement). The spectrum is calculated for every position.
After repeating the process, each time with a different window size, the
results constitute a collection of time–frequency representations of the
signal, all with different resolutions. The data are separated into
multiresolution components, each of which is studied with a resolution that
matches its scale (Aiazzi et al., 2002). While high-resolution components
capture fine-scale features in the signal, low-resolution components capture
the coarse-scale features.</p>
      <p id="d1e209">As wavelet analysis represents any arbitrary (nonlinear) function by a
linear combination of a set of wavelets or alternative basis functions, they
are highly suitable for use as both an integration kernel for analysis to
extract information about the process and a basis for representation or
characterization of processes (Kaheil et al., 2008). Figure S3 in the
Supplement shows the hourly ozone time series of a monitoring
station in downtown Seoul, South Korea, with a wavelet transform for the
year 2017. Here, the wavelet transform exhibits strong power levels
associated with a period of 24 h and a period of 168 h in the middle of the year,
indicating dominant daily (24 h) and weekly variation (168 h).</p>
</sec>
<sec id="Ch1.S2.SS3">
  <label>2.3</label><title>Dynamic time warping</title>
      <p id="d1e220">To assess the similarity between two time series, DTW expands or contracts a
given time series to minimize the difference between the two of them (Berndt
and Clifford, 1994). The advantage it has over Euclidean distance, a
conventional distance analysis method, is that it highlights when a shift
(e.g., a time lag) occurs between two time steps in two time series (see
Fig. S4 in the Supplement). Euclidean distance takes pairs of
data within the time series and compares them. DTW calculates the smallest
distance between all points, matching one time step to many counterpart
steps on the linked time series (see Fig. S4 in the Supplement). Owing to its nonlinear
mapping capability, it is widely used in various domains, from time series
classification (Jeong et al., 2011) to bioinformatics (Giorgino, 2009),
health signal processing (Tormene et al., 2009), and speech recognition
(Berndt and Clifford, 1994).</p>
      <p id="d1e223">One benefit of DTW is that it will classify two time series of the same
shape as similar even if their absolute values differ or if one time series
contains large variability. Figure S5 in the Supplement compares the DTW distance between the
observation time series and two prediction models for an ozone monitoring
station in Texas. DTW detects the differences<?pagebreak page6240?> between CMAQ estimation and
observation with the highest difference in the middle of 2014.</p>
</sec>
</sec>
<sec id="Ch1.S3">
  <label>3</label><title>Results and discussion</title>
<sec id="Ch1.S3.SS1">
  <label>3.1</label><title>Case 1: CNN as a real-time ozone forecasting system</title>
      <p id="d1e242">In this case, we used the modeling experience reported in Eslami et al.
(2020a). Briefly, the system employs a deep CNN model that uses an hourly
variation of seven meteorological and two air quality parameters from the
day before as inputs to predict hourly ozone concentrations on the following
day for 25 monitoring stations in Seoul, South Korea. Figures S7 and S8 in the Supplement show
the accuracy of the CNN model (using the index of agreement; IOA) and the
time series comparison of average ozone concentrations between the
observation and the CNN prediction, respectively. Note that IOA is a
standardized measure of the degree of model prediction error and varies
between 0 and 1. The agreement value of 1 indicates a perfect match, and 0
indicates no agreement at all. While the model maintained a proper level of
prediction accuracy, it was prone to two main limitations: (i) its
performance at various times of the year varied (see Fig. S6 in the Supplement); and (ii) nighttime predictions showed higher relative bias and lower modeling
performance than daytime predictions (see Fig. S7 in the Supplement). In general, wavelet
transform can explain varying, time-dependent modeling performance;
nevertheless, the significant difference between modeling performance during
daytime and nighttime indicates an undertrained CNN model.</p>
<sec id="Ch1.S3.SS1.SSS1">
  <label>3.1.1</label><title>Time-dependent model performance</title>
      <p id="d1e252">The performance of the CNN model is directly dependent on how well the model
understands the relationship between the inputs (meteorology and ozone
precursors) and output (ozone concentration). Compared with meteorological
variables, emission sources from volatile organic compounds (VOCs) and <inline-formula><mml:math id="M2" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">NO</mml:mi><mml:mi>x</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>
experience less variability in time. Thus, meteorological variables
play an important role in governing the variation of the ozone at different
times throughout the year (Choi, 2014; Pan et al., 2019). Temperature, wind
speed, and relative humidity (RH) are among the most important
meteorological parameters affecting ozone variation.</p>
      <p id="d1e266">Figure 2 shows the wavelet power transform of the aforementioned
meteorological variables for 2017. Since we used an hourly time series to
calculate the wavelet powers, both the index and the period are in hours.
The figure also locates five time periods, which indicates significant
performance variations. From Fig. S6 in the Supplement, the CNN model under-performed during
weeks 3–9 and 44–51, labeled the “Worst CNN results” in Fig. 2. For weeks 14–22 and 42–44, the CNN model showed the best forecasting results. Between
weeks 29 and 33, the CNN model produced significant underestimations,
labeled “Large under-prediction” in Fig. 2. The figure shows strong
wavelet powers during a 24 h (daily) period for all variables, the
results of strong diurnal variation of these parameters, which are directly
or indirectly controlled by sunlight (e.g., temperature, relative humidity,
etc.). While the wavelet powers for wind speed were generally larger than
RH, the temperature showed lower but more consistent daily modes. This
finding is important since the CNN model can more accurately detect specific
“patterns” in the temperature than those in the wind speed and RH. Thus,
when the daily modes are stronger in temperature, the CNN model likely
performs better. In contrast, when the daily modes of the meteorological
variables are relatively weak, the CNN model performs poorly (see Fig. 2).</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F2" specific-use="star"><?xmltex \currentcnt{2}?><label>Figure 2</label><caption><p id="d1e271">Wavelet power transform of <bold>(a)</bold> temperature, <bold>(b)</bold> wind speed, and
<bold>(c)</bold> RH % for 2017 in Seoul, South Korea.</p></caption>
            <?xmltex \igopts{width=369.885827pt}?><graphic xlink:href="https://gmd.copernicus.org/articles/13/6237/2020/gmd-13-6237-2020-f02.png"/>

          </fig>

      <p id="d1e290">The large coarse modes in the wind speed and RH lead to significant over and
underestimation of the CNN model. Figure S8 shows the polar frequency
(influenced by the wind speed) of the CNN modeling bias in various months.
As the figure shows, while southwesterly winds in August 2017 were
associated with relatively large underpredictions<?pagebreak page6241?> boosted by pollution
transport from the Incheon area, north–northwesterly winds with air coming
from less urbanized regions were allied with notable overpredictions.</p>
      <p id="d1e293">Figure S9 in the Supplement compares the CNN model predictions with observational data for the
seasons with respect to levels of RH. The figure showed the largest
differences in the CNN model predictions (both overpredictions and underpredictions)
when the level of RH was close to the extreme (very high and very low). This
finding was particularly evident for the summer months when the model showed
poor performance at capturing high-ozone episodes. This finding underscores
the importance of coarse models from the wavelet analysis during the warm
months. Directly indicating the overpredictions or underpredictions by the model
through these modes, however, is challenging. For instance, Fig. S10 in the Supplement shows
one high-ozone episode in July 2017, when the daily ozone peak exceeded
90 ppb on 2 continuous days at most stations. Here, the overprediction of
the CNN model was associated with high RH, while the underprediction was
linked to low RH, indicating more complexity among the relationships between
meteorological factors and ozone formation or depletion.</p>
      <p id="d1e296">Another reason for the poor performance of the CNN model during the selected
time period was the relatively large coarse modes (period <inline-formula><mml:math id="M3" display="inline"><mml:mrow><mml:mo>&gt;</mml:mo><mml:mn mathvariant="normal">24</mml:mn></mml:mrow></mml:math></inline-formula> h). The CNN model received information about only the last day; hence,
it was unable to address the bi-daily and weekly trends with the input data.
For instance, for time periods with large underpredictions, coarse modes in
the wind speed were even larger than the daily modes. Thus, employing a
longer history would adequately explain the relationship between wind speed
and ozone. In the comparison of the average wavelet powers in various
periods (from daily to weekly modes) of CNN predictions and observational
data, Fig. 3 shows that the powers for both time series match periods of
approximately 24 h. After 32 h, however, the wavelet power of the
CNN model shrinks to a relatively constant power while that for the
observation reaches local extrema at around 3, 5, and 7 d.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F3"><?xmltex \currentcnt{3}?><label>Figure 3</label><caption><p id="d1e311">Wavelet power for various time periods (modes) for CNN predictions
and observations.</p></caption>
            <?xmltex \igopts{width=241.848425pt}?><graphic xlink:href="https://gmd.copernicus.org/articles/13/6237/2020/gmd-13-6237-2020-f03.png"/>

          </fig>

      <?pagebreak page6242?><p id="d1e320">Although wavelet analysis indicates that modes coarser than 24 h are
important components of the ozone time series, their relationship to CNN
model accuracy can be complicated. Figure 4 compares wavelet powers for both
fine and coarse modes with a correlation coefficient (<inline-formula><mml:math id="M4" display="inline"><mml:mi>r</mml:mi></mml:math></inline-formula>) in 25 ozone
stations in Seoul. For stations closer to the downtown area (i.e., those
with station numbers under 11), the fine modes had fewer wavelet powers than
those for stations in less urbanized areas, indicating that the relationship
between ozone concentrations with local emissions was evident in the less
urbanized areas than it was in the other areas. The coarse modes, however,
varied from station-to-station with relatively high coarse wavelet power
for those in less urbanized areas. Nonetheless, no evidence points to a
clear relationship between either coarse or fine wavelet modes and the
accuracy of the model. Figure 4 shows that the CNN model generally performed
better for stations close to downtown Seoul. Because Seoul has only one
meteorological station, these stations had access to more realistic weather
parameters in their training and prediction process.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F4" specific-use="star"><?xmltex \currentcnt{4}?><label>Figure 4</label><caption><p id="d1e333">Relationship between <bold>(a)</bold> fine and <bold>(b)</bold> coarse wavelet power modes
and correlation coefficients in all stations in Seoul, South Korea.</p></caption>
            <?xmltex \igopts{width=341.433071pt}?><graphic xlink:href="https://gmd.copernicus.org/articles/13/6237/2020/gmd-13-6237-2020-f04.png"/>

          </fig>

</sec>
<sec id="Ch1.S3.SS1.SSS2">
  <label>3.1.2</label><title>Low modeling performance during the nighttime</title>
      <p id="d1e356">In their discussion of several air quality forecasting models that
incorporated machine learning algorithms, including CNN, deep neural
networks, and decision trees, Eslami et al. (2020a) and Eslami et al.
(2020b) claimed that the algorithms encounter a significant modeling bias
while estimating air quality concentrations during the nighttime. This bias
reduced the prediction accuracy of nighttime ozone concentrations, compared
to daytime concentrations, by more than 20 %. A similar issue is also
encountered by CTMs, even those with complex physical and chemical equations
that explain the diurnal variation of ozone concentrations.</p>
      <p id="d1e359">One reason for this modeling bias was likely the result of variation among
the meteorological inputs during the nighttime. Although their absolute
values were generally higher they were during the daytime, the relative
frequency of variation was more pronounced during the nighttime, causing a
discontinuity in the learning process of the CNN model. Since both daytime
and nighttime hours were inputs, the CNN model minimized the cost function
that contained “normalized” errors during both daytime and nighttime hours
(the cost function was the mean-squared errors or 24 h ozone predictions
at each step). Generally, there are more daytime hours than nighttime hours
(see Fig. S11 in the Supplement). Also, the accumulation of <inline-formula><mml:math id="M5" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">NO</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> concentrations for these
extreme cases was mainly due to stagnant atmospheric conditions with wind
speeds close to their yearly minimum values (see Fig. S12a in the Supplement for scatterplots
with levels of wind speeds). As a result, the CNN model was vulnerable to
characteristic bias in nighttime ozone estimations. As a customized cost
function could be a potential solution to this limitation, it requires
further investigation.</p>
      <p id="d1e373">The performance of the CNN model in predicting nighttime ozone
concentrations also suffered because of the misinterpretation of extreme
conditions of the input parameters. Figure 5 shows scatterplots that
compare CNN predictions and observations by the levels of two important
ozone precursors (<inline-formula><mml:math id="M6" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">NO</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> concentrations) and meteorological variables
(RH %) separated into daytime and nighttime. The <inline-formula><mml:math id="M7" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">NO</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> concentration
was generally higher during the nighttime when the ozone concentration was
near zero for extreme <inline-formula><mml:math id="M8" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">NO</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> values because of conditions amenable to
ozone depletion with the absence of sunlight. Unable to capture this
relationship, however, the CNN model overestimated these cases (See Fig. 5a).</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F5" specific-use="star"><?xmltex \currentcnt{5}?><label>Figure 5</label><caption><p id="d1e412">Scatterplots comparing CNN predictions and observations with
respect to levels of <bold>(a)</bold> <inline-formula><mml:math id="M9" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">NO</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> concentrations and <bold>(b)</bold> RH %.</p></caption>
            <?xmltex \igopts{width=341.433071pt}?><graphic xlink:href="https://gmd.copernicus.org/articles/13/6237/2020/gmd-13-6237-2020-f05.png"/>

          </fig>

      <p id="d1e438">In contrast to the above-mentioned overestimated events, Fig. 5b shows an
underestimation of nighttime ozone when the level of RH % was generally
high, primarily during warm days. A similar pattern occurred when the
surface pressure was accounted for (Fig. S12b in the Supplement). Such underestimated events
occurred for two reasons. One is that high (or low) levels of RH % and
surface pressure generally occur at about the same time during the early
morning (or late afternoon) when the planetary boundary layer (PBL) is at
its lowest (or highest) level during the day. In these extreme conditions,
the earlier sunrise (or later sunset) during the summer months established a
condition that elevated ozone concentrations. As these events normally
occurred only during short periods of time, the CNN model was not
sufficiently trained to capture these relationships.</p>
</sec>
</sec>
<sec id="Ch1.S3.SS2">
  <label>3.2</label><title>Case 2: CNN as a post-processing tool in a real-time ozone forecasting
system</title>
      <p id="d1e450">In this case, a generalized bias-correction CNN model introduced by Choi et al. (2019) was used. Their model is a computationally efficient deep
learning-based model that produces more reliable numerical results. The
authors used a deep CNN model to map ozone precursors from CMAQ and
meteorological parameters from the weather research and forecasting (WRF)
model (as input variables) to observe hourly ozone concentrations at a
monitoring station (as a<?pagebreak page6243?> target). Their model, the CMAQ-CNN model,
significantly improves the performance of the CMAQ model in both accuracy
and bias. Figure S13 in the Supplement shows the statistical improvements (in correlation,
root-mean-squared error, and standard deviation) of the CMAQ-CNN model over
the CMAQ model (as a base model) in different months. Figure S14 in the Supplement compares
the daily maximum ozone estimated by CMAQ and CMAQ-CNN in 48 states for
which the CMAQ-CNN significantly moderated the overpredictions of the CMAQ.</p>
      <p id="d1e453">It was clear that the likelihood of the CMAQ-CNN model producing accurate
results was strongly associated with the quality of CMAQ forecasts; when
CMAQ forecasted hourly ozone concentrations with a station-specific yearly
IOA of more than 0.5, the IOA of the CMAQ-CNN model was more than 0.8 for
most cases. The probability of such accuracy was generally unrelated to that
of the CMAQ model. For instance, the CMAQ-CNN model was unable a reach the
yearly IOA <inline-formula><mml:math id="M10" display="inline"><mml:mo>=</mml:mo></mml:math></inline-formula> 0.8 even though the CMAQ IOA was more than 0.7 (e.g., EPA
no. 101, Tennessee: CMAQ IOA <inline-formula><mml:math id="M11" display="inline"><mml:mo>=</mml:mo></mml:math></inline-formula> 0.7; CMAQ-CNN IOA <inline-formula><mml:math id="M12" display="inline"><mml:mo>=</mml:mo></mml:math></inline-formula> 0.78). In some cases,
however, the yearly IOA following the post-processing approach was less than
0.7 (e.g., EPA no. 1011, California: CMAQ-CNN IOA <inline-formula><mml:math id="M13" display="inline"><mml:mo>=</mml:mo></mml:math></inline-formula> 0.63). Here, we used the
distance analysis from DTW to explain (i) why CMAQ-CNN produced satisfactory
results at some stations but not others, and (ii) why it performed poorly at
some stations.</p>
<sec id="Ch1.S3.SS2.SSS1">
  <label>3.2.1</label><title>Satisfactory post-processing scenarios</title>
      <p id="d1e491">Figure 6 shows the time-series of CMAQ, CMAQ-CNN, and observed daily ozone
concentrations at three EPA stations. These stations were selected because
the IOA accuracy of the CMAQ-CNN model was either more than 0.9 (Fig. 6a and
b) or 20 % more than that of CMAQ (Fig. 6c). Figure 7 compares the DTW
distance analysis of CMAQ and CMAQ-CNN for the same stations. These are
three typical cases of satisfactory improvement by the CMAQ-CNN
post-processing approach:</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F6" specific-use="star"><?xmltex \currentcnt{6}?><label>Figure 6</label><caption><p id="d1e496">Comparison of the time series of CMAQ and CMAQ-CNN predictions for
EPA stations <bold>(a)</bold> no. 3001 (California), <bold>(b)</bold> no. 33 (Florida), and <bold>(c)</bold> no. 4
(Vermont).</p></caption>
            <?xmltex \igopts{width=341.433071pt}?><graphic xlink:href="https://gmd.copernicus.org/articles/13/6237/2020/gmd-13-6237-2020-f06.png"/>

          </fig>

      <?xmltex \floatpos{t}?><fig id="Ch1.F7" specific-use="star"><?xmltex \currentcnt{7}?><label>Figure 7</label><caption><p id="d1e516">Comparison of the distance analysis of CMAQ and CMAQ-CNN
predictions for EPA stations <bold>(a)</bold> no. 3001 (California), <bold>(b)</bold> no. 33
(Florida), and <bold>(c)</bold> no. 4 (Vermont).</p></caption>
            <?xmltex \igopts{width=341.433071pt}?><graphic xlink:href="https://gmd.copernicus.org/articles/13/6237/2020/gmd-13-6237-2020-f07.png"/>

          </fig>

      <p id="d1e535">Figures 6a–7a show that observed ozone concentrations in this California location were higher at the
beginning of the ozone season, followed by relatively steady values ranging
between 20–40 ppb. After May, however, CMAQ significantly overestimated daily
ozone concentrations. The overestimation was more pronounced at the end of
the ozone season, resulting in an overall IOA accuracy of 0.73. The DTW
distance analysis showed a consistent distance between CMAQ predictions and
observed values. Because of this consistency, the CMAQ-CNN model recognized
the bias trends in CMAQ, boosting its prediction accuracy by 0.17, even
though the large distance from the CMAQ predictions (mean distance <inline-formula><mml:math id="M14" display="inline"><mml:mo>=</mml:mo></mml:math></inline-formula> 0.52)
mirrored a relatively significant overestimation in the CMAQ-CNN
post-processed results.</p>
      <p id="d1e545">Figures 6b–7b show that here the trend in ozone concentrations followed a U-shaped curve in the
ozone season because of strong summer winds coming from the large bodies of
water near Florida (the North Atlantic Ocean and the Gulf<?pagebreak page6244?> of Mexico). For
this station, CMAQ accurately predicted this trend throughout the ozone
season with a relatively constant bias from July to September. As a result,
the overall accuracy of the IOA was 0.84 for the CMAQ prediction. The CMAQ
was also consistent with the DTW analysis, with two distance gaps in July
and September (at the beginning and the end of the CMAQ overestimation
period). The CMAQ-CNN model, recognizing the adequate performance of the
base model in its post-processing algorithm, further improved the IOA
accuracy of CMAQ by around 10 %.</p>
      <p id="d1e548">Figures 6c–7c show that the trend of observed ozone had a steady decrease in this northeastern
state because of the significantly cooler summer and fall months. This
trend, along with the fewer ozone emission sources surrounding this station,
resulted in the formation of less ozone during the ozone season. The CMAQ
model overestimated ozone concentrations by more than 50 % during most of
the season with a relatively large mean DTW distance (0.62). The CMAQ-CNN
model was able to address this issue because of the consistency of the bias
trend in CMAQ predictions (see left panel in Fig. 7c for DTW distance). Thus, overall,
the accuracy of IOA improved by 0.2.</p>
      <p id="d1e551">The satisfactory post-processing results using the CMAQ-CNN model were
mainly characterized by the regularity of the bias trend in CMAQ as the base
model for training the CNN model. As shown by the DTW distance analysis,
when the DTW distance of CMAQ predictions from observed values was
consistent throughout the ozone season, the CNN model was able to improve
the CMAQ results to a reliable level (IOA <inline-formula><mml:math id="M15" display="inline"><mml:mrow><mml:mo>&gt;</mml:mo><mml:mn mathvariant="normal">0.8</mml:mn></mml:mrow></mml:math></inline-formula>). To test this
hypothesis, we used the CMAQ-CNN post-processing approach in typical
unsatisfactory scenarios.</p>
</sec>
<sec id="Ch1.S3.SS2.SSS2">
  <label>3.2.2</label><title>Unsatisfactory post-processing scenarios</title>
      <p id="d1e572">Figure 8 compares the time series of ozone observations with the CMAQ and
CMAQ-CNN models at three selected EPA stations. For all of these stations,
the CMAQ-CNN model failed to reach a reliable IOA accuracy level of 0.8,
while the accuracy of the CMAQ model improved. Figure 9 represents the DTW
distance analysis of the two models and the ozone observation for the same
stations. Unsatisfactory improvement by the CMAQ-CNN model occurred in the
following three cases.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F8" specific-use="star"><?xmltex \currentcnt{8}?><label>Figure 8</label><caption><p id="d1e577">Comparison of the distance analysis of CMAQ and CMAQ-CNN
predictions for EPA stations <bold>(a)</bold> no. 101 (Tennessee), <bold>(b)</bold> no. 1011
(California), and <bold>(c)</bold> no. 9008 (Oklahoma).</p></caption>
            <?xmltex \igopts{width=341.433071pt}?><graphic xlink:href="https://gmd.copernicus.org/articles/13/6237/2020/gmd-13-6237-2020-f08.png"/>

          </fig>

      <p id="d1e595">Figures 8a–9a show that the ozone trend in this station fluctuated throughout the ozone season with
frequent spikes<?pagebreak page6245?> in May, July, and October, primarily the result of biomass
burning (Choi et al., 2016). While the CMAQ model predicted ozone
concentrations with a relatively small bias (IOA <inline-formula><mml:math id="M16" display="inline"><mml:mo>=</mml:mo></mml:math></inline-formula> 0.7), the bias trend
varied from time to time, i.e., trends of underpredictions and overpredictions
changed frequently. A footprint of these trends, i.e., changes in the
path of the distance trend, is evident in the DTW analysis. This
inconsistency was mirrored in the equivalent DTW analysis for the CMAQ-CNN
model by a consistent distance trend, resulting in an unsatisfactory IOA
accuracy level (IOA <inline-formula><mml:math id="M17" display="inline"><mml:mo>=</mml:mo></mml:math></inline-formula> 0.78) with an increased mean DTW distance (0.89
compared to 0.74 for the CMAQ time series).</p>
      <p id="d1e613">Figures 8b–9b show that the trend in this California location was a relatively constant
concentration of ozone generally ranging between 10–30 ppb. The CMAQ model
significantly overpredicted ozone concentrations throughout the entire time
period, mostly as a result of the proximity of this station to the Pacific
Ocean (San Diego County), which controls the variation in the daily ozone
concentration (Pan et al., 2017). The DTW distance analysis shows a
significant yet steady spike in the distance between CMAQ and the
observation. Thus, even though the CMAQ-CNN significantly improved the
accuracy of the CMAQ model (IOA <inline-formula><mml:math id="M18" display="inline"><mml:mo>=</mml:mo></mml:math></inline-formula> 0.63 compared to CMAQ IOA <inline-formula><mml:math id="M19" display="inline"><mml:mo>=</mml:mo></mml:math></inline-formula> 0.44), the
large distance accounted for the underperformance of the post-processing
approach. This also mirrored the consistent distance in the CMAQ-CNN
distance trend (see the right panel in Fig. 9c).</p>
      <p id="d1e630">Figures 8c–9c show that, in this station, the ozone concentration followed an infrequent trend with
lows and highs spread indiscriminately across the ozone season, the result
of several factors affecting air pollution in this region, including biomass
burning, a strong frontal system, and other<?pagebreak page6246?> conditions. As a result, the
CMAQ model underperformed, with substantial overestimation during most of the
time period (IOA <inline-formula><mml:math id="M20" display="inline"><mml:mo>=</mml:mo></mml:math></inline-formula> 0.55). In addition, the bias of the CMAQ model did not
follow as clear a trend as the DTW distance analysis. The CMAQ-CNN model
improved the prediction results by more than 10 % with a reduced DTW
distance (0.27m versus 0.35 for the CMAQ time series). Nevertheless, the varying
ozone trend accompanying the inconsistency in the prediction bias trend
resulted in the low overall accuracy of the IOA of the CMAQ-CNN for this
station (IOA <inline-formula><mml:math id="M21" display="inline"><mml:mo>=</mml:mo></mml:math></inline-formula> 0.67).</p>
      <p id="d1e647">Unlike the satisfactory cases, the unsatisfactory post-processing results
using the CMAQ-CNN model stemmed from the inconsistency in the bias trend
found by the DTW distance analysis. Another influential factor was the
variability of observed ozone concentrations. Because of the frequent
variation in the observational data, it was more complicated to train the
CMAQ-CNN model so that it addressed the bias in the CMAQ model. The
geographical location of a station was also an important factor in the
improvement level of the post-processing approach. Proximity to the large
body of water and/or sources from biomass burning during the ozone season
were among the influential geographical features. Also, as Figs. 8 and 9 show,
the DTW distances of the CMAQ-CNN predictions from the observed ones
followed a consistent trend. Therefore, the information in Figs. 6 and<?pagebreak page6247?> 7
indicate that a secondary post-processing model might be a possible solution
to boosting prediction accuracy.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F9" specific-use="star"><?xmltex \currentcnt{9}?><label>Figure 9</label><caption><p id="d1e652">Comparison of the distance analysis of CMAQ and CMAQ-CNN
predictions for EPA stations <bold>(a)</bold> no. 101 (Tennessee), <bold>(b)</bold> no. 1011
(California), and <bold>(c)</bold> no. 9008 (Oklahoma).</p></caption>
            <?xmltex \igopts{width=341.433071pt}?><graphic xlink:href="https://gmd.copernicus.org/articles/13/6237/2020/gmd-13-6237-2020-f09.png"/>

          </fig>

</sec>
</sec>
<sec id="Ch1.S3.SS3">
  <label>3.3</label><title>Discussion</title>
      <p id="d1e679">Despite the enormous success of the convolutional neural network (CNN)
algorithm in numerous applications, certain issues related to its
applications in air quality forecasting (AQF) require further analysis and
discussion. Our main goal in this paper was to discuss some of these issues
in a few practical applications. To discuss these issues analytically, we
used wavelet transform and dynamic time warping (DTW) as powerful
mathematical tools for time series analysis and models. Based on the
findings that were presented in the paper, these tools are extremely helpful
not only in understanding the issues with machine learning models but also
in fine-tuning them to improve their performances with a scientific point of
view. Awareness of the limitations in CNN models will enable scientists to
develop more accurate regional or local air quality forecasting systems by
identifying the affecting factors in high concentration episodes.</p>
      <p id="d1e682">Based on our findings in the base studies presenting the aforementioned CNN
models, in both cases, the CNN model shows reasonable accuracy for ozone
prediction, 24 h in advance, in two geographical locations (the United
States and South Korea). However, similar to other data-driven prediction
tools, in a CNN model the out-of-sample prediction error is almost always
greater than the in-sample prediction error. Thus, since both CNN models
were designed as a real-time air quality prediction models, the prediction
error is inevitable, even though (i) both models were configured for optimum
performance (based on the input or training samples),<?pagebreak page6248?> and (ii) in the
development of both models cross-validation processes were followed to
mitigate any systematic biases. However, the underperformance of the CNN
model was dependent on several factors, including modeling configuration
(e.g., the depth of CNN model), arrangements of input variables (e.g.,
number of previous days as inputs), the day of the week (e.g., weekdays
versus weekends), the hour of the day (e.g., daytime versus nighttime) (see
Eslami et al. (2019, 2020a, b), Choi et al. (2019), Sayeed et al.
(2020), and Lops et al. (2019), and the discussion within).</p>
      <p id="d1e685">Here, we discussed the general limitations of the CNN model in two common
applications: (i) a real-time AQF model and (ii) a post-processing tool in
a dynamical AQF model (i.e., CMAQ). These examples are fundamentally
different in terms of execution, one being a raw predictor (statistical
approach), while the other is a post-processor (hybrid approach). Since
both models are commonly used as a real-time air quality prediction system,
we discussed their issues individually to explain specific issues that one
may encounter in executing either of them. Thus, it will provide both
machine learning researchers and atmospheric scientists with multiple
candidate models and analytical tools to develop any specific model of their
choice.</p>
      <?pagebreak page6249?><p id="d1e688">For one case (raw prediction model), we used the wavelet transform to
determine the reasons behind the poor performance of CNN during the
nighttime, cold months, and high ozone episodes. We find that when fine
wavelet modes (hourly and daily) were relatively weak or when coarse wavelet
modes (weekly) were strong, the CNN model produced less accurate forecasts.
Since the CNN model has used only a single precious day of air quality and
meteorological parameters, the coarse patterns (e.g., weekly) were not
used as a prediction feature, and any connection between different
time series windows (as is revealed in a wavelet transform analysis) was not
considered. Thus, the wavelet transform can be helpful as a complementary
tool for filling these gaps in CNN prediction model development. It should
be noted that a long short-term memory (LSTM) model can potentially
incorporate some of the aforementioned time-dependencies (e.g., bi-daily or
weekly). However, the focus of this study is to address such a limitation in
a CNN model as a choice of the ML model.</p>
      <p id="d1e692">For the other case (post-processing model), we used the DTW distance
analysis to compare post-processed results with their CMAQ counterparts (as
a base model). For those CMAQ results with a consistent DTW distance from
the observation, the post-processing approach properly addressed the CMAQ
modeling bias with predicted IOAs exceeding 0.85. When the DTW distance of
CMAQ versus observation is irregular, the post-processing approach is unlikely
to perform satisfactorily. Even though the CMAQ-CNN model has included
several chemical components and meteorological variables as its inputs,
there was no input feature representing CMAQ's own accuracy. By comparing a
history of CMAQ results in different geographical locations with available
observation data, the DTW can provide an “irregularity” index as an
additional input feature.</p>
</sec>
</sec>
<sec id="Ch1.S4" sec-type="conclusions">
  <label>4</label><title>Conclusions</title>
      <p id="d1e704">Various applications of deep learning algorithms, particularly convolutional
neural networks, have universally been applied in the field of atmospheric
sciences, especially in air quality forecasting systems. Although such
applications supported easy-to-use, computationally efficient frameworks, and
flexible capabilities appeared to generate accurate prediction results, the
risk of exaggerated expectations may be a cause for concern. In an effort to
elucidate both the advantages and limitations of deep learning models in air
quality forecasting (AQF) systems, this paper addressed several common
issues raised by the use of these models.</p>
      <p id="d1e707">To explore the limitation, we chose two applications of two similar CNN
models: (i) CNN as an independent real-time AQF and (ii) CNN as a
post-processing model of a state-of-the-art dynamical model, the Community
Multi-scale Air Quality Model (CMAQ). For both cases, the CNN model resulted
in an acceptable 24 h in advance, hourly ozone concentration prediction
with an index of agreement (IOA) of more than 0.8 for two networks of
monitoring stations in South Korea and the United States. We selected two
powerful statistical data analytic techniques, wavelet transform and
dynamic time warping (DTW), to identify the limitations of the proposed
models in both cases. By applying these techniques, researchers find
discrepancies in the input data and their temporal trends and thus gain
awareness of the limitations of deep learning models.</p>
      <p id="d1e710">When the CNN model was used as a real-time AQF system in South Korea, it
underperformed during both cold months and high ozone episodes. In these
scenarios, we found that the fine wavelet modes (daily and hourly) were
relatively weaker than they were in other conditions. Also, when the coarse
modes were strong, the predictions of the CNN model were fraught with a
large number of errors. We also found that the model underperformed during
the nighttime hours, the results of an undertrained model and extreme values
of the input parameters during the nighttime.</p>
      <p id="d1e713">For the post-processing CNN model, the level of improvement depended on the
DTW distance of the CMAQ model to the observations. When the calculated
distance followed a consistent trend, the post-processing model was able to
address the bias of CMAQ, independent from its accuracy level or error
range. When such consistency was absent or when observed ozone varied
frequently, however, the errors in the CMAQ model were mirrored in the
results of the post-processing model.</p>
      <p id="d1e717">Given this discussion of the limitations of deep learning models, we suggest
that researchers configure their deep learning models based on temporal
trends within the input parameters, geographical locations, and variation
frequency of target pollutants. To predict ambient hourly ozone
concentrations, we have restricted our discussions to a multi-output
regression problem in supervised settings. While our study approach might be
valid for other supervised algorithms, we leave a detailed study of other
supervised methods for future work.</p>
</sec>

      
      </body>
    <back><notes notes-type="codeavailability"><title>Code availability</title>

      <p id="d1e725">The code for the algorithm development, evaluation, and
statistical analysis is freely available for noncommercial research
purposes by contacting the corresponding author.</p>
  </notes><app-group>
        <supplementary-material position="anchor"><p id="d1e728">The supplement related to this article is available online at: <inline-supplementary-material xlink:href="https://doi.org/10.5194/gmd-13-6237-2020-supplement" xlink:title="pdf">https://doi.org/10.5194/gmd-13-6237-2020-supplement</inline-supplementary-material>.</p></supplementary-material>
        </app-group><notes notes-type="authorcontribution"><title>Author contributions</title>

      <p id="d1e737">EE, YC, YL, AS, and AKS contributed to the
design and implementation of the research and to the analysis of the results.
EE took the lead in writing the manuscript with input from YC, YL,
AS, and AKS. YC supervised the project. EE and AS prepared the
modeling input data and optimized the python codes. All authors discussed
the results, commented on the manuscript, and contributed to the final
version of the manuscript.</p>
  </notes><notes notes-type="competinginterests"><title>Competing interests</title>

      <p id="d1e743">The authors declare that they have no conflict of interest.</p>
  </notes><ack><title>Acknowledgements</title><p id="d1e749">The authors express their gratefulness to Wonbae Jeon and Shuai Pan, who prepared the 4-year CMAQ and SMOKE runs for the TCEQ that were used in this study.</p></ack><notes notes-type="financialsupport"><title>Financial support</title>

      <p id="d1e754">This research has been supported by the High Priority Area Research Seed Grant of the University of Houston (Research Seeds Grants 2019-Choi).</p>
  </notes><notes notes-type="reviewstatement"><title>Review statement</title>

      <p id="d1e760">This paper was edited by Adrian Sandu and reviewed by Arash Sarshar and two anonymous referees.</p>
  </notes><ref-list>
    <title>References</title>

      <ref id="bib1.bib1"><label>1</label><?label 1?><mixed-citation>
Aiazzi, B., Alparone, L., Baronti, S., and Garzelli, A.: Context-driven
fusion of high spatial and spectral resolution images based on oversampled
multiresolution analysis, IEEE T. Geosci. Remote, 40, 2300–2312, 2002.</mixed-citation></ref>
      <ref id="bib1.bib2"><label>2</label><?label 2?><mixed-citation>
Berndt, D. J. and Clifford, J.: Using dynamic time warping to find patterns
in time series, in: KDD workshop, 10, 359–370, 1994.</mixed-citation></ref>
      <ref id="bib1.bib3"><label>3</label><?label 3?><mixed-citation>
Camalier, L., Cox, W., and Dolwick, P.: The effects of meteorology on ozone
in urban areas and their use in assessing ozone trends, Atmos.
Environ., 41, 7127–7137, 2007.</mixed-citation></ref>
      <ref id="bib1.bib4"><label>4</label><?label 4?><mixed-citation>
Chen, H., Engkvist, O., Wang, Y., Olivecrona, M., and Blaschke, T.: The rise
of deep learning in drug discovery, Drug Discov. Today, 23, 1241–1250,
2018.</mixed-citation></ref>
      <ref id="bib1.bib5"><label>5</label><?label 5?><mixed-citation>Choi, Y.: The impact of satellite-adjusted <inline-formula><mml:math id="M22" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">NO</mml:mi><mml:mi mathvariant="normal">mathitx</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> emissions on simulated <inline-formula><mml:math id="M23" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">NO</mml:mi><mml:mi mathvariant="normal">mathitx</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M24" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">O</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> discrepancies in the urban and outflow areas of the Pacific and Lower Middle US, Atmos. Chem. Phys., 14, 675–690, <ext-link xlink:href="https://doi.org/10.5194/acp-14-675-2014" ext-link-type="DOI">10.5194/acp-14-675-2014</ext-link>, 2014.</mixed-citation></ref>
      <ref id="bib1.bib6"><label>6</label><?label 6?><mixed-citation>Choi, Y., Jeon, W., Roy,  A., Souri,  A. H., Diao, L., Pan, S., and Eslami, E.:
CMAQ Modeling Archive for Exceptional Events Analysis, Texas Commission on
Environmental Quality (TCEQ), available at:
<ext-link xlink:href="https://www.tceq.texas.gov/assets/public/implementation/air/am/contracts/reports/pm/5821554181FY1609-20160829-uh-CMAQModelingArchiveForExceptionalEventsAnalyses.pdf">https://www.tceq.texas.gov/assets/public/implementation/air/am/ contracts/reports/pm/5821554181FY1609-20160829-uh-CMAQModelingArchiveForExceptionalEventsAnalyses.pdf</ext-link> (last access: November 2020),
2016.</mixed-citation></ref>
      <ref id="bib1.bib7"><label>7</label><?label 7?><mixed-citation>
Choi, Y., Eslami, E., Sayeed, A., and Lops, Y.: CAMQ-AI: A computationally
efficient deep learning model to improve CMAQ performance over the United
States, in: 2019 AGU Fall Meeting, San Francisco, CA, December 2019, 2019.</mixed-citation></ref>
      <ref id="bib1.bib8"><label>8</label><?label 8?><mixed-citation>
Deng, L. and Yu, D: Deep learning: methods and applications, Found. Trends
Signal Process., 7, 197–387, 2014</mixed-citation></ref>
      <ref id="bib1.bib9"><label>9</label><?label 13?><mixed-citation>Eslami, E., Choi, Y., Lops, Y., and Sayeed, A.: A Deep Learning Driven
Improved Ensemble Approach for Hurricane Forecasting, in: 2019 ESIP Annual
Meeting, Bethesda, MD, January 2019, <ext-link xlink:href="https://doi.org/10.6084/m9.figshare.7591775.v1" ext-link-type="DOI">10.6084/m9.figshare.7591775.v1</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bib10"><label>10</label><?label 9?><mixed-citation>Eslami, E., Choi, Y., Lops, Y., and Sayeed, A.: A real-time hourly ozone prediction system using deep convolutional neural network, Neural Comput. Appl., 32, 8783–8797, <ext-link xlink:href="https://doi.org/10.1007/s00521-019-04282-x" ext-link-type="DOI">10.1007/s00521-019-04282-x</ext-link>, 2020a.</mixed-citation></ref>
      <ref id="bib1.bib11"><label>11</label><?label 12?><mixed-citation>Eslami, E., Salman, A. K., Choi, Y., Sayeed, A., and Lops, Y.: A data ensemble approach for real-time air quality forecasting using extremely randomized trees and deep neural networks, Neural Comput. Appl., 32, 7563–7579, <ext-link xlink:href="https://doi.org/10.1007/s00521-019-04287-6" ext-link-type="DOI">10.1007/s00521-019-04287-6</ext-link>, 2020b.</mixed-citation></ref>
      <ref id="bib1.bib12"><label>12</label><?label 14?><mixed-citation>
Foufoula-Georgiou, E. and Kumar, P. (Eds.): Wavelets in geophysics, Vol. 4, Elsevier, USA, 2014.</mixed-citation></ref>
      <ref id="bib1.bib13"><label>13</label><?label 15?><mixed-citation>
Giorgino, T: Computing and visualizing dynamic time warping alignments in R:
the dtw package, J. Stat. Softw., 31, 1–24, 2009.</mixed-citation></ref>
      <ref id="bib1.bib14"><label>14</label><?label 16?><mixed-citation>
Goodfellow, I., Bengio, Y., and Courville, A.: Deep learning, MIT press,
USA, 2016.</mixed-citation></ref>
      <ref id="bib1.bib15"><label>15</label><?label 17?><mixed-citation>Grinsted, A., Moore, J. C., and Jevrejeva, S.: Application of the cross wavelet transform and wavelet coherence to geophysical time series, Nonlin. Processes Geophys., 11, 561–566, <ext-link xlink:href="https://doi.org/10.5194/npg-11-561-2004" ext-link-type="DOI">10.5194/npg-11-561-2004</ext-link>, 2004.</mixed-citation></ref>
      <ref id="bib1.bib16"><label>16</label><?label 18?><mixed-citation>
Higham, C. F. and Higham, D. J.: Deep learning: An introduction for applied
mathematicians, SIAM Rev., 61, 860–891, 2019.</mixed-citation></ref>
      <ref id="bib1.bib17"><label>17</label><?label 19?><mixed-citation>
Huang, L., Kemao, Q., Pan, B., and Asundi, A. K.: Comparison of Fourier
transform, windowed Fourier transform, and wavelet transform methods for
phase extraction from a single fringe pattern in fringe projection
profilometry, OPT. Laser. Eng., 48, 141–148, 2010.</mixed-citation></ref>
      <ref id="bib1.bib18"><label>18</label><?label 20?><mixed-citation>
Jeong, Y. S., Jeong, M. K., and Omitaomu, O. A.: Weighted dynamic time
warping for time series classification, Pattern Recogn., 44, 2231–2240,
2011.</mixed-citation></ref>
      <ref id="bib1.bib19"><label>19</label><?label 21?><mixed-citation>
Kaheil, Y. H., Rosero, E., Gill, M. K., McKee, M., and Bastidas, L. A.:
Downscaling and forecasting of evapotranspiration using a synthetic model of
wavelets and support vector machines, IEEE T. Geosci. Remote, 46,
2692–2707, 2008.</mixed-citation></ref>
      <ref id="bib1.bib20"><label>20</label><?label 22?><mixed-citation>
Kamilaris, A. and Prenafeta-Boldú, F. X.: Deep learning in agriculture:
A survey, Comput. Electron. Agr., 147, 70–90, 2018.</mixed-citation></ref>
      <ref id="bib1.bib21"><label>21</label><?label 23?><mixed-citation>
Krizhevsky, A., Sutskever, I., and Hinton, G. E.: Imagenet classification
with deep convolutional neural networks, Adv. Neur. In., 25, 1097–1105,
2012.</mixed-citation></ref>
      <ref id="bib1.bib22"><label>22</label><?label 24?><mixed-citation>
LeCun, Y., Bengio, Y., and Hinton, G.: Deep learning, Nature, 521,
436–444, 2015.</mixed-citation></ref>
      <ref id="bib1.bib23"><label>23</label><?label 25?><mixed-citation>
Litjens, G., Kooi, T., Bejnordi, B. E., Setio, A. A. A., Ciompi, F., Ghafoorian, M., van der Laak, J. A. W. M., van Ginneken, B., and Sanchez, C. I.: A survey on deep learning in medical image
analysis, Med. Image Anal., 42, 60–88, 2017.</mixed-citation></ref>
      <ref id="bib1.bib24"><label>24</label><?label 26?><mixed-citation>Lops, Y., Choi, Y., Eslami, E., and Sayeed, A.: Real-time 7-day forecast of pollen counts using a deep convolutional neural network, Neural Comput. Appl., 32, 11827–11836, <ext-link xlink:href="https://doi.org/10.1007/s00521-019-04665-0" ext-link-type="DOI">10.1007/s00521-019-04665-0</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bib25"><label>25</label><?label 27?><mixed-citation>
Mallat, S. G.: A theory for multiresolution signal decomposition: the
wavelet representation, IEEE T. Pattern Anal., 7, 674–693, 1989.</mixed-citation></ref>
      <ref id="bib1.bib26"><label>26</label><?label 28?><mixed-citation>
Pan, S., Choi, Y., Roy, A., Li, X., Jeon, W., and Souri, A. H.: Modeling the
uncertainty of several VOC and its impact on simulated VOC and ozone in
Houston, Texas, Atmos. Environ., 120, 404–416, 2015.</mixed-citation></ref>
      <ref id="bib1.bib27"><label>27</label><?label 29?><mixed-citation>
Pan, S., Choi, Y., Jeon, W., Roy, A., Westenbarger, D. A., and Kim, H. C.:
Impact of high-resolution sea surface temperature, emission spikes and wind
on simulated surface ozone in Houston, Texas during a high ozone
episode, Atmos. Environ., 152, 362–376, 2017.</mixed-citation></ref>
      <ref id="bib1.bib28"><label>28</label><?label 30?><mixed-citation>
Pan, S., Roy, A., Choi, Y., Eslami, E., Thomas, S., Jiang, X., and Gao, H. O.: Potential impacts of electric vehicles on air quali<?pagebreak page6251?>ty and health
endpoints in the Greater Houston Area in 2040, Atmos. Environ., 207, 38–51,
2019.</mixed-citation></ref>
      <ref id="bib1.bib29"><label>29</label><?label 31?><mixed-citation>Sabour, S., Frosst, N., and Hinton, G. E.: Dynamic routing between
capsules, Adv. Neur. In., 3856–3866, ArXiv, available at: <uri>https://arxiv.org/pdf/1710.09829.pdf</uri> (last access: November 2020), 2017.</mixed-citation></ref>
      <ref id="bib1.bib30"><label>30</label><?label 32?><mixed-citation>
Sayeed, A., Choi, Y., Eslami, E., Lops, Y., Roy, A., and Jung, J.: Using a Deep
Convolutional Neural Network to Predict 2017 Ozone Concentrations, 24 h
in Advance, Neural Networks, 121, 396–408, 2020.</mixed-citation></ref>
      <ref id="bib1.bib31"><label>31</label><?label 33?><mixed-citation>Schmidhuber, J.: Deep learning in neural networks: An overview, Neural
Networks, 61, 85–117, 2015.
 </mixed-citation></ref><?xmltex \hack{\newpage}?>
      <ref id="bib1.bib32"><label>32</label><?label 34?><mixed-citation>
Tormene, P., Giorgino, T., Quaglini, S., and Stefanelli, M.: Matching
incomplete time series with dynamic time warping: an algorithm and an
application to post-stroke rehabilitation, Artif. Intell. Med., 45,
11–34, 2009.</mixed-citation></ref>
      <ref id="bib1.bib33"><label>33</label><?label 35?><mixed-citation>
Zhang, Y., Bocquet, M., Mallet, V., Seigneur, C., and Baklanov, A.:
Real-time air quality forecasting, part I: History, techniques, and current
status, Atmos. Environ., 60, 632–655, 2012.</mixed-citation></ref>

  </ref-list></back>
    <!--<article-title-html>Using wavelet transform and dynamic time warping to identify the limitations of the CNN model as an air quality forecasting system</article-title-html>
<abstract-html><p>As the deep learning algorithm has become a popular data analysis technique,
atmospheric scientists should have a balanced perception of its strengths
and limitations so that they can provide a powerful analysis of complex data
with well-established procedures. Despite the enormous success of the
algorithm in numerous applications, certain issues related to its
applications in air quality forecasting (AQF) require further analysis and
discussion. This study addresses significant limitations of an advanced deep
learning algorithm, the convolutional neural network (CNN), in two common
applications: (i) a real-time AQF model and (ii) a post-processing tool in
a dynamical AQF model, the Community Multi-scale Air Quality Model (CMAQ).
In both cases, the CNN model shows promising accuracy for ozone prediction
24&thinsp;h in advance in both the United States of America and South Korea (with an
overall index of agreement exceeding 0.8). For the first case, we use the
wavelet transform to determine the reasons behind the poor performance of
CNN during the nighttime, cold months, and high-ozone episodes. We find that
when fine wavelet modes (hourly and daily) are relatively weak or when
coarse wavelet modes (weekly) are strong, the CNN model produces less
accurate forecasts. For the second case, we use the dynamic time warping
(DTW) distance analysis to compare post-processed results with their CMAQ
counterparts (as a base model). For CMAQ results that show a consistent DTW
distance from the observation, the post-processing approach properly
addresses the modeling bias with predicted indexes of agreement exceeding 0.85. When the DTW
distance of CMAQ versus observation is irregular, the post-processing approach
is unlikely to perform satisfactorily. Awareness of the limitations in CNN
models will enable scientists to develop more accurate regional or local air
quality forecasting systems by identifying the affecting factors in high-concentration episodes.</p></abstract-html>
<ref-html id="bib1.bib1"><label>1</label><mixed-citation>
Aiazzi, B., Alparone, L., Baronti, S., and Garzelli, A.: Context-driven
fusion of high spatial and spectral resolution images based on oversampled
multiresolution analysis, IEEE T. Geosci. Remote, 40, 2300–2312, 2002.
</mixed-citation></ref-html>
<ref-html id="bib1.bib2"><label>2</label><mixed-citation>
Berndt, D. J. and Clifford, J.: Using dynamic time warping to find patterns
in time series, in: KDD workshop, 10, 359–370, 1994.
</mixed-citation></ref-html>
<ref-html id="bib1.bib3"><label>3</label><mixed-citation>
Camalier, L., Cox, W., and Dolwick, P.: The effects of meteorology on ozone
in urban areas and their use in assessing ozone trends, Atmos.
Environ., 41, 7127–7137, 2007.
</mixed-citation></ref-html>
<ref-html id="bib1.bib4"><label>4</label><mixed-citation>
Chen, H., Engkvist, O., Wang, Y., Olivecrona, M., and Blaschke, T.: The rise
of deep learning in drug discovery, Drug Discov. Today, 23, 1241–1250,
2018.
</mixed-citation></ref-html>
<ref-html id="bib1.bib5"><label>5</label><mixed-citation>
Choi, Y.: The impact of satellite-adjusted NO<sub>mathitx</sub> emissions on simulated NO<sub>mathitx</sub> and O<sub>3</sub> discrepancies in the urban and outflow areas of the Pacific and Lower Middle US, Atmos. Chem. Phys., 14, 675–690, <a href="https://doi.org/10.5194/acp-14-675-2014" target="_blank">https://doi.org/10.5194/acp-14-675-2014</a>, 2014.
</mixed-citation></ref-html>
<ref-html id="bib1.bib6"><label>6</label><mixed-citation>
Choi, Y., Jeon, W., Roy,  A., Souri,  A. H., Diao, L., Pan, S., and Eslami, E.:
CMAQ Modeling Archive for Exceptional Events Analysis, Texas Commission on
Environmental Quality (TCEQ), available at:
<a href="https://www.tceq.texas.gov/assets/public/implementation/air/am/contracts/reports/pm/5821554181FY1609-20160829-uh-CMAQModelingArchiveForExceptionalEventsAnalyses.pdf" target="_blank">https://www.tceq.texas.gov/assets/public/implementation/air/am/ contracts/reports/pm/5821554181FY1609-20160829-uh-CMAQModelingArchiveForExceptionalEventsAnalyses.pdf</a> (last access: November 2020),
2016.
</mixed-citation></ref-html>
<ref-html id="bib1.bib7"><label>7</label><mixed-citation>
Choi, Y., Eslami, E., Sayeed, A., and Lops, Y.: CAMQ-AI: A computationally
efficient deep learning model to improve CMAQ performance over the United
States, in: 2019 AGU Fall Meeting, San Francisco, CA, December 2019, 2019.
</mixed-citation></ref-html>
<ref-html id="bib1.bib8"><label>8</label><mixed-citation>
Deng, L. and Yu, D: Deep learning: methods and applications, Found. Trends
Signal Process., 7, 197–387, 2014
</mixed-citation></ref-html>
<ref-html id="bib1.bib9"><label>9</label><mixed-citation>
Eslami, E., Choi, Y., Lops, Y., and Sayeed, A.: A Deep Learning Driven
Improved Ensemble Approach for Hurricane Forecasting, in: 2019 ESIP Annual
Meeting, Bethesda, MD, January 2019, <a href="https://doi.org/10.6084/m9.figshare.7591775.v1" target="_blank">https://doi.org/10.6084/m9.figshare.7591775.v1</a>, 2019.
</mixed-citation></ref-html>
<ref-html id="bib1.bib10"><label>10</label><mixed-citation>
Eslami, E., Choi, Y., Lops, Y., and Sayeed, A.: A real-time hourly ozone prediction system using deep convolutional neural network, Neural Comput. Appl., 32, 8783–8797, <a href="https://doi.org/10.1007/s00521-019-04282-x" target="_blank">https://doi.org/10.1007/s00521-019-04282-x</a>, 2020a.
</mixed-citation></ref-html>
<ref-html id="bib1.bib11"><label>11</label><mixed-citation>
Eslami, E., Salman, A. K., Choi, Y., Sayeed, A., and Lops, Y.: A data ensemble approach for real-time air quality forecasting using extremely randomized trees and deep neural networks, Neural Comput. Appl., 32, 7563–7579, <a href="https://doi.org/10.1007/s00521-019-04287-6" target="_blank">https://doi.org/10.1007/s00521-019-04287-6</a>, 2020b.
</mixed-citation></ref-html>
<ref-html id="bib1.bib12"><label>12</label><mixed-citation>
Foufoula-Georgiou, E. and Kumar, P. (Eds.): Wavelets in geophysics, Vol. 4, Elsevier, USA, 2014.
</mixed-citation></ref-html>
<ref-html id="bib1.bib13"><label>13</label><mixed-citation>
Giorgino, T: Computing and visualizing dynamic time warping alignments in R:
the dtw package, J. Stat. Softw., 31, 1–24, 2009.
</mixed-citation></ref-html>
<ref-html id="bib1.bib14"><label>14</label><mixed-citation>
Goodfellow, I., Bengio, Y., and Courville, A.: Deep learning, MIT press,
USA, 2016.
</mixed-citation></ref-html>
<ref-html id="bib1.bib15"><label>15</label><mixed-citation>
Grinsted, A., Moore, J. C., and Jevrejeva, S.: Application of the cross wavelet transform and wavelet coherence to geophysical time series, Nonlin. Processes Geophys., 11, 561–566, <a href="https://doi.org/10.5194/npg-11-561-2004" target="_blank">https://doi.org/10.5194/npg-11-561-2004</a>, 2004.
</mixed-citation></ref-html>
<ref-html id="bib1.bib16"><label>16</label><mixed-citation>
Higham, C. F. and Higham, D. J.: Deep learning: An introduction for applied
mathematicians, SIAM Rev., 61, 860–891, 2019.
</mixed-citation></ref-html>
<ref-html id="bib1.bib17"><label>17</label><mixed-citation>
Huang, L., Kemao, Q., Pan, B., and Asundi, A. K.: Comparison of Fourier
transform, windowed Fourier transform, and wavelet transform methods for
phase extraction from a single fringe pattern in fringe projection
profilometry, OPT. Laser. Eng., 48, 141–148, 2010.
</mixed-citation></ref-html>
<ref-html id="bib1.bib18"><label>18</label><mixed-citation>
Jeong, Y. S., Jeong, M. K., and Omitaomu, O. A.: Weighted dynamic time
warping for time series classification, Pattern Recogn., 44, 2231–2240,
2011.
</mixed-citation></ref-html>
<ref-html id="bib1.bib19"><label>19</label><mixed-citation>
Kaheil, Y. H., Rosero, E., Gill, M. K., McKee, M., and Bastidas, L. A.:
Downscaling and forecasting of evapotranspiration using a synthetic model of
wavelets and support vector machines, IEEE T. Geosci. Remote, 46,
2692–2707, 2008.
</mixed-citation></ref-html>
<ref-html id="bib1.bib20"><label>20</label><mixed-citation>
Kamilaris, A. and Prenafeta-Boldú, F. X.: Deep learning in agriculture:
A survey, Comput. Electron. Agr., 147, 70–90, 2018.
</mixed-citation></ref-html>
<ref-html id="bib1.bib21"><label>21</label><mixed-citation>
Krizhevsky, A., Sutskever, I., and Hinton, G. E.: Imagenet classification
with deep convolutional neural networks, Adv. Neur. In., 25, 1097–1105,
2012.
</mixed-citation></ref-html>
<ref-html id="bib1.bib22"><label>22</label><mixed-citation>
LeCun, Y., Bengio, Y., and Hinton, G.: Deep learning, Nature, 521,
436–444, 2015.
</mixed-citation></ref-html>
<ref-html id="bib1.bib23"><label>23</label><mixed-citation>
Litjens, G., Kooi, T., Bejnordi, B. E., Setio, A. A. A., Ciompi, F., Ghafoorian, M., van der Laak, J. A. W. M., van Ginneken, B., and Sanchez, C. I.: A survey on deep learning in medical image
analysis, Med. Image Anal., 42, 60–88, 2017.
</mixed-citation></ref-html>
<ref-html id="bib1.bib24"><label>24</label><mixed-citation>
Lops, Y., Choi, Y., Eslami, E., and Sayeed, A.: Real-time 7-day forecast of pollen counts using a deep convolutional neural network, Neural Comput. Appl., 32, 11827–11836, <a href="https://doi.org/10.1007/s00521-019-04665-0" target="_blank">https://doi.org/10.1007/s00521-019-04665-0</a>, 2020.
</mixed-citation></ref-html>
<ref-html id="bib1.bib25"><label>25</label><mixed-citation>
Mallat, S. G.: A theory for multiresolution signal decomposition: the
wavelet representation, IEEE T. Pattern Anal., 7, 674–693, 1989.
</mixed-citation></ref-html>
<ref-html id="bib1.bib26"><label>26</label><mixed-citation>
Pan, S., Choi, Y., Roy, A., Li, X., Jeon, W., and Souri, A. H.: Modeling the
uncertainty of several VOC and its impact on simulated VOC and ozone in
Houston, Texas, Atmos. Environ., 120, 404–416, 2015.
</mixed-citation></ref-html>
<ref-html id="bib1.bib27"><label>27</label><mixed-citation>
Pan, S., Choi, Y., Jeon, W., Roy, A., Westenbarger, D. A., and Kim, H. C.:
Impact of high-resolution sea surface temperature, emission spikes and wind
on simulated surface ozone in Houston, Texas during a high ozone
episode, Atmos. Environ., 152, 362–376, 2017.
</mixed-citation></ref-html>
<ref-html id="bib1.bib28"><label>28</label><mixed-citation>
Pan, S., Roy, A., Choi, Y., Eslami, E., Thomas, S., Jiang, X., and Gao, H. O.: Potential impacts of electric vehicles on air quality and health
endpoints in the Greater Houston Area in 2040, Atmos. Environ., 207, 38–51,
2019.
</mixed-citation></ref-html>
<ref-html id="bib1.bib29"><label>29</label><mixed-citation>
Sabour, S., Frosst, N., and Hinton, G. E.: Dynamic routing between
capsules, Adv. Neur. In., 3856–3866, ArXiv, available at: <a href="https://arxiv.org/pdf/1710.09829.pdf" target="_blank"/> (last access: November 2020), 2017.
</mixed-citation></ref-html>
<ref-html id="bib1.bib30"><label>30</label><mixed-citation>
Sayeed, A., Choi, Y., Eslami, E., Lops, Y., Roy, A., and Jung, J.: Using a Deep
Convolutional Neural Network to Predict 2017 Ozone Concentrations, 24&thinsp;h
in Advance, Neural Networks, 121, 396–408, 2020.
</mixed-citation></ref-html>
<ref-html id="bib1.bib31"><label>31</label><mixed-citation>
Schmidhuber, J.: Deep learning in neural networks: An overview, Neural
Networks, 61, 85–117, 2015.

</mixed-citation></ref-html>
<ref-html id="bib1.bib32"><label>32</label><mixed-citation>
Tormene, P., Giorgino, T., Quaglini, S., and Stefanelli, M.: Matching
incomplete time series with dynamic time warping: an algorithm and an
application to post-stroke rehabilitation, Artif. Intell. Med., 45,
11–34, 2009.
</mixed-citation></ref-html>
<ref-html id="bib1.bib33"><label>33</label><mixed-citation>
Zhang, Y., Bocquet, M., Mallet, V., Seigneur, C., and Baklanov, A.:
Real-time air quality forecasting, part I: History, techniques, and current
status, Atmos. Environ., 60, 632–655, 2012.
</mixed-citation></ref-html>--></article>
