<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v3.0 20080202//EN" "https://jats.nlm.nih.gov/nlm-dtd/publishing/3.0/journalpublishing3.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article" dtd-version="3.0" xml:lang="en">
<front>
<journal-meta>
<journal-id journal-id-type="publisher">GMD</journal-id>
<journal-title-group>
<journal-title>Geoscientific Model Development</journal-title>
<abbrev-journal-title abbrev-type="publisher">GMD</abbrev-journal-title>
<abbrev-journal-title abbrev-type="nlm-ta">Geosci. Model Dev.</abbrev-journal-title>
</journal-title-group>
<issn pub-type="epub">1991-9603</issn>
<publisher><publisher-name>Copernicus Publications</publisher-name>
<publisher-loc>Göttingen, Germany</publisher-loc>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.5194/gmd-6-2087-2013</article-id>
<title-group>
<article-title>Correction of approximation errors with Random Forests applied to modelling  of cloud droplet formation</article-title>
</title-group>
<contrib-group><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Lipponen</surname>
<given-names>A.</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
</contrib>
<contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Kolehmainen</surname>
<given-names>V.</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
</contrib>
<contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Romakkaniemi</surname>
<given-names>S.</given-names>
<ext-link>https://orcid.org/0000-0001-9414-3093</ext-link>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
</contrib>
<contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Kokkola</surname>
<given-names>H.</given-names>
<ext-link>https://orcid.org/0000-0002-1404-6670</ext-link>
</name>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
</contrib>
</contrib-group><aff id="aff1">
<label>1</label>
<addr-line>Department of Applied Physics, University of Eastern Finland, P.O. Box 1627, 70211 Kuopio, Finland</addr-line>
</aff>
<aff id="aff2">
<label>2</label>
<addr-line>Finnish Meteorological Institute, Kuopio Unit, P.O. Box 1627, 70211 Kuopio, Finland</addr-line>
</aff>
<pub-date pub-type="epub">
<day>16</day>
<month>12</month>
<year>2013</year>
</pub-date>
<volume>6</volume>
<issue>6</issue>
<fpage>2087</fpage>
<lpage>2098</lpage>
<permissions>
<copyright-statement>Copyright: &#x000a9; 2013 A. Lipponen et al.</copyright-statement>
<copyright-year>2013</copyright-year>
<license license-type="open-access">
<license-p>This work is licensed under the Creative Commons Attribution 3.0 Unported License. To view a copy of this licence, visit <ext-link ext-link-type="uri"  xlink:href="https://creativecommons.org/licenses/by/3.0/">https://creativecommons.org/licenses/by/3.0/</ext-link></license-p>
</license>
</permissions>
<self-uri xlink:href="https://gmd.copernicus.org/articles/6/2087/2013/gmd-6-2087-2013.html">This article is available from https://gmd.copernicus.org/articles/6/2087/2013/gmd-6-2087-2013.html</self-uri>
<self-uri xlink:href="https://gmd.copernicus.org/articles/6/2087/2013/gmd-6-2087-2013.pdf">The full text article is available as a PDF file from https://gmd.copernicus.org/articles/6/2087/2013/gmd-6-2087-2013.pdf</self-uri>
<abstract>
<p>In atmospheric models, due to their computational time or resource
limitations, physical processes have to be simulated using reduced (i.e.
simplified) models. The use of a reduced model, however, induces errors to
the simulation results. These errors are referred to as approximation errors.
In this paper, we propose a novel approach to correct these approximation
errors. We model the approximation error as an additive noise process in the
simulation model and employ the Random Forest (RF) regression algorithm for
constructing a computationally low cost predictor for the approximation
error. In this way, the overall simulation problem is decomposed into two
separate and computationally efficient simulation problems: solution of the
reduced model and prediction of the approximation error realisation. The
approach is tested for handling approximation errors due to a reduced coarse
sectional representation of aerosol size distribution in a cloud droplet
formation calculation as well as for compensating the uncertainty caused by
the aerosol activation parameterization itself. The results show a
significant improvement in the accuracy of the simulation compared to the
conventional simulation with a reduced model. The proposed approach is rather
general and extension of it to different parameterizations or reduced process
models that are coupled to geoscientific models is a straightforward task.
Another major benefit of this method is that it can be applied to physical
processes that are dependent on a large number of variables making them
difficult to be parameterized by traditional methods.</p>
</abstract>
<counts><page-count count="12"/></counts>
</article-meta>
</front>
<body/>
<back>
<ref-list>
<title>References</title>
<ref id="ref1">
<label>1</label><mixed-citation publication-type="other" xlink:type="simple">Abdul-Razzak, H., Ghan, S. J., and Rivera-Carpio, C.: A parameterization of aerosol activation 1. single aerosol type, J. Geophys. Res., 103, 6123–6131, &lt;a href=&quot;http://dx.doi.org/10.1029/97JD03735&quot;&gt;https://doi.org/10.1029/97JD03735&lt;/a&gt;, 1998.</mixed-citation>
</ref>
<ref id="ref2">
<label>2</label><mixed-citation publication-type="other" xlink:type="simple">Abdul-Razzak, H. and Ghan, S. J.: A parameterization of aerosol activation 2. Multiple aerosol types, J. Geophys. Res., 105, 6837–6844, &lt;a href=&quot;http://dx.doi.org/10.1029/1999JD901161&quot;&gt;https://doi.org/10.1029/1999JD901161&lt;/a&gt;, 2000.</mixed-citation>
</ref>
<ref id="ref3">
<label>3</label><mixed-citation publication-type="other" xlink:type="simple">Abdul-Razzak, H. and Ghan, S. J.: A parameterization of aerosol activation 3. Sectional representation, J. Geophys. Res., 107, AAC 1-1–AAC 1-6, &lt;a href=&quot;http://dx.doi.org/10.1029/2001JD000483&quot;&gt;https://doi.org/10.1029/2001JD000483&lt;/a&gt;, 2002.</mixed-citation>
</ref>
<ref id="ref4">
<label>4</label><mixed-citation publication-type="other" xlink:type="simple">Arridge, S., Kaipio, J., Kolehmainen, V., Schweiger, M., Somersalo, E., Tarvainen, T., and Vauhkonen, M.: Approximation errors and model reduction with an application in optical diffusion tomography, Inverse Probl., 22, 175–195, &lt;a href=&quot;http://dx.doi.org/10.1088/0266-5611/22/1/010&quot;&gt;https://doi.org/10.1088/0266-5611/22/1/010&lt;/a&gt;, 2006.</mixed-citation>
</ref>
<ref id="ref5">
<label>5</label><mixed-citation publication-type="other" xlink:type="simple">Bechtel, B. and Daneke, C.: Classification of local climate zones based on multiple earth observation data, IEEE J. Sel. Top. Appl., 5, 1191–1202, &lt;a href=&quot;http://dx.doi.org/10.1109/JSTARS.2012.2189873&quot;&gt;https://doi.org/10.1109/JSTARS.2012.2189873&lt;/a&gt;, 2012.</mixed-citation>
</ref>
<ref id="ref6">
<label>6</label><mixed-citation publication-type="other" xlink:type="simple">Bergman, T., Kerminen, V.-M., Korhonen, H., Lehtinen, K. J., Makkonen, R., Arola, A., Mielonen, T., Romakkaniemi, S., Kulmala, M., and Kokkola, H.: Evaluation of the sectional aerosol microphysics module SALSA implementation in ECHAM5-HAM aerosol-climate model, Geosci. Model Dev., 5, 845–868, &lt;a href=&quot;http://dx.doi.org/10.5194/gmd-5-845-2012&quot;&gt;https://doi.org/10.5194/gmd-5-845-2012&lt;/a&gt;, 2012.</mixed-citation>
</ref>
<ref id="ref7">
<label>7</label><mixed-citation publication-type="other" xlink:type="simple">Breiman, L.: Random Forests, Mach. Learn., 45, 5–32, &lt;a href=&quot;http://dx.doi.org/10.1023/A:1010933404324&quot;&gt;https://doi.org/10.1023/A:1010933404324&lt;/a&gt;, 2001.</mixed-citation>
</ref>
<ref id="ref8">
<label>8</label><mixed-citation publication-type="other" xlink:type="simple">Chen, C., Liaw, A., and Breiman, L.: Using random forests to learn imbalanced data, Technical Report 666, Statistics Department of University of California, Berkeley, USA, 2004.</mixed-citation>
</ref>
<ref id="ref9">
<label>9</label><mixed-citation publication-type="other" xlink:type="simple">Clegg, S. L., Brimblecombe, P., and Wexler, A. S.: Thermodynamical model of the system H&lt;sup&gt;+&lt;/sup&gt;-NH&lt;sub&gt;4&lt;/sub&gt;&lt;sup&gt;+&lt;/sup&gt;-SO&lt;sub&gt;4&lt;/sub&gt;&lt;sup&gt;2-&lt;/sup&gt;-NO&lt;sub&gt;3&lt;/sub&gt;&lt;sup&gt;-&lt;/sup&gt;-H&lt;sub&gt;2&lt;/sub&gt;O at tropospheric temperatures, J. Phys. Chem. A, 102, 2137–2154, 1998.</mixed-citation>
</ref>
<ref id="ref10">
<label>10</label><mixed-citation publication-type="other" xlink:type="simple">Forster, P., Ramaswamy, V., Artaxo, P., Berntsen, T., Betts, R., Fahey, D., Haywood, J., Lean, J., Lowe, D., Myhre, G., Nganga, J., Prinn, R., Raga, G., Schulz, M., and Van Dorland, R.: Changes in atmospheric constituents and in radiative forcing, in: Climate Change 2007: The Physical Science Basis, Contribution of Working Group I to the Fourth Assessment Report of the Intergovernmental Panel on Climate Change, edited by: Solomon, S., Qin, D., Manning, M., Chen, Z., Marquis, M., Averyt, K., Tignor, M., and Miller, H., Cambridge University Press, Cambridge, UK and New York, NY, USA, 2007.</mixed-citation>
</ref>
<ref id="ref11">
<label>11</label><mixed-citation publication-type="other" xlink:type="simple">Fountoukis, C. and Nenes, A.: Continued development of a cloud formation parameterization for global climate models, J. Geophys. Res., 110, D11212, &lt;a href=&quot;http://dx.doi.org/10.1029/2004JD005591&quot;&gt;https://doi.org/10.1029/2004JD005591&lt;/a&gt;, 2005.</mixed-citation>
</ref>
<ref id="ref12">
<label>12</label><mixed-citation publication-type="other" xlink:type="simple">Haykin, S. O.: Neural networks and learning machines, Prentice Hall, New York, 2009.</mixed-citation>
</ref>
<ref id="ref13">
<label>13</label><mixed-citation publication-type="other" xlink:type="simple">Jacobson, M. Z.: GATOR-GCMM: a global through urban scale air pollution and weather forecast model. 1. Model design and treatment of subgrid soil, vegetation, roads, rooftops, water, sea ice, and snow, J. Geophys. Res., 106, 5385–5402, &lt;a href=&quot;http://dx.doi.org/10.1029/2000JD900560&quot;&gt;https://doi.org/10.1029/2000JD900560&lt;/a&gt;, 2001.</mixed-citation>
</ref>
<ref id="ref14">
<label>14</label><mixed-citation publication-type="other" xlink:type="simple">Kaipio, J. and Somersalo, E.: Statistical and Computational Inverse Problems, Springer, New York, 2005.</mixed-citation>
</ref>
<ref id="ref15">
<label>15</label><mixed-citation publication-type="other" xlink:type="simple">Kokkola, H., Romakkaniemi, S., Kulmala, M., and Laaksonen, A.: A cloud microphysics model including trace gas condensation and sulfate chemistry, Boreal Environ. Res., 8, 413–424,2003.</mixed-citation>
</ref>
<ref id="ref16">
<label>16</label><mixed-citation publication-type="other" xlink:type="simple">Kokkola, H., Korhonen, H., Lehtinen, K. E. J., Makkonen, R., Asmi, A., Järvenoja, S., Anttila, T., Partanen, A.-I., Kulmala, M., Järvinen, H., Laaksonen, A., and Kerminen, V.-M.: SALSA – a Sectional Aerosol module for Large Scale Applications, Atmos. Chem. Phys., 8, 2469–2483, &lt;a href=&quot;http://dx.doi.org/10.5194/acp-8-2469-2008&quot;&gt;https://doi.org/10.5194/acp-8-2469-2008&lt;/a&gt;, 2008.</mixed-citation>
</ref>
<ref id="ref17">
<label>17</label><mixed-citation publication-type="other" xlink:type="simple">Kolehmainen, V., Schweiger, M., Nissilä, I., Tarvainen, T., Arridge, S., and Kaipio, J.: Approximation errors and model reduction in three-dimensional diffuse optical tomography, J. Opt. Soc. Am. A, 10, 2257–2267, &lt;a href=&quot;http://dx.doi.org/10.1364/JOSAA.26.002257&quot;&gt;https://doi.org/10.1364/JOSAA.26.002257&lt;/a&gt;, 2009.</mixed-citation>
</ref>
<ref id="ref18">
<label>18</label><mixed-citation publication-type="other" xlink:type="simple">Kolehmainen, V., Tarvainen, T., Arridge, S., and Kaipio, J.: Marginalization of uninteresting distributed parameters in inverse problems – application to diffuse optical tomography, International Journal for Uncertainty Quantification, 1, 1–17, &lt;a href=&quot;http://dx.doi.org/10.1615/Int.J.UncertaintyQuantification.v1.i1.10&quot;&gt;https://doi.org/10.1615/Int.J.UncertaintyQuantification.v1.i1.10&lt;/a&gt;, 2011.</mixed-citation>
</ref>
<ref id="ref19">
<label>19</label><mixed-citation publication-type="other" xlink:type="simple">Lehikoinen, A., Finsterle, S., Voutilainen, A., Heikkinen, L., Vauhkonen, M., and Kaipio, J.: Approximation errors and truncation of computational domains with application to geophysical tomography, Inverse Probl. Imag., 1, 371–389, &lt;a href=&quot;http://dx.doi.org/10.3934/ipi.2007.1.371&quot;&gt;https://doi.org/10.3934/ipi.2007.1.371&lt;/a&gt;, 2007.</mixed-citation>
</ref>
<ref id="ref20">
<label>20</label><mixed-citation publication-type="other" xlink:type="simple">Liu, X., Easter, R. C., Ghan, S. J., Zaveri, R., Rasch, P., Shi, X., Lamarque, J.-F., Gettelman, A., Morrison, H., Vitt, F., Conley, A., Park, S., Neale, R., Hannay, C., Ekman, A. M. L., Hess, P., Mahowald, N., Collins, W., Iacono, M. J., Bretherton, C. S., Flanner, M. G., and Mitchell, D.: Toward a minimal representation of aerosols in climate models: description and evaluation in the Community Atmosphere Model CAM5, Geosci. Model Dev., 5, 709–739, &lt;a href=&quot;http://dx.doi.org/10.5194/gmd-5-709-2012&quot;&gt;https://doi.org/10.5194/gmd-5-709-2012&lt;/a&gt;, 2012.</mixed-citation>
</ref>
<ref id="ref21">
<label>21</label><mixed-citation publication-type="other" xlink:type="simple">Munro, N. P., Cairns, D. A., Clarke, P., Rogers, M., Stanley, A. J., Barrett, J. H., Harnden, P., Thompson, D., Eardley, I., Banks, R. E., and Knowles, M. A.: Urinary biomarker profiling in transitional cell carcinoma, Int. J. Cancer, 119, 2642–2650, 2006.</mixed-citation>
</ref>
<ref id="ref22">
<label>22</label><mixed-citation publication-type="other" xlink:type="simple">Nenes, A. and Seinfeld, J.: Parameterization of cloud dropletformation in global climate models, J. Geophys. Res., 108, 4415, &lt;a href=&quot;http://dx.doi.org/10.1029/2002JD002911&quot;&gt;https://doi.org/10.1029/2002JD002911&lt;/a&gt;, 2003.</mixed-citation>
</ref>
<ref id="ref23">
<label>23</label><mixed-citation publication-type="other" xlink:type="simple">Nissinen, A., Heikkinen, L., Kolehmainen, V., and Kaipio, J.: Compensation of errors due to discretization, domain truncation and unknown contact impedances in electrical impedance tomography, Meas. Sci. Technol., 20, 105504, &lt;a href=&quot;http://dx.doi.org/10.1088/0957-0233/20/10/105504&quot;&gt;https://doi.org/10.1088/0957-0233/20/10/105504&lt;/a&gt;, 2009.</mixed-citation>
</ref>
<ref id="ref24">
<label>24</label><mixed-citation publication-type="other" xlink:type="simple">Nissinen, A., Kolehmainen, V., and Kaipio, J.: Compensation of modelling errors due to unknown domain boundary in electrical impedance tomography, IEEE T. Med. Imaging, 30, 231–242, 2011.</mixed-citation>
</ref>
<ref id="ref25">
<label>25</label><mixed-citation publication-type="other" xlink:type="simple">Pal, M.: Random Forest classifier for remote sensing classification, Int. J. Remote Sens., 26, 217–222, &lt;a href=&quot;http://dx.doi.org/10.1080/01431160412331269698&quot;&gt;https://doi.org/10.1080/01431160412331269698&lt;/a&gt;, 2005.</mixed-citation>
</ref>
<ref id="ref26">
<label>26</label><mixed-citation publication-type="other" xlink:type="simple">Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O, Blondel, M., Prettenhofer, P., Weiss, R., Dubourg, V., Vanderplas, J., Passos, A., Cournapeau, D., Brucher, M., Perrot, M., and Duchesnay, E.: Scikit-learn: Machine Learning in Python, J. Mach. Learn. Res., 12, 2825–2830, 2011.</mixed-citation>
</ref>
<ref id="ref27">
<label>27</label><mixed-citation publication-type="other" xlink:type="simple">Rodriguez, M. and Dabdub, D. J.: IMAGES-SCAPE2: A modeling study of size and chemically resolved aerosol thermodynamics in a global chemical transport model, J. Geophys. Res., 109, D02203, &lt;a href=&quot;http://dx.doi.org/10.1029/2003JD003639&quot;&gt;https://doi.org/10.1029/2003JD003639&lt;/a&gt;, 2004.</mixed-citation>
</ref>
<ref id="ref28">
<label>28</label><mixed-citation publication-type="other" xlink:type="simple">Rojas, R.: Neural Networks: A Systematic Introduction, Springer-Verlag, Berlin, 1996.</mixed-citation>
</ref>
<ref id="ref29">
<label>29</label><mixed-citation publication-type="other" xlink:type="simple">Romakkaniemi, S., Kokkola, H., and Laaksonen, A.: Parameterization of the nitric acid effect on CCN activation, Atmos. Chem. Phys., 5, 879–885, &lt;a href=&quot;http://dx.doi.org/10.5194/acp-5-879-2005&quot;&gt;https://doi.org/10.5194/acp-5-879-2005&lt;/a&gt;, 2005.</mixed-citation>
</ref>
<ref id="ref30">
<label>30</label><mixed-citation publication-type="other" xlink:type="simple">Romakkaniemi, S., Arola, A., Kokkola, H., Birmili, W., Tuch, T., Kerminen, V.-M., R\&quot;isänen, P., Smith, J. N., Korhonen, H., and Laaksonen, A.: Effect of aerosol size distribution changes on AOD, CCN and cloud droplet concentration: Case studies from Erfurt and Melpitz, Germany, J. Geophys. Res., in press, &lt;a href=&quot;http://dx.doi.org/10.1029/2011JD017091&quot;&gt;https://doi.org/10.1029/2011JD017091&lt;/a&gt;, 2012.</mixed-citation>
</ref>
<ref id="ref31">
<label>31</label><mixed-citation publication-type="other" xlink:type="simple">Sorjamaa, R., Svenningsson, B., Raatikainen, T., Henning, S., Bilde, M., and Laaksonen, A.: The role of surfactants in Köhler theory reconsidered, Atmos. Chem. Phys., 4, 2107–2117, &lt;a href=&quot;http://dx.doi.org/10.5194/acp-4-2107-2004&quot;&gt;https://doi.org/10.5194/acp-4-2107-2004&lt;/a&gt;, 2004.</mixed-citation>
</ref>
<ref id="ref32">
<label>32</label><mixed-citation publication-type="other" xlink:type="simple">Tesfamariam, S. and Liu, Z.: Earthquake induced damage classification for reinforced concrete buildings, Struct. Saf., 32, 154–164, &lt;a href=&quot;http://dx.doi.org/10.1016/j.strusafe.2009.10.002&quot;&gt;https://doi.org/10.1016/j.strusafe.2009.10.002&lt;/a&gt;, 2010.</mixed-citation>
</ref>
<ref id="ref33">
<label>33</label><mixed-citation publication-type="other" xlink:type="simple">Weisenstein, D. K., Penner, J. E., Herzog, M., and Liu, X.: Global 2-D intercomparison of sectional and modal aerosol modules, Atmos. Chem. Phys., 7, 2339–2355, &lt;a href=&quot;http://dx.doi.org/10.5194/acp-7-2339-2007&quot;&gt;https://doi.org/10.5194/acp-7-2339-2007&lt;/a&gt;, 2007.</mixed-citation>
</ref>
<ref id="ref34">
<label>34</label><mixed-citation publication-type="other" xlink:type="simple">Yao, D., Yang, J., and Zhan, X.: A novel method for disease prediction: hybrid of Random Forest and multivariate adaptive regression splines, J. Computers, 8, 170–177, &lt;a href=&quot;http://dx.doi.org/10.4304/jcp.8.1.170-177&quot;&gt;https://doi.org/10.4304/jcp.8.1.170-177&lt;/a&gt;, 2013.</mixed-citation>
</ref>
</ref-list>
</back>
</article>