Articles | Volume 14, issue 1
Methods for assessment of models
06 Jan 2021
Methods for assessment of models |  | 06 Jan 2021

Ground-based lidar processing and simulator framework for comparing models and observations (ALCF 1.0)

Peter Kuma, Adrian J. McDonald, Olaf Morgenstern, Richard Querel, Israel Silber, and Connor J. Flynn

Automatic lidars and ceilometers (ALCs) provide valuable information on cloud and aerosols but have not been systematically used in the evaluation of general circulation models (GCMs) and numerical weather prediction (NWP) models. Obstacles associated with the diversity of instruments, a lack of standardisation of data products and open processing tools mean that the value of large ALC networks worldwide is not being realised. We discuss a tool, called the Automatic Lidar and Ceilometer Framework (ALCF), that overcomes these problems and also includes a ground-based lidar simulator, which calculates the radiative transfer of laser radiation and allows one-to-one comparison with models. Our ground-based lidar simulator is based on the Cloud Feedback Model Intercomparison Project (CFMIP) Observation Simulator Package (COSP), which has been extensively used for spaceborne lidar intercomparisons. The ALCF implements all steps needed to transform and calibrate raw ALC data and create simulated attenuated volume backscattering coefficient profiles for one-to-one comparison and complete statistical analysis of clouds. The framework supports multiple common commercial ALCs (Vaisala CL31, CL51, Lufft CHM 15k and Droplet Measurement Technologies MiniMPL), reanalyses (JRA-55, ERA5 and MERRA-2) and models (the Unified Model and AMPS – the Antarctic Mesoscale Prediction System). To demonstrate its capabilities, we present case studies evaluating cloud in the supported reanalyses and models using CL31, CL51, CHM 15k and MiniMPL observations at three sites in New Zealand. We show that the reanalyses and models generally underestimate cloud fraction. If sufficiently high-temporal-resolution model output is available (better than 6-hourly), a direct comparison of individual clouds is also possible. We demonstrate that the ALCF can be used as a generic evaluation tool to examine cloud occurrence and cloud properties in reanalyses, NWP models, and GCMs, potentially utilising the large amounts of ALC data already available. This tool is likely to be particularly useful for the analysis and improvement of low-level cloud simulations which are not well monitored from space. This has previously been identified as a critical deficiency in contemporary models, limiting the accuracy of weather forecasts and future climate projections. While the current focus of the framework is on clouds, support for aerosol in the lidar simulator is planned in the future.

1 Introduction

Automatic lidars and ceilometers (ALCs) are active ground-based instruments which emit laser pulses in the ultraviolet, visible or infrared (IR) part of the electromagnetic spectrum and measure radiation backscattered from atmospheric constituents such as cloud and fog liquid droplets as well as ice crystals, haze, aerosol and atmospheric gases (Emeis2010). Vertical profiles of attenuated backscattered radiation can be produced by measuring received power as a function of time elapsed between emitting the pulse and receiving the backscattered radiation. Quantities such as cloud-base height (CBH) and a cloud mask (Pal et al.1992; Wang and Sassen2001; Martucci et al.2010; Costa-Surós et al.2013; Van Tricht et al.2014; Liu et al.2015a, b; Lewis et al.2016; Cromwell and Flynn2018; Silber et al.2018), the particle volume backscattering coefficient (Marenco et al.1997; Welton et al.2000, 2002; Wiegner and Geiß2012; Wiegner et al.2014; Jin et al.2015; Dionisi et al.2018), and boundary layer height (Eresmaa et al.2006; Münkel et al.2007; Emeis et al.2009; Tsaknakis et al.2011; Milroy et al.2012; Knepp et al.2017) can be derived from the attenuated volume backscattering coefficient profile. Lidars equipped with polarisation or multiple wavelengths can also provide the depolarisation ratio or colour ratio, respectively, which can be used to infer cloud phase or particle types. Doppler lidars can measure wind speed in the direction of the lidar orientation. ALCs are commonly deployed at airports, where they provide CBH, fog and aerosol observations needed for air traffic control. Large networks of up to hundreds of lidars and ceilometers have been deployed worldwide: Cloudnet (Illingworth et al.2007), E-PROFILE (Illingworth et al.2018), PollyNET (Baars et al.2016), ICENET (Cazorla et al.2017), MPLNET (Welton et al.2006) and ARM (Stokes and Schwartz1994; Campbell et al.2002). The purpose of these networks is to observe cloud, fog, aerosol, air quality, visibility and volcanic ash, provide input to numerical weather prediction (NWP) model evaluation (Hogan et al.2001; Illingworth et al.2007; Morcrette et al.2012; Warren et al.2018; Lamer et al.2018; Hansen et al.2018b) and assimilation (Illingworth et al.2015b, 2018), and for climate studies. These networks are usually composed of multiple types of ALCs, with Vaisala CL31, CL51, Lufft (formerly Jenoptik) CHM 15k and Droplet Measurement Technologies (formerly Sigma Space and Hexagon) MiniMPL being the most common. Complex lidar data processing has been set up on some of these networks. Notably, at the SIRTA site in France, a lidar ratio (LR) comparable with a lidar simulator (Chiriaco et al.2018) is calculated as part of the “ReOBS” processing method. Intercomparison and calibration campaigns such as CeiLinEx2015 (Mattis et al.2016) and INTERACT-I(-II) (Rosoldi et al.2018; Madonna et al.2018) have been performed. Lidar data processing involves a number of tasks such as re-sampling, calibration, noise removal and cloud detection. Some of these are implemented in the instrument firmware of ALCs. This, however, means that the lidar attenuated volume backscattering coefficient and detected cloud and cloud base are not comparable between different instruments. In most cases the algorithms are not publicly documented, making it impossible to compare the data with values from a model or a lidar simulator without a systematic bias.

Atmospheric model evaluation is an ongoing task and a critical part of the model improvement process (Eyring et al.2019; Hourdin et al.2017; Schmidt et al.2017). Traditionally, various types of observational and model datasets have been utilised – weather and climate station data, upper-air soundings, ground-based and satellite remote sensing datasets, and high-resolution model simulations, amongst others. Clouds are one of the most problematic phenomena in atmospheric models due to their transient nature, high spatial and temporal variability, and sensitivity to a complex combination of conditions such as relative humidity, aerosols (presence of cloud condensation nuclei and ice nuclei), and thermodynamic and dynamic conditions. At the same time, clouds have a very substantial effect on the atmospheric shortwave and longwave radiation balance, and any cloud misrepresentation has a strong effect on other components of the model, limiting the ability to accurately represent past and present climate and predict future climate (Zadra et al.2018). An improved understanding of clouds and cloud feedbacks is one of the focuses of the Coupled Model Intercomparison Project Phase 6 (CMIP6) (Eyring et al.2016), and comparison of model cloud with observations is one of the key points of the Cloud Feedback Model Intercomparison Project (CFMIP) (Webb et al.2017). Satellite observations make up the majority of the data used to evaluate model clouds. These include the following: passive visible and IR low-earth-orbit and geostationary radiometers measuring, among others, features such as cloud cover, cloud-top height (CTH) and cloud-top temperature; passive microwave instruments measuring total column water; and active radars and lidars measuring cloud vertical profiles. Ground-based remote sensing instruments include radars, lidars, ceilometers, radiometers and sky cameras. As pointed out by Williams and Bodas-Salcedo (2017), using a wide range of different observational datasets including satellite and ground-based observations for general circulation model (GCM) evaluation is important due to the limitations of each dataset.

Model cloud is commonly represented by the mixing ratio of liquid and ice to the cloud fraction (CF) on every model grid cell and vertical level. In addition, some models provide the cloud droplet effective radius used in radiative transfer calculations. Remote sensing observations do not match the representation of the atmospheric model fields directly because of their different resolutions, limited field of view (FOV) and attenuation by atmospheric constituents before reaching the instrument's receiver. Instrument simulators bridge this gap by converting the model fields to quantities which emulate those measured by the instrument, which can then be compared directly with observations. One such collection of instrument simulators is the CFMIP Observation Simulator Package (COSP) (Bodas-Salcedo et al.2011; Swales et al.2018), which has been used for more than a decade for the evaluation of models using satellite, and more recently ground-based, observations. The simulators in COSP include the following: active instruments (spaceborne and ground-based radars) such as the Cloud Profiling Radar (CPR) on CloudSat (Stephens et al.2002) and the Ka-band ARM Zenith Radar (KAZR); lidars such as Cloud–Aerosol Lidar Orthogonal Polarization (CALIOP) on CALIPSO (Winker et al.2009), the Cloud–Aerosol Transport System (CATS) on ISS (McGill et al.2015) and the Atmospheric Lidar (ATLID) on EarthCARE (Illingworth et al.2015a); and spaceborne passive instruments such as ISCCP (Rossow and Schiffer1991), MODIS (Parkinson2003) and MISR (Diner et al.1998). The more recent addition of ground-based radar (Zhang et al.2018) and lidar (Chiriaco et al.2018; Bastin et al.2018) opens up new possibilities to use the large amount of remote sensing data obtained from ground-based active remote sensing instruments. In practice, ground-based observational remote sensing data are not straightforward to use without a substantial amount of additional processing. Some previous studies have also compared models and ground-based radar and lidar observations without the use of an instrument simulator (Bouniol et al.2010; Hansen et al.2018a), though for the reasons identified above this is not advisable.

In this study we introduce a software package called the Automatic Lidar and Ceilometer Framework (ALCF) for evaluating model cloud using ALC observations. It extends and integrates the COSP lidar simulator (Chiriaco et al.2006; Chepfer et al.2007, 2008) with pre- and post-processing steps and allows the simulator to be run offline on model output instead of having to be integrated inside the model. This makes it possible to compare ALC data at any location without having to run the model with a specific configuration. Multiple ALCs, reanalyses and model output formats are supported. The original COSP lidar simulator was extended with Rayleigh, Mie and ice crystal scattering at multiple lidar wavelengths. Observational ALC data from a number of common instruments can be processed by re-sampling to a common resolution, removing noise, detecting cloud and calculating statistics. The same steps can be performed on the simulated lidar data from the model (the output of running COSP on the model data), allowing for one-to-one comparison of model and observations. A particular focus of our work was on applying the same processing steps to the observed and simulated attenuated volume backscattering coefficient in order to avoid biases. The ALCF is made available under an open-source licence (MIT) at (last access: January 2021) and as a permanent archive of code and technical documentation on Zenodo at

A relatively small amount of other open source code is available for ALC data processing. A lidar simulator has been developed as part of the Goddard Satellite Data Simulator Unit (G-SDSU) (Matsui2019), a package based on the instrument simulator package SDSU (Masunaga et al.2010). The Community Intercomparison Suite (CIS) (Watson-Parris et al.2016) allows for subsetting, aggregation, co-location and plotting of mostly satellite data with a focus on model–observation intercomparison. The STRAT lidar data processing tools are a collection of tools for conversion of raw ALC data, visualisation and feature classification (Morille et al.2007).

Here, we provide an overview of the ALCF (Sect. 2) and describe the supported ALCs, reanalyses and models (Sect. 3), the lidar simulator (Sect. 4), and the observed and simulated lidar data processing steps (Sect. 5). Later, we present a set of case studies at three sites in New Zealand (NZ) (Sect. 6) to demonstrate the value of this new tool. Lastly, we present the results of the case studies in Sect. 7.

2 Overview of operation of the Automatic Lidar and Ceilometer Framework (ALCF 1.0)

The ALCF performs the necessary steps to simulate the ALC attenuated volume backscattering coefficient based on four-dimensional atmospheric fields from reanalyses, NWP models and GCMs, as well as to transform the observed raw ALC attenuated volume backscattering coefficient profiles to profiles comparable with the simulated profiles. It does so by extracting two-dimensional (time  ×  height) profiles from the model data, performing radiative transfer calculations based on a modified COSP lidar simulator (Sect. 4), absolute calibration and re-sampling of the observed attenuated volume backscattering coefficient to a common resolution, and performing comparable cloud detection on the simulated and observed attenuated volume backscattering coefficient. The framework supports multiple common ALCs (Sect. 3.1), reanalyses and models (Sect. 3.2). The schematic in Fig. 1 illustrates this process as well as the ALCF commands which perform the individual steps. The following commands are implemented: model, simulate, lidar, stats and plot. The commands are normally executed in a sequence, which is also implemented by a meta-command auto that is equivalent to executing a sequence of commands. The commands are described in detail in the technical documentation available online at (last access: 1 January 2021), on Zenodo at and in the Supplement. The physical basis is described here.

Figure 1(a) Scheme showing the operation of the ALCF and (b) the processing commands.


The model command extracts two-dimensional profiles of cloud liquid and ice content (and other thermodynamic fields) from the supported NWP model, GCM and reanalysis data (model data in Fig. 1) at a geographical point along a ship track or a flight path. The resulting profiles are recorded as NetCDF files. Section 3.2 describes the supported reanalyses and models. The model data can either be in one of the supported model output formats, or a new module for reading arbitrary model output can be written provided that the required atmospheric fields are present in the model output. The required model fields are per-level specific cloud liquid water content, specific cloud ice water content, cloud fraction, geopotential height, temperature, surface-level pressure and orography. No physical calculations are performed by this command. The atmospheric profiles are extracted by a nearest-neighbour selection.

The simulate command runs the lidar simulator described in Sect. 4 on the extracted model data (the output of the model command) and produces simulated attenuated volume backscattering coefficient profiles. This command runs the COSP-derived lidar simulator, which performs radiative transfer calculations of the laser radiation through the atmosphere. The resulting simulated attenuated volume backscattering coefficient profiles are the output of this command.

The lidar command applies various processing algorithms to either the simulated attenuated volume backscattering coefficient (the output of the simulate command) or the observed ALC coefficient (lidar data in Fig. 1) (Sect. 5). The data are re-sampled to increase the signal-to-noise ratio (SNR), noise is subtracted, LR is calculated, a cloud mask is calculated by applying a cloud detection algorithm and CBH is determined from the cloud mask. Absolute calibration (Sect. 5.2) can also be applied in this step by multiplying the observed attenuated volume backscattering coefficient by a calibration coefficient. This is important in order to obtain unbiased attenuated volume backscattering coefficient profiles comparable with the simulated profiles. Section 3.1 describes the supported instruments. The lidar data can be in one of the supported instrument formats. If the native instrument format is not NetCDF, it has to be converted from the native format with the auxiliary command convert or one of the conversion programmes: cl2nc (Vaisala CL31, CL51), mpl2nc or SigmaMPL (Sigma Space MiniMPL).

The stats step calculates summary statistics from the output of the lidar command. These include CF, cloud occurrence by height, attenuated volume backscattering coefficient histograms, and the averages of LR and the backscattering coefficient.

The plot command plots attenuated volume backscattering coefficient profiles produced by the lidar command (Figs. 4, 5, 6) and the statistics produced by the stats command: cloud occurrence (Fig. 3), attenuated volume backscattering coefficient histograms (Fig. 7) and attenuated volume backscattering coefficient noise standard deviation histograms (Fig. 9).

3 Supported input data: instruments, reanalyses and models

3.1 Instruments

Table 1Table of ALCs and their technical parameters. Power is calculated as pulse  ×  pulse repetition frequency (PRF).

1 Sampling rate. 2 Vertical (range) resolution. 3 Depolarisation. 4 Pulse energy. 5 Maximum range. 6 Range of full overlap. 7 Receiver field of view. 8 Hopkin et al. (2019). 9 Madonna et al. (2018).

Download Print Version | Download XLSX

The primary focus of the framework is to support common commercial ALCs. Ceilometers are considered the most basic type of lidar (Emeis2010; Kotthaus et al.2016) intended as commercial products designed for unattended operation. They are used routinely to measure CBH, but most instruments also provide the full vertical profiles of the attenuated volume backscattering coefficient. Therefore, they are suitable for model evaluation by comparing not only CBH, but also cloud occurrence as a function of height. Their compact size and low cost make it possible to deploy a large number of these instruments in different locations or use them in unusual settings such as mounted on ships (Klekociuk et al.2019; Kuma et al.2020). Common off-the-shelf ceilometers are the Lufft CHM 15k and the Vaisala CL31 and CL51. Some lidars offer higher power and therefore higher SNR, as well as capabilities not present in ceilometers such as dual polarisation, multiple wavelengths, Doppler shift measurement and Raman scattering. Below we describe ALCs supported by the framework and used in our case studies: Lufft CHM 15k, Vaisala CL31 and CL51 and Droplet Measurement Technologies MiniMPL. Table 1 lists selected parameters of the supported ALCs.

The Lufft CHM 15k (previously Jenoptik CHM 15k) is a ceilometer operating at a wavelength of 1064 nm (near IR). The maximum range of the instrument is 15.4 km, with a vertical sampling resolution of 5 m in the first 150 and 15 m above as well as sampling rate of 2 s. The total number of vertical levels is 1024. The wavelength in the near-IR spectrum ensures low molecular backscattering. The instrument produces NetCDF files containing uncalibrated attenuated volume backscattering coefficient profiles and various derived variables, although the calibration coefficient is relatively consistent for different instruments of the model (Hopkin et al.2019, Fig. 13).

The Vaisala CL31 and CL51 are ceilometers operating at a wavelength of 910 nm (near IR). The maximum range of the CL31 and CL51 is 7.7 and 15.4 km, and the sampling rate is 2 and 6 s, respectively. The vertical resolution is 10 m. The total number of vertical levels is 770 and 1540, respectively. The wavelength is characterised by relatively low molecular backscattering (but higher than 1064 nm) and is affected by water vapour absorption (Wiegner and Gasteiger2015; Wiegner et al.2019), which can cause additional absorption of about 20 % in the mid-latitudes and 50 % in the tropics (see also Sect. 5.4). The instruments produce data files containing uncalibrated attenuated volume backscattering coefficients which can be converted to NetCDF (see cl2nc in the “Code and data availability” section). The firmware configuration option “noise_h2 off” results in a backscatter range correction being selectively applied under a certain critical range and above this range only if cloud is present (Kotthaus et al.2016, Sect. 3.2). This was the case with our case study dataset (Sect. 6). We apply a range correction to the uncorrected range gates during lidar data processing. The critical range in CL51 is not documented but was determined as 6000 m based on an observed discontinuity.

The Droplet Measurement Technologies Mini Micro Pulse Lidar (MiniMPL) (previously Sigma Space MiniMPL and Hexagon MiniMPL) (Spinhirne1993; Campbell et al.2002; Flynn et al.2007) is a dual-polarisation micro-pulse lidar (meaning that it uses a high pulse repetition rate (PRF) and low pulse power) operating at a wavelength of 532 nm (green in the visible spectrum). The maximum range of the instrument is 30 km. The vertical resolution is 5–75 m and the sampling rate is 1 s. The shorter wavelength is affected by stronger molecular backscattering than 910 and 1064 nm. The instrument can be housed in an enclosure with a scanning head to provide configurable scanning by elevation angle and azimuth. The instrument produces data files containing raw attenuated volume backscattering coefficients which can be converted to NetCDF containing normalised relative backscatter (NRB) with the vendor-provided tool SigmaMPL (see also mpl2nc in the “Code and data availability” section).

3.2 Reanalyses and models

Below we briefly describe the reanalyses and models1 used in the case studies presented here (Sect. 6). We used publicly available output from three reanalyses and one NWP model. In addition, we performed nudged GCM simulations with high-temporal-resolution output with the Unified Model (UM). Table 2 lists some of the main properties of the reanalyses and models.

Table 2Reanalyses and models used in the case studies and some of their main properties. The temporal and horizontal grid resolution and vertical levels listed indicate the resolution of the model output available. The horizontal grid resolution is determined at 45 S. The internal resolution of the model may be different (see Sect. 3.2 for details). The reanalyses and the UM use regular longitude–latitude grids, while the AMPS horizontal grid is regular in the South Pole stereographic projection.

Download Print Version | Download XLSX

The Antarctic Mesoscale Prediction System (AMPS) (Powers et al.2003) is a limited-area NWP model based on the polar fifth-generation Pennsylvania State University–National Center for Atmospheric Research Mesoscale Model (Polar MM5), now known as the Polar Weather Research and Forecasting (WRF) model (Hines and Bromwich2008). The model serves operational and scientific needs in Antarctica, but its largest grid also covers the South Island of NZ. AMPS forecasts are publicly available on the Earth System Grid (Williams et al.2009). The forecasts are produced on several domains. The largest domain D01 used in the presented analysis covers NZ and has horizontal grid spacing of approximately 21 km over NZ. The model uses 60 vertical levels. The model output is available in 3-hourly intervals initialised at 00:00 and 12:00 UTC. The initial and boundary conditions are based on the Global Forecasting System (GFS) global NWP model. AMPS assimilates local Antarctic observations from human-operated stations, automatic weather stations (AWS), upper-air stations and satellites.

ERA5 (ECMWF2019) is a reanalysis produced by the European Centre for Medium-Range Weather Forecasts (ECMWF) currently available for the time period 1979 to the present, with a plan to extend the time period to 1950. The reanalysis is based on the global NWP model Integrated Forecast System (IFS) version CY41R2. It uses a 4D-Var assimilation of station, satellite, radiosonde, radar, aircraft, ship-based and buoy data. The model has 137 vertical levels. Atmospheric fields are interpolated from a horizontal resolution equivalent to 31 km with 137 model levels on a regular longitude–latitude grid of 0.25 and 37 pressure levels, all of which is made available to end users. In this analysis we use the hourly data on pressure and surface levels.

The Japanese 55-year reanalysis (JRA-55) (Ebita et al.2011; Kobayashi et al.2015; Harada et al.2016) is a global reanalysis produced by the Japan Meteorological Agency (JMA) and the Central Research Institute of Electric Power Industry (CRIEPI) based on the JMA Global Spectral Model (GSM). The reanalysis is available from 1958 onward. The reanalysis is based on the JMA operational assimilation system. JRA-55 uses a 4D-Var assimilation of surface, upper-air, satellite, ship-based and aircraft observations. The model uses 60 vertical levels and a horizontal grid with a resolution of approximately 60 km. In this analysis we use the 1.25 isobaric analysis and forecast fields interpolated to 37 pressure levels.

The Modern-Era Retrospective analysis for Research and Applications (MERRA-2) (Gelaro et al.2017) is a reanalysis produced by the NASA Global Modeling and Assimilation Office (GMAO). The reanalysis is based on the Goddard Earth Observing System (GEOS) atmospheric model. The model has approximately 0.5× 0.65 horizontal resolution and 72 vertical levels. It performs 3D-Var assimilation of station, upper-air, satellite, ship-based and aircraft data in 6-hourly cycles. In this analysis, we use the MERRA-2 3-hourly instantaneous model-level assimilated meteorological fields (M2I3NVASM) version 5.12.4 product.

The The UK Met Office Unified Model (UM) (Walters et al.2019) is an atmospheric model for weather forecasting and climate projection developed by the UK Met Office and the Unified Model Partnership. The UM is the atmospheric component, called Global Atmosphere (GA), of the HadGEM3–GC3.1 GCM and the UKESM1 earth system model (ESM). In this analysis we performed custom nudged runs of the UM (Telford et al.2008) in the GA7.1 configuration with a 20 min time step and output temporal resolution on a New Zealand eScience Infrastructure (NeSI)–National Institute of Water & Atmospheric Research (NIWA) supercomputer (Williams et al.2016). The model was nudged to the ERA-Interim (Dee et al.2011) atmospheric fields of horizontal wind speed and potential temperature as well as the HadISST sea surface temperature (SST) and sea ice dataset (Rayner et al.2003). The model uses 85 vertical levels and a horizontal grid resolution of 1.875× 1.25.

4 Lidar simulator

The COSP lidar simulator, the Active Remote Sensing Simulator (ACTSIM), was introduced by Chiriaco et al. (2006) for the purpose of deriving simulated CALIOP measurements (Chepfer et al.2007, 2008). The simulation is implemented by applying the lidar equation on model levels. Scattering and absorption by cloud particles and air molecules are calculated using the Mie and Rayleigh theory, respectively. Scattering and absorption by aerosols are not implemented in the presented version, but support is planned in the future for models which provide the concentration of aerosols. Therefore, the current focus of the simulator is solely on cloud evaluation. CALIOP operates at a wavelength of 532 nm, and calculations in the original COSP simulator use this wavelength. We implemented a small set of changes to the lidar simulator to support a number of ALCs with different operating wavelengths and developed a parameterisation of backscattering from ice crystals based on temperature.

Table 3Table of physical quantities.

Download Print Version | Download XLSX

The lidar equation (Emeis2010) is based on the radiative transfer equation (Goody and Yung1995; Liou2002; Petty2006; Zdunkowski et al.2007), which relates the transmission of radiation to scattering, emission and absorption in media such as the atmosphere. The lidar equation assumes that laser radiation passes through the atmosphere where it is absorbed and scattered. A fraction of laser radiation is scattered back to the instrument and reaches the receiver. Scattering and absorption in the atmosphere are determined by their constituents – gases, liquid droplets, ice crystals and aerosol particles. The focus of the current version of the simulator is on clouds. For this purpose, the atmospheric model output needed is four-dimensional fields of the mass mixing ratios of liquid and ice as well as CF. The lidar equation can be applied to these output fields to simulate the backscattered radiation received by the instrument. Table 3 lists the physical quantities used in the following sections. Here, we a radiative transfer notation similar to Petty (2006) and the notation of the original lidar simulator (Chiriaco et al.2006).

Below we provide a brief review of LR, Rayleigh and Mie scattering, calculate LR of cloud droplets at lidar wavelengths of the presented instruments, and introduce an empirical parameterisation of LR and the multiple-scattering coefficient of ice crystals based on previous studies.

4.1 Lidar ratio

The lidar ratio S is the extinction-to-backscattering ratio of atmospheric constituents at the lidar wavelength. It is an important quantity in lidar observations and the lidar simulator because it determines the amount of attenuation and backscattering. LR is not explicitly known from the observed attenuated volume backscattering coefficient. For liquid cloud droplets at near-IR wavelengths it is relatively constant at S≈19sr (Sect. 4.2), while for ice crystals (Sect. 4.3) and aerosol it is highly variable. When the lidar signal is fully attenuated, and under the assumption that cloud LR is constant and scattering from clouds is much stronger than molecular and aerosol scattering, LR can be determined from the observed attenuated volume backscattering coefficient by integrating it vertically (O'Connor et al.2004):

(1) S = η S = 1 2 0 β d z ,

where S is effective (apparent) LR, a quantity which does not depend on the multiple-scattering coefficient.

4.2 Rayleigh and Mie scattering

The Rayleigh volume backscattering coefficient βmol (m-1sr-1) in ACTSIM is parameterised by the following equation (Eq. 8 in Chiriaco et al.2006):

(2) β mol = p k B T ( 5.45 × 10 - 32 ) λ 550 nm - 4.09 = p k B T C mol ,

where for lidar wavelength λ=532 nm, Cmol=6.2446×10-32; kB is the Boltzmann constant kB1.38×10-23JK−1, p is the atmospheric pressure and T is the atmospheric temperature. We multiply this equation by exp (4.09(log (532)−log (λ))) (where the value of λ is in nanometres) to get molecular backscattering for wavelengths other than 532 nm, which allows us to support multiple commercially available instruments. The strength of molecular backscattering is usually lower than backscattering from clouds for the relevant wavelengths.

The lidar signal at visible or near-IR wavelengths is scattered by cloud droplets in the Mie scattering regime (Mie1908). In the most simple approximation, one can assume spherical dielectric particles. The scattering from these particles depends on the relative size of the wavelength and the (spherical) particle radius r, expressed by the dimensionless size parameter x:

(3) x = 2 π r λ .

While the wavelength is approximately constant during the operation of the lidar2, the particle size comes from a distribution of sizes, typically approximated in NWP models and GCMs by a gamma or log-normal distribution with a given mean and standard deviation. Some models provide the mean as effective radius reff. If the effective radius is not provided by the model, the lidar simulator assumes a value reff=10µm by default, which is approximately consistent with global studies of the effective radius (Bréon and Colzy2000; Bréon and Doutriaux-Boucher2005; Hu et al.2007; Zhang and Platnick2011; Rausch et al.2017; Fu et al.2019). This is different from the default effective radius of 30 µm in the original COSP lidar simulator.

In order to support multiple laser wavelengths, it is necessary to calculate backscattering efficiency due to scattering by a distribution of particle sizes. We use the computer code MIEV developed by Warren J. Wiscombe (Wiscombe1979, 1980) to calculate backscattering efficiency for a range of the size parameter x and integrate for a distribution of particle sizes. The resulting pre-calculated LR (extinction-to-backscatter ratio) as a function of the effective radius is included in the lidar simulator for fast lookup during the simulation.

Cloud droplet size distribution parameters are an important assumption in lidar simulation due to the dependence of Mie scattering on the ratio of the wavelength and particle size (the size parameter x). NWP models and GCMs traditionally use the effective radius reff and effective standard deviation σeff (or an equivalent parameter such as effective variance νeff) to parameterise this distribution. Knowledge of the real distribution is likely highly uncertain due to a large variety of clouds occurring globally and the limited ability to predict microphysical cloud properties in models. In this section we introduce theoretical assumptions used in the lidar simulator based on established definitions of the effective radius and effective standard deviation as well as two common distributions. Edwards and Slingo (1996) discuss the effective radius in the context of model radiation schemes, and we will primarily follow the definitions detailed in Chang and Li (2001) and Petty and Huang (2011). The practical result of this section (and the corresponding offline code) is pre-calculated backscatter-to-extinction ratios as a function of the effective radius in the form of a lookup table included in the lidar simulator and used in the online calculations. The offline code is provided and can be re-used for calculation of the necessary lookup tables for different lidar wavelengths, should the user of the code want to support another instrument.

The effective radius reff and effective standard deviation σeff are defined by

(4) r eff = 0 r 3 n ( r ) d r 0 r 2 n ( r ) d r , σ eff 2 = 0 ( r - r eff ) 2 r 2 n ( r ) d r 0 r 2 n ( r ) d r ,

where n(r) is the probability density function (PDF) of the distribution. Here, we follow Petty and Huang (2011), who define the effective variance νeff which relates to σeff by νeff=σeff2/reff2. Due to lack of knowledge about the real distribution of particle radii, it has to be modelled by a theoretical distribution, such as a log-normal or gamma distribution. The original ACTSIM assumes a log-normal distribution (Chiriaco et al.2006) with the PDF:

(5) n ( r ) 1 r exp - ( log r - μ ) 2 2 σ 2 ,

where μ and σ are the mean and the standard deviation of the corresponding normal distribution, respectively. Chiriaco et al. (2006) use the value of σ=log(1.2)=0.18 “for ice clouds” (the value for liquid cloud does not appear to be documented). In our parameterisation we used a combination of reff and σeff to constrain the theoretical distribution, wherein the effective standard deviation σeff was assumed to be one-fourth of the effective radius reff. This choice is approximately consistent with σ=log(1.2)=0.18 at reff = 20 µm (see Table 4, described below). In future updates, the values could be based on in situ studies of size distribution or taken from the atmospheric model output if available.

Table 4Table of sensitivity tests for the theoretical distribution assumption, effective radius reff and effective standard deviation σeff of the cloud droplet size distribution; μ and σ are the mean and standard deviation of a normal distribution, corresponding to the log-normal distribution, numerically calculated from reff and σeff, and μ* and σ* are the actual mean and standard deviation of the distribution (numerically calculated).

Download Print Version | Download XLSX

From the expression for the nth moment of the log-normal distribution E[Xn]=exp(nμ+n2σ22) and Eq. (4) we calculate reff and σeff of the log-normal distribution:


We find μ and σ for given reff and σeff numerically by root-finding using the equations above. In practice, we find that the root-finding converges well for reff between 5 and 50 µm, which is the range most likely to be applicable in practice.

The gamma distribution follows the PDF:

(8) n ( r ) r ( 1 - 3 ν eff ) / ν eff exp - r r eff ν eff

(see e.g. Eq. 13 in Petty and Huang2011, or Eq. 1 in Bréon and Doutriaux-Boucher2005). In this case, the distribution explicitly depends on reff and σeff and as such does not require numerical root-finding.

Figure 2(a) Theoretical distributions of cloud droplet radius based on the log-normal and gamma distributions parameterised by multiple choices of the effective radius reff and effective standard deviation σeff. (b) Lidar ratio (LR) as a function of effective radius calculated for different theoretical cloud droplet size distributions, laser wavelengths and effective standard deviation ratios. (c) Parameterisation of ice cloud optical properties as a function of temperature based on Garnier et al. (2015) and Heymsfield (2005). The plot shows LR (S), LR of CALIPSO calculated using the constant standard processing multiple-scattering coefficient η=0.6 (SCALIPSO,η=0.6), the effective LR of CALIPSO (SCALIPSO), the effective radius (reff) and the multiple-scattering coefficient of CALIPSO (ηCALIPSO) determined by Garnier et al. (2015). LRs are calculated for three wavelengths of 532 nm (solid line), 910 nm (dashed line) and 1064 nm (dotted line) by scaling with the colour ratio.


Figure 2a shows the log-normal and gamma distributions calculated for a number of reff and σeff values, and Table 4 summarises the properties of these distributions. The actual mean and standard deviation of the distributions do not necessarily correspond well to the effective radius and effective standard deviation.

In ACTSIM, the volume extinction coefficient αe is calculated by integrating the extinction by individual particles over the particle size distribution:

(9) α e = 0 Q e π r 2 n ( r ) d r Q e π 0 r 2 n ( r ) d r = Q e 3 q ρ air 4 ρ r eff ,

assuming approximately constant extinction efficiency Qe≈2 (which is approximately true for the interesting range of reff and laser wavelengths) and using the relationship between the cloud liquid mass mixing ratio q and 0r2n(r)dr:

(10) q ρ air = 0 4 3 π r 3 ρ n ( r ) d r = 4 3 π ρ 0 r 3 n ( r ) d r = 4 3 π ρ r eff 0 r 2 n ( r ) d r ,

where ρ and ρair are the densities of liquid water and air, respectively.

Likewise, the volume backscattering coefficient from particles βp is calculated by integrating backscattering by individual particles over the particle size distribution:

(11) β p = 0 Q s π r 2 P π ( π ) 4 π n ( r ) d r ,

where Qs is scattering efficiency and Pπ(π) is the scattering phase function at 180. Since the normalisation of n(r) is not known until the online phase of calculation, the backscatter-to-extinction ratio from particles kp=β/αe can be calculated offline instead (the requirement for normalisation of n(r) is avoided by appearing in both the numerator and denominator):

(12) k p = β p / α e = 0 Q s r 2 P π ( π ) / ( 4 π ) n ( r ) d r 0 Q e r 2 n ( r ) d r .

We pre-calculate this integral numerically for a permissible interval of reff (5–50 µm) at 500 evenly spaced wavelengths and store the result as a lookup table for the online phase. The integral in the numerator is numerically hard to calculate due to strong dependency of Pπ(π) on r. Figure 2b shows LR as a function of reff, calculated for log-normal and gamma particle size distributions with σeff=0.25reff and σeff=0.5reff. This corresponds to the lookup table we use in the online phase of the lidar simulator. As can be seen in Fig. 2, LR depends only weakly on the choice of the distribution type and the effective standard deviation ratio.

4.3 Backscattering from ice crystals

Simulation of backscattering from ice crystals is relatively complex compared to backscattering from liquid droplets due to the very high variability of ice crystal microphysical properties such as habit, size, orientation and surface roughness, all of which affect LR, extinction cross section, single-scattering albedo and the multiple-scattering coefficient. Common habits include hexagonal plates, hexagonal columns, hollow hexagonal columns, droxtals, bullet rosettes, hollow bullet rosettes and aggregates (Baran2009; van Diedenhoven2017). Size can be highly variable and bimodal with a dependence on temperature and relative humidity. Orientation is commonly random or horizontally oriented (often reported with hexagonal ice plates). The surface can vary between smooth and rough depending on supersaturation and crystal age. In general, the Mie theory cannot be used to simulate backscattering from ice crystals because of their irregular shape (Yang et al.2014). While large crystals allow the use of the geometric optics approximation to estimate the optical properties, smaller crystals and diffraction by large crystals necessitate the use of more advanced techniques such as the T-matrix method, finite-difference time domain (FDTD), discrete dipole approximation (DDA) and others, which are generally computationally expensive. Current global atmospheric models do not normally explicitly parameterise the microphysical properties of cloud ice and provide only very limited information such as ice mass concentration and in some cases the effective radius of ice crystals in the model output. Radiative transfer schemes of atmospheric models do not explicitly evaluate backscattering (the phase function at 180) and therefore cannot provide this information to the simulator. Instead the phase function is parameterised by the asymmetry factor, which is likely insufficient to give an accurate estimate of backscattering.

Because the model ice crystal microphysical and optical properties are not known, they have to be parameterised. A first option is to parameterise the microphysical properties such as habit and size and theoretically calculate optical properties. A second option is to directly parameterise the optical properties. This appears to be a more practical choice because of the broad availability of global remote sensing measurements of optical properties from satellites and ground-based lidars compared to relatively scarce in situ measurements of ice crystals. Garnier et al. (2015) analysed CALIPSO lidar and co-located passive infrared data from the Imaging Infrared Radiometer (IIR) and determined a global relationship between temperature, LR and the multiple-scattering coefficient at the lidar wavelength of 532 nm. The multiple-scattering coefficient is taken as a constant of 0.6 in the standard CALIPSO data processing, but they determined that it is in fact variable between about 0.4 and 0.8. Here, we parameterise LR based on their findings. LR varies with the lidar wavelength, a larger part of which is due to the change in the diffraction peak and a smaller part is due to the variation of the refractive index (Borovoi et al.2014). We use the colour ratio to estimate LR at lidar wavelengths other than 532 nm. A colour ratio of 1064 nm relative to 532 nm is commonly estimated for dual-wavelength lidars such as CALIOP. Here, we use a value of 0.8, approximately consistent with the results of Bi et al. (2009) and Vaughan et al. (2010). The effective radius is defined for non-spherical particles as reff=32IWCσ, where IWC is the ice water content, and σ is the volume extinction coefficient of ice. Heymsfield (2005) summarised the ice crystal effective radius (related to IWC σ by a factor of 1.64) parameterised as a function of temperature based on a number of field studies. We use this relationship for determination of the effective radius. Figure 2c shows the true and effective LR based on Garnier et al. (2015) and the effective radius based on Heymsfield (2005), parameterised by the following equations:


where T is atmospheric temperature in Kelvin (K). S follows Garnier et al. (2015, Fig. 12b), η follows Garnier et al. (2015, Fig. 9a) and reff follows Heymsfield (2005, Fig. 2), where the concave and convex shape (respectively) is approximated by using 1∕T as an argument of the linear approximation, and we use a logarithmic scale of reff in the expression for reff to avoid negative values at low temperature. Figure 2c also shows LR when calculated with the assumption of η=0.6 (SCALIPSO,η=0.6) as in the standard processing of CALIPSO data. This corresponds to the empirically found relationship in Garnier et al. (2015, Fig. 12a) and Josset et al. (2012, Fig. 9) with a local maximum at 225 K. LR at wavelengths other than 532 nm is approximated by 0.8λ-532532, where λ is lidar wavelength in micrometres (µm) and 0.8 is the approximate value of the 1064 nm  532 nm colour ratio. The parameterisation of LR (S in Fig. 2c) spans about the same range of values as reported by Hopkin (2018, Fig. 5.6) (20 to 60 sr) and Yorks et al. (2011) (10 to 60 sr). Based on CALIPSO observations, Hu (2007) determined that while the effective LR of global ice clouds at a lidar wavelength of 532 nm is mostly clustered around 17 sr, horizontally oriented plates produce a much lower effective LR below 10 sr caused by specular reflection. These results are close to our parameterisation of effective LR (SCALIPSO). In the current version of the lidar simulator we do not parameterise horizontally oriented plates, but in a future version they could be taken into account by parameterising their concentration based on temperature (Noel and Chepfer2010). For the ALCs we use the same constant value of the multiple-scattering coefficient η=0.7 as for liquid cloud droplets (Sect. 4.5).

4.4 Cloud overlap and cloud fraction

Model cloud is defined by the liquid and ice mass mixing ratio as well as the cloud fraction in each atmospheric layer. The lidar simulator simulates radiation passing vertically at a random location within the grid cell. Therefore, it is necessary to generate a random vertical cloud overlap based on the cloud fraction in each layer, as the overlap is not explicitly defined in the model output. Two common methods of generating overlap are the random and maximum–random overlap methods (Geleyn and Hollingsworth1979). In the random overlap method, each layer is either cloudy or clear with a probability given by CF, independent of other layers. The maximum–random overlap method assumes that adjacent layers with non-zero CF are maximally overlapped, whereas layers separated by zero CF layers are randomly overlapped. COSP implements cloud overlap generation in the Subgrid Cloud Overlap Profile Sampler (SCOPS) (Klein and Jakob1999; Webb et al.2001; Chepfer et al.2008). The ALC simulator uses SCOPS to generate 10 random subcolumns for each profile using the maximum–random overlap assumption as the default setting of a user-configurable option. The attenuated volume backscattering coefficient profile and cloud occurrence can be plotted for any subcolumn. Due to the random nature of the overlap, the attenuated volume backscattering coefficient profile may differ from the observed profile even if the model is correct in its cloud simulation. The random overlap generation should, however, result in unbiased cloud statistics.

4.5 Multiple scattering

Due to a finite FOV of the lidar receiver, a fraction of the laser radiation scattered forward will remain in the FOV. Therefore, the effective attenuation is smaller than calculated with the assumption that all but the backscattered radiation is removed from the FOV and cannot reach the receiver. The forward scattering can be repeated multiple times before a fraction of the radiation is backscattered, eventually reaching the receiver. To account for this multiple-scattering effect, the COSP lidar simulator uses a multiple-scattering correction coefficient η, by which the volume scattering coefficient is multiplied before calculating the layer optical thickness (Chiriaco et al.2006; Chepfer et al.2007, 2008). The theoretical value of η is between 0 and 1 and depends on the receiver FOV and optical properties of the cloud. For CALIOP at λ = 532 nm a value of 0.7 is used in the COSP lidar simulator. Hogan (2006) implemented a fast approximate multiple-scattering code. This code has recently been used by Hopkin et al. (2019) in their ceilometer calibration method. They noted that η is usually between 0.7 and 0.85 for wavelengths between 905 and 1064 nm. The ALC simulator presented here does not use an explicit calculation of η but retains the value of η=0.7 for cloud droplets. The code of Hogan (2006), “Multiscatter”, is publicly available (, last access: 1 January 2021) and could be used in a later version of the framework to improve the accuracy of simulated attenuation and calibration.

5 Lidar data processing

The scheme in Fig. 1 outlines the processing done in the framework. The individual processing steps are described below.

5.1 Noise and subsampling

ALC signal reception is affected by a number of sources of noise such as sunlight and electronic noise (Kotthaus et al.2016). Range-independent noise can be removed by assuming that the attenuated volume backscattering coefficient at the highest range gate is dominated by noise. This is true if the highest range is not affected by clouds or aerosol and if contributions from molecular scattering are negligible. The supported instruments have a range of approximately 8 (CL31), 15 (CL51, CHM 15k) and 30 km (MiniMPL). By assuming that the distribution of noise at the highest level is approximately normal, the mean and standard deviation can be calculated from a sample over a period of time such as 5 min, which is short enough to assume the noise is constant over this period and long enough to achieve accurate estimates of the standard deviation. The mean and standard deviation can then be scaled by the square of the range to estimate the distribution of range-independent noise at each range bin. By subtracting the noise mean from the measured attenuated volume backscattering coefficient we get the expected attenuated volume backscattering coefficient. The result of the noise removal algorithm is the expected attenuated volume backscattering coefficient and its standard deviation at each range bin.

5.2 Backscatter calibration

ALCs often report the attenuated volume backscattering coefficient in arbitrary units (a.u.) or as NRB (MiniMPL). If they report it in units of m-1sr-1, these values are often not calibrated to represent the true absolute attenuated volume backscattering coefficient. Assuming that range-dependent corrections (overlap, dead time and after pulse) have been applied to the attenuated volume backscattering coefficient in a.u., the reported attenuated volume backscattering coefficient is proportional to the true attenuated volume backscattering coefficient (inclusive of noise backscattering). In order to have a comparable quantity to the lidar simulator and consistent input to the subsequent processing (e.g. cloud detection), calibration by multiplying by a calibration coefficient is required. Formally, the units of the calibration coefficient depend on the units of backscattering recorded by the instrument, which are m-1sr-1 in CL31 and CL51, unitless in CHM 15k, and µs-1µJ-1km2 in MiniMPL; i.e. the units of the calibration coefficient are m-1sr-1 (instrument units). In the following discussion, we leave out the units. Several methods of calibration have been previously described: calibration based on LR in fully attenuating liquid stratocumulus clouds (O'Connor et al.2004; Hopkin et al.2019), calibration based on molecular backscattering (Wiegner et al.2014) and calibration based on a high-spectral-resolution lidar reference (Heese et al.2010; Jin et al.2015). In addition, calibration can be assisted by sun-photometer or radiosonde measurements (Wiegner et al.2014).

Table 5Theoretical molecular volume backscattering coefficient calculated at pressure 1000 hPa and temperature 20 C along with the calibration coefficient, relative to the instrument native units, determined for the instrument based on the molecular volume backscattering coefficient and stratocumulus lidar ratio calibration methods.

Download Print Version | Download XLSX

Relatively large variability in the calibration coefficient has been determined for instruments of the same model (Hopkin et al.2019). However, past studies can be useful for determining an approximate value of the coefficient before applying one of the calibration methods. For the CL51, Jin et al. (2015) reported a value of 1.2 ± 0.1 based on a multi-wavelength lidar reference. Hopkin et al. (2019) reported mean values of 1.4–1.5 for a number of CL31 instruments (software version 202). For CHM 15k, Hopkin et al. (2019) reported mean values between 0.3 and 0.8 for a majority of the instruments examined. The ALCF provides per-instrument default values of the calibration coefficient (Table 5), but a unit-specific coefficient should be determined for an analysed instrument during the lidar data processing step.

Calibration based on LR in fully opaque liquid stratocumulus clouds has been successfully applied to large networks of ALCs. It utilises the fact that given suitable conditions the vertically integrated attenuated volume backscattering coefficient is proportional to LR of the cloud, which can be theoretically derived if the cloud droplet effective radius can be assumed. The theoretically derived value is about 18.8 sr for common ALC wavelengths and a relatively large range of effective radii (O'Connor et al.2004). Another factor which needs to be known or assumed is the multiple-scattering coefficient, which tends to be about 0.7–1.0 in common ALCs. Due to its relatively simple requirements, this method is possibly the easiest ALC calibration method. The ALCF implements this calibration method by letting the user identify time periods with fully opaque liquid stratocumulus cloud, for which the mean LR is calculated. The ratio of the observed LR and the theoretical LR is equivalent to the calibration coefficient. This implementation, while very easy to perform, has multiple limitations, some of which are highlighted by Hopkin et al. (2019).

  1. Aerosol can cause additional attenuation and scattering, which results in LR that is different from the theoretical value by an unknown factor. Therefore, a frequent re-calibration may be necessary.

  2. The multiple-scattering coefficient assumption may not be accurate for the given instrument.

  3. The 910 nm wavelength of CL31 and CL51 is affected by water vapour absorption, which causes additional attenuation that is currently not taken into account in the calculation of LR.

  4. Near-range attenuated volume backscattering coefficient retrieval is affected by receiver saturation and incomplete overlap. Therefore, using stratocumulus clouds above approximately 2 km for this calibration method is recommended. This range is instrument-dependent.

  5. The composition of stratocumulus clouds may be uncertain. At temperatures between 0 and −30C these clouds may contain both liquid and ice, which results in a different LR than expected.

These limitations could be addressed in the future by (1) using sun-photometer observations as an optional input to determine the aerosol optical depth (AOD), (2) calculating the multiple-scattering coefficient more accurately (such as with the Multiscatter package of Hogan2006), (3) calculating the water vapour absorption explicitly based on water vapour, temperature and pressure fields from a reanalysis or radiosonde profile data, (4) correcting the near-range backscatter based on the integrated attenuated volume backscattering coefficient distribution as a function of the height of the maximum backscatter (Hopkin et al.2019, Sect. 5.1), or (5) combining the attenuated volume backscattering coefficient profile with the temperature field from a reanalysis to exclude cold clouds.

Molecular (Rayleigh) backscattering can be accurately calculated if the temperature and pressure of the atmospheric profile are known (Sect. 4.2). This can be employed for absolute calibration of ALCs. Given the low SNR of low-power ALCs, several hours of integration are required to identify the molecular backscattering (Wiegner et al.2014). The molecular backscattering is attenuated by an unknown amount of aerosol with unknown LR, and the near-range backscattering is affected by a potentially inaccurate overlap correction. Therefore, this method alone produces calibration coefficients which depend on the atmospheric conditions. We found that all studied ALCs except for the CL31 are capable of observing the molecular backscattering (Sect. 7). Therefore, this method may be used in addition to the liquid stratocumulus LR method for cross-validation of the calibration.

5.3 Cloud detection

Cloud is the most strongly attenuating feature in ALC attenuated volume backscattering coefficient measurements. Due to this attenuation, the lidar signal is quickly attenuated in thick cloud and can fall below the noise level before reaching the top of the cloud. This means that the first cloud base can be detected reliably (unless the cloud is too thin or too high and obscured by noise), while the cloud top or multi-layer cloud cannot be observed reliably under all conditions. The opposite is true for spaceborne lidars, which can detect the cloud top reliably but cannot always detect the cloud base. Therefore, ALC observations can be regarded as complementary to spaceborne lidar observations. By applying a suitable algorithm, one can detect CBH and CTH as well as identifying cloud layers. Instrument firmware often determines CBH and sometimes cloud layers as part of its internal processing, often using an undisclosed algorithm which is not comparable between different instruments and potentially not even different versions of the instrument firmware (Kotthaus et al.2016). Mattis et al. (2016) compared a large number of ALCs and found differences of up to 70 m between the reported CBH, and others found relatively large differences as well (Liu et al.2015b; Silber et al.2018). Alternatively to instrument-reported CBH and cloud layers, it is possible to detect cloud based on the attenuated volume backscattering coefficient profile. A relatively large number of cloud detection algorithms have been proposed (Wang and Sassen2001; Morille et al.2007; Martucci et al.2010; Van Tricht et al.2014; Silber et al.2018; Cromwell and Flynn2019). We use a simple algorithm based on an attenuated volume backscattering coefficient threshold applied to the denoised backscatter, assuming that the noise can be represented by a normal distribution at the highest range, which is unlikely to contain cloud or aerosol if the instrument is pointing vertically (this may not be true, however, for CL31, which has a maximum range of just 7.7 km). This assumption neglects the range-dependent molecular backscattering, which is relatively small at the ceilometer wavelengths examined (910 and 1064 nm). A cloud mask is determined to be positive where the attenuated volume backscattering coefficient is greater than a chosen threshold plus 5 standard deviations of noise at the given range. In addition, the observed attenuated volume backscattering coefficient can optionally be coupled with a simulated attenuated molecular volume backscattering coefficient and molecular backscattering removed from the observed backscattering prior to cloud detection. This improves the results in the boundary layer, especially with instruments which operate in the visible range and are therefore affected by large molecular backscattering (MiniMPL). A threshold of 2×10-6m-1sr-1 was found to be a good compromise between false detection and misses in our Southern Hemisphere data relatively unaffected by anthropogenic aerosol. Our observed and simulated results show that cloud backscatter is generally higher than 1×10-6m-1sr-1, and a threshold below 2×10-6m-1sr-1 results in excessive false detection due to aerosol, molecular backscattering and noise from sunlight. The threshold is an adjustable option of the ALCF. Users are encouraged to change this value if, for example, the data are affected by a large amount of aerosol. This value is above the maximum molecular backscattering, which is approximately 1.54×10-6m-1sr-1 at the surface in the case of the MiniMPL (wavelength 532 nm). Noise is not simulated by the lidar simulator, but the cloud detection algorithm allows for coupling of simulated and observed profiles, whereby the noise standard deviation is taken from the corresponding location in the observed profile. With 5 min averaging, when the standard deviation of noise is relatively low, we found that the coupling does not make substantial differences in the detected cloud (not shown). While the threshold-based algorithm is less sophisticated than other methods of cloud detection, the vertical resolution of the simulated attenuated volume backscattering coefficient is likely too low and the vertical derivatives of the simulated attenuated volume backscattering coefficient too crudely represented (Table 7) to apply any algorithm based on the vertical derivatives of the attenuated volume backscattering coefficient. Using the same cloud detection algorithm on the observed and simulated attenuated volume backscattering coefficient is essential for an unbiased one-to-one comparison of cloud.

5.4 Water vapour absorption

Previous studies have noted that ceilometers which utilise the wavelength of 910 nm, such as the Vaisala CL31 and CL51, are affected by additional absorption of laser radiation by water vapour (Wiegner and Gasteiger2015; Wiegner et al.2019; Hopkin et al.2019). The wavelength coincides with water vapour absorption bands between 900 and 930 nm, while the other common ceilometer wavelength of 1064 nm is not affected. Wiegner and Gasteiger (2015) reported that it can cause absorption of the order of 20 % in the extratropics and 50 % in the tropics. The lidar simulator does not currently account for this. However, as the water vapour concentration is available from the reanalyses and models, it should be possible to use a line-by-line model to calculate the water vapour volume absorption coefficient for each vertical layer during the integration process. Water vapour also affects calibration of the observed attenuated volume backscattering coefficient. In order to use the liquid stratocumulus LR calibration method, the attenuated volume backscattering coefficient has to be corrected for water vapour absorption to achieve high-accuracy calibration. Hopkin et al. (2019) used a simplified approach based on a parameterised curve and reported a difference from explicit radiative transfer calculations of 2 % in the United Kingdom atmosphere (Middle Wallop). In the future either approach should be used to include water vapour absorption in the simulator or remove the effect of water vapour absorption from the observed lidar attenuated volume backscattering coefficient to achieve an improved one-to-one comparison between the observations, reanalyses and models.

6 Description of case studies

The case studies analysed here were selected to include all instruments supported by the framework. We compare four different instruments (CHM 15k, CL31, CL51, MiniMPL) deployed at three locations in NZ (Lauder, Christchurch, Cass) with three reanalyses (MERRA-2, ERA5, JRA-55), one NWP model (AMPS) and one GCM (UM). These case studies aim to demonstrate capability rather than to comprehensively evaluate cloud simulation in the models and reanalyses. The work detailed in Kuma et al. (2020) provides a detailed evaluation of the UM and MERRA-2 relative to shipborne ceilometer observations. Figure 3a shows the location of the sites and Table 6 summarises the case studies, which are also described in greater detail below. The sites were chosen from available datasets to demonstrate the use of the framework with all supported instruments. Two of the sites also had co-located instruments: CL31 and MiniMPL in Lauder and CHM 15k and MiniMPL in Christchurch. The MiniMPL in Lauder and Christchurch were two different units. The number of model levels within the range of each instrument and vertical resolution range are listed in Table 7.

Figure 3(a) Map showing the location of sites. Data at three sites in New Zealand were analysed: Cass, Lauder and Christchurch. (b, c, d) Cloud occurrence histograms as a function of height above the mean sea level observed at three sites and simulated by the lidar simulator based on atmospheric fields for five reanalyses and models. The total cloud fraction (CF) is also shown. The histogram is calculated from the cloud mask as determined by the cloud detection algorithm.

Table 6Location of sites and instruments. The time periods are inclusive.

Download Print Version | Download XLSX

Table 7Number of models levels and vertical resolution in the range of the instrument at the locations of the case studies. The first number is the number of levels, followed by the minimum and maximum distance range between adjacent model levels in the lidar's range (m).

Download Print Version | Download XLSX

Cass is a field station of the University of Canterbury located at an altitude of 577 m in the Southern Alps of the South Island of NZ. The station is located far from any settlements and is likely less affected by anthropogenic aerosol relative to the other sites. We have analysed 13 d of observations with a CL51 at this station performed in September and October 2014.

Lauder is a field station of NIWA located inland in the central Otago region on the South Island of NZ. The station is situated in a rural area relatively far from large human settlements at an altitude of 370 m. We have analysed 13 d of co-located MiniMPL and CL31 observations made in January 2018. The MiniMPL was operated in an enclosure with a scanning head set to a fixed vertical scanning mode during this period (elevation angle 90).

Observations at the Christchurch site were performed at the University of Canterbury campus on the Ernest Rutherford building rooftop at an altitude of 45 m. Christchurch is located on the east coast of the South Island of NZ. Its climate is affected by the ocean, its proximity to the hilly area of the Banks Peninsula, the Canterbury Plains and föhn-type winds (Canterbury northwester) resulting from its position on the lee side of the Southern Alps. The city is affected by significant wintertime air pollution from domestic wood burning and transport. The orography of the city and the adjacent Canterbury Plains is very flat, making it prone to inversions. The Ernest Rutherford building is a five-floor building situated in an urban area, surrounded by multiple buildings of similar height. We have analysed 23 d of co-located MiniMPL and CHM 15k observations performed in July and August 2019. The MiniMPL was operated in an enclosure with a scanning head set to a fixed vertical scanning mode (elevation angle 90). The nudged run of the UM was only available up to the year 2018. Therefore, it was not analysed for this site.

7 Results

To demonstrate how the ALCF can be used we compared a total of 49 d of ALC observations with the simulated lidar attenuated volume backscattering coefficient at three sites in NZ (Sect. 6). The observed attenuated volume backscattering coefficient was normalised to the calibrated absolute range-corrected attenuated volume backscattering coefficient. The noise mean as determined at the furthest range was removed from the attenuated volume backscattering coefficient. Cloud detection based on an attenuated absolute volume backscattering coefficient threshold of 2×10-6m-1sr-1, after removing molecular backscattering and 5 noise standard deviations, was applied to derive a cloud mask and CBH. We compare the statistical cloud occurrence as a function of height above the mean sea level (a.s.l.) (Fig. 3b, c, d) and individual attenuated volume backscattering coefficient profiles (selected profiles are shown in Figs. 4, 5 and 6) in this section. In these plots 5 standard deviations of the attenuated volume backscattering coefficient noise (Sect. 5.3) were removed. In addition, molecular backscattering was removed by coupling the observed data (Figs. 4a, 5a, 6a) with the molecular attenuated volume backscattering coefficient calculated by the lidar simulator based on the MERRA-2 reanalysis data. The same applies to model data (Figs. 4b–f, 5b–f, 6b–e), but the molecular attenuated volume backscattering coefficient was calculated by the lidar simulator based on the respective model data.

Figure 4Examples of the observed and simulated attenuated volume backscattering coefficient during 24 h at Cass. The observed attenuated volume backscattering coefficient was normalised to absolute units and denoised. The first subcolumn generated by the Subgrid Cloud Overlap Profile Sampler (SCOPS) was used to make the plots. The red line is the station altitude. S(a) The observed effective lidar ratio calculated by vertically integrating the attenuated volume backscattering coefficient is also shown, as are (b–f) the corresponding model cloud liquid water, cloud ice and cloud fraction.


Figure 5The same as Fig. 4 but for the Lauder.


Figure 6The same as Fig. 4 but for the Christchurch.


7.1 Cass

We analysed 13 d of CL51 observations from the Cass field station in late winter. Due to the location of the station at a relatively high altitude in a varied terrain of the Southern Alps, the models, with their relatively coarse horizontal grid resolution, do not represent the terrain and position accurately. The orography representation of the models meant that the virtual altitude of the station was 1115 m (AMPS), 1051 m (ERA5), 401 m (JRA-55), 914 m (MERRA-2) and 428 m (UM). The virtual position, which is the centre of the nearest model grid cell to the site location, ranged from relatively close in the Southern Alps (AMPS, ERA5, MERRA-2, UM) to relatively far on the west coast of NZ (JRA-55) depending on the horizontal resolution of the grid. The time period examined was characterised by diverse cloud occurrence with periods of low cloud and precipitation, mid-level cloud, fog, high cloud, and clear skies. Precipitation, currently not simulated by the lidar simulator, was present in about 18 % of the observed attenuated volume backscattering coefficient profiles, as determined by visual inspection. Figure 3b shows that predominantly low cloud and precipitation between the ground and 3 km a.s.l. in 25 % of profiles was observed. Cloud between 3 and 12 km a.s.l. was observed about evenly in 2 % of profiles. While the reanalyses and models were able to partially reproduce the peak of cloud occurrence near 1 km a.s.l., the peak they displayed is less vertically broad than observed, and in the UM the peak was much weaker than observed. The lack of precipitation simulation might have also contributed to this apparent difference between observed and simulated cloud. Above 3 km a.s.l., the reanalyses and models tended to overestimate cloud, with only ERA5 and JRA-55 simulating close to the observed cloud occurrence. The observed total CF was 61 %. AMPS overestimated this value by 5 percentage points (pp), and ERA5 and the UM reproduced almost the exact value (within 1 pp), while the other reanalyses (JRA-55 and MERRA-2) underestimated CF by about 15 pp.

7.2 Lauder

We also analysed 13 d of CL31 and MiniMPL observations from the Lauder station in summer. During the time period relatively diverse cloud was observed, with periods of low, middle and high cloud, clear sky, and a small fraction of profiles with precipitation (about 3 %). The altitude of the station of 370 m a.s.l. generally had a much higher equivalent in the reanalyses and models at 565 m (AMPS), 642 m (ERA5), 681 m (JRA-55) and 786 m (MERRA-2) due to the presence of hills in the surrounding region (the station is in a high valley), with the exception of the UM wherein the altitude was 385 m. The virtual station position in the reanalyses and models ranged from relatively close to the station in the same geographical region (AMPS, ERA5), to a nearby location in a more hilly region (JRA-55), a relatively distant location in the adjacent Dunstan Mountains (MERRA-2) and a relatively distant location in central Otago (UM). Figure 3c shows that the CL31 observed relatively even cloud occurrence between the ground and 3 km a.s.l. at 8 %, falling off to about 3 % between 4 and 8 km a.s.l. (the maximum lidar range of CL31 is 7.7 km). The MiniMPL observed a much weaker attenuated volume backscattering coefficient than CL31 below 3 km a.s.l., which was identified as an overlap calibration issue in the MiniMPL. The MiniMPL observed substantial amounts of cloud above 8 km not present in the CL31 observations due to its range limitation. Overall, the observed cloud occurrence had two peaks at the ground to 3 km a.s.l. and at about 9 km a.s.l. The simulated cloud occurrence was generally underestimated between the ground and 5 km a.s.l., with the exception of the UM which reproduced the lower half of the peak accurately and ERA5 which reproduced the upper half of the peak accurately. Above 5 km a.s.l., the cloud occurrence was well reproduced in ERA5 and JRA-55 but strongly overestimated in AMPS, MERRA-2 and the UM. The reanalyses and models also tended to have two peaks at about 2 and 11 km a.s.l., but these were quite different from the observed peaks, with the lower peak underestimated by about 5 pp in the reanalyses and models and the higher peak overestimated by about 5–10 pp. The total CF was observed as 45 % and 60 % by CL31 and MiniMPL, respectively. CF observed by the MiniMPL was likely higher due to its higher maximum lidar range (CL31 missed substantial amounts of high cloud due to this limitation). The total CF was strongly underestimated by the reanalyses and models by up to 31 pp (CL31) and 28 pp (MiniMPL), with the exception of the UM which simulated the correct CF within 3 pp.

7.3 Christchurch

The Christchurch observations were made during a total of 23 d in middle to late winter. The cloud situations were characterised by the frequent occurrence of low cloud and fog, with relatively diverse mid-level and high-level cloud and periods of clear sky also present (not shown). Precipitation was present in about 9 % of profiles and fog in about 11 % of profiles. As the site location is relatively flat (Canterbury Plains), the models did not have any difficulty in reproducing the altitude of the site, which was 32 m (AMPS), 72 m (ERA5), 143 m (JRA-55) and 76 m (MERRA-2). The virtual location was within the boundaries of the city (AMPS), on the Canterbury Plains close to the city boundaries (ERA5, MERRA-2) and over Lake Ellesmere about 20 km from the city (JRA-55). Figure 3d shows that the co-located CHM 15k and MiniMPL observed a strong peak of cloud occurrence of 26 % (CHM 15k) at about 500 m a.s.l. This was likely due to the combined precipitation and fog as well as false detection of aerosol as cloud. The observed cloud occurrence had a local minimum of 2 % at about 5 km a.s.l., a secondary peak of 5 % at 7 km a.s.l. and fell off 0 % at 11 km a.s.l. The CHM 15k and MiniMPL observations showed inconsistencies of up to 4 pp. The reanalyses and models underestimated low cloud by 5–10 pp. With the exception of AMPS, they underestimated mid-level cloud by about 5 pp and represented high cloud relatively accurately. The total CF observed was 68 %, while the reanalyses and models strongly underestimated CF by up to 34 pp (JRA-55), with common underestimates of around 20 pp.

7.4 Backscattering on daily scales

Figures 4, 5 and 6 show images of the attenuated volume backscattering coefficient for three separate days taken from the three case studies. The selected days represent some of the best-matching profiles and demonstrate how well the reanalyses and models can simulate cloud under favourable conditions. As can be seen in the figures, ERA5 and the UM perform the best in terms of the temporal and height accuracy of the simulated cloud (Figs. 4c, 4f, 5c, 5f, 6c). This is likely due to the high output temporal resolution of the UM and ERA5 of 20 min and 1 h, respectively. The UM and ERA5 were able to represent the relatively fine structure of cloud and to a lesser extent the optical thickness (inferred from the strength of backscattering) of the cloud. Deficiencies, however, are readily identifiable. The low cloud in the UM (Fig. 4f) covers too large of an area relative to observations (Fig. 4a) and the high cloud has a greater vertical extent in the UM. Likewise, the altocumulus cloud observed in Fig. 5a is shifted by several hours in the UM (Fig. 5f). The stratocumulus and nimbostratus cloud, visually identified based on the attenuated volume backscattering coefficient profiles, in ERA5 (Fig. 4c) is markedly lower than observed (Fig. 4a), as well as optically thicker than in reality. The mid-level cloud in ERA5 (Fig. 5c) was located about 2 km higher than observed (Fig. 5a). Precipitation observed in Fig. 6a towards the end of the analysed period was not present in the ERA5 simulated profile (Fig. 6c) due to lack of precipitation simulation in the current lidar simulator (even though rain- and snow-specific content is available from the reanalysis). AMPS and MERRA-2 had lower cloud representation accuracy. They managed to capture the overall structure of clouds (Figs. 4b, 4d, 5b, 5d, 6b, 6d), but substantial discrepancies were present, some of which were likely due to the relatively low temporal resolution of 3 h. AMPS, however, has a relatively high horizontal grid resolution of 21 km. This demonstrates that factors in the model other than resolution have a stronger influence on the quality of cloud simulation. JRA-55 was identified as the last in terms of cloud representation accuracy. JRA-55 has the lowest temporal resolution of the studied reanalyses and models of just 6 h, as well as the lowest horizontal grid resolution of 139 km. Therefore, it cannot be expected to capture any fine details of cloud. In the presented profiles (Figs. 4e, 5e, 6e) one can see that the cloud is only crudely represented. JRA-55 was able to represent the stratocumulus cloud in Fig. 4a, although its temporal extent and optical thickness were overestimated. The mid-level clouds in Figs. 5a and 6a were relatively well represented in terms of height and optical thickness given the low temporal resolution of the reanalysis. We stress that a direct attenuated volume backscattering coefficient profile intercomparison is highly dependent on the temporal resolution of the model output. The statistical intercomparison, however, should still give unbiased results if the cloud physics are accurately simulated by the atmospheric model.

Figures 4a, 5a and 6a also show the effective LR of observations calculated by integrating the vertically attenuated volume backscattering coefficient (Sect. 4.1). If the attenuated volume backscattering coefficient is properly calibrated, under fully attenuating cloud conditions effective LR converges to the theoretical value of the LR of liquid cloud droplets (approximately 18.8 sr at near-IR wavelengths) multiplied by the multiple-scattering coefficient (approximately 0.7; Sect. 4.5).

7.5 Molecular backscattering, aerosol backscattering and noise

Figure 7 shows attenuated volume backscattering coefficient histograms as a function of height for small values of the coefficient (up to 2×10-6 m-1sr-1) observed and simulated at the sites of the case studies, calculated for the entire time period of each case study. The scale of values is below cloud backscattering and therefore shows backscattering which results from molecular and aerosol scattering and noise. Molecular backscattering depends on the atmospheric pressure and temperature as well as the lidar wavelength. It causes the main “streak” (a local maximum) visible in each of the histograms. The observed molecular attenuated volume backscattering coefficient at the surface approximately corresponds to the theoretically calculated value at each wavelength: 0.0906×10-6m-1sr-1 (λ=1064 nm), 0.172×10-6m-1sr-1 (λ=910 nm) and 1.54×10-6m-1sr-1 (λ=532 nm) at 1000 hPa and 20 C (Table 5). The molecular backscattering in the boundary layer is, however, superimposed on backscattering by aerosol and cloud. In the case of the MiniMPL observations at the Christchurch site (Fig. 7i), the molecular attenuated volume backscattering coefficient streak has multiple secondary streaks. These are caused by different levels of attenuation by cloud and aerosol during the period of the observations. These secondary streaks were also partially reproduced by the simulator (Fig. 7j). A smaller portion of the width of the streak is also caused by fluctuations of atmospheric temperature and pressure. Under suitable conditions, the molecular attenuated volume backscattering coefficient can be used for absolute calibration of an instrument. With the exception of CL31 (Fig. 7c), the molecular backscattering can be identified in the observed attenuated volume backscattering coefficient in each case. Therefore, it is possible to choose a calibration coefficient such that the observed and simulated molecular attenuated volume backscattering coefficients overlap. This can be considered a viable alternative to the liquid stratocumulus LR calibration method or as a means of cross-validating the instrument calibration. However, it should be noted that the accuracy of this method is affected by an unknown amount of aerosol attenuation. Cloudy profiles can be filtered when calculating the histogram, and therefore the effect of cloud attenuation can be minimised. In addition to the molecular attenuated volume backscattering coefficient streak, there is a zero-centred streak visible in the histograms. This is caused by noise when the signal is fully attenuated by cloud. Lastly, a zero-centred “cone” of noise is visible in the observed attenuated volume backscattering coefficient, increasing with the square of range. The size of this cone is particularly large in the case of the CL31 (Fig. 7c), which is most likely the result of its low receiver sensitivity and low power compared to the other instruments. The standard deviation of the cone at the furthest range is used to determine the noise standard deviation used by the cloud detection algorithm (Sect. 5.3).

Figure 7Attenuated volume backscattering coefficient histograms as a function of height observed and simulated at three different sites for the case studies calculated from all profiles. The plots show the distribution of the attenuated volume backscattering coefficient for values which are on the scale of noise, molecular and aerosol backscattering ([−0.5, 0.5] for CHM 15k, [−1, 1] for CL31 and CL51 and [−2, 2]×10-6m-1sr-1 for MiniMPL). The simulated attenuated volume backscattering coefficient is based on the ERA5 atmospheric fields. Backscattering caused by molecular backscattering (the main “streak”), noise when the signal is fully attenuated by cloud (the zero-centred “streak”) and the range-dependent noise (the zero-centred “cone”) are also visible in the plots. The molecular backscattering is marked by a red dashed line on the observed attenuated volume backscattering coefficient plots, the shape of which is taken from the simulated molecular attenuated volume backscattering coefficient for the corresponding instrument and site.


Figure 8The same as Fig. 7 but calculated from clear-sky profiles only.


Figure 8 shows the same information as Fig. 7 but for clear-sky profiles only. Here, it can be seen that the zero-centred peak caused by the complete attenuation by cloud is no longer present. There is a clear overlap between the centre of the noise cone and the simulated molecular attenuated volume backscattering coefficient; i.e. the noise cone is centred at the observed molecular attenuated volume backscattering coefficient. This is visible with all instruments including CL31 (Fig. 8c), for which the overlap between the observed and simulated molecular attenuated volume backscattering coefficient is most clearly visible at about 1 km a.s.l. Below 1 km a.s.l., the effect of boundary layer aerosol distorts the molecular attenuated volume backscattering coefficient by an unknown quantity. The clear-sky histograms as shown in Fig. 8 may therefore be preferable to the all-sky histograms in Fig. 7 for calibration by fitting the molecular attenuated volume backscattering coefficient. The dead time, after-pulse and overlap MiniMPL calibration supplied by the vendor appears to be deficient and causes range-dependent bias in the attenuated volume backscattering coefficient profile.

Figure 9Attenuated volume backscattering coefficient noise standard deviation histogram calculated for each instrument for sites in the case studies from clear-sky profiles over the whole time period. The noise distribution is calculated at the furthest range. The range-scaled noise distribution is shown at a range of 8 km. “Night” and “day” distributions are calculated separately from nighttime and daytime profiles only.


We now examine the noise in each instrument using the ALCF. Figure 9 shows the distribution of the standard deviation of backscatter noise determined at the highest observable range of each instrument and range-scaled to 8 km. It can be seen that the CL31 is affected by the greatest amount of noise, peaking at about 2×10-6m-1sr-1. This is at the threshold of cloud detection of 2×10-6m-1sr-1. Therefore, thin cloud may be obscured by noise at higher ranges with this instrument. The MiniMPL, operating in the visible spectral range, shows a strongly bimodal distribution of the attenuated volume backscattering coefficient noise depending on sunlight. During daytime, it peaks at about 0.7×10-6m-1sr-1, which is the second highest of the analysed instruments. During nighttime, it peaks at about 0.02×10-6m-1sr-1, which is the lowest of the analysed instruments. The CHM 15k and CL51 peak between the nighttime and daytime MiniMPL at about 0.05×10-6m-1sr-1. The CL31, CL51 and CHM 15k show a slight reduction of noise during nighttime, presumably because of a small amount of incoming solar radiation at near-IR wavelengths. The difference between the nighttime and daytime attenuated volume backscattering coefficient noise in the MiniMPL has been previously analysed by Silber et al. (2018) (Fig. S3), and these results confirm their findings.

8 Discussion and conclusions

We presented the Automatic Lidar and Ceilometer Framework, which combines lidar processing and lidar simulation for the purpose of model evaluation. The lidar simulation is based on the COSP spaceborne lidar simulator by accounting for the different geometry and lidar wavelength. We calculated new lookup tables for Mie scattering for a number of ALC wavelengths, developed an ice crystal backscattering parameterisation based on temperature, and implemented noise removal and cloud detection algorithms. The framework supports the most common ALCs and reanalyses. We demonstrated the use of the framework on ALC observations at three different sites in New Zealand and applied the lidar simulator to three reanalyses and two models. We found that while some reanalyses and models such as the UM and ERA5 show relatively good correspondence with observed cloud, others performed relatively poorly in our time-limited local comparison. All reanalyses and models underestimated the total CF by up to 34 pp, with common underestimation by 20 pp. In some cases, the observed and simulated attenuated volume backscattering coefficient profiles matched relatively closely in terms of time and altitude, and a better match was observed with reanalyses with high output temporal resolution such as the UM and ERA5, while reanalyses with low temporal resolution did not allow for reliable direct (non-statistical) comparison of cloud. However, it is clear that factors other than the horizontal and vertical resolution influence the cloud simulation accuracy, especially the cloud, boundary layer and convection schemes employed by the atmospheric model. The reanalysis and model output temporal resolution, horizontal grid resolution and vertical resolution are not always the same as the internal resolution of the underlying atmospheric model. Both have an impact on the comparison between the simulated and observed attenuated volume backscattering coefficient and cloud. While the output resolution should not have an impact on the long-term statistics, it can be a limiting factor for direct attenuated volume backscattering coefficient profile comparison. We demonstrated that the ALCF could be used to identify substantial differences in the cloud attenuated volume backscattering coefficient which were present in all reanalyses and models. We showed that all the studied instruments except for the CL31 are capable of detecting molecular backscattering and that this can be used for calibration or cross-validation of other calibration methods. We found that the nighttime MiniMPL was subject to the lowest amount of noise of all the instruments examined, followed by the CL51, CHM 15k, daytime MiniMPL and CL31. Noise in the MiniMPL, and to a lesser extent in the other ALCs, was shown to have a bimodal distribution due to daytime–nighttime differences. The ALCF can therefore be useful for testing the quality of collected data.

Currently the framework has several limitations which should be addressed in the future. The water vapour absorption at 910 nm likely affects the instrument calibration of the CL31 and CL51 ceilometers and limits the accuracy of the one-to-one comparison, even though due to the relatively high backscattering caused by cloud, the calculated cloud masks are unlikely to be strongly affected. The lidar simulator currently does not simulate backscattering from precipitation. Observed precipitation is generally detected as “cloud” by the cloud detection algorithm, while the simulated profile contains no backscattering at the location of precipitation (backscattering and attenuation by raindrops and snow should be implemented in the lidar simulator in the future). If desired, the attenuated volume backscattering coefficient profiles affected by precipitation can be excluded before the comparison or their fraction determined by visually inspecting the observed attenuated volume backscattering to assess their possible effect on the statistical results. Aerosol is also not currently implemented in the simulator. Previous studies (Chan et al.2018) characterised optical parameters of different groups of aerosol, which could be used in a future version of the simulator with models which provide the concentration of aerosol in their output. In our case studies the aerosol volume backscattering coefficient was less than 2×10-6m-1sr-1 and below 4 km, which could result in worst-case two-way attenuation of about 50 % assuming LR of 50 sr. This should not preclude cloud detection due to the large magnitude of typical cloud backscattering. The ALCs also suffer from various measurement deficiencies. Notably incomplete overlap, dead time and after-pulse corrections tend to give sub-optimal results at the near range. It is possible to use semi-automated methods to correct for these deficiencies, such as by calculating the integrated attenuated volume backscattering coefficient distribution via the height of the maximum backscattering and correcting for the range-dependent bias (Hopkin et al.2019, Sect. 5.1). This method could be implemented in the framework to enable range-dependent calibration of the observed attenuated volume backscattering coefficient.

The presented framework streamlines lidar data processing and tasks related to lidar simulation and model comparison. The framework was recently used by Kuma et al. (2020) for Southern Ocean model cloud evaluation in the GA7.1 model and MERRA-2 reanalysis. Considering the existing extensive ALC networks worldwide there is a wealth of global data. We therefore think that ALCs should have a greater role in model evaluation. Satellite observations have long been established in this respect due to their availability, spatial and temporal coverage, and well-developed derived products and tools. ALCs, with their diverse formats and decentralised nature, have so far lacked derived products and tools which would make them more accessible for model evaluation. We hope that this software will enable more model evaluation studies based on ALC observations. Development of lidar data processing is currently hampered by closed development of code. We note that code has very rarely been made available with past ALC studies. Continued improvement of publicly available code for lidar data processing is needed to achieve faster development of ground-based remote sensing and make it more attractive for GCM, NWP model and reanalysis evaluation.

Code and data availability

The ALCF is open-source and available at (last access: 1 January 2021) as well as in a permanent archive of code and technical documentation on Zenodo at (Kuma et al.2021). The technical documentation is also in the Supplement. A tool for converting Vaisala CL31 and CL51 data files to NetCDF cl2nc is open-source and available at (Kuma2020a). A tool for converting MiniMPL raw binary data files to NetCDF mpl2nc is open-source and available at (Kuma2020b). The observational data used in the case studies are available upon request. The reanalyses data used in the case studies are publicly available online from the respective projects. The Unified Model data used in the case studies are available upon request. The Unified Model is proprietary to the UK Met Office and is made available under a licence. For more information, readers are advised to contact the UK Met Office.


The supplement related to this article is available online at:

Author contributions

PK wrote the code of the framework, performed the data analysis of the case studies and wrote the text of the paper. AJM and OM provided continuous scientific input on the code development, analysis and text of the paper. RQ, IS and CJF provided calibration of the MiniMPL data and substantial discussion of the theoretical concepts. All authors reviewed the paper.

Competing interests

The authors declare that they have no conflict of interest.


We would like to thank the editor, Volker Grewe, and two anonymous referees. We would like to acknowledge the following: the New Zealand eScience Infrastructure (NeSI), which provided supercomputing resources to run the Unified Model; Vidya Varma, Jonny Williams, Guang Zeng and Wolfgang Hayek for their contribution to setting up a nudged run of the Unified Model; Graeme Plank and Graeme MacDonald, who participated in the installation of the Vaisala CL51 at the Cass field station; the COSP project for the code which we used as the basis for the lidar simulator; the AMPS, JRA-55, ERA5 and MERRA-2 models and reanalyses, which provided public access to their data; the open-source libraries NumPy (Van Der Walt et al.2011), SciPy (Virtanen et al.2019), matplotlib (Hunter2007), netCDF4 (Rew and Davis1990) and Astropy (Price-Whelan et al.2018) as well as the Python programming language (Rossum1995), which we used in the implementation of our code; the R programming language (R Core Team2017); the Natural Earth dataset (, last access: 1 January 2021); the Shuttle Radar Topography Mission (SRTM) version 3 global 1 arc second digital elevation model (Werner2001; NASA JPL2013), which we used to produce a map of sites; GitHub, which provided free hosting of our code; and the Linux-based (Torvalds1997) operating systems Devuan GNU+Linux and Debian GNU/Linux on which we produced this analysis.

Financial support

This research has been supported by the New Zealand Deep South National Science Challenge Clouds and Aerosols project as well as the NeSI collaborator institutions and Ministry of Business, Innovation & Employment Research Infrastructure programme, New Zealand.

Review statement

This paper was edited by Volker Grewe and reviewed by two anonymous referees.


Baars, H., Kanitz, T., Engelmann, R., Althausen, D., Heese, B., Komppula, M., Preißler, J., Tesche, M., Ansmann, A., Wandinger, U., Lim, J.-H., Ahn, J. Y., Stachlewska, I. S., Amiridis, V., Marinou, E., Seifert, P., Hofer, J., Skupin, A., Schneider, F., Bohlmann, S., Foth, A., Bley, S., Pfüller, A., Giannakaki, E., Lihavainen, H., Viisanen, Y., Hooda, R. K., Pereira, S. N., Bortoli, D., Wagner, F., Mattis, I., Janicka, L., Markowicz, K. M., Achtert, P., Artaxo, P., Pauliquevis, T., Souza, R. A. F., Sharma, V. P., van Zyl, P. G., Beukes, J. P., Sun, J., Rohwer, E. G., Deng, R., Mamouri, R.-E., and Zamorano, F.: An overview of the first decade of PollyNET: an emerging network of automated Raman-polarization lidars for continuous aerosol profiling, Atmos. Chem. Phys., 16, 5111–5137,, 2016. a

Baran, A. J.: A review of the light scattering properties of cirrus, J. Quant. Spectrosc. Ra., 110, 1239–1260,, 2009. a

Bastin, S., Chiriaco, M., and Drobinski, P.: Control of radiation and evaporation on temperature variability in a WRF regional climate simulation: comparison with colocated long term ground based observations near Paris, Clim. Dynam., 51, 985–1003,, 2018. a

Bi, L., Yang, P., Kattawar, G. W., Baum, B. A., Hu, Y. X., Winker, D. M., Brock, R. S., and Lu, J. Q.: Simulation of the color ratio associated with the backscattering of radiation by ice particles at the wavelengths of 0.532 and 1.064 µm, J. Geophys. Res., 114, D00H08,, 2009. a

Bodas-Salcedo, A., Webb, M., Bony, S., Chepfer, H., Dufresne, J.-L., Klein, S., Zhang, Y., Marchand, R., Haynes, J., Pincus, R., and John, V. O.: COSP: Satellite simulation software for model assessment, B. Am. Meteorol. Soc., 92, 1023–1043,, 2011. a

Borovoi, A., Konoshonkin, A., and Kustova, N.: Backscatter ratios for arbitrary oriented hexagonal ice crystals of cirrus clouds, Opt. Lett., 39, 5788–5791,, 2014. a

Bouniol, D., Protat, A., Delanoë, J., Pelon, J., Piriou, J.-M., Bouyssel, F., Tompkins, A. M., Wilson, D. R., Morille, Y., Haeffelin, M., O’Connor, E. J., Hogan, R. J., Illingworth, A. J., Donovan, D. P., and Baltink, H.: Using continuous ground-based radar and lidar measurements for evaluating the representation of clouds in four operational models, J. Appl. Meteorol. Climatol., 49, 1971–1991,, 2010. a

Bréon, F.-M. and Doutriaux-Boucher, M.: A comparison of cloud droplet radii measured from space, IEEE T. Geosci. Remote, 43, 1796–1805,, 2005. a, b

Bréon, F.-M. and Colzy, S.: Global distribution of cloud droplet effective radius from POLDER polarization measurements, Geophys. Res. Lett., 27, 4065–4068,, 2000. a

Campbell, J. R., Hlavka, D. L., Welton, E. J., Flynn, C. J., Turner, D. D., Spinhirne, J. D., Scott III, V. S., and Hwang, I.: Full-time, eye-safe cloud and aerosol lidar observation at atmospheric radiation measurement program sites: Instruments and data processing, J. Atmos. Ocean. Tech., 19, 431–442,<0431:FTESCA>2.0.CO;2, 2002. a, b

Cazorla, A., Casquero-Vera, J. A., Román, R., Guerrero-Rascado, J. L., Toledano, C., Cachorro, V. E., Orza, J. A. G., Cancillo, M. L., Serrano, A., Titos, G., Pandolfi, M., Alastuey, A., Hanrieder, N., and Alados-Arboledas, L.: Near-real-time processing of a ceilometer network assisted with sun-photometer data: monitoring a dust outbreak over the Iberian Peninsula, Atmos. Chem. Phys., 17, 11861–11876,, 2017. a

Chan, K. L., Wiegner, M., Flentje, H., Mattis, I., Wagner, F., Gasteiger, J., and Geiß, A.: Evaluation of ECMWF-IFS (version 41R1) operational model forecasts of aerosol transport by using ceilometer network measurements, Geosci. Model Dev., 11, 3807–3831,, 2018. a

Chang, F. and Li, Z.: The effect of droplet size distribution on the determination of cloud droplet effective radius, in: 11th ARM Science Team Meeting, Atlanta, Ga, 19–23, 2001. a

Chepfer, H., Chiriaco, M., Vautard, R., and Spinhirne, J.: Evaluation of MM5 optically thin clouds over Europe in fall using ICESat lidar spaceborne observations, Mon. Weather Rev., 135, 2737–2753,, 2007. a, b, c

Chepfer, H., Bony, S., Winker, D., Chiriaco, M., Dufresne, J.-L., and Sèze, G.: Use of CALIPSO lidar observations to evaluate the cloudiness simulated by a climate model, Geophys. Res. Lett., 35, L15704,, 2008. a, b, c, d

Chiriaco, M., Vautard, R., Chepfer, H., Haeffelin, M., Dudhia, J., Wanherdrick, Y., Morille, Y., and Protat, A.: The ability of MM5 to simulate ice clouds: Systematic comparison between simulated and measured fluxes and lidar/radar profiles at the SIRTA atmospheric observatory, Mon. Weather Rev., 134, 897–918,, 2006. a, b, c, d, e, f, g

Chiriaco, M., Dupont, J.-C., Bastin, S., Badosa, J., Lopez, J., Haeffelin, M., Chepfer, H., and Guzman, R.: ReOBS: a new approach to synthesize long-term multi-variable dataset and application to the SIRTA supersite, Earth Syst. Sci. Data, 10, 919–940,, 2018. a, b

Costa-Surós, M., Calbó, J., González, J., and Martin-Vide, J.: Behavior of cloud base height from ceilometer measurements, Atmos. Res., 127, 64–76,, 2013. a

Cromwell, E. and Flynn, D.: Lidar Cloud Detection With Fully Convolutional Networks, 2019 IEEE Winter Conference on Applications of Computer Vision (WACV), Waikoloa Village, HI, USA, 619–627,, 2019. a

Cromwell, E. and Flynn, D.: Lidar cloud detection with fully convolutional networks, in: 2019 IEEE Winter Conference on Applications of Computer Vision (WACV), 619–627, IEEE, 2019. a

Dee, D. P., Uppala, S. M., Simmons, A., Berrisford, P., Poli, P., Kobayashi, S., Andrae, U., Balmaseda, M., Balsamo, G., Bauer, d. P., et al.: The ERA-Interim reanalysis: Configuration and performance of the data assimilation system, Q. J. Roy. Meteorol. Soc., 137, 553–597,, 2011. a

Diner, D. J., Beckert, J. C., Reilly, T. H., Bruegge, C. J., Conel, J. E., Kahn, R. A., Martonchik, J. V., Ackerman, T. P., Davies, R., Gerstl, S. A. W., Gordon, H. R., Muller, J.. Myneni, R. B., Sellers, P. J., Pinty, B., and Verstraete, M. M.: Multi-angle Imaging SpectroRadiometer (MISR) instrument description and experiment overview, IEEE T. Geosci. Remote, 36, 1072–1087,, 1998. a

Dionisi, D., Barnaba, F., Diémoz, H., Di Liberto, L., and Gobbi, G. P.: A multiwavelength numerical model in support of quantitative retrievals of aerosol properties from automated lidar ceilometers and test applications for AOT and PM10 estimation, Atmos. Meas. Tech., 11, 6013–6042,, 2018. a

Ebita, A., Kobayashi, S., Ota, Y., Moriya, M., Kumabe, R., Onogi, K., Harada, Y., Yasui, S., Miyaoka, K., Takahashi, K., Kamahori, H., Kobayashi, C., Endo, H., Soma, M., Oikawa, Y., and Ishimizu, T.: The Japanese 55-year reanalysis “JRA-55”: an interim report, Sola, 7, 149–152,, 2011. a

ECMWF: Copernicus Climate Change Service (C3S) (2017): ERA5: Fifth generation of ECMWF atmospheric reanalyses of the global climate, Copernicus Climate Change Service Climate Data Store (CDS),, available at:!/home (last access: 1 January 2021), 2019. a

Edwards, J. and Slingo, A.: Studies with a flexible new radiation code. I: Choosing a configuration for a large-scale model, Q. J. Roy. Meteorol. Soc., 122, 689–719,, 1996. a

Emeis, S.: Surface-based remote sensing of the atmospheric boundary layer, vol. 40, Springer Science & Business Media,, 2010. a, b, c

Emeis, S., Schäfer, K., and Münkel, C.: Observation of the structure of the urban boundary layer with different ceilometers and validation by RASS data, Meteorol. Z., 18, 149–154,, 2009. a

Eresmaa, N., Karppinen, A., Joffre, S. M., Räsänen, J., and Talvitie, H.: Mixing height determination by ceilometer, Atmos. Chem. Phys., 6, 1485–1493,, 2006. a

Eyring, V., Bony, S., Meehl, G. A., Senior, C. A., Stevens, B., Stouffer, R. J., and Taylor, K. E.: Overview of the Coupled Model Intercomparison Project Phase 6 (CMIP6) experimental design and organization, Geosci. Model Dev., 9, 1937–1958,, 2016. a

Eyring, V., Cox, P. M., Flato, G. M., Gleckler, P. J., Abramowitz, G., Caldwell, P., Collins, W. D., Gier, B. K., Hall, A. D., Hoffman, F. M., Hurtt, G. C., Jahn, A., Jones, C. D., Klein, S. A., Krasting, J. P., Kwiatkowski, L., Lorenz, R., Maloney, E., Meehl, G. A., Pendergrass, A. G., Pincus, R., Ruane, A. C., Russell, J. L., Sanderson, B. M., Santer, B. D., Sherwood, S. C., Simpson, I. R., Stouffer, R. J., and Williamson, M. S.: Taking climate model evaluation to the next level, Nat. Clim. Change, 9, 102–110,, 2019. a

Flynn, C. J., Mendozaa, A., Zhengb, Y., and Mathurb, S.: Novel polarization-sensitive micropulse lidar measurement technique, Opt. Exp., 15, 2785–2790,, 2007. a

Fu, D., Di Girolamo, L., Liang, L., and Zhao, G.: Regional Biases in MODIS Marine Liquid Water Cloud Drop Effective Radius Deduced Through Fusion With MISR, J. Geophys. Res.-Atmos., 124, 13182–13196,, 2019. a

Garnier, A., Pelon, J., Vaughan, M. A., Winker, D. M., Trepte, C. R., and Dubuisson, P.: Lidar multiple scattering factors inferred from CALIPSO lidar and IIR retrievals of semi-transparent cirrus cloud optical depths over oceans, Atmos. Meas. Tech., 8, 2759–2774,, 2015. a, b, c, d, e, f, g

Gelaro, R., McCarty, W., Suárez, M. J., Todling, R., Molod, A., Takacs, L., Randles, C. A., Darmenov, A., Bosilovich, M. G., Reichle, R., Wargan, K., Coy, L., Cullather, R., Draper, C., Akella, S., Buchard, V., Conaty, A., da Silva, A. M., Gu, W., Kim, G., Koster, R., Lucchesi, R., Merkova, D., Nielsen, J. E., Partyka, G., Pawson, S., Putman, W., Rienecker, M., Schubert, S. D., Sienkiewicz, M., and Zhao, B.: The modern-era retrospective analysis for research and applications, version 2 (MERRA-2), J. Climate, 30, 5419–5454,, 2017. a

Geleyn, J. and Hollingsworth, A.: An economical analytical method for the computation of the interaction between scattering and line absorption of radiation, Contributions to Atmospheric Physics, 52, 1–16, 1979. a

Goody, R. M. and Yung, Y. L.: Atmospheric radiation: theoretical basis, Oxford University Press, New York, NY, USA, 2 edn., 1995. a

Hansen, A., Ament, F., Grützun, V., and Lammert, A.: Model evaluation by a cloud classification based on multi-sensor observations, Geosci. Model Dev. Discuss.,, 2018a. a

Hansen, A., Ament, F., Grützun, V., and Lammert, A.: Model evaluation by a cloud classification based on multi-sensor observations, Geosci. Model Dev. Discuss.,, 2018b. a

Harada, Y., Kamahori, H., Kobayashi, C., Endo, H., Kobayashi, S., Ota, Y., Onoda, H., Onogi, K., Miyaoka, K., and Takahashi, K.: The JRA-55 Reanalysis: Representation of atmospheric circulation and climate variability, J. Meteorol. Soc. Japan. Ser. II, 94, 269–302,, 2016. a

Heese, B., Flentje, H., Althausen, D., Ansmann, A., and Frey, S.: Ceilometer lidar comparison: backscatter coefficient retrieval and signal-to-noise ratio determination, Atmos. Meas. Tech., 3, 1763–1770,, 2010. a

Heymsfield, A. J.: Extinction-ice water content-effective radius algorithms for CALIPSO, Geophys. Res. Lett., 32, L10807,, 2005. a, b, c, d

Hines, K. M. and Bromwich, D. H.: Development and testing of Polar Weather Research and Forecasting (WRF) model. Part I: Greenland ice sheet meteorology, Mon. Weather Rev., 136, 1971–1989,, 2008. a

Hogan, R. J.: Fast approximate calculation of multiply scattered lidar returns, Appl. Opt., 45, 5984–5992,, 2006. a, b, c

Hogan, R. J., Jakob, C., and Illingworth, A. J.: Comparison of ECMWF Winter-Season Cloud Fraction with Radar-Derived Values, J. Appl. Meteorol., 40, 513–525,<0513:COEWSC>2.0.CO;2, 2001. a

Hopkin, E.: Use of a calibrated ceilometer network to improve high resolution weather forecasts, Ph.D. thesis, University of Reading, UK, 2018. a

Hopkin, E., Illingworth, A. J., Charlton-Perez, C., Westbrook, C. D., and Ballard, S.: A robust automated technique for operational calibration of ceilometers using the integrated backscatter from totally attenuating liquid clouds, Atmos. Meas. Tech., 12, 4131–4147,, 2019. a, b, c, d, e, f, g, h, i, j, k, l

Hourdin, F., Mauritsen, T., Gettelman, A., Golaz, J.-C., Balaji, V., Duan, Q., Folini, D., Ji, D., Klocke, D., Qian, Y., Rauser, F., Rio, C., Tomassini, L., Watanabe, M., and Williamson, D. : The art and science of climate model tuning, B. Am. Meteorol. Soc., 98, 589–602,, 2017. a

Hu, Y.: Depolarization ratio–effective lidar ratio relation: Theoretical basis for space lidar cloud phase discrimination, Geophys. Res. Lett., 34, L11812,, 2007. a

Hu, Y., Vaughan, M., McClain, C., Behrenfeld, M., Maring, H., Anderson, D., Sun-Mack, S., Flittner, D., Huang, J., Wielicki, B., Minnis, P., Weimer, C., Trepte, C., and Kuehn, R.: Global statistics of liquid water content and effective number concentration of water clouds over ocean derived from combined CALIPSO and MODIS measurements, Atmos. Chem. Phys., 7, 3353–3359,, 2007. a

Hunter, J. D.: Matplotlib: A 2D graphics environment, Comput. Sci. Eng., 9, 90,, 2007. a

Illingworth, A., Hogan, R., O'connor, E., Bouniol, D., Brooks, M., Delanoë, J., Donovan, D., Eastment, J., Gaussiat, N., Goddard, J. W. F., Haeffelin, M., Baltink, H. K., Krasnov, O. A., Pelon, J., Piriou, J.-M., Protat, A., Russchenberg, H. W. J., Seifert, A., Tompkins, A. M., van Zadelhoff, G.-J., Vinit, F., Willén, U., Wilson, D. R., and Wrench, C. L.: Cloudnet: Continuous evaluation of cloud profiles in seven operational models using ground-based observations, B. Am. Meteorol. Soc., 88, 883–898,, 2007. a, b

Illingworth, A., Cimini, D., Haefele, A., Haeffelin, M., Hervo, M., Kotthaus, S., Löhnert, U., Martinet, P., Mattis, I., O’Connor, E. J., and Potthast, R.: How can Existing Ground-Based Profiling Instruments Improve European Weather Forecasts?, B. Am. Meteorol. Soc., 100, 605–619,, 2018. a, b

Illingworth, A. J., Barker, H., Beljaars, A., Ceccaldi, M., Chepfer, H., Clerbaux, N., Cole, J., Delanoë, J., Domenech, C., Donovan, D. P., Fukuda, S., Hirakata, M., Hogan, R. J., Huenerbein, A., Kollias, P., Kubota, T., Nakajima, T., Nakajima, T. Y., Nishizawa, T., Ohno, Y., Okamoto, H., Oki, R., Sato, K., Satoh, M., Shephard, M. W., Velázquez-Blázquez, A., Wandinger, U., Wehr, T., and van Zadelhoff, G.-J.: The EarthCARE satellite: The next step forward in global measurements of clouds, aerosols, precipitation, and radiation, B. Am. Meteorol. Soc., 96, 1311–1332,, 2015a. a

Illingworth, A. J., Cimini, D., Gaffard, C., Haeffelin, M., Lehmann, V., Löhnert, U., O’Connor, E. J., and Ruffieux, D.: Exploiting existing ground-based remote sensing networks to improve high-resolution weather forecasts, B. Am. Meteorol. Soc., 96, 2107–2125,, 2015b. a

Jin, Y., Kai, K., Kawai, K., Nagai, T., Sakai, T., Yamazaki, A., Uchiyama, A., Batdorj, D., Sugimoto, N., and Nishizawa, T.: Ceilometer calibration for retrieval of aerosol optical properties, J. Quant. Spectrosc. Ra., 153, 49–56,, 2015. a, b, c

Josset, D., Pelon, J., Garnier, A., Hu, Y., Vaughan, M., Zhai, P.-W., Kuehn, R., and Lucker, P.: Cirrus optical depth and lidar ratio retrieval from combined CALIPSO-CloudSat observations using ocean surface echo, J. Geophys. Res.-Atmos., 117, D05207,, 2012. a

Klein, S. A. and Jakob, C.: Validation and sensitivities of frontal clouds simulated by the ECMWF model, Mon. Weather Rev., 127, 2514–2531,<2514:VASOFC>2.0.CO;2, 1999. a

Klekociuk, A. R., French, W. J. R., Alexander, S. P., Kuma, P., and McDonald, A. J.: The state of the atmosphere in the 2016 southern Kerguelen Axis campaign region, Deep Sea Res. Pt. II, 174, 0967-0645,, 2019. a

Knepp, T. N., Szykman, J. J., Long, R., Duvall, R. M., Krug, J., Beaver, M., Cavender, K., Kronmiller, K., Wheeler, M., Delgado, R., Hoff, R., Berkoff, T., Olson, E., Clark, R., Wolfe, D., Van Gilst, D., and Neil, D.: Assessment of mixed-layer height estimation from single-wavelength ceilometer profiles, Atmos. Meas. Tech., 10, 3963–3983,, 2017. a

Kobayashi, S., Ota, Y., Harada, Y., Ebita, A., Moriya, M., Onoda, H., Onogi, K., Kamahori, H., Kobayashi, C., Endo, H., Miyaoka, K., and Takahashi, K.: The JRA-55 reanalysis: General specifications and basic characteristics, J. Meteorol. Soc. Japan. Ser. II, 93, 5–48,, 2015. a

Kotthaus, S., O'Connor, E., Münkel, C., Charlton-Perez, C., Haeffelin, M., Gabey, A. M., and Grimmond, C. S. B.: Recommendations for processing atmospheric attenuated backscatter profiles from Vaisala CL31 ceilometers, Atmos. Meas. Tech., 9, 3769–3791,, 2016. a, b, c, d

Kuma, P.: cl2nc 3.3.0, Zenodo,, 2020a. a

Kuma, P.: mpl2nc 1.3.5, Zenodo,, 2020b. a

Kuma, P., McDonald, A. J., Morgenstern, O., Alexander, S. P., Cassano, J. J., Garrett, S., Halla, J., Hartery, S., Harvey, M. J., Parsons, S., Plank, G., Varma, V., and Williams, J.: Evaluation of Southern Ocean cloud in the HadGEM3 general circulation model and MERRA-2 reanalysis using ship-based observations, Atmos. Chem. Phys., 20, 6607–6630,, 2020. a, b, c

Kuma, P., McDonald, A. J., Morgenstern, O., Querel, R., Silber, I., and Flynn, C. J.: Automatic Lidar and Ceilometer Framework (ALCF) (Version 1.0.0), Zenodo,, 2021. a

Lamer, K., Fridlind, A. M., Ackerman, A. S., Kollias, P., Clothiaux, E. E., and Kelley, M.: (GO)2-SIM: a GCM-oriented ground-observation forward-simulator framework for objective evaluation of cloud and precipitation phase, Geosci. Model Dev., 11, 4195–4214,, 2018. a

Lewis, J. R., Campbell, J. R., Welton, E. J., Stewart, S. A., and Haftings, P. C.: Overview of MPLNET version 3 cloud detection, J. Atmos. Ocean. Tech., 33, 2113–2134,, 2016. a

Liou, K.-N.: An introduction to atmospheric radiation, vol. 84, Elsevier, 2 edn., 2002. a

Liu, J., Li, Z., Zheng, Y., and Cribb, M.: Cloud-base distribution and cirrus properties based on micropulse lidar measurements at a site in southeastern China, Adv. Atmos. Sci., 32, 991–1004,, 2015a. a

Liu, L., Sun, X.-J., Liu, X.-C., Gao, T.-C., and Zhao, S.-J.: Comparison of cloud base height derived from a ground-based infrared cloud measurement and two ceilometers, Adv. Meteorol., 2015, 1687-9309,, 2015b. a, b

Madonna, F., Rosoldi, M., Lolli, S., Amato, F., Vande Hey, J., Dhillon, R., Zheng, Y., Brettle, M., and Pappalardo, G.: Intercomparison of aerosol measurements performed with multi-wavelength Raman lidars, automatic lidars and ceilometers in the framework of INTERACT-II campaign, Atmos. Meas. Tech., 11, 2459–2475,, 2018. a, b

Marenco, F., Santacesaria, V., Bais, A. F., Balis, D., di Sarra, A., Papayannis, A., and Zerefos, C.: Optical properties of tropospheric aerosols determined by lidar and spectrophotometric measurement (Photochemical Activity and Solar Ultraviolet Radiation campaign), Appl. Opt., 36, 6875–6886,, 1997. a

Martucci, G., Milroy, C., and O’Dowd, C. D.: Detection of cloud-base height using Jenoptik CHM15K and Vaisala CL31 ceilometers, J. Atmos. Ocean. Tech., 27, 305–318,, 2010. a, b

Masunaga, H., Matsui, T., Tao, W.-k., Hou, A. Y., Kummerow, C. D., Nakajima, T., Bauer, P., Olson, W. S., Sekiguchi, M., and Nakajima, T. Y.: Satellite data simulator unit: A multisensor, multispectral satellite simulator package, B. Am. Meteorol. Soc., 91, 1625–1632,, 2010. a

Matsui, T.: Goddard Satellite Data Simulator Unit (G-SDSU), (last access: January 2021), 2019. a

Mattis, I., Begbie, R., Boyouk, N., Bravo-Aranda, J. A., Brettle, M., Cermak, J., Drouin, M.-A., Geiß, A., Görsdorf, U., Haefele, A., Haeffelin, M., Hervo, M., Komínková, K., Leinweber, R., Müller, G., Münkel, C., Pattantyús-Ábrahám, M., Pönitz, K., Wagner, F., and Wiegner, M.: The ceilometer inter-comparison campaign CeiLinEx2015, in: EGU General Assembly Conference Abstracts, EPSC2016–9687, 2016. a, b

McGill, M. J., Yorks, J. E., Scott, V. S., Kupchock, A. W., and Selmer, P. A.: The Cloud-Aerosol Transport System (CATS): A technology demonstration on the International Space Station, in: Lidar Remote Sensing for Environmental Monitoring XV, vol. 9612, p. 96120A, International Society for Optics and Photonics, 2015. a

Mie, G.: Beiträge zur Optik trüber Medien, speziell kolloidaler Metallösungen, Annalen der Physik, 330, 377–445,, 1908. a

Milroy, C., Martucci, G., Lolli, S., Loaec, S., Sauvage, L., Xueref-Remy, I., Lavrič, J. V., Ciais, P., Feist, D. G., Biavati, G., and O'Dowd, C. D.: An assessment of pseudo-operational ground-based light detection and ranging sensors to determine the boundary-layer structure in the coastal atmosphere, Adv. Meteorol., 2012, 929080,, 2012. a

Morcrette, C. J., O'Connor, E. J., and Petch, J. C.: Evaluation of two cloud parametrization schemes using ARM and Cloud-Net observations, Q. J. Roy. Meteorol. Soc., 138, 964–979,, 2012. a

Morille, Y., Haeffelin, M., Drobinski, P., and Pelon, J.: STRAT: An automated algorithm to retrieve the vertical structure of the atmosphere from single-channel lidar data, J. Atmos. Ocean. Tech., 24, 761–775,, 2007. a, b

Münkel, C., Eresmaa, N., Räsänen, J., and Karppinen, A.: Retrieval of mixing height and dust concentration with lidar ceilometer, Bound.-Lay. Meteorol., 124, 117–128,, 2007. a

NASA JPL: NASA Shuttle Radar Topography Mission Global 3 arc second [Data set], NASA EOSDIS Land Processes DAAC,, 2013. a

Noel, V. and Chepfer, H.: A global view of horizontally oriented crystals in ice clouds from Cloud-Aerosol Lidar and Infrared Pathfinder Satellite Observation (CALIPSO), J. Geophys. Res., 115, D00H23,, 2010. a

O'Connor, E. J., Illingworth, A. J., and Hogan, R. J.: A technique for autocalibration of cloud lidar, J. Atmos. Ocean. Tech., 21, 777–786,<0777:ATFAOC>2.0.CO;2, 2004. a, b, c

Pal, S. R., Steinbrecht, W., and Carswell, A. I.: Automated method for lidar determination of cloud-base height and vertical extent, Appl. Opt., 31, 1488–1494,, 1992. a

Parkinson, C. L.: Aqua: An Earth-observing satellite mission to examine water and other climate variables, IEEE T. Geosci. Remote, 41, 173–183,, 2003. a

Petty, G. W.: A First Course in Atmospheric Radiation, Sundog Publishing, 2 edn., 2006. a, b

Petty, G. W. and Huang, W.: The modified gamma size distribution applied to inhomogeneous and nonspherical particles: Key relationships and conversions, J. Atmos. Sci., 68, 1460–1473,, 2011. a, b, c

Powers, J. G., Monaghan, A. J., Cayette, A. M., Bromwich, D. H., Kuo, Y.-H., and Manning, K. W.: Real-Time Mesoscale Modeling Over Antarctica: The Antarctic Mesoscale Prediction System, B. Am. Meteorol. Soc., 84, 1533–1546,, 2003. a

Price-Whelan, A. M., Sipőcz, B. M., Günther, H. M., et al.: The Astropy Project: Building an Open-science Project and Status of the v2.0 Core Package, Astronomical J., 156, 123,, 2018. a

R Core Team: R: A Language and Environment for Statistical Computing, R Foundation for Statistical Computing, Vienna, Austria, available at: (last access: 1 January 2021), 2017. a

Rausch, J., Meyer, K., Bennartz, R., and Platnick, S.: Differences in liquid cloud droplet effective radius and number concentration estimates between MODIS collections 5.1 and 6 over global oceans, Atmos. Meas. Tech., 10, 2105–2116,, 2017. a

Rayner, N. A., Parker, D. E., Horton, E. B., Folland, C. K., Alexander, L. V., Rowell, D. P., Kent, E. C., and Kaplan, A.: Global analyses of sea surface temperature, sea ice, and night marine air temperature since the late nineteenth century, J. Geophys. Res.-Atmos., 108, 4407,, 2003. a

Rew, R. and Davis, G.: NetCDF: an interface for scientific data access, IEEE Comput. Graph. Appl., 10, 76–82,, 1990. a

Rosoldi, M., Madonna, F., Pappalardo, G., Hey, J. V., and Zheng, Y.: The lesson learnt during interact-I and INTERACT-II actris measurement campaigns, in: EPJ Web of Conferences, vol. 176, p. 11002, EDP Sciences, 2018. a

Rossow, W. B. and Schiffer, R. A.: ISCCP cloud data products, B. Am. Meteorol. Soc., 72, 2–20,<0002:ICDP>2.0.CO;2, 1991. a

Rossum, G.: Python reference manual, Centre for Mathematics and Computer Science, Amsterdam, Netherlands, 1995. a

Schmidt, G. A., Bader, D., Donner, L. J., Elsaesser, G. S., Golaz, J.-C., Hannay, C., Molod, A., Neale, R. B., and Saha, S.: Practice and philosophy of climate model tuning across six US modeling centers, Geosci. Model Dev., 10, 3207–3223,, 2017. a

Silber, I., Verlinde, J., Eloranta, E. W., Flynn, C. J., and Flynn, D. M.: Polar liquid cloud base detection algorithms for high spectral resolution or micropulse lidar data, J. Geophys. Res.-Atmos., 123, 4310–4322,, 2018. a, b, c, d

Spinhirne, J. D.: Micro pulse lidar, IEEE T. Geosci. Remote, 31, 48–55, 1993. a

Stephens, G. L., Vane, D. G., Boain, R. J., Mace, G. G., Sassen, K., Wang, Z., Illingworth, A. J., O'Connor, E. J., Rossow, W. B., Durden, S. L., Miller, S. D., Austin, R. T., Benedetti, A., Mitrescu, C., and the CloudSat Science Team: The CloudSat mission and the A-Train: A new dimension of space-based observations of clouds and precipitation, B. Am. Meteorol. Soc., 83, 1771–1790,, 2002. a

Stokes, G. M. and Schwartz, S. E.: The Atmospheric Radiation Measurement (ARM) Program: Programmatic background and design of the cloud and radiation test bed, B. Am. Meteorol. Soc., 75, 1201–1222,<1201:TARMPP>2.0.CO;2, 1994. a

Swales, D. J., Pincus, R., and Bodas-Salcedo, A.: The Cloud Feedback Model Intercomparison Project Observational Simulator Package: Version 2, Geosci. Model Dev., 11, 77–81,, 2018. a

Telford, P. J., Braesicke, P., Morgenstern, O., and Pyle, J. A.: Technical Note: Description and assessment of a nudged version of the new dynamics Unified Model, Atmos. Chem. Phys., 8, 1701–1712,, 2008. a

Torvalds, L.: Linux: a portable operating system, Master's thesis, University of Helsinki, 1997. a

Tsaknakis, G., Papayannis, A., Kokkalis, P., Amiridis, V., Kambezidis, H. D., Mamouri, R. E., Georgoussis, G., and Avdikos, G.: Inter-comparison of lidar and ceilometer retrievals for aerosol and Planetary Boundary Layer profiling over Athens, Greece, Atmos. Meas. Tech., 4, 1261–1273,, 2011. a

Van Der Walt, S., Colbert, S. C., and Varoquaux, G.: The NumPy array: a structure for efficient numerical computation, Comput. Sci. Eng., 13, 22–30,, 2011. a

van Diedenhoven, B.: Remote Sensing of Crystal Shapes in Ice Clouds, in: Springer Series in Light Scattering, pp. 197–250, Springer International Publishing,, 2017. a

Van Tricht, K., Gorodetskaya, I. V., Lhermitte, S., Turner, D. D., Schween, J. H., and Van Lipzig, N. P. M.: An improved algorithm for polar cloud-base detection by ceilometer over the ice sheets, Atmos. Meas. Tech., 7, 1153–1167,, 2014. a, b

Vaughan, M. A., Liu, Z., McGill, M. J., Hu, Y., and Obland, M. D.: On the spectral dependence of backscatter from cirrus clouds: Assessing CALIOP's 1064 nm calibration assumptions using cloud physics lidar measurements, J. Geophys. Res.-Atmos., 115, D14206,, 2010. a

Virtanen, P., Gommers, R., Oliphant, T. E., Haberland, M., Reddy, T., Cournapeau, D., Burovski, E., Peterson, P., Weckesser, W., Bright, J., van der Walt, S. J., Brett, M., Wilson, J., Jarrod Millman, K., Mayorov, N., Nelson, A. R. J., Jones, E., Kern, R., Larson, E., Carey, C., Polat, İ., Feng, Y., Moore, E. W., Vand erPlas, J., Laxalde, D., Perktold, J., Cimrman, R., Henriksen, I., Quintero, E. A., Harris, C. R., Archibald, A. M., Ribeiro, A. H., Pedregosa, F., van Mulbregt, P., and Contributors: SciPy 1.0–Fundamental Algorithms for Scientific Computing in Python, arXiv [preprint], arXiv:1907.10121, 11 December 2019. a

Walters, D., Baran, A. J., Boutle, I., Brooks, M., Earnshaw, P., Edwards, J., Furtado, K., Hill, P., Lock, A., Manners, J., Morcrette, C., Mulcahy, J., Sanchez, C., Smith, C., Stratton, R., Tennant, W., Tomassini, L., Van Weverberg, K., Vosper, S., Willett, M., Browse, J., Bushell, A., Carslaw, K., Dalvi, M., Essery, R., Gedney, N., Hardiman, S., Johnson, B., Johnson, C., Jones, A., Jones, C., Mann, G., Milton, S., Rumbold, H., Sellar, A., Ujiie, M., Whitall, M., Williams, K., and Zerroukat, M.: The Met Office Unified Model Global Atmosphere 7.0/7.1 and JULES Global Land 7.0 configurations, Geosci. Model Dev., 12, 1909–1963,, 2019. a

Wang, Z. and Sassen, K.: Cloud type and macrophysical property retrieval using multiple remote sensors, J. Appl. Meteorol., 40, 1665–1682,<1665:CTAMPR>2.0.CO;2, 2001. a, b

Warren, E., Charlton-Perez, C., Kotthaus, S., Lean, H., Ballard, S., Hopkin, E., and Grimmond, S.: Evaluation of forward-modelled attenuated backscatter using an urban ceilometer network in London under clear-sky conditions, Atmos. Environ., 191, 532–547,, 2018. a

Watson-Parris, D., Schutgens, N., Cook, N., Kipling, Z., Kershaw, P., Gryspeerdt, E., Lawrence, B., and Stier, P.: Community Intercomparison Suite (CIS) v1.4.0: a tool for intercomparing models and observations, Geosci. Model Dev., 9, 3093–3110,, 2016. a

Webb, M., Senior, C., Bony, S., and Morcrette, J.-J.: Combining ERBE and ISCCP data to assess clouds in the Hadley Centre, ECMWF and LMD atmospheric climate models, Clim. Dynam., 17, 905–922,, 2001. a

Webb, M. J., Andrews, T., Bodas-Salcedo, A., Bony, S., Bretherton, C. S., Chadwick, R., Chepfer, H., Douville, H., Good, P., Kay, J. E., Klein, S. A., Marchand, R., Medeiros, B., Siebesma, A. P., Skinner, C. B., Stevens, B., Tselioudis, G., Tsushima, Y., and Watanabe, M.: The Cloud Feedback Model Intercomparison Project (CFMIP) contribution to CMIP6, Geosci. Model Dev., 10, 359–384,, 2017. a

Welton, E. J., Voss, K. J., Gordon, H. R., Maring, H., Smirnov, A., Holben, B., Schmid, B., Livingston, J. M., Russell, P. B., Durkee, P. A., Formenti, P., and Andreae, M. O.: Ground-based lidar measurements of aerosols during ACE-2: Instrument description, results, and comparisons with other ground-based and airborne measurements, Tellus B, 52, 636–651,, 2000. a

Welton, E. J., Voss, K. J., Quinn, P. K., Flatau, P. J., Markowicz, K., Campbell, J. R., Spinhirne, J. D., Gordon, H. R., and Johnson, J. E.: Measurements of aerosol vertical profiles and optical properties during INDOEX 1999 using micropulse lidars, J. Geophys. Res.-Atmos., 107, INX2–18,, 2002. a

Welton, E. J., Campbell, J. R., Berkoff, T. A., Valencia, S., Spinhirne, J. D., Holben, B., Tsay, S.-C., and Schmid, B.: The NASA Micro-Pulse Lidar Network (MPLNET): an overview and recent results, Opt. Pur. Apl, 39, 67–74, 2006. a

Werner, M.: Shuttle radar topography mission (SRTM) mission overview, Frequenz, 55, 75–79,, 2001. a

Wiegner, M. and Gasteiger, J.: Correction of water vapor absorption for aerosol remote sensing with ceilometers, Atmos. Meas. Tech., 8, 3971–3984,, 2015. a, b, c, d

Wiegner, M. and Geiß, A.: Aerosol profiling with the Jenoptik ceilometer CHM15kx, Atmos. Meas. Tech., 5, 1953–1964,, 2012. a

Wiegner, M., Madonna, F., Binietoglou, I., Forkel, R., Gasteiger, J., Geiß, A., Pappalardo, G., Schäfer, K., and Thomas, W.: What is the benefit of ceilometers for aerosol remote sensing? An answer from EARLINET, Atmos. Meas. Tech., 7, 1979–1997,, 2014. a, b, c, d

Wiegner, M., Mattis, I., Pattantyús-Ábrahám, M., Bravo-Aranda, J. A., Poltera, Y., Haefele, A., Hervo, M., Görsdorf, U., Leinweber, R., Gasteiger, J., Haeffelin, M., Wagner, F., Cermak, J., Komínková, K., Brettle, M., Münkel, C., and Pönitz, K.: Aerosol backscatter profiles from ceilometers: validation of water vapor correction in the framework of CeiLinEx2015, Atmos. Meas. Tech., 12, 471–490,, 2019. a, b

Williams, D. N., Ananthakrishnan, R., Bernholdt, D., Bharathi, S., Brown, D., Chen, M., Chervenak, A., Cinquini, L., Drach, R., Foster, I., et al.: The Earth System Grid: Enabling access to multimodel climate simulation data, B. Am. Meteorol. Soc., 90, 195–206,, 2009. a

Williams, J., Morgenstern, O., Varma, V., Behrens, E., Hayek, W., Oliver, H., Dean, S., Mullan, B., and Frame, D.: Development of the New Zealand Earth System Model: NZESM, Weather and Climate, 36, 25–44,, 2016. a

Williams, K. D. and Bodas-Salcedo, A.: A multi-diagnostic approach to cloud evaluation, Geosci. Model Dev., 10, 2547–2566,, 2017. a

Winker, D. M., Vaughan, M. A., Omar, A., Hu, Y., Powell, K. A., Liu, Z., Hunt, W. H., and Young, S. A.: Overview of the CALIPSO mission and CALIOP data processing algorithms, J. Atmos. Ocean. Tech., 26, 2310–2323,, 2009. a

Wiscombe, W. J.: Mie scattering calculations: Advances in technique and fast, vector-speed computer codes, Tech. rep., National Center for Atmospheric Research Boulder, Colorado, 1979. a

Wiscombe, W. J.: Improved Mie scattering algorithms, Appl. Opt., 19, 1505–1509, 1980. a

Yang, P., Liou, K.-N., Bi, L., Liu, C., Yi, B., and Baum, B. A.: On the radiative properties of ice clouds: Light scattering, remote sensing, and radiation parameterization, Adv. Atmos. Sci., 32, 32–63,, 2014. a

Yorks, J. E., Hlavka, D. L., Hart, W. D., and McGill, M. J.: Statistics of Cloud Optical Properties from Airborne Lidar Measurements, J. Atmos. Ocean. Tech., 28, 869–883,, 2011.  a

Zadra, A., Williams, K., Frassoni, A., Rixen, M., Adames, Á. F., Berner, J., Bouyssel, F., Casati, B., Christensen, H., Ek, M. B., Flato, G., Huang, Y., Judt, F., Lin, H., Maloney, E., Merryfield, W., Van Niekerk, A., Rackow, T., Saito, K., Wedi, N., and Yadav, P.: Systematic Errors in Weather and Climate Models: Nature, Origins, and Ways Forward, B. Am. Meteorol. Soc., 99, ES67–ES70,, 2018. a

Zdunkowski, W., Trautmann, T., and Bott, A.: Radiation in the atmosphere: a course in theoretical meteorology, Cambridge University Press, New York, NY, USA, 482 pp., ISBN 0-511-27560-9, 2007. a

Zhang, Y., Xie, S., Klein, S. A., Marchand, R., Kollias, P., Clothiaux, E. E., Lin, W., Johnson, K., Swales, D., Bodas-Salcedo, A., Tang, S., Haynes, J. M., Collis, S., Jensen, M., Bharadwaj, N., Hardin, J., and Isom, B.: The ARM Cloud Radar Simulator for Global Climate Models: Bridging Field Data and Climate Models, B. Am. Meteorol. Soc., 99, 21–26,, 2018. a

Zhang, Z. and Platnick, S.: An assessment of differences between cloud effective particle radius retrievals for marine water clouds from three MODIS spectral bands, J. Geophys. Res.-Atmos., 116, D20215,, 2011. a


We use the term “reanalysis” when referring to ERA5, JRA-55 and MERRA-2 even though the reanalyses are based on atmospheric models. We use the term “model” when referring to AMPS and the UM, which are atmospheric models.


The actual lidar wavelength is not constant and is characterised by a central wavelength and width. The central wavelength may fluctuate with temperature (Wiegner and Gasteiger2015).