The Coupled Model
Intercomparison Project phase 6 (CMIP6) HighResMIP is a new experimental design for global
climate model simulations that aims to assess the impact of model horizontal
resolution on climate simulation fidelity. We describe a hierarchy of global
coupled model resolutions based on the Hadley Centre Global Environment Model 3 – Global Coupled vn 3.1 (HadGEM3-GC3.1) model that ranges from
an atmosphere–ocean resolution of 130 km–1
We assess the change in model biases as a function of both atmosphere and ocean resolution, together with the effectiveness and robustness of this new experimental design. We find reductions in the biases in top-of-atmosphere radiation components and cloud forcing. There are significant reductions in some common surface climate model biases as resolution is increased, particularly in the Atlantic for sea surface temperature and precipitation, primarily driven by increased ocean resolution. There is also a reduction in drift from the initial conditions both at the surface and in the deeper ocean at higher resolution. Using an eddy-present and eddy-rich ocean resolution enhances the strength of the North Atlantic ocean circulation (boundary currents, overturning circulation and heat transport), while an eddy-present ocean resolution has a considerably reduced Antarctic Circumpolar Current strength. All models have a reasonable representation of El Niño–Southern Oscillation. In general, the biases present after 30 years of simulations do not change character markedly over longer timescales, justifying the experimental design.
There is now considerable evidence that enhancing model horizontal resolution can help to reduce systematic and long-standing climate model biases (Kinter et al., 2013; Small et al., 2014; Griffies et al., 2015; Hewitt et al., 2016; C. D. Roberts et al., 2018; M. Roberts et al., 2016, 2018) and hence potentially improve the robustness and trust in future projections. Some of the evidence for this comes from previous Coupled Model Intercomparison Project (CMIP) exercises (Meehl et al., 2000, 2007; Taylor et al., 2012). However, it can be difficult to assess the impact of horizontal resolution changes alone, as even when the same model is submitted to CMIP with multiple resolutions (relatively rare for coupled models), there are generally additional model differences. These may include retuning via parameter changes and difficulty in assessing model evolution due to extra complexity (e.g. interactive aerosol schemes, Earth system components).
CMIP6 HighResMIP (Haarsma et al., 2016) is a new experimental design that specifically focuses on assessing the impact of increased horizontal resolution on mean state biases, using model configurations designed for this purpose. The protocol encourages minimal model changes as resolution is increased, the use of a common, simplified aerosol optical property scheme (MACv2-SP; Stevens et al., 2017), as well as common initial conditions and other standard CMIP6 forcings (Eyring et al., 2016). However, due to the increased costs of such enhanced resolution models, some compromises need to be made. One such compromise is the length of coupled model simulation – it is not currently affordable to execute long pre-industrial (PI) spinup and PI control simulations (typically many hundreds of years) as used in the CMIP6 DECK (Diagnostic, Evaluation and Characterization of Klima) simulations (Eyring et al., 2016).
Hence, given a new protocol, together with model resolutions which range wider than those used in previous CMIP exercises, we need to assess the efficacy of this experimental design, finding its strengths and weaknesses, in addition to using it to assess the impact of model horizontal resolution. Similar assessments with other global climate models are ongoing and can be found in C. D. Roberts et al. (2018), Cherchi et al. (2019), Voldoire et al. (2019), Gutjahr et al. (2019), Sidorenko et al. (2019), Haarsma et al. (2019) and Caldwell et al. (2019).
Here, we describe the Hadley Centre Global Environment Model 3 – Global Coupled vn 3.1 (HadGEM3-GC3.1) model (Williams et al., 2017) as
configured for HighResMIP, with a resolution hierarchy spanning from a
standard CMIP-type resolution (130 km–1 How does horizontal resolution impact on the simulated coupled climate,
specifically on model biases, mean state and variability? How well does the CMIP6 HighResMIP protocol work in isolating these
impacts? Are there areas in which it will be difficult to assess the models due to
the protocol (e.g. drift larger than signal, the 100-year insufficient
simulation length, not enough ensemble members for robust differences)? Do the longer control-1950 simulations shown here differ significantly
from the initial 100 years, and do they reveal any further insights?
The focus of this work is on the spinup period and the control simulations
(constant forcing), together with the longer-term behaviour in several of
the models. In Sect. 2, we describe the model configuration at different
resolutions, together with aspects of implementation of the HighResMIP
experimental protocol. Results are shown in Sect. 3 on the impact of
resolution on aspects of the individual model components as well as the
coupled model evolution of mean state and variability, and in Sect. 4, we
will summarise our experiences and discuss future work.
The configuration of the global coupled model HadGEM3-GC3.1 for submission to the CMIP6 DECK (Eyring et al., 2016) is described in Williams et al. (2017), Menary et al. (2018) and Kuhlbrodt et al. (2018). It incorporates a global atmosphere–land configuration called GA/GL7.1 (Walters et al., 2019), with a new modal aerosol scheme (GLOMAP mode; Mulcahy et al., 2018). The atmospheric model uses a regular latitude–longitude grid and has 85 levels extending to 85 km. The global ocean component is called GO6 (Storkey et al., 2018), which uses the Nucleus for European Modelling of the Ocean (NEMO) ocean model (Madec et al., 2017) at vn3.6, having a tripolar grid, with 75 ocean levels (and top level thickness of 1 m). The sea ice model configuration is GSI8.1 (Ridley et al., 2018), which uses the CICE5.1 model (Hunke et al., 2015). Coupling between atmosphere and ocean models is performed by the OASIS-MCT coupler (Valcke et al., 2015) with conservation for the heat and freshwater terms and with surface fluxes calculated on the atmosphere grid. The coupling period is set to 1 h for all models. A sample of the vertical resolution in atmosphere and ocean is shown in Table 1.
A selection of model levels in the atmosphere and ocean and their corresponding height/depth.
The HighResMIP protocol recommends the use of the MACv2-SP scheme (Stevens et al., 2017) for simplified and standardised aerosol forcing. This specifies the change of anthropogenic aerosol optical properties over time and hence enables easier comparison between different models, while retaining the model's own aerosol mean (non-varying) background natural climatology and hence requiring little or no additional tuning. It is used here in place of the prognostic GLOMAP-mode scheme, with model implementation described in Vidale et al. (2019a). Simulations representative of 1950 (spinup-1950, control-1950) use mean values over 1949–1958 of the time-varying dataset.
The different model resolutions used in this work, together with parameterisation and parameter choices, are summarised in Table 2. We will henceforth use the CMIP6 naming conventions and CMIP6 nominal resolutions when describing the models – this is calculated as a weighted mean of the diagonal distance across grid cells, binned into resolution categories. Henceforth, we use the terms “model resolution” and “model grid spacing” interchangeably, while Klaver et al. (2019) describe the effective model resolution based on kinetic energy spectra.
Parameter value changes between different model ocean (from top) and atmosphere (from bottom) resolutions, together with resolution naming conventions (in bold).
The HadGEM3-GC3.1 model has very few parameter values explicitly changed as model resolution is varied (Table 2). In the atmosphere, the only explicit parameter change is to the “USSP launch factor”, which is used to produce a reasonable period for the quasi-biennial oscillation (QBO) at different resolutions, as described in Walters et al. (2019). For the ocean model, there are more changes, principally because we move from a regime at 100 km resolution (LL model) where the ocean mesoscale (ocean eddies, boundary currents) are strongly tied to parameterisations and the requirements of numerical stability, to eddying regimes at 25 and 8 km where these properties are increasingly explicitly resolved. The parameter choices for the LL model are described in Kuhlbrodt et al. (2018), with key differences from 100 to 25 km resolution being the deactivation of the ocean eddy flux parameterisation (Gent and McWilliams, 1990) and the reduction of explicit dissipation parameters. The snow on sea ice albedo was adjusted to be lower in the LL model (by 2 %–3 %; see Table 2) due to excessive sea ice thickness particularly in the Arctic (Kuhlbrodt et al., 2018).
In addition to explicit parameter differences, some model parameters and schemes are self-tuning; that is, their controlling parameters vary automatically based on model resolution. These include the stochastic physics stochastic perturbation of tendencies (SPT) and stochastic kinetic energy backscatter (SKEB2) schemes, as described in Walters et al. (2019) and Sanchez et al. (2016).
Detailed descriptions of the atmospheric model differences, such as the impact of using MACv2-SP scheme, can be found in Vidale et al. (2019a), and further assessments of ocean model differences up to 8 km resolution are described in Storkey et al. (2018) and Mathiot et al. (2019).
The simulations described here follow the CMIP6 HighResMIP protocol (Haarsma et al., 2016) in terms of the forcing datasets used. The spinup-1950 and control-1950 experiments (Fig. 1) use mean values of the CMIP6 transient forcing datasets for aerosol (as above), solar (Matthes et al., 2017), ozone concentration (Hegglin et al., 2016) and greenhouse gas (GHG) forcings (Meinshausen and Vogel, 2016). Aerosol uses a monthly mean from the 1950–1959 period; ozone and solar use a monthly mean over 22 years centred about 1950 (in order to mean over the 11-year solar cycle); GHG uses the 1950 value of GHG global concentrations. The hist-1950 simulations (not analysed here) use the time-varying versions of these forcings.
The HighResMIP experimental design, including the names of the component experiments and an indication of their relationship.
Details of the computational performance of the models at different resolutions can be found in Vidale et al. (2019b). Table 3 shows an overview of the total run lengths, initial conditions for each simulation, model costs and the volume of data output, primarily of the raw priority 1 diagnostics for the HighResMIP data request (Juckes et al., 2019). Since we expect the higher-resolution models to represent weather processes and events in an improved way, there is a requirement for more high-frequency output (1-, 3-, 6-hourly on multiple atmospheric levels, as well as daily ocean surface output) in order for us to assess these processes.
For each model resolution for the control-1950 simulation: the nominal CMIP6 resolution, the initial condition, total simulated years and costs of various model resolutions on a Cray XC40 with 36 cores per node, together with raw model output volumes.
Most of the model data used in the following analysis are available from the CMIP6 Earth System Grid Federation and can be located using the information in M. Roberts (2018, 2017a, b, c, d) for LL, MM, HM, MH and HH resolutions, respectively. Other model resolutions are not part of the official HadGEM3-GC3.1 CMIP6 HighResMIP submission, with the data available on request.
The analysis in this work uses data from the spinup-1950 and control-1950 HighResMIP experiments and hence is a representation of 1950 climatological conditions. It should be noted, when comparing these data to observations, that globally complete observational data are typically only available post-1970s, and hence one expects that some component of any “bias” will be attributable to this difference in the simulation and observational periods.
Given the expense of the higher-resolution models, the typical spinup procedure as used in CMIP6 DECK simulations (many hundreds of years of pre-industrial (PI) spinup before piControl and historic simulations are initialised; Eyring et al., 2016) is simply not feasible here. Hence, the HighResMIP protocol recommends 30–50 years using the spinup-1950 protocol from specified ocean and atmosphere initial conditions; here, we use 30 years. This “spun up” state is then used to initialise the control-1950 and hist-1950 simulations as illustrated in Fig. 1.
The common initial conditions for the ocean temperature and salinity are derived from the January 1950–1954 mean of the EN4 ocean analysis (Good et al., 2013). This is bilinearly interpolated to the model ocean grid. For the higher ocean resolutions (25 and 8 km), it was found that several days of simulation with a very short ocean time step (typically one quarter of the standard value) were needed in order to remove small-scale instabilities introduced by the interpolation, particularly in the high Arctic and where the Mediterranean and Black seas meet. The model was then restarted at 1 January 1950 with these derived ocean and sea ice initial conditions.
The atmosphere initial condition is derived from ERA-20C (Poli et al., 2016) in January 1950. Initial conditions for sea ice are taken from previous ocean–sea ice simulations (at the same resolution) valid around 1979, since methods to initialise different sea ice models with common variables are less well developed and with the assumption that sea ice has a timescale of only several decades to quasi-equilibrate. The soil moisture has a relatively long memory, and its initial condition was taken from a previous HighResMIP atmosphere-only simulation using the same atmosphere resolution.
The spinup-1950 experiment with models using 25 and 8 km ocean resolutions were only performed with one atmosphere resolution – MM and MH, respectively – both using the 100 km atmosphere.
The final states of all the model components at the end of spinup-1950 simulation are used to initialise both the control-1950 and hist-1950 experiments. As noted above, the MM spinup-1950 is used as initial condition of both MM and HM, and the MH spinup-1950 was used for both MH and HH. This method seemed to work well for the ocean component, but for the soil moisture in the land component it was found that the more inhomogeneous soil properties at 50 km (HM, HH) resolution tended to retain less water. This led to a pulse of freshwater from land to ocean at the start of the control-1950 and hist-1950 simulations. To prevent this, the soil moisture from a previous 50 km simulation was inserted into initial conditions for the HM and HH simulations.
The control-1950 experiment is required to be at least 100 years in length but can be much longer – it is the HighResMIP equivalent to the CMIP6 piControl. It uses the same forcings as spinup-1950. In this work, there are 1000 years of LL, 600 years of MM and 150 years of MH available for analysis.
These simulations have the same initial conditions as their respective control-1950 models. The forcings are now the time-varying CMIP6 forcings. Since we recognise that the models will continue to drift over time, due to the short spinup, we can use the difference between the hist-1950 and control-1950 simulations as an estimate of the impact of changing forcings on the climate state, assuming a common drift between the two simulation types. Analysis of these simulations is outside the scope of this work.
The radiative balance of the different resolution models is shown in Fig. 2,
in terms of the top-of-atmosphere (TOA) radiation, its outgoing shortwave
and longwave (OSR, OLR) components, and the global mean surface temperature
(ST), together with observational estimates where possible. The TOA starts
at between
Time series of globally averaged quantities for three ocean
resolution spinup-1950 simulations (
Annual mean zonal mean model radiation biases (years 50–100) of
top-of-atmosphere radiation components against CERES-EBAF observations
for 2000–2018 (Kato et al., 2013).
For comparison, as shown in Vidale et al. (2019a), the atmosphere-only
simulations have a TOA which starts at around
The adjustments of the MM and MH models are similar, and they stay closer to
the observational estimates. Here, the OSR increases by 0.5–1 W m
Annual mean model bias over years 50–100 in cloud radiative forcing
(W m
The oscillations in the MM TOA after 100 years are relatively large and relate to Antarctic sea ice variability and particularly to a large polynya that opens and closes over time in the Weddell Sea. This is a relatively common feature among climate model simulations (Griffies et al., 2015; Dufour et al., 2017; Cabré et al., 2017).
The models have not been tuned (beyond the shared, common scientific
configuration developed for the HadGEM3-GC3.1 CMIP6 DECK model) for a
specific long-term TOA radiation, so it is either chance or some inherent
property of this coupled system that enables all models to achieve a
relatively small net TOA balance over such a short time. For comparison, the
equivalent HadGEM3-GC3.1 DECK pre-industrial control (piControl) simulations
have mean TOA of 0.2 and 0.31 W m
The zonal mean TOA, OSR and OLR biases compared to the Clouds and the Earth's Radiant Energy System Energy Balanced and Filled product surface fluxes edition 4.0 (CERES-EBAF; Kato et al., 2013) are shown in Fig. 3. This confirms that the largest differences occur when the resolution is changed from LL to any other resolution. At low and midlatitudes, the TOA bias is relatively small, but this is due to a compensation between OSR and OLR components, with OLR biased negative at nearly all latitudes and OSR biased positive. All models have positive OSR biases at midlatitudes apart from near Antarctica where there is a negative bias.
The spatial patterns of cloud radiative forcing (CRF) biases against CERES-EBAF (Kato et al., 2013) are shown in Fig. 4 for both shortwave and longwave components (SW, LW). The large-scale patterns of bias are consistent across model resolutions, but there are significant regional differences. As we increase the resolution, for the Atlantic basin, there are reductions in the SW CRF bias in both the tropics and over the Gulf Stream–North Atlantic Current, in the stratocumulus/upwelling regions (off the west coasts of South America, South Africa and to a lesser extent North America) and in the western Pacific. In contrast, there is an increase in bias in the eastern tropical Indian Ocean and over South America. For the LW CRF, there is less change: the bias in the tropical Atlantic changes from a negative bias to the south of the Equator to a positive bias to the north, which links to the precipitation biases shown later. A negative bias also increases with increased resolution in the eastern Indian Ocean (perhaps somewhat compensating for the SW CRF bias).
The time series of sea ice area in the Arctic and Antarctic for March and September (approximately the extremes of the respective seasonal cycles) are shown in Fig. 5, together with some observational estimates (HadISST1.2, Rayner et al., 2003; HadISST.2.2.0.0, Titchner and Rayner 2014), noting that the observations are representative of years 1990–2009, while the model is simulating the year 1950. As noted previously, the LL model has a large increase in Arctic sea ice area over the initial decades before approaching the climatology of the other models. All models have somewhat more Arctic sea ice in both summer and winter than observations. In the Antarctic winter, the MM model displays considerable decadal variability (particularly later in the simulation, as noted in the TOA variability previously) which is not so evident at other resolutions and is due to a large polynya opening and closing in the Weddell Sea. Summer Antarctic sea ice values are improved over previous versions of the model, particularly at MM resolution for which large-scale warming of the Southern Ocean has been a persistent model bias. This has been achieved primarily from reduced atmospheric flux biases (Bodas-Salcedo et al., 2016; Hyder et al., 2018; Williams et al., 2017), though a warm bias does still remain (see Fig. 7).
Time series of sea ice areas in the
Seasonal cycles for
Top row: model SST bias in control-1950 years 50–100 vs. EN4 (1950–1954 mean), with the mean and rms bias shown; second row: the total SST differences between MM-LL and HH-MM resolutions; third row: the impact of changing the atmosphere resolution at each reference ocean resolution; bottom row: the impact of changing the ocean resolution at each reference atmosphere resolution. Units are K. Points that contain annual mean sea ice are masked.
The sea ice area and volume seasonal cycles are shown in Fig. 6 at the end of both the spinup-1950 and control-1950 simulations. There are few observational means to assess sea ice volume, and so here we use the Pan-Arctic Ice Ocean Modelling and Assimilation System sea ice reanalysis (PIOMAS; Zhang and Rothrock, 2003; Schweiger et al., 2011) model as a reference in the Arctic (1990–2009) and satellite estimates from ICESat (Ice, Cloud, and land Elevation Satellite) for the Antarctic during 2003–2008 (Kurtz and Markus, 2012). PIOMAS has been shown to compare well with ICESat thickness for the periods where ICESat is present (Schweiger et al., 2011), and this gives us confidence to use the data throughout the year and over the longer evaluation period (1990–2009). The seasonal cycle amplitude and phase of sea ice area are well captured in the models except for LL in the Arctic, which has too much sea ice. All the models have more sea ice volume than is indicated by the PIOMAS model and ICESat in the Arctic and Antarctic, respectively. In the Arctic, the volume increases over time in the MM and MH simulations while reducing somewhat in LL, while in the Antarctic the MM volume starts lower than the other models but adjusts to a similar mean state.
The SST biases at the end of the control-1950 period (averaged over years 50–100) are shown in Fig. 7 for each model resolution. These are shown both as model bias compared to initial condition (top row) and as inter-resolution differences for the total change (due to atmosphere plus ocean resolution), change due to atmosphere resolution alone, and change due to ocean resolution alone (rows two, three and four, respectively). A common feature across the models is a generally cold midlatitude bias, which may partly reflect the experimental design of using EN4 1950–1954 initial conditions, the short spinup-1950 period, the constant 1950s forcing derived from CMIP6 and a consequently negative TOA in the first few decades (Fig. 2a) but is also a feature found in many climate models and experiments (see Flato et al., 2013, Fig. 9.13; Kuhlbrodt et al., 2018).
The change in SST over 100 years of the control-1950 simulation illustrated by the difference between years 90–100 and years 1–10.
The biases typical for a 100 km ocean model (Danabasoglu et al., 2014) are evident in Fig. 7a for LL and also described in Kuhlbrodt et al. (2018). They are strongest over the boundary currents in the North Atlantic and northwest Pacific, with cold biases of more than 5 K, and over the tropics with a cold bias of 1–2 K. Warm biases over the stratocumulus decks to the west of southern Africa, South America and California are also evident, as well as warm biases in the regions where boundary currents separate from the coastline (particularly the Gulf Stream and Kuroshio). Comparing this to the 25 km model (MM, Fig. 7b, d), there are large reductions in both boundary current and tropical biases, and some reductions in the warm stratocumulus biases and the warm bias at the Kuroshio separation. These come at the expense of an enhanced warm bias in the Southern Ocean, which is also common in models at this resolution (Bodas-Salcedo et al., 2014; Flato et al., 2013, Fig. 9.2b), and is due to errors in the atmospheric fluxes (Bodas-Salcedo et al., 2014; Hyder et al., 2018) and due to the intermediate regime in the Southern Ocean in which the ocean models have no eddy parameterisation but also a poor explicit representation of eddies (Hallberg, 2013; Moreton et al., 2019). Although still sizable, the bias has been considerably reduced compared to previous versions of the model (Williams et al., 2017).
Comparing the 8 and 25 km ocean models (HH and MM; Fig. 7e), there is further warming in the Atlantic in the eddy-rich model with mixed results compared to the bias and some further cooling in the stratocumulus regions, and over the Gulf Stream separation the bias switches from warm to cold. The Southern Ocean warm bias is slightly reduced compared to MM but remains larger than the LL bias.
There is no clean way to attribute the biases to either atmosphere or ocean resolution due to complex coupled interactions, but rows 3–4 of Fig. 7 show the impact of atmosphere and ocean resolution change only, respectively, for a given resolution of the other component. The largest changes are found between the L and M resolution components, with smaller changes at higher resolutions (consistent with C. D. Roberts et al., 2018). For the L and M resolution ocean, a higher-resolution atmosphere tends to produce a cooler ocean SST, particularly in the ocean upwelling regions (as also seen in Gent et al., 2010 and Small et al., 2014, 2015), the Southern Ocean and the North Atlantic. A higher ocean resolution, meanwhile, tends to produce a warmer SST particularly in boundary currents and other high gradient regions, with changes over 6 K (Griffies et al., 2015; Small et al., 2019). Differences between HH and MH, HM are relatively smaller, though the improved separation of the Gulf Stream from the North American coast and consequent reduction in warm bias is evident in the dipole of SST change with the eddy-rich ocean (Fig. 7k) as also seen in Small et al. (2019).
To contrast the biases above to model surface drift, and hence the effectiveness of the short spinup in the experimental design, Fig. 8 shows the SST differences between the start and end of the control-1950 simulation. The different model resolutions evolve in distinct ways. LL and ML (Fig. 8a, b) both warm in the northern Pacific and Atlantic basins, the latter consistent with the reduced Arctic sea ice as seen earlier and an increasing AMOC and northward heat transport (see later). MM and HM (Fig. 8c, d) both cool at high latitudes in the Southern Hemisphere and warm slightly in the northern North Atlantic; the HM model has some additional cooling in the southeast Pacific, likely associated with reduced coastal warm bias. MH and HH (Fig. 8e, f) both have a slight warming in the South Atlantic near the Antarctic Circumpolar Current. However, it is clear that the magnitude of this drift is considerably smaller than the mean bias shown in Fig. 7, such that the bias plot at the end of spinup-1950 (not shown) is little different from Fig. 7. The only notable region with similar magnitude of drift and bias is the LL North Atlantic warming and part of the Southern Ocean cooling in MM and HM.
The annual mean 2 m temperature biases over land are shown in Fig. 9 using model means over years 50–100, compared to the Climate Research Unit time series 4.01 dataset for period of 1940–1960 (CRU TS; Harris et al., 2014). The warmer SSTs and reduced sea ice extent in the Arctic Ocean between LL and the higher-resolution models give significantly warmer surface temperatures over Scandinavia, northern Russia and Alaska of 2–3 K. Tropical cold biases are also reduced as resolution is increased, leading to the HH model having a considerably smaller global root mean square (rms) bias.
The results above indicate that, for the surface climatology, the HighResMIP simulations are adequately long to illustrate robust differences due to model resolution. However, the deeper ocean requires much longer to come into any pseudo-equilibrium, and these drifts are described next.
Annual mean bias in 2 m temperature over land (
The evolution of subsurface ocean changes from the initial state (henceforth
referred to as ocean drift) at different model resolutions is shown in
Figs. 10 and 11 for temperature and salinity, respectively, for both the
global ocean (left column) and the Atlantic basin (40
The mean ocean potential temperature anomaly (compared to year 1) on model levels against time for the (left) global and (right) Atlantic Ocean basins, from the spinup-1950 period to the first 210 years (where available) of control-1950. Note the different scales for the different regions.
As Fig. 10 but for ocean salinity, again with different scales on the left and right.
All models have a global cooling of 0.5–1 K over the top 200 m (Fig. 10, left column) which gradually reduces over time and a warming centred at 800 m, with an enhanced magnitude in the L ocean models. The LM and MM models have a deep warming not seen in other models, which is likely associated with the Southern Ocean bias (not shown). The global salinity drift (Fig. 11, left column) shows all models have a freshening over the top 300 m with magnitude increasing over time. They also have an increase in salinity between 1000 and 2000 m, which is larger with the L ocean.
The temperature evolution in the Atlantic (Fig. 10, right column) is similar to the global drift, with near-surface cooling and a warming at mid-depths, both of which are stronger in the L ocean. The warming at mid-depths is also associated with an increase in salinity (Fig. 11, right column), which is also seen in Griffies et al. (2015) and Kuhlbrodt et al. (2018), and is likely to be associated with the AMOC circulation, production of North Atlantic Deep Water (NADW) and biases in representing deep overflows in the North Atlantic (Danabasoglu et al., 2014). The change from L to a higher-resolution ocean causes the surface Atlantic salinity bias to switch from a freshening to a positive increase, with the eddy-rich MH resolution notable for having greatly reduced salinity drifts in the Atlantic – these differences are consistent with differences in (tropical) precipitation as shown later.
Figure 10c–d and e–f illustrate the impact of atmosphere and ocean resolution on these drifts. Using a higher-resolution atmosphere in ML (Fig. 10d) suggests a reduction in the Atlantic mid-depth warming (perhaps an improvement in the NADW), while LM (Fig. 10e, f) shows a slight reduction in the magnitude of warming at 1000 m and the warming of the bottom waters globally associated with the Southern Ocean bias.
Given the experimental design, it is difficult to disentangle drifts due to imbalances in surface forcing from processes that may be poorly represented. As discussed in Griffies et al. (2015), von Storch et al. (2016) and Small et al. (2019), the evolution of the subsurface ocean depends on fluxes of heat (and salt) from either parameterised or explicitly represented processes (e.g. vertical diffusion, advective fluxes from mean, mesoscale and submesoscale circulation). The changes in the subsurface ocean over time with resolution shown here are consistent with these previous studies, which would indicate an important role for upward heat transport from vertical mesoscale fluxes. The models shown here do not contain a submesoscale eddy parameterisation, which may be important to represent unresolved processes (Fox-Kemper et al., 2011), but they do have enhanced ocean vertical resolution in the near surface (Table 1) compared to the models referenced above, which may also be important.
The information above has shown that 100 years is insufficient to saturate the deep ocean drifts, as would be expected. However, some differences with resolution do seem to be robust and possibly linked to process improvement, particularly in the Southern Ocean where the eddy-rich MH simulation greatly reduces the deep warm temperature drift and in the North Atlantic for both temperature and salinity.
The spatial patterns of the temperature and salinity biases at 950 m depth are shown in Fig. 12 for LL, MM and HH, as a mean over years 50–100, compared to EN4 1950–1954. As indicated in the previous figures, the LL model has warming and increased salinity over much of the Atlantic with an enhancement in the western tropical Atlantic. There biases are mostly reduced in MM and HH simulations, where the main bias switches to the eastern North Atlantic with potentially some role for the Mediterranean outflow. In the region of the Gulf Stream separation from the North American coast, the HH model has a cold and slightly fresh bias opposite to the lower resolutions. Over the rest of the global oceans, the LL model has warming and increase in salinity at mid–high latitudes in the Pacific Ocean which is somewhat reduced in HH, while all the resolutions have cooling and freshening in the northwest Indian Ocean and warming in the southwest Indian Ocean.
Biases at 950 m depth at different model resolutions for
The eddy-rich ocean simulation, MH, has both the minimum in surface
adjustment (of radiation and temperature) as well as the smallest
deep ocean drifts from the EN4 initial conditions. It is unclear whether
this is due only to improved representation of key processes (for example, in
the Southern Ocean and Atlantic), or whether by chance it is better able to
adjust from these initial conditions. It should also be noted that interior
biases can be exacerbated by spurious mixing, especially in the
Changes in the annual mean precipitation bias against Global Precipitation Climatology Project (GPCP) version 2.3 1979–2014 (Adler et al., 2018) with resolution are shown in Fig. 13, averaged over model years 50–100 of control-1950, and are generally consistent with those found in the multi-model analysis of Vannière et al. (2018). The mean biases are shown in the top row, together with the total differences (atmosphere plus ocean resolution changes, row two), and then the impact of atmosphere and ocean resolution changes individually (rows three and four, respectively).
Top row: annual mean model precipitation bias of control-1950
model years 50–100 vs. GPCP_v2.3 1979-2014; second row: the
total precipitation differences between MM-LL and HH-MM resolutions; third
row: the impact of changing the atmosphere resolution at each reference
ocean resolution; bottom row: the impact of changing the ocean resolution
at each reference atmosphere resolution. Units are mm d
Some aspects of the large-scale biases are common across all resolutions, consistent with those shown in Williams et al. (2017). There is excessive precipitation in the western tropical Pacific, southeast Asia, the western Indian Ocean and the South Pacific Convergence Zone (SPCZ), which also includes a double Intertropical Convergence Zone (ITCZ) error in the east Pacific (Lin, 2007).
In the tropical Atlantic, there is a dipole error of 2–3 mm d
Atmosphere and ocean resolutions both play an important role in these changes (Figs. 13f, g, h and 12i, j, k, respectively), with a higher-resolution atmosphere tending to be dry to the south of the Equator and wet to the north with a similar magnitude at each ocean resolution, while the ocean resolution change is much bigger between L and M apart from over the Gulf Stream.
The change in tropical Atlantic precipitation is consistent with an ITCZ located further north at higher resolution. This would be consistent with increased AMOC at higher resolution causing SST gradient changes (Jackson et al., 2015; also Fig. 7d, e) and may also be a consequence of a change in the global energy budget leading to shifts in the Hadley cell/ITCZ position (Bischoff and Schneider, 2016). The precipitation and consequent evaporation changes in the tropical Atlantic are also consistent with the salinity drifts shown in Fig. 11.
The Atlantic northward heat transport (NHT) and Atlantic meridional
overturning circulation (AMOC) time series are shown in Fig. 14 at
26.5
The MH and HH models have NHT (Fig. 14a) which is most consistent with the observations, suggesting an important role for an eddy-rich ocean improving boundary current representation (Treguier et al., 2012; Roberts et al., 2016). However, for AMOC transport, the observations are perhaps more consistent with the lower-resolution models. This apparent inconsistency has been investigated previously by Msadek et al. (2013) and C. D. Roberts et al. (2018). Using a breakdown of heat transport components from the RAPID-MOCHA array (Smeed et al., 2017; Johns et al., 2011), they showed that the amount of NHT was typically underestimated due to both a too-weak AMOC and too little heat transport per Sverdrup of AMOC strength. The same breakdown for the model resolutions is shown in Fig. 15 as well as the AMOC profile with depth. All the models have a weaker overturning component to the heat transport than indicated by RAPID-MOCHA, though the H ocean does shift to higher values than the lower resolutions. Other models and reanalyses show a similar relationship, with strong correlations between AMOC and NHT, producing a regression that would imply, for the observed AMOC strength, a weaker NHT than observed (Danabasoglu et al., 2014; Roberts et al., 2013; Msadek et al., 2013).
AMOC-related metrics calculated using RapidMoc code (C. D. Roberts,
2017) consistent with the RAPID-MOCHA observations, using model years 1–100.
The depth profile of AMOC (Fig. 15b) indicates a strengthening at most depths with increased ocean resolution, with all models having a maximum at around 1000 m consistent with the observations. The shape of the AMOC profile at depth in MH/HH agrees far better with the observations (a difference of 5 Sv at 3000 m between LL and HH), which is likely to be important for global deep water masses and circulation. However, as discussed above, the peak transport at 1000 m becomes considerably stronger than observations in HH. The seasonal cycle of AMOC with annual mean removed (Fig. 15c) indicates the higher-resolution models can match the observed magnitude and phase. The AMOC minimum in March corresponds to the period of maximum variance, with reduced variability in summer.
The spatial structure of the AMOC in both depth and potential density
(referenced to 2000 m) in shown in Fig. 16. The strength of the AMOC in depth
space and the depth of the return flow both increase with resolution over
all latitudes apart from the northern North Atlantic (50–60
Atlantic meridional overturning streamfunction in (left) depth
coordinates and (right) potential density (
Ocean northward heat transport for
The NHT dependence on latitude is shown in Fig. 17 for the global, North
Atlantic and Indo-Pacific basins, where the individual components (total,
resolved advective components, diffusion and parameterised eddy-induced
velocity) are also indicated. In the North Atlantic (Fig. 17b), the eddying
advection component for LL is only visible near the Equator, while the
eddy-induced transport associated with the Gent–McWilliams parameterisation
reaches
For the global ocean (Fig. 17a), the higher resolutions have noticeably
enhanced poleward transport at higher latitudes. In the Southern Ocean
around 40
The time evolution of the Antarctic Circumpolar Current (ACC) transport,
calculated as the volume transport through Drake Passage, is shown in Fig. 18. The mean net eastward transport of 155, 90 and 125 Sv, respectively, for
LL, MM and MH models compares to the recent observational range of
Time series of the Antarctic Circumpolar Current (ACC) transport calculated between Antarctica and South America across Drake Passage for spinup-1950 and control-1950 simulations.
It has been shown (e.g. Kuhlbrodt et al., 2012) that the value of a constant coefficient for eddy parameterisation can influence the ACC transport via the meridional density gradient, and it is possible that the varying coefficient used in LL (Table 2) may play a similar role. Use of such a scheme in the higher-resolution models may well increase the ACC transport for similar reasons, at the expense of removing explicitly resolved mesoscale processes.
As the dominant mode of interannual tropical variability, El Niño–Southern Oscillation (ENSO) is a key aspect of climate variability with worldwide impacts (Timmermann et al., 2018). Over time, there has been some improvement in modelling ENSO in global climate models (e.g. Bellenger et al., 2014), with HadGEM3-GC3.1 performance described in Williams et al. (2017), Kuhlbrodt et al. (2018) and Menary et al. (2018). Figure 19 shows the power spectrum of NINO3.4 monthly surface temperature anomalies, calculated using a periodogram method with 50 years of data in each sample and a 25-year overlap between samples, with the average power spectrum and range (shading) shown. Only the LL and MM models are shown since, as indicated in Wittenberg (2009) and Stevenson et al. (2010), 100 years is insufficient for a robust spectrum. The mean spectra from these models agree well with those from HadISST1.1 observations for 1877–2018 (Rayner et al., 2003), with the other resolutions having little obvious difference (not shown). The standard deviations of the mean NINO3.4 DJF SST from the models are all slightly higher than the observed value of 0.93.
Power spectrum of NINO3.4 surface monthly temperature variability for LL, MM resolution models and HadISST1.1 observations. The coloured line shows the mean power spectrum for all years, the dashed line shows the first 50 years, while the shading shows the range over all 50-year subsampled periods. The number in brackets is the standard deviation of the DJF mean SST in the NINO3.4 region.
The composite December–January–February (DJF) mean surface temperature
patterns relating to El Niño and La Niña events are shown in Fig. 20, with events defined when the DJF NINO3.4 index exceeds
Composite DJF mean surface temperature (colours) for (top) El Niño and (bottom) La Niña from models and observations based on El Niño and La Niña events. The number of events sampled is shown in the title and the proportion of El Niño events classified as cold tongue (CT) and warm pool (WP). Events are defined as years when the NINO3.4 DJF SST anomaly exceeds 0.7 K. The observations are a combination of HadISST1.1 over the ocean and CRU 2 m temperatures over land, with values masked over HadISST1.1 sea ice regions. Mean sea level pressure anomalies are shown as contours with interval 0.5 hPa, with observations from HadSLP2.
The equivalent composite rainfall patterns are shown in Fig. 21 for the models and GPCP2.3 observations for 1979–2014 (Adler et al., 2018) for El Niño and La Niña events. The extension of the SST pattern into the western Pacific in LL is also evident here as excessive precipitation at the Equator in the west Pacific, with some improvement at higher resolutions. All model resolutions mirror the observed teleconnections quite faithfully, though the dry anomaly with El Niño events over South Africa is not robustly captured.
As Fig. 18 but for DJF composites of mean precipitation (top) El Niño and (bottom) La Niña from models and GPCP2.3 precipitation, with mean sea level pressure anomalies as contours. The number of events sampled is shown in the title, along with the proportion of El Niño events classified as CT and WP. Events are defined as years when the NINO3.4 DJF SST anomaly exceeds 0.7 K.
The near-
There seems to be only minor differences in the ENSO performance in the models at different resolutions, mainly in slight differences to the SST composite. While 100 years is not long enough to assess the power spectrum, as noted previously by Wittenberg (2009) and Stevenson et al. (2010), and ENSO composites based on relatively few events can be uncertain due to internal variability (Deser et al., 2017), the composite patterns of surface temperature and precipitation show relative robustness.
The Madden–Julian Oscillation (MJO) dominates the tropical intraseasonal
variability (Madden and Julian, 1971) and is characterised by eastward
propagation of deep convective structures moving along the Equator with an
average phase speed of around 5 m s
As part of the CMIP6 HighResMIP project, a wide range of coupled model simulations with atmosphere resolutions between 250 and 50 km, and ocean resolutions from 100 to 8 km has been performed with the HadGEM3-GC3.1 model. We have shown that increased model resolution in the atmosphere and ocean can have considerable impact on climate model biases of the mean state and variability, both at the surface in terms of temperature and precipitation (Figs. 4, 7, 9, 13), as well as in the deeper ocean (Figs. 10–12).
We have demonstrated that the new CMIP6 HighResMIP experimental design, with only a multi-decadal spinup and 100-year simulation length, is sufficiently long to robustly establish some of these responses in model bias (Fig. 8). This has enabled the use an eddy-rich 8 km ocean model within the same suite of experiments to make a more comprehensive chain of resolutions and hence further test the robustness of our results. These experiments may also enable better understanding of the model adjustment process (so-called spinup from initial conditions), which tends not to be a focus of the standard CMIP simulations with a long pre-industrial spinup. This makes it harder to understand why the deep ocean adjustment process timescales may be different with different resolutions and what role these biases might play in model sensitivity to changes in forcing.
We find that increased ocean resolution is key to reducing many of the most common SST biases (Fig. 7), while a combination of ocean and atmosphere resolution significantly improves the large tropical Atlantic precipitation biases seen in typical CMIP-resolution models (Fig. 13), the latter having the potential to cause considerable uncertainty in projections of future rainfall changes.
We have also found some potential links between the biases and the evolution of the mean state. Based on previous work (Jackson et al., 2015), it seems likely that the strengthened AMOC and northward heat transport in the tropical Atlantic is linked to the improved SST biases and reduced precipitation (and ITCZ) biases, which in turn may be associated with some of the deeper ocean biases. These may also link to the different spinup behaviours seen in the different models. The initial drop in AMOC and northward heat transport in the LL model (Fig. 14) cause a cooling in the North Atlantic and Arctic, with a consequent increase in sea ice. Over time, the stronger temperature (and salinity) contrast between Equator and pole drives an increase in AMOC which gradually warms the Arctic. This increase in AMOC could be enhanced by the increased tropical Atlantic salinity bias in LL, which would increase the density of water reaching the northern North Atlantic and enable a stronger AMOC circulation to develop. Using the same initial conditions and short spinup in all experiments may enable better understanding of such adjustment processes than is generally possible in standard CMIP simulations.
While representing a substantial improvement over the length of simulation period typically used in global high-resolution experiments in the past, there are aspects of these simulations for which 100 years is not sufficient. The LL model in particular seems to take considerably longer than 30 years to quasi-equilibrate aspects of the large-scale circulation (such as AMOC), perhaps indicating that some processes are inadequately represented at this resolution. The deep ocean equilibration timescales are clearly much longer than 100 years, although there is evidence that the different resolution models trending towards different final states and the magnitude of the drifts is resolution dependent. This 100-year period is still not enough for a robust estimate of the ENSO power spectrum and variability, although the ENSO composites of surface temperature and precipitation teleconnections are apparently robust. The longer control-1950 simulations (at the lower resolutions) have been vital to test the reliability of these assertions, and hence there is probably a role for such longer simulations within the HighResMIP experimental design.
Such timescales also emphasise the importance of considering both the control-1950 and hist-1950 simulations when evaluating the models involved in HighResMIP, to properly assess model drift compared to response to forcing. An open question is whether this experimental design will enable us to identify a climate change signal in the historic and future simulations (not analysed here) by comparing these to the control simulations with constant forcing, given that the control simulations are continuing to drift. Results may be dependent on the process of interest and require new analysis methods or reference to simulations where both traditional long spinup simulations and HighResMIP experiments have been attempted.
It is unclear how robust all of the results shown here will be across a multi-model dataset. Ongoing work within Horizon 2020 PRIMAVERA project (Grist et al., 2018; C. D. Roberts et al., 2018; Vannière et al., 2018) suggests that at least for changes from 100 to 25 km ocean resolution there are robust reductions in SST and precipitation biases, while work by Griffies et al. (2015) and Small et al. (2014, 2019) does indicate some consistency in further changes to eddy-rich ocean resolutions. Further work using this multi-model ensemble within HighResMIP, in addition to comparing these simulations to their CMIP6 DECK equivalents, is ongoing and may reveal further insights into the impact of resolution and model complexity (Earth system processes such as interactive aerosols and biogeochemistry) and perhaps indicate the best trade-offs for gaining the largest model improvements for the smallest computational cost.
Most of the model data used in the following analysis are available from the CMIP6 Earth System Grid Federation and can be located using the information in M. Roberts (2018, 2017a, b, c, d) for resolutions LL, MM, HM, MH and HH, respectively. Other model resolutions are not part of the official HadGEM3-GC3.1 CMIP6 HighResMIP submission, and these data are available upon request via the CEDA-JASMIN platform.
The source code for the models used in this study, UM, JULES, NEMO and CICE,
is available to use. To apply for a license for the UM, go to
In order to implement the scientific configuration of GC3.1 and to allow the components to work together, a number of branches (code changes) are applied to the above codes. Please contact the authors for more information on these branches and how to obtain them.
Model simulations were performed by MJR, AC and RS. JS post-processed the model output into CMOR format for publication to ESGF. MJR prepared the manuscript with contributions from all co-authors.
The authors declare that they have no conflict of interest.
Malcolm J. Roberts, Jon Seddon, Pier Luigi Vidale, Reinhard Schiemann, Benoît Vannière, Alex Baker and Christopher D. Roberts acknowledge PRIMAVERA funding received from the European Commission under grant agreement 641727 of the Horizon 2020 research programme. Malcolm J. Roberts, Helene T. Hewitt, Ed W. Blockley, Pierre Mathiot and Laura C. Jackson were supported by the Met Office Hadley Centre Climate Programme funded by BEIS and Defra (GA01101). Till Kuhlbrodt was funded by the National Environmental Research Council (NERC) national capability grant for the UK Earth System Modelling project (grant no. NE/N017951/1) and by the EU Horizon 2020 Research Programme CRESCENDO project (grant agreement 641816). Pier Luigi Vidale, Reinhard Schiemann, Alex Baker and Benoît Vannière acknowledge funding from the National Environmental Research Council (NERC). Andrew Coward acknowledges funding from the ACSIS project which is supported by the Natural Environment Research Council (grant no. NE/N018044/1).
Data from the RAPID-MOCHA program are funded by the US National Science
Foundation and UK Natural Environment Research Council and are freely
available at
We acknowledge extensive use of the supercomputers at the Met Office and the ARCHER UK National Supercomputing Service to complete the simulations and the CEDA-JASMIN platform for enabling data storage and model analysis.
We thank Paul Earnshaw, Dan Copsey, Matthew Mizielinski, Tim Graham and many other colleagues at the Met Office for help in various aspects of model simulation, data processing and analysis tools.
We would also like to thank the reviewers, Stephen Griffies, Luke Van Roekel and Justin Small, and the editor, for their time and help in improving the manuscript.
This research has been supported by the Horizon 2020 programme: PRIMAVERA (grant no. 641727) and CRESCENDO (grant no. 641816), the Joint UK BEIS/Defra Met Office Hadley Centre Climate Programme (grant no. GA01101), the National Environmental Research Council (NERC), UK Earth System Modelling (grant no. NE/N017951/1), and the Natural Environment Research Council (NERC) and ACSIS (grant no. NE/N018044/1).
This paper was edited by Richard Neale and reviewed by Stephen M. Griffies, Justin Small, and Luke Van Roekel.