WOMBAT v1.0: a fully Bayesian global fluxinversion framework
 ^{1}School of Mathematics and Applied Statistics, University of Wollongong, Wollongong, Australia
 ^{2}School of Earth and Life Sciences, University of Wollongong, Wollongong, Australia
 ^{3}Climate Science Centre, CSIRO Oceans and Atmosphere, Aspendale, Australia
 ^{4}School of Chemistry, University of Bristol, Bristol, UK
 ^{}These authors contributed equally to this work.
 ^{1}School of Mathematics and Applied Statistics, University of Wollongong, Wollongong, Australia
 ^{2}School of Earth and Life Sciences, University of Wollongong, Wollongong, Australia
 ^{3}Climate Science Centre, CSIRO Oceans and Atmosphere, Aspendale, Australia
 ^{4}School of Chemistry, University of Bristol, Bristol, UK
 ^{}These authors contributed equally to this work.
Correspondence: Andrew ZammitMangion (azm@uow.edu.au)
Hide author detailsCorrespondence: Andrew ZammitMangion (azm@uow.edu.au)
WOMBAT (the WOllongong Methodology for Bayesian Assimilation of Tracegases) is a fully Bayesian hierarchical statistical framework for flux inversion of trace gases from flask, in situ, and remotely sensed data. WOMBAT extends the conventional Bayesian synthesis framework through the consideration of a correlated error term, the capacity for online bias correction, and the provision of uncertainty quantification on all unknowns that appear in the Bayesian statistical model. We show, in an observing system simulation experiment (OSSE), that these extensions are crucial when the data are indeed biased and have errors that are spatiotemporally correlated. Using the GEOSChem atmospheric transport model, we show that WOMBAT is able to obtain posterior means and variances on nonfossilfuel CO_{2} fluxes from Orbiting Carbon Observatory2 (OCO2) data that are comparable to those from the Model Intercomparison Project (MIP) reported in Crowell et al. (2019). We also find that WOMBAT's predictions of outofsample retrievals obtained from the Total Column Carbon Observing Network (TCCON) are, for the most part, more accurate than those made by the MIP participants.

Notice on corrigendum
The requested paper has a corresponding corrigendum published. Please read the corrigendum first before downloading the article.

Article
(4966 KB)
 Corrigendum

Supplement
(299 KB)

The requested paper has a corresponding corrigendum published. Please read the corrigendum first before downloading the article.
 Article
(4966 KB)  Corrigendum

Supplement
(299 KB)  BibTeX
 EndNote
Atmospheric carbon dioxide (CO_{2}) is a leading driver of global warming (e.g. Peters et al., 2013). If left unchecked, the rise in global temperatures will have a substantial negative impact on society and the environment (e.g. Edenhofer et al., 2014). As part of the worldwide effort to limit these impacts, the 2015 Paris Agreement under the United Nations Framework Convention on Climate Change, COP 21, called for a global stocktake of the sources and sinks of CO_{2} and other greenhouse gases, with the first evaluation planned for 2023 (UNFCCC, 2015). The rate at which CO_{2} is emitted (from sources) or absorbed (at sinks) per unit space and time is known as the CO_{2} flux, which itself varies spatiotemporally in a substantial manner. Despite the fact that it is human emissions that are driving the rise in atmospheric CO_{2} concentrations, the most uncertain aspects of quantifying CO_{2} fluxes at Earth's surface centre around natural processes. For example, while we know that the land and oceans absorb more than half of the CO_{2} emitted by human activities (e.g. Dlugokencky and Tans, 2020), the geographical and temporal patterns of these sinks remain unclear (e.g. Crowell et al., 2019).
Monitoring the progression of CO_{2} in our atmosphere is thus of utmost importance, and billions of dollars have been spent over the last few decades on research, development, and deployment of instruments for measuring CO_{2} mole fraction (defined here as the proportion of CO_{2} molecules in a given parcel of dry air) across the globe (e.g. Burrows et al., 1995; Kuze et al., 2009; Wunch et al., 2011a; Masarie et al., 2014; Eldering et al., 2017, 2019). However, CO_{2} mole fraction is only indirectly related to the key quantity of interest, namely the geographic distribution of the CO_{2} fluxes over time, which cannot be observed directly on regional scales. Identifying these sources and sinks spatially and temporally is an illposed inverse problem, often called a tracegas fluxinversion problem, whose solution requires the use of both an atmospheric transport model and a spatiotemporal model for the fluxes (e.g. Enting, 2002). In this paper, we present version 1.0 of a global fluxinversion system for the solution of this problem, which we call the WOllongong Methodology for Bayesian Assimilation of Tracegases (WOMBAT).
A global tracegas fluxinversion system is designed to infer fluxes from observational data, which are generally available either as pointreferenced (flask or in situ) measurements or columnaveraged remote sensing retrievals. The underlying model in an inversion system is usually a statespace model, where the fluxes, or a reduced representation thereof, are the latent states that need to be inferred from data via the use of an atmospheric chemical transport model (CTM). Computationally, flux estimation is done within a standard optimisation framework (e.g. Chevallier et al., 2005; Baker et al., 2006), either via full Bayesian synthesis (e.g. Enting, 2002, chap. 3; Mukherjee et al., 2011; Schuh et al., 2019) or via ensemble Kalman filtering (e.g. Peters et al., 2005; Feng et al., 2009). Inversion systems rely, to various extents, on realistic bottomup estimates of fluxes for the elicitation of an informative prior distribution, an accurate CTM that provides the link between the fluxes and the observed mole fraction, highquality unbiased measurements, and reliable uncertainty measures on each model component.
The complexity of all modelled processes, from fluxes right up to satellite retrieval errors, inevitably leads to model misspecification (e.g. Engelen et al., 2002). The main causes of misspecification are (i) fluxprocess dimensionreduction error (e.g. Kaminski et al., 2001), which is a consequence of using a spatiotemporal model for the flux field that is low dimensional and inflexible; (ii) an inaccurate prior flux mean, variance, and covariance (e.g. Philip et al., 2019); (iii) transport model errors (e.g. Houweling et al., 2010; Basu et al., 2018; Schuh et al., 2019) arising from the underlying assumed physics, meteorology, and discretisation schemes used (e.g. Lauvaux et al., 2019; McNorton et al., 2020); (iv) retrieval biases (e.g. O'Dell et al., 2018) and incorrect associated measurement error statistics (e.g. Worden et al., 2017); and (v) measurement error spatiotemporal correlations that are not fully accounted for (e.g. Chevallier, 2007; Ciais et al., 2010). Two other causes of model misspecification worth noting are an incorrectly specified initial global molefraction field and flux components assumed known in the inversion (i.e. assumed degenerate at their prior mean), such as anthropogenic emissions (e.g. Feng et al., 2019). The latter can be seen as a special case of (i) above, while the effect of the former can generally be minimised by using a realistic initial condition (e.g. Basu et al., 2013) and incorporating a burnin (or spinup) period in which early flux estimates are discarded.
In Sect. 2, we present the underlying statistical framework of WOMBAT, which addresses the implications of model misspecification in four ways: first, by using prior distributions to encode uncertainty over prior beliefs on the fluxes (sometimes referred to as hyperprior distributions; see, e.g. Ganesan et al., 2014; ZammitMangion et al., 2016), second, by adding a spatiotemporally correlated component of variability to the molefraction data model to address some of this (typically unmodelled) correlated model–data discrepancy (Brynjarsdóttir and O'Hagan, 2014), third, by explicitly modelling biases in the molefraction data model (generalising the approach of Basu et al., 2013), and fourth, by propagating uncertainty on all unknowns within a fully Bayesian statistical framework wherein inference is made using Markov chain Monte Carlo (MCMC). We note that while the benefits of MCMC are becoming increasingly apparent in regional tracegas inversions (e.g. Mukherjee et al., 2011; Ganesan et al., 2014; Miller et al., 2014; ZammitMangion et al., 2016), its use is still the exception rather than the rule in global tracegas inversions. The use of a spatiotemporally correlated component of variability leads to computational challenges, which are addressed in Sect. 3. We also note that our MCMC framework allows the uncertainties over the fluxes to be affected by uncertainties over all unknown parameters in the model, and it is thus different from the Monte Carlo approach to estimating flux uncertainty used by, for example, Chevallier et al. (2007) and Liu et al. (2014).
The fully Bayesian nature of the model, coupled with the introduction of a correlated process in modelling the molefraction field, leads to computational challenges. Section 3 details how we deal with these by defining a specific type of stochastic process on the irregularly located spatiotemporal errors, one that leads to a sparse precision matrix (e.g. Vecchia, 1988; Datta et al., 2016). Details on how this facilitates the MCMC scheme we implement are deferred to Appendix A. Section 4 discusses the experimental setup used for running, validating, and implementing WOMBAT on the satellite data analysed in this article. In Sect. 5, we first conduct an observing system simulation experiment (OSSE) in which the true fluxes are assumed known. Results from the OSSE demonstrate WOMBAT's validity and also illustrate the importance of modelling biases and correlated error terms when these are indeed present in the data. We then use WOMBAT to perform flux inversion using the Orbiting Carbon Observatory2 (OCO2) Version 7 retrospective (V7r) dataset as used in the model intercomparison project (MIP) of Crowell et al. (2019). Our model fitting reveals that about 80 % of the total error variance associated with the OCO2 data used for the MIP can be explained with a correlated model–data discrepancy term. WOMBAT accounts for this, which results in posterior distributions over the fluxes that, for the most part, corroborate the results from the ensemble inversions, both on a regional and on a global scale. In Sect. 5, we also show the utility of WOMBAT in carrying out online bias correction. Section 6 summarises the features of WOMBAT and discusses avenues for future work.
In this section we outline the spatiotemporal Bayesian hierarchical statistical model (e.g. Sect. 1.3, Wikle et al., 2019) that WOMBAT uses for global flux inversion. The model consists of four submodels: (1) a flux process model, (2) a molefraction process model, (3) a molefraction data model, and (4) a parameter model.
2.1 The flux process model
Let ${Y}_{\mathrm{1}}^{\mathrm{0}}(\mathit{s},t)$ denote the prior mean of the tracegas surface flux at spatial location s∈𝕊^{2} and time t∈𝒯, where 𝕊^{2} is the surface of Earth and $\mathcal{T}\equiv [{t}_{\mathrm{0}},{t}_{f}]$ is some time interval of interest. The field ${Y}_{\mathrm{1}}^{\mathrm{0}}(\cdot \phantom{\rule{0.125em}{0ex}},\cdot )$ could, for example, be treated as a linear regression (e.g. Michalak et al., 2004) or could be constructed using bottomup estimates of biospheric and/or anthropogenic fluxes (e.g. Basu et al., 2013).
In the same vein as conventional Bayesiansynthesis frameworks (e.g. Enting, 2002), we model the true flux, ${Y}_{\mathrm{1}}(\cdot \phantom{\rule{0.125em}{0ex}},\cdot )$, as ${Y}_{\mathrm{1}}^{\mathrm{0}}(\cdot \phantom{\rule{0.125em}{0ex}},\cdot )$ plus a spatiotemporal field constructed through a sum of r spatiotemporal basis functions. These basis functions could be space–time step functions, as typically found in variational inversion systems (e.g. Chevallier et al., 2005), discretised flux “patterns” (e.g. Fan et al., 1998), or a generalpurpose basis such as a Fourier basis (e.g. Crowell et al., 2019, Appendix A4).
We denote the set of prespecified flux basis functions as $\mathit{\left\{}{\mathit{\varphi}}_{j}\right(\cdot ,\cdot ):j=\mathrm{1},\mathrm{\dots},r\mathit{\}}$. Our flux process model is a spatiotemporal stochastic process given by the following equation:
where the scaling factors $\mathit{\{}{\mathit{\alpha}}_{j}:j=\mathrm{1},\mathrm{\dots},r\mathit{\}}$ are unknown, are assigned a multivariate probability distribution, and need to be inferred in the inversion framework. Since we assume that $\mathrm{E}\left({Y}_{\mathrm{1}}\right(\cdot \phantom{\rule{0.125em}{0ex}},\cdot \left)\right)={Y}_{\mathrm{1}}^{\mathrm{0}}(\cdot \phantom{\rule{0.125em}{0ex}},\cdot ),$ we let E(α_{j})=0 for $j=\mathrm{1},\mathrm{\dots},r.$ For a given set of basis functions, the prior belief on the covariance structure of ${Y}_{\mathrm{1}}(\cdot \phantom{\rule{0.125em}{0ex}},\cdot )$ is fully determined by that on $\mathit{\alpha}\equiv ({\mathit{\alpha}}_{\mathrm{1}},\mathrm{\dots},{\mathit{\alpha}}_{r}{)}^{\prime}$. When the flux is modelled on a space–time grid and space–time step functions are used as basis functions, a prior distribution on α that correlates the flux a priori in space and time is natural (Michalak et al., 2004; Chevallier et al., 2007; Basu et al., 2018). On the other hand, when using large spatial flux “patterns” that have temporally limited scope, it is generally reasonable to assume that any two of the α_{j} corresponding to basis functions in different spatial regions are uncorrelated but that those associated with the same spatial region are temporally correlated.
Irrespective of the choice of basis functions, in WOMBAT one expresses prior judgement on α through the model $\mathit{\alpha}\sim \mathrm{Gau}(\mathbf{\text{0}},{\mathbf{\Sigma}}_{\mathit{\alpha}})$, where Gau(μ,Σ) is a Gaussian probability density function of a random vector with mean μ and covariance matrix Σ. The covariance matrix Σ_{α} is parameterised through a parameter vector θ_{α}, which typically contains variances and spatiotemporal length scales in the covariances. Expert elicitation can be used to construct prior distributions on these parameters too; we describe possible prior distributions when discussing the parameter model in the Bayesian hierarchical model in Sect. 2.4.
The flux model of Eq. (1) may be improved by introducing a dimensionreduction error, also known as aggregation error, on the righthand side. This error accounts for the fact that the structured basis functions typically span a small (function) space and that they therefore cannot reproduce fluxes perfectly. However, since we are unable to deconvolve dimension reduction error from other sources of error (e.g. transport model error) in our molefraction data, we model the spatiotemporal variability it introduces collectively with other sources of error (Sect. 2.2).
2.2 The molefraction process model
We denote the true molefraction process at horizontal location s, vertical height h, and time t as ${Y}_{\mathrm{2}}(\mathit{s},h,t)$. We only model the mole fraction within our time interval of interest 𝒯, starting at time t_{0}, and therefore we express the true molefraction field within 𝒯 as a function of the initial molefraction process at t_{0} and the exogenous fluxprocess inputs in 𝒯. Specifically, ${\mathcal{T}}_{t}\equiv [{t}_{\mathrm{0}},t]$ is defined for ${t}_{\mathrm{0}}\le t\le {t}_{f}$ as the set of all time points in 𝒯 up to and including t. The field ${Y}_{\mathrm{2}}(\cdot \phantom{\rule{0.125em}{0ex}},\cdot \phantom{\rule{0.125em}{0ex}},\cdot )$ is related to the flux process through the following relationship:
for $\mathit{s}\in {\mathbb{S}}^{\mathrm{2}},\phantom{\rule{0.25em}{0ex}}h\in {\mathbb{R}}^{+},\phantom{\rule{0.25em}{0ex}}t\in \mathcal{T}$, where ${Y}_{\mathrm{2}}(\cdot \phantom{\rule{0.125em}{0ex}},\cdot \phantom{\rule{0.125em}{0ex}},{t}_{\mathrm{0}})$ is the molefraction field at time t_{0}, ${Y}_{\mathrm{1}}(\cdot \phantom{\rule{0.125em}{0ex}},{\mathcal{T}}_{t})$ is the flux field evolving over the whole time period 𝒯_{t}, and ℋ is an operator that solves the underlying chemical transport equations (that are approximately linear for longlived species such as CO_{2}; see, e.g. Enting, 2002, chap. 2). In practice, ℋ is not known perfectly, but we usually have at hand a reasonable approximation to it, $\widehat{\mathcal{H}}$, which is often referred to as the chemical transport model (CTM) or simply as the transport model. Similarly, we will usually have a reasonable approximation to ${Y}_{\mathrm{2}}(\cdot \phantom{\rule{0.125em}{0ex}},\cdot \phantom{\rule{0.125em}{0ex}},{t}_{\mathrm{0}})$, which we call ${\widehat{Y}}_{\mathrm{2}}(\cdot \phantom{\rule{0.125em}{0ex}},\cdot ,,{t}_{\mathrm{0}})$. The use of ${\widehat{Y}}_{\mathrm{2}}(\cdot \phantom{\rule{0.125em}{0ex}},\cdot \phantom{\rule{0.125em}{0ex}},{t}_{\mathrm{0}})$ instead of ${Y}_{\mathrm{2}}(\cdot \phantom{\rule{0.125em}{0ex}},\cdot \phantom{\rule{0.125em}{0ex}},{t}_{\mathrm{0}})$, and of $\widehat{\mathcal{H}}$ instead of ℋ, leads to a residual term ${v}_{\mathrm{2}}(\cdot \phantom{\rule{0.125em}{0ex}},\cdot \phantom{\rule{0.125em}{0ex}},\cdot )$ that will inevitably be spatiotemporally correlated (Enting, 2002, chap. 9). In particular,
for $\mathit{s}\in {\mathbb{S}}^{\mathrm{2}},\phantom{\rule{0.25em}{0ex}}h\in {\mathbb{R}}^{+},\phantom{\rule{0.25em}{0ex}}t\in \mathcal{T}$, where ${v}_{\mathrm{2}}(\cdot \phantom{\rule{0.125em}{0ex}},\cdot \phantom{\rule{0.125em}{0ex}},\cdot )$ is the residual molefraction process arising from the use of an approximate initial molefraction field, imperfect meteorology inside the transport model, imperfect transport model parameters and physics, and potentially subgridscale variation in the molefraction field when $\widehat{\mathcal{H}}$ is a numerical model evaluated at a coarse resolution. It is difficult to place prior beliefs on the structure of ${v}_{\mathrm{2}}(\cdot \phantom{\rule{0.125em}{0ex}},\cdot \phantom{\rule{0.125em}{0ex}},\cdot )$, which we model as statistical error, but it is known that using the approximation $\widehat{\mathcal{H}}$ introduces errors that could span hundreds of kilometres and several days (Lauvaux et al., 2019; McNorton et al., 2020). Transport model implementations tend to differ considerably in their vertical and interhemispheric mixing behaviour, and fluxinversion estimates are known to be particularly sensitive to transport model choice (Gurney et al., 2002; Schuh et al., 2019). Note that ${v}_{\mathrm{2}}(\cdot \phantom{\rule{0.125em}{0ex}},\cdot \phantom{\rule{0.125em}{0ex}},\cdot )$ is also likely to depend on ${Y}_{\mathrm{1}}(\cdot \phantom{\rule{0.125em}{0ex}},\cdot )$ and that we ignore this dependence for model simplicity in what follows.
The assumed linear behaviour of the underlying dynamics for CO_{2} is important. First, it allows us to model the effect of the approximate initial molefraction field, ${\widehat{Y}}_{\mathrm{2}}(\cdot \phantom{\rule{0.125em}{0ex}},\cdot ,{t}_{\mathrm{0}})$, separately from that of the fluxes (e.g. Enting, 2002, chap. 10) so that Eq. (3) is of the following form:
for s∈𝕊^{2}, $h\in {\mathbb{R}}^{+}$, and t∈𝒯. Second, it allows us to express the molefraction field as a linear combination of the individual responses from the basis functions used to construct ${Y}_{\mathrm{1}}(\cdot \phantom{\rule{0.125em}{0ex}},\cdot )$, as we now show. Substituting Eq. (1) into Eq. (4), we have the following equation:
for s∈𝕊^{2}, $h\in {\mathbb{R}}^{+}$, and t∈𝒯, where ${Y}_{\mathrm{2}}^{\mathrm{0}}(\mathit{s},h,t)\equiv \widehat{\mathcal{H}}\left({\widehat{Y}}_{\mathrm{2}}\right(\cdot \phantom{\rule{0.125em}{0ex}},\cdot \phantom{\rule{0.125em}{0ex}},{t}_{\mathrm{0}}),\mathrm{0};\mathit{s},h,t)+\widehat{\mathcal{H}}(\mathrm{0},{Y}_{\mathrm{1}}^{\mathrm{0}}(\cdot \phantom{\rule{0.125em}{0ex}},{\mathcal{T}}_{t});\mathit{s},h,t)$; ${\widehat{\mathit{\psi}}}_{j}(\mathit{s},h,t)\equiv \widehat{\mathcal{H}}(\mathrm{0},{\mathit{\varphi}}_{j}(\cdot ,{\mathcal{T}}_{t});\mathit{s},h,t),$ for $j=\mathrm{1},\mathrm{\dots},r$, are basis functions in molefraction space, often termed “response functions” (e.g. Saeki et al., 2013); and ${v}_{\mathrm{2}}(\mathit{s},h,t)$ is the residual term given in Eq. (3). We assume that $\mathrm{E}\left({v}_{\mathrm{2}}\right(\cdot \phantom{\rule{0.125em}{0ex}},\cdot \phantom{\rule{0.125em}{0ex}},\cdot \left)\right)=\mathrm{0}$, and thus ${Y}_{\mathrm{2}}^{\mathrm{0}}(\mathit{s},h,t)$ can be seen as the prior expectation of the molefraction field at $(\mathit{s},h,t)$ under $\widehat{\mathcal{H}}$ and ${\widehat{Y}}_{\mathrm{2}}(\cdot \phantom{\rule{0.125em}{0ex}},\cdot ,{t}_{\mathrm{0}})$. That is, it is the molefraction field generated by running our CTM with the input fluxes set to the prior expected flux and with the molefraction field at t_{0} set to ${\widehat{Y}}_{\mathrm{2}}(\cdot \phantom{\rule{0.125em}{0ex}},\cdot ,{t}_{\mathrm{0}})$.
2.3 The molefraction data model
Fluxes cannot be observed directly at the spatial and temporal scales of interest. Flux inversion therefore proceeds by “constraining” the flux field using columnaveraged retrievals or pointreferenced measurements of mole fraction. We use Z_{2,i} to denote the ith molefraction measurement or retrieval, where $i\in \mathit{\{}\mathrm{1},\mathrm{\dots},m\mathit{\}}$ indexes the datum used in the inversion and m is the number of data used in the inversion.
Pointreferenced (PR) measurements of mole fraction are generally made at or near Earth's surface using instruments on towers or in aircraft. The molefraction data model for these measurements is therefore given by
where Z_{2,i} is the observed mole fraction at $({\mathit{s}}_{i},{h}_{i},{t}_{i})$, and ϵ_{i} is meanzero Gaussian measurement error, with a model for its variance parameter presented below in Sect. 2.4. Measurement errors associated with pointreferenced instruments are generally small and (usually) not correlated in space and time. Despite this, such data are not immune to the effects of spatiotemporal correlations induced by the CTM in the process model, and they may even be more susceptible than columnaveraged retrievals due to the combined effect of their usual proximity to the surface and the discretisations employed when simulating approximate transport (Rayner and O'Brien, 2001; Basu et al., 2018).
Columnaveraged (CA) retrievals, such as XCO_{2} (where “X” refers to the columnaveraged nature of the retrievals) from the OCO2 satellite or the Total Column Carbon Observing Network (TCCON) sites, are noisier than PR measurements, although TCCON is less noisy. In particular, since the raw spectral information collected for the retrieval is affected by environmental factors such as aerosols (O'Dell et al., 2012), the errors can contain biases and exhibit spatiotemporal correlations. These biases can also be instrumentmode dependent (e.g. land glint, LG, vs. land nadir, LN, retrievals for OCO2; see Sect. 4.4.1). The vertical averaging operation also involves an averaging kernel and an a priori bias correction, which are both specific to the retrieval and which arise from the algorithm used for the retrieval. In general, this relationship can be expressed as follows:
where (s_{i},t_{i}) is the space–time location of the retrieval, ${\widehat{\mathcal{A}}}_{i}$ is the assumed (but necessarily approximate) observation operator of the ith retrieval that columnaverages the mole fraction field via an averaging kernel, b_{i} is bias, ${v}_{{Z}_{\mathrm{2},i}}$ is meanzero spatiotemporally correlated random error, and ϵ_{i} is meanzero uncorrelated random error. The bias and error terms arise from the use of an approximate observation operator. Surfacebased or remotely sensed CA retrievals are sometimes provided as “biascorrected retrievals”. In this case, the data model for these retrievals is identical to Eq. (7), but with the bias component omitted.
Substituting Eq. (5) into Eqs. (6) and (7) we see that, in general, we have
where, for a PR measurement, ${\widehat{Z}}_{\mathrm{2},i}^{\mathrm{0}}\equiv {Y}_{\mathrm{2}}^{\mathrm{0}}({\mathit{s}}_{i},{h}_{i},{t}_{i})$, ${\widehat{\mathit{\psi}}}_{j,i}\equiv {\widehat{\mathit{\psi}}}_{j}({\mathit{s}}_{i},{h}_{i},{t}_{i})$, for $j=\mathrm{1},\mathrm{\dots},r$, ${b}_{i}=\mathrm{0};$ and ${\mathit{\xi}}_{i}\equiv {v}_{\mathrm{2}}({\mathit{s}}_{i},{h}_{i},{t}_{i});$ while for a CA retrieval, ${\widehat{Z}}_{\mathrm{2},i}^{\mathrm{0}}\equiv {\widehat{\mathcal{A}}}_{i}\left({Y}_{\mathrm{2}}^{\mathrm{0}}\right({\mathit{s}}_{i},\cdot \phantom{\rule{0.125em}{0ex}},{t}_{i}\left)\right)$, ${\widehat{\mathit{\psi}}}_{j,i}\equiv {\widehat{\mathcal{A}}}_{i}\left({\widehat{\mathit{\psi}}}_{j}\right({\mathit{s}}_{i},\cdot \phantom{\rule{0.125em}{0ex}},{t}_{i}\left)\right)$, for $j=\mathrm{1},\mathrm{\dots},r$; and ${\mathit{\xi}}_{i}\equiv {\widehat{\mathcal{A}}}_{i}\left({v}_{\mathrm{2}}\right({\mathit{s}}_{i},\cdot \phantom{\rule{0.125em}{0ex}},{t}_{i}\left)\right)+{v}_{{Z}_{\mathrm{2},i}}$. Biascorrected retrievals are given by Eq. (8) but with the bias component b_{i} omitted. Note that for identifiability reasons we have modelled all the possibly correlated error terms using one component, {ξ_{i}}.
Flux inversions can make use of both PR measurements and CA retrievals simultaneously, and hence it is convenient to provide a data model that encapsulates both types of measurements. It is also often the case that measurements from the same instrument type can be divided into groups that can be expected to have similar characteristics, such as groupspecific bias and error properties. A given group could contain, for example, PR data from the same in situ instrument or CA retrievals from a particular remote sensing instrument under a specific retrieval mode (e.g. land nadir). Hence, we consider the following general data model, where different groups have different terms, but the overall structure is the same:
where n_{g} is the number of groups; Z_{2,g} contains the data in group g; ${\widehat{\mathit{Z}}}_{\mathrm{2},g}^{\mathrm{0}}$ are the prior expected mole fractions in group g under the approximate transport model, the approximate molefraction field at t_{0}, and (if the groups consist of CA retrievals) under the approximate observation operators; ${\widehat{\mathbf{\Psi}}}_{g}$ are the response functions in group g evaluated at either the PR locations (in the case of PR measurements) or averaged over a column via an approximate observation operator (in the case of CA retrievals); b_{g}≡A_{g}β_{g} are groupspecific biases, with A_{g} a groupspecific design matrix and β_{g} the corresponding weights; ξ_{g} is the group g's vector of correlated errors; and ϵ_{g} is the group's vector of uncorrelated errors. When the data in group g are considered to be unbiased (or are already biascorrected data), the term A_{g}β_{g}=0. The variables constituting β_{g} and ϵ_{g}, for $g=\mathrm{1},\mathrm{\dots},{n}_{g},$ are mutually independent within and across groups.
The correlation between elements of ξ_{g} associated with measurements that are proximal in space and time is stronger than between those that are farther apart. However, while spatiotemporal correlation in model–data discrepancies is widely acknowledged (Chevallier, 2007; Ciais et al., 2010; Mukherjee et al., 2011; Miller et al., 2020), the general consensus is that using just the variance of ξ_{g} to add to the variance of the uncorrelated component is sufficient (e.g. Michalak et al., 2005; Basu et al., 2013; Deng et al., 2016). However, as we show in our OSSE in Sect. 5.1, even when a measurement error variance inflation factor is estimated, predictions of the flux worsen under the assumption of uncorrelated errors if the errors are truly correlated. The main reason not to routinely model spatiotemporal correlations in global flux inversion appears to be computational; we discuss a way to rectify this bottleneck in Sect. 3.
2.4 The parameter model
The parameter model (i.e. prior distributions on parameters) is dependent on the specification of the flux process model, the molefraction process model, and the molefraction data model. Here, we describe the parameter model we use in the OSSE and in the MIP comparison in Sect. 5.
2.4.1 Parameters of the flux process model
In the experiments given below in Sect. 5, our flux basis functions are from bottomup inventories that are divided into r_{s} spatial regions and r_{t} time spans. This construction yields $r={r}_{s}\times {r}_{t}$ basis functions, and it naturally suggests a temporal partitioning of α into $({\mathit{\alpha}}_{\mathrm{1}}^{\prime},\mathrm{\dots},{\mathit{\alpha}}_{{r}_{t}}^{\prime}{)}^{\prime}$, where each ${\mathit{\alpha}}_{k}\in {\mathbb{R}}^{{r}_{s}},$ for $k=\mathrm{1},\mathrm{\dots},{r}_{t}$. This in turn suggests that a suitable model for α is the vectorautoregressive process, similar to that used by Peters et al. (2005) and Dahlén et al. (2020). Specifically, ${\mathit{\alpha}}_{k+\mathrm{1}}=\mathbf{M}\left(\mathit{\kappa}\right){\mathit{\alpha}}_{k}+{\mathit{w}}_{k}$, for $k=\mathrm{1},\mathrm{\dots},{r}_{t}$, where, in our examples, we constrain the matrix M(κ) to be diagonal with nonzero elements equal to $\mathit{\kappa}\equiv ({\mathit{\kappa}}_{\mathrm{1}},\mathrm{\dots},{\mathit{\kappa}}_{{r}_{s}}{)}^{\prime}$, and we let ${\mathit{w}}_{k}\sim \mathrm{Gau}(\mathbf{\text{0}},{\mathbf{\Sigma}}_{w})$, where the precision matrix ${\mathbf{Q}}_{w}\equiv {\mathbf{\Sigma}}_{w}^{\mathrm{1}}$ is diagonal with positive elements ${\mathit{\tau}}_{w}\equiv ({\mathit{\tau}}_{w,j}:j=\mathrm{1},\mathrm{\dots},{r}_{s}{)}^{\prime}$. The fluxprocess parameters are therefore ${\mathit{\theta}}_{\mathit{\alpha}}\equiv ({\mathit{\kappa}}^{\prime},{\mathit{\tau}}_{w}^{\prime}{)}^{\prime}$, which in turn govern the covariance matrix of α, notated as Σ_{α}≡var(α). There is an ordering of α for which ${\mathbf{\Sigma}}_{\mathit{\alpha}}^{\mathrm{1}}$ is a block diagonal matrix with each block being a tridiagonal matrix (see Appendix A). Either sequential estimation (e.g. Kalman filtering and smoothing) or batch Bayesian updating can be used to make inference on α. In our case we use the latter, and we take advantage of efficient algorithms that are available for sparse linear algebraic computations.
We expect that {α_{k}} are positively correlated in time. Therefore, for the prior distributions for {κ_{j}}, we use the beta distribution, which has support on the interval [0,1]: independently, ${\mathit{\kappa}}_{j}\sim \mathrm{Beta}({a}_{\mathit{\kappa},j},{b}_{\mathit{\kappa},j}),$ for $j=\mathrm{1},\mathrm{\dots},{r}_{s},$ where {a_{κ,j}} and {b_{κ,j}} are fixed and assumed known. For prior distributions on the precision parameters {τ_{w,j}}, we use gamma distributions with shape parameters {ν_{w,j}} and rate parameters {ω_{w,j}}, which are fixed and assumed known: independently, ${\mathit{\tau}}_{w,j}\sim \mathrm{Ga}({\mathit{\nu}}_{w,j},{\mathit{\omega}}_{w,j}),$ for $j=\mathrm{1},\mathrm{\dots},{r}_{s}.$
Michalak et al. (2005) suggested that variance parameters could be estimated directly in a maximumlikelihood framework. The use of prior distributions on κ and τ_{w} adds an extra level of flexibility and allows the modeller to express the “uncertainty on the uncertainties” in an inversion framework (e.g. Ganesan et al., 2014). A related advantage is that the prior distributions can be used to provide information on the variance parameters even when the molefraction observations are not informative of the parameters. One could even configure these prior distributions to be extremely informative and effectively fix the prior model for the flux. The choice of prior distribution can be made on a regionbyregion basis, as is the case in our experiments (Sect. 4.5), where land regions are given largely uninformative priors, and ocean regions are given informative ones.
2.4.2 CA molefraction retrieval bias parameters
In Sect. 5, we consider OCO2 retrievals and a different set of bias parameters for each instrument mode. In this context, β_{g} is associated with a particular instrument mode and a single element of β_{g} captures the bias arising from, for example, aerosol presence. Experiments (see Sect. 5.3) reveal that these bias parameters are quite easily constrained in an inversion framework. When constructing the model for each β_{g}, we first standardise each row in A_{g} so that the covariates have unit marginal empirical variance. Then we model {β_{g}} as follows: independently, ${\mathit{\beta}}_{g}\sim \mathrm{Gau}(\mathbf{\text{0}},{\mathit{\sigma}}_{\mathit{\beta}}^{\mathrm{2}}\mathbf{I}),$ for $g=\mathrm{1},\mathrm{\dots},{n}_{g},$ with ${\mathit{\sigma}}_{\mathit{\beta}}^{\mathrm{2}}=\mathrm{100}$ and where I is the identity matrix. This choice for ${\mathit{\sigma}}_{\mathit{\beta}}^{\mathrm{2}}$ renders the prior distribution uninformative for the dataset sizes we consider in our experiments.
2.4.3 Model–data discrepancy and measurement error parameters
The retrievals used to perform inversions often come with prescribed variances, ${\mathit{v}}_{g}^{\mathrm{ps}}$, that account for both retrieval error, correlated or otherwise, and CTM error. For example, the MIP protocol of Crowell et al. (2019) prespecified these variances. We therefore let the total marginal variance of ξ_{g}+ϵ_{g} be equal to ${\mathit{\gamma}}_{g}\cdot {\mathit{v}}_{g}^{\mathrm{ps}},$ where the inflation factor parameter γ_{g} accounts for the possibility of misspecified variances (Worden et al., 2017). To deconvolve ξ_{g} and ϵ_{g}, we first construct the correlation matrix ${\mathbf{R}}_{{\mathit{\xi}}_{g}}\equiv \mathrm{corr}\left({\mathit{\xi}}_{g}\right)$ using a spatiotemporal correlation function ${R}_{{\mathit{\xi}}_{g}}(\cdot \phantom{\rule{0.125em}{0ex}},\cdot \phantom{\rule{0.125em}{0ex}};{\mathbf{\ell}}_{{\mathit{\xi}}_{g}})$, where ${\mathbf{\ell}}_{{\mathit{\xi}}_{g}}$ are length scales that need to be inferred from data. We then enforce the total marginal variance constraint by defining the covariance matrices of ξ_{g} and ϵ_{g} to be ${\mathbf{\Sigma}}_{{\mathit{\xi}}_{g}}\equiv \mathrm{var}\left({\mathit{\xi}}_{g}\right)=\mathrm{diag}({\mathit{\rho}}_{g}{\mathit{\gamma}}_{g}\cdot {\mathit{v}}_{g}^{\mathrm{ps}}{)}^{\mathrm{1}/\mathrm{2}}{\mathbf{R}}_{{\mathit{\xi}}_{g}}\mathrm{diag}({\mathit{\rho}}_{g}{\mathit{\gamma}}_{g}\cdot {\mathit{v}}_{g}^{\mathrm{ps}}{)}^{\mathrm{1}/\mathrm{2}}$ and ${\mathbf{\Sigma}}_{{\mathit{\u03f5}}_{g}}\equiv \mathrm{var}\left({\mathit{\u03f5}}_{g}\right)=\mathrm{diag}\left(\right(\mathrm{1}{\mathit{\rho}}_{g}){\mathit{\gamma}}_{g}\cdot {\mathit{v}}_{g}^{\mathrm{ps}})$. The parameter ρ_{g} $\in [\mathrm{0},\mathrm{1}]$ and represents the relative contribution of the correlated error variance (comprising both CTM error and, if present, correlated measurement error) to the total inflated prescribed variance.
We model the inflation factors {γ_{g}} using inversegamma distributions: independently, ${\mathit{\gamma}}_{g}\sim \mathrm{IG}({\mathit{\nu}}_{{\mathit{\gamma}}_{g}},{\mathit{\omega}}_{{\mathit{\gamma}}_{g}}),\phantom{\rule{1em}{0ex}}g=\mathrm{1},\mathrm{\dots},{n}_{g},$ where the shape and rate parameters, $\mathit{\left\{}{\mathit{\nu}}_{{\mathit{\gamma}}_{g}}\mathit{\right\}}$ and $\mathit{\left\{}{\mathit{\omega}}_{{\mathit{\gamma}}_{g}}\mathit{\right\}}$, are fixed and assumed known. We model the relative contribution factors {ρ_{g}} using standard uniform distributions: independently, ${\mathit{\rho}}_{g}\sim \mathrm{Unif}(\mathrm{0},\mathrm{1}),$ for $g=\mathrm{1},\mathrm{\dots},{n}_{g}$. We use gamma prior distributions to model the length scales {ℓ_{g}} in the correlation function: independently, ${\mathrm{\ell}}_{g,j}\sim \mathrm{Ga}({\mathit{\nu}}_{{\mathrm{\ell}}_{g},j},{\mathit{\omega}}_{{\mathrm{\ell}}_{g},j}),$ for $j=\mathrm{1},\mathrm{\dots},{n}_{{\mathrm{\ell}}_{g}};\phantom{\rule{0.25em}{0ex}}g=\mathrm{1},\mathrm{\dots},{n}_{g}.$ We collect together the unknown parameters that determine the variances and covariances of the correlated component of the error into ${\mathit{\theta}}_{{\mathit{\xi}}_{g}}\equiv ({\mathit{\rho}}_{g},{\mathbf{\ell}}_{g}^{\prime}{)}^{\prime}$, for $g=\mathrm{1},\mathrm{\dots},{n}_{g}$.
2.5 Summary of the Bayesian hierarchical model and computation
The Bayesian hierarchical model, which we use in Sect. 5, can be written succinctly as follows:

flux autoregressive parameters:
${\mathit{\kappa}}_{j}\sim \text{Beta}({a}_{\mathit{\kappa},j},{b}_{\mathit{\kappa},j})$, $j=\mathrm{1},\mathrm{\dots},{r}_{s}$; 
flux innovation precisions:
${\mathit{\tau}}_{w,j}\sim \text{Ga}({\mathit{\nu}}_{w,j},{\mathit{\omega}}_{w,j})$, $j=\mathrm{1},\mathrm{\dots},{r}_{s}$; 
flux scaling factors:
$\mathit{\alpha}\mid \mathit{\kappa},{\mathit{\tau}}_{w}\sim \text{Gau}(\mathbf{\text{0}},{\mathbf{\Sigma}}_{\mathit{\alpha}})$; 
measurement bias coefficients:
${\mathit{\beta}}_{g}\sim \text{Gau}(\mathbf{\text{0}},{\mathit{\sigma}}_{\mathit{\beta}}^{\mathrm{2}}\mathbf{I})$, $g=\mathrm{1},\mathrm{\dots},{n}_{g}$; 
error variance inflation factors:
${\mathit{\gamma}}_{g}\sim \text{IG}({\mathit{\nu}}_{{\mathit{\gamma}}_{g}},{\mathit{\omega}}_{{\mathit{\gamma}}_{g}})$, $g=\mathrm{1},\mathrm{\dots},{n}_{g}$; 
error length scales:
${\mathrm{\ell}}_{g,j}\sim \text{Ga}({\mathit{\nu}}_{{\mathrm{\ell}}_{g},j},{\mathit{\omega}}_{{\mathrm{\ell}}_{g},j})$, $j=\mathrm{1},\mathrm{\dots},{n}_{{\mathrm{\ell}}_{g}};\phantom{\rule{0.25em}{0ex}}g=\mathrm{1},\mathrm{\dots},{n}_{g}$; 
error proportion:
${\mathit{\rho}}_{g}\sim \mathrm{Unif}(\mathrm{0},\mathrm{1})$, $g=\mathrm{1},\mathrm{\dots},{n}_{g}$; 
likelihood:
${\mathit{Z}}_{\mathrm{2}}\mid \mathit{\beta},\mathit{\alpha},{\mathit{\theta}}_{\mathit{\xi},\mathit{\u03f5}}\sim \text{Gau}({\widehat{\mathit{Z}}}_{\mathrm{2}}^{\mathrm{0}}+\widehat{\mathbf{\Psi}}\mathit{\alpha}+\mathbf{A}\mathit{\beta},{\mathbf{\Sigma}}_{\mathit{\xi}}+{\mathbf{\Sigma}}_{\mathit{\u03f5}})$,
where ${\mathit{Z}}_{\mathrm{2}}\equiv ({\mathit{Z}}_{\mathrm{2},\mathrm{1}}^{\prime},\mathrm{\dots},{\mathit{Z}}_{\mathrm{2},{n}_{g}}^{\prime}{)}^{\prime}$, ${\widehat{\mathit{Z}}}_{\mathrm{2}}^{\mathrm{0}}\equiv ({\widehat{\mathit{Z}}}_{\mathrm{2},\mathrm{1}}^{{\mathrm{0}}^{\prime}},\mathrm{\dots},{\widehat{\mathit{Z}}}_{\mathrm{2},{n}_{g}}^{{\mathrm{0}}^{\prime}}{)}^{\prime}$, $\widehat{\mathbf{\Psi}}\equiv ({\widehat{\mathbf{\Psi}}}_{\mathrm{1}}^{\prime},\mathrm{\dots},{\widehat{\mathbf{\Psi}}}_{{n}_{g}}^{\prime}{)}^{\prime}$, $\mathbf{A}\equiv \mathrm{bdiag}({\mathbf{A}}_{\mathrm{1}},\mathrm{\dots},{\mathbf{A}}_{{n}_{g}})$, $\mathit{\beta}\equiv ({\mathit{\beta}}_{\mathrm{1}}^{\prime},\mathrm{\dots},{\mathit{\beta}}_{{n}_{g}}^{\prime}{)}^{\prime}$, ${\mathbf{\Sigma}}_{\mathit{\xi}}\equiv \mathrm{bdiag}({\mathbf{\Sigma}}_{{\mathit{\xi}}_{\mathrm{1}}},\mathrm{\dots},{\mathbf{\Sigma}}_{{\mathit{\xi}}_{{n}_{g}}})$, ${\mathbf{\Sigma}}_{\mathit{\u03f5}}\equiv \mathrm{bdiag}({\mathbf{\Sigma}}_{{\mathit{\u03f5}}_{\mathrm{1}}},\mathrm{\dots},{\mathbf{\Sigma}}_{{\mathit{\u03f5}}_{{n}_{g}}})$, and ${\mathit{\theta}}_{\mathit{\xi},\mathit{\u03f5}}\equiv \left(\right({\mathit{\theta}}_{{\mathit{\xi}}_{g}}^{\prime},{\mathit{\gamma}}_{g}):g=\mathrm{1},\mathrm{\dots},{n}_{g}{)}^{\prime}$. Here, bdiag(⋅) constructs a blockdiagonal matrix from its arguments. A graphical model summarising the relationships between the variables is given in Fig. 1.
The joint posterior distribution over all quantities can be estimated using a Gibbs sampler, which successively “updates” parameters using their full conditional distributions. When the conditional distributions cannot be sampled from directly (in particular, for the parameters κ, ℓ_{g}, and ρ_{g}, for $g=\mathrm{1},\mathrm{\dots},{n}_{g}$), we employ a slice sampler (Neal, 2003) to obtain samples. Details are given in Appendix A.
The posterior distribution over all the unknown parameters in our model is given by
The log of the first two terms on the righthand side of Eq. (10) are expressions that are commonly seen in optimisationbased fluxinversion frameworks. In particular, we have
where ${\mathit{Z}}_{\mathrm{2},p}(\mathit{\beta},\mathit{\alpha})\equiv {\widehat{\mathit{Z}}}_{\mathrm{2}}^{\mathrm{0}}+\widehat{\mathbf{\Psi}}\mathit{\alpha}+\mathbf{A}\mathit{\beta}$; in our case we set α_{p}=0, and “const.” denotes a constant. The primary differences between our framework and the usual optimisationbased fluxinversion frameworks are the presence of the log determinants, which penalise covariance matrices that have large determinants (large correlations and/or large variances), and the presence of offdiagonal elements in the matrix Σ_{ξ}+Σ_{ϵ}. Note that the log determinants can only be omitted when all covariance matrices are considered known a priori.
The computational complexity of the Gibbs sampler is dominated by that of the loglikelihood function, $\mathrm{log}p({\mathit{Z}}_{\mathrm{2}}\mid \mathit{\beta},\mathit{\alpha},{\mathit{\theta}}_{\mathit{\xi},\mathit{\u03f5}}),$ which is the sum of the groupwise loglikelihood functions, $\mathrm{log}p({\mathit{Z}}_{\mathrm{2},g}\mid \mathit{\beta},\mathit{\alpha},{\mathit{\theta}}_{\mathit{\xi},\mathit{\u03f5}})$, for $g=\mathrm{1},\mathrm{\dots},{n}_{g}$. For computationally efficient inference, we must ensure that each groupwise loglikelihood function is simple to evaluate. From Eq. (9), the groupwise loglikelihood is given by
for $g=\mathrm{1},\mathrm{\dots},{n}_{g}$, where ${\mathbf{\Sigma}}_{g}\equiv {\mathbf{\Sigma}}_{{\mathit{\xi}}_{g}}+{\mathbf{\Sigma}}_{{\mathit{\u03f5}}_{g}}$ and m_{g} is the number of observations and/or retrievals in group g. From a computational perspective there are two components in Eq. (11) that can present difficulties. The first component is the matrix ${\widehat{\mathbf{\Psi}}}_{g}$; this matrix is dense, and its number of elements scales linearly with both the number of data points and the number of basis functions, r. Fortunately, this matrix only needs to be evaluated once using the CTM, and it can typically be generated efficiently on a large parallel computing infrastructure. The second component is the matrix Σ_{g}, which is of size m_{g}×m_{g} and generally dense. Recall from Sect. 2.4 that this covariance matrix is constructed from ${\mathit{\gamma}}_{g},{\mathbf{\ell}}_{{\mathit{\xi}}_{g}},$ and ρ_{g}, which are sampled within the Gibbs sampler. Therefore, this covariance matrix needs to be reconstructed at each sampler iteration. This is infeasible for the m_{g}≈100 000 retrievals used in this study.
We deal with the denseness of Σ_{g} by using an approximation first proposed by Vecchia (1988). Here, one first orders the elements of ξ_{g}. Following this, one approximates the joint distribution of ξ_{g} as,
where 𝒩_{g,i} is the “past” neighbour set of the ith datum in group g, which contains a (very small) subset of the integers between, and including, 1 and (i−1). It can be shown that this formulation leads to a valid distribution for ξ_{g} that approximates the true joint distribution. The approximate distribution is Gaussian with mean 0 and a sparse precision matrix, ${\mathbf{\Sigma}}_{{\mathit{\xi}}_{g}}^{\mathrm{1}}$, with the degree of sparsity closely connected to the sizes of the sets $\mathit{\{}{\mathcal{N}}_{g,i}:i=\mathrm{1},\mathrm{\dots},{m}_{g}\mathit{\}}$.
In the version of WOMBAT presented here, we consider a special case of Eq. (12), where the observations are ordered in time and where the correlation function ${R}_{{\mathit{\xi}}_{g}}(\cdot ,\cdot \phantom{\rule{0.125em}{0ex}};{\mathbf{\ell}}_{{\mathit{\xi}}_{g}})$ is simply an exponential function of temporal separation. In this case, one only needs to consider one (temporal) lengthscale parameter per group, ${\mathrm{\ell}}_{g,\mathrm{1}},$ for $g=\mathrm{1},\mathrm{\dots},{n}_{g}$. The motivations for this simplification are twofold. First, the remote sensing instrument we consider in Sect. 5 flies on a satellite that is in a sunsynchronous orbit, and thus correlation in time is a proxy for alongtrack correlations. This model for characterising correlation in the errors was suggested and used by Chevallier (2007). Second, the use of an exponential correlation function allows the approximation in Eq. (12) to become an equality, where ${\mathcal{N}}_{g,i}=i\mathrm{1}$. This is a manifestation of the socalled “screening effect”, where the exponential correlation function induces a firstorder conditionalindependence structure. Now, ${\mathbf{\Sigma}}_{g}^{\mathrm{1}}=({\mathbf{\Sigma}}_{{\mathit{\xi}}_{g}}+{\mathbf{\Sigma}}_{{\mathit{\u03f5}}_{g}}{)}^{\mathrm{1}}$ and $\left{\mathbf{\Sigma}}_{g}\right={\mathbf{\Sigma}}_{{\mathit{\xi}}_{g}}+{\mathbf{\Sigma}}_{{\mathit{\u03f5}}_{g}}$ , where ${\mathbf{\Sigma}}_{{\mathit{\xi}}_{g}}^{\mathrm{1}}$ is very sparse and ${\mathbf{\Sigma}}_{{\mathit{\u03f5}}_{g}}^{\mathrm{1}}$ is diagonal. Efficient computations of Eq. (11) therefore follow by expressing ${\mathbf{\Sigma}}_{g}^{\mathrm{1}}$ and $\left{\mathbf{\Sigma}}_{g}\right$ in terms of these sparse matrices. Specifically, ${\mathbf{\Sigma}}_{g}^{\mathrm{1}}$ and $\left{\mathbf{\Sigma}}_{g}\right$ are evaluated through the use of the Sherman–Morrison–Woodbury matrix identity and a matrixdeterminant lemma (e.g. Henderson and Searle, 1981, and Appendix A).
This section gives the setup needed for Sect. 5, where we compare the inversions from WOMBAT v1.0 to those from the OCO2 MIP (Crowell et al., 2019). In the MIP, inversions followed a predefined protocol; we therefore configured WOMBAT to follow the same protocol. The MIP prescribed the data to be used, including both preprocessed pointreferenced data and remotely sensed data from OCO2 between 6 September 2014 and 1 April 2017. Participants were tasked to provide flux estimates for the years 2015 and 2016. The protocol also specified a fossilfuel flux field that had to be assumed fixed and known, in order to facilitate the interpretation of the differences in flux estimates obtained by the different participants. All other modelling choices (e.g. transport model, prior fluxes) were left to individual participants.
4.1 Prior expected flux
Our prior expectation of the spatiotemporal flux process, ${Y}_{\mathrm{1}}^{\mathrm{0}}(\cdot \phantom{\rule{0.125em}{0ex}},\cdot )$, is constructed from inventories of different types of fluxes through the following decomposition:
where ${Y}_{\mathrm{1}}^{\mathrm{0},\mathrm{ff}}(\cdot \phantom{\rule{0.125em}{0ex}},\cdot )$ corresponds to fossilfuel emissions, ${Y}_{\mathrm{1}}^{\mathrm{0},\mathrm{bio}}(\cdot \phantom{\rule{0.125em}{0ex}},\cdot )$ to terrestrial biospheric fluxes, ${Y}_{\mathrm{1}}^{\mathrm{0},\mathrm{bf}}(\cdot \phantom{\rule{0.125em}{0ex}},\cdot )$ to biofuel emissions, ${Y}_{\mathrm{1}}^{\mathrm{0},\mathrm{fire}}(\cdot \phantom{\rule{0.125em}{0ex}},\cdot )$ to fire emissions, and ${Y}_{\mathrm{1}}^{\mathrm{0},\mathrm{ocn}}(\cdot \phantom{\rule{0.125em}{0ex}},\cdot )$ to ocean–air exchange fluxes. We now describe these components in more detail.

Fossilfuel emissions. For ${Y}_{\mathrm{1}}^{\mathrm{0},\mathrm{ff}}(\cdot \phantom{\rule{0.125em}{0ex}},\cdot )$ we use the Opensource Data Inventory for Anthropogenic CO_{2} monthly fossilfuel emissions (ODIAC2016; Oda and Maksyutov, 2011; Oda et al., 2018) with Temporal Improvements for Modeling Emissions by Scaling (TIMES) weekly scaling factors (Nassar et al., 2013). This term also includes emissions from international aviation and shipping. The fossilfuel fluxes are prescribed by the MIP protocol.

Biospheric flux. This flux is a result of the interaction between the atmosphere and trees, shrubs, grasses, soils, dead wood, leaf litter, and other biota. It is defined by the quantity $\text{GPP}{R}_{\mathrm{A}}{R}_{\mathrm{H}}$, where GPP stands for gross primary production (the uptake of carbon by plants due to photosynthesis), R_{A} is autotrophic respiration (the release of carbon through respiration by plants), and R_{H} is heterotrophic respiration (the release of carbon through the metabolic action of bacteria, fungi, and animals). For ${Y}_{\mathrm{1}}^{\mathrm{0},\mathrm{bio}}(\cdot \phantom{\rule{0.125em}{0ex}},\cdot )$ we use one of the two priors used in CarbonTracker 2019 (Jacobson et al., 2020), specifically that based on the Carnegie–Ames–Stanford Approach (CASA) biogeochemical model (Potter et al., 1993; Giglio et al., 2013).

Biofuel emissions. These emissions result from the burning of wood, charcoal, and agricultural waste for energy, as well as the burning of agricultural fields. For ${Y}_{\mathrm{1}}^{\mathrm{0},\mathrm{bf}}(\cdot \phantom{\rule{0.125em}{0ex}},\cdot )$ we use the estimates of Yevich and Logan (2003) that in turn were based on data from 1985.

Fire emissions. These emissions correspond to those from vegetative fires (wildfires), which may either start naturally or be started by humans. For ${Y}_{\mathrm{1}}^{\mathrm{0},\mathrm{fire}}(\cdot \phantom{\rule{0.125em}{0ex}},\cdot )$ we use the Quick Fire Emissions Dataset, version 2 (QFED2; Darmenov and da Silva, 2015).

Ocean–air exchange. These fluxes are a result of ocean–air differences in partial pressure of CO_{2}. For ${Y}_{\mathrm{1}}^{\mathrm{0},\mathrm{ocn}}(\cdot \phantom{\rule{0.125em}{0ex}},\cdot )$ we use the estimates of Takahashi et al. (2002), with annual scalings reflecting increasing uptake of CO_{2} as described by Takahashi et al. (2009).
A summary of these components is provided in Table 1.
Oda and Maksyutov (2011)Nassar et al. (2013)Oda et al. (2018)Potter et al. (1993)Giglio et al. (2013)Jacobson et al. (2020)Yevich and Logan (2003)Darmenov and da Silva (2015)Takahashi et al. (2002, 2009)4.2 Basis functions
We divided the globe into the r_{s}=22 disjoint TransCom3 regions (Gurney et al., 2002) and time into the r_{t}=31 months between (and including) September 2014 and March 2017, and then we constructed one flux basis function for each region–month pair. This yielded $r={r}_{s}\times {r}_{t}=\mathrm{682}$ basis functions, each with nonzero support in a space–time volume spanning 1 month in time, and one TransCom3 region in space. Half of the TransCom3 regions are land regions and half are ocean regions, and thus half of our basis functions correspond to land areas and half to ocean areas. We show the TransCom3 regions in Fig. S1 and their codes and labels in Table S1, both of which can be found in the Supplement. Some areas of the globe, depicted in white in Fig. S1, are assumed to have zero flux; all of our basis functions are zero in these regions.
For ${\mathit{\varphi}}_{j}(\cdot \phantom{\rule{0.125em}{0ex}},\cdot )$, a basis function corresponding to a land area, we have
where ${D}_{j}^{\mathit{\varphi}}$ is the spacetime volume over which the jth basis function is defined to be nonzero. For ${\mathit{\varphi}}_{j}(\cdot \phantom{\rule{0.125em}{0ex}},\cdot )$ a basis function corresponding to an ocean area, we have
Both Eqs. (14) and (15) exclude ${Y}_{\mathrm{1}}^{\mathrm{0},\mathrm{ff}}(\cdot \phantom{\rule{0.125em}{0ex}},\cdot )$. This is done to meet the MIP requirement that fossilfuel fluxes are treated as fixed and known (which is common practice in flux inversion; see, e.g. Basu et al., 2013). The influence of fossil fuels on the molefraction field is therefore present only as an invariant component of the prior expectation of the molefraction field. Note that we have used the same inventories to construct ${Y}_{\mathrm{1}}^{\mathrm{0}}(\cdot ,\cdot )$ and the basis functions ${\mathit{\varphi}}_{j}(\cdot ,\cdot ),j=\mathrm{1},\mathrm{\dots},r$; this was done for convenience, and different inventories could be used if needed.
Since the spatiotemporal patterns of the fluxes within a region–month space–time volume are fixed, our construction is quite restrictive. However, the space–time patterns are dictated by those in the inventories used to construct the basis functions. Although there is a general lack of agreement between inventories targeting the same processes (e.g. Huntzinger et al., 2017), these spatiotemporal patterns would not be unreasonable and certainly more reasonable than those from generic basis functions commonly used in spatial statistics (e.g. Wikle et al., 2019, chap. 4). This underlying assumption is often made in fluxinversion systems; for example, Jacobson et al. (2020) scale 3hourly fluxes using weekly scale factors over 156 regions, while Basu et al. (2013) use monthly scale factors for 3hourly fluxes over ${\mathrm{6}}^{\circ}\times {\mathrm{4}}^{\circ}$ grid cells. Constraining the spatiotemporal pattern is inferentially advantageous because it helps address the illposed nature of flux inversion. It is also computationally advantageous because it reduces the number of unknowns for which inference is needed. The disadvantage is that the reliance on a priori structures increases the risk of dimension reduction error because, while our basis functions allow the posterior fluxes to vary at subTransCom3region scales, variations that do not follow the prescribed pattern are necessarily ignored. Therefore, if one wishes to make inference at scales that are finer than those resolved by the scaling factors, one should introduce additional basis functions for those regions and time spans. Moreover, for processes where there is disagreement (such as biogeochemical processes), one may consider running separate inversions with basis functions constructed from different inventories and carry out a sensitivity analysis. We note that there is a considerable body of work tackling basisfunction choice in the context of inversion; see, for example, Turner and Jacob (2015).
As described in Sect. 2.2, for $j=\mathrm{1},\mathrm{\dots},\mathrm{682}$, each flux basis function ${\mathit{\varphi}}_{j}(\cdot \phantom{\rule{0.125em}{0ex}},\cdot )$ has a corresponding molefraction basis function ${\widehat{\mathit{\psi}}}_{j}(\cdot \phantom{\rule{0.125em}{0ex}},\cdot \phantom{\rule{0.125em}{0ex}},\cdot )$, which may be constructed by running the transport model, $\widehat{\mathcal{H}}$, under the flux ${Y}_{\mathrm{1}}^{\mathrm{0}}(\cdot \phantom{\rule{0.125em}{0ex}},\cdot )+{\mathit{\varphi}}_{j}(\cdot \phantom{\rule{0.125em}{0ex}},\cdot )$. Then the molefraction basis function ${\widehat{\mathit{\psi}}}_{j}(\cdot \phantom{\rule{0.125em}{0ex}},\cdot \phantom{\rule{0.125em}{0ex}},\cdot )$ is recovered through linearity by simply subtracting ${Y}_{\mathrm{2}}^{\mathrm{0}}(\cdot \phantom{\rule{0.125em}{0ex}},\cdot \phantom{\rule{0.125em}{0ex}},\cdot )$ from the output molefraction field. For illustration, Fig. 2 shows examples of the flux basis functions (monthly averaged) for January 2016 and the regions TransCom3 02 and TransCom3 06, and it also shows snapshots of the corresponding molefraction basis functions (daily and atmosphericcolumn averaged) obtained using the transport model described next in Sect. 4.3.
4.3 Transport model and initial condition
For our approximate transport model, $\widehat{\mathcal{H}}$, we used the GEOSChem global 3D chemical transport model, version 12.3.2 (Bey et al., 2001; Yantosca, 2019), driven by the GEOSFP meteorological fields from NASA's Goddard Earth Observing System (Rienecker et al., 2008). We use the offline GEOSChem CO_{2} simulation (Nassar et al., 2010), with the native horizontal resolution of ${\mathrm{0.25}}^{\circ}\times {\mathrm{0.3125}}^{\circ}$ and 72 vertical levels aggregated to ${\mathrm{2}}^{\circ}\times {\mathrm{2.5}}^{\circ}$ and 47 vertical levels for computational efficiency. We use a transport time step of 10 min and a flux time step of 20 min. All fluxes described in Sects. 4.1 and 4.2 were implemented in GEOSChem using the HEMCO emissions component (Keller et al., 2014). GEOSChem can be configured to allow for a 3D chemical source of CO_{2} due to oxidation of other trace gases, but this was disabled for compatibility with the OCO2 MIP.
The approximate initial condition, $\widehat{{Y}_{\mathrm{2}}}(\cdot \phantom{\rule{0.125em}{0ex}},\cdot ,{t}_{\mathrm{0}})$, specifies the molefraction field at the beginning of the study period on 1 September 2014. For our initial molefraction field, we used a modified version of that generated by Bukosa et al. (2019). This molefraction field was constructed using a spinup period, starting on 1 January 2005 and ending on 1 September 2014, with transport driven by inventory fluxes and meteorology (that in some cases differ from those we use here; see Bukosa et al., 2019, for details). At the end of the spinup period, the whole molefraction field on 1 September 2014 was scaled such that the value in the surface grid cell containing the South Pole was equal to the monthly averaged molefraction measurement from surface flask measurements at the South Pole (Thoning et al., 2020) in September 2014. The South Pole was chosen as our calibration point due its physical isolation from strong sources and sinks.
4.4 Data
This study uses a subset of the data sources in the MIP (Crowell et al., 2019). These include retrievals of columnaveraged CO_{2} by NASA's OCO2 satellite (Eldering et al., 2017) and retrievals of columnaveraged CO_{2} from TCCON (Wunch et al., 2011a). As in the MIP, we use OCO2 data to estimate CO_{2} fluxes and TCCON data to validate the estimates.
4.4.1 OCO2
The OCO2 satellite was launched in 2014 with the goal of retrieving atmospheric CO_{2} mole fractions. The onboard instrument measures radiances in three nearinfrared spectral bands, which in turn are used to retrieve the CO_{2} mole fraction on 20 vertical levels via a retrieval algorithm based on Bayesian optimal estimation (Rodgers, 2000; O'Dell et al., 2012). The retrieved levels are columnaveraged and then biascorrected through comparison with TCCON retrievals (Wunch et al., 2011a). The OCO2 team releases regular revisions of the retrieval dataset.
OCO2 radiance measurements are taken in three distinct pointing modes: nadir mode, where the satellite aims at the point directly beneath it; glint mode, where the satellite points at the reflection of the sun on the surface; and target mode, where the satellite aims at a specific target, typically a ground station that also measures CO_{2} mole fractions. Target observations are generally excluded from flux inversions and used only for instrument calibration. Over the ocean, nadir measurements have insufficient signaltonoise ratio to provide useful retrievals, while over land both nadir and glint retrievals are made. There are therefore three retrieval modes to consider, land glint (LG), land nadir (LN), and ocean glint (OG). The error properties of retrievals over land differ significantly from those over ocean; in particular, the OG retrievals in the V7r dataset (the dataset used in the MIP) are not considered reliable and were therefore excluded from the MIP (Crowell et al., 2019). We follow this protocol and only do inversions using LG and LN data.
The MIP protocol dictated the use of a postprocessed version of the V7r retrievals; this postprocessing was done as follows. First, an additional biascorrection term related to highalbedo measurements was applied to the XCO_{2} retrievals. The biascorrected retrievals were then grouped and averaged into 1 s bins, and following this they were further grouped and averaged into 10 s bins. The 10 s spans correspond to ground swathes of approximately 67 km in length. The standard deviation for each 10 s retrieval was computed as a function of the individual retrieval standard deviations, with an additional model–data mismatch term added to account for the expected differences arising from transport model errors. For the MIP, the 10 s averages were assumed to be independent but, following Sect. 2.4, we treat them as dependent. For more details on how the 10 s averages were computed, including how the standard errors were derived, see Crowell et al. (2019).
The OCO2 retrieval algorithm produces estimates of XCO_{2}. In Eq. (7), we encapsulate the retrieval algorithm in the observation operator, ${\widehat{\mathcal{A}}}_{i}$. Appendix B gives more details on this observation operator in the case of OCO2 retrievals.
4.4.2 TCCON
TCCON is a network of groundbased sites measuring solar radiances in the nearinfrared spectral band (Wunch et al., 2011a). Similar to the way OCO2 retrievals are obtained, these measurements are converted to retrievals of columnaveraged CO_{2} (and other gases) using a retrieval algorithm. TCCON retrievals have been adjusted to agree with World Meteorological Organization (WMO) tracegas measurement scales, and validated using aircraft data (Wunch et al., 2010). As both TCCON and remote sensing instruments retrieve columnaveraged mole fractions, the TCCON data are an important validation resource (Wunch et al., 2011b). The MIP used TCCON columnaveraged CO_{2} retrievals from the GGG2014 release as validation data, including all retrievals available as of 6 July 2017. In the MIP, outliers and retrievals corresponding to retrievals with high solar zenith angle were removed. The remaining TCCON retrievals were then averaged over 30 min intervals; further details are given by Crowell et al. (2019). We note that the filtering procedure used in the MIP occasionally led to long periods of time for which data were considered missing. For consistency with the MIP, in this study we used the same retrievals and postprocessing methods; the stations used are listed in Table 2.
Feist et al. (2014)Deutscher et al. (2015)Notholt et al. (2014)Wennberg et al. (2015)Griffith et al. (2014a)Iraci et al. (2016)Strong et al. (2016)Blumenstock et al. (2014)Hase et al. (2015)Wennberg et al. (2016)Sherlock et al. (2014)Dubey et al. (2014)Warneke et al. (2014)Wennberg et al. (2014)De Mazière et al. (2014)Kawakami et al. (2014)Kivi et al. (2014)Morino et al. (2016)Griffith et al. (2014b)Like OCO2, TCCON retrievals also have an associated observation operator ${\widehat{\mathcal{A}}}_{i}$. This has a similar form to the operator for OCO2, which is described in Appendix B. A detailed description of the TCCON observation operator is given by Wunch et al. (2011b).
4.5 Prior distributions over the parameters
The prior distributions for the parameters governing the scaling factors, α, are specified separately for the land and ocean TransCom3 regions. The land regions, which are observed directly by OCO2 when in LG or LN mode, are assigned a noninformative prior, while the indirectly observed ocean regions (which also have relatively small fluxes over a given area) are assigned a relatively informative prior. Informative priors for ocean fluxes were deemed necessary following OSSEs performed by us that revealed that it is often not possible to reliably constrain ocean fluxes from OCO2 land data.
Specifically, for j corresponding to a land region, we assigned a prior to κ_{j} by letting ${a}_{\mathit{\kappa},j}={b}_{\mathit{\kappa},j}=\mathrm{1}$ (resulting in a uniform prior over [0,1]), and for τ_{w,j} we let ${\mathit{\nu}}_{w,j}=\mathrm{0.354}$ and ${\mathit{\omega}}_{w,j}=(\mathrm{1}{\mathit{\kappa}}_{j}^{\mathrm{2}})/\mathrm{0.0153}$. This prior on τ_{w,j} implies that $\mathrm{1}/{\mathit{\tau}}_{w,j}$, the marginal variance of the elements of the scalings in α that correspond to land regions, has 5 % and 95 % percentiles of 0.01 and 10, respectively, which is reasonably uninformative. For j corresponding to an ocean region, we apply an independent and identically distributed Gaussian prior with mean zero and standard deviation of 0.5 to α_{j}. This is achieved by fixing κ_{j}=0 and ${\mathit{\tau}}_{w,j}=\mathrm{4}$.
As described in Sect. 4.4.1, the OCO2 MIP 10 s averages come with prescribed uncertainties that include both measurement error and transport model error. In our framework, the parameters governing these error processes are γ_{g}, ρ_{g}, and ℓ_{g,1}. For the prior distribution of γ_{g}, we let ${\mathit{\nu}}_{{\mathit{\gamma}}_{g}}=\mathrm{1.627}$ and ${\mathit{\omega}}_{{\mathit{\gamma}}_{g}}=\mathrm{2.171}$, which lead to 5 % and 95 % prior percentiles of 0.5 and 10, respectively, while we used a uniform prior distribution on [0,1] for ρ_{g}. For ℓ_{g,1}, we let ${\mathit{\nu}}_{{\mathrm{\ell}}_{g},\mathrm{1}}=\mathrm{1}$ and ${\mathit{\omega}}_{{\mathrm{\ell}}_{g},\mathrm{1}}=\mathrm{1}\phantom{\rule{0.33em}{0ex}}{\text{min}}^{\mathrm{1}}$. When doing bias correction online, we used the prior on β described in Sect. 2.4.
4.6 Computation
Computations were performed in two stages. In the first stage, the 682 molefraction basis functions were precomputed in the manner described in Sect. 4.2. This is the most computationally demanding step, as each basis function requires the CTM to be run for several days of clock time, on average. Fortunately, since every basis function can be computed independently from all others, computing them is an embarrassingly parallel problem. Furthermore, since the basis functions are shared between the different inversions in this section, they only need to be computed once. Computation of the basis functions took 7 d in total using the Gadi supercomputer at the Australian National Computational Infrastructure.
The inversions were performed in the second stage. As mentioned in Sect. 2.5, the posterior distribution for each inversion was estimated using an MCMC sampling scheme, with details given in Appendix A. The sampling schemes in all cases were run for 11 000 iterations, and the first 1000 iterations were discarded as burnin. Convergence of the MCMC chain was confirmed by inspection of all the trace plots. In our studies, we found that the vast majority of posterior distributions were different from the prior distributions, and this is not surprising. Although complex, the model used in WOMBAT is lowdimensional, and in this setup the total number of unknowns is 732 (r=682 of which are flux scaling factors), which is orders of magnitude less than the number of LG and LN data available for the inversion (114 808 and 129 203, respectively). Specifically, these data prove to be highly informative of the unknown parameters in our model. Generally, one need not be overly concerned if a parameter is poorly constrained by the data. In such cases, a Bayesian framework such as WOMBAT returns a posterior distribution over the poorly constrained parameter that tends toward the prior distribution, which in turn encapsulates the a priori belief on the plausible range of values the parameter can take.
The MCMC scheme was implemented in the R programming language (R Core Team, 2020), with intensive linear algebraic computations offloaded for performance to a graphics processing unit (GPU) using Tensorflow (Abadi et al., 2016). The total running time of the sampler depended on which model assumptions were used; specifically, whether uncorrelated (15 min) or correlated errors (2 h) were modelled. The bottleneck leading to a drastic increase in computing time when modelling correlated errors is due to the term ${\widehat{\mathbf{\Psi}}}_{g}^{\prime}({\mathbf{\Sigma}}_{{\mathit{\xi}}_{g}}+{\mathbf{\Sigma}}_{{\mathit{\u03f5}}_{g}}{)}^{\mathrm{1}}{\widehat{\mathbf{\Psi}}}_{g}$ in Eq. (A7), which needs to be reevaluated at each MCMC iteration. This operation scales as O(r^{3}+nr^{2}); on hardware current to the year 2021, r needs to be less than 10 000 for computations to remain tractable. On the other hand, when the errors are assumed to be uncorrelated or the lengthscale parameters are assumed known, many matrix computations can be done only once (and not at each MCMC iteration); in this case the bottleneck becomes memory, and on current stateoftheart servers one may accommodate an order of magnitude more basis functions. All inversions were performed on a machine with an eightcore Intel i99900K CPU running at 3.60 GHz and an NVIDIA RTX 2080 GPU.
In this section we evaluate WOMBAT, first in an OSSE, where the true fluxes are assumed known and data are simulated from these true fluxes, and then on actual satellite data via the MIP protocol of Crowell et al. (2019). Using an OSSE, described in Sect. 5.1, serves two purposes: first, to show that WOMBAT can indeed recover the true fluxes in a controlled environment where the “working model” is the “true model”, and second to illustrate the importance of modelling measurement error biases and correlated errors when these are present in the true model from which the data are simulated. Following this, in Sect. 5.2 we show that WOMBAT gives similar flux estimates to those obtained by different MIP participants and that it performs well relative to the MIP participants in reproducing outofsample TCCON validation data. In Sect. 5.3 we show that WOMBAT is able to estimate bias coefficients online, if needed.
5.1 Observing system simulation experiment (OSSE)
In this section we illustrate the use of WOMBAT in an OSSE, where we randomly draw flux scaling factors α^{s} from a Gaussian distribution with mean 0 and covariance matrix 0.09I, and assume that these are the true flux scaling factors. The (simulated) true flux ${Y}_{\mathrm{1}}^{s}(\cdot \phantom{\rule{0.125em}{0ex}},\cdot )$ is given by
where ${\mathit{\alpha}}_{j}^{s}$ is the jth element of α^{s}. The (simulated) true molefraction field, ${Y}_{\mathrm{2}}^{s}(\cdot \phantom{\rule{0.125em}{0ex}},\cdot \phantom{\rule{0.125em}{0ex}},\cdot )$, is then given by
Finally, we simulate measurements from the molefraction data model in Eq. (9) at the same times and locations as the OCO2 10 s average retrievals for the LN and LG modes by passing ${Y}_{\mathrm{2}}^{s}(\cdot \phantom{\rule{0.125em}{0ex}},\cdot \phantom{\rule{0.125em}{0ex}},\cdot )$ through the corresponding OCO2 observation operators (see Sect. 4.4.1).
When simulating data via Eq. (9), we assume that both b_{g} and ξ_{g} are present. For the bias term b_{g}, we assume that the OCO2 retrieval biases are a linear combination of covariates that are associated with bias in the retrieval process:

“dp” is the prior–retrieval surface pressure differential;

“logDWS” is the logarithm of the total retrieved optical depth associated with the aerosol types dust, water cloud, and sea salt;

“co2_grad_del” is the difference between the retrieved CO_{2} mole fractions at the surface and retrieval vertical level 13 (corresponding to the height with air pressure equal to 63.2 % of the surface pressure, which is around 520 to 650 hPa for most retrievals).
The “official” V7r biascorrection parameters (regression coefficients) for the original Level 2 (L2) data release were obtained through offline comparison of the raw L2 product with TCCON retrievals, and they are the same for both LG and LN observations. They are equal to 0.3, 0.028, and 0.6 for the three variables, respectively. We construct our (simulated) true biases based on these coefficients.
As discussed in Sect. 2.4, we assume that the prescribed variance of each retrieval needs to be inflated, and the inflated variance is the sum of the variance from both the correlated (ξ_{g}) and uncorrelated (ϵ_{g}) error components. In our OSSE, we assume that the inflation factor of the prescribed variances, ${\mathit{v}}_{g}^{\mathrm{ps}}$, is γ_{g}=1.25, and that the proportion of this variance allocated to the correlated error process is ρ_{g}=0.8. We induce the correlations using the exponential covariance function described in Sect. 3 with the single length scale of the correlated component set to ${\mathrm{\ell}}_{g,\mathrm{1}}=\mathrm{1}$ min for all $g=\mathrm{1},\mathrm{\dots},{n}_{g}$. We specify this correlation structure to be the same for both LG and LN data.
We ran five different setups in WOMBAT. In four of these setups, bias is assumed or not assumed to be present, errors are assumed or not assumed to be correlated, and all model hyperparameters are estimated. The fifth setup attempts to mimic a conventional flux inversion system based on scaling factors; here data is assumed to be unbiased, the errors uncorrelated, and the hyperparameters fixed to their true values. The known true flux, generated as described above, is the same between the cases, and we evaluate the ability of WOMBAT to recover the truth under each of the setups. Table 3 gives the rootmeansquared error (RMSE) and continuous ranked probability score (CRPS, Gneiting and Raftery, 2007) when estimating monthly and regionally aggregated fluxes using these five setups. The regions on which these evaluations are based are the same TransCom3 regions that were used to construct the flux basis functions (see Sect. 4.2), and the quantities in the table are averages across all combinations of the 31 months and 22 regions. The true datagenerating process involves both bias and correlated error. Therefore, as one would expect, Table 3 shows that the WOMBAT setup that takes into account both of these features performs the best in terms of both RMSE and CRPS, while the setups that assume that neither feature is present perform the worst. For the two partially misspecified setups, the biascorrected and uncorrelated setup outperforms the notbiascorrected and correlated setup for LG data, while the opposite is true for LN data. Notably, despite the presence of systematic biases in the simulated data, the WOMBAT setup that assumes no bias, but which takes into account correlated errors, performs overwhelmingly better than the fully misspecified model that assumes no bias and uncorrelated errors, where the hyperparameters are assumed to be unknown. The performance of the setup with hyperparameters fixed to their true values is between those of the fully and the partially misspecified setups, illustrating the importance of modelling and estimating biases and correlations when these are indeed present.
Figure 3 shows the (simulated) true (black), prior mean (blue), and posterior distributions (red, purple, orange, light green, and dark green) for the total flux in the tropics (latitude 23.5^{∘} S to 23.5^{∘} N) in 2015 and 2016 from both LN and LG and split by the southern and the northern components (latitudes 23.5^{∘} S to 0^{∘} and 0 to 23.5^{∘} N, respectively). The five posterior distributions depicted in each panel correspond to the five different WOMBAT setups. The interior of the ellipses represent the 95 % credible regions. The dotted grey lines along the diagonal correspond to combinations of the southern and northern fluxes that yield the true total flux in the tropics; if the dotted line is not within the ellipse for an inversion, the total flux is misestimated. All fluxes shown in Fig. 3 are exclusive of fossil fuels which, recall, are held fixed in the inversions. Figure S2 shows the results on a global scale (land vs. ocean), while Fig. S3 shows the results on a regional scale: TransCom3 region 04 (South American Temperate) vs. TransCom3 region 06 (Southern Africa) .
Collectively, the performances of the different models, as shown in Figs. 3, S2, and S3, align with the conclusions based on the RMSE and CRPS statistics. In all the cases shown, the 95 % credible regions for the WOMBAT configuration with both bias correction and correlated error (red) contain the true value, while those for the configuration with neither feature (light green and dark green) do so rarely. The orange credible regions for the biascorrected and uncorrelated variant are always smaller than the red credible regions, indicating that the biascorrected and uncorrelated variant is overconfident. In contrast, the purple credible regions for the notbiascorrected and correlated variant are always larger, which may suggest that the correlated errors are partially compensating for the lack of bias correction in this variant.
In summary, this OSSE shows that WOMBAT can recover the true flux when the assumed model is the true model, but more importantly the OSSE also demonstrates the importance of modelling both bias and correlated errors in these flux inversions. If the bias parameters are omitted, fluxes can be estimated incorrectly, although this may be partially mitigated by modelling correlated errors. If uncorrelated errors are assumed, estimation performance suffers, and flux estimates will likely be reported with too small an uncertainty, even if the prescribed variances are allowed to be inflated when making inference. In a realdata setting, any factors thought to introduce systematic biases should be taken into account, but this OSSE also suggests that the use of correlated errors may provide some insurance against any remaining unmitigated spatiotemporal biases.
5.2 OCO2 satellite data
In this section we present results from WOMBAT applied to OCO2 satellite data under the MIP protocol (Crowell et al., 2019). The protocol mandates the use of OCO2 retrievals with the TCCONbased offline bias correction. While WOMBAT is capable of online bias correction (see Sect. 5.3), in this section we follow the MIP protocol and set the bias parameters in WOMBAT equal to zero.
5.2.1 Flux comparison on a region–time basis with the MIP
In the OCO2 MIP, nine participants submitted fluxes based on inversions satisfying the MIP protocol. Each participant reported to the MIP four sets of fluxes: their prior mean fluxes and their fluxes from three inversions based on point referenced data, OCO2 LG data, and OCO2 LN data, respectively. Crowell et al. (2019) considered the different participants' fluxes as an ensemble, reporting the ensemble mean, median, and standard deviation across a variety of temporal and spatial scales. Under the same protocol and using OCO2 LG and OCO2 LN data, we compare WOMBAT's posterior distribution over the fluxes to the corresponding results from the MIP ensemble.
Through its MCMC Bayesian computations, WOMBAT's inversions generate samples from the posterior distribution of all unknown quantities in the model, including the parameters discussed in Sect. 2.4. Section 5.2.3 discusses the inferred parameters in detail for inversions corresponding to different satellite modes. Of note are the posterior distributions of the parameters {ρ_{g}}, which are centred on values around 0.84. This is a strong indication that the majority of the process that combines model–data discrepancy and measurement error should indeed be attributed to the correlated component, ξ_{g}, given in Eq. (9). Samples from the MCMC scheme enable estimation of functionals of the posterior distribution, including posterior means and quantiles, of the flux process ${Y}_{\mathrm{1}}(\cdot \phantom{\rule{0.125em}{0ex}},\cdot )$. While some individual MIP participants are able to produce probabilistic uncertainty estimates for fluxes, these were not reported as part of the MIP; instead, the empirical distribution from the ensemble of fluxes was used by the MIP for uncertainty quantification. Since it is difficult to make a quantitative comparison between WOMBAT's posteriorbased uncertainties and the ensemble uncertainties in the MIP, we opt here for a visual comparison of the posterior means and standard deviations over the fluxes compared to the ensemble minimum, mean, and maximum. This comparison is done at both annual and monthly temporal scales, and across spatial scales encompassing the whole globe, global land, global ocean, and zonal bands.
Global totals. Figure 4 presents annual and monthly nonfossilfuel fluxes for the globe, land regions, and ocean regions for inversions using the LG retrievals. Fluxes are shown for the MIP, split into prior and LG inversions, and for WOMBAT they are split into prior and posterior using LG data. Thick lines show the ensemble means for the MIP, and the (prior or posterior) means for WOMBAT. Shaded areas and thin lines for the MIP denote the values between the ensemble minimum and maximum, while for WOMBAT they denote values in the 95 % fully Bayesian credible intervals. The corresponding figure for LN inversions is given in Fig. S4.
The global annual nonfossilfuel fluxes estimated by posterior means from WOMBAT are very similar for both LG and LN modes, with an overall posterior mean net sink in 2015 of 3.40 and 3.14 PgC yr^{−1} for LG and LN, respectively, and in 2016 of 4.14 and 4.27 PgC yr^{−1} for LG and LN, respectively. For 2015, these sinks are very similar to the MIP ensemble means (3.57 and 3.21 PgC yr^{−1} for LG and LN, respectively). However, for 2016, WOMBAT returns a larger posteriormean sink than the MIP ensemble means (3.82 and 3.78 PgC yr^{−1} for LG and LN, respectively), and its 95 % credible intervals do not contain the ensemble means within them. At a monthly scale, WOMBAT reproduces a key feature of the MIP fluxes, wherein the seasonal cycle in the fluxes, driven by the Northern Hemisphere growing season, begins and ends earlier than it does in the prior for both 2015 and 2016. In agreement with the MIP, WOMBAT results indicate that the largest sink in the cycle is larger than that in the prior. However, the sink estimated by WOMBAT is around 0.4 PgC per month smaller than the MIP ensemble mean for both LG and LN and for both 2015 and 2016.
Global land and ocean. For global land fluxes, shown in the second row of Figs. 4 and S4, WOMBAT's results agree with those from the MIP for both LG and LN in that a source larger than that in the prior flux is estimated for October 2015. However, while the source persists into November in the MIP ensemble mean, the WOMBAT posterior mean does not have the same persistence. For global ocean fluxes, shown in the third row of Figs. 4 and S4, the MIP LGestimated and LNestimated fluxes differ little from the prior fluxes, and we observe the same for global oceans in the WOMBAT estimates for both LG and LN modes and in most months. The exceptions are September and October in both years, where WOMBAT estimates a shallower sink, and even zero flux with the LN data. These features are not obvious in the MIP ensemble means, but they do appear reasonable within the MIP ensemble spread.
Zonal bands. Figures S5 and S6 show for LG and LN inversions, respectively, fluxes for zonal bands covering the northern extratropics (23.5 to 90^{∘} N), northern tropics (0 to 23.5^{∘} N), southern tropics (23.5^{∘} S to 0^{∘}), and southern extratropics (90 to 23.5^{∘} S). For the MIP ensemble, Crowell et al. (2019) noted that inversions using LG data led to a smaller net annual sink (averaged between 2015 and 2016) in the northern extratropics than those using LN data. WOMBAT also finds this feature, with a 95 % credible interval of the LGminusLN difference spanning 0.14–0.6 PgC yr^{−1}. This is substantially smaller than the difference between the MIP ensemble means for these modes, which is 0.7 PgC yr^{−1}. Fluxes in the southern extratropics, shown in the fourth row of Figs. S5 and S6, are dominated by ocean fluxes for which, as noted above, LG and LN data provide little information.
One of the most prominent features in the MIP inversion results is a seasonal cycle in the tropics that is larger than that in both the prior mean and the in situ inversions (Crowell et al., 2019). From the second and third rows of Figs. S5 and S6, which depict inferred tropicalzone fluxes, it can be seen that WOMBAT does not reproduce this feature for both LG and LN inversions. In the northern tropics, the WOMBAT posterior means are similar to the prior means, and the credible intervals in the annual fluxes reflect high confidence. However, results from WOMBAT do corroborate those of the MIP ensemble, in that nonfossilfuel fluxes in the northern tropics were a net source of CO_{2} in 2016.
5.2.2 TCCON comparison
To evaluate the estimated fluxes in the OCO2 MIP, each participant was asked to use the 30 min average TCCON retrievals of columnaveraged CO_{2} (see Sect. 4.4.2) as validation data, and compare them to the columnaveraged CO_{2} predicted values obtained by applying the process model to the estimated fluxes with the same CTM used for the inversion. Recall that when performing the inversions only OCO2 data were used and that the TCCON data were treated as unobserved and set aside for validation. For WOMBAT, we repeated this validation exercise by examining the prior and posterior distributions of Z_{2,g}, where each g corresponds to a different TCCON site. For each group g, we set b_{g}=0, since we assume that the TCCON retrievals provided are free of bias. On the other hand, we assumed that ξ_{g}+ϵ_{g} has variance that is group specific and that these errors are fully correlated. While this assumption is conservative, it is also reasonable, since the CTM does induce errors that are highly correlated in time at a common spatial location, as it averages all variables on a rather coarse grid when simulating atmospheric transport. We estimate the variance of these correlated errors in a group g as the average of the reported variances of each TCCON retrieval within the gth group.
In Fig. S7, we compare the time series of the TCCON retrievals with the predictions from WOMBAT under the priormean flux, the posterior distribution of flux from LG data, and the posterior distribution of flux from LN data. Several things are of note from this figure: first, the posteriormean estimates are a better match to the TCCON retrievals than the priormean estimates, which is evidence that OCO2 data do allow for improved flux estimates to be obtained. Second, discrepancies between TCCON and predicted retrievals persist for a long time, lending credence to our assumption that errors are highly temporally correlated. Third, the 95 % prediction intervals are appropriate, and largely contain the TCCON retrievals.
In Fig. 5, we reproduce an augmented version of Fig. 8 of Crowell et al. (2019), which depicts the mean and standard deviation of the differences between the TCCON retrievals and the predicted retrieval by TCCON site, MIP participant, and observation mode (LG and LN) alongside the results from WOMBAT. The improvement of the WOMBAT posterior prediction over the prior prediction is evident in the mean of the differences, and the posterior error means and standard deviations of WOMBAT are in line with those of the MIP participants. WOMBAT's predictive distributions from LGinferred fluxes can be seen to be better than those of the MIP participants, even by straightforward visual inspection. In Table 4 we compute meansquared error, by participant and observation mode, averaged over the 19 TCCON stations used in the MIP. WOMBAT outperforms all other participants when using this metric with LG data, and it is the fourth best when using this metric with LN data. While these results are not conclusive on the validity of the WOMBAT fluxes globally, they are encouraging, especially in light of the fact that our flux process has a relatively lowdimensional representation.
5.2.3 The inferred parameters
One of the key features of WOMBAT is the use of a parameter prior distribution in the hierarchical Bayesian model, which applies to both the parameters governing the flux scaling factors and to the parameters governing the model–data discrepancy and measurement error processes. Figure 6 shows the estimated posterior means and 95 % credible intervals for the autoregressive parameters κ (Fig. 6a) and the innovation precisions τ_{w} (Fig. 6b) for the 11 land regions and for inversions using LG and LN data. The inferred parameters are relatively consistent across the LG and LN modes, with the exception of TransCom3 region 02 (North American Temperate). Most regions have a posterior mean for κ_{j} that is approximately between 0.25 and 0.75, reflective of moderate autocorrelation in the scaling factors. The exception is TransCom3 region 03, for which the scaling factors are estimated to be highly autocorrelated a priori. The innovation precisions, τ_{w}, have posterior means that lie approximately between 1 and 10 for most regions.
The parameters governing the model–data discrepancy and measurement error processes are ρ_{g}, ℓ_{g,1}, and γ_{g}, for $g=\mathrm{1},\mathrm{\dots},{n}_{g}$. Table 5 gives the posterior means, 2.5 % quantiles, and 97.5 % quantiles for these parameters. Recall that the LG and LN parameters are derived from different inversions; that is, they are not two groups in the same inversion. Nonetheless, the inferred parameters are similar between the inversions, which is reassuring. The values for γ_{g} are indicative of a 21 % variance inflation needed for both instrument modes. The length scales, ℓ_{g}, are 1.2 min for the LG data and 1.1 min for the LN data, which corresponds to around 700 to 800 km on the ground. Finally, the estimated values of ρ_{g} are around 0.835, indicating that the majority of the combined model–data discrepancy and measurement error process should indeed be attributed to the correlated component, ξ_{g}, given in Eq. (9).
5.3 Online bias correction
The OSSEbased sensitivity study in Sect. 5.1 demonstrated that WOMBAT is able to perform online bias correction with simulated data, where biases are estimated while doing flux inversion. This is different to the typical offline approach to bias correction, where retrieval biases are determined in a separate study (e.g. Wunch et al., 2011b). To comply with the MIP protocol, the online biascorrection functionality of WOMBAT was disabled in the study of Sect. 5.2, and the TCCONbased offline biascorrected OCO2 retrievals from the MIP were used. In order to investigate the prospect of online bias correction with real data, we repeat the inversions with online bias correction enabled, using OCO2 10 s average retrievals both with and without the TCCONbased offline corrections.
In Fig. 7, we show the posterior densities of the WOMBATestimated biascorrection coefficients when using the retrievals without the offline correction. The posterior densities shown there are for inversions based on LG and LN retrievals, while the TCCONbased offline biascorrection coefficients are given by the blue vertical lines. The WOMBATestimated coefficients have the same sign and similar magnitudes to the offline corrections, suggesting that WOMBAT is picking up similar bias patterns while doing flux inversion. However, with the exception of the “dp” coefficient under the LN inversion, the offline values are outside the plausible ranges estimated by WOMBAT. For “co2_grad_del”, WOMBAT favours a smaller correction for both LG and LN, while for “logDWS” a larger correction is favoured. For “dp”, WOMBAT favours a smaller correction under the LG inversion.
We repeated the online biascorrection procedure using retrievals retaining the TCCONbased offline bias correction. In this setting, if the retrievals are unbiased, bias coefficients equal to zero should be inferred. The posterior densities of the estimated coefficients under this configuration are shown in the second row of Fig. 7. As expected, the magnitudes of the onlineestimated coefficients are close to zero, although the credible intervals do not always include zero. Naïvely, one might expect that the coefficients would be approximately equal to the difference between the TCCONbased offline coefficients and the coefficients estimated by WOMBAT when using uncorrected data. For “dp” and “co2_grad_del”, the estimated coefficients indeed have the expected sign and the expected orders of magnitude. The inferred “logDWS” coefficients are surprising, however, with an opposite sign to what was expected for the LG inversion and smaller magnitudes for both the LG and LN inversions. This unexpected result is a reflection of the complex interplay and nonlinear relationships between the parameters and processes in a regularised fluxinversion model.
Overall, the onlineestimated bias correction is practically, if not statistically, similar to the TCCONbased offline correction. One possible explanation for the difference between the WOMBAT and the TCCONbased estimates is that different data are used because WOMBAT does not use TCCON data for the correction. Moreover, it is likely that the true bias coefficients are spatiotemporally varying; if this is indeed the case, the estimated biases would be affected by the spatiotemporal locations of the retrievals used to estimate them. Another reason may be that, for simplicity, we have used only a few of the most important variables that are used for offline bias correction; a consideration of all the variables may lead to slightly different results. Despite this, our results show that online bias correction is possible and that further research into it as an attractive byproduct of flux inversion is warranted.
Figure 8 gives annual and monthly fluxes estimated from the inversions using the online bias correction applied to the uncorrected retrievals. For comparison, the equivalent fluxes from Sect. 5.2.1 for the offlinecorrected data and with the online bias correction disabled are also reported. The annual and monthly fluxes are similar between the offlinecorrected and onlinecorrected inversions, with substantial overlap in the marginal posterior distributions for all time periods. This result gives further evidence that online bias correction is a viable alternative to offline correction (here based on TCCON data) when doing flux inversion.
The WOllongong Methodology for Bayesian Assimilation of Tracegases (WOMBAT) extends the standard synthesis fluxinversion framework, which does not put prior distributions on all unknowns, to a framework based on a fully Bayesian hierarchical statistical model. It incorporates physically motivated flux basis functions and follows the standard Bayesian synthesis framework by using a CTM to compute the corresponding molefraction basis functions offline. The scaling factors for the basis functions are inferred from molefraction satellite data. WOMBAT incorporates a correlatederror term, estimates measurement biases and measurement error scaling factors online, estimates variances and length scales of flux scaling factors, and uses an MCMC scheme that allows uncertainty quantification through posterior distributions on all unknowns in the model. We have illustrated the importance of modelling correlation and bias within a fluxinversion system, and we have shown that WOMBAT produces global and regional flux estimates from OCO2 data that are comparable to those from the MIP participants in Crowell et al. (2019). In particular, WOMBAT outperformed the other flux models in reproducing TCCON data when using the OCO2 LG retrievals to obtain the fluxes, and it was competitive when using fluxes obtained from the OCO2 LN retrievals. When the fossilfuel fluxes are included, we estimate a global carbon source of 6.11±0.09 PgC yr^{−1} using the LG data and 6.17±0.07 PgC yr^{−1} using the LN data. These estimates corroborate those of the MIP within uncertainty.
This paper presents the general, underlying Bayesian hierarchical framework for WOMBAT v1.0, which will serve as a baseline for our flux inversions based on current and future versions of OCO2 data. There are several potential extensions being considered; here we discuss three of the most pertinent ones. First, WOMBAT, like most other fluxinversion systems, currently operates using a single CTM. This is problematic from a statistical modelling point of view, as it does not allow one to attribute the correlated error either to the measurement error process or to the molefraction process. If more than one CTM is used, in principle one could statistically attribute at least part of the error to transport. This will not necessarily solve the problem though, since CTMs tend to share common features that induce similar correlations (e.g. due to unresolved subgrid variation). A possible way forward is to take results from offline OSSEs to estimate and fix the parameters characterising the CTM error, and then to attribute any residual observed correlation to the retrieval process.
Second, WOMBAT extends a traditional state–space approach to flux inversion, which competes with adjointbased approaches that allow for a much higher flux dimensionality. In the current version of WOMBAT, one should use the largest number of basis functions possible, given available hardware requirements, and find a compromise between the temporal and spatial resolution of the flux basis functions, such that the chosen resolution is as close as possible, or finer, than that at which the flux estimates need to be produced (in our case, this was the TransCom3bymonth level). At this chosen resolution we expect WOMBAT to perform well and give predictions that are valid within uncertainty: when one has broad spatial and temporal data coverage of the response functions, as in the case of OCO2 and a TransCom3bymonth flux resolution, Bayesian synthesis can be expected to be reasonably robust to dimensionreduction error. Further, the WOMBAT posterior distributions over the fluxes are nonGaussian and can accommodate skewness and long tails; this added flexibility mitigates the risk of underfitting. Moreover, one may introduce additional scaling factors and corresponding basis functions in small “regions of interest”, where the finescale fluxes are an inferential target, and this is something we are doing in a followup iteration of WOMBAT. However, it would be desirable to have global inversions yield valid inferences at fine spatiotemporal scales globally. Moving forward, WOMBAT will therefore seek to introduce higher dimensionality by using flux basis functions that are at a finer scale than the TransCom3bymonth spatiotemporal basis functions that we have used here or a finescale variation term in the flux process model that can be used to absorb variation in the flux that cannot be explained by the basis functions.
Finally, WOMBAT currently only considers alongtrack correlations when modelling the correlationerror term, ξ_{g}. However, the general framework we have proposed, based on a sparseprecision matrix approximation, can be extended to model full space–time correlations. Dedicated investigations will be required to assess the feasibility of the implementation and the impact it will have on flux estimates.
As mentioned in Sect. 2.5, WOMBAT makes inference on the fluxes and the other parameters in the model using a Gibbs sampler, wherein samples of subsets of the parameters are iteratively drawn from their full conditional distributions (e.g. Tierney, 1994). Recall that the target distribution is $p(\mathit{\alpha},\mathit{\beta},\mathit{\kappa},{\mathit{\tau}}_{w},{\mathit{\theta}}_{\mathit{\xi},\mathit{\u03f5}}\mid {\mathit{Z}}_{\mathrm{2}})$, as shown in Eq. (10).
The Gibbs sampler in WOMBAT is as follows. Given the ith sample, {α^{[i]}, β^{[i]}, κ^{[i]}, ${\mathit{\tau}}_{w}^{\left[i\right]}$, ${\mathit{\theta}}_{\mathit{\xi},\mathit{\u03f5}}^{\left[i\right]}\mathit{\}}$, the (i+1)th sample is generated sequentially in the following manner.

Sample α^{[i+1]} and β^{[i+1]} jointly from
$p(\mathit{\alpha},\mathit{\beta}\mid {\mathit{\kappa}}^{\left[i\right]},{\mathit{\tau}}_{w}^{\left[i\right]},{\mathit{\theta}}_{\mathit{\xi},\mathit{\u03f5}}^{\left[i\right]},{\mathit{Z}}_{\mathrm{2}})$. 
Sample κ^{[i+1]} from $p(\mathit{\kappa}\mid {\mathit{\tau}}_{w}^{\left[i\right]},{\mathit{\alpha}}^{[i+\mathrm{1}]})$.

Sample ${\mathit{\tau}}_{w}^{[i+\mathrm{1}]}$ from $p({\mathit{\tau}}_{w}\mid {\mathit{\kappa}}^{[i+\mathrm{1}]},{\mathit{\alpha}}^{[i+\mathrm{1}]})$.

Sample ${\mathit{\theta}}_{\mathit{\xi},\mathit{\u03f5}}^{[i+\mathrm{1}]}$ from $p({\mathit{\theta}}_{\mathit{\xi},\mathit{\u03f5}}\mid {\mathit{\alpha}}^{[i+\mathrm{1}]},{\mathit{\beta}}^{[i+\mathrm{1}]},{\mathit{Z}}_{\mathrm{2}})$.
In Appendices A1–A4, we give the details for Steps (1)–(4). In deriving the conditional distributions, we often make use of the Sherman–Morrison–Woodbury matrix identity and a matrixdeterminant lemma (e.g. Henderson and Searle, 1981). The former identity states that,for conformable matrices A, U, C, and V,
whenever the required inverses exist, while the latter lemma states that
A1 Sampling α and β
Let $\mathbf{B}=(\widehat{\mathbf{\Psi}},\mathbf{A})$, $\mathit{x}=({\mathit{\alpha}}^{\prime},{\mathit{\beta}}^{\prime}{)}^{\prime}$, and ${\mathbf{\Sigma}}_{x}=\mathrm{bdiag}({\mathbf{\Sigma}}_{\mathit{\alpha}},{\mathit{\sigma}}_{\mathit{\beta}}^{\mathrm{2}}\mathbf{I})$. Then
The log of this quantity is quadratic in x, and therefore the full conditional distribution of x is a multivariate Gaussian distribution:
where
and
As described in Sect. 3, Σ_{ϵ} is diagonal and under Markov assumptions Σ_{ξ} has a sparse inverse. These properties allow us to efficiently compute the required mean and covariance matrix through use of the Sherman–Morrison–Woodbury matrix identity. Once these are computed, sampling from Eq. (A1) is straightforward.
A2 Sampling κ
The full conditional distribution of κ is
where, recalling Sect. 2.4, p(κ) is a product of beta density functions. Since each κ_{j} and τ_{w,j} is associated with a spatial region, we also partition α by spatial region. That is, we define $\mathit{\alpha}\equiv \left(\right({\mathit{\alpha}}^{\mathrm{1}}{)}^{\prime},\mathrm{\dots},({\mathit{\alpha}}^{{r}_{s}}{)}^{\prime}{)}^{\prime}$, where ${\mathit{\alpha}}^{j}\in {\mathbb{R}}^{{r}_{t}}$, $j=\mathrm{1},\mathrm{\dots},{r}_{s}$. For $j=\mathrm{1},\mathrm{\dots},{r}_{s}$, the r_{t}dimensional vector α^{j} can therefore be associated with ${\mathit{\kappa}}_{j},{\mathit{\tau}}_{w,j}$, and a r_{t}×r_{t} subblock of the matrix ${\mathbf{\Sigma}}_{\mathit{\alpha}}^{\mathrm{1}}$, which we denote as ${\mathbf{\Sigma}}_{\mathit{\alpha},j}^{\mathrm{1}}$. Under the autoregressive model in Sect. 2.4, ${\mathbf{\Sigma}}_{\mathit{\alpha},j}^{\mathrm{1}}={\mathit{\tau}}_{w,j}{\mathbf{Q}}_{\mathit{\alpha},j}$, where
Since the flux scaling factors in each spatial region are treated as independent a priori, Eq. (A2) may be written as
where
To generate samples from Eq. (A2), we therefore successively sample κ_{j}, $j=\mathrm{1},\mathrm{\dots},{r}_{s}$ from its full conditional distribution (Eq. A3). Equation (A3) does not correspond to any standard distribution, and thus we use slice sampling (Neal, 2003) to sample from it.
A3 Sampling τ_{w}
Similar to κ, the conditional distribution of τ_{w} factorises across spatial regions, and it is therefore given by
where
The density function in Eq. (A4) is a gamma density function,
where ${\mathit{\nu}}_{w,j}^{c}={\mathit{\nu}}_{w,j}+\frac{\mathrm{1}}{\mathrm{2}}{r}_{t}$ and ${\mathit{\omega}}_{w,j}^{c}={\mathit{\omega}}_{w,j}+\frac{\mathrm{1}}{\mathrm{2}}({\mathit{\alpha}}^{j}{)}^{\prime}{\mathbf{Q}}_{\mathit{\alpha},j}{\mathit{\alpha}}^{j}$. Therefore, we sample from the full conditional distribution of τ_{w} by successively sampling τ_{w,j}, for $j=\mathrm{1},\mathrm{\dots},{r}_{s},$ directly from Eq. (A5).
A4 Sampling θ_{ξ,ϵ}
Since we assume that the parameters governing the correlated and uncorrelated error terms are datagroup specific, the full conditional distribution of θ_{ξ,ϵ} is
where $p({\mathit{\theta}}_{{\mathit{\xi}}_{g}},{\mathit{\gamma}}_{g}\mid \mathit{\alpha},\mathit{\beta},{\mathit{Z}}_{\mathrm{2},g})\propto p({\mathit{Z}}_{\mathrm{2},g}\mid \mathit{\alpha},\mathit{\beta},{\mathit{\theta}}_{{\mathit{\xi}}_{g}},{\mathit{\gamma}}_{g})p({\mathit{\theta}}_{{\mathit{\xi}}_{g}},{\mathit{\gamma}}_{g})$; $p({\mathit{\theta}}_{{\mathit{\xi}}_{g}},{\mathit{\gamma}}_{g})$ is the joint prior over ${\mathit{\theta}}_{{\mathit{\xi}}_{g}}$ and γ_{g}, and
The matrix operations in Eq. (A7) may be computed efficiently using the matrix identities given at the start of this appendix. Since Eq. (A7) does not correspond to any standard distribution, we use slice sampling to sample from it for $g=\mathrm{1},\mathrm{\dots},{n}_{g}$.
The retrieval algorithm used for OCO2 takes spectral data as input and returns CO_{2} mole fractions on 20 vertical levels as output via an optimisation procedure. The CO_{2} mole fractions at these 20 vertical levels are subsequently used to compute a columnaveraged CO_{2} that we associate with a single retrieval. For the ith retrieval, we denote the vector of retrieved mole fractions as ${\mathit{Z}}_{\mathrm{2},i}^{r}$. Following the argument given by Rodgers and Connor (2003) and Connor et al. (2008), the retrieved mole fractions are given by
where ${\mathit{Y}}_{\mathrm{2},i}^{\mathrm{0},r}=({Y}_{\mathrm{2},i,\mathrm{1}}^{\mathrm{0},r},\mathrm{\dots},{Y}_{\mathrm{2},i,\mathrm{20}}^{\mathrm{0},r}{)}^{\prime}$ is the vector of priormean mole fractions used by the retrieval algorithm, specific to the ith retrieval (these are unrelated to the prior mean of the molefraction field used in our inversion, ${Y}_{\mathrm{2}}^{\mathrm{0}}(\cdot \phantom{\rule{0.125em}{0ex}},\cdot \phantom{\rule{0.125em}{0ex}},\cdot )$); A_{i} is the “averaging kernel matrix”; ${\mathit{Y}}_{\mathrm{2},i}\equiv \left({Y}_{\mathrm{2}}\right({\mathit{s}}_{i},{h}_{i,k},{t}_{i}){)}_{k=\mathrm{1}}^{\mathrm{20}}$ is the true mole fraction at the 20 vertical levels for the retrieval at geopotential heights h_{i,k}, $k=\mathrm{1},\mathrm{\dots},\mathrm{20}$; and ${\mathit{\epsilon}}_{i}^{r}$ is a vector of measurement errors (which may also include systematic biases and errors induced by nonlinearity in the inversion process). The columnaveraged retrieval is then
where ${\mathit{c}}_{i}\equiv ({c}_{i,\mathrm{1}},\mathrm{\dots},{c}_{i,\mathrm{20}}{)}^{\prime}$ are quadrature weights used to estimate the column average, and ${Y}_{\mathrm{2},i}^{\mathrm{0},rc}\equiv {\mathit{c}}_{i}^{\prime}{\mathit{Y}}_{\mathrm{2},i}^{\mathrm{0},r}$ is the retrieval prior mean columnaveraged CO_{2}. Define ${\mathit{a}}_{i}\equiv ({a}_{i,\mathrm{1}},\mathrm{\dots},{a}_{i,\mathrm{20}}{)}^{\prime}$, where
and $({\mathit{c}}_{i}^{\prime}{\mathbf{A}}_{i}{)}_{k}$ denotes the kth element of ${\mathit{c}}_{i}^{\prime}{\mathbf{A}}_{i}$. The vector a_{i} is the “averaging kernel vector” of the ith retrieval, which is available in the official releases of the OCO2 data. It follows that the observation operator in Eq. (7) is defined as follows:
Note that we do not explicitly account for the error term ${\mathit{c}}_{i}^{\prime}{\mathit{\epsilon}}_{i}^{r}$ in our definition for ${\widehat{\mathcal{A}}}_{i}$. This is because it will be absorbed by the error terms we use in the data model (Eq. 8).
The averagingkernel vector elements {a_{i,k}} are ideally close in value to 1. They reflect the fact that the retrieval is not equally sensitive to the mole fractions at all the vertical levels. At levels where there is less sensitivity (i.e. values <1), the retrieval priormean mole fraction will be assigned greater weight when producing the columnaverage CO_{2} estimate (Rodgers, 2000).
The code associated with this article is available at https://doi.org/10.5281/zenodo.4886771 (Bertolacci et al., 2021a). This code repository includes scripts for reproducing the entire analysis in this paper, which can be applied to a variety of inverse modelling problems. Rerunning the entire analysis is computationally expensive due to the need to simulate atmospheric transport under various perturbations. To help with this, we provide these outputs as a separate download at https://doi.org/10.5281/zenodo.4887043 (Bertolacci et al., 2021b). These allow the inversions to be done and for results to be generated without the need to run the atmospheric transport model. Please see README.md in the code repository for more details.
The supplement related to this article is available online at: https://doi.org/10.5194/gmd15452022supplement.
AZM, MB, and NC designed and wrote the study. JF, AS, MR, and YC contributed to the design and implementation of the GEOSChem model simulations. MB and AZM designed and implemented the MCMC algorithm and conducted the analyses.
The contact author has declared that neither they nor their coauthors have any competing interests.
Publisher’s note: Copernicus Publications remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
This project was largely supported by the Australian Research Council (ARC) Discovery Project (DP) DP190100180. Andrew ZammitMangion's research was also supported by an ARC Discovery Early Career Research Award (DECRA) DE180100203. Noel Cressie's research was also supported by ARC DP150104576. Noel Cressie's and Andrew ZammitMangion's research was also supported by NASA ROSES grant 17OCO2170012. Jenny Fisher was supported by ARC DP160101598. Ann Stavert and Matthew Rigby were supported by the UK Natural Environment Research Council (grant no. NE/K002236/1). The TCCON data were obtained from the TCCON Data Archive hosted by CaltechDATA at https://tccondata.org/ (last access: 16 November 2021). The authors would also like to thank David Baker for generating the OCO2 10 s average retrievals and Beata Bukosa, Nicholas Deutscher, Anita Ganesan, Peter Rayner, Andrew Schuh, and members of the OCO2 Flux Team for invaluable feedback on this research. This work was supported by resources provided by the Pawsey Supercomputing Centre with funding from the Australian Government and the Government of Western Australia. The work was also supported by the NCI National Facility systems at the Australian National University through the National Computational Merit Allocation Scheme supported by the Australian Government.
This research has been supported by the Australian Research Council (grant nos. DP190100180, DE180100203, DP160101598, and DP150104576), the National Aeronautics and Space Administration (grant no. 17OCO2170012), the Natural Environment Research Council (grant no. NE/K002236/1), and the Australian Government and the Government of Western Australia (National Computational Merit Allocation Scheme project code m19 and resources provided by the Pawsey Supercomputing Centre).
This paper was edited by Sergey Gromov and reviewed by Scot Miller and one anonymous referee.
Abadi, M., Agarwal, A., Barham, P., Brevdo, E., Chen, Z., Citro, C., Corrado, G. S., Davis, A., Dean, J., Devin, M., Ghemawat, S., Goodfellow, I., Harp, A., Irving, G., Isard, M., Jia, Y., Jozefowicz, R., Kaiser, L., Kudlur, M., Levenberg, J., Mane, D., Monga, R., Moore, S., Murray, D., Olah, C., Schuster, M., Shlens, J., Steiner, B., Sutskever, I., Talwar, K., Tucker, P., Vanhoucke, V., Vasudevan, V., Viegas, F., Vinyals, O., Warden, P., Wattenberg, M., Wicke, M., Yu, Y., and Zheng, X.: TensorFlow: LargeScale Machine Learning on Heterogeneous Distributed Systems, arXiv [preprint], 1603.04467, 16 March 2016. a
Baker, D. F., Doney, S. C., and Schimel, D. S.: Variational data assimilation for atmospheric CO_{2}, Tellus B, 58, 359–365, https://doi.org/10.1111/j.16000889.2006.00218.x, 2006. a
Basu, S., Guerlet, S., Butz, A., Houweling, S., Hasekamp, O., Aben, I., Krummel, P., Steele, P., Langenfelds, R., Torn, M., Biraud, S., Stephens, B., Andrews, A., and Worthy, D.: Global CO_{2} fluxes estimated from GOSAT retrievals of total column CO_{2}, Atmos. Chem. Phys., 13, 8695–8717, https://doi.org/10.5194/acp1386952013, 2013. a, b, c, d, e, f
Basu, S., Baker, D. F., Chevallier, F., Patra, P. K., Liu, J., and Miller, J. B.: The impact of transport model differences on CO_{2} surface flux estimates from OCO2 retrievals of column average CO_{2}, Atmos. Chem. Phys., 18, 7189–7215, https://doi.org/10.5194/acp1871892018, 2018. a, b, c
Bertolacci, M., ZammitMangion, A., Cao, Y., Fisher, J., and Stavert, A.: WOMBAT: A fully Bayesian global fluxinversion framework, version 1, Zenodo [code], https://doi.org/10.5281/zenodo.4886771, 2021a. a
Bertolacci, M., Fisher, J., Cao, Y., Stavert, A., and Rigby, M.: WOMBAT: A fully Bayesian global fluxinversion framework, version 1 (intermediate files), Zenodo [data set], https://doi.org/10.5281/zenodo.4887043, 2021b. a
Bey, I., Jacob, D. J., Yantosca, R. M., Logan, J. A., Field, B. D., Fiore, A. M., Li, Q., Liu, H. Y., Mickley, L. J., and Schultz, M. G.: Global Modeling of Tropospheric Chemistry with Assimilated Meteorology: Model Description and Evaluation, J. Geophys. Res., 106, 23073–23095, https://doi.org/10.1029/2001JD000807, 2001. a
Blumenstock, T., Hase, F., Schneider, M., García, O. E., and Sepúlveda, E.: TCCON data from Izaña (ES), Release GGG2014.R0, CaltechDATA [data set], https://doi.org/10.14291/tccon.ggg2014.izana01.R0/1149295, 2014. a
Brynjarsdóttir, J. and O'Hagan, A.: Learning about physical parameters: the importance of model discrepancy, Inverse Problems, 30, 114007, https://doi.org/10.1088/02665611/30/11/114007, 2014. a
Bukosa, B., Deutscher, N. M., Fisher, J. A., Kubistin, D., PatonWalsh, C., and Griffith, D. W. T.: Simultaneous shipborne measurements of CO_{2}, CH_{4} and CO and their application to improving greenhousegas flux estimates in Australia, Atmos. Chem. Phys., 19, 7055–7072, https://doi.org/10.5194/acp1970552019, 2019. a, b
Burrows, J. P., Hölzle, E., Goede, A. P. H., Visser, H., and Fricke, W.: SCIAMACHY–Scanning Imaging Absorption Spectrometer for Atmospheric Chartography, Acta Astronaut., 35, 445–451, https://doi.org/10.1016/00945765(94)00278T, 1995. a
Chevallier, F.: Impact of Correlated Observation Errors on Inverted CO_{2} Surface Fluxes from OCO Measurements, Geophys. Res. Lett., 34, L24804, https://doi.org/10.1029/2007GL030463, 2007. a, b, c
Chevallier, F., Fisher, M., Peylin, P., Serrar, S., Bousquet, P., Bréon, F.M., Chédin, A., and Ciais, P.: Inferring CO_{2} sources and sinks from satellite observations: Method and application to TOVS data, J. Geophys. Res., 110, D24309, https://doi.org/10.1029/2005JD006390, 2005. a, b
Chevallier, F., Bréon, F.M., and Rayner, P. J.: Contribution of the Orbiting Carbon Observatory to the estimation of CO_{2} sources and sinks: Theoretical study in a variational data assimilation framework, J. Geophys. Res., 112, D09307, https://doi.org/10.1029/2006JD007375, 2007. a, b
Ciais, P., Rayner, P., Chevallier, F., Bousquet, P., Logan, M., Peylin, P., and Ramonet, M.: Atmospheric inversions for estimating CO_{2} fluxes: methods and perspectives, Climatic Change, 103, 69–92, https://doi.org/10.1007/s1058401099093, 2010. a, b
Connor, B. J., Boesch, H., Toon, G., Sen, B., Miller, C., and Crisp, D.: Orbiting Carbon Observatory: Inverse method and prospective error analysis, J. Geophys. Res., 113, D05305, https://doi.org/10.1029/2006JD008336, 2008. a
Crowell, S., Baker, D., Schuh, A., Basu, S., Jacobson, A. R., Chevallier, F., Liu, J., Deng, F., Feng, L., McKain, K., Chatterjee, A., Miller, J. B., Stephens, B. B., Eldering, A., Crisp, D., Schimel, D., Nassar, R., O'Dell, C. W., Oda, T., Sweeney, C., Palmer, P. I., and Jones, D. B. A.: The 2015–2016 carbon cycle as seen from OCO2 and the global in situ network, Atmos. Chem. Phys., 19, 9797–9831, https://doi.org/10.5194/acp1997972019, 2019. a, b, c, d, e, f, g, h, i, j, k, l, m, n, o, p, q, r, s
Dahlén, U., Lindström, J., and Scholze, M.: Spatiotemporal Reconstructions of Global CO_{2}Fluxes Using Gaussian Markov Random Fields, Environmetrics, 31, e2610, https://doi.org/10.1002/env.2610, 2020. a
Darmenov, A. and da Silva, A.: The Quick Fire Emissions Dataset (QFED): Documentation of versions 2.1, 2.2 and 2.4, NASA Technical Report Series on Global Modeling and Data Assimilation, Tech. Rep. NASA/TM2015104606, available at: http://gmao.gsfc.nasa.gov/pubs/docs/Darmenov796.pdf (last access: 16 November 2021), 2015. a, b
Datta, A., Banerjee, S., Finley, A. O., and Gelfand, A. E.: Hierarchical nearestneighbor Gaussian process models for large geostatistical datasets, J. Am. Stat. Assoc., 111, 800–812, https://doi.org/10.1080/01621459.2015.1044091, 2016. a
De Mazière, M., Sha, M. K., Desmet, F., Hermans, C., Scolas, F., Kumps, N., Metzger, J.M., Duflot, V., and Cammas, J.P.: TCCON data from Réunion Island (RE), Release GGG2014.R0, CaltechDATA [data set], https://doi.org/10.14291/tccon.ggg2014.reunion01.R0/1149288, 2014. a
Deng, F., Jones, D. B., O'Dell, C. W., Nassar, R., and Parazoo, N. C.: Combining GOSAT XCO_{2} observations over land and ocean to improve regional CO_{2} flux estimates, J. Geophys. Res.Atmos., 121, 1896–1913, https://doi.org/10.1002/2015JD024157, 2016. a
Deutscher, N. M., Notholt, J., Messerschmidt, J., Weinzierl, C., Warneke, T., Petri, C., and Grupe, P.: TCCON data from Bialystok (PL), Release GGG2014.R1, CaltechDATA [data set], https://doi.org/10.14291/tccon.ggg2014.bialystok01.R1/1183984, 2015. a
Dlugokencky, E. and Tans, P.: Trends in atmospheric carbon dioxide, National Oceanic & Atmospheric Administration, Earth System Research Laboratory (NOAA/ESRL), available at: http://www.esrl.noaa.gov/gmd/ccgg/trends, last access: 11 December 2020. a
Dubey, M. K., Henderson, B. G., Green, D., Butterfield, Z. T., KeppelAleks, G., Allen, N. T., Blavier, J.F., Roehl, C. M., Wunch, D., and Lindenmaier, R.: TCCON data from Manaus (BR), Release GGG2014.R0, CaltechDATA [data set], https://doi.org/10.14291/tccon.ggg2014.manaus01.R0/1149274, 2014. a
Edenhofer, O., R., PichsMadruga, Y., Sokona, E., Farahani, S., Kadner, K., Seyboth, A., Adler, I., Baum, S., Brunner, P., Eickemeier, B., Kriemann, J., Savolainen, S., Schlömer, C., von Stechow, T., Zwickel, and Minx, J.: IPCC, 2014: Summary for Policymakers, in: Climate Change 2014: Mitigation of Climate Change. Contribution of Working Group III to the Fifth Assessment Report of the Intergovernmental Panel on Climate Change, Cambridge University Press, Cambridge, UK, 2014. a
Eldering, A., O'Dell, C. W., Wennberg, P. O., Crisp, D., Gunson, M. R., Viatte, C., Avis, C., Braverman, A., Castano, R., Chang, A., Chapsky, L., Cheng, C., Connor, B., Dang, L., Doran, G., Fisher, B., Frankenberg, C., Fu, D., Granat, R., Hobbs, J., Lee, R. A. M., Mandrake, L., McDuffie, J., Miller, C. E., Myers, V., Natraj, V., O'Brien, D., Osterman, G. B., Oyafuso, F., Payne, V. H., Pollock, H. R., Polonsky, I., Roehl, C. M., Rosenberg, R., Schwandner, F., Smyth, M., Tang, V., Taylor, T. E., To, C., Wunch, D., and Yoshimizu, J.: The Orbiting Carbon Observatory2: first 18 months of science data products, Atmos. Meas. Tech., 10, 549–563, https://doi.org/10.5194/amt105492017, 2017. a, b
Eldering, A., Taylor, T. E., O'Dell, C. W., and Pavlick, R.: The OCO3 mission: measurement objectives and expected performance based on 1 year of simulated data, Atmos. Meas. Tech., 12, 2341–2370, https://doi.org/10.5194/amt1223412019, 2019. a
Engelen, R. J., Denning, A. S., and Gurney, K. R.: On error estimation in atmospheric CO_{2} inversions, J. Geophys. Res., 107, 4635, https://doi.org/10.1029/2002JD002195, 2002. a
Enting, I. G.: Inverse Problems in Atmospheric Constituent Transport, Cambridge University Press, Cambridge, UK, https://doi.org/10.1017/CBO9780511535741, 2002. a, b, c, d, e, f
Fan, S., Gloor, M., Mahlman, J., Pacala, S., Sarmiento, J., Takahashi, T., and Tans, P.: A large terrestrial carbon sink in North America implied by atmospheric and oceanic carbon dioxide data and models, Science, 282, 442–446, 1998. a
Feist, D. G., Arnold, S. G., John, N., and Geibel, M. C.: TCCON data from Ascension Island (SH), Release GGG2014.R0, CaltechDATA [data set], https://doi.org/10.14291/tccon.ggg2014.ascension01.r0/1149285, 2014. a
Feng, L., Palmer, P. I., Bösch, H., and Dance, S.: Estimating surface CO_{2} fluxes from spaceborne CO_{2} dry air mole fraction observations using an ensemble Kalman Filter, Atmos. Chem. Phys., 9, 2619–2633, https://doi.org/10.5194/acp926192009, 2009. a
Feng, S., Lauvaux, T., Keller, K., Davis, K. J., Rayner, P., Oda, T., and Gurney, K. R.: A road map for improving the treatment of uncertainties in highresolution regional carbon flux inverse estimates, Geophys. Res. Lett., 46, 13461–13469, https://doi.org/10.1029/2019GL082987, 2019. a
Ganesan, A. L., Rigby, M., ZammitMangion, A., Manning, A. J., Prinn, R. G., Fraser, P. J., Harth, C. M., Kim, K.R., Krummel, P. B., Li, S., Mühle, J., O'Doherty, S. J., Park, S., Salameh, P. K., Steele, L. P., and Weiss, R. F.: Characterization of uncertainties in atmospheric trace gas inversions using hierarchical Bayesian methods, Atmos. Chem. Phys., 14, 3855–3864, https://doi.org/10.5194/acp1438552014, 2014. a, b, c
Giglio, L., Randerson, J. T., and van der Werf, G. R.: Analysis of daily, monthly, and annual burned area using the fourthgeneration global fire emissions database (GFED4), J. Geophys. Res.Biogeo., 118, 317–328, https://doi.org/10.1002/jgrg.20042, 2013. a, b
Gneiting, T. and Raftery, A. E.: Strictly Proper Scoring Rules, Prediction, and Estimation, J. Am. Stat. Assoc., 102, 359–378, https://doi.org/10.1198/016214506000001437, 2007. a
Griffith, D. W., Deutscher, N. M., Velazco, V. A., Wennberg, P. O., Yavin, Y., KeppelAleks, G., Washenfelder, R. A., Toon, G. C., Blavier, J.F., PatonWalsh, C., Jones, N. B., Kettlewell, G. C., Connor, B. J., Macatangay, R. C., Roehl, C., Ryczek, M., Glowacki, J., Culgan, T., and Bryant, G. W.: TCCON data from Darwin (AU), Release GGG2014.R0, CaltechDATA [data set], https://doi.org/10.14291/tccon.ggg2014.darwin01.R0/1149290, 2014a. a
Griffith, D. W., Velazco, V. A., Deutscher, N. M., PatonWalsh, C., Jones, N. B., Wilson, S. R., Macatangay, R. C., Kettlewell, G. C., Buchholz, R. R., and Riggenbach, M. O.: TCCON data from Wollongong (AU), Release GGG2014.R0, CaltechDATA [data set], https://doi.org/10.14291/tccon.ggg2014.wollongong01.R0/1149291, 2014b. a
Gurney, K. R., Law, R. M., Denning, A. S., Rayner, P. J., Baker, D., Bousquet, P., Bruhwiler, L., Chen, Y.H., Ciais, P., Fan, S., Fung, I. Y., Gloor, M., Heimann, M., Higuchi, K., John, J., Maki, T., Maksyutov, S., Masarie, K., Peylin, P., Prather, M., Pak, B. C., Randerson, J., Sarmiento, J., Taguchi, S., Takahashi, T., and Yuen, C.W.: Towards robust regional estimates of CO_{2} sources and sinks using atmospheric transport models, Nature, 415, 626–630, https://doi.org/10.1038/415626a, 2002. a, b
Hase, F., Blumenstock, T., Dohe, S., Groß, J., and Kiel, M.: TCCON data from Karlsruhe (DE), Release GGG2014.R1, CaltechDATA [data set], https://doi.org/10.14291/tccon.ggg2014.karlsruhe01.R1/1182416, 2015. a
Henderson, H. V. and Searle, S. R.: On deriving the inverse of a sum of matrices, Siam Rev., 23, 53–60, 1981. a, b
Houweling, S., Aben, I., Breon, F.M., Chevallier, F., Deutscher, N., Engelen, R., Gerbig, C., Griffith, D., Hungershoefer, K., Macatangay, R., Marshall, J., Notholt, J., Peters, W., and Serrar, S.: The importance of transport model uncertainties for the estimation of CO_{2} sources and sinks using satellite measurements, Atmos. Chem. Phys., 10, 9981–9992, https://doi.org/10.5194/acp1099812010, 2010. a
Huntzinger, D., Michalak, A., Schwalm, C., Ciais, P., King, A., Fang, Y., Schaefer, K., Wei, Y., Cook, R., Fisher, J., Hayes, D., Huang, M., Ito, A., Jain, A., Lei, H., Lu, C., Maignan, F., Mao, J., Parazoo, N., Peng, S., Poulter, B., Ricciuto, D., Shi, X., Tian, H., Wang, W., Zeng, N., and Zhao, F.: Uncertainty in the response of terrestrial carbon sink to environmental drivers undermines carbonclimate feedback predictions, Scientific Reports, 7, 1–8, 2017. a
Iraci, L. T., Podolske, J. R., Hillyard, P. W., Roehl, C., Wennberg, P. O., Blavier, J.F., Landeros, J., Allen, N., Wunch, D., Zavaleta, J., Quigley, E., Osterman, G. B., Albertson, R., Dunwoody, K., and Boyden, H.: TCCON data from Edwards (US), Release GGG2014.R1, CaltechDATA [data set], https://doi.org/10.14291/tccon.ggg2014.edwards01.r1/1255068, 2016. a
Jacobson, A. R., Schuldt, K. N., Miller, J. B., Oda, T., Tans, P., Arlyn Andrews, Mund, J., Ott, L., Collatz, G. J., Aalto, T., Afshar, S., Aikin, K., Aoki, S., Apadula, F., Baier, B., Bergamaschi, P., Beyersdorf, A., Biraud, S. C., Bollenbacher, A., Bowling, D., Brailsford, G., Abshire, J. B., Chen, G., Chen, H., Chmura, L., Climadat, S., Colomb, A., Conil, S., Cox, A., Cristofanelli, P., Cuevas, E., Curcoll, R., Sloop, C. D., Davis, K., Wekker, S. D., Delmotte, M., DiGangi, J. P., Dlugokencky, E., Ehleringer, J., Elkins, J. W., Emmenegger, L., Fischer, M. L., Forster, G., Frumau, A., Galkowski, M., Gatti, L. V., Gloor, E., Griffis, T., Hammer, S., Haszpra, L., Hatakka, J., Heliasz, M., Hensen, A., Hermanssen, O., Hintsa, E., Holst, J., Jaffe, D., Karion, A., Kawa, S. R., Keeling, R., Keronen, P., Kolari, P., Kominkova, K., Kort, E., Krummel, P., Kubistin, D., Labuschagne, C., Langenfelds, R., Laurent, O., Laurila, T., Lauvaux, T., Law, B., Lee, J., Lehner, I., Leuenberger, M., Levin, I., Levula, J., Lin, J., Lindauer, M., Loh, Z., Lopez, M., Myhre, C. L., Machida, T., Mammarella, I., Manca, G., Manning, A., Manning, A., Marek, M. V., Marklund, P., Martin, M. Y., Matsueda, H., McKain, K., Meijer, H., Meinhardt, F., Miles, N., Miller, C. E., Mölder, M., Montzka, S., Moore, F., Morgui, J.A., Morimoto, S., Munger, B., Necki, J., Newman, S., Nichol, S., Niwa, Y., O'Doherty, S., OttossonLöfvenius, M., Paplawsky, B., Peischl, J., Peltola, O., Pichon, J.M., Piper, S., PlassDölmer, C., Ramonet, M., ReyesSanchez, E., Richardson, S., Riris, H., Ryerson, T., Saito, K., Sargent, M., Sasakawa, M., Sawa, Y., Say, D., Scheeren, B., Schmidt, M., Schmidt, A., Schumacher, M., Shepson, P., Shook, M., Stanley, K., Steinbacher, M., Stephens, B., Sweeney, C., Thoning, K., Torn, M., Turnbull, J., Tørseth, K., Bulk, P. V. D., van der LaanLuijkx, I. T., Dinther, D. V., Vermeulen, A., Viner, B., Vitkova, G., Walker, S., Weyrauch, D., Wofsy, S., Worthy, D., Young, D., and Zimnoch, M.: CarbonTracker CT2019, Model, NOAA Earth System Research Laboratory, Global Monitoring Division, https://doi.org/10.25925/39m36069, available at: https://www.esrl.noaa.gov/gmd/ccgg/carbontracker/CT2019_doc.php (last access: 16 November 2021), 2020. a, b, c
Kaminski, T., Rayner, P. J., Heimann, M., and Enting, I. G.: On aggregation errors in atmospheric transport inversions, J. Geophys. Res., 106, 4703–4715, https://doi.org/10.1029/2000JD900581, 2001. a
Kawakami, S., Ohyama, H., Arai, K., Okumura, H., Taura, C., Fukamachi, T., and Sakashita, M.: TCCON data from Saga (JP), Release GGG2014.R0, CaltechDATA [data set], https://doi.org/10.14291/tccon.ggg2014.saga01.R0/1149283, 2014. a
Keller, C. A., Long, M. S., Yantosca, R. M., Da Silva, A. M., Pawson, S., and Jacob, D. J.: HEMCO v1.0: a versatile, ESMFcompliant component for calculating emissions in atmospheric models, Geosci. Model Dev., 7, 1409–1417, https://doi.org/10.5194/gmd714092014, 2014. a
Kivi, R., Heikkinen, P., and Kyrö, E.: TCCON data from Sodankylä (FI), Release GGG2014.R0, CaltechDATA [data set], https://doi.org/10.14291/tccon.ggg2014.sodankyla01.R0/1149280, 2014. a
Kuze, A., Suto, H., Nakajima, M., and Hamazaki, T.: Thermal and near Infrared Sensor for Carbon Observation FourierTransform Spectrometer on the Greenhouse Gases Observing Satellite for Greenhouse Gases Monitoring, Appl. Optics, 48, 6716–6733, https://doi.org/10.1364/AO.48.006716, 2009. a
Lauvaux, T., DíazIsaac, L. I., Bocquet, M., and Bousserez, N.: Diagnosing spatial error structures in CO_{2} mole fractions and XCO_{2} column mole fractions from atmospheric transport, Atmos. Chem. Phys., 19, 12007–12024, https://doi.org/10.5194/acp19120072019, 2019. a, b
Liu, J., Bowman, K. W., Lee, M., Henze, D. K., Bousserez, N., Brix, H., James Collatz, G., Menemenlis, D., Ott, L., Pawson, S., Jones, D., and Nassar, R.: Carbon monitoring system flux estimation and attribution: impact of ACOSGOSAT X${}_{{\text{CO}}_{\text{2}}}$ sampling on the inference of terrestrial biospheric sources and sinks, Tellus B, 66, 22486, https://doi.org/10.3402/tellusb.v66.22486, 2014. a
Masarie, K. A., Peters, W., Jacobson, A. R., and Tans, P. P.: ObsPack: a framework for the preparation, delivery, and attribution of atmospheric greenhouse gas measurements, Earth Syst. Sci. Data, 6, 375–384, https://doi.org/10.5194/essd63752014, 2014. a
McNorton, J. R., Bousserez, N., AgustíPanareda, A., Balsamo, G., Choulga, M., Dawson, A., Engelen, R., Kipling, Z., and Lang, S.: Representing model uncertainty for global atmospheric CO_{2} flux inversions using ECMWFIFS46R1, Geosci. Model Dev., 13, 2297–2313, https://doi.org/10.5194/gmd1322972020, 2020. a, b
Michalak, A. M., Bruhwiler, L., and Tans, P. P.: A geostatistical approach to surface flux estimation of atmospheric trace gases, J. Geophys. Res., 109, D14109, https://doi.org/10.1029/2003JD004422, 2004. a, b
Michalak, A. M., Hirsch, A., Bruhwiler, L., Gurney, K. R., Peters, W., and Tans, P. P.: Maximum likelihood estimation of covariance parameters for Bayesian atmospheric trace gas surface flux inversions, J. Geophys. Res., 110, D24107, https://doi.org/10.1029/2005JD005970, 2005. a, b
Miller, S. M., Michalak, A. M., and Levi, P. J.: Atmospheric inverse modeling with known physical bounds: an example from trace gas emissions, Geosci. Model Dev., 7, 303–315, https://doi.org/10.5194/gmd73032014, 2014. a
Miller, S. M., Saibaba, A. K., Trudeau, M. E., Mountain, M. E., and Andrews, A. E.: Geostatistical inverse modeling with very large datasets: an example from the Orbiting Carbon Observatory 2 (OCO2) satellite, Geosci. Model Dev., 13, 1771–1785, https://doi.org/10.5194/gmd1317712020, 2020. a
Morino, I., Matsuzaki, T., and Horikawa, M.: TCCON data from Tsukuba (JP), 125HR, Release GGG2014.R1, Version GGG2014.R1, CaltechDATA [Data set], https://doi.org/10.14291/tccon.ggg2014.tsukuba02.R1/1241486, 2016. a
Mukherjee, C., Kasibhatla, P. S., and West, M.: Bayesian statistical modeling of spatially correlated error structure in atmospheric tracer inverse analysis, Atmos. Chem. Phys., 11, 5365–5382, https://doi.org/10.5194/acp1153652011, 2011. a, b, c
Nassar, R., Jones, D. B. A., Suntharalingam, P., Chen, J. M., Andres, R. J., Wecht, K. J., Yantosca, R. M., Kulawik, S. S., Bowman, K. W., Worden, J. R., Machida, T., and Matsueda, H.: Modeling global atmospheric CO_{2} with improved emission inventories and CO_{2} production from the oxidation of other carbon species, Geosci. Model Dev., 3, 689716, https://doi.org/10.5194/gmd36892010, 2010. a
Nassar, R., NapierLinton, L., Gurney, K. R., Andres, R. J., Oda, T., Vogel, F. R., and Deng, F.: Improving the temporal and spatial distribution of CO_{2} emissions from global fossil fuel emission data sets, J. Geophys. Res.Atmos., 118, 917–933, https://doi.org/10.1029/2012JD018196, 2013. a, b
Neal, R. M.: Slice sampling, Ann. Stat., 31, 705–741, https://doi.org/10.1214/aos/1056562461, 2003. a, b
Notholt, J., Petri, C., Warneke, T., Deutscher, N. M., Palm, M., Buschmann, M., Weinzierl, C., Macatangay, R. C., and Grupe, P.: TCCON data from Bremen (DE), Release GGG2014.R0, CaltechDATA [data set], https://doi.org/10.14291/tccon.ggg2014.bremen01.R0/1149275, 2014. a
Oda, T. and Maksyutov, S.: A very highresolution (1 km × 1 km) global fossil fuel CO_{2} emission inventory derived using a point source database and satellite observations of nighttime lights, Atmos. Chem. Phys., 11, 543–556, https://doi.org/10.5194/acp115432011, 2011. a, b
Oda, T., Maksyutov, S., and Andres, R. J.: The Opensource Data Inventory for Anthropogenic CO2, version 2016 (ODIAC2016): a global monthly fossil fuel CO_{2} gridded emissions data product for tracer transport simulations and surface flux inversions, Earth Syst. Sci. Data, 10, 87–107, https://doi.org/10.5194/essd10872018, 2018. a, b
O'Dell, C. W., Connor, B., Bösch, H., O'Brien, D., Frankenberg, C., Castano, R., Christi, M., Eldering, D., Fisher, B., Gunson, M., McDuffie, J., Miller, C. E., Natraj, V., Oyafuso, F., Polonsky, I., Smyth, M., Taylor, T., Toon, G. C., Wennberg, P. O., and Wunch, D.: The ACOS CO_{2} retrieval algorithm – Part 1: Description and validation against synthetic observations, Atmos. Meas. Tech., 5, 99–121, https://doi.org/10.5194/amt5992012, 2012. a, b
O'Dell, C. W., Eldering, A., Wennberg, P. O., Crisp, D., Gunson, M. R., Fisher, B., Frankenberg, C., Kiel, M., Lindqvist, H., Mandrake, L., Merrelli, A., Natraj, V., Nelson, R. R., Osterman, G. B., Payne, V. H., Taylor, T. E., Wunch, D., Drouin, B. J., Oyafuso, F., Chang, A., McDuffie, J., Smyth, M., Baker, D. F., Basu, S., Chevallier, F., Crowell, S. M. R., Feng, L., Palmer, P. I., Dubey, M., García, O. E., Griffith, D. W. T., Hase, F., Iraci, L. T., Kivi, R., Morino, I., Notholt, J., Ohyama, H., Petri, C., Roehl, C. M., Sha, M. K., Strong, K., Sussmann, R., Te, Y., Uchino, O., and Velazco, V. A.: Improved retrievals of carbon dioxide from Orbiting Carbon Observatory2 with the version 8 ACOS algorithm, Atmos. Meas. Tech., 11, 6539–6576, https://doi.org/10.5194/amt1165392018, 2018. a
Peters, G. P., Andrew, R. M., Boden, T., Canadell, J. G., Ciais, P., Le Quéré, C., Marland, G., Raupach, M. R., and Wilson, C.: The challenge to keep global warming below 2 ^{∘}C, Nat. Clim. Change, 3, 4–6, https://doi.org/10.1038/nclimate1783, 2013. a
Peters, W., Miller, J., Whitaker, J., Denning, A., Hirsch, A., Krol, M., Zupanski, D., Bruhwiler, L., and Tans, P.: An ensemble data assimilation system to estimate CO_{2} surface fluxes from atmospheric trace gas observations, J. Geophys. Res., 110, D24304, https://doi.org/10.1029/2005JD006157, 2005. a, b
Philip, S., Johnson, M. S., Potter, C., Genovesse, V., Baker, D. F., Haynes, K. D., Henze, D. K., Liu, J., and Poulter, B.: Prior biosphere model impact on global terrestrial CO_{2} fluxes estimated from OCO2 retrievals, Atmos. Chem. Phys., 19, 13267–13287, https://doi.org/10.5194/acp19132672019, 2019. a
Potter, C. S., Randerson, J. T., Field, C. B., Matson, P. A., Vitousek, P. M., Mooney, H. A., and Klooster, S. A.: Terrestrial ecosystem production: a process model based on global satellite and surface data, Global Biogeochem. Cy., 7, 811–841, https://doi.org/10.1029/93GB02725, 1993. a, b
R Core Team: R: A Language and Environment for Statistical Computing, R Foundation for Statistical Computing, Vienna, Austria, available at: https://www.Rproject.org/ (last access: 16 November 2021), 2020. a
Rayner, P. and O'Brien, D.: The utility of remotely sensed CO_{2} concentration data in surface source inversions, Geophys. Res. Lett., 28, 175–178, https://doi.org/10.1029/2000GL011912, 2001. a
Rienecker, M. M., Suarez, M. J., Todling, R., Bacmeister, J., Takacs, L., Liu, H.C., Gu, W., Sienkiewicz, M., Koster, R. D., Gelaro, R., Stajner, I., and Nielsen, J.: The GEOS5 Data Assimilation SystemDocumentation of Versions 5.0.1, 5.1.0, and 5.2.0, NASA Tech. Rep. TM2008104606, 2008. a
Rodgers, C. D.: Inverse Methods for Atmospheric Sounding, World Scientific, Singapore, Singapore, https://doi.org/10.1142/3171, 2000. a, b
Rodgers, C. D. and Connor, B. J.: Intercomparison of Remote Sounding Instruments, J. Geophys. Res., 108, 4116, https://doi.org/10.1029/2002JD002299, 2003. a
Saeki, T., Maksyutov, S., Sasakawa, M., Machida, T., Arshinov, M., Tans, P., Conway, T. J., Saito, M., Valsala, V., Oda, T., Andres, R. J., and Belikov, D.: Carbon flux estimation for Siberia by inverse modeling constrained by aircraft and tower CO_{2} measurements, J. Geophys. Res.Atmos., 118, 1100–1122, https://doi.org/10.1002/jgrd.50127, 2013. a
Schuh, A. E., Jacobson, A. R., Basu, S., Weir, B., Baker, D., Bowman, K., Chevallier, F., Crowell, S., Davis, K. J., Deng, F., Denning, S., Feng, L., Jones, D., Liu, J., and Palmer, P. I.: Quantifying the Impact of Atmospheric Transport Uncertainty on CO_{2} Surface Flux Estimates, Global Biogeochem. Cy., 33, 484–500, https://doi.org/10.1029/2018GB006086, 2019. a, b, c
Sherlock, V., Connor, B., Robinson, J., Shiona, H., Smale, D., and Pollard, D. F.: TCCON data from Lauder (NZ), 125HR, Release GGG2014.R0, CaltechDATA [data set], https://doi.org/10.14291/tccon.ggg2014.lauder02.R0/1149298, 2014. a
Strong, K., Roche, S., Franklin, J. E., Mendonca, J., Lutsch, E., Weaver, D., Fogal, P. F., Drummond, J. R., Batchelor, R., and Lindenmaier, R.: TCCON data from Eureka (CA), Release GGG2014.R1, CaltechDATA [data set], https://doi.org/10.14291/tccon.ggg2014.eureka01.r1/1325515, 2016. a
Takahashi, T., Sutherland, S. C., Sweeney, C., Poisson, A., Metzl, N., Tilbrook, B., Bates, N., Wanninkhof, R., Feely, R. A., Sabine, C., Olafsson, J., and Nojiri, Y.: Global sea–air CO_{2} flux based on climatological surface ocean pCO_{2}, and seasonal biological and temperature effects, DeepSea Res. Pt. II, 49, 1601–1622, https://doi.org/10.1016/S09670645(02)000036, 2002. a, b
Takahashi, T., Sutherland, S. C., Wanninkhof, R., Sweeney, C., Feely, R. A., Chipman, D. W., Hales, B., Friederich, G., Chavez, F., Sabine, C., Watson, A., Bakker, D. C. E., Schuster, U., Metzl, N., YoshikawaInoue, H., Ishii, M., Midorikawa, T., Nojiri, Y., Körtzinger, A., Steinhoff, T., Hoppema, M., Olafsson, J., Arnarson, T. S., Tilbrook, B., Johannessen, T., Olsen, A., Bellerby, R., Wong, C. S., Delille, B., Bates, N. R., and de Baar, H. J. W.: Climatological mean and decadal change in surface ocean pCO_{2}, and net sea–air CO_{2} flux over the global oceans, DeepSea Res. Pt. II, 56, 554–577, https://doi.org/10.1016/j.dsr2.2008.12.009, 2009. a, b
Thoning, K. W., Crotwell, A. M., and Mund, J. W.: Atmospheric Carbon Dioxide Dry Air Mole Fractions from continuous measurements at Mauna Loa, Hawaii, Barrow, Alaska, American Samoa and South Pole. 1973–2019, Version 202008, National Oceanic and Atmospheric Administration (NOAA), Global Monitoring Laboratory (GML) [data set], Boulder, Colorado, https://doi.org/10.15138/yaf1bk21, 2020. a
Tierney, L.: Markov Chains for Exploring Posterior Distributions, Ann. Statist., 22, 1701–1728, https://doi.org/10.1214/aos/1176325750, 1994. a
Turner, A. J. and Jacob, D. J.: Balancing aggregation and smoothing errors in inverse models, Atmos. Chem. Phys., 15, 7039–7048, https://doi.org/10.5194/acp1570392015, 2015. a
UNFCCC: Adoption of the Paris Agreement, Report No. FCCC/CP/2015/L.9/Rev.1, available at: http://unfccc.int/resource/docs/2015/cop21/eng/l09r01.pdf (last access: 16 November 2021), 2015. a
Vecchia, A. V.: Estimation and model identification for continuous spatial processes, J. Roy. Stat. Soc. BMet., 50, 297–312, https://doi.org/10.1111/j.25176161.1988.tb01729.x, 1988. a, b
Warneke, T., Messerschmidt, J., Notholt, J., Weinzierl, C., Deutscher, N. M., Petri, C., and Grupe, P.: TCCON data from Orléans (FR), Release GGG2014.R0, CaltechDATA [data set], https://doi.org/10.14291/tccon.ggg2014.orleans01.R0/1149276, 2014. a
Wennberg, P. O., Roehl, C. M., Wunch, D., Toon, G. C., Blavier, J.F., Washenfelder, R., KeppelAleks, G., Allen, N. T., and Ayers, J.: TCCON data from Park Falls (US), Release GGG2014.R0, CaltechDATA [data set], https://doi.org/10.14291/tccon.ggg2014.parkfalls01.R0/1149161, 2014. a
Wennberg, P. O., Wunch, D., Roehl, C. M., Blavier, J.F., Toon, G. C., and Allen, N. T.: TCCON data from Caltech (US), Release GGG2014.R1, CaltechDATA [data set], https://doi.org/10.14291/tccon.ggg2014.pasadena01.r1/1182415, 2015. a
Wennberg, P. O., Wunch, D., Roehl, C. M., Blavier, J.F., Toon, G. C., and Allen, N. T.: TCCON data from Lamont (US), Release GGG2014.R1, CaltechDATA [data set], https://doi.org/10.14291/tccon.ggg2014.lamont01.r1/1255070, 2016. a
Wikle, C. K., ZammitMangion, A., and Cressie, N.: SpatioTemporal Statistics with R, Chapman & Hall/CRC Press, Boca Raton, FL, 2019. a, b
Worden, J. R., Doran, G., Kulawik, S., Eldering, A., Crisp, D., Frankenberg, C., O'Dell, C., and Bowman, K.: Evaluation and attribution of OCO2 XCO_{2} uncertainties, Atmos. Meas. Tech., 10, 2759–2771, https://doi.org/10.5194/amt1027592017, 2017. a, b
Wunch, D., Toon, G. C., Wennberg, P. O., Wofsy, S. C., Stephens, B. B., Fischer, M. L., Uchino, O., Abshire, J. B., Bernath, P., Biraud, S. C., Blavier, J.F. L., Boone, C., Bowman, K. P., Browell, E. V., Campos, T., Connor, B. J., Daube, B. C., Deutscher, N. M., Diao, M., Elkins, J. W., Gerbig, C., Gottlieb, E., Griffith, D. W. T., Hurst, D. F., Jiménez, R., KeppelAleks, G., Kort, E. A., Macatangay, R., Machida, T., Matsueda, H., Moore, F., Morino, I., Park, S., Robinson, J., Roehl, C. M., Sawa, Y., Sherlock, V., Sweeney, C., Tanaka, T., and Zondlo, M. A.: Calibration of the Total Carbon Column Observing Network using aircraft profile data, Atmos. Meas. Tech., 3, 1351–1362, https://doi.org/10.5194/amt313512010, 2010. a
Wunch, D., Toon, G. C., Blavier, J.F. L., Washenfelder, R. A., Notholt, J., Connor, B. J., Griffith, D. W., Sherlock, V., and Wennberg, P. O.: The Total Carbon Column Observing Network, Philos. T. Roy. Soc. A, 369, 2087–2112, https://doi.org/10.1098/rsta.2010.0240, 2011a. a, b, c, d
Wunch, D., Wennberg, P. O., Toon, G. C., Connor, B. J., Fisher, B., Osterman, G. B., Frankenberg, C., Mandrake, L., O'Dell, C., Ahonen, P., Biraud, S. C., Castano, R., Cressie, N., Crisp, D., Deutscher, N. M., Eldering, A., Fisher, M. L., Griffith, D. W. T., Gunson, M., Heikkinen, P., KeppelAleks, G., Kyrö, E., Lindenmaier, R., Macatangay, R., Mendonca, J., Messerschmidt, J., Miller, C. E., Morino, I., Notholt, J., Oyafuso, F. A., Rettinger, M., Robinson, J., Roehl, C. M., Salawitch, R. J., Sherlock, V., Strong, K., Sussmann, R., Tanaka, T., Thompson, D. R., Uchino, O., Warneke, T., and Wofsy, S. C.: A method for evaluating bias in global measurements of CO_{2} total columns from space, Atmos. Chem. Phys., 11, 12317–12337, https://doi.org/10.5194/acp11123172011, 2011b. a, b, c
Yantosca, B.: geoschem/geoschem: GEOSChem 12.3.2, 12.3.2, Zenodo [code], https://doi.org/10.5281/zenodo.2658178, 2019. a
Yevich, R. and Logan, J. A.: An assessment of biofuel use and burning of agricultural waste in the developing world, Global Biogeochem. Cy., 17, 1095, https://doi.org/10.1029/2002GB001952, 2003. a, b
ZammitMangion, A., Cressie, N., and Ganesan, A. L.: NonGaussian bivariate modelling with application to atmospheric tracegas inversion, Spatial Statistics, 18, 194–220, https://doi.org/10.1016/j.spasta.2016.06.005, 2016. a, b
 Abstract
 Introduction
 Bayesian hierarchical model for global flux inversion of trace gases
 The model–data discrepancy term
 Setup of fluxinversion framework
 Results
 Conclusion
 Appendix A: Markov chain Monte Carlo algorithm
 Appendix B: Observation operator for OCO2 retrievals
 Code and data availability
 Author contributions
 Competing interests
 Disclaimer
 Acknowledgements
 Financial support
 Review statement
 References
 Supplement
The requested paper has a corresponding corrigendum published. Please read the corrigendum first before downloading the article.
 Article
(4966 KB)  Fulltext XML
 Corrigendum

Supplement
(299 KB)  BibTeX
 EndNote
 Abstract
 Introduction
 Bayesian hierarchical model for global flux inversion of trace gases
 The model–data discrepancy term
 Setup of fluxinversion framework
 Results
 Conclusion
 Appendix A: Markov chain Monte Carlo algorithm
 Appendix B: Observation operator for OCO2 retrievals
 Code and data availability
 Author contributions
 Competing interests
 Disclaimer
 Acknowledgements
 Financial support
 Review statement
 References
 Supplement