Articles | Volume 18, issue 23
https://doi.org/10.5194/gmd-18-9279-2025
https://doi.org/10.5194/gmd-18-9279-2025
Development and technical paper
 | 
01 Dec 2025
Development and technical paper |  | 01 Dec 2025

Predicting and correcting the influence of boundary conditions in regional inverse analyses

Hannah Nesser, Kevin W. Bowman, Matthew D. Thill, Daniel J. Varon, Cynthia A. Randles, Ashutosh Tewari, Felipe J. Cardoso-Saldaña, Emily Reidy, Joannes D. Maasakkers, and Daniel J. Jacob
Abstract

Regional inverse analyses of atmospheric trace gas observations quantify gridded two-dimensional surface fluxes by fitting the observations to simulated concentrations from a transport model, usually by Bayesian optimization regularized by a gridded prior flux estimate. Regional inversions rely on the specification of background concentrations given by the boundary conditions (BCs) at the edges of the inversion domain, but biases in the BCs propagate to biases in the optimized fluxes. We develop a theoretical framework to explain how errors in the BCs influence the optimized fluxes as a function of the prior and observing system error statistics and of model transport. We derive a preview metric to estimate the BC-induced errors before conducting an inversion to support domain specification and a diagnostic metric to accurately quantify these errors after solving the inversion. We compare two methods to correct BC biases as part of an inversion, either directly by optimizing BC concentrations (boundary method) or indirectly by expanding the domain and correcting grid cell fluxes outside the region of interest (buffer method). We demonstrate that the boundary method is generally more accurate, physically grounded, and computationally tractable.

Share
1 Introduction

Regional inversions of observed atmospheric concentrations of long-lived trace gases quantify surface fluxes by fitting simulated concentrations from a transport model to the observations assuming boundary conditions (BCs) at the edge of the inversion domain. Such analyses can improve knowledge of fluxes and their trends on local to continental scales at high spatiotemporal resolution while avoiding the need to accurately quantify fluxes and concentrations globally (Sargent et al., 2021; Nesser et al., 2024; Byrne et al., 2024). However, BCs are often uncertain and biases in the BCs propagate to the inferred fluxes. Here we examine the problem of how BC biases affect regional inversions of greenhouse gas fluxes and develop a framework to predict and correct the influence of these biases.

BCs can be provided by coarse-resolution global simulations (Göckede et al., 2010; Wecht et al., 2014), by statistical or meteorological analysis of trace gas observations (Lauvaux et al., 2016; Balashov et al., 2020), or by combining simulated and observed concentrations (Sargent et al., 2018; Estrada et al., 2025). The information used to constrain BCs is often spatiotemporally sparse and may be biased. Even small BC biases can cause large biases in the inferred fluxes (Göckede et al., 2010; Lauvaux et al., 2012; Karion et al., 2021). BC biases are particularly critical for regional inverse analyses of long-lived gases such as carbon dioxide and methane where concentrations and variability can depend significantly on inflow.

Regional inversions generally infer gridded surface fluxes (the state vector) by minimizing a Bayesian cost function that accounts for the error statistics of the observing system (including the observations and transport model) and of the prior flux estimate used to regularize the solution. BC biases may be corrected as part of the inversion by optimizing BC concentrations as part of the state vector (Lauvaux et al., 2012; Wecht et al., 2014), by allowing buffer grid cells at the edge of the domain to absorb BC biases (Shen et al., 2021; Varon et al., 2022), or by combining these approaches (Estrada et al., 2025).

We present here a theoretical framework to predict, diagnose, and correct the influence of BC biases on the posterior gridded surface fluxes generated by a regional inversion (Sect. 2). We consider an inert trace gas with no sources or sinks within the inversion domain other than the surface fluxes. We demonstrate this framework with increasingly complex one- and two-dimensional simulation experiments that invert pseudo-observations of a long-lived trace gas generated using known fluxes and BCs (Sects. 3 and 4, respectively). Throughout, we compare different methods to correct BC biases within inversions and present metrics that estimate the effect of BC biases.

2 Analytical solution for the effect of boundary condition errors

We describe the analytical inverse solution, which we use to derive an exact solution (diagnostic) for the sensitivity of the posterior fluxes to a BC bias (Sect. 2.1). We apply this diagnostic to a simple one-dimensional transport model to determine how these errors depend on constant inversion parameters (Sect. 2.2). We then generalize the results using a two-box model applicable to two-dimensional inversions with variable inversion parameters. We use this two-box model to derive a preview metric that predicts the sensitivity of posterior fluxes to BC biases for a given inversion configuration, allowing the user to improve inversion parameters as needed before solving the inversion (Sect. 2.3).

2.1 Diagnostic equation

Given an n-dimensional state vector of gridded fluxes x and an m-dimensional vector of observations y, both with normally distributed errors, the optimal flux estimate x^ is obtained by minimizing a Bayesian cost function

(1) J ( x ) = ( x - x A ) T S A - 1 ( x - x A ) + ( y - F ( x ) ) T S O - 1 ( y - F ( x ) ) ,

where xA and SA are the prior flux estimate and error covariance matrix, respectively, SO is the observing system error covariance matrix representing uncertainties in the observations and the transport model, and F is the transport model (Brasseur and Jacob, 2017). We assume as is standard that the transport model initial conditions are given by a sufficiently long spin-up simulation driven by the BCs so that the initial conditions are consistent with the BCs. If the transport model is linear, then F(x)=Kx+c where K=F/x is the Jacobian matrix. The vector c represents the model background defined by the transport of the BCs to the same spatiotemporal locations as the observations. If the BC concentrations are optimized as part of the inversion, information about the background is instead contained in the state vector and in the columns of the Jacobian matrix so that c=0.

We can write the analytical solution for the cost function minimum that yields the optimal (posterior) flux estimate x^ as

(2) x ^ = x A + G ( y - ( K x A + c ) )

where

(3) G = x ^ y = K T S O - 1 K + S A - 1 - 1 K T S O - 1

is the gain matrix that represents the sensitivity of the posterior fluxes to the observations. We can separate the model-observation difference into the contributions from the errors in the prior fluxes relative to the true fluxes (xT) and the errors in the observing system (ε=y-(KxT+c)) so that

(4) x ^ = x A + A ( x T - x A ) + G ε

where A=x^/x=GK is the averaging kernel matrix, a measure of inversion information content that gives the sensitivity of the posterior fluxes to the true fluxes. The trace of the averaging kernel matrix gives the degrees of freedom for signal (DOFS), the number of independent pieces of information quantified by the inversion (Rodgers, 2000).

The posterior error induced by a BC error (εC) is derived by comparing the posterior fluxes produced with the true BC (cT) to an inversion with the BC error (c=cT+εC):

(5) Δ x ^ = - G ε C .

We define Eq. (5) as the diagnostic that estimates the effect of BC bias on the posterior fluxes given an explicitly constructed Jacobian matrix and assumed BC error statistics, which can be estimated from observed variability of background concentrations. BC-induced posterior errors are controlled by the gain matrix, which is a function of the observing system errors, prior flux errors, and transport as represented by the Jacobian matrix.

https://gmd.copernicus.org/articles/18/9279/2025/gmd-18-9279-2025-f01

Figure 1One-dimensional and two-box models of a passive trace gas to quantify the influence of BCs on inverse analyses. The one-dimensional model (a) simulates the concentrations of an inert trace gas over n grid cells of length L given a prescribed BC c, fluxes x=[x1,x2,,xn]T, and advection with wind speed U. The two-box model (c) generalizes an inverse analysis optimizing fluxes over a two-dimensional grid (b) where all grid cells have a representative grid cell length L given the wind direction. Each grid cell in the domain is reduced to a two-box model composed of a cluster of the j upwind grid cells (index j0) and the grid cell itself (index j). In all models, the inversion is solved with the averaged observation yi over each grid cell i={1,,n} with corresponding error standard deviation σO,i. The inversion is regularized by the prior flux xA,i in each grid cell with corresponding prior error standard deviation σA,i.

Download

2.2 One-dimensional model

To better understand how observing system errors, prior flux errors, and transport determine the influence of BC biases on posterior flux estimates (Eq. 5), we consider a one-dimensional horizontal transport model with constant wind speed U. Figure 1a depicts this model for n grid cells of length L. The Jacobian matrix for this transport model is derived through steady-state mass balance (Sect. S1 in the Supplement) to be lower diagonal with constant values given by

(6) τ = α τ

where α converts between the flux units and the observation units and τ is the grid cell residence time calculated as τ=L/U. The trace gas sources in the model are the BC c and fluxes x=[x1,x2,,xn]T. We assume that the m observations are uniformly distributed in space and time with constant uncorrelated error variance σO2. We solve the inversion (Eq. 2) using the averaged observations in each grid cell so that the observing system error covariance matrix SO is diagonal with constant error variances σO2/mg where mg=m/n is the number of observations in each grid cell. We assume constant prior fluxes xA and diagonal prior error covariance matrix SA with constant error variances σA2. We define the dimensionless, domain-average information ratio R of the prior error variances in concentration units to the observing system error variances

(7) R = m g τ σ A σ O 2 .

The information ratio increases with decreasing observing system error standard deviation, increasing prior error standard deviation, and increasing residence time so that the observations are more sensitive to the fluxes relative to the BC. Large values of the information ratio (R≫1) represent the case where the prior errors are larger than the observing system errors so that the posterior fluxes are strongly constrained by the observations (observation-rich). Small values (R≪1) correspond to inversions that are limited by the number or uncertainty of the observations (observation-limited), including the common case of spatially heterogeneous observations. Consider an illustrative inversion of a methane-like trace gas with τ=1.4 h (corresponding to U=5ms-1 and L=25 km), σA=12.5ppbh-1 (corresponding to 50 % uncertainty on relatively large prior emissions of 25 ppb h−1), and σO=10 ppb. In this case, an average of mg=200 observations per grid cell are needed to achieve R=1. The information ratio for an inversion can be increased by increasing the inversion duration to include more observations or by coarsening the resolution of the gridded fluxes optimized by the inversion.

The effect of a constant BC bias εC on the posterior fluxes is calculated as a function of the information ratio with the diagnostic (Eq. 5). We derive (Sect. S2 in the Supplement) the BC-induced error in the limiting cases of inversions that are observation-limited (R≪1) or observation-rich (R≫1):

(8) Δ x ^ j = - τ - 1 ε C ( n - j + 1 ) R , R 1 - τ - 1 ε C R - j + 1 , R 1 , j = { 1 , , n } .

Figure 2 shows the BC-induced error in these limiting cases (panel a) and for intermediate values of the information ratio calculated using Eq. (5) (panel b) for an illustrative inversion with n=10, εC=10ppb, and τ=1.4h corresponding to U=5ms-1 and L=25 km. In all cases, BC-induced errors decrease across the domain with the rate of decay bounded by the limiting cases in Eq. (8). In the observation-limited case, the BC-induced errors decrease linearly with the distance from the upwind boundary. The total BC-induced error is relatively small because of the strong constraint from the prior estimate. As the information ratio increases from R≪1, BC-induced errors increase because of the decreased prior constraint. The errors increase the most in the most upwind grid cells. As the information ratio approaches one, the total error converges to the total flux needed to explain the error in the BC (τ−1εC). Further increases in the observational constraint decrease the length scale over which the BC bias influences the posterior solution but not the total BC-induced error. In the observation-rich case, the BC-induced errors decay geometrically as the distance from the upwind boundary increases so that the BC-induced errors are limited to the most upwind grid cell.

https://gmd.copernicus.org/articles/18/9279/2025/gmd-18-9279-2025-f02

Figure 2BC-induced error on posterior surface fluxes as a function of the information ratio R (Eq. 7) for a one-dimensional atmospheric trace gas inversion over ten grid cells with constant parameters. The inversions assume τ=1.4 h corresponding to U=5ms-1 and L=25. The solid lines (b) represent the error induced by a 10 ppb BC bias relative to the prior emissions of 25 ppb h−1 in each grid cell (thin lines) and the total error integrated across the full domain (thick black line). The dashed line corresponds to an information ratio of 1, while the dotted lines show representative values for which the limiting cases described in Eq. (8) hold. The distribution of the BC-induced errors across the domain for these representative values is also shown (a).

Download

We also consider the effect of inverse methods to decrease BC biases within the constant parameter, one-dimensional inverse model. We compare the effect of optimizing the BC concentrations (boundary method) or fluxes outside of the domain of interest (buffer method). The boundary method optimizes one or more terms corresponding to BC concentrations along with the domain fluxes (Lauvaux et al., 2012; Wecht et al., 2014; Hancock et al., 2025). The buffer method expands the inversion domain to optimize fluxes in outlying grid cells, which are allowed to vary unphysically to absorb BC biases and are excluded from the final analysis (Shen et al., 2021; Varon et al., 2022). Both approaches increase the state vector dimension, thereby altering the Jacobian matrix and prior error covariance matrix. The buffer method also applies large prior error standard deviations to the buffer grid cells, often by aggregating the buffer grid cells into large clusters.

We derive the conditions under which the boundary and buffer methods are equivalent for an inversion of a single grid cell. For the boundary method, we use a BC prior error standard deviation equal to the constant BC bias εC and update the Jacobian matrix to include the sensitivity of the grid cell average observation to the BC. For the buffer method, we add a buffer grid cell and scale its prior error standard deviation by a factor of p. We assume observations are equally distributed over both grid cells. We solve both inversions. The posterior fluxes are equivalent when

(9) p = ε C τ σ A R + 2 .

Assuming steady state and knowledge of the domain wind speed, the buffer scale factor p can be chosen so that buffer and boundary methods perform equivalently. The scale factor goes to infinity for very small values of the information ratio (observation limited) and to zero for very large values (observation rich). The dependence of the scale factor on residence time implies that the buffer method is more vulnerable to varying wind speeds than the boundary method.

2.3 Preview for boundary condition errors

Equation (5) can estimate the effect of BC-induced errors on the posterior solution once model transport is characterized by the construction of the Jacobian matrix. We estimate BC-induced errors to inform the choice of inversion domain before building the Jacobian matrix by applying Eq. (5) to a simple steady-state two-box model that generalizes a gridded flux inversion. Figure 1b and c depict the model. The domain has BC c and constant wind speed U. All grid cells have representative grid cell length L given the wind direction. We consider a grid cell (with index j) that is j grid cells from the domain boundary and define the upwind grid cell (with index j0) as the aggregate of the upwind j grid cells. We conduct an inversion of the average observations over these two grid cells y=[yj0yj]T regularized by prior fluxes xA=[xA,j0xA,j]T. We define the prior error covariance matrix as

(10) S A = σ A , j 0 2 0 0 σ A , j 2

and use an equivalent structure for the observing system error covariance matrix. We assume the observing system error standard deviations account for the reduction in error resulting from averaging the observations. The Jacobian matrix is derived by steady-state mass balance (Sect. S1) to be a function of τ (Eq. 6) given by

(11) K = τ j 0 j 1 .

We solve for the effect of a constant BC bias εC=[εCεC]T on the posterior flux in the grid cell of interest Δx^j by applying the diagnostic (Eq. 5) to compute Δx^=[Δx^j-1Δx^j]T. This defines the preview

(12) Δ x ^ j = - ( τ - 1 ε C ) R j 1 + R j + β R j 0 + R j R j 0

where Rj0 and Rj refer to the information ratio (Eq. 7) in the upwind and selected grid cell, respectively, with τj0=jτ, and

(13) β = m g , j m g , j 0 + 1 .

Here, mg,j0 and mg,j refer to the number of observations in the upwind and selected grid cell, respectively, so that β represents the relative observation density across the grid cells. For grid cells abutting the boundary, all upwind values are set to 0 so that the BC-induced error approaches τ−1εC when the grid cell information ratio is large and 0 when the grid cell information ratio is small. As distance from the boundary increases, the BC-induced errors on average decrease due to the accumulation of upwind fluxes and observations, which increases the upwind information ratio Rj0. Variability in the prior fluxes and observation density as quantified by the information ratio Rj is super-imposed on this decay.

By analogy to our one-dimensional model, we consider the observation-limited and observation-rich limiting cases assuming constant prior fluxes, prior flux standard deviations, observation density, and observing system errors. In this case, Rj0=j4Rj due to the dependence on the number of grid cells j of the upstream observation count (mg,j0=jmg), grid cell residence time (τj0=jτ), and prior flux uncertainty (assuming the uncertainties are uncorrelated, σA,j0=σA/j). The resulting BC-induced errors

(14) Δ x ^ j = - ( τ - 1 ε C ) R , R 1 - ( τ - 1 ε C ) j - 4 R - 1 , R 1

match the general form of the one-dimensional approximation (Eq. 8), but with the geometric decay in the observation-controlled limiting case approximated as quartic decay.

3 One-dimensional numerical solution

We demonstrate the preview, diagnostic, and correction methods using illustrative, numerical inversions of a one-dimensional model for the horizontal transport of an inert trace gas (Sect. 2.2; Nesser, 2025). Table S1 in the Supplement summarizes the model parameters and Table 1 the inversion parameters, which we select to simulate realistic inversions. We use these demonstration inversions to illustrate the effect of constant BC biases in inversions with constant and varying wind speeds (Sect. 3.1). We then consider periodic BC biases (Sect. 3.2). We finally generalize the results by varying the parameter choice to understand the variables controlling BC-induced errors (Sect. 3.3).

3.1 Constant boundary condition perturbations

We quantify the sensitivity of posterior fluxes to BC biases by perturbing the true BC in our one-dimensional model, solving the inversion, and comparing the posterior fluxes to those generated by an inversion solved with the true BC (Nesser, 2025). The inversions apply no other sources of bias (including transport errors) to isolate the effect of BC biases. The domain-average information ratio R=0.2 for these demonstration inversions reflects the low DOFS achieved (5 for inversions with constant wind speeds and 7 for varying wind speeds) despite the uniformity and large number of observations (m=1000), consistent with real inversions (Varon et al., 2023).

Table 1Inversion parameters for simulation experiments.

a The error covariance matrices are assumed diagonal. b We optimize an incremental correction to the BC. c We optimize one buffer element that is a cluster of three native-resolution grid cells. d The buffer scale factor is the factor p applied to the prior error standard deviation in the buffer grid cell. e We calculate the buffer scale factor with Eq. (9). f We increase the Express Extension of the Gridded EPA inventory for 2020 (Maasakkers et al., 2023) by a factor of three so that the prior error statistics are consistent with the true emissions. g We use the IMI default boundary conditions, which are given by monthly mean TROPOMI observations smoothed over ≈1000 km. h We take as the inversion buffer zone the five concentric rings of 0.25°×0.3125° grid cells around the outer edge of the domain. We solve inversions using 10 buffer cluster elements and 440 native resolution buffer grid cells. i The IMI assumes a constant error standard deviation of 15 ppb for all observations aggregated into errors for the averaged observations accounting for error correlations following Chen et al. (2023). We use the errors generated by the IMI directly.

Download Print Version | Download XLSX

Figure 3a shows the relative difference in the posterior fluxes, normalized by the prior fluxes, induced by a constant 10 ppb BC perturbation for an inversion with constant wind speeds. As expected from our theoretical analysis, the resulting error on average decreases as the distance from the upwind boundary increases. Exceptions to the decreasing trend result from normalizing by the prior fluxes. The preview (Eq. 12) accurately predicts the error as expected from the specification of the wind speed. The diagnostic (Eq. 5; not shown) perfectly quantifies the no correction method error as expected given the specification of the BC bias.

https://gmd.copernicus.org/articles/18/9279/2025/gmd-18-9279-2025-f03

Figure 3Exact and predicted influence of a constant BC bias on the posterior fluxes generated by a one-dimensional inverse model using different BC correction methods. The lines represent the posterior error induced by a 10 ppb BC perturbation as a function of the number of grid cells from the upwind boundary using constant (a) and varying (b) wind speeds. The error is given as the difference between inversions solved with the true and biased BCs, normalized by the prior fluxes (Δx^/xA). The error is shown for inversions solved with no BC correction method, the boundary method, and the buffer method. The buffer grid cell is not shown. The panels also show the preview. Only the first 13 of 20 grid cells are shown; all errors approach zero after this point. The varying wind speed is a see-sawing wind with mean equal to the constant wind speed.

Download

Figure 3a also shows the effect of the buffer and boundary methods, which are described in Table 1. The buffer method optimizes a buffer cluster of three native resolution grid cells. Increasing the buffer grid cell size decreases the magnitude of BC-induced errors but increases the computational cost. The prior uncertainties of the buffer element are increased by a factor of 10p where p is the scale factor for which the boundary and buffer methods are theoretically equivalent (Eq. 9). Increasing p improves performance, but the error reduction quickly asymptotes so that a factor of 10 is sufficient to maximize error reduction in all tested cases. The boundary method optimizes an average correction to the BC.

Both the boundary and buffer methods reduce the error induced by the constant BC perturbation. The buffer method achieves larger uncertainty reductions than the boundary method as expected due to the choice of scale factor on the prior uncertainty. The uncorrected errors for the boundary method result from the decreased information available to correct the upwind fluxes, while the residual errors for the buffer method result from uncorrected BC biases. In all cases, the residual error magnitude, as measured by the root mean square error (RMSE) of the posterior fluxes compared to the inversion solved with the true BCs, is constant as a function of BC perturbation. This implies that the correction methods may increase posterior errors for grid cells close to the boundary in inversions with small BC biases. Excluding these grid cells from final analysis will minimize the effect of BC biases on the inversion.

The consistent performance of the metrics and of the correction methods results in part from the use of constant wind speeds, which is aligned with the assumptions used to derive the preview and the equivalence between the correction approaches. Figure 3b shows the effect of varying wind speeds. The BC-induced bias consists of decaying upwind biases and structured downwind biases. The preview (Eq. 12) accurately captures the average error decay over the domain, though it is unable to predict the structure of the downwind biases. The diagnostic (Eq. 5; not shown) perfectly predicts both the upwind and downwind biases due to its representation of transport and perfect knowledge of the BC bias. Both correction methods decrease the upwind biases, but only the boundary method decreases the downwind biases. Moreover, the RMSE of the posterior fluxes compared to the inversion solved with the true BCs is constant as a function of perturbation magnitude for the boundary method while the RMSE increases with perturbation magnitude for the buffer method. The buffer method is less able to decrease the variability of the BC-induced biases due to its dependence on wind speed (Eq. 9).

3.2 Varying boundary condition perturbations

Varying BC biases are unresolvable within inversions without prior knowledge of their structure. We consider variable BC biases to identify the types of biases that are most important to avoid, to understand the impact of correction methods, and to demonstrate the performance of the preview and diagnostic when the structure of the BC bias is not known. Because varying biases can be represented as the sum of periodic functions, we represent them with periodic BCs with varying y intercept, amplitude, and period number (Sect. S3 in the Supplement). The correction methods are implemented as in the constant perturbation case (Table 1 and Sect. 3.1) but with three BC elements optimized instead of one in the boundary method. This marginally increases the computational cost of the inversion but improves performance.

As in the inversion with a constant BC error and varying wind speeds, the periodic BC-induced bias consists of decaying upwind biases and structured downwind biases (Fig. S1 in the Supplement). The upwind biases are driven by the mean bias of the periodic perturbation over the inversion period. The spatial structure of the downwind biases is driven by the frequency of the BC error and the magnitude by the amplitude. This suggests that the correction methods will improve the upwind errors but not the downwind errors and that decreasing the amplitude of BC errors may reduce biases that are otherwise unresolvable.

Figure 4 shows the relative difference in the posterior fluxes, normalized by the prior fluxes, induced by a demonstration periodic perturbation with mean bias of 4 ppb over the inversion period (panel a and Sect. S3) for inversions with constant and varying wind speeds (panels b and c, respectively). The upwind biases are controlled by the mean bias with the error on average decreasing with the distance from the upwind boundary. The preview and diagnostic (Eqs. 13 and 5, respectively), which are calculated using the mean bias, capture these biases. The diagnostic also captures the downwind biases attributable to transport. Similarly, both correction methods decrease the upwind bias. The boundary method achieves modest error reductions because the BC-induced error is of similar magnitude to the error generated by the loss of information content due to the BC optimization. Downwind biases are only decreased by the boundary method when three or more BC elements are optimized.

https://gmd.copernicus.org/articles/18/9279/2025/gmd-18-9279-2025-f04

Figure 4Exact and predicted influence of a periodic BC bias on the posterior fluxes generated by a one-dimensional inverse model using different BC correction methods. (a) shows the true and periodic BC. (b, c) show the posterior error induced by the periodic BC as a function of the number of grid cells from the upwind boundary error is given as the difference between inversions solved with the true and biased BCs, normalized by the prior fluxes (Δx^/xA) for constant and varying wind speeds, respectively. The error is shown for inversions solved with no BC correction method, the buffer method, and the boundary method. The buffer and boundary methods each optimize one additional element. The buffer grid cell is not shown. The panels also show the preview and diagnostic, which are calculated using the mean BC bias over the inversion period. The varying wind speed is a see-sawing wind with mean equal to the constant wind speed.

Download

3.3 Varying inversion parameters

BC-induced posterior biases are a function of the relative constraint from the observations compared to the prior fluxes as measured by the information ratio (R, Eq. 7) and the grid cell residence time (τ, Eq. 6). To determine how changes in these quantities influence BC-induced biases, we define the influence length scale for our one-dimensional model as the number of grid cells required for the BC-induced error normalized by the prior fluxes to decrease below 0.25 (Δx^/xA<0.25). The influence length scale captures the strong dependence of the BC-induced errors on the number of grid cells from the domain boundary and reflects the additional computational cost associated with optimizing these grid cells. We vary the mean prior flux (from 20–50 ppb h−1) and associated prior error standard deviations, observation density (from 50–200 observations per grid cell), and mean wind speed (from 3–10 m s−1) and calculate the domain average information ratio and residence time for each inversion. Uncertainty is quantified as the standard deviation of the influence length scale across 100 inversions solved with random prior fluxes. All inversions use varying wind speeds and a 10 ppb BC perturbation, though the results are robust for constant wind speeds.

Figure 5a shows the influence length scale as a function of the information ratio for an inversion with no correction method. As predicted from the one-dimensional model solved with constant parameters (Eq. 8 and Fig. 2), the influence length scale decreases with the information ratio. The largest rate of change occurs in the observation-limited regime as the observational constraint shifts the BC-induced errors into the upwind grid cells. In the observation-rich case, the influence length scale is relatively constant as a function of the information ratio. Figure 5b shows the inverse dependence of the influence length scale on grid cell residence time as predicted by Eq. (8). As residence time increases, the influence length scale decreases because the relative contribution of the fluxes compared to the BC increases in the observed concentrations, improving the observational constraint.

https://gmd.copernicus.org/articles/18/9279/2025/gmd-18-9279-2025-f05

Figure 5Sensitivity of the BC-induced posterior error to changes in parameters in a one-dimensional inverse model. The vertical axis shows the influence length scale defined as the number of grid cells before which the prior-normalized posterior errors resulting from a 10 ppb BC bias decrease below 0.25 (Δx^/xA<0.25). The lines show the influence length scale for the preview and for inversions solved with no BC correction method, the boundary method, and the buffer method. Shading gives the one standard deviation range for 100 inversions solved with different random prior fluxes. (a) shows the influence length scale as a function of the information ratio R (Eq. 7) that describes the ratio of observing system error variances to the prior error variances in concentration units. The information ratio R=1 is marked by a vertical dashed line. (b) shows the influence length scale as a function of grid cell residence time (τ).

Download

Finally, we consider the performance of the metrics and correction methods. The preview (Eq. 12) on average captures the dependence of the BC-induced error (no correction method) on the information ratio and grid cell residence time. In almost all cases, it marginally overestimates the influence length scale so that it represents an upper bound on error. As expected from the prescription of the BC perturbation, the diagnostic (not shown) perfectly predicts the influence length scale associated with no correction method. Both correction methods display functionally no dependence on the information ratio and grid cell residence time. The boundary and buffer methods comparably reduce the influence length scale in all cases.

4 Two-dimensional numerical solution

We extend the framework and understanding derived from the analytical solution and one-dimensional model to a two-dimensional demonstration inversion of TROPOMI-like methane column pseudo-observations over the Permian basin using the GEOS-Chem transport model as implemented by the Integrated Methane Inversion (IMI; Varon et al., 2022; Estrada et al., 2025; Nesser, 2025). The Permian Basin is the largest oil producing region in North America and the subject of many inverse analyses (e.g., Zhang et al., 2020; Barkley et al., 2023; Varon et al., 2023; Vanselow et al., 2024). Table S1 summarizes the model parameters, Table 1 gives the inversion parameters, and Fig. 6 shows the inversion domain, true emissions (panel a), prior emissions (panel b), and observation density as defined by real TROPOMI observations for May 2020 (panel c) for the n=285 grid cells within the inversion domain. The true BC is given by the IMI's TROPOMI-based smoothed BCs (Estrada et al., 2025). Assuming a constant wind speed of 5 m s−1 and using average values of the prior and observing system error standard deviations, the domain average information ratio is R=0.03, reflecting the heterogeneous observational constraint.

https://gmd.copernicus.org/articles/18/9279/2025/gmd-18-9279-2025-f06

Figure 6Exact and predicted influence of a constant BC bias on a two-dimensional simulation inversion over the Permian basin in Texas using different BC correction methods. True emissions from the Environmental Defense Fund high-resolution inventory (a; Zhang et al., 2020) and true BCs from smoothed TROPOMI observations are used to generate pseudo-observations over the Permian basin (dashed line) with observational density (c) given by the TROPOMI methane observations for May 2020. The inversion is regularized by prior emissions given by the Express Extension of the Gridded EPA Inventory for 2020 multiplied by a factor of three so that the prior error statistics include the true emissions (b; Maasakkers et al., 2023). The error induced by a spatially variable BC bias with mean 10 ppb is calculated as the difference between inversions solved with the true and biased BCs normalized by the prior fluxes. The error is shown for inversions solved with no correction method (e), the boundary method (f), and the buffer method using ten buffer clusters (g) and five rows of native resolution grid cells (h). The dark black outlines show the buffer clusters or grid cells where the prior uncertainties are artificially inflated and for which posterior fluxes are calculated but not shown. The preview (d) is also shown.

We apply a spatially variable BC bias of 7.5, 10, 10 and 12.5 ppb to the northern, southern, eastern, and western boundaries, respectively, using the Jacobian matrix for each edge of the domain as generated by the IMI. Figure 6e shows the resulting error in the posterior emissions normalized by the prior emissions. Unlike the BC-induced errors shown in the one-dimensional inversions, which used random prior fluxes, the distribution of the relative errors is strongly a function of the prior emissions. This likely results from the skewed distribution of the emissions across the domain, which limits the inversion's ability to correct fluxes near the domain edge.

The preview (Eq. 12) calculated with a BC uncertainty of 10 ppb is shown in Fig. 6d. We estimate the Jacobian matrix elements following Eq. (6) as implemented by Nesser et al. (2024) with a constant wind speed of 5 m s−1, constant surface pressure of 1000 hPa, and a grid cell length scale given by the square root of each grid cell's area. We define the upwind length scale as the minimum distance from each grid cell's center to the border, dividing the domain into four regions based on proximity to each boundary. Upwind emissions and observation counts are calculated as the cumulative sum of the median value of each row or column between each grid cell and its closest boundary. Observing system errors are calculated by decreasing the 15 ppb single-observation uncertainty by the square root of the observation count. We apply a 5 ppb observing system error minimum to reflect transport and BC uncertainty.

The preview models the decay in BC-induced error as distance from the domain boundary increases accounting for the prior emission distribution and observation density (Eq. 12 and Sect. 2.3). As a result, the preview most accurately captures the effect of the BC bias nearest to the domain edge with a Pearson correlation coefficient of r=0.76 between the BC-induced biases and the preview for grid cells within three rows of the boundary. The performance of the preview degrades near the center of the domain, though there is still good agreement between the BC-induced biases and the preview for the full domain (r=0.67). The preview is larger than the BC-induced bias in most (63 %) of the grid cells so that the preview represents an error upper-bound. The diagnostic (not shown) perfectly captures the errors across the full domain as expected from the specification of the BC uncertainty.

Figure 6f, g, and h also show the correction methods. For the buffer method, we test two sets of buffer grid cells within the five concentric rings of grid cells around the outer edge of the domain. The first (clusters) aggregates the individual grid cells into 10 large buffer clusters using K-means clustering. The second (no clusters) uses 440 individual grid cells as buffers. In all cases, we scale the prior error standard deviation for the buffer elements by a factor of p=1000, which we find is sufficient to minimize BC-induced errors in all tested inversions. This very large, unphysical factor is chosen to compensate for the very small emissions around the edge of the Permian basin. For the boundary method, we optimize a mean bias correction along the northern, southern, eastern, and western boundaries.

The correction results mirror those found in the one-dimensional varying wind speed example. All correction methods decrease the BC-induced errors. Despite relying on fewer than half of the observations used by the buffer method, the boundary method virtually eliminates BC-induced errors while avoiding unphysical flux corrections and significantly decreasing computational cost by reducing the domain size. The buffer method with no clusters outperforms the buffer method with clusters, suggesting that multiple rows of buffer grid cells are preferable to large clusters. Larger numbers of buffer grid cells may also better absorb biases with higher resolution spatial variability.

5 Conclusions

We developed and demonstrated a framework to predict and correct the effect of BC biases on the optimal (posterior) gridded surface fluxes generated by regional inversions of atmospheric trace gas observations using a transport model. We proposed two metrics to predict BC-induced errors both before (preview) and after (diagnostic) the inversion is solved. The preview can inform the choice of inversion domain while the diagnostic improves posterior error quantification. We also considered two methods to correct BC biases as part of an inversion by optimizing the BC directly (boundary method) or by unphysically correcting grid cell fluxes outside the domain of interest (buffer method). Both methods can obtain identical error reductions in inversions with constant wind speeds, but the boundary method is more effective for inversions with variable wind speeds, provides a physical constraint, and reduces computational cost. Beyond the application to regional inversions of long-lived gases presented here, the framework is more generally applicable to the analysis of bias in the observations or transport model and the treatment of initial conditions in global inversions.

We demonstrated our theoretical framework using a simple one-dimensional model for the horizontal transport of an inert trace gas, which represents the worst-case scenario for the propagation of BC biases to the posterior flux estimate. BC-induced errors on average decay with increasing distance from the domain edge while smaller downwind systematic biases result from variability in the transport model wind speeds or BC biases. The length scale over which BC-induced errors have a significant effect on the posterior fluxes is minimized when the observations provide a strong constraint across the domain, which limits the bias to the most upwind grid cells. The preview identifies the grid cells with the largest biases, supporting domain specification before the inversion is conducted, while the diagnostic perfectly predicts the BC-induced error when the BC perturbation is specified. The boundary and buffer method both significantly reduce the influence length scale, but only the boundary method can decrease the magnitude of downwind errors. The results are robust for constant and variable BC biases.

We extended the framework to a two-dimensional demonstration inversion over the Permian Basin in Texas using model transport from the GEOS-Chem transport model. The BC-induced biases do not decay with distance from the boundary but instead correlate with the prior flux estimate due to the concentration of emissions in the center of the domain. Despite the difference in the distribution of BC-induced biases compared to the one-dimensional model, we find similar performance of the metrics and correction methods. The preview accurately predicts the BC-induced errors, with the best agreement occurring in the grid cells closest to the domain boundary. The diagnostic accurately describes the errors across the full domain. The boundary method functionally eliminates the BC-induced errors at much lower computational cost despite relying on fewer than half of the observations used in the buffer method. The buffer method decreases but does not eliminate BC-induced errors, with performance improving as the number of buffer clusters increases.

Code and data availability

The code and data for both the one-dimensional and two-dimensional simulation experiments are available at https://doi.org/10.5281/zenodo.17417750 (Nesser, 2025). The IMI v2.0 used to generate the inputs for the two-dimensional example is available at https://github.com/geoschem/integrated_methane_inversion/releases/tag/imi-2.0.1 (last access: 6 October 2025) (IMI, 2025).

Supplement

The supplement related to this article is available online at https://doi.org/10.5194/gmd-18-9279-2025-supplement.

Author contributions

HN and DJJ designed the study. HN, DJJ, and KWB contributed to the theoretical development. HN conducted the theoretical analysis and demonstration applications. MDT contributed to the proof of Eq. (8). KWB, DJV, CR, AT, FJCS, ER, and JDM discussed the results. HN, DJJ, and KWB wrote the paper with input from all authors.

Competing interests

The contact author has declared that none of the authors has any competing interests.

Disclaimer

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.

Acknowledgements

The research was carried out at the Jet Propulsion Laboratory, California Institute of Technology, under a contract with the National Aeronautics and Space Administration (80NM0018D0004). This work was funded in part by ExxonMobil Technology and Engineering Company. We thank Kimberly Mueller for her feedback.

Financial support

This research has been supported by the NASA Postdoctoral Program at the Jet Propulsion Laboratory, California Institute of Technology, administered by Oak Ridge Associated Universities under contract with NASA.

Review statement

This paper was edited by Marko Scholze and reviewed by Anna Karion and one anonymous referee.

References

Balashov, N. V., Davis, K. J., Miles, N. L., Lauvaux, T., Richardson, S. J., Barkley, Z. R., and Bonin, T. A.: Background heterogeneity and other uncertainties in estimating urban methane flux: results from the Indianapolis Flux Experiment (INFLUX), Atmos. Chem. Phys., 20, 4545–4559, https://doi.org/10.5194/acp-20-4545-2020, 2020. 

Barkley, Z., Davis, K., Miles, N., Richardson, S., Deng, A., Hmiel, B., Lyon, D., and Lauvaux, T.: Quantification of oil and gas methane emissions in the Delaware and Marcellus basins using a network of continuous tower-based measurements, Atmos. Chem. Phys., 23, 6127–6144, https://doi.org/10.5194/acp-23-6127-2023, 2023. 

Brasseur, G. and Jacob, D. J.: Modeling of atmospheric chemistry, Cambridge University Press, Cambridge New York, ISBN 978-1107146969, 2017. 

Byrne, B., Liu, J., Bowman, K. W., Yin, Y., Yun, J., Ferreira, G. D., Ogle, S. M., Baskaran, L., He, L., Li, X., Xiao, J., and Davis, K. J.: Regional Inversion Shows Promise in Capturing Extreme-Event-Driven CO2 Flux Anomalies but Is Limited by Atmospheric CO2 Observational Coverage, J. Geophys. Res.-Atmos., 129, e2023JD040006, https://doi.org/10.1029/2023JD040006, 2024. 

Chen, Z., Jacob, D. J., Gautam, R., Omara, M., Stavins, R. N., Stowe, R. C., Nesser, H., Sulprizio, M. P., Lorente, A., Varon, D. J., Lu, X., Shen, L., Qu, Z., Pendergrass, D. C., and Hancock, S.: Satellite quantification of methane emissions and oil–gas methane intensities from individual countries in the Middle East and North Africa: implications for climate action, Atmos. Chem. Phys., 23, 5945–5967, https://doi.org/10.5194/acp-23-5945-2023, 2023. 

Estrada, L. A., Varon, D. J., Sulprizio, M., Nesser, H., Chen, Z., Balasus, N., Hancock, S. E., He, M., East, J. D., Mooring, T. A., Oort Alonso, A., Maasakkers, J. D., Aben, I., Baray, S., Bowman, K. W., Worden, J. R., Cardoso-Saldaña, F. J., Reidy, E., and Jacob, D. J.: Integrated Methane Inversion (IMI) 2.0: an improved research and stakeholder tool for monitoring total methane emissions with high resolution worldwide using TROPOMI satellite observations, Geosci. Model Dev., 18, 3311–3330, https://doi.org/10.5194/gmd-18-3311-2025, 2025. 

Göckede, M., Turner, D. P., Michalak, A. M., Vickers, D., and Law, B. E.: Sensitivity of a subregional scale atmospheric inverse CO2 modeling framework to boundary conditions, J. Geophys. Res.-Atmos., 115, https://doi.org/10.1029/2010JD014443, 2010. 

Hancock, S. E., Jacob, D. J., Chen, Z., Nesser, H., Davitt, A., Varon, D. J., Sulprizio, M. P., Balasus, N., Estrada, L. A., Cazorla, M., Dawidowski, L., Diez, S., East, J. D., Penn, E., Randles, C. A., Worden, J., Aben, I., Parker, R. J., and Maasakkers, J. D.: Satellite quantification of methane emissions from South American countries: a high-resolution inversion of TROPOMI and GOSAT observations, Atmos. Chem. Phys., 25, 797–817, https://doi.org/10.5194/acp-25-797-2025, 2025. 

IMI: Integrated Methane Inversion, https://imi.seas.harvard.edu/ (last access: 6 October 2025), 2025. 

Karion, A., Lopez-Coto, I., Gourdji, S. M., Mueller, K., Ghosh, S., Callahan, W., Stock, M., DiGangi, E., Prinzivalli, S., and Whetstone, J.: Background conditions for an urban greenhouse gas network in the Washington, DC, and Baltimore metropolitan region, Atmos. Chem. Phys., 21, 6257–6273, https://doi.org/10.5194/acp-21-6257-2021, 2021. 

Lauvaux, T., Schuh, A. E., Uliasz, M., Richardson, S., Miles, N., Andrews, A. E., Sweeney, C., Diaz, L. I., Martins, D., Shepson, P. B., and Davis, K. J.: Constraining the CO2 budget of the corn belt: exploring uncertainties from the assumptions in a mesoscale inverse system, Atmos. Chem. Phys., 12, 337–354, https://doi.org/10.5194/acp-12-337-2012, 2012. 

Lauvaux, T., Miles, N. L., Deng, A., Richardson, S. J., Cambaliza, M. O., Davis, K. J., Gaudet, B., Gurney, K. R., Huang, J., O'Keefe, D., Song, Y., Karion, A., Oda, T., Patarasuk, R., Razlivanov, I., Sarmiento, D., Shepson, P., Sweeney, C., Turnbull, J., and Wu, K.: High-resolution atmospheric inversion of urban CO2 emissions during the dormant season of the Indianapolis Flux Experiment (INFLUX), J. Geophys. Res.-Atmos., 121, 5213–5236, https://doi.org/10.1002/2015JD024473, 2016. 

Maasakkers, J. D., McDuffie, E. E., Sulprizio, M. P., Chen, C., Schultz, M., Brunelle, L., Thrush, R., Steller, J., Sherry, C., Jacob, D. J., Jeong, S., Irving, B., and Weitz, M.: A Gridded Inventory of Annual 2012–2018 U. S. Anthropogenic Methane Emissions, Environ. Sci. Technol., 57, 16276–16288, https://doi.org/10.1021/acs.est.3c05138, 2023. 

Nesser, H.: Boundary condition sensitivity simulation experiments (v2.0), Zenodo [code], https://doi.org/10.5281/zenodo.17417750, 2025. 

Nesser, H., Jacob, D. J., Maasakkers, J. D., Lorente, A., Chen, Z., Lu, X., Shen, L., Qu, Z., Sulprizio, M. P., Winter, M., Ma, S., Bloom, A. A., Worden, J. R., Stavins, R. N., and Randles, C. A.: High-resolution US methane emissions inferred from an inversion of 2019 TROPOMI satellite data: contributions from individual states, urban areas, and landfills, Atmos. Chem. Phys., 24, 5069–5091, https://doi.org/10.5194/acp-24-5069-2024, 2024. 

Rodgers, C. D.: Inverse methods for atmospheric sounding: theory and practice, World Scientific, Singapore, ISBN 978-9810227401, https://doi.org/10.1142/3171, 2000. 

Sargent, M., Barrera, Y., Nehrkorn, T., Hutyra, L. R., Gately, C. K., Jones, T., McKain, K., Sweeney, C., Hegarty, J., Hardiman, B., Wang, J. A., and Wofsy, S. C.: Anthropogenic and biogenic CO2 fluxes in the Boston urban region, P. Natl. Acad. Sci. USA, 115, 7491–7496, https://doi.org/10.1073/pnas.1803715115, 2018. 

Sargent, M. R., Floerchinger, C., McKain, K., Budney, J., Gottlieb, E. W., Hutyra, L. R., Rudek, J., and Wofsy, S. C.: Majority of US urban natural gas emissions unaccounted for in inventories, P. Natl. Acad. Sci. USA, 118, e2105804118, https://doi.org/10.1073/pnas.2105804118, 2021. 

Shen, L., Zavala-Araiza, D., Gautam, R., Omara, M., Scarpelli, T., Sheng, J., Sulprizio, M. P., Zhuang, J., Zhang, Y., Qu, Z., Lu, X., Hamburg, S. P., and Jacob, D. J.: Unravelling a large methane emission discrepancy in Mexico using satellite observations, Remote Sens. Environ., 260, 112461, https://doi.org/10.1016/j.rse.2021.112461, 2021. 

Vanselow, S., Schneising, O., Buchwitz, M., Reuter, M., Bovensmann, H., Boesch, H., and Burrows, J. P.: Automated detection of regions with persistently enhanced methane concentrations using Sentinel-5 Precursor satellite data, Atmos. Chem. Phys., 24, 10441–10473, https://doi.org/10.5194/acp-24-10441-2024, 2024. 

Varon, D. J., Jacob, D. J., Sulprizio, M., Estrada, L. A., Downs, W. B., Shen, L., Hancock, S. E., Nesser, H., Qu, Z., Penn, E., Chen, Z., Lu, X., Lorente, A., Tewari, A., and Randles, C. A.: Integrated Methane Inversion (IMI 1.0): a user-friendly, cloud-based facility for inferring high-resolution methane emissions from TROPOMI satellite observations, Geosci. Model Dev., 15, 5787–5805, https://doi.org/10.5194/gmd-15-5787-2022, 2022. 

Varon, D. J., Jacob, D. J., Hmiel, B., Gautam, R., Lyon, D. R., Omara, M., Sulprizio, M., Shen, L., Pendergrass, D., Nesser, H., Qu, Z., Barkley, Z. R., Miles, N. L., Richardson, S. J., Davis, K. J., Pandey, S., Lu, X., Lorente, A., Borsdorff, T., Maasakkers, J. D., and Aben, I.: Continuous weekly monitoring of methane emissions from the Permian Basin by inversion of TROPOMI satellite observations, Atmos. Chem. Phys., 23, 7503–7520, https://doi.org/10.5194/acp-23-7503-2023, 2023. 

Wecht, K. J., Jacob, D. J., Frankenberg, C., Jiang, Z., and Blake, D. R.: Mapping of North American methane emissions with high spatial resolution by inversion of SCIAMACHY satellite data, J. Geophys. Res.-Atmos., 119, 7741–7756, https://doi.org/10.1002/2014JD021551, 2014. 

Zhang, Y., Gautam, R., Pandey, S., Omara, M., Maasakkers, J. D., Sadavarte, P., Lyon, D., Nesser, H., Sulprizio, M. P., Varon, D. J., Zhang, R., Houweling, S., Zavala-Araiza, D., Alvarez, R. A., Lorente, A., Hamburg, S. P., Aben, I., and Jacob, D. J.: Quantifying methane emissions from the largest oil-producing basin in the United States from space, Sci. Adv., 6, eaaz5120, https://doi.org/10.1126/sciadv.aaz5120, 2020. 

Download
Short summary
Regional analyses of atmospheric trace gases can improve knowledge of fluxes at high resolution but rely on specified boundary conditions (BCs) at the domain edges. Biases in the often-uncertain BCs propagate to the inferred fluxes. We develop a framework to explain how errors in the BCs influence the optimized fluxes, derive two metrics to estimate this influence, and compare two methods to correct for the biases. We demonstrate correcting BCs directly is more effective at reducing bias.
Share