Articles | Volume 11, issue 12
https://doi.org/10.5194/gmd-11-5203-2018
https://doi.org/10.5194/gmd-11-5203-2018
Methods for assessment of models
 | 
21 Dec 2018
Methods for assessment of models |  | 21 Dec 2018

A generic pixel-to-point comparison for simulated large-scale ecosystem properties and ground-based observations: an example from the Amazon region

Anja Rammig, Jens Heinke, Florian Hofhansl, Hans Verbeeck, Timothy R. Baker, Bradley Christoffersen, Philippe Ciais, Hannes De Deurwaerder, Katrin Fleischer, David Galbraith, Matthieu Guimberteau, Andreas Huth, Michelle Johnson, Bart Krujit, Fanny Langerwisch, Patrick Meir, Phillip Papastefanou, Gilvan Sampaio, Kirsten Thonicke, Celso von Randow, Christian Zang, and Edna Rödig
Abstract

Comparing model output and observed data is an important step for assessing model performance and quality of simulation results. However, such comparisons are often hampered by differences in spatial scales between local point observations and large-scale simulations of grid cells or pixels. In this study, we propose a generic approach for a pixel-to-point comparison and provide statistical measures accounting for the uncertainty resulting from landscape variability and measurement errors in ecosystem variables. The basic concept of our approach is to determine the statistical properties of small-scale (within-pixel) variability and observational errors, and to use this information to correct for their effect when large-scale area averages (pixel) are compared to small-scale point estimates. We demonstrate our approach by comparing simulated values of aboveground biomass, woody productivity (woody net primary productivity, NPP) and residence time of woody biomass from four dynamic global vegetation models (DGVMs) with measured inventory data from permanent plots in the Amazon rainforest, a region with the typical problem of low data availability, potential scale mismatch and thus high model uncertainty. We find that the DGVMs under- and overestimate aboveground biomass by 25 % and up to 60 %, respectively. Our comparison metrics provide a quantitative measure for model–data agreement and show moderate to good agreement with the region-wide spatial biomass pattern detected by plot observations. However, all four DGVMs overestimate woody productivity and underestimate residence time of woody biomass even when accounting for the large uncertainty range of the observational data. This is because DGVMs do not represent the relation between productivity and residence time of woody biomass correctly. Thus, the DGVMs may simulate the correct large-scale patterns of biomass but for the wrong reasons. We conclude that more information about the underlying processes driving biomass distribution are necessary to improve DGVMs. Our approach provides robust statistical measures for any pixel-to-point comparison, which is applicable for evaluation of models and remote-sensing products.

1 Introduction

The rate of environmental change in tropical South America and in particular in the Amazon region has been unprecedented in the last decades (e.g. Lewis et al., 2011; Davidson et al., 2012). Estimates of the amount of carbon stored in tropical rainforest biomass differ strongly (Avitabile et al., 2016; Baccini et al., 2012; Saatchi et al., 2011; Mitchard et al., 2014). In addition, estimated carbon release to the atmosphere from land-use change is uncertain (e.g. Houghton et al., 2012; Baccini et al., 2017; Song et al., 2015; Harris et al., 2012). Nonetheless, a successful implementation of protection incentives, e.g. for reducing emissions from deforestation and degradation (REDD+), requires both accurate estimates of existing regional carbon stocks as well as improved projections of future scenarios (e.g. Langner et al., 2014).

Dynamic global vegetation models (DGVMs) are important tools to estimate impacts of climate and land-use change on the carbon cycle (e.g. Cramer et al., 2004; Sitch et al., 2008). To correctly capture carbon dynamics in tropical forests, DGVMs need to improve the simulation approach of drought-related mortality and other types of tree mortality that control stand density (Pillet et al., 2018), they need to include more detailed gap dynamics influencing stand dynamics (Espírito-Santo et al., 2014; Rödig et al., 2017), and they need to incorporate how nutrient availability limits woody productivity (Quesada et al., 2012; Johnson et al., 2016). At the same time, model evaluation based on available data is necessary for which the primary source of data is ground-based observations of aboveground biomass (AGB) obtained from forest census data (Lopez-Gonzalez et al., 2011, 2014; Brienen et al., 2014, 2015; Mitchard et al., 2014; Johnson et al., 2016). When conducting model–data comparisons at the plot scale, the spatial resolution of both come into focus. The size of forest plots is typically of the order of 1 ha or less, whereas average DGVM grid-cell resolution is determined by the available gridded climate dataset, which is usually about several thousand square kilometres (> 100 000 ha). Plot observations are affected by observational errors, uneven spatial distribution (Saatchi et al., 2015) and spatial variability due to natural gap dynamics (Chambers et al., 2013), and are thus likely to exhibit substantial deviation from average large-scale properties. The problem of comparing point data with model results obtained at grid-cell (pixel) size occurs in many applications of remote sensing and ecological modelling.

So far, we are lacking a reliable and objective method to compare simulation results from DGVMs at grid-cell scale (pixel) and plot (point) observations. Several studies, which evaluated patterns of interpolated maps from plot data and model simulations, concluded that the observed and simulated spatial patterns do not match (e.g. Johnson et al., 2016). Here, we complement these findings by providing quantitative statistical measures for such comparisons and present an approach for performing pixel-to-point comparisons while accounting for different statistical properties of point and pixel values and their uncertainties. The basic concept of the approach is to determine statistical properties from small-scale variability and observational errors in ecosystem variables in order to account for these effects when comparing large-scale area averages (pixels) and small-scale plot estimates (points).

We apply our approach by comparing point estimates of ecosystem properties obtained from forest inventories (Mitchard et al., 2014; Brienen et al., 2015) to corresponding simulated pixel values from four state-of-the-art DGVMs. Similar to Johnson et al. (2016), we focus on three ecosystem properties that are well defined and represented in both DGVMs and forest inventories: (i) aboveground biomass (AGB, in Mg C ha−1); (ii) aboveground woody productivity (WP, in Mg C ha−1 yr−1); and (iii) residence time of woody biomass (τ in years).

We evaluate the accuracy of the spatial pattern of these three ecosystem properties and provide statistical measures for the quality of model simulations in comparison to observations, thereby accounting for small-scale landscape variability and associated measurement errors. We demonstrate the strength of our approach by highlighting its applicability for model evaluation and model benchmarking.

In particular, we address three research questions.

  1. How well do models represent variations in AGB across the Amazon region? We expand the pure (visual) qualitative comparison by deriving three statistical metrics for a quantitative comparison.

  2. How large are the differences between observed and simulated spatial biomass patterns (based on the presented metrics), in particular when considering different allometric equations? We discuss the effects of inadequate data and associated model uncertainty.

  3. What can we learn from the spatial heterogeneity and underlying drivers of spatial biomass patterns? We analyse simulated and observed patterns of WP and τ.

https://www.geosci-model-dev.net/11/5203/2018/gmd-11-5203-2018-f01

Figure 1Schematic overview of the three steps in the pixel-to-point comparison. Note that we refer to “landscape” as the region of interest for which point and pixel data are available. (a) The mean and global variability at the point scale are calculated from all plots across the landscape; (b) the within-pixel variability is calculated from all plots within the distance of the pixel size (i.e. the red arrow corresponds with pixel size); (c) the mean and global variability is calculated from the pixel values and the three comparison metrics are derived. Note: a block bootstrapping with 10 000 repetitions is performed to derive confidence intervals of the comparison metrics. The detailed set of equations to calculate maximum similarity, PA and SP can be found in the Supplement.

Download

2 Methods

Landscape variability depends on the extent and heterogeneity of the study area (Turner et al., 2001). Point measurements within a pixel of larger spatial scale, for example, may reveal small-scale spatial variabilities within the pixel. We derive a “within-pixel variability” that so far has not been accounted for in earlier approaches. We present three steps to calculate three metrics that provide a measure on the best achievable correlation between point and pixel values (see Fig. 1).

2.1 A generic method for point-to-pixel comparisons

2.1.1 Calculate the “global variability” across the region of interest

Assume that we have a dataset X with a number N of point observations xi at location i (e.g. plot observations from inventory data). In the first step, we calculate the mean x and variance σx2 across all plots in a region (e.g. across the Amazon region). The variance σx2 denotes the global variability (i.e. the variability across the whole Amazon region) at the point scale (Fig. 1a).

2.1.2 Calculate within-pixel variability

In the second step, we identify within-pixel variability from point measurements. With coarser pixel resolution, the spatial variability (here: global variability) is reduced. In order to compare pixel values against point values, the global variability at the point scale needs to be reduced by the within-pixel variability (variability component εi; Fig. 1b).

The variability component εi is assumed to be normally distributed with zero mean and variance σε2:

(1) ε i N 0 , σ ε 2 .

Based on the variability component εi, we estimate the within-pixel variance σε2 from the point observations by analysing their covariance, which is equivalent to the nugget effect (i.e. the sum of variance caused by small-scale variability and observation error) in a semivariogram (see methods in the Supplement). Due to the limited amount of inventory data, we assume here that σε2 is stationary across the region of interest (for details on that assumption see discussion and results in the Supplement).

The global variance at the point scale σx2 now differs from the corrected global variance at the pixel scale, σx,corr2, as variances add quadratically, assuming that the (small scale) variability component εi has errors uncorrelated to the global distribution of x.

(2) σ x , corr 2 = σ x 2 - σ ε 2 .

2.1.3 Metrics for the comparison of two datasets with different spatial resolutions

In a third step, we compare the point data xi with simulated data yi at the pixel scale. Similar to the above procedure, we calculate the mean y and variance σy2 for the simulated pixels that contain point observations (hereby we assign each point observation the pixel value in which the point is located). We then compare the simulation results by applying three metrics:

  1. Mean bias (MB): the ratio of means y/x across the whole region as a measure of the mean bias in the patterns which is not affected by small-scale variability;

  2. Pattern amplitude (PA): the ratio of standard deviations σy/σx,corr using the corrected global variability (i.e. removed within-pixel variability) and serves as a measure of differences in pattern amplitude or in the variability in the simulated and observed data;

  3. Similarity of pattern (SP): we use rcorr as a measure of the similarity of the “shape” of spatial patterns, i.e. the spatial correlation of simulated and observed data (see the Supplement). Accordingly, we can calculate the maximum achievable correlation coefficient rmax, which is derived from correlating the observational dataset at the point scale to the same observational dataset at the pixel scale (see Fig. 1a, b and the Supplement).

The limited number of point observations and their non-random spatial distribution in the Amazon region affects the accuracy of the comparison. We therefore estimate confidence intervals for the comparison metrics MB, PA and SP, respectively, by applying a bootstrapping technique (10 000 repetitions). Because the estimation of σε2 is based on the analysis of the spatial correlation structure of the data, a block-bootstrapping is performed (Politis and Romano, 1994). For each permutation, the domain of observations is randomly divided into 100 tiles (random orientation and offset, ca. pixel size) from which a random recombination is drawn with replacement. This technique assures that the spatial correlation structure of the data remains intact.

2.2 Application of the pixel-to-point comparison to simulated and observed data from the Amazon region

2.2.1 Observed data at the point scale: description of site-level data

The observed data at the point scale are forest-census-based plot measurements across the Amazon region, in which all plots that were subject to anthropogenic disturbances, including selective logging, were excluded (Brienen et al., 2015). The average plot size is ∼1.2 ha (Brienen et al., 2015) so that the plots incorporate most size classes of natural gaps, particularly as the plot data were averaged across sites occurring within the same pixel. Across the plot network, the biases introduced into the estimate of carbon balance by 1 ha plots not sampling the very largest and rarest natural gaps are in fact very small (Espírito-Santo et al., 2014). We use datasets of AGB (Lopez-Gonzalez et al., 2011, 2014; Mitchard et al., 2014), WP and woody loss (WL; Brienen et al., 2014). WP and WL are derived “ from the sum of biomass growth of surviving trees and trees that recruited (that is, reached a diameter  100 mm), and mortality [= woody loss] from the biomass of trees that died between censuses” (Brienen et al., 2015). We convert AGB, WP and WL from dry biomass to carbon mass (see methods in the Supplement). For the calculation of AGB, we use different allometric equations that account for regional differences in wood density or tree height (Table S1). We exemplify our comparison metrics based on AGB calculated from the three-parameter moist tropical forest allometry from Chave et al. (2005), where tree height is estimated from the diameter at breast height (DBH) individually for each stem based on the region-specific Weibull models from Feldpausch et al. (2012). Wood density is estimated for each stem using the mean value for the species in the Global Wood Density Database (Chave et al., 2009; Zanne et al., 2009), or the mean for the genus using congeneric taxa from Mexico, Central America and tropical South America if no data were available for that species (Mitchard et al., 2014). We here evaluate the principal AGB dataset (KDHρ) from Mitchard et al. (2014) in more detail (the other allometric equations are presented in the Supplement).

2.2.2 Simulated data at the pixel scale: description of DGVM simulations

We use outputs from four state-of-the-art DGVMs, namely the Lund-Potsdam-Jena model with managed Land (LPJmL, Bondeau et al., 2007; Gerten et al., 2004; Sitch et al., 2003), the Joint UK Land Environment Simulator (JULES), v. 2.1. (Best et al., 2011; Clark et al., 2011), the INtegrated model of LAND surface processes (INLAND) model (a development of the IBIS model; Kucharik et al., 2000) and the Organising Carbon and Hydrology In Dynamic Ecosystems (ORCHIDEE) model (Krinner et al., 2005). A short description of each of the applied models is provided in the Supplement. The models were applied to the Amazon region covering the area of 88 to 34 W and 13 N to 25 S at a spatial resolution of 1×1 lat/long. The resolution of the DGVMs is defined by the resolution of the climate input data for which we used bias-corrected NCEP meteorological data (Sheffield et al., 2006). Model runs were performed based on the standardized Moore Foundation Andes-Amazon Initiative (AAI) modelling protocol (Zhang et al., 2015). The same set of models and output variables was analysed in Johnson et al. (2016).

https://www.geosci-model-dev.net/11/5203/2018/gmd-11-5203-2018-f02

Figure 2Estimates of aboveground biomass (AGB) from forest plots in 1×1 pixels. (a) Mean AGB per pixel derived from inventory data based on one allometric equation (KDHρ, see the Supplement for explanation and other allometric equations). (b) Number of plots per pixel and (c–f) simulated AGB from four DGVMs.

Download

https://www.geosci-model-dev.net/11/5203/2018/gmd-11-5203-2018-f03

Figure 3Distribution of aboveground biomass (AGB in Mg C ha−1) from the four DGVMs and from the observational plots (see also Table S2 and S3). The figure shows (a) the mean value (white dot) and distribution from bootstrapping of absolute AGB values from the four simulations and observed data (grey violin shapes). (b) Mean bias as the ratio of mean simulated and mean observed AGB (y/x).

Download

2.2.3 Comparing inventory and simulation results

In our application, dataset X corresponds to the inventory measurements at the point scale (Fig. 1a). For this dataset, we have to derive the within-pixel variability (Fig. 1b). Dataset Y corresponds to the simulated pixel values (Fig. 1c). Hence, the pixel scale is defined by the resolution of the model simulation (1×1, approximately 12 200 km2, Fig. 1c). We calculate the three metrics (Sect. 2.1.3) from the observed and simulated ecosystem variables AGB, WP and τ.

3 Results

3.1 Comparison of aboveground biomass (AGB)

The visual comparison indicates that the spatial pattern of AGB from the plots (Figs. 2a and S1) differs from the spatial patterns of AGB simulated by either DGVM (Fig. 2c–f). In addition, the DGVM patterns are vastly different among each other.

Mean x and global variability σx in AGB for all plot observations across the Amazon region (Fig. 3a) range from 134 to 153 and from 36 to 50 Mg C ha−1, respectively (Table S2), depending on the allometric equation applied. Within-pixel variability σε, as calculated from Eq. (S1), ranges between 28 and 36 Mg C ha−1. The corrected variability in observed AGB at the pixel scale (σx,corr) is thus substantially lower than the global variability and ranges between 22 and 39 Mg C ha−1 (Fig. 4a, Table S2). Based on these estimates we calculate the maximum achievable coefficients rmax for a comparison between pixel averages and point estimates of 0.61 to 0.78 for different allometric equations (Fig. 5).

https://www.geosci-model-dev.net/11/5203/2018/gmd-11-5203-2018-f04

Figure 4(a) Standard deviations of AGB (in Mg C ha−1) for the four models and observational data (for the other allometric equations see also Table S2, S3). For the observational data, the global variability at the point scale (“observed”) and the corrected variability at the pixel scale (“corrected”) is given; (b) ratio of standard deviations without correcting for within-pixel variability (c) corrected metrics of pattern amplitude (σy/σx,corr).

Download

https://www.geosci-model-dev.net/11/5203/2018/gmd-11-5203-2018-f05

Figure 5The similarity of the observed vs. simulated spatial pattern of AGB at the pixel scale (as indicated by r given in bars). The similarity is calculated for different versions of observed AGB derived from six allometric equations (indicated by the colours, see Table S1). The dashed lines show the maximum achievable correlation coefficients rmax from the observational data and for the different allometric equations.

Download

The models simulate a continuous cover of biomass across the Amazon region at a spatial resolution of 1×1 pixel size. For our comparison, we only use the simulated pixel values of AGB at each plot location. Thus, the estimated statistical properties are not representative for the entire Amazon region but only for a relatively small subset of pixels (i.e. 98 pixels as in Fig. 2a, b). For simulated AGB from the four DGVMs, we estimate a mean y of 114 Mg C ha−1 for INLAND, 151 Mg C ha−1 for JULES, 217 Mg C ha−1 for ORCHIDEE and 170 Mg C ha−1 for LPJmL (Fig. 3a, Table S3). Depending on the allometric equation applied to calculate observed biomass (Table S1), INLAND underestimates mean AGB by 15 %–25 %. LPJmL and ORCHIDEE overestimate AGB by 11 %–26 % and 42 %–62 %, respectively. JULES deviates only by 1 % from AGB derived from the two-parameter allometric equations, but overestimates AGB derived from all three-parameter allometric equations by 12 % (Table S3).

Mean global variability in simulated AGB, σy, ranges between 13 Mg C ha−1 (JULES) and 62 Mg C ha−1 (ORCHIDEE; Fig. 4a and Table S3). Without correcting for small-scale variability σε in the point-to-pixel comparison, we would conclude that the pattern amplitude simulated by ORCHIDEE and LPJmL agree quite well with observed patterns (Fig. 4b). However, when accounting for the lower corrected variability (σx,corr), because the error of observation-based estimates at the pixel level is smaller, it becomes easier to falsify models with uncertain data. We find that LPJmL and ORCHIDEE both overestimate the observed spatial amplitude by 43 % and 62 %, respectively (Fig. 4c and see Table S3 for other allometric equations). For INLAND and JULES, on the other hand, we find a corresponding underestimation of pattern amplitude by 14 % and 65 %, respectively. We also note that confidence intervals for σy/σx,corr are large in particular for ORCHIDEE and LPJmL (Fig. 4c).

Correlation coefficients indicating the similarity of simulated and observed patterns of AGB range from 0.25 to 0.53 (corrected) across all models (Table S3). The highest similarity of pattern (i.e. best correlation values rcorr) is found for ORCHIDEE, lowest similarity of pattern for LPJmL. Across the three models INLAND, JULES and LPJmL, generally, higher similarity of pattern is found for the allometric equations that include regional height models and mean or species-specific wood density (KDHρ, KDH; Fig. 5).

https://www.geosci-model-dev.net/11/5203/2018/gmd-11-5203-2018-f06

Figure 6Estimates of aboveground woody productivity (WP) from forest plots in 1×1 pixels. (a) Mean WP from inventory plots; (b) number of plots per pixel; and (c–f) simulated WP from four DGVMs.

Download

Table 1Results of the point-to-pixel comparison for (a) woody productivity (WP) and (b) residence time of woody biomass (τ). The 5 % and 95 % confidence intervals are given in brackets.

a indicate when models overestimate observed values, b indicate underestimation.

Download Print Version | Download XLSX

3.2 Comparison of woody productivity (WP)

Mean x and variability at the pixel scale σx,corr of observed WP are 2.57 and 0.38 Mg C ha−1 yr−1, respectively. There seems to be a weak spatial pattern in the plot estimates at pixel level (Fig. 6a), which is not reflected by the models (Fig. 6c–f). The DGVMs display a distinct pattern of WP across the region that strongly differs among the four models (Fig. 6c–f).

Mean WP simulated by the DGVMs (y) is between 4 and 5 Mg C ha−1 yr−1 for LPJmL and JULES, and between 8 and 9 Mg C ha−1 yr−1 for INLAND and ORCHIDEE (Table S4). All DGVMs strongly overestimate mean WP (Table 1a). In addition, most models overestimate the pattern amplitude, and the simulated variability ranges between 0.72 and 1.6 Mg C ha−1 yr−1 (Table S4). Pattern similarity of observed and simulated data is low ranging from 0.03 to 0.50 (Table 1a), even with a relatively low maximum achievable correlation of 0.65 (Table 1a).

https://www.geosci-model-dev.net/11/5203/2018/gmd-11-5203-2018-f07

Figure 7Estimates of woody biomass residence time (τ) from forest plots in 1×1 pixels. (b) Mean residence time from inventory plots; (b) number of plots per pixel; and (c–f) simulated τ from four DGVMs.

Download

3.3 Comparison of residence time of woody biomass (τ)

Mean x and variability at the pixel scale σx,corr of observed τ are 74 and 28 years, respectively. Again the visual comparison shows that the simulations do not match the observations (Fig. 7a vs. Fig. 7c–f). The simulated mean y of τ ranges between 15 (INLAND) and 35 (LPJmL) years with a variability of 3 (INLAND) to 8 (LPJmL) years. This is displayed in our comparison metrics: mean bias results in very low values (i.e. strong underestimation of 53 % to 80 %; Table 1b) and pattern amplitude is strongly underestimated by 65 % to 87 % (Table 1b). The similarity of pattern is very low for all models (Table 1b).

4 Discussion

We present a novel approach for a pixel-to-point comparison. We account for the reduced observed variability when going from point to pixel scale by evaluating three indicators, i.e. the mean bias, the pattern amplitude and the similarity of spatial pattern (Sect. 1.3). We use an example from the Amazon region by comparing model output from four DGVMs and forest inventory data. In the following, we discuss our findings of substantial discrepancies between simulated and observed patterns of AGB, WP and τ across the Amazon region.

4.1 How well do model simulations represent observed biomass patterns across the Amazon?

Interpolated biomass maps from plot observations (e.g. Johnson et al., 2016; Malhi et al., 2006) should be treated with caution since plot observations may not be representative at the landscape scale (Chave et al., 2004). As a result, a direct and meaningful comparison of observed and simulated maps is currently not feasible but reliable biomass estimates are necessary for implementation of protection incentives and future projections of vegetation biomass. Our results demonstrate that most models are in good agreement and deviate from mean observational biomass by less than 20 % (i.e. low mean bias, cf. Fig. 3) and their variability at the landscape scale deviates by about 40 % (i.e. pattern amplitude, cf. Fig. 4). Such relatively good agreement was also found in simulation runs from Delbart et al. (2010) and Johnson et al. (2016). Our results even yield relatively high similarity in observed and simulated spatial patterns of AGB at the pixel scale (except LPJmL; cf. Fig. 5), given the fact that the maximum achievable correlation in the data itself is only 0.6–0.8 (Fig. 5). This indicates that there is considerable uncertainty in the data, which needs to be considered in point-to-pixel comparisons and which we elaborate on in the following paragraph.

4.2 How large are the differences between observed and simulated patterns of biomass (based on the presented metrics), in particular when considering different allometric equations?

As discussed by several authors (e.g. Baker et al., 2004; Chave et al., 2006, 2014; Réjou-Méchain et al., 2017) the methodology used to convert plot measurements to actual biomass may lead to differential biomass estimates depending on the respective assumptions of the allometric equations employed (i.e. using species-level or community mean wood density, and region-specific or basin-wide height models; see Table S1 in the Supplement). As a result, we find a more or less pronounced pattern of biomass variability across the Amazon region based on the respective assumption used (cf. Figs. 2 and S1). While mean global variability in biomass is highest for the allometric equations including species-level wood density (cf. Fig. S1, Table S2), highest within-pixel variability is found for biomass values estimated from two-parameter allometric equations (Table S2) excluding tree height (cf. Table S1). This result is also reflected by the lower maximum achievable correlation coefficient (rmax), describing how observational data at the point scale correlates with observational data at the pixel scale, which is particularly low for the two-parameter allometric equations (Fig. 5). Although three out of four DGVMs achieve a relatively good agreement between simulated and observed patterns at the pixel scale, we find substantial uncertainty in the observational data due to spatial heterogeneity of local vegetation characteristics such as the structural and functional tree species composition affecting biomass estimates across the Amazon (see also Rödig et al., 2017). The uncertainty resulting from conversion of raw inventory measurements into biomass from different allometric equations is generally neglected in model–data comparisons. However, it strongly affects our pixel-to-point comparison metrics, thereby remaining an important bottleneck for good model–data biomass comparisons. We suggest to include AGB estimates with associated uncertainties, e.g. using Bayesian inference procedures (see Réjou-Méchain et al., 2017) or to directly compare modelled allometries and related parameters in DGVMs with observational data.

4.3 What can we learn from including spatial heterogeneity and underlying drivers of biomass?

While our approach shows that some models could provide robust estimates for standing biomass stocks across the Amazon region (cf. Fig. 2), it highlights that DGVMs currently do not represent productivity and related turnover correctly (i.e. the relation between productivity and residence time of woody biomass). As a result, the models might simulate the correct patterns for the wrong reasons as far as it can be derived from observational data. The four DGVMs applied in this study generally capture the observed pattern of AGB but strongly overestimate observed WP and underestimate τ, and, from a pixel perspective, do not show strong variability across the Amazon region, thereby not capturing observed gradients (cf. Fig. 6 and Table 1). WP and τ are driving AGB and are calculated by different schemes in the four DGVMs, e.g. regarding carbon allocation and drivers of mortality. Ground observations suggest that forest structure, forest dynamics and species composition vary across the Amazon region, such that variations in geology and soil fertility or mechanical properties coincide with region-wide variations in AGB, growth and stem mortality rates (Johnson et al., 2016; Quesada et al., 2012). Accordingly, recent studies highlight that variation in stem mortality rates determines spatial variation in AGB and conclude that mortality should be modelled on the basis of individual stems, since stem-size distributions and stand density are important for predicting variation in AGB (Johnson et al., 2016; Rödig et al., 2017; Pillet et al., 2018). Projected increasing disturbances with different sizes and frequency may be an important additional driver for further variations under future scenarios (Espírito-Santo et al., 2014; Rödig et al., 2017). Nonetheless, the mechanisms leading to stem mortality need to be implemented in models based on experimental data that are only recently becoming available (Meir et al., 2015; Rowland et al., 2015). Overall, the DGVMs are able to reproduce the observed spatial pattern of AGB across the Amazon region, whereas for WP the model performance is less good and reproduction of the spatial pattern in mortality is generally very poor (Figs. 2, 6, 7). This suggests that models need to account for processes such as WP and mortality more mechanistically by including factors associated with resource limitation and disturbance regimes (see also Johnson et al., 2016). Recent efforts aiming at improving simulated Amazon forest biomass and productivity by including spatial variation in biophysical parameters (such as τ and Vcmax) have found that using single values for key parameters limits simulation accuracy (Castanho et al., 2013). Thus, we conclude that a more mechanistic representation of the processes driving the spatial variability in carbon stocks and fluxes, forest structure, and tree demographic dynamics is necessary to improve simulation accuracy (Rödig et al., 2018).

5 Future applications of the methodological approach and outlook

In general, we assume that the basic concept of our method is applicable to any comparison between two datasets that are characterized by differences in spatial scale. If the process that causes small-scale variability can be approximated as white noise, corrected statistics can be computed. Notwithstanding future developments of next-generation DGVMs, the most relevant step of the presented approach is to account for the within-pixel variability σε from the point data to allow for a comparison of observational and simulated data. Due to relatively sparse plot data availability, we assume here that σε is stationary across the Amazon region. To evaluate this assumption further, we have calculated a regional within-pixel variability σε (Fig. S2) and find that it is in the range of the Amazon-wide σε of 28 to 36 Mg C ha−1 (depending on the allometric equation used, see Table S2). Field studies show that forest dynamics vary locally, mostly due to variations in natural disturbance regimes, mortality and edaphic properties (e.g. Baker et al., 2004; Chambers et al., 2013; Chave et al., 2006; Malhi et al., 2006; John et al., 2007), which in turn strongly influences our calculated within-pixel variability and thus the metrics of the pixel-to-point comparison. Recent regional studies, which combine observational plot data and remote-sensing products from applications such as LIDAR (regions of Peru: Marvin et al., 2014; French Guyana: Fayad et al., 2016; Congo: Xu et al., 2017), have already proven to detect spatial variability at high spatial resolution, which could be used to calculate a pixel-wise within-pixel variability based on our approach. Upcoming remote-sensing missions as the Global Ecosystem Dynamics Investigation lidar (GEDI), the ESA BIOMASS mission, the NASA-ISRO Synthetic Aperture Radar (NISAR) mission, or the proposed Tandem-L mission (Moreira et al., 2015) will have the potential to provide non-stationary values of within-pixel variability for all regions of the Amazon. Thus, it is desirable to include regionally or locally specific estimates of σε2 in our analyses, which could be derived, for example, from the abovementioned remote-sensing data or from individual tree-based high-resolution simulations (e.g. Rödig et al., 2017). In any case, we conclude that upcoming model–data comparison studies should at least account for stationary within-pixel variability when comparing simulated spatial data to data from discrete observational networks.

Code and data availability

All models are described in more detail in the Supplement. The model code for LPJmL is available at https://github.com/PIK-LPJmL/LPJmL (last access: 17 December 2018) and archived under https://doi.org/10.5880/pik.2018.002. The model code for ORCHIDEE is available at https://doi.org/10.14768/06337394-73A9-407C-9997-0E380DAC5597. The model code for JULES is available from the JULES FCM repository: https://code.metoffice.gov.uk/trac/jules (registration required, last access: 17 December 2018). The model code for INLAND is available at http://www.ccst.inpe.br/wp-content/uploads/inland/inland2.0.tar.gz (last access: 17 December 2018). The permanent archive of the observational data from Mitchard et al. (2014) can be accessed at https://doi.org/10.5521/FORESTPLOTS.NET/2014_1, see also Lopez-Gonzales et al. (2014). The inventory data from Brienen et al. (2015) are available at https://doi.org/10.5521/ForestPlots.net/2014_4, see also Brienen et al. (2014)..

Supplement

The supplement related to this article is available online at: https://doi.org/10.5194/gmd-11-5203-2018-supplement.

Author contributions

AR and JH conceived the ideas and designed methodology, and contributed equally to the paper; AR, JH, ER, PP and CZ analysed the data; FL, KT, MG, CR, BC, GS performed simulation runs; all authors contributed to writing the paper.

Competing interests

The authors declare that they have no conflict of interest.

Acknowledgements

We acknowledge funding from the European Union's Seventh Framework Programme AMAZALERT project (282664), the Helmholtz Alliance “Remote Sensing and Earth System Dynamics”, and the Belmont Forum/BMBF funded project CLIMAX. We acknowledge the efforts of the TEAM, RAINFOR and ATDM projects making the observational datasets available.

This work was supported by the German Research
Foundation (DFG) and the Technical University of Munich (TUM) in the framework of the Open Access Publishing Program.

Edited by: Carlos Sierra
Reviewed by: two anonymous referees

References

Avitabile, V., Herold, M., Heuvelink, G. B. M., Lewis, S. L., Phillips, O. L., Asner, G. P., Armston, J., Ashton, P. S., Banin, L., Bayol, N., Berry, N. J., Boeckx, P., de Jong, B. H. J., DeVries, B., Girardin, C. A. J., Kearsley, E., Lindsell, J. A., Lopez-Gonzalez, G., Lucas, R., Malhi, Y., Morel, A., Mitchard, E. T. A., Nagy, L., Qie, L., Quinones, M. J., Ryan, C. M., Ferry, S. J. W., Sunderland, T., Laurin, G. V., Gatti, R. C., Valentini, R., Verbeeck, H., Wijaya, A., and Willcock, S.: An integrated pan-tropical biomass map using multiple reference datasets, Glob. Change Biol., 22, 1406–1420, https://doi.org/10.1111/gcb.13139, 2016. 

Baccini, A., Goetz, S. J., Walker, W. S., Laporte, N. T., Sun, M., Sulla-Menashe, D., Hackler, J., Beck, P. S. A., Dubayah, R., Friedl, M. A., Samanta, S., and Houghton, R. A.: Estimated carbon dioxide emissions from tropical deforestation improved by carbon-density maps, Nat. Clim. Change, 2, 182–185, https://doi.org/10.1038/nclimate1354, 2012. 

Baccini, A., Walker, W., Carvalho, L., Farina, M., Sulla-Menashe, D., and Houghton, R. A.: Tropical forests are a net carbon source based on aboveground measurements of gain and loss, Science, 358, 230–234, https://doi.org/10.1126/science.aam5962, 2017. 

Baker, T. R., Phillips, O. L., Malhi, Y., Almeida, S., Arroyo, L., Di Fiore, A., Erwin, T., Higuchi, N., Killeen, T. J., Laurance, S. G., Laurance, W. F., Lewis, S. L., Monteagudo, A., Neill, D. A., Vargas, P. N., Pitman, N. C. A., Silva, J. N. M., and Martinez, R. V.: Increasing biomass in Amazonian forest plots, Phil. Trans. R. Soc. B, 359, 353–365, 2004. 

Best, M. J., Pryor, M., Clark, D. B., Rooney, G. G., Essery, R. L. H., Ménard, C. B., Edwards, J. M., Hendry, M. A., Porson, A., Gedney, N., Mercado, L. M., Sitch, S., Blyth, E., Boucher, O., Cox, P. M., Grimmond, C. S. B., and Harding, R. J.: The Joint UK Land Environment Simulator (JULES), model description – Part 1: Energy and water fluxes, Geosci. Model Dev., 4, 677–699, https://doi.org/10.5194/gmd-4-677-2011, 2011. 

Bondeau, A., Smith, P. C., Zaehle, S., Schaphoff, S., Lucht, W., Cramer, W., Gerten, D., Lotze-Campen, H., Müller, C., Reichstein, M., and Smith, B.: Modelling the role of agriculture for the 20th century global terrestrial carbon balance, Glob. Change Biol., 13, 679–706, 2007. 

Brienen, R. J. W., Phillips, O. L., Feldpausch, T. R., Gloor, E., Baker, T. R., Lloyd, J., Lopez-Gonzalez, G., Monteagudo-Mendoza, A., Malhi, Y., Lewis, S. L., Vasquez Martinez, R., Alexiades, M., Alvarez Davila, E., Alvarez-Loayza, P., Andrade, A., Aragao, L. E. O. C., Araujo-Murakami, A., Arets, E. J. M. M., Arroyo, L., Aymard C, G. A., Banki, O. S., Baraloto, C., Barroso, J., Bonal, D., Boot, R. G. A., Camargo, J. L. C., Castilho, C. V., Chama, V., Chao, K. J., Chave, J., Comiskey, J. A., Cornejo Valverde, F., da Costa, L., de Oliveira, E. A., Di Fiore, A., Erwin, T. L., Fauset, S., Forsthofer, M., Galbraith, D. R., Grahame, E. S., Groot, N., Herault, B., Higuchi, N., Honorio Coronado, E. N., Keeling, H., Killeen, T. J., Laurance, W. F., Laurance, S., Licona, J., Magnussen, W. E., Marimon, B. S., Marimon-Junior, B. H., Mendoza, C., Neill, D. A., Nogueira, E. M., Nunez, P., Pallqui Camacho, N. C., Parada, A., Pardo-Molina, G., Peacock, J., Pena-Claros, M., Pickavance, G. C., Pitman, N. C. A., Poorter, L., Prieto, A., Quesada, C. A., Ramirez, F., Ramirez-Angulo, H., Restrepo, Z., Roopsind, A., Rudas, A., Salomao, R. P., Schwarz, M., Silva, N., Silva-Espejo, J. E., Silveira, M., Stropp, J., Talbot, J., ter Steege, H., Teran-Aguilar, J., Terborgh, J., Thomas-Caesar, R., Toledo, M., Torello-Raventos, M., Umetsu, R. K., van der Heijden, G. M. F., van der Hout, P., Guimaraes Vieira, I. C., Vieira, S. A., Vilanova, E., Vos, V. A., and Zagt, R. J.: Plot Data from: “Long-term decline of the Amazon carbon sink”, ForestPlots.NET, https://doi.org/10.5521/ForestPlots.net/2014_4, 2014. 

Brienen, R. J. W., Phillips, O. L., Feldpausch, T. R., Gloor, E., Baker, T. R., Lloyd, J., Lopez-Gonzalez, G., Monteagudo-Mendoza, A., Malhi, Y., Lewis, S. L., Vasquez Martinez, R., Alexiades, M., Alvarez Davila, E., Alvarez-Loayza, P., Andrade, A., Aragao, L. E. O. C., Araujo-Murakami, A., Arets, E. J. M. M., Arroyo, L., Aymard C, G. A., Banki, O. S., Baraloto, C., Barroso, J., Bonal, D., Boot, R. G. A., Camargo, J. L. C., Castilho, C. V., Chama, V., Chao, K. J., Chave, J., Comiskey, J. A., Cornejo Valverde, F., da Costa, L., de Oliveira, E. A., Di Fiore, A., Erwin, T. L., Fauset, S., Forsthofer, M., Galbraith, D. R., Grahame, E. S., Groot, N., Herault, B., Higuchi, N., Honorio Coronado, E. N., Keeling, H., Killeen, T. J., Laurance, W. F., Laurance, S., Licona, J., Magnussen, W. E., Marimon, B. S., Marimon-Junior, B. H., Mendoza, C., Neill, D. A., Nogueira, E. M., Nunez, P., Pallqui Camacho, N. C., Parada, A., Pardo-Molina, G., Peacock, J., Pena-Claros, M., Pickavance, G. C., Pitman, N. C. A., Poorter, L., Prieto, A., Quesada, C. A., Ramirez, F., Ramirez-Angulo, H., Restrepo, Z., Roopsind, A., Rudas, A., Salomao, R. P., Schwarz, M., Silva, N., Silva-Espejo, J. E., Silveira, M., Stropp, J., Talbot, J., ter Steege, H., Teran-Aguilar, J., Terborgh, J., Thomas-Caesar, R., Toledo, M., Torello-Raventos, M., Umetsu, R. K., van der Heijden, G. M. F., van der Hout, P., Guimaraes Vieira, I. C., Vieira, S. A., Vilanova, E., Vos, V. A., and Zagt, R. J.: Long-term decline of the Amazon carbon sink, Nature, 519, 344–348, https://doi.org/10.1038/nature14283, 2015. 

Castanho, A. D. A., Coe, M. T., Costa, M. H., Malhi, Y., Galbraith, D., and Quesada, C. A.: Improving simulated Amazon forest biomass and productivity by including spatial variation in biophysical parameters, Biogeosciences, 10, 2255–2272, https://doi.org/10.5194/bg-10-2255-2013, 2013. 

Chambers, J. Q., Negron-Juarez, R. I., Magnabosco Marrac, D. M., Di Vittorioa, A., Tewse, J., Roberts, D., Ribeiro, G. H. P. M., Trumbore, S. E., and Higuchi, N.: The steady-state mosaic of disturbance and succession across an old-growth Central Amazon forest landscape, P. Natl. Acad. Sci. USA, 110, 3949–3954, 2013. 

Chave, J., Condit, R., Aguilar, S., Hernandez, A., Lao, S., and Perez, R.: Error propagation and scaling for tropical forest biomass estimates, Philos. T. Roy. Soc. B, 359, 409–420, https://doi.org/10.1098/rstb.2003.1425, 2004. 

Chave, J., Andalo, C., Brown, S., Cairns, M. A., Chambers, J. Q., Eamus, D., Fölster, H., Fromard, F., Higuchi, N., Kira, T., Lescure, J. P., Nelson, B. W., Ogawa, H., Puig, H., Riéra, B., and Yamakura, T.: Tree allometry and improved estimation of carbon stocks and balance in tropical forests, Oecologia, 145, 87–99, 2005. 

Chave, J., Muller-Landau, H. C., Baker, T. R., Easdale, T. A., Steege, H. T., and Webb, C. O.: Regional And Phylogenetic Variation Of Wood Density Across 2456 Neotropical Tree Species, Ecol. Appl., 16, 2356–2367, https://doi.org/10.1890/1051-0761(2006)016[2356:RAPVOW]2.0.CO;2, 2006. 

Chave, J., Coomes, D., Jansen, S., Lewis, S. L., Swenson, N. G., and Zanne, A. E.: Towards a worldwide wood economics spectrum, Ecol. Lett., 12, 351–366, 2009. 

Chave, J., Réjou-Méchain, M., Búrquez, A., Chidumayo, E., Colgan, M. S., Delitti, W. B. C., Duque, A., Eid, T., Fearnside, P. M., Goodman, R. C., Henry, M., Martínez-Yrízar, A., Mugasha, W. A., Muller-Landau, H. C., Mencuccini, M., Nelson, B. W., Ngomanda, A., Nogueira, E. M., Ortiz-Malavassi, E., Pélissier, R., Ploton, P., Ryan, C. M., Saldarriaga, J. G., and Vieilledent, G.: Improved allometric models to estimate the aboveground biomass of tropical trees, Glob. Change Biol., 20, 3177–3190, https://doi.org/10.1111/gcb.12629, 2014. 

Clark, D. B., Mercado, L. M., Sitch, S., Jones, C. D., Gedney, N., Best, M. J., Pryor, M., Rooney, G. G., Essery, R. L. H., Blyth, E., Boucher, O., Harding, R. J., Huntingford, C., and Cox, P. M.: The Joint UK Land Environment Simulator (JULES), model description – Part 2: Carbon fluxes and vegetation dynamics, Geosci. Model Dev., 4, 701–722, https://doi.org/10.5194/gmd-4-701-2011, 2011. 

Cramer, W., Bondeau, A., Schaphoff, S., Lucht, W., Smith, B., and Sitch, S.: Tropical forests and the global carbon cycle: Impacts of atmospheric carbon dioxide, climate change and rate of deforestation, Philos. T. R. Soc. Lond., 359, 331–343, 2004. 

Davidson, E. A., de Araujo, A. C., Artaxo, P., Balch, J. K., Brown, F., Bustamante, M. M. C., Coe, M. T., DeFries, R. S., Keller, M., Longo, M., Munger, J. W., Schroeder, W., Soares-Filho, B. S., Souza Jr, C. M., and Wofsy, S. C.: The Amazon basin in transition, Nature, 481, 321–328, 2012. 

Delbart, N., Ciais, P., Chave, J., Viovy, N., Malhi, Y., and Le Toan, T.: Mortality as a key driver of the spatial distribution of aboveground biomass in Amazonian forest: results from a dynamic vegetation model, Biogeosciences, 7, 3027–3039, https://doi.org/10.5194/bg-7-3027-2010, 2010. 

Espírito-Santo, F. D. B., Gloor, M., Keller, M., Malhi, Y., Saatchi, S., Nelson, B., Junior, R. C. O., Pereira, C., Lloyd, J., Frolking, S., Palace, M., Shimabukuro, Y. E., Duarte, V., Mendoza, A. M., López-González, G., Baker, T. R., Feldpausch, T. R., Brienen, R. J. W., Asner, G. P., Boyd, D. S., and Phillips, O. L.: Size and frequency of natural forest disturbances and the Amazon forest carbon balance, Nat. Commun., 5, 3434, https://doi.org/10.1038/ncomms4434, 2014. 

Fayad, I., Baghdadi, N., Guitet, S., Bailly, J.-S., Hérault, B., Gond, V., El Hajj, M., and Tong Minh, D. H.: Aboveground biomass mapping in French Guiana by combining remote sensing, forest inventories and environmental data, Int. J. Appl. Earth Obs., 52, 502–514, https://doi.org/10.1016/j.jag.2016.07.015, 2016. 

Feldpausch, T. R., Lloyd, J., Lewis, S. L., Brienen, R. J. W., Gloor, M., Monteagudo Mendoza, A., Lopez-Gonzalez, G., Banin, L., Abu Salim, K., Affum-Baffoe, K., Alexiades, M., Almeida, S., Amaral, I., Andrade, A., Aragão, L. E. O. C., Araujo Murakami, A., Arets, E. J. M. M., Arroyo, L., Aymard C., G. A., Baker, T. R., Bánki, O. S., Berry, N. J., Cardozo, N., Chave, J., Comiskey, J. A., Alvarez, E., de Oliveira, A., Di Fiore, A., Djagbletey, G., Domingues, T. F., Erwin, T. L., Fearnside, P. M., França, M. B., Freitas, M. A., Higuchi, N., E. Honorio C., Iida, Y., Jiménez, E., Kassim, A. R., Killeen, T. J., Laurance, W. F., Lovett, J. C., Malhi, Y., Marimon, B. S., Marimon-Junior, B. H., Lenza, E., Marshall, A. R., Mendoza, C., Metcalfe, D. J., Mitchard, E. T. A., Neill, D. A., Nelson, B. W., Nilus, R., Nogueira, E. M., Parada, A., Peh, K. S.-H., Pena Cruz, A., Peñuela, M. C., Pitman, N. C. A., Prieto, A., Quesada, C. A., Ramírez, F., Ramírez-Angulo, H., Reitsma, J. M., Rudas, A., Saiz, G., Salomão, R. P., Schwarz, M., Silva, N., Silva-Espejo, J. E., Silveira, M., Sonké, B., Stropp, J., Taedoumg, H. E., Tan, S., ter Steege, H., Terborgh, J., Torello-Raventos, M., van der Heijden, G. M. F., Vásquez, R., Vilanova, E., Vos, V. A., White, L., Willcock, S., Woell, H., and Phillips, O. L.: Tree height integrated into pantropical forest biomass estimates, Biogeosciences, 9, 3381–3403, https://doi.org/10.5194/bg-9-3381-2012, 2012. 

Gerten, D., Schaphoff, S., Haberlandt, U., Lucht, W., and Sitch, S.: Terrestrial vegetation and water balance – hydrological evaluation of a dynamic global vegetation model, J. Hydrol., 286, 249–270, 2004. 

Harris, N. L., Brown, S., Hagen, S. C., Saatchi, S. S., Petrova, S., Salas, W., Hansen, M. C., Potapov, P. V., and Lotsch, A.: Baseline Map of Carbon Emissions from Deforestation in Tropical Regions, Science, 336, 1573–1576, https://doi.org/10.1126/science.1217962, 2012. 

Houghton, R. A., House, J. I., Pongratz, J., van der Werf, G. R., DeFries, R. S., Hansen, M. C., Le Quéré, C., and Ramankutty, N.: Carbon emissions from land use and land-cover change, Biogeosciences, 9, 5125–5142, https://doi.org/10.5194/bg-9-5125-2012, 2012. 

John, R., Dalling, J. W., Harms, K. E., Yavitt, J. B., Stallard, R. F., Mirabello, M., Hubbell, S. P., Valencia, R., Navarrete, H., Vallejo, M., and Foster, R. B.: Soil nutrients influence spatial distributions of tropical tree species, P. Natl. Acad. Sci., 104, 864–869, 2007. 

Johnson, M. O., Galbraith, D., Gloor, M., et al.: Variation in stem mortality rates determines patterns of above-ground biomass in Amazonian forests: implications for dynamic global vegetation models, Glob. Change Biol., 22, 3996–4013, https://doi.org/10.1111/gcb.13315, 2016. 

Krinner, G., Viovy, N., de Noblet-Ducoudre, N., Ogee, J., Polcher, J., Friedlingstein, P., Ciais, P., Sitch, S., and Prentice, I. C.: A dynamic global vegetation model for studies of the coupled atmosphere biosphere system, Global Biogeochem. Cy., 19, GB1015, https://doi.org/10.1029/2003GB002199, 2005. 

Kucharik, C. J., Foley, J. A., Delire, C., Fisher, V. A., Coe, M. T., Lenters, J. D., Young-Molling, C., and Ramankutty, N.: Testing the performance of a dynamic global ecosystem model: water balance, carbon balance, and vegetation structure, Global Biogeochem. Cy., 14, 795–825, 2000. 

Langner, A., Achard, F., and Grassi, G.: Can recent pan-tropical biomass maps be used to derive alternative Tier 1 values for reporting REDD+ activities under UNFCCC?, Environ. Res. Lett., 9, 124008, https://doi.org/10.1088/1748-9326/9/12/124008, 2014. 

Lewis, S. L., Brando, P. M., Phillips, O. L., van der Heijden, G. M. F., and Nepstad, D.: The 2010 Amazon Drought, Science, 331, 554–554, https://doi.org/10.1126/science.1200807, 2011. 

Lopez-Gonzalez, G., Lewis, S., Burkitt, M., and Phillips, O.: ForestPlots.net: a web application and research tool to manage and analyse tropical forest plot data, J. Veg. Sci., 22, 610–613, 2011. 

Lopez-Gonzalez, G., Mitchard, E., Feldpausch, T., Brienen, R., Monteagudo, A., Baker, T., Lewis, S., Lloyd, J., Quesada, C., Gloor, E., ter Steege, H., Meir, P., Alvarez, E., Araujo-Murakami, A., Aragao, L., Arroyo, L., Aymard, G., Banki, O., Bonal, D., Brown, S., Brown, F., Ceron, C., Chama Moscoso, V., Chave, J., Comiskey, J., Cornejo, F., Corrales Medina, M., Da Costa, L., Costa, F., Di Fiore, A., Domingues, T., Erwin, T., Fredericksen, T., Higuchi, N., Honorio Coronado, E., Killeen, T., Laurance, W., Levis, C., Magnusson, W., Marimon, B., Marimon-Junior, B., Mendoza Polo, I., Mishra, P., Nascimento, M., Neill, D., Nunez Vargas, M., Palacios, W., Parada-Gutierrez, A., Pardo Molina, G., Pena-Claros, M., Pitman, N., Peres, C., Poorter, L., Prieto, A., Ramirez-Angulo, H., Restrepo Correa, Z., Roopsind, A., Roucoux, K., Rudas, A., Salomao, R., Schietti, J., Silveira, M., De Souza, P., Steiniger, M., Stropp, J., Terborgh, J., Thomas, R., Toledo, M., Torres-Lezama, A., Van Andel, T., van der Heijden, G., Vieira, I., Vieira, S., Vilanova-Torre, E., Vos, V., Wang, O., Zartman, C., de Oliveira, E., Morandi, P., Malhi, Y., and Phillips, O.: Amazon forest biomass measured in inventory plots. Plot Data from “Markedly divergent estimates of Amazon forest carbon density from ground plots and satellites”, ForestPlots.net, https://doi.org/10.5521/FORESTPLOTS.NET/2014_1, 2014. 

Malhi, Y., Wood, D., Baker, T. R., Wright, J., Phillips, O. L., Cochrane, T., Meir, P., Chave, J., Almeida, S., Arroyo, L., Higuchi, N., Killeen, T. J., Laurance, S. G., Laurance, W. F., Lewis, S. L., Monteagudo, A., Neill, D. A., Nunez Vargas, P., Pitman, N. C. A., Quesada, C. A., Salamao, R., Silva, J. N. M., Torres-Lezama, A., Terborgh, J., Vasquez Martinez, R., and Vinceti, B.: The regional variation of aboveground live biomass in old-growth Amazonian forests, Glob. Change Biol., 12, 1107–1138, 2006. 

Marvin, D. C., Asner, G. P., Knapp, D. E., Anderson, C. B., Martin, R. E., Sinca, F., and Tupayachi, R.: Amazonian landscapes and the bias in field studies of forest structure and biomass, P. Natl. Acad. Sci. USA, 111, E5224–E5232, https://doi.org/10.1073/pnas.1412999111, 2014. 

Meir, P., Mencuccini, M., and Dewar, R. C.: Drought-related tree mortality: addressing the gaps in understanding and prediction, New Phytol., 207, 28–33, https://doi.org/10.1111/nph.13382, 2015. 

Mitchard, E. T. A., Feldpausch, T. R., Brienen, R. J. W., Lopez-Gonzalez, G., Monteagudo, A., Baker, T. R., Lewis, S. L., Lloyd, J., Quesada, C. A., Gloor, M., ter Steege, H., Meir, P., Alvarez, E., Araujo-Murakami, A., Aragao, L. E. O. C., Arroyo, L., Aymard, G., Banki, O., Bonal, D., Brown, S., Brown, F. I., Ceron, C. E., Chama Moscoso, V., Chave, J., Comiskey, J. A., Cornejo, F., Corrales Medina, M., Da Costa, L., Costa, F. R. C., Di Fiore, A., Domingues, T. F., Erwin, T. L., Frederickson, T., Higuchi, N., Honorio Coronado, E. N., Killeen, T. J., Laurance, W. F., Levis, C., Magnusson, W. E., Marimon, B. S., Marimon Junior, B. H., Mendoza Polo, I., Mishra, P., Nascimento, M. T., Neill, D., Nunez Vargas, M. P., Palacios, W. A., Parada, A., Pardo Molina, G., Pena-Claros, M., Pitman, N., Peres, C. A., Poorter, L., Prieto, A., Ramirez-Angulo, H., Restrepo Correa, Z., Roopsind, A., Roucoux, K. H., Rudas, A., Salomao, R. P., Schietti, J., Silveira, M., de Souza, P. F., Steininger, M. K., Stropp, J., Terborgh, J., Thomas, R., Toledo, M., Torres-Lezama, A., van Andel, T. R., van der Heijden, G. M. F., Vieira, I. C. G., Vieira, S., Vilanova-Torre, E., Vos, V. A., Wang, O., Zartman, C. E., Malhi, Y., and Phillips, O. L.: Markedly divergent estimates of Amazon forest carbon density from ground plots and satellites, Glob. Ecol. Biogeogr., 23, 935–946, https://doi.org/10.1111/geb.12168, 2014. 

Moreira, A., Krieger, G., Hajnsek, I., Papathanassiou, K., Younis, M., Lopez-Dekker, P., Huber, S., Villano, M., Pardini, M., Eineder, M., De Zan, F., and Parizzi, A.: Tandem-L: A Highly Innovative Bistatic SAR Mission for Global Observation of Dynamic Processes on the Earth's Surface, IEEE Geosci. Remote Sens., 3, 8–23, https://doi.org/10.1109/MGRS.2015.2437353, 2015. 

Pillet, M., Joetzjer, E., Belmin, C., Chave, J., Ciais, P., Dourdain, A., Evans, M., Hérault, B., Luyssaert, S., Poulter, B., and Zhou, S.: Disentangling competitive vs. climatic drivers of tropical forest mortality, J. Ecol., 106, 1165–1179, https://doi.org/10.1111/1365-2745.12876, 2018. 

Politis, D. N. and Romano, J. P.: The stationary bootstrap, J. Am. Stat. Assoc., 89, 1303–1313, 1994. 

Quesada, C. A., Phillips, O. L., Schwarz, M., Czimczik, C. I., Baker, T. R., Patiño, S., Fyllas, N. M., Hodnett, M. G., Herrera, R., Almeida, S., Alvarez Dávila, E., Arneth, A., Arroyo, L., Chao, K. J., Dezzeo, N., Erwin, T., di Fiore, A., Higuchi, N., Honorio Coronado, E., Jimenez, E. M., Killeen, T., Lezama, A. T., Lloyd, G., López-González, G., Luizão, F. J., Malhi, Y., Monteagudo, A., Neill, D. A., Núñez Vargas, P., Paiva, R., Peacock, J., Peñuela, M. C., Peña Cruz, A., Pitman, N., Priante Filho, N., Prieto, A., Ramírez, H., Rudas, A., Salomão, R., Santos, A. J. B., Schmerler, J., Silva, N., Silveira, M., Vásquez, R., Vieira, I., Terborgh, J., and Lloyd, J.: Basin-wide variations in Amazon forest structure and function are mediated by both soils and climate, Biogeosciences, 9, 2203–2246, https://doi.org/10.5194/bg-9-2203-2012, 2012. 

Réjou-Méchain, M., Tanguy, A., Piponiot, C., Chave, J., and Hérault, B.: biomass: an r package for estimating above-ground biomass and its uncertainty in tropical forests, Methods Ecol. Evol., 8, 1163–1167, https://doi.org/10.1111/2041-210X.12753, 2017. 

Rödig, E., Cuntz, M., Heinke, J., Rammig, A., and Huth, A.: Spatial heterogeneity of biomass and forest structure of the Amazon rain forest: Linking remote sensing, forest modelling and field inventory, Glob. Ecol. Biogeogr., 26, 1292–1302, 2017. 

Rödig, E., Cuntz, M., Rammig, A., Fischer, R., Taubert, F., and Huth, A.: The importance of forest structure for carbon fluxes of the Amazon rainforest, Environ. Res. Lett., 13, 054013, https://doi.org/10.1088/1748-9326/aabc61, 2018. 

Rowland, L., da Costa, A. C. L., Galbraith, D. R., Oliveira, R. S., Binks, O. J., Oliveira, A. A. R., Pullen, A. M., Doughty, C. E., Metcalfe, D. B., Vasconcelos, S. S., Ferreira, L. V., Malhi, Y., Grace, J., Mencuccini, M., and Meir, P.: Death from drought in tropical forests is triggered by hydraulics not carbon starvation, Nature, 528, 119–122, https://doi.org/10.1038/nature15539, 2015. 

Saatchi, S. S., Harris, N. L., Brown, S., Lefsky, M., Mitchard, E. T. A., Salas, W., Zutta, B. R., Buermann, W., Lewis, S. L., Hagen, S., Petrova, S., White, L., Silman, M., and Morel, A.: Benchmark map of forest carbon stocks in tropical regions across three continents, P. Natl. Acad. Sci. USA, 108, 9899–9904, https://doi.org/10.1073/pnas.1019576108, 2011. 

Saatchi, S., Mascaro, J., Xu, L., Keller, M., Yang, Y., Duffy, P., Espirito-Santo, F., Baccini, A., Chambers, J. Q., and Schimel, D. S.: Seeing the forest beyond the trees, Glob. Ecol. Biogeogr., 24, 606–610, https://doi.org/10.1111/geb.12256, 2015. 

Sheffield, J., Goteti, G., and Wood, E. F.: Development of a 50-Year High-Resolution Global Dataset of Meteorological Forcings for Land Surface Modeling, J. Climate, 19, 3088–3111, https://doi.org/10.1175/jcli3790.1, 2006. 

Sitch, S., Smith, B., Prentice, I. C., Arneth, A., Bondeau, A., Cramer, W., Kaplans, J. O., Levis, S., Lucht, W., Sykes, M. T., Thonicke, K., and Venevsky, S.: Evaluation of ecosystem dynamics, plant geography and terrestrial carbon cycling in the LPJ dynamic global vegetation model, Glob. Change Biol., 9, 161–185, 2003. 

Sitch, S., Huntingford, C., Gedney, N., Levy, P. E., Lomass, M., Piao, S., Betts, R. A., Ciais, P., Cox, P. M., Friedlingstein, P., Jones, C. D., Prentice, I. C., and Woodward, F. I.: Evaluation of the terrestrial carbon cycle, future plant geography and climate-carbon cycle feedbacks usin five Dynamic Global Vegetation Models (DGVMs), Glob. Change Biol., 14, 1–25, 2008. 

Song, X.-P., Huang, C., Saatchi, S., Hansen, M., and Townshend, J.: Annual Carbon Emissions from Deforestation in the Amazon Basin between 2000 and 2010, PLoS ONE, 10, e0126754, https://doi.org/10.1371/journal.pone.0126754, 2015. 

Turner, M. G., Gardner, R. H., and O'Neill, R. V. (Eds.): Landscape Ecology in theory and praxis, Springer-Verlag, New York, 401 pp., 2001. 

Xu, L., Saatchi, S. S., Shapiro, A., Meyer, V., Ferraz, A., Yang, Y., Bastin, J.-F., Banks, N., Boeckx, P., Verbeeck, H., Lewis, S. L., Muanza, E. T., Bongwele, E., Kayembe, F., Mbenza, D., Kalau, L., Mukendi, F., Ilunga, F., and Ebuta, D.: Spatial Distribution of Carbon Stored in Forests of the Democratic Republic of Congo, Sci. Rep., 7, 15030, https://doi.org/10.1038/s41598-017-15050-z, 2017. 

Zanne, A. E., Zanne, A. E., Lopez-Gonzalez, G., Coomes, D. A., Ilic, J., Jansen, S., Lewis, S. L., Miller, R. B., Swenson, N. G., Wiemann, M. C., and Chave, J.: Global wood density database, https://doi.org/10.5061/dryad.234, 2009. 

Zhang, K., de Almeida Castanho, A. D., Galbraith, D. R., Moghim, S., Levine, N. M., Bras, R. L., Coe, M. T., Costa, M. H., Malhi, Y., Longo, M., Knox, R. G., McKnight, S., Wang, J., and Moorcroft, P. R.: The fate of Amazonian ecosystems over the coming century arising from changes in climate, atmospheric CO2, and land use, Glob. Change Biol., 21, 2569–2587, https://doi.org/10.1111/gcb.12903, 2015. 

Download
Short summary
We propose a generic approach for a pixel-to-point comparison applicable for evaluation of models and remote-sensing products. We provide statistical measures accounting for the uncertainty in ecosystem variables. We demonstrate our approach by comparing simulated values of aboveground biomass, woody productivity and residence time of woody biomass from four dynamic global vegetation models (DGVMs) with measured inventory data from permanent plots in the Amazon rainforest.