Assessment of gap-filling techniques applied to satellite phytoplankton composition products for the Atlantic Ocean

Mehdipour, Ehsan; Xi, Hongyan; Barth, Alexander; Alvera-Azcárate, Aida; Wilhelm, Adalbert; Bracher, Astrid

doi:10.5194/gmd-19-1619-2026

Articles | Volume 19, issue 4

https://doi.org/10.5194/gmd-19-1619-2026

Articles | Volume 19, issue 4

Model evaluation paper

26 Feb 2026

Model evaluation paper |

| 26 Feb 2026

Assessment of gap-filling techniques applied to satellite phytoplankton composition products for the Atlantic Ocean

Ehsan Mehdipour, Hongyan Xi, Alexander Barth, Aida Alvera-Azcárate, Adalbert Wilhelm, and Astrid Bracher

Abstract

Phytoplankton are vital to marine biogeochemical cycles and form the base of the marine food web. Comprehensive datasets offering a spatiotemporal perspective on phytoplankton composition are essential for assessing the impacts of climate change on marine ecosystems. Phytoplankton functional types (PFTs) classify phytoplankton based on their biogeochemical functions, enabling assessments of nutrient cycling, primary productivity, and ecosystem structure. However, satellite-derived ocean colour products like PFTs chlorophyll a (Chl a) concentrations are challenged by limited temporal and spatial coverage due to the exclusion of data collected under non-optimal observing conditions, such as strong sun glint, clouds, thick aerosols, straylight, and large viewing angles or due to the specific sensor configuration and sensor malfunction. This highlights the importance of gap-filling techniques for producing consistent datasets, which are currently missing for operational data sets. This study evaluates two robust gap-filling methods for satellite observations: Data Interpolating Empirical Orthogonal Functions (DINEOF) and Data Interpolating Convolutional Auto Encoder (DINCAE). These methods were applied to Sentinel 3A/B OLCI-derived Chl a concentration products in several regions of the Atlantic Ocean over three years of data, including total Chl a (TChl a) and Chl a concentration of five major PFTs, namely diatoms, dinoflagellates, haptophytes, green algae, and prokaryotic phytoplankton. The reconstructed datasets were assessed using test dataset evaluation and validated with in situ measurements collected during the transatlantic RV Polarstern expedition PS113 in 2018. The test dataset evaluation indicates that DINCAE performs slightly better than DINEOF in representing transient-scale variability, particularly within highly dynamic regions. DINCAE achieves an average root-mean-square-logarithmic-error (RMSLE) in cross-validation that is 66 % lower for TChl a and 16 % lower for PFTs compared to DINEOF. However, external validation using in situ measurements indicates better performance for DINEOF than DINCAE, with improved regression metrics for PFTs, including a 12.5 % better slope, 13.6 % better intercept, and 68 % higher coefficient of determination (R²). The gap-filled datasets exhibit slightly reduced but still robust accuracy compared to the original satellite data while preserving statistical trends, improving spatial structure restoration, and increasing matchup data for validation. It is concluded that DINCAE and DINEOF each have unique strengths for gap-filling ocean colour products. DINCAE performs well in complex water bodies, effectively reproducing patterns from the original satellite product. In contrast, DINEOF shows higher overall reliability, supported by independent validation, and is better suited for larger areas due to its lower computational demands.

Download & links

Article (PDF, 8125 KB)

Supplement (1668 KB)

Download & links

How to cite.

Received: 10 Jan 2025 – Discussion started: 12 Mar 2025 – Revised: 03 Nov 2025 – Accepted: 16 Jan 2026 – Published: 26 Feb 2026

1 Introduction

Phytoplankton are fundamental to marine biogeochemical cycles and ecosystems, contributing approximately 50 % of global primary production and providing over 90 % of the nutritional requirements for higher trophic levels within marine ecosystems (Field et al., 1998). Understanding the spatiotemporal distribution and composition of phytoplankton is crucial for assessing the impacts of climate change on ocean biogeochemistry, the marine food web, and the feedback mechanisms influencing oceanic and atmospheric processes (Fennel et al., 2019). Ocean colour remote sensing has significantly advanced our understanding of marine processes by providing continuous global data on surface chlorophyll a (Chl a) concentrations, a key indicator of phytoplankton biomass (Sathyendranath et al., 2019), which is widely used to monitor growth and blooms in aquatic ecosystems (Blondeau-Patissier et al., 2014; Huot et al., 2007). Despite its widespread use, Total Chl a (TChl a) provides a limited perspective since it does not capture the diversity and variability of the planktonic community structure. Phytoplankton functional types (PFTs) are typically defined as groups of organisms linked by shared biogeochemical processes, such as silicification, calcification, and nitrogen fixation, though they may not be phylogenetically related (Falkowski et al., 2003; IOCCG, 2014; Litchman et al., 2006). Since many phytoplankton groups that are identifiable through remote sensing also function as PFTs (Bracher et al., 2017), these satellite-detected proxies are often referred to as PFTs for simplicity (e.g., Losa et al., 2017).

Satellite-derived PFT Chl a data are among the most effective datasets for investigating long-term variability in phytoplankton communities (e.g., Xi et al. 2025). Daily satellite PFT Chl a concentration datasets are currently unavailable due to a variety of reasons. First, significant gaps in satellite data exist due to factors such as cloud cover, observation geometry hindering successful retrievals, and sensor-specific limited spatiotemporal coverage. Second, closing these gaps necessitates advanced computational techniques, which are often resource-intensive and difficult to implement effectively, especially for large-scale datasets. Finally, the scarcity and uneven distribution of in situ measurements, which are critical for ensuring the accuracy and reliability of gap-filled data, limit the validation of reconstructed datasets. These challenges make it difficult to generate consistent, high-quality gap-filled PFT Chl a concentration datasets.

Several techniques have been developed to address missing data for ocean colour satellite products. Spatial interpolation methods such as Inverse Distance Weighting (IDW), Kriging (e.g., Kostopoulou, 2021), spline interpolation, and nearest neighbour interpolation are used to estimate missing values based on spatial proximity (Li and Heap, 2008, 2014). Temporal interpolation or univariate interpolation methods like linear interpolation, polynomial interpolation, and spline interpolation aim to fill missing values within a time series of observations (Kandasamy et al., 2013; Lepot et al., 2017). Spatiotemporal methods such as Optimal Interpolation (OI) (Hosoda and Sakaida, 2016; Reynolds and Smith, 1994) and data assimilation techniques like the Ensemble Kalman filter (Evensen, 2009; Nerger and Hiller, 2013) integrate spatial and temporal information.

Empirical and statistical methods offer alternative strategies for gap-filling depending on the data characteristics and intended application. Alvera-Azcárate et al. (2005) introduced the Data Interpolating Empirical Orthogonal Functions (DINEOF) method, a self-consistent technique based on EOF. This method is designed to fill in missing data within geophysical datasets using pre-existing spatiotemporal patterns, making it particularly effective for handling situations such as cloud-covered regions in satellite imagery or data interruptions caused by satellite malfunctions (Beckers and Rixen, 2003). This method was further developed to include a multivariate approach using extended EOFs (Alvera-Azcárate et al., 2007) and later improved with Laplacian filtering to reduce spurious variability (Alvera-Azcárate et al., 2009). Stock et al. (2020) compared DINEOF with other gap-filling methods and found it among the best-performing techniques. However, linear approaches can still be limited in capturing complex, non-linear oceanic variability.

Machine learning gap-filling methods such as artificial neural networks (ANN) (Hong et al., 2023; Krasnopolsky et al., 2015), random forests (Park et al., 2019; Stock et al., 2020), and self-organising maps (SOM) (Abdel Latif et al., 2008; Chapman and Charantonis, 2017; Jouini et al., 2013) are designed to learn non-linear patterns within datasets to predict missing values. These techniques present a promising approach for preserving transient-scale structures during data reconstruction due to their ability to handle non-linear relationships and complex interactions. Among these, the Data Interpolating Convolutional Auto-Encoder (DINCAE) method represents a deep learning approach (Barth et al., 2020, 2022). This algorithm uses a neural network with a convolutional auto-encoder structure to reconstruct missing data from satellite observations using available cloud-free pixels while also providing an error estimate for the reconstruction (Barth et al., 2020, 2022).

This study aims to address the abovementioned limitations on generating daily PFTs Chl a concentration dataset by evaluating the performance of gap-filling methods in reconstructing TChl a and the Chl a of the five major PFTs, namely diatoms, dinoflagellates, haptophytes, green algae, and prokaryotes, in the Atlantic Ocean. DINEOF and DINCAE were chosen for this study as gap-filling methods because they are particularly suited to oceanographic datasets, where maintaining spatial and temporal continuity is crucial, and their advanced gap-filling capabilities ensure higher quality and more reliable data reconstruction compared to alternative approaches. These methods surpass traditional interpolation techniques, such as kriging or simple regression, which often struggle to preserve the dynamic consistency of ocean processes. DINEOF reconstructs missing values by extracting dominant modes of variability, making it particularly effective for large-scale oceanographic variables with spatial coherence. This method is actively used for gap-filling ocean colour products within the Copernicus Marine Service, including monthly global TChl a and daily regional products (e.g., Mediterranean and Black Sea) at ∼ 4 km resolution (Volpe et al., 2018). Additionally, it is applied in NOAA CoastWatch multi-sensor global products, such as ∼ 9 km TChl a, SPM, and diffuse attenuation coefficient Kd(490) (Liu and Wang, 2018). Conversely, DINCAE employs a deep learning approach to capture complex non-linear relationships, offering greater flexibility and accuracy, particularly for highly variable data. By incorporating both anomaly estimation and error estimation in its cost function, DINCAE provides reliable performance in both reconstruction and error quantification (Barth et al., 2020, 2022).

In the following sections, we present the materials and methods used in this study. Section 2 covers the data sources, preprocessing steps, and methodologies for DINEOF and DINCAE, including model optimisation and validation metrics. Section 3 presents and analyses the evaluation results, concluding with a discussion of the strengths and limitations of the reconstruction methods.

2 Materials and methods

2.1 Datasets

This study focuses on a corridor of the RV Polarstern PS113 transatlantic expedition (Alfred-Wegener-Institut Helmholtz-Zentrum für Polar- und Meeresforschung, 2017; Strass, 2018), which traversed from the Patagonian shelf to the English Channel between 10 May and 9 June 2018 (Bracher et al., 2020a). This study used datasets from two distinct sources: the ship-borne dataset, which served as the basis for validating and comparing the gap-filling results, and three years of satellite-derived datasets, upon which the gap-filling methods were applied.

2.1.1 In situ dataset

The gap-filled satellite datasets were comprehensively evaluated using in situ measurements. These in situ measurements are based on phytoplankton pigment concentrations measured by high-pressure liquid chromatography (HPLC) and published in Bracher et al. (2020b). 230 surface water samples were gathered during the expedition for subsequent laboratory analysis of the phytoplankton pigment composition. The TChl a was calculated as the sum of several Chl a pigments (monovinyl chlorophyll a, divinyl chlorophyll a, chlorophyll a allomers, chlorophyll a epimers, and chlorophyllide-a). The PFT concentrations were derived using the diagnostic pigment analysis (DPA) method based on Vidussi et al. (2001) to derive phytoplankton size classes and further refined by Hirata et al. (2011) to derive PFTs with updated pigment-specific weighting coefficients following Xi et al. (2023a). This method enables the determination of five PFT concentrations: diatoms, dinoflagellates, haptophytes, green algae, and prokaryotes (Bracher et al., 2020a). The distribution of HPLC TChl a and PFTs is depicted in Fig. S1 in the Supplement.

2.1.2 Satellite dataset

The satellite PFT products were acquired from the Copernicus Marine Services website (https://marine.copernicus.eu, last access: 22 January 2024) (E.U. Copernicus Marine Service Information, Marine Data Store), covering a temporal range of three years from 25 April 2016 to 25 April 2019, and spanning spatially from 64° W to 3° E and 50° S to 52° N. The data are derived using the algorithm developed by Xi et al. (2021, 2020) within the Copernicus Marine Service framework. This algorithm extracts global PFT concentrations from merged ocean colour (OC) products or Sentinel-3 (S3) A/B Ocean and Land Colour Instrument (OLCI) data, using an expanded pigment database to determine PFT-specific coefficients. The algorithm employs EOF decomposition to reduce the dimensionality of remote sensing reflectance spectral signals, followed by a multi-linear regression method to establish the PFTs. The regression coefficients are calibrated using matchups with PFTs derived from in situ HPLC pigment data. Subsequently, the model is applied to the global remote sensing reflectance dataset to generate a global PFT dataset. Currently, this product is used for long-term monitoring of global phytoplankton groups (Xi et al., 2023b, 2025). The product and dataset IDs from Copernicus Marine Service are OCEANCOLOUR_GLO_BGC_L3_MY_009_103 and cmems_obs-oc_glo_bgc-plankton_my_ l3-multi-4km_P1D, respectively. Our study specifically focuses on five PFTs [product names in the dataset]: diatoms [DIATO], dinoflagellates [DINO], green algae [GREEN], haptophytes [HAPTO], and prokaryotes [PROKAR], alongside the Total chlorophyll a [CHL]. To maintain consistency, these product names for TChl a and PFTs are used as representative abbreviation symbols in figures.

The TChl a dataset was produced from merged OC products by integrating data from multi-satellite missions, including SeaWiFS, MERIS, MODIS-A, MODIS-T, VIIRS-SNPP & JPSS-1, and OLCI-S3A & S3B, resulting in a reduced rate of data gaps (average of 52 %) (Fig. 1a, d and f). Conversely, the Copernicus Marine Service's PFT products are derived from a more limited number of sources, OLCI-S3A & S3B, and exhibit a much higher rate of missing data (average of 82 %) compared to the TChl a product (Fig. 1b, e, and f). Notably, the missing data rate for PFTs decreased toward the end of the study period as Sentinel-3B became operational on 25 April 2018, thereby narrowing the gap between satellite tracks (Fig. 1c and f). Spatial analysis of missing data rates (Fig. 1d and e) reveals that the highest rates of missing data for both TChl a and PFTs occur predominantly in tropical and high-latitude regions, where they reach up to 80 %–90 % for TChl a and 90 %–100 % for PFTs. This high rate of missing data is largely attributable to persistent cloud cover in these regions.

https://gmd.copernicus.org/articles/19/1619/2026/gmd-19-1619-2026-f01

Figure 1Sample satellite datasets: (a) TChl a on 1 January 2018, (b) diatoms on 1 January 2018 (including the PS113 expedition track), and (c) diatoms on 1 January 2019 (showing the ten extracted regions of interest, ROIs). Diatom is only used here as a representative example of all PFT products. (d–e) spatial variation of average missing rate for TChl a and PFTs. (f) Temporal variation of average missing data rates for TChl a and PFTs across the Atlantic Ocean. The light grey line represents the missing data rate for TChl a, while the light red line indicates the missing rate for PFTs. Darker lines show the 30 d moving average, and the dotted dark lines denote the overall dataset's average missing rate.

To assess the potential advantages of integrating an auxiliary dataset with no data gaps to enhance the gap-filling process in dataset reconstruction, the OSTIA foundation sea surface temperature (SST) dataset was used. This dataset, developed by GHRSST, the Met Office, and the Copernicus Marine Service (Donlon et al., 2012; Good et al., 2020), is catalogued in the Copernicus Marine Service under product ID SST_GLO_SST_L4_REP_OBSERVATIONS_ 010_011 and dataset ID METOFFICE-GLO-SST-L4-REP-OBS-SST. It is a Level 4 daily product provided at a spatial resolution of 0.05° that was interpolated to the PFT product locations.

2.2 Satellite data preprocessing

Satellite datasets require careful preprocessing before they can be used in gap-filling models. This process involves restructuring and reformatting the data to meet specific model requirements and assumptions, such as the normality of input datasets. These steps are essential for enhancing model performance and ensuring the reliability of the gap-filling results. The preprocessing of satellite data prior to input into the gap-filling models involved the following steps: (1) normalisation of the log-normally distributed Chl a datasets, (2) extraction of regions of interest within the corridor surrounding the PS113 expedition, and (3) partitioning of the dataset into training, validation and test datasets using an artificial cloud mask. Figure 2 illustrates the data processing workflow, providing an overview of the sequential steps and methodologies employed in the analysis.

https://gmd.copernicus.org/articles/19/1619/2026/gmd-19-1619-2026-f02

Figure 2Diagram of the data processing workflow, divided into three sections: Preprocessing, which includes extracting regions of interest (ROIs) and partitioning the dataset into training, validation, and test subsets; Processing, which encompasses model configuration development and gap-filling across all areas; and Postprocessing, which includes validation of the satellite-derived products using the holdout test dataset and in situ measurements.

Download

2.2.1 Normalisation

Phytoplankton Chl a typically follows a log-normal distribution, influenced by the multiplicative effects of environmental factors like light, nutrients, and temperature on their growth (Campbell, 1995). Consequently, employing log-transformed Chl a concentration in modelling proves more effective, as it normalises the skewed distribution, diminishes the impact of outliers, and more accurately reflects the underlying ecological processes. Log-transformation also ensures that gap-filling results in positive Chl a concentrations.

2.2.2 Region of interest (ROI) extraction

Due to the substantial computational demand and inherent scalability limitations of both algorithms, direct reconstruction of the dataset for the whole Atlantic Ocean was deemed unfeasible. Efforts to process the entire region restricted the DINEOF algorithm to approximately four months of data, resulting in fewer EOF extractions and a lower-quality reconstruction. Furthermore, the DINCAE algorithm experienced GPU memory crashes when processing the entire dataset. To address these challenges, similar to Jung et al. (2022), the satellite dataset was divided into smaller, more computationally manageable areas (tiles) focused along a corridor of the expedition track, aiming to generate overlapping regions (1° buffer) and produce a continuous dataset that facilitates meaningful comparisons between the in situ measurements and satellite products, both before and after gap-filling. The challenge was to determine an optimal number of areas that would reduce the computational load while ensuring that each area was sufficiently large to accurately capture the dynamics of the region. The areas were designed to be comparable in size or larger than those used in previous gap-filling studies (Barth et al., 2020, 2022; Han et al., 2020; Jung et al., 2022). Using a k-means clustering algorithm, we classified the location of the research vessel during the PS113 expedition into ten areas, with each area covering at least ten degrees of longitude (Fig. 1c). This strategy ensured some data availability even when gaps between satellite tracks overlapped the area, thereby reducing the number of days without any data in PFT products. Figure S2 illustrates the boundaries of the regions of interest (ROI) overlaid on continental shelves (Flanders Marine Institute, 2023, 2024) and in situ measurements clustered by Bracher et al. (2020a) into the Longhurst biogeochemical provinces (Longhurst, 2010). Table 1 summarises the physical and biogeochemical characteristics of the areas along with the major overlapping Longhurst provinces (Longhurst, 2010) and hierarchical clusters defined by Bracher et al. (2020a).

The original satellite datasets initially spanned 1095 days across all regions. To improve the robustness of the reconstruction, days with less than 2 % data availability were excluded. Area No. 7 experienced the highest proportion of missing data, largely due to its proximity to the West African upwelling region, a zone frequently obscured by cloud cover. As a result, only 885 d of data were retained for area No. 7, while a minimum of 1048 d were preserved for the other regions. Following both gap-filling methods, finally a weighted blending method, commonly referred to as alpha blending or feathering, was employed to merge the areas (Lu et al., 2014; Uyttendaele et al., 2001). This technique ensures a smooth transition in the overlapping areas (Fig. 1c) with the weight coefficient determined by the distance from the stitching borders (Lu et al., 2014).

Table 1Summary of areas' physical and biogeochemical characteristics and major overlapping Longhurst provinces. SWAS for Southwest Atlantic Shelves, BRAZ for Brazilian Current Coast, SATL for South Atlantic Tropical Gyre, WTRA for Western Tropical Atlantic, NATR for North Atlantic Tropical Gyre, CNRY for Canary Current Coast, NASE North Atlantic Subtropical Gyre East, NASE-N for Northern NASE, NADR for North Atlantic Drift and NECS for Northeast Atlantic Shelves.

Download Print Version | Download XLSX

2.2.3 Data partitioning

The data from each area were divided into three distinct datasets: training, validation (development), and test, as illustrated in Fig. 2. The training dataset was used to analyse the satellite data and estimate the internal parameters of each algorithm; specifically, estimating EOF patterns in the DINEOF algorithm and determining the weights in the DINCAE algorithm. The validation dataset served to fine-tune the algorithms for optimal performance and to choose the model architecture, such as identifying the optimal number of EOFs in DINEOF and adjusting the hyperparameters in the DINCAE gap-filling method. The structure with the lowest error on the validation dataset was selected. Following this, the algorithms were retrained using a combination of the training and validation datasets with the optimal configuration to acquire the gap-filled products. The test dataset provided independent data for evaluating both algorithms and comparing their performances. The amount of masked data was similar across groups in each phase. However, differences in data availability resulted in varying percentages of masked data (Fig. 2). Typically, 1 % of the initial dataset was reserved for cross-validation, as noted by Alvera-Azcárate et al. (2007, 2009). Recent studies have employed higher percentages, such as 3 % in Wang et al. (2019), 2 %–3 % in Alvera-Azcárate et al. (2021), 5 % in Liu and Wang (2022), and 3 % in Alvera-Azcárate et al. (2025), aligning with the percentages used in our study. The validation and test datasets were generated using a data partitioning technique similar to that described by Alvera-Azcárate et al. (2009), Barth et al. (2020) and Beckers et al. (2006). In this approach, cloud masks were extracted from cloudy days and subsequently applied to other days, effectively obscuring portions of the dataset for evaluation purposes.

2.3 Gap-filling methods

2.3.1 DINEOF gap-filling

We provide a brief overview of the DINEOF algorithms along with the key modifications applied in this study. More details can be found in Alvera-Azcárate et al. (2009, 2007, 2005) and Beckers and Rixen (2003). The DINEOF approach handles missing data through an iterative process. Initially, the method centres the data by subtracting the mean and replaces the missing values with zero. The EOF is then computed using this updated matrix, and the primary EOF is used to predict the values at the locations where data were initially missing. This iteration continues until the anomalies at the missing values converge to a specified level from one iteration to the next. Once convergence is achieved, the number of calculated EOFs gradually increases, up to a maximum of k_max EOFs, or stops if the cross-validation error rises continuously.

In our study, the input matrix was constructed by combining multiple datasets of TChl a and five PFTs, resulting in the extended matrix X_e shown in Eq. (1). $x_{t}^{g}$ is a column vector containing all the spatial points of TChl a or PFTs Chl a concentration dataset at time step t across T temporal steps. The inclusion of the SST dataset was additionally tested to assess its impact on enhancing pattern recognition and improving reconstruction accuracy.

2.3.2 DINCAE gap-filling

DINCAE's architecture, similar to convolutional autoencoders and U-Net networks (Ronneberger et al., 2015), employs a sequence of encoder-decoder layers to extract significant features from irregular and sparse data through dimensionality reduction, similar to EOF methods (Jung et al., 2022). In this structure, input data is compressed through a bottleneck by the encoder using convolutional and average-pooling layers to reduce resolution, then decompressed by the decoder with convolutional and interpolation layers. Similar to a U-Net, DINCAE optionally incorporates skip connections, which allow some information to bypass the bottleneck and retain more of the original gradient features during reconstruction (Barth et al., 2020, 2022; Jung et al., 2022; Ronneberger et al., 2015). DINCAE employs a non-linear approach to OI using convolutional operations, modelling the neural network output as a Gaussian probability distribution with a predicted mean ${\hat{y}}_{i j}$ and expected error variance ${\hat{σ}}_{i j}^{2}$ for each grid point i, j. The weights and biases of the neural network are optimised to maximise the likelihood of the observed values y_ij. The corresponding cost function is formulated as described in Eq. (2).

\begin{matrix} (2) & J ({\hat{y}}_{i j}, {\hat{σ}}_{i j}^{2}) = \frac{1}{2 N} \sum_{i j} [{(\frac{y_{i j} - {\hat{y}}_{i j}}{{\hat{σ}}_{i j}})}^{2} + \log ({\hat{σ}}_{i j}^{2})] \end{matrix}

where N is the number of non-masked data points in y_ij. The first term of the cost function is related to the mean square error, adjusted by the estimated error standard deviation. The second term penalises overestimation of the error standard deviation. In DINCAE 2.0, a convolutional auto-encoder with refinement, the intermediate results ${\hat{y}}_{i j}$ and ${\hat{σ}}_{i j}^{2}$ are combined with the inputs and processed through another auto-encoder with a similar architecture and independent weights from the initial layer (Barth et al., 2022). The ultimate cost function incorporating refinement, denoted as J_r, is expressed as:

\begin{matrix} (3) & J_{r} = α J ({\hat{y}}_{i j}, {\hat{σ}}_{i j}^{2}) + α^{'} J ({\hat{y}}^{'}_{i j}, {\hat{σ}}^{'}_{i j}^{2}) \end{matrix}

Here, ${\hat{y}}^{'}_{i j}$ and ${\hat{σ}}^{'}_{i j}^{2}$ represent the reconstruction and the expected error variance generated by the second auto-encoder. The weights α and α′ control the relative significance attributed to the intermediate and final outputs within the cost function, and they would be fine-tuned during hyperparameter optimisation. Incorporating a refinement step effectively doubles the neural network's depth and nearly doubles the number of parameters, substantially increasing its complexity. This added depth enhances the network's ability to capture more intricate data patterns and relations but also results in higher computational costs. For more detailed information on the method, refer to Barth et al. (2022, 2020).

The main input variables for the DINCAE model include anomalies in input data and the inverse of error variance spanning consecutive days centred around the reconstruction day. Additionally, spatiotemporal coordinates were incorporated as auxiliary input variables. In our study, the number of input variables expanded due to the adoption of a multivariate approach and the inclusion of all PFT datasets. As outlined in Table 2, for a 3 d timeframe, there are 36 primary variables, with an additional 4 auxiliary variables, resulting in a total of 40 layers. The model's output consists of the reconstructed PFTs and the corresponding expected error variance of the reconstruction for each variable.

Table 2Example summary of input and output variables of DINCAE in reconstructing PFTs for the 3 d time window. Chl a anomalies are the Chl a concentrations of TChl a and PFTs.

Download Print Version | Download XLSX

2.3.3 Model development and hyperparameter optimisation

Hyperparameter optimisation is crucial for enhancing model performance. In contrast to model parameters, hyperparameters are defined prior to training and play a significant role in shaping the model's accuracy and behaviour. Their optimisation is typically conducted during the validation (development) step to ensure optimal model performance. Although DINEOF is inherently a self-consistent algorithm, its temporal Laplacian filtering process requires the optimisation of two key parameters to achieve optimal results. The parameter α governs the filter's intensity, while p denotes the number of iterations during which the filter is applied to the temporal covariance matrix. The effective extent of the Laplacian filter is calculated as $L = 2 π \sqrt{α p}$ . Further information related to the filtering method can be obtained from Alvera-Azcárate et al. (2009). In contrast, DINCAE involves numerous hyperparameters related to the input dataset, deep learning architecture, generalisation, and cost function optimisation. Random search offered an efficient method for hyperparameter optimisation by randomly selecting hyperparameter combinations from a given distribution. We also included a test condition to evaluate the algorithm's performance with and without SST in the dataset. Tables S1 and S2 outline the hyperparameters for DINEOF and DINCAE, respectively, including descriptions, selection ranges, distributions, and selected values.

Training individual models for each area demands significant computational resources. To optimise efficiency, hyperparameters were fine-tuned on a representative area and then generalised to other areas. Area No. 9 was selected for this purpose due to its dynamic features, including freshwater influx from coastal rivers and moderate phytoplankton levels, which enable the algorithm to better capture fluctuations compared to oligotrophic regions. Additionally, area No. 9 has a moderate rate of missing data, unlike areas with severe gaps, such as area No. 7, enhancing the algorithm's robustness. As the largest area, it also ensures that computational demands for subsequent analyses remain manageable. DINEOF computations were executed on a system equipped with AMD Rome Epyc 7702 processors. Each DINEOF training run (a full cycle of model training) during the development phase averaged 17 h, with durations ranging from 12 to 24 h. DINCAE computations were carried out on an Nvidia A100 GPU. On average, each DINCAE training run during the development phase took approximately 15 h, with durations ranging from 6 to 28 h. The computation time for DINCAE is particularly sensitive to the number of time windows and epochs required for model training.

2.4 Validation and evaluation metrics

The model's performance was assessed through holdout validation (termed cross-validation in literature) on the validation dataset during the development and final evaluation of the test dataset, as described in Sect. 2.2 for data partitioning. Additionally, two complementary approaches were employed to evaluate the performance of the gap-filling techniques. First, the gradient field and degree of smoothing were assessed to qualitatively and quantitatively examine the impact of reconstruction on the smoothing of the original satellite dataset. Second, mathematical and statistical metrics derived from the validation regression analysis of matchups between in situ measurements and satellite-derived products were analysed, using in situ measurements described in Sect. 2.1.1.

2.4.1 Performance evaluation

The optimal model during development is identified by the lowest root-mean-square-logarithmic-error (RMSLE) across TChl a and all PFTs (Eq. 4), with no bias towards any specific PFT, ensuring equal contribution in the total RMSLE calculation using the same number of holdout validation points. Model performance is further evaluated by RMSLE, comparing the reconstructed data to the test dataset. While RMSLE provides a useful comparison of different models on a logarithmic scale, it is not easily interpretable. Therefore, relative error metrics, particularly the mean-absolute-percentage-error (MAPE) defined in Eq. (5) are used for a clearer assessment of final model performance. MAPE evaluates the reconstructed Chl a on a linear scale, offering a more intuitive percentage-based interpretation, particularly useful for skewed data distributions. Unlike squared error metrics, MAPE uses absolute percentage values, which reduces the impact of outliers. However, when original values are near zero, percentage errors can become disproportionately large and distort the mean. To avoid this, a small percentage (approximately 0.01 %) of extreme values (i.e., PE > 10 000 %) were excluded from MAPE calculations to obtain a more robust assessment.

\begin{matrix} (4) & RMSLE = \sqrt{\frac{1}{M} \sum_{i = 1}^{M} {(\log_{10} (C_{rec, i}) - \log_{10} (C_{CV, i}))}^{2}} \\ (5) & MAPE = \frac{1}{M} \sum_{i = 1}^{M} |\frac{C_{rec, i} - C_{CV, i}}{C_{CV, i}}| \times 100 % \end{matrix}

where M and C_CV,i refers to the number and values of the holdout validation points (i.e. validation or test dataset) respectively and C_rec,i is the reconstructed value.

2.4.2 Spatial smoothing

The Sobel edge operator is extensively used for edge detection in image fields (Sobel and Feldman, 1968). This operator relies on a 3 × 3 convolution mask, consisting of a horizontal and vertical kernel, uniformly applied across the dataset to effectively extract gradient changes (Vincent and Folorunso, 2009). The magnitude of the gradient field is widely used in oceanography to define water mass boundaries and analyse dynamic changes in oceanic structures, help in understanding the spatial variability and the intensity of currents, fronts, and eddies (Belkin and O'Reilly, 2009; Wang et al., 2021). Although this method is typically used to extract gradients on SST, this study uses the Sobel operator to identify gradients in the Chl a dataset, aiming to qualitatively assess the performance of a gap-filling algorithm in reconstructing these gradients. The magnitude of the gradient field (G) is expressed as follows:

\begin{matrix} (6) & \begin{aligned} G_{x} = [\begin{array}{ccc} - 1 & 0 & + 1 \\ - 2 & 0 & + 2 \\ - 1 & 0 & + 1 \end{array}] * S \cdot {(2 d_{x})}^{- 1} \\ G_{y} = [\begin{array}{ccc} - 1 & - 2 & - 1 \\ 0 & 0 & 0 \\ + 1 & + 2 & + 1 \end{array}] * S \cdot {(2 d_{y})}^{- 1} \\ G = \sqrt{G_{x}^{2} + G_{y}^{2}} \end{aligned} \end{matrix}

In this formula, the symbol * signifies the convolution operator, S represents the source image undergoing processing, and d_x and d_y denote the resolution in the horizontal and vertical dimensions, respectively. These distances are used to normalise the gradient values, accounting for the physical spacing between pixels in each direction. Normalisation may be omitted if there is no need to adjust gradient magnitudes for pixel spacing, particularly when the focus is on visually assessing gradient changes. For this analysis, a pixel size of 4 km was applied, consistent with the nominal resolution of the TChl a and PFT products. In our study, the gradient is expressed in units of mg m⁻³ m⁻¹, which simplifies to mg m⁻⁴.

The degree of smoothing reflects how closely reconstructed data aligns with the original input, with less smoothing indicating better preservation of the original values. This is quantified using the RMSLE, similar to Eq. (4) but applied to the differences between all data from the training and validation datasets and the reconstructed dataset. Here, a lower RMSLE indicates better preservation of the original data's details. However, the degree of smoothing is not an independent validation metric; it only evaluates the ability of reconstruction techniques to transfer input data to the output (Barth et al., 2022). This analysis was performed on a logarithmic scale, consistent with the input and output of the model for performance evaluation.

2.4.3 Independent validation using in situ measurement

For validating satellite products with in situ data, the protocol from Bailey and Werdell (2006) and the guidelines from the EUMETSAT Sentinel-3 OLCI Ocean Colour product Matchup Protocols (EUMETSAT, 2022) are followed. The key steps include:

Pixels are matched based on the in situ data points within the 3 × 3 pixel box and captured on the same day.
A minimum of 50 % +1 of the “valid pixels” within a 3 × 3 pixel box (i.e., at least 5 pixels) is required to retain the matchup.
Pixels with deviations exceeding ±1.5 times the standard deviation are removed as outliers.
Matchups are discarded if the coefficient of variation of the remaining pixels exceeds 0.2.

The comparison between the satellite product and in situ measurements is conducted by evaluating the coefficient of determination (R²), slope, and intercept of the regressions, based on logarithmically scaled satellite (log ₁₀(C_st)) versus logarithmically scaled in situ measurement (log ₁₀(C_in)). Additionally, the median-percentage-deviation (MedPD), root-mean-square-deviation (RMSD), and normalised RMSD (NRMSD) (or relative RMSD) are calculated using linear data. The model performance statistics are presented as follows:

\begin{matrix} (7) & \log_{10} (C_{st, i}) = Intercept + Slope \times \log_{10} (C_{in, i}) + ϵ \\ (8) & R^{2} = \frac{\sum_{i = 1}^{M} {(\log_{10} (C_{st, i}) - \log_{10} ({\overline{C}}_{in}))}^{2}}{\sum_{i = 1}^{M} {(\log_{10} (C_{in, i}) - \log_{10} ({\overline{C}}_{in}))}^{2}} \\ (9) & MedPD = Med (\frac{|C_{st, i} - C_{in, i}|}{C_{in, i}} \times 100 %) \\ (10) & RMSD = \sqrt{\frac{1}{M} \sum_{i = 1}^{M} {(C_{st, i} - C_{in, i})}^{2}} \\ (11) & NRMSD = \frac{RMSD}{{\overline{C}}_{in}} \end{matrix}

Model II regression is used when both the response and explanatory variables are prone to measurement errors, allowing for more accurate comparisons by accounting for uncertainties in both variables (Legendre, 1998; Legendre and Legendre, 2012). This study applied model II regression (major axis) to account for the uncertainty in DPA-derived PFTs when validating satellite products. This method was not used for TChl a, as it was derived using a more direct approach with lower associated uncertainty. The uncertainty associated with pigment-based Chl a measurements is approximately 7 % for TChl a and higher for other groups (Claustre et al., 2004; IOCCG, 2019).

3 Results and Discussion

3.1 Hyperparameter tuning and final model configuration

During model development, the validation dataset is used to tune hyperparameters and determine the optimal model configuration. The tuning process focuses on optimising the model performance and ensuring robust generalisation. Following the methodology outlined in Sect. 2.3.3, we conducted hyperparameter tuning to assess the performance of both gap-filling methods in area No. 9. A random search strategy was employed for hyperparameter exploration, with 20 permutations for DINEOF and over 100 permutations for DINCAE. The number of hyperparameters in DINCAE significantly exceeds those in DINEOF, requiring more permutations for optimal tuning (Table S1). DINEOF's model configuration results on the validation dataset show that the total RMSLE remains highly consistent across all experiments, averaging 0.122 ± 0.001 log₁₀ (mg m⁻³), suggesting that further optimisation would yield minimal improvements. In contrast, DINCAE exhibits greater variability in performance, with some trials failing to converge and getting stuck in local minima, resulting in unrealistic RMSLE values even exceeding 10³ (around 30 % of experiments). Even when DINCAE successfully converges to an optimal minimum (RMSLE < 0.3) in 25 % of experiments, its performance remains more variable with an average of 0.176 ± 0.044 log₁₀ (mg m⁻³). Despite this variability, both models achieve similar minimum total RMSLE on the validation dataset (0.12 log₁₀ (mg m⁻³)), indicating comparable fits to the validation dataset. Notably, the optimally configured model for both gap-filling methods does not require SST as an auxiliary dataset, instead relying solely on Chl a datasets for gap-filling, consistent with the findings of Han et al. (2020) for TChl a gap-filling. The final optimal hyperparameter settings for the two models are presented in Tables S1 and S2.

Once the optimal model configurations for both gap-filling methods are determined, the combined training and validation datasets are used to reconstruct the data across all areas. Figure 3b presents the number of EOFs extracted for each area using the DINEOF method. Areas close to continental shelves and coastal regions, such as areas No. 1, 2, 9, and 10, generally require more EOFs to capture their variability. In contrast, open ocean and oligotrophic areas like area 6 need fewer EOFs to describe their variability. However, areas with a high rate of missing data, such as area No. 7, face challenges in pattern extraction, leading to fewer EOFs extraction. The reconstructed data for each gap-filling model across different areas are combined using a weighted blending method to ensure a smooth transition between regions. An example layout of the fully reconstructed data for DINEOF and DINCAE outputs is shown in Fig. 3. As illustrated in the original satellite data (Fig. 3a), a significant portion of the data is missing, and a comparison with the reconstructed outputs from both models highlights their substantial ability to reconstruct data from limited observations. The visual comparison of the two gap-filling outputs for diatoms on 26 May 2018 reveals noticeable differences in the reconstructed patterns, particularly in areas No. 1, 7, and 10, where distinct bloom patterns are observed. These differences highlight the need for further investigation through evaluation techniques.

https://gmd.copernicus.org/articles/19/1619/2026/gmd-19-1619-2026-f03

Figure 3Example of input and final output of gap-filling methods for diatoms on 26 May 2018. (a) Original satellite product fed to the gap-filling models, with squares indicating different areas (ROI) used for gap-filling. The red line represents the locations of the research vessel throughout the PS113 expedition. (b) Merged output of the DINEOF gap-filling method. The values within the blue boxes indicate the number of EOFs extracted during the DINEOF gap-filling process. (c) Merged output of the DINCAE gap-filling method.

3.2 Performance evaluation

During the data partitioning process, a subset of the available data was masked to serve as a test dataset for evaluating gap-filling techniques and conducting performance comparisons (explained in Sect. 2.2.3). The spatial variation in the average absolute logarithmic difference between the reconstructed data and the test dataset is illustrated in Fig. 4. Errors are observed to exceed 0.3 log₁₀ (mg m⁻³) in certain regions, particularly along the West African coast (areas No. 7 and 8), the Argentine Sea (areas No. 1 and 2), and the English Channel (area No. 10) with high phytoplankton abundance. Significantly lower error values are recorded in the oligotrophic regions of the South Atlantic gyres with lower phytoplankton abundance. The error distributions are similar between TChl a and all PFTs. However, the errors for prokaryotes appear more consistent across regions, likely due to their relatively stable Chl a concentration of prokaryotes. The errors for TChl a are lower, as greater data availability and higher concentration values facilitate more accurate gap-filling. In contrast, diatoms and green algae show high errors, even in open ocean and oligotrophic regions. The error difference analysis (the last row of Fig. 4) reveals that the DINCAE model performs slightly better than the DINEOF model in reconstructing TChl a and PFTs, exhibiting lower absolute differences, particularly in hotspots of high errors (e.g., coastal areas, continental shelves, and equatorial regions) where DINEOF demonstrates notable discrepancies.

https://gmd.copernicus.org/articles/19/1619/2026/gmd-19-1619-2026-f04

Figure 4Spatial variation in the average absolute logarithmic differences between the gap-filled (g.f.) and the test dataset, along with a comparative analysis of the two gap-filling models.

The performance difference between the two gap-filling methods can exceed ±0.2 log₁₀ (mg m⁻³) across all groups. However, the average error difference is relatively small, as the majority of differences in the open ocean are really small. The average error differences are 0.03 for TChl a, 0.01 for diatoms, 0.01 for dinoflagellates, 0.02 for haptophytes, 0.02 for green algae, and 0.01 log₁₀ (mg m⁻³) for prokaryotes.

The RMSLE and MAPE between the DINEOF and DINCAE reconstructed data and the test dataset across all areas and phytoplankton groups are presented in Fig. 5. The top panel of Fig. 5 shows that DINCAE's RMSLE for TChl a ranges from 0.03 to 0.12 log₁₀ (mg m⁻³), from the oligotrophic region to the high productivity zones near the Patagonian shelf and the English Channel, while DINEOF's RMSLE is slightly higher, ranging from 0.05 to 0.16 log₁₀ (mg m⁻³). The PFTs show marginally higher errors, with RMSLEs between 0.03 and 0.25 log₁₀ (mg m⁻³) for DINCAE reconstruction, and between 0.04 and 0.30 log₁₀ (mg m⁻³) for DINEOF reconstruction. Similar to TChl a, the highest RMSLE for PFTs in both models is observed on the Patagonian shelf and near the English Channel, while the lowest RMSLE occurs in the oligotrophic region, where phytoplankton abundance is low. On average, the RMSLE of DINEOF is observed to be approximately 66 % higher for TChl a and 16 % higher for PFTs compared to DINCAE (11 % for diatoms, 20 % for dinoflagellates, 22 % for haptophytes, 16 % for green algae, and 12 % for prokaryotes). The findings suggest that DINCAE generally exhibits a lower RMSLE than DINEOF across most regions and phytoplankton groups.

https://gmd.copernicus.org/articles/19/1619/2026/gmd-19-1619-2026-f05

Figure 5Statistical outcome and comparative analysis for the two gap-filling (g.f.) models on the test dataset.

Download

To enable a unit-free comparison across PFT groups and regions, statistical analysis is performed using MAPE (Fig. 5, bottom panel). MAPE values are generally lower for TChl a than those for PFTs. The averaged MAPE obtained by DINEOF and DINCAE reconstructions in coastal regions and continental shelves (area No. 1, 2, 7, 8, 9, 10) are approximately 32 % and 26 %, respectively, while in the open ocean (area No. 3, 4, 5, 6), the errors are averaged to 13 % and 10 %, respectively. A significant reduction in errors is achieved with DINCAE reconstruction compared to DINEOF, with notable improvements being observed for PFTs, particularly diatoms, dinoflagellates, and haptophytes in areas No. 1 and 7. MAPE is found to be notably higher in regions with increased dynamics due to coastal activity (areas No. 1, 2, 8, 9, and 10) or in areas with a high rate of missing data (area No. 7), while lower MAPE values (approximately 10 %) are observed in areas No. 3, 4, 5, and 6, characterized by lower dynamics and phytoplankton abundance in oligotrophic zones. The highest MAPE is recorded in area No. 1 for haptophytes, with 67 % and 48 %, for diatoms with 57 % and 43 %, and for green algae with 53 % and 43 %, for DINEOF and DINCAE reconstructions, respectively. Dinoflagellates and prokaryotes exhibit comparatively smaller errors. Even in unit-free comparisons, prokaryotes, despite their low concentration, maintain a relatively consistent MAPE ranging from 10 % to 22 % across all areas.

Our results show notable similarities and differences compared to previous studies. Sirjacobs et al. (2011) gap-filled four years of daily TChl a (from MERIS), TSM, and SST data for the Southern North Sea and English Channel. They reported RMSLE values for TChl a ranging from 0.09 to 0.29 log₁₀ (mg m⁻³). Hilborn and Costa (2018) gap-filled three years of MODIS-Aqua TChl a dataset for the highly productive coastal region of the Salish Sea, reporting an RMSLE ranging from 0.17 to 0.22 log₁₀ (mg m⁻³) for daily data and 0.27 to 0.32 log₁₀ (mg m⁻³) for weekly composite data. Wang et al. (2019) reported an RMSLE of 0.13 log₁₀ (mg m⁻³) during the development of a long-term cloud-free Chl a dataset derived from SeaWiFS and MODIS satellite observations over the Bohai and Yellow Seas. Han et al. (2020) reconstructed TChl a in the South China Sea and West Philippine Sea using the daily merged (GlobColour) product with DINEOF and DINCAE gap-filling techniques. Their results reported a minimum cross-validation error of 0.11 log₁₀ (mg m⁻³) (converted from ln(mg m⁻³)) for DINCAE and 0.12 log₁₀ (mg m⁻³) for DINEOF in the South China Sea, and 0.12 log₁₀ (mg m⁻³) for DINCAE and 0.13 log₁₀ (mg m⁻³) for DINEOF in the West Philippine Sea. In comparison, the RMSLE in our TChl a reconstruction is notably lower, with values ranging from 0.08 to 0.16 log₁₀ (mg m⁻³) for DINEOF and 0.03 to 0.12 log₁₀ (mg m⁻³) for DINCAE reconstructions. The higher errors in their study likely reflect the high Chl a concentrations of TChl a and the more dynamic nature of their study region. In our study, the RMSLE values for PFTs are comparable to those reported for TChl a in literature, with the highest values observed for diatoms, ranging from 0.07 to 0.28 log₁₀ (mg m⁻³) for DINEOF and 0.06 to 0.25 log₁₀ (mg m⁻³) for DINCAE reconstructions. Ji et al. (2021) used DINCAE for the gap-filling of SST and TChl a in the East China Sea using MODIS-Aqua, MODIS-Terra, and VIIRS-SNPP products. Their cross-validation results reported a mean relative error (MRE) of 0.40, corresponding to a minimum MAPE of 40 %. In comparison, the MAPE for TChl a and prokaryotes in our study is consistently below 40 % across all regions, and for the remaining PFTs, it is generally below or comparable to this value in productive regions. These results indicate that the validation errors for PFT reconstructions for both methods fall within an acceptable range relative to those reported in the literature.

In summary, both methodologies demonstrate substantial capability in reconstructing TChl a and PFTs with minimal error in open ocean settings, while maintaining acceptable relative error in coastal and high-dynamic regions. The highest MAPEs are 67 % for DINEOF and 48 % for DINCAE, lower than the uncertainty levels of the original PFT products (average uncertainty for the study period and the entire Atlantic Ocean as defined in Sect. 2.1.2: TChl a 33 %, diatoms 140 %, dinoflagellates 112 %, haptophytes 122 %, green algae 88 %, and prokaryotes 102 %). The DINCAE method consistently exhibits better performance, significantly reducing errors in comparison to DINEOF. These results highlight its robustness across all regions and phytoplankton groups, even in the face of challenges posed by the high temporal and spatial dynamics involved in reconstructing data in coastal areas (e.g., areas No. 1, 7, 9, and 10), demonstrating its ability to distinguish and reproduce patterns within the dataset based on a limited amount of available data. The discrepancy between the two models in dynamic regions may arise from their fundamental methodological differences. DINEOF reconstructs missing data by extracting dominant spatiotemporal modes from the entire temporal domain, with an additional emphasis on the local spatiotemporal structure through a Laplacian filter. This EOF-based reconstruction can underrepresent transient or localised features. In contrast, DINCAE employs a U-Net-style architecture that interpolates missing values based on nearby spatiotemporal information, allowing it to more effectively capture localised or transient variability in the data by preserving fine-scale details through skip connections. However, it is important to note that this improved accuracy comes at the cost of more intensive computational demands, requiring GPU resources and a higher number of tuning permutations.

In addition, it is challenging to isolate the effect of data availability on model performance across different spatial regions (e.g., high-latitude, equatorial, and mid-latitude areas), since the physical and biological dynamics in these regions are inherently distinct. The investigation of the relationship between data availability and gap-filling model performance across different groups based on mean absolute error (MAE) (results not shown) showed, for both models, as expected, that a higher data availability generally corresponds to lower and more tightly clustered MAE values, indicating improved reconstruction accuracy. When comparing the two models, DINCAE consistently produces reconstructions with lower MAE than DINEOF, suggesting that DINCAE more effectively captures the underlying spatiotemporal variability, especially in regions with limited data availability.

3.3 Spatial smoothing

3.3.1 Gradient field

As detailed in the Sect. 2.4.2, the Sobel Edge detection algorithm is used to compute the gradient field in the TChl a and PFTs concentration. As an example, results for one test dataset date in area No. 10 are presented in Fig. 6. In the Celtic Sea (Fig. 6, box 1), a pronounced gradient field is evident in the original satellite data for TChl a and diatoms, which is removed when clouds are added during test dataset generation. Both algorithms successfully reconstructed the high gradient in TChl a, maintaining a similar magnitude and pattern. DINCAE produced a gradient pattern closer to the large-scale features of the original satellite data, while DINEOF better captured smaller-scale patterns. For diatoms, DINCAE produced a gradient pattern more consistent with the original satellite product compared to DINEOF. In the Bay of Biscay, along the western coast of France (Fig. 6, box 2), a significant gradient change is observed in the TChl a dataset, while the diatom dataset was initially missing. After reconstruction, both DINEOF and DINCAE transferred the gradient pattern from TChl a to diatoms, demonstrating their ability to capture relationships between datasets. However, DINCAE produced a diatom gradient pattern more closely aligned with TChl a than DINEOF. Overall, the DINEOF gradient appeared noisier than the original satellite and DINCAE gap-filled products. Additionally, satellite track patterns are noticeable in DINEOF diatom reconstructions but are lower in DINCAE outputs.

https://gmd.copernicus.org/articles/19/1619/2026/gmd-19-1619-2026-f06

Figure 6Gradient Field (mg m⁻⁴) of TChl a and diatoms for original Satellite data, with added cloud, DINEOF reconstructed, and DINCAE reconstructed datasets computed for the 23 June 2018 on area No. 10. The blue colour shows the missing values.

3.3.2 Degree of smoothing of originally present data

Figure 7 compares the degree of smoothing as RMSLE between the original satellite product against the DINEOF and DINCAE reconstructed datasets for TChl a and PFTs. The lowest RMSLE are observed for TChl a, prokaryotes, and dinoflagellates, with around 0.10 and 0.05 log₁₀ (mg m⁻³) for DINEOF and DINCAE, respectively. The highest RMSLEs are associated with diatoms and green algae in both datasets, with around 0.14 and 0.07 for DINEOF and DINCAE reconstruction, respectively. The results show that DINEOF has approximately twice the RMSLE of DINCAE for all groups. This indicates that the DINCAE algorithm more effectively transfers the available data from the original satellite dataset to the reconstructed dataset with lower deviation and less smoothing.

https://gmd.copernicus.org/articles/19/1619/2026/gmd-19-1619-2026-f07

Figure 7Comparison of the degree of smoothing between the two gap-filling (g.f.) methods for TChl a and PFTs. The ratio indicates the degree of smoothing achieved by DINEOF relative to DINCAE reconstruction.

Download

3.4 Independent validation

Figures 8 and 9 evaluate the matchups between the original satellite, gap-filled DINEOF or gap-filled DINCAE products against in situ measurements. The evaluation of DINEOF and DINCAE reconstructed data in comparison with in situ measurements was conducted by categorising the dataset into two groups: transferred matchups, representing matchups present in the original satellite product, and filled matchups, referring to the pixels that were missing in the original satellite product and filled through the reconstruction method. In Fig. 8, the regression analysis for both methods across these categories is illustrated, and Fig. 9 provides a summary of the statistical parameters for performance comparison, distinguishing between transferred and filled indices. The transferred matchup points used for this validation are consistent with those used for validating the original satellite product, enabling a direct comparison to assess whether the accuracy of the original satellite data is preserved in the reconstruction process. The first columns in Fig. 8 and the original satellite statistical description in Fig. 9 demonstrate that the TChl a product from the original satellite dataset exhibits superior validation results compared to PFTs. This is attributed to the lower uncertainty associated with the TChl a product, which is derived from maturer conventional ocean colour algorithms that have reduced the product uncertainty as low as ∼ 30 % on average, whereas PFT concentrations, which are derived more indirectly from pigment concentrations via DPA as ground truth data, posing greater challenges for accurate interpretation and separation from the total biomass by the backscattered signals measured by the satellite sensors. TChl a obtains more matchups, due to higher satellite data availability due to the usage of multiple sensor data, with most points aligning closely along the 1:1 line. In addition, the regression analysis shows that TChl a has more favourable slope (0.78) and intercept (−0.38) values compared to the PFTs, which have slopes ranging from 0.41 to 0.66 and intercepts from −0.28 to −1.11. The coefficient of determination (R²) is approximately 0.84 for TChl a and diatoms, around 0.69 for haptophytes and green algae, and about 0.5 for dinoflagellates and prokaryotes. The MedPD of 35% and deviations below the 1:1 line at high Chl a concentrations indicate an underestimation of TChl a by the original satellite product. In contrast, for PFT products, this trend is primarily associated with overestimations at low Chl a concentrations. The highest MedPD values are observed for diatoms and dinoflagellates, at 121 % and 98 %, respectively. The remaining PFTs exhibit MedPD values comparable to TChl a, approximately 35 %. The highest RMSDs are observed for TChl a and diatoms, around 0.23 mg m⁻³, while other groups have lower RMSDs of approximately 0.03 mg m⁻³. However, because RMSDs vary with concentration ranges, they were normalised to NRMSD for better comparison. The NRMSD reveals the lowest error for prokaryotes, at 0.51, and the highest for diatoms, with an NRMSD of 4.66.

https://gmd.copernicus.org/articles/19/1619/2026/gmd-19-1619-2026-f08

Figure 8Regression analysis comparing in situ measurements with the original satellite product, DINEOF, or DINCAE reconstructed data. The matchups for the reconstructed data are also categorised into two additional groups: transferred matchups (representing data points present in the original satellite product) and filled matchups (representing data points missing in the original satellite product and subsequently filled by the reconstruction models). n refers to the number of matchups between the satellite products and the in situ measurement.

Download

https://gmd.copernicus.org/articles/19/1619/2026/gmd-19-1619-2026-f09

Figure 9Statistical outcome of the regression analysis comparing in situ measurements with the original satellite product, DINEOF, or DINCAE reconstructed data based on filled, transferred and all matchups.

Download

The transferred matchup validation results indicate minimal variation in statistical outcomes when compared to the original satellite validation, with minor deviations observed in R² and MedPD values before and after reconstruction. Notably, the R² values for TChl a, diatoms, dinoflagellates and green algae in the DINEOF reconstruction, along with the MedPD for diatoms in both methods, surpass those of the original satellite data. Overall, the differences in other statistical metrics are negligible, suggesting that both DINEOF and DINCAE effectively preserve the original patterns of the input dataset in the reconstructed outputs. Upon initial observation of Fig. 8, it is evident that the filled matchups exhibit greater dispersion compared to the transferred matchups. As expected, the number of matchups generated through gap-filling in satellite products substantially exceeds those found in the original satellite data. The total number of matchups for the fully reconstructed dataset is approximately 2.35 times greater for TChl a and 5 to 6 times greater for PFTs than the number of original satellite matchups. Figure 9 demonstrates that, for both methods, the slopes and intercepts of the filled matchups are slightly lower than those of the transferred matchups for TChl a and prokaryotes and nearly identical for diatoms. Furthermore, the filled matchups exhibit improved performance for dinoflagellates, haptophytes, and green algae. A lower R² is observed in all filled matchups of the DINCAE and DINEOF reconstructions compared to the transferred matchups, indicating significant dispersion from the regression line. MedPD and RMSD of filled matchups indicate reduced accuracy in most cases compared to the transferred matchups, except for diatoms and dinoflagellates for both methods. The NRMSD reveals greater error levels for TChl a, haptophytes, green algae, and prokaryotes, while diatoms and dinoflagellates exhibit lower errors. Notably, the RMSD for TChl a in filled matchups is approximately 2.75 times that of transferred matchups, whereas the NRMSD is only about 1.12 times higher, a pattern consistent across all PFTs. The higher overall mean Chl a concentration for the filled matchups could contribute to the increased RMSD values as compared to the transferred. These findings suggest that, in most instances, the accuracy of the gap-filled datasets is slightly worse than the original satellite data. Nevertheless, the filled matchups generally follow the same trends and trajectories, except for prokaryotes, albeit with increased variability relative to the transferred matchups. The ability of satellite gap-filling methods to preserve accuracy in existing matchups while significantly expanding the number of matchups with acceptable accuracy highlights the efficiency of these techniques.

The transferred and filled matchups are combined to facilitate a comparison of the outcomes of the reconstruction methods and to evaluate their respective accuracy. Figure 9 presents the statistical results of these regression analyses. The slope and intercept of the regressions are generally better for the DINEOF gap-filled product compared to the DINCAE gap-filled product and even the original satellite product. Exceptions are observed for green algae, where the slope is slightly lower, and for prokaryotes, where it is substantially lower for both gap-filling methods compared to the original dataset. This reduction in accuracy arises from the gap-filling process and reflects the distinct abundance patterns of prokaryotes compared to other groups. Future studies should consider separating the gap-filling process for prokaryotes to improve the performance across all groups. For TChl a, the R² values for both gap-filled products are similar to those of the original satellite product, with DINEOF performing slightly better. For PFTs, however, the R² values are significantly better for DINEOF compared to DINCAE, with DINEOF achieving results closer to the original satellite product. However, the R² values are significantly better (enhanced by 27 %–117 % for different PFTs) for DINEOF compared to DINCAE, with DINEOF achieving results closer to the original satellite product. MedPD for the gap-filled products is approximately 34 % lower for diatoms and dinoflagellates, 60 % lower for green algae, and 22 % lower for prokaryotes compared to the original satellite dataset. RMSD values for the gap-filled products are also close to each other; however, they are twice as high as the original satellite product for TChl a and diatoms, and five times higher for green algae. Normalising to NRMSD reveals that these differences are primarily due to the average Chl a concentrations in the filled products. The NRMSD of gap-filled TChl a is only about 17 % higher than the original product, while it is 17 % lower for diatoms and 30% lower for green algae. Overall, for TChl a, both gap-filling methods demonstrate robustness comparable to the original satellite dataset validation, albeit with higher RMSD, while maintaining similar NRMSD. For PFTs, DINEOF generally performs better than DINCAE, particularly in slope, intercept, and R² metrics. These results highlight the superior performance of DINEOF in external validation.

Xi et al. (2021) validated the global merged OC satellite product for TChl a and PFTs, reporting metrics in the order (MedPD, RMSD [mg m⁻³], R²) as follows: TChl a (32 %, 1.08, 0.82), diatoms (56 %, 0.92, 0.77), dinoflagellates (54 %, 0.89, 0.62), haptophytes (43 %, 0.16, 0.71), green algae (52 %, 0.10, 0.53), and prokaryotes (42 %, 0.09, 0.46). Comparisons with the original satellite validation in this study show that the MedPD ranges are similar, except for diatoms and dinoflagellates, which are approximately double those reported by Xi et al. (2021). Notably, their MedPD values align closely with the gap-filled MedPD results in this study. However, their RMSD values are at least twice as high for TChl a and all PFTs, exceeding both the original satellite and gap-filled product results. This is primarily because their focus was on a global scale, encompassing greater variability compared to our study area. Regarding R², Xi et al. (2021)'s values are comparable to ours for TChl a and haptophytes, lower for diatoms, green algae, and prokaryotes, but higher for dinoflagellates. Xi et al. (2023b) used data from 16 expeditions across the Atlantic Ocean to validate long-term trends in four PFT Chl a monthly products: diatoms, haptophytes, prokaryotes, and dinoflagellates. However, due to missing satellite data and inconsistencies during matchup extraction, less than 10 % (192 out of 1975) of the in situ measurements could be used as matchups for validating the satellite-derived PFT products, similar to the number of matchups in our study. A comparison of the matchup statistical results from Xi et al. (2023b), involving the monthly satellite products and in situ measurements, with the statistical results from our gap-filled products reveals slightly different performance: Their comparison showed stronger performance in terms of slope (diatoms 0.71, haptophytes 0.95, prokaryotes 0.71, dinoflagellates 1.07) and intercept (diatoms −0.27, haptophytes −0.01, prokaryotes 0.12, dinoflagellates 0.04). Results for R² (diatoms 0.76, haptophytes 0.41, prokaryotes 0.36, dinoflagellates 0.66), MedPD (diatoms 60 %, haptophytes 59 %, prokaryotes 185 %, dinoflagellates 59 %), and RMSD (diatoms 0.30, haptophytes 0.18, prokaryotes 0.06, dinoflagellates 0.07) were mixed, with some cases favouring their study and others DINEOF matchups, appearing overall comparable. DINCAE performed worse than their matchups in terms of slope, intercept, and R². For MedPD and RMSD, the results were mixed, similar to the trends observed with DINEOF.

Previous research employing the DINEOF and DINCAE gap-filling techniques has predominantly focused on SST, TChl a, and SPM, with no prior studies addressing PFTs. Consequently, our validation results can only be compared with previous findings for TChl a, although TChl a is not the primary focus of our investigation. Alvera-Azcárate et al. (2021) reconstructed 23 years of Chl a and SPM data in the Greater North Sea using the DINEOF method. Their validation results, based on regression analysis of matchups between the original satellite and DINEOF reconstructed products with in situ measurements, demonstrated a decline in R² from 0.75 to 0.58 after reconstruction, with the MAE increasing from 2.47 to 2.83 mg m⁻³. Our study indicates a smaller decrease in R² from 0.84 to 0.82 and an increase in MAE from 0.10 to 0.19 mg m⁻³using the DINEOF method for TChl a matchups. We observe reduced scatter compared to the previous study, despite a higher reconstruction error in our results. However, the magnitude of the MAE differs significantly between the two studies, largely due to an overall higher phytoplankton abundance in the North Sea compared to the Atlantic Ocean. In another study, Barth et al. (2021) reconstructed 20 years of TChl a and SPM data in the southern North Sea using DINCAE. Their validation using the original satellite and DINCAE reconstructed data with in situ measurement indicated a decline in the validation slope from 0.82 to 0.64, an increase in the intercept from 0.07 to 0.33, a reduction in R² from 0.62 to 0.41, an increase in log₁₀-RMSD from 0.29 to 0.33, and an increase in the number of matchups from 25 to 27. Our application of DINCAE to TChl a results in a similar slope of 0.78, an intercept shift from −0.38 to −0.35, a reduction in R² from 0.84 to 0.80, an increase in log₁₀-RMSD from 0.24 to 0.25, and an increase in matchups from 94 to 221. Volpe et al. (2018) developed an operational gap-filling technique based on DINEOF for ocean colour products in the Mediterranean Sea, validated using 1643 in situ measurements collected between 1997 and 2015. Their results reported RMSE values of 0.27 for the Level 3 product (original satellite dataset) and 0.29 for the Level 4 product (gap-filled), with absolute percentage difference (APD) values of 56 % for Level 3 and 53 % for Level 4 during the operational phase. In comparison, the external validation results for TChl a in our study show an RMSE of 0.25 for Level 3, 0.52 for DINEOF Level 4, and DINCAE Level 4 products, with a MedPD of −35 % for Level 3, −33 % for DINEOF Level 4, and −36 % for DINCAE Level 4. The findings of Volpe et al. (2018) indicate an increase of approximately 6.6 % in RMSE and a 3 % decrease in APD for gap-filled products. In our study, however, RMSE for gap-filled TChl a products increased by 108 % for both methods, while MedPD (noting that APD and MedPD may not be directly comparable) changed by −1 % for DINEOF and 1 % for DINCAE gap-filling. These results suggest that in our study, the median validation error (MedPD) is nearly identical between the original satellite and gap-filled TChl a products, though higher RMSD indicates more extreme errors at both high and low concentrations. For PFTs, the most significant improvements in MedPD are observed for diatoms (−39 % for DINEOF and −41 % for DINCAE gap-filled products) and dinoflagellates (−38 % for DINEOF and −33 % for DINCAE). The remaining PFTs and TChl a exhibit marginal reductions or values similar to the original satellite MedPD, comparable to the APD values for TChl a reported in Volpe et al. (2018). RMSD for dinoflagellates and prokaryotes shows slight changes, consistent with the TChl a level in Volpe et al. (2018), whereas other PFTs and TChl a display more substantial increases in RMSD. A comparison of the external validation results from the literature with those from our study indicates that the errors are generally lower and within an acceptable range, demonstrating the robustness of both the gap-filled datasets and the external validation.

In summary, DINEOF performs better than DINCAE in validation against in situ measurements. In regression analyses (slope, intercept, and R²), DINEOF demonstrates superior performance relative to DINCAE, occasionally even surpassing the original satellite products. This is particularly evident in the case of diatoms, dinoflagellates, haptophytes, and green algae for slope and intercept, and dinoflagellates for R².

3.5 Novelty and limitations

A comprehensive understanding of phytoplankton composition and distribution is crucial for explaining biogeochemical processes and assessing the impact of climate change on marine ecosystems and biodiversity. However, the methods available for retrieving phytoplankton dynamics and distributions are currently constrained. In situ measurements are limited in spatiotemporal coverage and fail to represent the full extent of phytoplankton dynamics. Most current biogeochemical models are limited in the diversity of phytoplankton groups, often focusing on diatoms (micro-phytoplankton) and prokaryotes (pico-phytoplankton) (e.g., RECOM, Schourup-Kristensen et al., 2014, and PISCES, Aumont et al., 2015) and assimilation of satellite-derived TChl a and PFT products shown to be effective in improving the models' performance in predicting these phytoplankton groups (e.g., Pradhan et al., 2019, 2020). Additionally, these models require evaluation by observations with high temporal and spatial coverage and provided uncertainty. Satellite observations of TChl a and PFT Chl a have covered the global ocean for more than 20 years. However, these data are significantly limited in coverage due to non-optimal observing conditions and sensors' availability in operation, leading to data gaps exceeding 90 % in some critical regions. So far, operational gap-filled PFT Chl a products are unavailable, which may have been due to the especially high PFT product data gap rates and the substantial computational demands of gap-filling techniques.

In this study, we applied for the first time gap-filling methods to PFT Chl a products. We used two well-established gap-filling techniques, DINEOF and DINCAE, that have proven effective in reconstructing ocean satellite data products, specifically SST and TChl a. We subsequently evaluated the performance of the two methods using multiple techniques. This novel application showed new perspectives on the gap-filling of multivariate datasets and their validation, particularly for phytoplankton community structure. We showed the transfer of patterns from broader parameter categories, like TChl a, to more specific subcategories, such as PFTs. Both gap-filling methods demonstrated robustness, even in areas with high data missing rates. Notably, gap-filling of PFT products resulted in approximately 80 % more data with minimal impact on the original satellite data, enhancing the understanding of biogeochemical dynamics at the spatiotemporal scale (e.g., time-series analysis) and increasing the number of matchups with in situ measurements by a factor of 5 to 6, thereby supporting further model development and validation. The reconstructions demonstrated significant efficiency in capturing transient-scale oceanic features, with DINEOF using more EOFs to retain various patterns, while DINCAE employing advanced machine learning techniques, such as skip-connections, to effectively preserve these features. For these developments, we used datasets from various Longhurst biogeochemical provinces in the Atlantic Ocean, ensuring the methods' applicability across diverse oceanic conditions, ranging from oligotrophic to eutrophic regions and from open ocean to continental shelves and coastal areas. The successful reconstructions of data across different oceanic regimes underscore the potential for these methods to be extended to other oceanographic variables and regions, paving the way for improved environmental monitoring and predictive modelling efforts on a global scale.

While both models demonstrated remarkable proficiency in reconstructing TChl a and PFTs, certain limitations remain that offer opportunities for further improvement in future work. Both approaches face inherent limitations in scaling a single scene analysis spatially and are specifically designed for regional applications involving multivariate long-term analysis. The associated computational demands further constrain their feasibility for broader, global-scale implementation. Consequently, multiple areas had to be reconstructed along the corridor of the expedition to obtain sufficient matchups for consistent validation purposes. The computational efficiency of both models can be enhanced by introducing an internal data segmentation step before pattern extraction, which would allow parallelised computations across multiple clusters or nodes. For DINEOF, this approach could reduce the cost of iterative EOF decomposition by processing spatial segments independently and later merging the reconstructed fields. For DINCAE, parallelisation can be achieved by distributing training and inference over spatial subsets or by adopting model architectures optimised for distributed GPU computation. Additionally, implementing chunked data handling and memory-efficient input/output could further optimise large-scale processing for both methods.

Furthermore, the length of the time series can significantly influence the performance of both models. A short time series may result in EOF patterns that do not fully capture the underlying variability in the dataset when using the DINEOF method, and it may also lead to suboptimal tuning of parameters and hyperparameters in DINCAE due to limited training samples. A long time series primarily affects the DINEOF method, as EOF extraction over extended periods tends to emphasise more persistent and large-scale spatial patterns while reducing the representation of transient variability. This effect is less pronounced in DINCAE, since the data are temporally segmented into minibatches before being processed by the network, which decreases, but does not eliminate, its dependence on the total record length. When the time series is extended, the corresponding increase in training epochs allows the model to learn from a broader range of examples and improves its generalisation ability, but it may also lead to a reduced sensitivity to short-lived or rare features. Although not examined in the present study, the length of the time series could be considered an experimental hyperparameter to be optimised during model development.

In Addition, the in situ measurements are limited to the expedition duration, which may affect the robustness of the external validation process for assessing the gap-filling model over several years of data in the Atlantic Ocean. Incorporating datasets from other expeditions conducted at different times in the same region, as implemented by Xi et al. (2023b), enhances the robustness of validation by extending the temporal and spatial coverage of in situ measurements, thereby reducing potential validation biases. Furthermore, both models incorporate hyperparameters that require multiple random search permutations to achieve a robust architecture, like most machine learning algorithms. This requirement is particularly pronounced for DINCAE, which features a convolutional structure, compared to DINEOF, which is inherently parameter-free. Even with optimised structures selected for the development area, the spatial transferability of the hyperparameter combinations needs to be tested.

Moreover, neither model includes per-pixel uncertainty of the input dataset, which is crucial for more accurate reconstruction. DINEOF does not directly provide an uncertainty analysis for the model; instead, this measure is available through postprocessing steps. Beckers et al. (2006) developed uncertainty metrics for DINEOF based on an analogy with optimal interpolation at a cost comparable to the interpolation itself. In contrast, DINCAE provides a reconstruction uncertainty. However, this uncertainty does not account for the inherent uncertainty in the satellite product. While some adjustments are provided to adapt to the input uncertainty (Barth et al., 2022), these techniques remain quite limited. Further advancements are necessary in this area, enabling the current method's direct applicability for more extensive uncertainty analysis. Consequently, there is a need for continued development to fully integrate per-pixel uncertainty from the original satellite product into reconstruction models. Recent developments have focused on incorporating per-pixel uncertainty from the original satellite dataset into DINCAE by scaling the input using the inverse of the per-pixel error variance. While this method shows potential, further evaluation is required to assess its effectiveness and implications.

4 Conclusion and outlook

Missing data in satellite-derived biogeochemical observations can lead to underrepresentation of important spatiotemporal dynamics, especially in regions characterised by high variability or ecological sensitivity, such as coastal zones and upwelling areas. Missing data in these regions can obscure critical information about transient biological events and environmental responses. The application of robust gap-filling techniques enables the reconstruction of these dynamic patterns, providing more complete datasets that can be exploited for improved modelling, targeted field campaigns, and continuous environmental monitoring. For example, enhanced reconstructions of Chl a concentration can support fisheries management by linking biological productivity with fish distribution, assist in the early detection and prediction of harmful algal blooms, and improve estimates of net primary production. These downstream applications ultimately contribute to the development of more sustainable marine and climate management strategies.

In this study, two gap-filling methods were applied for the first time to TChl a and five major PFTs (diatoms, dinoflagellates, haptophytes, green algae, and prokaryotes) over three years (2016–2019) along a corridor of a transect in the Atlantic Ocean, surveyed during the PS113 RV Polarstern expedition in 2018 with extensive in situ validation data (Bracher et al., 2020a). The first method, DINEOF, uses dominant empirical orthogonal functions, while the second, DINCAE, employs a convolutional autoencoder for reconstruction. A random search approach was used for hyperparameter optimisation.

DINEOF achieves roughly double the RMSLE in the degree of smoothing compared to DINCAE. The performance evaluation on the test dataset further highlights DINCAE's advantage, with DINEOF yielding RMSLE values that are 66 % higher for TChl a, 11 % higher for diatoms, 20 % higher for dinoflagellates, 16 % higher for green algae, and 12 % higher for prokaryotes than DINCAE. Additionally, the MAPE results are consistent with the RMSLE findings. These errors vary significantly by location and group, with higher errors near continental shelves or areas with high missing data rates (31.7 % for DINEOF and 26 % for DINCAE on total average error) and notably lower errors in the open ocean and oligotrophic regions (12.9 % for DINEOF and 9.7 % for DINCAE on total average error). Overall, DINCAE shows better gap-filling performance with test dataset validations, particularly in regions with complex water dynamics. External validation using in situ measurements reveals reduced accuracy in gap-filled data compared to the original dataset, with both models showing similar trends with increased dispersion. External validation indicates that DINEOF performs better than DINCAE, achieving better regression performance, with approximately 12.5 % improved slope, 13.6 % improved intercept, and 68 % higher R² for PFTs. In some cases, DINEOF even surpasses the original satellite dataset validation results. The MedPD, RMSD, and NRMSD values are comparable between the two gap-filling methods but show variable performance relative to the original satellite data validation. Overall, external validation indicates that DINEOF is more suitable for large-scale gap-filling of ocean colour products, particularly for TChl a and Chl a of PFTs. Although the current PFT and gap-filled PFT products show notable differences compared to in situ measurements, they remain valuable for examining large-scale phytoplankton dynamics, and further refinement of retrieval algorithms is expected to enhance their accuracy in the future.

Test dataset performance evaluation and external validation results indicate that both methods demonstrate sufficient capability to reconstruct gaps within the ocean colour datasets. DINEOF is recommended for larger areas, open oceans and less complex waters, where it performs better than DINCAE in independent validation, offering the benefits of simpler tuning, lower computational costs, and more interpretable phenomena representation through EOF patterns. Although DINCAE offers higher accuracy across all areas in test dataset validation, its more complex architecture, demanding tuning procedure, and requirement for GPU resources make it better suited for complex waters and coastal regions where precise reconstruction of original transient-scale patterns is critical. Future research should focus on a detailed uncertainty estimate for the reconstructed products, a critical improvement for their use in data fusion or assimilation. There are many other applications for the gap-free TChl a and PFT data envisaged for marine ecosystem studies at regional and global scales. For example, detailed process studies rely mostly on a set of different but coincident in situ measurements of physical and biogeochemical parameters obtained during specific expeditions transecting regions of interest. The complete reconstruction of the regional phytoplankton community phenology using the gap-free satellite TChl a and PFT Chl a before, during, and after research expeditions tremendously enhances the ability to link the different in-situ point measurements to each other. The gap-filled data set can enhance near-real-time research expedition planning to find hotspots of certain phytoplankton blooms in case of persistent cloud cover, limiting the satellite observations. Further, these gap-free data can ease the evaluation of global biogeochemical models representing PFTs (e.g., Bopp et al., 2013; Dutkiewicz et al., 2015; Gürses et al., 2023). Although the current methods are best suited for regional monitoring, scaling them to global applications would require additional optimisation. Nevertheless, constructing a globally gap-filled Chl a dataset, even at a reduced spatiotemporal resolution, could provide valuable input for long-term climate assessments, global biogeochemical modelling, and validation of Earth system models.

Code and data availability

The preprocessing, processing, and postprocessing codes for generating gap-filled satellite-derived TChl a and PFTs Chl a datasets are available on Zenodo (https://doi.org/10.5281/zenodo.14905369, Mehdipour, 2025a) and GitHub (https://github.com/EhsanMehdipour/PFT_gapfilling, last access: 21 February 2025). The merged gap-filled datasets generated using both gap-filling methods for the duration of independent in-situ measurements from the RV Polarstern PS113 expedition (10 May to 9 June 2018) are available on Zenodo (https://doi.org/10.5281/zenodo.14905558, Mehdipour, 2025b). The complete gap-filled datasets for the individual regions are also available separately: those generated using the DINEOF gap-filling method are available on Zenodo (https://doi.org/10.5281/zenodo.15095368, Mehdipour, 2025c), and those generated using the DINCAE gap-filling method are available on Zenodo (https://doi.org/10.5281/zenodo.15102826, Mehdipour, 2025d). The Source code for the DINEOF gap-filling model is available at https://github.com/aida-alvera/DINEOF (last access: 15 March 2023). The source code for the DINCAE gap-filling model is available at https://github.com/gher-uliege/DINCAE.jl (last access: 24 May 2024) or https://doi.org/10.5281/zenodo.5575066 (Barth, 2025). The DPA-derived TChl a and PFTs Chl a concentrations, obtained from the pigment database, were published at https://doi.org/10.1594/PANGAEA.911061 (Bracher et al., 2020b) and https://doi.org/10.1594/PANGAEA.954738 (Xi et al., 2023a). The original satellite-derived TChl a and PFTs Chl a concentrations are available from the Copernicus Marine Service website, as detailed in Sect. 2.1.2. The TChl a and PFTs Chl a dataset can be accessed at https://doi.org/10.48670/moi-00280 (E.U. Copernicus Marine Service Information, 2023), and the SST dataset is available at https://doi.org/10.48670/moi-00168 (E.U. Copernicus Marine Service Information, 2022).

Supplement

The supplement related to this article is available online at https://doi.org/10.5194/gmd-19-1619-2026-supplement.

Author contributions

The authors' contributions, outlined according to the Contributor Roles Taxonomy (CRediT) system, are as follows: EM: Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Software, Visualization, Writing – original draft preparation, Writing – review & editing; HX: Data curation, Formal analysis, Investigation, Writing – review & editing; ABa: Methodology, Software, Writing – review & editing; AAA: Methodology, Software, Writing – review & editing; AW: Conceptualization, Funding acquisition, Project administration, Supervision, Writing – review & editing; ABr: Conceptualization, Data curation, Funding acquisition, Project administration, Supervision, Writing – review & editing.

Competing interests

The contact author has declared that none of the authors has any competing interests.

Disclaimer

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. The authors bear the ultimate responsibility for providing appropriate place names. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.

Acknowledgements

We thank ESA, EUMETSAT, and NASA for the ocean colour satellite data and the Copernicus Marine Service for the level 3 merged TChl a and PFT products. We further acknowledge the captain, the crew, and other scientists on board PS113 for their valuable support on board. We sincerely thank the reviewers for their thorough evaluation of the manuscript and their valuable and thoughtful suggestions. We acknowledge support by the Open Access publication fund of Alfred-Wegener-Institut Helmholtz-Zentrum für Polar- und Meeresforschung. All text in this study was written by the co-authors, with AI assistance, such as ChatGPT, used solely to enhance the manuscript's readability and language.

Financial support

EM's contribution was part of the 4D-Phyto project, funded by AWI-INSPIRES and the Helmholtz School for Marine Data Science (MarDATA) (grant no. HIDSS-0005). HX's contribution was supported via the Copernicus Marine Service Evolution project GLOPHYTS (grant no. 21036L05B-COP-INNOSCI-9000) and ML-PhyTAO (grant no. 23138L03D-COP-INNO-SCI-9000) implemented by Mercator Ocean International. AAA's contribution was supported via the Copernicus Marine Service Evolution project, MultiRes. Funding for the RV Polarstern expedition PS113 data collection was supplied by the Helmholtz Infrastructure Initiative FRAM, and ship time was provided under grant no. AWI_PS113_00.

The article processing charges for this open-access publication were covered by the Alfred-Wegener-Institut Helmholtz-Zentrum für Polar- und Meeresforschung.

Review statement

This paper was edited by Paul Halloran and reviewed by two anonymous referees.

References

Abdel Latif, B., Lecerf, R., Mercier, G., and Hubert-Moy, L.: Preprocessing of Low-Resolution Time Series Contaminated by Clouds and Shadows, IEEE Trans. Geosci. Remote Sensing, 46, 2083–2096, https://doi.org/10.1109/TGRS.2008.916473, 2008.

Alfred-Wegener-Institut Helmholtz-Zentrum für Polar- und Meeresforschung: Polar Research and Supply Vessel POLARSTERN Operated by the Alfred-Wegener-Institute, Journal of Large-Scale Research Facilities, 3, A119–A119, https://doi.org/10.17815/jlsrf-3-163, 2017.

Alvera-Azcárate, A., Barth, A., Rixen, M., and Beckers, J. M.: Reconstruction of incomplete oceanographic data sets using empirical orthogonal functions: application to the Adriatic Sea surface temperature, Ocean Modelling, 9, 325–346, https://doi.org/10.1016/j.ocemod.2004.08.001, 2005.

Alvera-Azcárate, A., Barth, A., Beckers, J.-M., and Weisberg, R. H.: Multivariate reconstruction of missing data in sea surface temperature, chlorophyll, and wind satellite fields, Journal of Geophysical Research Oceans, 112, https://doi.org/10.1029/2006JC003660, 2007.

Alvera-Azcárate, A., Barth, A., Sirjacobs, D., and Beckers, J.-M.: Enhancing temporal correlations in EOF expansions for the reconstruction of missing data using DINEOF, Ocean Sci., 5, 475–485, https://doi.org/10.5194/os-5-475-2009, 2009.

Alvera-Azcárate, A., Van der Zande, D., Barth, A., Troupin, C., Martin, S., and Beckers, J.-M.: Analysis of 23 Years of Daily Cloud-Free Chlorophyll and Suspended Particulate Matter in the Greater North Sea, Frontiers in Marine Science, 8, https://doi.org/10.3389/fmars.2021.707632, 2021.

Alvera-Azcárate, A., Van der Zande, D., Barth, A., Dille, A., Massant, J., and Beckers, J.-M.: Generation of super-resolution gap-free ocean colour satellite products using data-interpolating empirical orthogonal functions (DINEOF), Ocean Sci., 21, 787–805, https://doi.org/10.5194/os-21-787-2025, 2025.

Aumont, O., Ethé, C., Tagliabue, A., Bopp, L., and Gehlen, M.: PISCES-v2: an ocean biogeochemical model for carbon and ecosystem studies, Geosci. Model Dev., 8, 2465–2513, https://doi.org/10.5194/gmd-8-2465-2015, 2015.

Bailey, S. W. and Werdell, P. J.: A multi-sensor approach for the on-orbit validation of ocean color satellite data products, Remote Sensing of Environment, 102, 12–23, https://doi.org/10.1016/j.rse.2006.01.015, 2006.

Barth, A.: gher-uliege/DINCAE.jl: v2.0.2, Zenodo [code], https://doi.org/10.5281/zenodo.5575066, 2025.

Barth, A., Alvera-Azcárate, A., Licer, M., and Beckers, J.-M.: DINCAE 1.0: a convolutional neural network with error estimates to reconstruct sea surface temperature satellite observations, Geosci. Model Dev., 13, 1609–1622, https://doi.org/10.5194/gmd-13-1609-2020, 2020.

Barth, A., Alvera-Azcárate, A., Troupin, C., Beckers, J.-M., and Van der Zande, D.: Reconstruction of Missing Data in Satellite Images of the Southern North Sea Using a Convolutional Neural Network (Dincae), in: 2021 IEEE International Geoscience and Remote Sensing Symposium IGARSS, 2021 IEEE International Geoscience and Remote Sensing Symposium IGARSS, 7493–7496, https://doi.org/10.1109/IGARSS47720.2021.9554045, 2021.

Barth, A., Alvera-Azcárate, A., Troupin, C., and Beckers, J.-M.: DINCAE 2.0: multivariate convolutional neural network with error estimates to reconstruct sea surface temperature satellite and altimetry observations, Geosci. Model Dev., 15, 2183–2196, https://doi.org/10.5194/gmd-15-2183-2022, 2022.

Beckers, J. M. and Rixen, M.: EOF Calculations and Data Filling from Incomplete Oceanographic Datasets, Journal of Atmospheric and Oceanic Technology, 20, 1839–1856, https://doi.org/10.1175/1520-0426(2003)020<1839:ECADFF>2.0.CO;2, 2003.

Beckers, J.-M., Barth, A., and Alvera-Azcárate, A.: DINEOF reconstruction of clouded images including error maps – application to the Sea-Surface Temperature around Corsican Island, Ocean Sci., 2, 183–199, https://doi.org/10.5194/os-2-183-2006, 2006.

Belkin, I. M. and O'Reilly, J. E.: An algorithm for oceanic front detection in chlorophyll and SST satellite imagery, Journal of Marine Systems, 78, 319–326, https://doi.org/10.1016/j.jmarsys.2008.11.018, 2009.

Blondeau-Patissier, D., Gower, J. F. R., Dekker, A. G., Phinn, S. R., and Brando, V. E.: A review of ocean color remote sensing methods and statistical techniques for the detection, mapping and analysis of phytoplankton blooms in coastal and open oceans, Progress in Oceanography, 123, 123–144, https://doi.org/10.1016/j.pocean.2013.12.008, 2014.

Bopp, L., Resplandy, L., Orr, J. C., Doney, S. C., Dunne, J. P., Gehlen, M., Halloran, P., Heinze, C., Ilyina, T., Séférian, R., Tjiputra, J., and Vichi, M.: Multiple stressors of ocean ecosystems in the 21st century: projections with CMIP5 models, Biogeosciences, 10, 6225–6245, https://doi.org/10.5194/bg-10-6225-2013, 2013.

Bracher, A., Bouman, H. A., Brewin, R. J. W., Bricaud, A., Brotas, V., Ciotti, A. M., Clementson, L., Devred, E., Di Cicco, A., Dutkiewicz, S., Hardman-Mountford, N. J., Hickman, A. E., Hieronymi, M., Hirata, T., Losa, S. N., Mouw, C. B., Organelli, E., Raitsos, D. E., Uitz, J., Vogt, M., and Wolanin, A.: Obtaining phytoplankton diversity from ocean color: A scientific roadmap for future development, Frontiers in Marine Science, 4, https://doi.org/10.3389/fmars.2017.00055, 2017.

Bracher, A., Xi, H., Dinter, T., Mangin, A., Strass, V., von Appen, W. J., and Wiegmann, S.: High Resolution Water Column Phytoplankton Composition Across the Atlantic Ocean From Ship-Towed Vertical Undulating Radiometry, Frontiers in Marine Science, 7, https://doi.org/10.3389/fmars.2020.00235, 2020a.

Bracher, A., Wiegmann, S., Xi, H., and Dinter, T.: Phytoplankton pigment concentration and phytoplankton groups measured on water samples obtained during POLARSTERN cruise PS113 in the Atlantic Ocean, PANGAEA [data set], https://doi.org/10.1594/PANGAEA.911061, 2020b.

Campbell, J. W.: The lognormal distribution as a model for bio-optical variability in the sea, Journal of Geophysical Research: Oceans, 100, 13237–13254, https://doi.org/10.1029/95JC00458, 1995.

Chapman, C. and Charantonis, A. A.: Reconstruction of Subsurface Velocities From Satellite Observations Using Iterative Self-Organizing Maps, IEEE Geosci. Remote Sensing Lett., 14, 617–620, https://doi.org/10.1109/LGRS.2017.2665603, 2017.

Claustre, H., Hooker, S. B., Van Heukelem, L., Berthon, J.-F., Barlow, R., Ras, J., Sessions, H., Targa, C., Thomas, C. S., van der Linde, D., and Marty, J.-C.: An intercomparison of HPLC phytoplankton pigment methods using in situ samples: application to remote sensing and database activities, Marine Chemistry, 85, 41–61, https://doi.org/10.1016/j.marchem.2003.09.002, 2004.

Donlon, C. J., Martin, M., Stark, J., Roberts-Jones, J., Fiedler, E., and Wimmer, W.: The Operational Sea Surface Temperature and Sea Ice Analysis (OSTIA) system, Remote Sensing of Environment, 116, 140–158, https://doi.org/10.1016/j.rse.2010.10.017, 2012.

Dutkiewicz, S., Hickman, A. E., Jahn, O., Gregg, W. W., Mouw, C. B., and Follows, M. J.: Capturing optically important constituents and properties in a marine biogeochemical and ecosystem model, Biogeosciences, 12, 4447–4481, https://doi.org/10.5194/bg-12-4447-2015, 2015.

E.U. Copernicus Marine Service Information: Global Ocean OSTIA Sea Surface Temperature and Sea Ice Reprocessed, Copernicus Marine Service [data set], https://doi.org/10.48670/moi-00168, 2022.

E.U. Copernicus Marine Service Information: Global Ocean Colour (Copernicus-GlobColour), Bio-Geo-Chemical, L3 (daily) from Satellite Observations (1997–ongoing), Copernicus Marine Service [data set], https://doi.org/10.48670/moi-00280, 2023.

EUMETSAT: Recommendations for Sentinel-3 OLCI Ocean Colour product validations in comparison with in situ measurements Matchup Protocols, EUM/SEN3/DOC/19/1092968, https://user.eumetsat.int/s3/eup-strapi-media/Recommendations_for_Sentinel_3_OLCI_Ocean_Colour_product_validations_in_comparison_with_in_situ_measurements_Matchup_Protocols_V8_B_e6c62ce677.pdf (last access: 6 September 2024), 2022.

Evensen, G.: Data Assimilation: The Ensemble Kalman Filter, Springer Berlin Heidelberg, Berlin, Heidelberg, https://doi.org/10.1007/978-3-642-03711-5, 2009.

Falkowski, P. G., Laws, E. A., Barber, R. T., and Murray, J. W.: Phytoplankton and Their Role in Primary, New, and Export Production, in: Ocean Biogeochemistry. Global Change — The IGBP Series (closed), edited by: Fasham, M. J. R., Springer, Berlin Heidelberg, 99–121, https://doi.org/10.1007/978-3-642-55844-3_5, 2003.

Fennel, K., Gehlen, M., Brasseur, P., Brown, C. W., Ciavatta, S., Cossarini, G., Crise, A., Edwards, C. A., Ford, D., Friedrichs, M. A. M., Gregoire, M., Jones, E., Kim, H. C., Lamouroux, J., Murtugudde, R., and Perruche, C.: Advancing marine biogeochemical and ecosystem reanalyses and forecasts as tools for monitoring and managing ecosystem health, Frontiers in Marine Science, 6, 89, https://doi.org/10.3389/fmars.2019.00089, 2019.

Field, C. B., Behrenfeld, M. J., Randerson, J. T., and Falkowski, P.: Primary production of the biosphere: Integrating terrestrial and oceanic components, Science, 281, 237–240, 1998.

Flanders Marine Institute: Maritime Boundaries Geodatabase: Maritime Boundaries and Exclusive Economic Zones (200NM), version 12, https://doi.org/10.14284/632, 2023.

Flanders Marine Institute: Maritime Boundaries Geodatabase: Extended Continental Shelves, version 2, https://doi.org/10.14284/697, 2024.

Good, S., Fiedler, E., Mao, C., Martin, M. J., Maycock, A., Reid, R., Roberts-Jones, J., Searle, T., Waters, J., While, J., and Worsfold, M.: The Current Configuration of the OSTIA System for Operational Production of Foundation Sea Surface Temperature and Ice Concentration Analyses, Remote Sensing, 12, 720, https://doi.org/10.3390/rs12040720, 2020.

Gürses, Ö., Oziel, L., Karakuş, O., Sidorenko, D., Völker, C., Ye, Y., Zeising, M., Butzin, M., and Hauck, J.: Ocean biogeochemistry in the coupled ocean–sea ice–biogeochemistry model FESOM2.1–REcoM3, Geosci. Model Dev., 16, 4883–4936, https://doi.org/10.5194/gmd-16-4883-2023, 2023.

Han, Z., He, Y., Liu, G., and Perrie, W.: Application of DINCAE to Reconstruct the Gaps in Chlorophyll-a Satellite Observations in the South China Sea and West Philippine Sea, Remote Sensing, 12, 480, https://doi.org/10.3390/rs12030480, 2020.

Hilborn, A. and Costa, M.: Applications of DINEOF to Satellite-Derived Chlorophyll-a from a Productive Coastal Region, Remote Sensing, 10, 1449, https://doi.org/10.3390/rs10091449, 2018.

Hirata, T., Hardman-Mountford, N. J., Brewin, R. J. W., Aiken, J., Barlow, R., Suzuki, K., Isada, T., Howell, E., Hashioka, T., Noguchi-Aita, M., and Yamanaka, Y.: Synoptic relationships between surface Chlorophyll-a and diagnostic pigments specific to phytoplankton functional types, Biogeosciences, 8, 311–327, https://doi.org/10.5194/bg-8-311-2011, 2011.

Hong, Z., Long, D., Li, X., Wang, Y., Zhang, J., Hamouda, M. A., and Mohamed, M. M.: A global daily gap-filled chlorophyll-a dataset in open oceans during 2001–2021 from multisource information using convolutional neural networks, Earth Syst. Sci. Data, 15, 5281–5300, https://doi.org/10.5194/essd-15-5281-2023, 2023.

Hosoda, K. and Sakaida, F.: Global Daily High-Resolution Satellite-Based Foundation Sea Surface Temperature Dataset: Development and Validation against Two Definitions of Foundation SST, Remote Sensing, 8, 962, https://doi.org/10.3390/rs8110962, 2016.

Huot, Y., Babin, M., Bruyant, F., Grob, C., Twardowski, M. S., and Claustre, H.: Relationship between photosynthetic parameters and different proxies of phytoplankton biomass in the subtropical ocean, Biogeosciences, 4, 853–868, https://doi.org/10.5194/bg-4-853-2007, 2007.

IOCCG: Phytoplankton Functional Types from Space, International Ocean Colour Coordinating Group (IOCCG) Dartmouth, NS, Canada, https://doi.org/10.25607/OBP-106, 2014.

IOCCG: Uncertainties in ocean colour remote sensing, International Ocean Colour Coordinating Group, Dartmouth, Nova Scotia, https://doi.org/10.25607/OBP-696, 2019.

Ji, C., Zhang, Y., Cheng, Q., and Tsou, J. Y.: Investigating ocean surface responses to typhoons using reconstructed satellite data, International Journal of Applied Earth Observation and Geoinformation, 103, 102474, https://doi.org/10.1016/j.jag.2021.102474, 2021.

Jouini, M., Lévy, M., Crépon, M., and Thiria, S.: Reconstruction of satellite chlorophyll images under heavy cloud coverage using a neural classification method, Remote Sensing of Environment, 131, 232–246, https://doi.org/10.1016/j.rse.2012.11.025, 2013.

Jung, S., Yoo, C., and Im, J.: High-Resolution Seamless Daily Sea Surface Temperature Based on Satellite Data Fusion and Machine Learning over Kuroshio Extension, Remote Sensing, 14, 575, https://doi.org/10.3390/rs14030575, 2022.

Kandasamy, S., Baret, F., Verger, A., Neveux, P., and Weiss, M.: A comparison of methods for smoothing and gap filling time series of remote sensing observations – application to MODIS LAI products, Biogeosciences, 10, 4055–4071, https://doi.org/10.5194/bg-10-4055-2013, 2013.

Kostopoulou, E.: Applicability of ordinary Kriging modeling techniques for filling satellite data gaps in support of coastal management, Model. Earth Syst. Environ., 7, 1145–1158, https://doi.org/10.1007/s40808-020-00940-5, 2021.

Krasnopolsky, V., Nadiga, S., Mehra, A., Bayler, E., and Behringer, D.: Neural Networks Technique for Filling Gaps in Satellite Measurements: Application to Ocean Color Observations, Computational Intelligence and Neuroscience, 2016, e6156513, https://doi.org/10.1155/2016/6156513, 2015.

Legendre, P.: Model II regression user's guide, R edition, R Vignette, 14 pp., https://cran.r-project.org/web/packages/lmodel2/vignettes/mod2user.pdf (last access: 6 June 2024), 1998.

Legendre, P. and Legendre, L.: Numerical ecology, Elsevier, ISBN 978-0-444-53868-0, 2012.

Lepot, M., Aubin, J.-B., and Clemens, F. H. L. R.: Interpolation in Time Series: An Introductive Overview of Existing Methods, Their Performance Criteria and Uncertainty Assessment, Water, 9, 796, https://doi.org/10.3390/w9100796, 2017.

Li, J. and Heap, A. D.: A review of spatial interpolation methods for environmental scientists, Geoscience Australia, Record 2008/23, 137 pp., ISBN 978-1-921498-30-5, https://www.ga.gov.au/bigobj/GA12526.pdf (last access: 11 August 2024), 2008.

Li, J. and Heap, A. D.: Spatial interpolation methods applied in the environmental sciences: A review, Environmental Modelling & Software, 53, 173–189, https://doi.org/10.1016/j.envsoft.2013.12.008, 2014.

Litchman, E., Klausmeier, C. A., Miller, J. R., Schofield, O. M., and Falkowski, P. G.: Multi-nutrient, multi-group model of present and future oceanic phytoplankton communities, Biogeosciences, 3, 585–606, https://doi.org/10.5194/bg-3-585-2006, 2006.

Liu, X. and Wang, M.: Gap Filling of Missing Data for VIIRS Global Ocean Color Products Using the DINEOF Method, IEEE Transactions on Geoscience and Remote Sensing, 56, 4464–4476, https://doi.org/10.1109/TGRS.2018.2820423, 2018.

Liu, X. and Wang, M.: Global daily gap-free ocean color products from multi-satellite measurements, International Journal of Applied Earth Observation and Geoinformation, 108, 102714, https://doi.org/10.1016/j.jag.2022.102714, 2022.

Longhurst, A. R.: Ecological geography of the sea, Elsevier, ISBN 978-0-12-455521-1, 2010.

Losa, S. N., Soppa, M. A., Dinter, T., Wolanin, A., Brewin, R. J. W., Bricaud, A., Oelker, J., Peeken, I., Gentili, B., Rozanov, V., and Bracher, A.: Synergistic exploitation of hyper- and multi-spectral precursor sentinel measurements to determine phytoplankton functional types (SynSenPFT), Frontiers in Marine Science, 4, 203, https://doi.org/10.3389/fmars.2017.00203, 2017.

Lu, T., Li, S., and Fu, W.: Fusion Based Seamless Mosaic for Remote Sensing Images, Sens. Imaging, 15, 101, https://doi.org/10.1007/s11220-014-0101-0, 2014.

Mehdipour, E.: EhsanMehdipour/PFT_gapfilling: Gap-Filling Phytoplankton Functional Types in the Atlantic Ocean Using DINCAE and DINEOF Methods, Zenodo [code], https://doi.org/10.5281/zenodo.14905369, 2025a.

Mehdipour, E.: Gap-filled phytoplankton functional types (PFT) dataset for the Atlantic Ocean along corridor of the RV Polarstern PS113 expedition using DINEOF and DINCAE gap-filling methods, Zenodo [data set], https://doi.org/10.5281/zenodo.14905558, 2025b.

Mehdipour, E.: Gap-Filled Phytoplankton Functional Types (PFT) Dataset Using the DINEOF Method for Selected Regions Along an Atlantic Ocean Transect (2016-04-25 to 2019-04-25), Zenodo [data set], https://doi.org/10.5281/zenodo.15095368, 2025c.

Mehdipour, E.: Gap-Filled Phytoplankton Functional Types (PFT) Dataset Using the DINCAE Method for Selected Regions Along an Atlantic Ocean Transect (2016-04-25 to 2019-04-25), Zenodo [data set], https://doi.org/10.5281/zenodo.15102826, 2025d.

Nerger, L. and Hiller, W.: Software for ensemble-based data assimilation systems – Implementation strategies and scalability, Computers & Geosciences, 55, 110–118, https://doi.org/10.1016/J.CAGEO.2012.03.026, 2013.

Park, J., Kim, J.-H., Kim, H., Kim, B.-K., Bae, D., Jo, Y.-H., Jo, N., and Lee, S. H.: Reconstruction of Ocean Color Data Using Machine Learning Techniques in Polar Regions: Focusing on Off Cape Hallett, Ross Sea, Remote Sensing, 11, 1366, https://doi.org/10.3390/rs11111366, 2019.

Pradhan, H. K., Völker, C., Losa, S. N., Bracher, A., and Nerger, L.: Assimilation of Global Total Chlorophyll OC-CCI Data and Its Impact on Individual Phytoplankton Fields, Journal of Geophysical Research: Oceans, 124, 470–490, https://doi.org/10.1029/2018JC014329, 2019.

Pradhan, H. K., Völker, C., Losa, S. N., Bracher, A., and Nerger, L.: Global Assimilation of Ocean-Color Data of Phytoplankton Functional Types: Impact of Different Data Sets, Journal of Geophysical Research: Oceans, 125, e2019JC015586, https://doi.org/10.1029/2019JC015586, 2020.

Reynolds, R. W. and Smith, T. M.: Improved Global Sea Surface Temperature Analyses Using Optimum Interpolation, Journal of Climate, 7, 929–948, https://doi.org/10.1175/1520-0442(1994)007<0929:IGSSTA>2.0.CO;2, 1994.

Ronneberger, O., Fischer, P., and Brox, T.: U-Net: Convolutional Networks for Biomedical Image Segmentation, in: Medical Image Computing and Computer-Assisted Intervention – MICCAI 2015, Cham, 234–241, https://doi.org/10.1007/978-3-319-24574-4_28, 2015.

Sathyendranath, S., Brewin, R. J. W., Brockmann, C., Brotas, V., Calton, B., Chuprin, A., Cipollini, P., Couto, A. B., Dingle, J., Doerffer, R., Donlon, C., Dowell, M., Farman, A., Grant, M., Groom, S., Horseman, A., Jackson, T., Krasemann, H., Lavender, S., Martinez-Vicente, V., Mazeran, C., Mélin, F., Moore, T. S., Müller, D., Regner, P., Roy, S., Steele, C. J., Steinmetz, F., Swinton, J., Taberner, M., Thompson, A., Valente, A., Zühlke, M., Brando, V. E., Feng, H., Feldman, G., Franz, B. A., Frouin, R., Gould, R. W., Hooker, S. B., Kahru, M., Kratzer, S., Mitchell, B. G., Muller-Karger, F. E., Sosik, H. M., Voss, K. J., Werdell, J., and Platt, T.: An Ocean-Colour Time Series for Use in Climate Studies: The Experience of the Ocean-Colour Climate Change Initiative (OC-CCI), Sensors, 19, 4285, https://doi.org/10.3390/s19194285, 2019.

Schourup-Kristensen, V., Sidorenko, D., Wolf-Gladrow, D. A., and Völker, C.: A skill assessment of the biogeochemical model REcoM2 coupled to the Finite Element Sea Ice–Ocean Model (FESOM 1.3), Geosci. Model Dev., 7, 2769–2802, https://doi.org/10.5194/gmd-7-2769-2014, 2014.

Sirjacobs, D., Alvera-Azcárate, A., Barth, A., Lacroix, G., Park, Y., Nechad, B., Ruddick, K., and Beckers, J.-M.: Cloud filling of ocean colour and sea surface temperature remote sensing products over the Southern North Sea by the Data Interpolating Empirical Orthogonal Functions methodology, Journal of Sea Research, 65, 114–130, https://doi.org/10.1016/j.seares.2010.08.002, 2011.

Sobel, I. and Feldman, G.: A 3 × 3 isotropic gradient operator for image processing, A Talk at the Stanford Artificial Intelligence Project, https://www.researchgate.net/publication/285159837_A_33_isotropic_gradient_operator_for_image_processing (last access: 11 June 2024), 1968.

Stock, A., Subramaniam, A., Van Dijken, G. L., Wedding, L. M., Arrigo, K. R., Mills, M. M., Cameron, M. A., and Micheli, F.: Comparison of Cloud-Filling Algorithms for Marine Satellite Data, Remote Sensing, 12, 3313, https://doi.org/10.3390/rs12203313, 2020.

Strass, V. H.: The Expedition PS113 of the Research Vessel POLARSTERN to the Atlantic Ocean in 2018, Bremerhaven, Germany, 66 pp., https://doi.org/10.2312/BzPM_0724_2018, 2018.

Uyttendaele, M., Eden, A., and Skeliski, R.: Eliminating ghosting and exposure artifacts in image mosaics, in: Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001, https://doi.org/10.1109/CVPR.2001.991005, 2001.

Vidussi, F., Claustre, H., Manca, B. B., Luchetta, A., and Marty, J.-C.: Phytoplankton pigment distribution in relation to upper thermocline circulation in the eastern Mediterranean Sea during winter, Journal of Geophysical Research: Oceans, 106, 19939–19956, https://doi.org/10.1029/1999JC000308, 2001.

Vincent, O. R. and Folorunso, O.: A descriptive algorithm for sobel image edge detection, in: Proceedings of Informing Science & IT education Conference (InSITE), 97–107, https://doi.org/10.28945/3351, 2009.

Volpe, G., Buongiorno Nardelli, B., Colella, S., Pisano, A., and Santoleri, R.: Operational Interpolated Ocean Colour Product in the Mediterranean Sea, New Frontiers in Operational Oceanography, 227–244, https://doi.org/10.17125/gov2018.ch09, 2018.

von Appen, W.-J., Strass, V. H., Bracher, A., Xi, H., Hörstmann, C., Iversen, M. H., and Waite, A. M.: High-resolution physical–biogeochemical structure of a filament and an eddy of upwelled water off northwest Africa, Ocean Sci., 16, 253–270, https://doi.org/10.5194/os-16-253-2020, 2020.

Wang, Y., Gao, Z., and Liu, D.: Multivariate DINEOF Reconstruction for Creating Long-Term Cloud-Free Chlorophyll-a Data Records From SeaWiFS and MODIS: A Case Study in Bohai and Yellow Seas, China, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 12, 1383–1395, https://doi.org/10.1109/JSTARS.2019.2908182, 2019.

Wang, Y., Tang, R., Yu, Y., and Ji, F.: Variability in the Sea Surface Temperature Gradient and Its Impacts on Chlorophyll-a Concentration in the Kuroshio Extension, Remote Sensing, 13, 888, https://doi.org/10.3390/rs13050888, 2021.

Xi, H., Losa, S. N., Mangin, A., Soppa, M. A., Garnesson, P., Demaria, J., Liu, Y., d'Andon, O. H. F., and Bracher, A.: Global retrieval of phytoplankton functional types based on empirical orthogonal functions using CMEMS GlobColour merged products and further extension to OLCI data, Remote Sensing of Environment, 240, https://doi.org/10.1016/j.rse.2020.111704, 2020.

Xi, H., Losa, S. N., Mangin, A., Garnesson, P., Bretagnon, M., Demaria, J., Soppa, M. A., Hembise Fanton d'Andon, O., and Bracher, A.: Global Chlorophyll a Concentrations of Phytoplankton Functional Types With Detailed Uncertainty Assessment Using Multisensor Ocean Color and Sea Surface Temperature Satellite Products, Journal of Geophysical Research: Oceans, 126, https://doi.org/10.1029/2020JC017127, 2021.

Xi, H., Peeken, I., Gomes, M., Brotas, V., Tilstone, G. H., Brewin, R. J. W., Dall'Olmo, G., Tracana, A., Alvarado, L. M. A., Murawski, S., Wiegmann, S., and Bracher, A.: Phytoplankton pigment concentrations and phytoplankton groups measured on water samples collected from various expeditions in the Atlantic Ocean from 71° S to 84° N, PANGAEA [data set] https://doi.org/10.1594/PANGAEA.954738, 2023a.

Xi, H., Bretagnon, M., Losa, S. N., Brotas, V., Gomes, M., Peeken, I., Alvarado, L. M. A., Mangin, A., and Bracher, A.: Satellite monitoring of surface phytoplankton functional types in the Atlantic Ocean over 20 years (2002–2021), in: 7th edition of the Copernicus Ocean State Report (OSR7), edited by: von Schuckmann, K., Moreira, L., Le Traon, P.-Y., Grégoire, M., Marcos, M., Staneva, J., Brasseur, P., Garric, G., Lionello, P., Karstensen, J., and Neukermans, G., Copernicus Publications, State Planet, 1-osr7, 5, https://doi.org/10.5194/sp-1-osr7-5-2023, 2023b.

Xi, H., Bretagnon, M., Mehdipour, E., Demaria, J., Mangin, A., and Bracher, A.: Consistent long-term observations of surface phytoplankton functional types from space, in: 9th edition of the Copernicus Ocean State Report (OSR9), edited by: Karina von Schuckmann (Mercator Ocean International, France), Lorena Moreira (Nologin, Spain), Álvaro de Pascual Collar (Nologin, Spain), Marilaure Grégoire (University of Liège, Belgium), Pierre Brasseur (CNRS, France), Gilles Garric (Mercator Ocean International, France), Johannes Karstensen (GEOMAR Helmholtz Centre for Ocean Research Kiel, Germany), Piero Lionello (University of Salento, Italy), Marta Marcos (University of the Balearic Islands, Spain), Pierre-Marie Poulain (Istituto Nazionale di Oceanografia e di Geofisica Sperimentale (OGS), Italy), and Joanna Staneva (Helmholtz-Zentrum Hereon, Germany), Copernicus Publications, State Planet, 6-osr9, 7, https://doi.org/10.5194/sp-6-osr9-7-2025, 2025.

Articles

Short summary

Phytoplankton are vital for marine ecosystems and nutrient cycling, detectable by optical satellites. Data gaps caused by clouds and other non-optimal conditions limit comprehensive analyses like trend monitoring. This study evaluated DINCAE and DINEOF gap-filling methods for reconstructing chlorophyll a datasets, including total chlorophyll a and five major phytoplankton groups. Both methods showed robust reconstruction capabilities, aiding pattern detection and long-term ocean colour analysis.