the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
SICNetseason V1.0: a transformer-based deep learning model for seasonal Arctic sea ice prediction by integrating sea ice thickness data
Abstract. The Arctic sea ice is suffering dramatic retreating in summer and fall, which has far-reaching consequences on the global climate and commercial activities. Accurate seasonal sea ice predictions are significant in inferring climate change and planning commercial activities. However, seasonally predicting the summer sea ice encounters a significant obstacle known as the spring predictability barrier (SPB): predictions made later than May demonstrate good skill in predicting summer sea ice, while predictions made on or earlier than May exhibit considerably lower skill. This study develops a transformer-based deep-learning model, SICNetseason (V1.0), to predict the Arctic sea ice concentration on a seasonal scale. Including spring sea ice thickness (SIT) data in the model significantly improves the prediction skill at the SPB point. A 20-year (2000–2019) testing demonstrates that the detrended anomaly correlation coefficient (ACC) of Sep. sea ice extent (sea ice concentration > 15 %) predicted by our model at May/Apr. is improved by 7.7 %/10.61 % over the ACC predicted by the state-of-the-art dynamic model from the European Centre for Medium-Range Weather Forecasts (ECMWF). Compared with the anomaly persistence benchmark, the mentioned improvement is 41.02 %/36.33 %. Our deep learning model significantly reduces prediction errors of Sep.'s sea ice concentration on seasonal scales compared to ECMWF and Persistence. The spring SIT data is key in optimizing the SPB, contributing to a more than 20 % ACC enhancement in Sep.'s SIE at four to five months lead predictions. Our model achieves good generalization in predicting the Sep. SIE of 2020–2023.
- Preprint
(4732 KB) - Metadata XML
- BibTeX
- EndNote
Status: open (until 15 Feb 2025)
-
CEC1: 'Comment on gmd-2024-200', Juan Antonio Añel, 27 Dec 2024
reply
Dear authors,
Unfortunately, after checking your manuscript, it has come to our attention some lack of compliance with our "Code and Data Policy".
https://www.geoscientific-model-development.net/policies/code_and_data_policy.htmlTo assure the replicability of your work it is necessary to have all the input and output data that you use and produce, respectively. You have provided a repository hosted by Zenodo for part of the input data and the code. However, in the repository it reads "Because the well-trained model is too large (10GB), it cannot be supported by the github". I understand that this is a sentence that comes from a previously existing GitHub site where you hosted these assets. However, it is clear that does not correspond to the current Zenodo repository. Moreover, Zenodo does not have problems with hosting 10 GB of data, so you must include in it all the input data that you have used. Also, in your "Code and Data Availability" section you link several webpages that provide access to the data that you use for your work. These are not valid and should be replaced by the repositories containing the exact input data. The sites that you list are only main "portals" to data, not the repositories for the exact input data that you use, and what you have to provide is the exact input data used to produce your work. Moreover, such sites are not valid repositories for long-term archival or scientific publication.
Additionally, you do not provide the output data produced by your model, and you must do it.
Therefore, you must reply to this comment as soon as possible with the relevant information (link and DOI) for the new repositories, as we request that you make it already available before submission and, of course, before the Discussions stage. Moreover, you must include in a potentially revised version of your manuscript the modified 'Code and Data Availability' section, with the DOI of the code (and another DOI for the dataset if necessary).
Note that if you do not fix these problems as requested, we will have to reject your manuscript for publication in our journal. In the meantime, I advice the Topical Editor to stall the peer-review process until these issues are solved, as this manuscript should not have been accepted in Discussion given the lack of compliance with the policy of the journal.
Juan A. Añel
Geosci. Model Dev. Executive Editor
Citation: https://doi.org/10.5194/gmd-2024-200-CEC1 -
AC1: 'Reply on CEC1', Yibin Ren, 27 Dec 2024
reply
Thanks for the comment.
We have uploaded the exact input data, output data, and the well-trained model weights to the new Zenodo repository with a DOI: https://doi.org/10.5281/zenodo.14561423. We have added detail introductions for the upload files to make sure the model can be run:
- The codes of the model are in the model_utils folders.
- The jupyter file "SICNet_season_training.ipynb" contains the flow of loading data, defining a model, and training the model.
- The jupyter file "SICNet_season_evaluating.ipynb" includes evaluating the trained model and drawing the ACC/BACC result.
- The training and testing data are in the Data/Data.zip file and can be loaded for training and evaluation of the model. The files "SIC_1979_2020_Jan_May_96.npy", "SIT_1979_2020_Jan_May_96.npy", "SIC_climate_mean_96.npy", and "mask_96.npy" are input data for model training. The "Output data" file contains the predicted results output by SICNetseason model, "Predicted_SIC_output.npy". The "Ground_SIC_output.npy" in this file is the ground truth from NSIDC.
- The Model_SICNet.zip contains the saved well-trained models. Please unpackage the zip file and copy the .h5 files to the corresponding file path "/Result/Model_SICNet". Then, the "SICNet_season_evaluating.ipynb" can be run to genernate the predicted values by the well-trained models.
We also revised the “Code and data availability” session as follow:
The satellite-observed sea ice concentration is obtained from the following site https://nsidc.org/data/NSIDC-0081/versions/1 (Cavalieri et al., 1996). The reanalysis of sea ice thickness data is open access (http://psc.apl.uw.edu/research/projects/arctic-sea-ice-volume-anomaly/data/model_grid) (Zhang and Rothrock, 2003). The seasonal predictions of SEAS5 are obtained from the ECMWF (https://cds.climate.copernicus.eu/cdsapp#!/dataset/seasonal-monthly-single-levels?tab=form) (Johnson et al., 2019). The code, the exact input/output data, and the saved well-trained weights of the developed model SICNetseason are available at https://doi.org/10.5281/zenodo.14561423.
We upload the revised manuscript with the new “Code and data availability” section as supplement.
-
CEC2: 'Reply on AC1', Juan Antonio Añel, 28 Dec 2024
reply
Dear authors,
Thanks for replying so quickly to my previous comment. However, I do not understand why you continue including in the Code and Data Availability section links to the sites that do not comply with the policy. It is my understanding that all the input data is in the Zenodo repository, and therefore, linking other sites is unnecessary and only generates doubts among readers about where to find the data to replicate your work. Therefore, the section should read simply "The code, the exact input/output data, and the saved well-trained weights of the developed model SICNetseason are available at https://doi.org/10.5281/zenodo.14561423."
If my interpretation is wrong, and it is necessary to use data that is not included in the Zenodo repository, and that is hosted in the other sites, then your manuscript is not in compliance with the policy of the journal, because such sites are not accepted as suitable repositories for scientific publication.
Please, clarify the situation regarding the input data, and delete the mentions to the sites that do not comply with the policy.
Juan A. Añel
Geosci. Model Dev. Executive Editor
Citation: https://doi.org/10.5194/gmd-2024-200-CEC2 -
AC2: 'Reply on CEC2', Yibin Ren, 28 Dec 2024
reply
Thanks for the comment. Sorry for my misunderstanding of the last comment. We have deleted other sites and revised the "Code and data availability" section as:
The code, the exact input/output data, and the saved well-trained weights of the developed model SICNetseason are available at https://doi.org/10.5281/zenodo.14561423.
The new revised manuscript is attached with the supplement.
-
CEC3: 'Reply on AC2', Juan Antonio Añel, 28 Dec 2024
reply
Dear authors,
Many thanks for the quick reply again. We can consider now the current version of your manuscript in compliance with our code and data policy, and therefore the Discussions stage and the peer-review process can continue.
Juan A. Añel
Geosci. Model Dev. Executive Editor
Citation: https://doi.org/10.5194/gmd-2024-200-CEC3
-
CEC3: 'Reply on AC2', Juan Antonio Añel, 28 Dec 2024
reply
-
AC2: 'Reply on CEC2', Yibin Ren, 28 Dec 2024
reply
-
AC1: 'Reply on CEC1', Yibin Ren, 27 Dec 2024
reply
-
RC1: 'Comment on gmd-2024-200', Anonymous Referee #1, 09 Jan 2025
reply
For comments - see pdf attached
Viewed
HTML | XML | Total | BibTeX | EndNote | |
---|---|---|---|---|---|
204 | 43 | 10 | 257 | 1 | 2 |
- HTML: 204
- PDF: 43
- XML: 10
- Total: 257
- BibTeX: 1
- EndNote: 2
Viewed (geographical distribution)
Country | # | Views | % |
---|
Total: | 0 |
HTML: | 0 |
PDF: | 0 |
XML: | 0 |
- 1