Downscaling Multi-Model Climate Projection Ensembles with Deep Learning (DeepESD): Contribution to CORDEX EUR-44
- 1Meteorology Group, Instituto de Física de Cantabria (IFCA,CSIC-UC), Santander, Spain
- 2Meteorology Group, Dpto. de Matemática Aplicada y Ciencias de la Computación, Universidad de Cantabria, Santander, Spain
- 1Meteorology Group, Instituto de Física de Cantabria (IFCA,CSIC-UC), Santander, Spain
- 2Meteorology Group, Dpto. de Matemática Aplicada y Ciencias de la Computación, Universidad de Cantabria, Santander, Spain
Abstract. Deep Learning (DL) has recently emerged as an innovative tool to downscale climate variables from large-scale atmospheric fields under the perfect prognosis (PP) approach. Different Convolutional Neural Networks (CNN) have been applied under present-day conditions with promising results, but little is known about their suitability for extrapolating future climate change conditions. Here, we analyze this problem from a multi-model perspective, developing and evaluating an ensemble of CNN-based downscaled projections (DeepESD) for temperature and precipitation over the European EUR-44i (0.5º) domain, based on eight GCMs from the Coupled Model Intercomparison Project Phase 5 (CMIP5). To our knowledge, this is the first time that CNNs have been used to produce multi-model ensembles of downscaled projections, allowing to quantify inter-model uncertainty in climate change signals. The results are compared with those corresponding to an EUR-44 ensemble of regional climate models (RCMs) showing that DeepESD reduces distributional biases in the historical period. Moreover, the resulting climate change signals are broadly comparable to those obtained with the RCMs, with similar spatial structures. As for the uncertainty of the climate change signal (measured on the basis of inter-model spread), DeepESD yields a smaller uncertainty for precipitation, but a similar uncertainty for temperature.
To facilitate further studies of this downscaling approach we follow FAIR principles and make publicly available the code (a Jupyter notebook) and the DeepESD dataset. In particular, DeepESD is published at the Earth System Grid Federation (ESGF), as the first continental-wide PP dataset contributing to CORDEX (EUR-44).
Jorge Baño-Medina et al.
Status: final response (author comments only)
-
RC1: 'Comments on gmd-2022-57', Anonymous Referee #1, 16 Apr 2022
General comments:
In this paper, the author built a statistical downscaling model based on (Convolutional Neural Network) CNN. And used the reanalysis data as a reference for the training phase. Then, the author employed the trained model to downscale the ensemble of CMIP5 models over Europe. The results were compared with the CORDEX RCMs for the historical (1975-2005) and the projection (2006-2100) period. The results from the deepSD algorithm are contributing to CORDEX initiative, which is a breakthrough since the statistical downscaling based on artificial intelligence was not trusted for climate studies in the past few years. In general terms, the manuscript is well structured, the methodology is well described, the figures and tables are well organized and the results are adequately discussed. This paper falls within the scope of this journal. In this sense, the article can be approved after minor revisions.
Recommendation: Accept after minor revisions.
Specific comments:
Abstract:
The sentence “To our knowledge, this is the first time that CNNs have been used to produce multi-model ensembles” is not that accurate since there are previous studies that employed CNN to downscale the model ensemble (e.g., Babaousmail et al. (2021)).
Introduction:
- Introduction (third paragraph): “These methods are not computationally demanding…” This sentence needs a citation.
- The author should justify why he selected the RCP8.5 scenario out of the other scenarios? Also, was there any method employed for the selection of the 8 GCMs?
- The author didn’t justify why E-OBS v20 was selected as an observation in this study.
Data and methods:
- Since the author is comparing the ensemble resulting from 8 GCMs with the RCM ensemble projection, shouldn’t be the number of GCMs equal to the number of RCMs?
- Usually, when we train a neural net model, a validation phase is required after the training and it should be selected from the historical 25 years period, in this paper the author didn’t mention it.
- Concerning the CNN algorithm, we noticed that the CNN used to downscale precipitation has one more layer than the one for temperature (one output layer). Can the author explain the reason?
-
RC2: 'Comment on gmd-2022-57', Anonymous Referee #2, 21 Apr 2022
The comment was uploaded in the form of a supplement: https://gmd.copernicus.org/preprints/gmd-2022-57/gmd-2022-57-RC2-supplement.pdf
- RC3: 'Comment on gmd-2022-57', Anonymous Referee #3, 23 Apr 2022
-
RC4: 'Comment on gmd-2022-57', Anonymous Referee #4, 23 Apr 2022
general comments
The paper is an important contribution to the methodology of the downscaling of the climate change simulations. The results show that neural networks can bring meaningful regionalized climate change fields that can be a good complement to those obtained from regional climate models. The method works well for precipitation and temperature, but question of its usefulness for other climate fields, specially 3d fields, remains open.
The paper is clearly written and well structured. However, there are a few thinks that can be improved in the paper. In particular, I thing that the answer to this comments could improve the paper.
- with which criteria were chosen the predictand fields?
- how expensive in computer resources is the method?
- Why eobs was used? It is too smooth, what can be seen in the results, specially in places with high topography.
- It seems that the use of more output layers for the precipitation than in temperature makes the biases in the downscaling of precipitation as small as for temperature, but reduces the standard deviation in the downscaling (Figure 3) I think that this fact is related to the methodology and should be commented by the authors.
- Also, the fact that the simulation of R01 in DeepESD is closer to the RCMs that to the GCMs shows the importance of a good simulation of orographic precipitation, while SDII and Mean temperature in DeepESD and GCM are closer, probably reflecting the tuning of the GCMs (which usually is not made in RCMs) and the training with observations in DeepESD. The exception for temperature in ED looks strange for me and would be nice if you explain this behavior.
- Results in figures 3,4 and could be also contributed by the use of stochastic (deterministic) approaches for the precipitation (temperature)
specific comments
A more detailed description of the methodology for not specialists (most of readers, I guess) should be interesting. Can be added as an appendix
How does the interpolation method influences the results?
Why did not used a regional, high resolution reanalysis as predictands?
In the Iberian Peninsula and the Scandinavian peninsula the climate change signal in DeepESD is similar to that of the global models, while the opposite is true in central Europe. Could you elaborate on this?
-
CEC1: 'Comment on gmd-2022-57', Juan Antonio Añel, 25 Apr 2022
Dear authors,
After checking your manuscript, it has come to our attention that it does not comply with our Code and Data Policy.
https://www.geoscientific-model-development.net/policies/code_and_data_policy.htmlWe can not accept the UDGat the Universidad de Cantabria as the repository for the data. According to our policy, it does not comply with the requirements (funding secured for decades, independence of a single institution, etc.) Moreover, after login into the portal to access the ECMWF_ERA-Interim-ESD data, it is necessary to request approval. This is against our policy; it would be the same that simply stating "data available upon request", and it compromises the replicability of the work. You should move the data necessary to a public repository that complies with our policy (e.g. Zenodo, which you already use). Without being fully familiar with your work, I understand that a complete dataset of ERA fields can have a substantial size (several hundreds of GB) that prevents it from being feasible. Please save a small dataset of predictors as a sample that improves the replicability in such a case. Also, be aware that in this way, the Jupyter notebooks should link the new repositories and not the servers at the Universidade de Cantabria.
Also, you refer to GitHub several times in the Code and Data availability section. GitHub is not a suitable repository for scientific publication. GitHub itself instructs authors to use other alternatives for long-term archival and academic publishing, such as Zenodo (GitHub provides a direct way to copy your project to a Zenodo repository). Therefore, please, publish frozen versions of the code used in this work (this includes C4R) in one of the appropriate repositories, and remove from the text and notebooks the mentions to GitHub, as they can be confusing to the reader.
A note, the Terms of Use to access the User Data Gateway at the Universidad de Cantabria are not available. The link shows an ERROR 500 message.
Juan A. Añel
Geosci. Model Dev. Executive Editor
-
RC5: 'Comment on gmd-2022-57', Anonymous Referee #5, 06 May 2022
Review of GMD-2022-57
Downscaling Multi-Model Climate Projection Ensembles with Deep Learning (DeepESD): Contribution to CORDEX EUR-44
By Baño-Medina et al.
In this study, the authors develop a downscaling method for climate variables based on the prefect prognosis approach using convolution neural networks (CNN) and evaluate how it extrapolates to unseen climatic states as projected by multi-model Earth system models. They focus on temperature and precipitation as well as on the European domain. The authors compare the output of their method to regional high-resolved climate model output and show that the CNN-approach reduces biases for the historical period and extrapolates to future climate change conditions in a plausible way.
The downscaling approach is very interesting and useful, especially the evaluation of its extrapolating skill to a different system state. Overall, the manuscript is well written. I have a few general comments and a short list of specific comments. Thus, I recommend minor revisions before publication.
- Could you make flowchart of your workflow and include as a figure in the methods section? It is a bit hard to follow your exact procedure.
- Isn‘t the comparison to observations between unconstrained mechanistic models (i.e. GCMs) and CNNs trained on observations "unfair"? If you did some nudging procedure with GCMs you would also end up with model output better fitting observations. For the CNN training, did you split the observational data into train (validation) and test set (only train on 20 years and show performance for 10 years)? Again, a flowchart would help to understand what you did. If you show the performance of DeepESD for the test set and compare that to GCM output, it’d be “more fair”, but still, just by design we would expect that the CNN reproduces observations better than GCMs.
- You show that the CNN learns the "necessary" dynamics based on predictors of the historical period and extrapolates reasonable well using predictors from GCM output for projections. That is a very interesting point. I wonder if this simple bias correction for GCMs really does the trick, as the models considerably diverge over the climatic time-scales and very model specific regional biases emerge. Can you comment on whether other bias-correcting measures were tested? Overall, there is certainly a long list of potential further evaluation and testing steps that could be undertaken, but maybe it is enough for this model description paper.
Specific Comments:
L5: What is DeepESD standing for? Please introduce acronym before first usage.
LL33-34: The “perfect prognosis” approach is based on the assumption that GCMs don’t have systematic biases with respect to the observations that were used for training, right? Maybe you should include a short sentence here that addresses this aspect.
L55: I recommend to use another more static hosting platform for your code, e.g. Zenedo (https://zenodo.org/).
L60: Why did you use ERA-Interim reanalysis? It is outdated for quite some time now.
L62: I don’t understand your use of dashes (—) in the manuscript. Please check whether the make sense throughout the manuscript.
LL62-65: What about adding high-resolution orography description as static predictor?
L85: Why did you analyze both and can you provide the reason why you settled with the deterministic one?
L88: Please stick to the tenses (in this paragraph you mix present and past tense), i.e. do not switch between present and past tense when describing your results or methods. I recommend that you always use present tense when talking about your study, i.e. when describing your methods, your results etc., and use past tense when referring to already published studies.
L137: “contribute to increasing”
Figure 1: Add unit at lower right colorbar. Also, it’d be useful if you could include letter characters as pointers to subplots, e.g. a,b,c,d. This comment applies for all Figures.
Figure 4: Please be more specific about the numbers in the plots. Please provide more detailed information in the caption.
Figure 5: The mid-column misses a time axis. DeepESD is not “yellow” but “green”, no?
Jorge Baño-Medina et al.
Data sets
DeepESD jorge Baño, Rodrigo Manzanas, Ezequiel Cimadevilla, Jesús Fernández, José González, Antonio Cofiño, José Manuel Gutiérrez https://data.meteo.unican.es/thredds/catalog/esgcet/catalog.html
Model code and software
2022_Bano_GMD.ipynb Jorge Baño, Rodrigo Manzanas, Ezequiel Cimadevilla, Jesús Fernández, José González, Antonio Cofiño, José Manuel Gutiérrez https://github.com/SantanderMetGroup/DeepDownscaling/blob/master/2022_Bano_GMD.ipynb
Jorge Baño-Medina et al.
Viewed
HTML | XML | Total | BibTeX | EndNote | |
---|---|---|---|---|---|
450 | 150 | 18 | 618 | 4 | 5 |
- HTML: 450
- PDF: 150
- XML: 18
- Total: 618
- BibTeX: 4
- EndNote: 5
Viewed (geographical distribution)
Country | # | Views | % |
---|
Total: | 0 |
HTML: | 0 |
PDF: | 0 |
XML: | 0 |
- 1