A generalized spatial autoregressive neural network (GSARNN) method for three-dimensional spatial interpolation
- 1School of Earth Sciences, Zhejiang University, Hangzhou 310027, China
- 2Zhejiang Provincial Key Laboratory of Geographic Information Science, Hangzhou 310028, China
- 3Ocean Academy, Zhejiang University, Zhoushan 316022, China
- These authors contributed equally to this work.
- 1School of Earth Sciences, Zhejiang University, Hangzhou 310027, China
- 2Zhejiang Provincial Key Laboratory of Geographic Information Science, Hangzhou 310028, China
- 3Ocean Academy, Zhejiang University, Zhoushan 316022, China
- These authors contributed equally to this work.
Abstract. Spatial interpolation, which is one of the most important spatial analysis methods, predicts unsampled spatial data from the values of sampled points. Generally, the core of spatial interpolation is fitting spatial weights via spatial correlation. Traditional methods express spatial distances in a conventional Euclidean way and conduct relatively simple spatial weight calculation processes, limiting their ability to fit complex spatial nonlinear characteristics in multidimensional space. To tackle these problems, we developed a generalized spatial distance neural network (GSDNN) unit to generally and adaptively express spatial distances in complex feature space. By combining the spatial autoregressive neural network (SARNN) with the GSDNN unit, we constructed a generalized spatial autoregressive neural network (GSARNN) to perform spatial interpolation in three-dimensional space. The GSARNN model was examined and compared using two three-dimensional cases: a simulated case and a real Argo case. The experiment results demonstrated that exploiting the feature extraction ability of neural networks, the GSARNN achieved superior interpolation performance and was more adaptable than inverse distance weighted, ordinary Kriging, and SARNN methods.
Junda Zhan et al.
Status: final response (author comments only)
-
RC1: 'Comment on gmd-2022-198', Anonymous Referee #1, 03 Oct 2022
The paper "A generalized spatial autoregressive neural network (GSARNN) method for three-dimensional spatial interpolation" proposed a hybrid machine learning framework for interpolating three-dimensional spatial variables by combining SARNN and a newly developed generalized spatial distance neural network (GSDNN). This mixed neural network can capture the complicated spatial correlation between sample sites and produce a more representative estimate of spatial distance. The authors demonstrate the improved performance of the proposed method over the common interpolation methods of inverse distance weighted and regular Kriging, as well as the SARNN model, using simulated and measured datasets. This work is well-written overall, the approach is clearly explained, and the results are properly examined. However, I believe there are a few topics that require elaboration in order to increase the overall quality of this article. I'm excited to see a new version of this paper.
Major improvements:
1. GSDNN is critical in enhancing the GSARNN model's fitting accuracy. This highlights the significance of obtaining the correct spatial correlation between sample points by taking into account both the dataset's local and global qualities. I'm simply wondering if using GSDNN with traditional approaches will result in better interpolation results.
2. Section 2.1.2 should go over the kriging approach in greater detail. Please explain how the weight coefficient I is calculated.
3. The phrase "weight matrix" appears for the first time on Page 5, Line 6. Please provide some context.
4. You say in section 2.3.3 that you utilize variable learning rate for network training and explain how it changes during the training process. You should also consider the benefits of this customized learning rate.
5. The difference between all of these interpolation solutions is difficult to notice in Figure 11. I recommend graphing the difference between the interpolated and real values (interpolation error) and modifying the color scheme accordingly.
6. It is unclear what the x-axis signifies in Figures 8 and 12. Please include some text and figure descriptions.Minor suggestions:
1. In section 3.1.1, from Line 19 to Line 22, I suggest not overly emphasizing the benefits of this experiment using the simulated data. This point has been mentioned in earlier paragraphs.
2. In Formula 10, since k_ij is 1 for the situation i≠j, then should the off-diagonal elements simply be written as ρ_ij?
3. Multiline formulas, such as Formula 6, Formula 19, Formula 30, should be left aligned. The “int” in Formula 26 and Formula 27 seems to be redundant.
4. Page 2, Abstract, Line 3: “which is one of the most important” to “a fundamental”.
5. Page 2, Abstract, Line 10: “compared” to “compared with traditional methods”.
6. Page 2, Line 17: “continuous data” to “continuous field”.
7. Page 2, Line 32: “They” to “These methods”.
8. Page 3, Line 7: “Zeng et al.” to “In particular, Zeng et al.”.
9. Page 3, Line 23: “by combining the GSDNN unit with the SARNN to integrate generalized distances into the spatial interpolation method, we developed” to “by combining the GSDNN unit with the SARNN, we integrated generalized distances into the spatial interpolation method and developed”.
10. Page 4, Line 14: “deposit reserves” to “mineral deposit predication”.
11. Page 6, Line 6: “speed” to “rate”.
12. Page 9, Line 15: “the feature extraction and fitting ability of the GSARNN model are fully and persuasively tested” to “we can fully test the feature extraction and fitting ability of the GSARNN model”.
13. Page 9, Line 17: “the most authentic” to “the authentic”.
14. Page 10, Line 4: “sudden change” to “sudden variation”.
15. Page 10, Line 17: “adds” to “imposes”.
16. Page 10, Line 18: “ε” to “The term ε”.
17. Page 11, Line 25: “has” to “achieves”.
18. Page 16, Line 21: “in low depth area” to “at shallow depths”.
19. Page 17, Line 5: “indicating that there may be individual outliers” to “indicating the presence of potential outliers”.-
AC1: 'Reply on RC1', Zhen Hong Du, 31 Oct 2022
Dear Referee:
Thanks very much for your great support and constructive suggestions with regard to our manuscript. These comments are all valuable and very helpful for revising and improving our paper, as well as the important guiding significance to our researches. We have made our best efforts to improve our paper very carefully following your comments and suggestions. Our point by point response to the comments are given below. We hope the revised manuscript will be acceptable to your requirements. If still there are concerns, we will be happy to take care once we hear from you.
Major improvements:
Comment 1: GSDNN is critical in enhancing the GSARNN model's fitting accuracy. This highlights the significance of obtaining the correct spatial correlation between sample points by taking into account both the dataset's local and global qualities. I'm simply wondering if using GSDNN with traditional approaches will result in better interpolation results.
[Response]: Thanks for your comment. The GSDNN unit plays an important role in GSARNN model, taking variation characteristics of geographical elements in different directions into account. However, since there is no recognized true value of generalized spatial distance for training process, applying GSDNN unit to traditional methods will make the calculation process impossible to carry on. Therefore, the GSDNN unit can only be embedded in the NN method and participates in its overall training and calculation process. In other words, the generalized spatial distance is determined by the spatial characteristics of the elements to be interpolated, owning a specific connotation based on specific context of spatial elements. Some additional explanations have been added in the last paragraph of Section 2.3.1.
“Note that since there is no recognized true value of generalized spatial distance for training process, the GSDNN unit can only be embedded in the neural network-based method and participates in its overall training and calculation process. In other words, the generalized spatial distance is determined by the spatial characteristics of the elements to be interpolated, owning a specific connotation based on specific context of spatial elements.”
Comment 2: Section 2.1.2 should go over the kriging approach in greater detail. Please explain how the weight coefficient λ is calculated.
[Response]: Thanks for your helpful advice. We have properly supplemented the interpolation calculation process of the Kriging method, including how the weight coefficient λ is calculated. However, since the focus of this paper is on the GSARNN model, the derivation processes of some formulas in Kriging method have been omitted to make the length of Section 2.1.2 not too long.
See supplement for more revision details.
Comment 3: The phrase "weight matrix" appears for the first time on Page 5, Line 6. Please provide some context.
[Response]: Thanks for your instructive advice. The logic of space weight description in the original manuscript is not coherent enough, which will make some readers confused. The overall presentation of this part (Section 2.2) has been properly reorganized to make it easier to understand.
See supplement for more revision details.
Comment 4: You say in section 2.3.3 that you utilize variable learning rate for network training and explain how it changes during the training process. You should also consider the benefits of this customized learning rate.
[Response]: Thanks for your very helpful advice. The learning rate starts from αstart and increases gradually at the rate of k1 until αmax. A relatively small initial learning rate can prevent excessive fluctuation and convergence obstacle and the following increment of leaning rate can avoid the convergence rate at the early stage of the training process being too low. The maximum learning rate is maintained for n epochs. At this stage, the model can stably learn the spatial characteristics of the elements. The learning rate then gradually decreases exponentially at the rate of k2, ensuring that the model can sufficiently converge near the optimal position. The description of the customized learning rate benefits has been added in Section 2.3.3, Paragraph 3, which makes this strategy more reasonable.
See supplement for more revision details.
Comment 5: The difference between all of these interpolation solutions is difficult to notice in Figure 11. I recommend graphing the difference between the interpolated and real values (interpolation error) and modifying the color scheme accordingly.
[Response]: Thanks for your very instructive advice. According to your recommendation, we graph the cross-validation interpolation results as well as the interpolation errors of the four methods as Figure 10 (Figure 11 in the original manuscript). The difference of the interpolation performances between the four methods can be more easily distinguished now. The caption and paragraph corresponding to this figure have also been appropriately revised.
See supplement for more revision details.
Comment 6: It is unclear what the x-axis signifies in Figures 8 and 12. Please include some text and figure descriptions.
[Response]: Thanks for your helpful advice. The x-axis in Figures 8 and Figure 12 represents the identifier number of each sampled point after they are sorted in ascending order of real value. Without an explanation, readers may feel confused. Some descriptions have been added in Figure 8 and Figure 12.
See supplement for more revision details.
Minor suggestions:
Comment 1: In section 3.1.1, from Line 19 to Line 22, I suggest not overly emphasizing the benefits of this experiment using the simulated data. This point has been mentioned in earlier paragraphs.
[Response]: Thanks for your helpful advice. Redundant statements have been removed. In order to keep Section 3.1.1 consistent with Section 3.2.1, a sentence is added at the end of Section 3.1.1.
“This case compares the interpolation abilities of the GSARNN model and the other three models in three-dimensional space using the simulated dataset above.”
Comment 2: In Formula 10, since kij is 1 for the situation i≠j, then should the off-diagonal elements simply be written as ρij?
[Response]: Thanks for your helpful advice. The issue has been corrected.
See supplement for more revision details.
Comment 3: Multiline formulas, such as Formula 6, Formula 19, Formula 30, should be left aligned. The “int” in Formula 26 and Formula 27 seems to be redundant.
[Response]: Thanks for your helpful advice. Formula 6, Formula 19, Formula 30 have been corrected to be left aligned. The “int” in Formula 26 and Formula 27 is to take the integer part of the operation result, which is the coordinate of the point in the cube.
See supplement for more revision details.
Comment 4: Page 2, Abstract, Line 3: “which is one of the most important” to “a fundamental”.
[Response]: Thanks. The issue has been corrected.
Comment 5: Page 2, Abstract, Line 10: “compared” to “compared with traditional methods”.
[Response]: Thanks. The issue has been corrected.
Comment 6: Page 2, Line 17: “continuous data” to “continuous field”.
[Response]: Thanks. The issue has been corrected.
Comment 7: Page 2, Line 32: “They” to “These methods”.
[Response]: Thanks. The issue has been corrected.
Comment 8: Page 3, Line 7: “Zeng et al.” to “In particular, Zeng et al.”.
[Response]: Thanks. The issue has been corrected.
Comment 9: Page 3, Line 23: “by combining the GSDNN unit with the SARNN to integrate generalized distances into the spatial interpolation method, we developed” to “by combining the GSDNN unit with the SARNN, we integrated generalized distances into the spatial interpolation method and developed”.
[Response]: Thanks. The issue has been corrected.
Comment 10: Page 4, Line 14: “deposit reserves” to “mineral deposit predication”.
[Response]: Thanks. The issue has been corrected.
Comment 11: Page 6, Line 6: “speed” to “rate”.
[Response]: Thanks. The issue has been corrected.
Comment 12: Page 9, Line 15: “the feature extraction and fitting ability of the GSARNN model are fully and persuasively tested” to “we can fully test the feature extraction and fitting ability of the GSARNN model”.
[Response]: Thanks. The issue has been corrected.
Comment 13: Page 9, Line 17: “the most authentic” to “the authentic”.
[Response]: Thanks. The issue has been corrected.
Comment 14: Page 10, Line 4: “sudden change” to “sudden variation”.
[Response]: Thanks. The issue has been corrected.
Comment 15: Page 10, Line 17: “adds” to “imposes”.
[Response]: Thanks. The issue has been corrected.
Comment 16: Page 10, Line 18: “ε” to “The term ε”.
[Response]: Thanks. The issue has been corrected.
Comment 17: Page 11, Line 25: “has” to “achieves”.
[Response]: Thanks. The issue has been corrected. The similar issue in Section 3.2.3 has also been corrected.
Comment 18: Page 16, Line 21: “in low depth area” to “at shallow depths”.
[Response]: Thanks. The issue has been corrected.
Comment 19: Page 17, Line 5: “indicating that there may be individual outliers” to “indicating the presence of potential outliers”.
[Response]: Thanks. The issue has been corrected.
-
AC1: 'Reply on RC1', Zhen Hong Du, 31 Oct 2022
-
RC2: 'Comment on gmd-2022-198', Anonymous Referee #2, 10 Oct 2022
This manuscript presents a novel method for three-dimensional spatial interpolation. The technique employs neural networks for optimizing spatial weights calculation in a spatial auto-regression model. Based on that, the concept of generalized spatial distance is proposed to further enhance the spatial feature extraction ability of the model. The model is tested with simulated datasets and temperature observations of Argo. The presentation of the method, the evaluation procedure and the analysis are straightforward and clear.
I think there are some minor improvements that can be made to the manuscript.
- The authors only list the parameters of GSARNN model in detail in the manuscript. The configurations of traditional methods, such as the p value of IDW and the variation function adopted in Kriging, should also be mentioned in the comparison experiments.
- The results of case 2 turn out that the neural network-based models generate smoother spatial patterns than traditional methods. I wonder if that is worth discussing.
- I think the point of how long it takes to run the model deserves more discussion in the manuscript. The authors briefly mention this as a limitation in the conclusion section, but some basic statistics on how long it takes would be a helpful addition.
- Please express the information of GSDNN unit in Figure 2. Maybe Figure 1 is redundant and it can be merged into Figure 2.
- In Figure 13, it would be better to make clear that the values in the left column represent depths below the sea surface.
- In Table 1 and Table 4, you’d better change ‘Hyperparameters’ to ‘Hyper-parameters’.
- When a matrix or a vector is represented by a word or a character, it should be written in bold, such as in Formula 10.
-
AC2: 'Reply on RC2', Zhen Hong Du, 31 Oct 2022
Dear Referee:
Thanks very much for your great support and constructive suggestions with regard to our manuscript. These comments are all valuable and very helpful for revising and improving our paper, as well as the important guiding significance to our researches. We have made our best efforts to improve our paper very carefully following your comments and suggestions. Our point by point response to the comments are given below. We hope the revised manuscript will be acceptable to your requirements. If still there are concerns, we will be happy to take care once we hear from you.
Comment 1: The authors only list the parameters of GSARNN model in detail in the manuscript. The configurations of traditional methods, such as the p value of IDW and the variation function adopted in Kriging, should also be mentioned in the comparison experiments.
[Response]: Thanks for your very helpful advice. The power parameter of IDW method is 4 in two cases. In Kriging method, we adopt the gaussian model to fit the functional relationship between the semi-variogram and the spatial distance, which turns out to be the optimal variation function model among linear, gaussian, spherical and exponential models. Some explanations have been added in Section 3.1.2, Paragraph 2.
“Besides, the power parameter of IDW method is 4, and in Kriging method, we adopt the gaussian model to fit the functional relationship between the semi-variogram and the spatial distance, which turns out to be the optimal variation function model among linear, gaussian, spherical and exponential models.”
Comment 2: The results of case 2 turn out that the neural network-based models generate smoother spatial patterns than traditional methods. I wonder if that is worth discussing.
[Response]: Thanks for your very helpful advice. As you mentioned, the interpolation results of neural network-based models exhibit smoother spatial patterns with less noise than those of traditional methods. This indicates that neural network-based models can greatly reduce the influence of local extreme points on the points to be interpolated and acquire quite reasonable distributions of the geospatial elements through the non-linear fitting ability of neural networks. Some discussion has been added in Section 4 Discussion, Paragraph 2.
“In contrast, neural network-based models generate smoother interpolation surface than traditional methods. This indicates that neural network-based models can greatly reduce the influence of local extreme points on points to be interpolated and acquire quite reasonable spatial patterns of geospatial elements exploiting the non-linear fitting ability of neural networks.”
Comment 3: I think the point of how long it takes to run the model deserves more discussion in the manuscript. The authors briefly mention this as a limitation in the conclusion section, but some basic statistics on how long it takes would be a helpful addition.
[Response]: Thanks for your very insightful advice. As you mentioned, the model complexity of GSARNN is considerably higher than traditional methods. Nonetheless, compared with multifarious models in the fields of neural networks and deep learning, the structure of GSARNN with a few hidden layers is relatively lightweight, so its training and calculation efficiency can be quite high. The GSARNN model usually converges to the optimal state within 15-20 minutes in our cases since it can take advantage of mighty parallel computing capabilities of GPU units and distributed computing structures to accelerate the training process. Although the efficiency of Kriging method is better than GSARNN model, under the same condition, it still takes about 10 minutes to fit the functional relationship between the semi-variogram and the distance using “pykrige”. However, as the number of sampled points increases, the number of input neurons and output neurons of the GSARNN will also increase, resulting in the expansion of network parameters and the extension of training time inevitably. How to maintain a stable and acceptable training time given different sample data volumes is an important problem to be tackled in further researches.
Some discussions have been added in the end of Section 3.1.3 (an additional paragraph) and Section 5 Conclusion, Paragraph 4.
“In addition, compared with multifarious models in the fields of deep learning, the structure of GSARNN is relatively lightweight, so its training and calculation efficiency can be quite high. Taking advantage of mighty parallel computing capabilities of GPU units and distributed computing structures to accelerate the training process, the GSARNN model usually converges to the optimal state within 15-20 minutes in our cases. Although the efficiency of Kriging method is better than GSARNN model, under the same condition, it still takes about 10 minutes to fit the functional relationship between the semi-variogram and the distance.”
“In addition, as the number of sampled points increases, the number of input neurons and output neurons of the GSARNN will also increase, resulting in the expansion of network parameters and the extension of training time inevitably. Therefore, how to maintain a stable and acceptable training time given different sample data volumes is an important problem to be tackled in further researches.”
Comment 4: Please express the information of GSDNN unit in Figure 2. Maybe Figure 1 is redundant and it can be merged into Figure 2.
[Response]: Thanks for your very instructive advice. The information of GSDNN unit is shown with GSARNN model structure in the same figure (Figure 1 in the revised manuscript) now. Due to the deletion of the original Figure 1, the numbers of subsequent figures and their related text have also been revised accordingly.
See supplement for more revision details.
Comment 5: In Figure 13, it would be better to make clear that the values in the left column represent depths below the sea surface.
[Response]: Thanks for your helpful advice. A description (“section depth”) of the values in the left column in Figure 13 (Figure 12 in the revised manuscript) has been added to avoid readers’ confusion.
See supplement for more revision details.
Comment 6: In Table 1 and Table 4, you’d better change ‘Hyperparameters’ to ‘Hyper-parameters’.
[Response]: Thanks for your advice. All the “hyperparameters” in the manuscript have been revised to “hyper-parameters”.
Comment 7: When a matrix or a vector is represented by a word or a character, it should be written in bold, such as in Formula 10.
[Response]: Thanks for your very helpful advice. All the matrices and vectors in formulas and paragraphs have been revised to bold. In addition, matrices are uniformly represented by upper case letters, and vectors are represented by lower case letters.
-
CEC1: 'Comment on gmd-2022-198', Juan Antonio Añel, 25 Oct 2022
Dear authors,
Checking your manuscript, have come to our attention a couple of issues with your code and technical details given in the text.
First, your code in FigShare does not include a license.
If you do not include a license, despite what you state in the README file, the code is not "open-source"; it continues to be your property. Therefore, you should add a free software/open-source (FLOSS) license to the repository and mention it in the code. We recommend the GPLv3. You only need to include the file 'https://www.gnu.org/licenses/gpl-3.0.txt' as LICENSE.txt with your code. Also, you can choose other options, for example, the GPLv2, Apache License, MIT License, etc.Also, for your work, you use TensorFlow. This is not clear in the manuscript, and it is an important detail. Therefore, please, add more detail in the Methods section of the text about the specific software used and their versions.Regards,Juan A. AñelGeosci. Model Dev. Exec. Editor-
AC3: 'Reply on CEC1', Zhen Hong Du, 31 Oct 2022
Dear Editor:
We would like to express our great gratitude for you to give us the opportunity to revise our manuscript “A generalized spatial autoregressive neural network (GSARNN) method for three-dimensional spatial interpolation”. We have made a revision of the manuscript according to the comments and suggestions from you. These comments are all valuable and very helpful for improving our paper. We sincerely hope that our revisions would meet your requirements. Please don’t hesitate to contact us if you have any problems about the response.
Comment 1: First, your code in FigShare does not include a license. If you do not include a license, despite what you state in the README file, the code is not "open-source"; it continues to be your property. Therefore, you should add a free software/open-source (FLOSS) license to the repository and mention it in the code. We recommend the GPLv3. You only need to include the file 'https://www.gnu.org/licenses/gpl-3.0.txt' as LICENSE.txt with your code. Also, you can choose other options, for example, the GPLv2, Apache License, MIT License, etc.
[Response]: Thanks for your very helpful comment. According to your suggestion, we have adopted the GPLv3 license to our code and data in FigShare. A text file named “LICENSE.txt” obtained from 'https://www.gnu.org/licenses/gpl-3.0.txt' has been upload to the original FigShare link in Section code and data availability.
Comment 2: Also, for your work, you use TensorFlow. This is not clear in the manuscript, and it is an important detail. Therefore, please, add more detail in the Methods section of the text about the specific software used and their versions.
[Response]: Thanks for your very helpful comment. We are sorry for neglecting this important detail. The GSARNN and SARNN models are implemented using TensorFlow-GPU 1.13.0 and Python 3.5.4. This information has been added in the section of experiment implementation (Section 3.1.2, Paragraph 2), which we consider it may be more suitable than adding it in Methods section.
“The GSARNN and SARNN models are implemented using TensorFlow-GPU 1.13.0 and Python 3.5.4.”
-
AC3: 'Reply on CEC1', Zhen Hong Du, 31 Oct 2022
-
RC3: 'Comment on gmd-2022-198', Anonymous Referee #3, 28 Oct 2022
This paper developes a generalized spatial autoregressive neural network (GSARNN) by combining spatial autoregressive neural network (SARNN) with the generalized spatial distance neural network (GSDNN) unit to address the problem of spatial distance estimation in complex no-linear 3D space. This idea can overcome the limitation of simple spatial distance such as Euclidean distance. The success of this algorithm can be benifit various senecios such as geologic engineering, climate study, transportations, economics etc.
The paper and algorithm can be improved if the follow issues need to be addressed:
- In the simulated dataset at section 3.1.1. For component V1, do negative values affect the result? The distance resolution is set as 0.5, why? Why not 1 or 2..?
- In Section 3.1.2 authors experiment on “1,555 data points in the training set and 173 data points in the validation set”. Does the number of data points affect the performance of the algorithm? How?
- For section 3.1.3, what is the settings for OK method?
- What are limitations and challenges of your algorithm?
-
AC4: 'Reply on RC3', Zhen Hong Du, 31 Oct 2022
Dear Referee:
Thanks very much for your great support and constructive suggestions with regard to our manuscript. These comments are all valuable and very helpful for revising and improving our paper, as well as the important guiding significance to our researches. We have made our best efforts to improve our paper very carefully following your comments and suggestions. Our point by point response to the comments are given below. We hope the revised manuscript will be acceptable to your requirements. If still there are concerns, we will be happy to take care once we hear from you.
Comment 1: In the simulated dataset at section 3.1.1. For component V1, do negative values affect the result? The distance resolution is set as 0.5, why? Why not 1 or 2..?
[Response]: Thanks for your comment.
In our simulated case, V1 will not be negative according to our setting. Even if V1 is negative, it will not affect the performance of the GSARNN model. This is because the training process is driven by the loss (MSE in our experiments), and the loss represents the deviation of the interpolated value from the real value, which is equivalent for positive V1 and negative V1.
Setting distance resolution as 0.5, 1 or 2 is equivalent. Different distance resolutions are only different in global magnification, but the relative positions of sample points and the spatial distribution characteristics of element values are the same.
Comment 2: In Section 3.1.2 authors experiment on “1,555 data points in the training set and 173 data points in the validation set”. Does the number of data points affect the performance of the algorithm? How?
[Response]: Thanks for your very helpful comment. We use the 10-fold cross-validation method for model training, which is a common strategy. The 10-fold cross-validation randomly divides the dataset into 10 equal portions, among which nine portions serve as the training set, and the remaining portion is used as the validation set in turn. This ensures that each sampled point is used as training data and validation data, avoids the impact of data division on results, and fully verifies the performance of interpolation methods. A big fold number will lead to a long training time; a small fold number will cause the training set to be too small to fully learn the data characteristics.
Comment 3: For section 3.1.3, what is the settings for OK method?
[Response]: Thanks for your very helpful comment. In OK method, we adopt the gaussian model to fit the functional relationship between the semi-variogram and the spatial distance, which turns out to be the optimal variation function model among linear, gaussian, spherical and exponential models. Some explanations have been added in Section 3.1.2, Paragraph 2.
“Besides, the power parameter of IDW method is 4, and in Kriging method, we adopt the gaussian model to fit the functional relationship between the semi-variogram and the spatial distance, which turns out to be the optimal variation function model among linear, gaussian, spherical and exponential models.”
Comment 4: What are limitations and challenges of your algorithm?
[Response]: Thanks for your provident comment. In GSARNN model, as the number of sampled points increases, the number of input neurons and output neurons of the GSARNN will also increase, resulting in the expansion of network parameters and the extension of training time inevitably. This limits the application of the model in scenarios with large amounts of data. How to maintain a stable and acceptable training time given different sample data volumes is a challenge to be tackle in further researches. Some discussions have been added in Section 5 Conclusion.
“In addition, as the number of sampled points increases, the number of input neurons and output neurons of the GSARNN will also increase, resulting in the expansion of network parameters and the extension of training time inevitably. Therefore, how to maintain a stable and acceptable training time given different sample data volumes is an important problem to be tackled in further researches.”
Junda Zhan et al.
Junda Zhan et al.
Viewed
HTML | XML | Total | BibTeX | EndNote | |
---|---|---|---|---|---|
516 | 114 | 22 | 652 | 3 | 4 |
- HTML: 516
- PDF: 114
- XML: 22
- Total: 652
- BibTeX: 3
- EndNote: 4
Viewed (geographical distribution)
Country | # | Views | % |
---|
Total: | 0 |
HTML: | 0 |
PDF: | 0 |
XML: | 0 |
- 1