the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
Improvement of near-surface wind speed modeling through refined aerodynamic roughness length in high-roughness surface regions: implementation and validation in the Weather Research and Forecasting (WRF) model version 4.0
Jiamin Wang
Jiarui Liu
Xiaogang Ma
Wenjun Tang
Ling Yuan
Zuhuan Ren
Aerodynamic roughness length (z0) is a key parameter determining near-surface wind profiles, significantly influencing wind-related studies and applications. In high-roughness surface areas, surface roughness has been substantially altered by land use changes such as urbanization. However, many numerical models still assign long-standing and fixed z0 based on traditional land cover types, neither accounting for shifts in land cover nor updating class-specific z0, leaving z0 values in high-roughness surface regions outdated and unreliable. To address this issue, this study proposed a cost-effective method to estimate z0 values at weather stations by adjusting z0 values to minimize the wind speed differences between ERA5 reanalysis data and weather station observation data. Using this approach, z0 values were derived for 1837 stations in the high-roughness surface areas across China. Based on these estimates, a high-resolution monthly gridded z0 dataset was then developed for high-roughness surface areas in China using Random Forest Regression algorithm. Simulations with Weather Research and Forecasting (WRF) model show that implementation of the new z0 dataset significantly improves the accuracy of 10 m wind speed over high-roughness surface areas, reducing mean wind speed errors by 79.8 % and 78.0 % compared to the default z0 in WRF and a latest gridded z0 dataset from Peng et al. (2022), respectively. Independent validations of 100 m wind speed against anemometer tower data further confirm the dataset's reliability. Therefore, this approach is valuable for wind-dependent studies and applications, such as urban planning, air quality management, and wind energy utilization, by enabling more accurate simulations of wind speed in high-roughness surface areas.
- Article
(7776 KB) - Full-text XML
-
Supplement
(3213 KB) - BibTeX
- EndNote
With the rapid advancement of urbanization and industrialization, human activities and energy use are increasingly concentrated along the settlement-landscape continuum (Liu et al., 2014), particularly in high-roughness areas such as built-up zones and inhabited vegetated landscapes. High-roughness surface regions not only significantly influence climate change but also are highly sensitive to meteorological and climatic conditions (Kammen and Sunter, 2016). Among various meteorological parameters, wind speed exerts great impacts on both environmental and human systems. One prominent example is that wind speed is a crucial consideration for assessing the atmospheric pollutant dispersion capability (Manju et al., 2002; Han et al., 2017). Specifically, mean flows and atmospheric turbulence are two key factors for pollutant removal (Wong and Liu, 2013; Di Nicola et al., 2022). Also, wind speed regulates pollen dispersion and distribution that are associated with public health (Roy et al., 2023). The utilization of wind energy in high-roughness surface areas also depends on wind speed distribution (Ishugah et al., 2014; Stathopoulos et al., 2018; Tasneem et al., 2020). Proper utilization, through measures such as suburban wind farms or building-integrated turbines, can minimize the need for transmission infrastructure. Beyond energy considerations, wind speed characteristics play a critical role in design and planning of human settlements, influencing both contemporary building practices (Hadavi and Pasdarshahri, 2020) and the preservation of historical-cultural heritage (Li et al., 2021). Therefore, accurately characterizing wind speed is essential for guiding systematic regulation and promoting sustainable development in high-roughness surface areas.
Aerodynamic roughness length (z0) is a crucial parameter that determines near-surface wind speed profiles (Stull, 1988). As a key input for atmospheric models, z0 significantly influences wind speed-related applications, however, its representation in existing numerical models often oversimplifies real-world conditions. Specifically, most models, such as the widely used European Centre for Medium-Range Weather Forecasts Reanalysis v5 (ERA5), determine z0 with long-standing and fixed values based on traditional land cover types. Such treatment fails to reflect the impact of transitions between surface types and changes in roughness elements within the same type, particularly the complexity of urban structures, thereby posing significant challenges for accurate wind speed simulation and prediction over high-roughness surface areas (Wang et al., 2024). Numerous studies have demonstrated that the changes of z0, caused by land use changes, particularly urbanization and industrialization, as well as deforestation and afforestation, significantly impacted wind speed. For instance, the increase in z0 has explained 70 % of the wind speed reduction in Europe (Wever, 2012) and caused a 1.1 m s−1 decrease in eastern China (Wu et al., 2017). Furthermore, Zhang et al. (2019) identified z0 changes as a primary driver of long-term wind speed trends in China, Europe, and North America. In line with these findings, Luu et al. (2023) showed that the rise in z0, caused by shifts from short vegetation to high vegetation and urbanization, partly contributes to the decline in mean and maximum surface wind speed over Western Europe. A similar mechanism operated in Canada. At Sudbury Airport (Ontario), 10 m wind speeds declined by ∼34 % during 1975–1995 mainly due to reforestation-induced increases in surface roughness (Tanentzap et al., 2007). These findings highlight the need to refine z0 in models by incorporating the effects of high-roughness surface areas across urban-town settings and tall-vegetation landscapes. In addition to wind speed, z0 also plays a significant role in environmental processes. The difference in z0 between urban and suburban areas is one of drivers causing larger intensity of daytime urban heat islands in humid regions (Zhao et al., 2014; Li et al., 2019). Winckler et al. (2019) showed that roughness changes are a primary control on deforestation's biogeophysical effects, notably surface temperature responses. Therefore, accurate z0 data in high-roughness surface areas can not only enhance the performance of atmospheric numerical models, but also provide scientific support for formulating sustainable urban environmental management strategies.
The estimation of z0 in high-roughness surface areas traditionally relies on three primary approaches: the micrometeorological method, the morphometric method, and a combination of these two methods. The micrometeorological method, based on the Monin-Obukhov similarity theory (Monin and Obukhov, 1954), typically calculates z0 using observations from flux or anemometer towers (Grimmond et al., 1998; Liu et al., 2018). Although theoretically robust, this method is limited by high costs of instruments and infrastructure (Grimmond and Oke, 1999), as well as the need for homogeneous surface conditions (Wieringa, 1993; Bottema and Mestayer, 1998). The morphometric method usually formulates mathematical models based on geometric characteristics and distribution density of high-roughness surface areas (Raupach, 1992, 1994; Bottema and Mestayer, 1998; Macdonald et al., 1998; Kanda et al.,2013; Shen et al., 2022, 2024). However, these models often suffer from simplified assumptions and require high-resolution surface feature data, which are costly to acquire (Grimmond and Oke, 1999; Zhang et al., 2017). The combination method, which establishes a relationship between the z0 ground truth obtained from micrometeorological method and high-resolution surface feature data for regional-scale applications, has shown promise in specific regions, such as Tokyo and Nagoya (Kanda et al., 2013), Beijing (Zhang et al., 2017), and Osaka subregions (Duan and Takemi, 2021). Nevertheless, the limitations of the former two methods hinder its broader applications. Therefore, there is a considerable lack of reliable z0 data in high-roughness surface regions.
To address the aforementioned challenges, this study proposed a low-cost method for estimating z0 by integrating 10 m wind speed at China Meteorological Administration (CMA) stations with 10 m wind speed and z0 from ERA5 reanalysis data. This approach takes advantage of the synergy between CMA's high-density station distribution and ERA5 reanalysis' temporal continuity to substantially enhance the sample size of z0 estimates. Based on these estimates, we have developed a high-resolution monthly z0 dataset for high-roughness surface areas in China using Random Forest Regression (RFR) algorithm. The applicability of the new z0 dataset has been assessed through its implementation in the Weather Research and Forecasting (WRF) model for wind speed simulation. This study contributes to the advancement of mesoscale wind speed simulation over high-roughness surface environments, which can promote wind field-dependent studies, such as urban planning, wind energy utilization, and air quality management.
2.1 Data
In this study, we mainly utilized monthly gridded z0 dataset from ERA5 (Hersbach et al., 2020, 2023a), referred to as z0_ERA5, along with hourly 10 m wind speed data from both ERA5 (Hersbach et al., 2023b) and surface weather station observations provided by the(CMA) during 2015–2019, to derive z0 estimates at each CMA station.
To extend the site-scale z0 estimates into a gridded dataset at the regional scale, we applied the RFR algorithm, incorporating six key features: variance of the slope (), terrain standard deviation within 0.01° window (TSD), percent tree cover (PTC), leaf area index (LAI), normalized difference vegetation index (NDVI), and urban-rural classification (URC). was derived as an integral over orographic spectrum, capturing multi-scale orographic complexity with wave length from meter to 10 km (Beljaars et al., 2004). We obtained from the dataset accompanying the turbulent orographic form drag scheme in WRF (Zhou et al., 2018), which was processed from the global 30′′ GMTED2010 digital elevation model (Danielson and Gesch, 2011). TSD was calculated using elevation data from Shuttle Radar Topography Mission with a spatial resolution of 3 arcsec (Jarvis et al., 2008). The PTC data were obtained from the MOD44B Version 6.1 Vegetation Continuous Fields product (DiMiceli et al., 2022), which provides yearly data at a 250 m pixel resolution. The monthly 1 km NDVI data were acquired from MOD13A3 product (Didan, 2021). The LAI data with an 8 d temporal interval and 500 m spatial resolution were sourced from Yuan et al. (2011) and Lin et al. (2023). URC data were extracted from a 1 km global human settlements map, which categorizes the rural-urban continuum into 19 distinct types (Li et al., 2022, 2023). To generate a monthly z0 dataset at a spatial resolution of 0.01°×0.01°, all input datasets were linearly interpolated or resampled to the target resolution. LAI data were averaged monthly by assigning each 8-day interval to the closest month.
Additionally, to compare with the existed z0 datasets, a latest z0 dataset developed by Peng et al. (2022) (denoted as z0_Peng) was used by integrating it into the WRF model for wind speed simulation. This dataset was generated by applying machine learning techniques to integrate FLUXNET ground-based observations and MODIS remote sensing data. Moreover, 100 m wind speed data from 589 anemometer towers in China were utilized for two critical purposes. First, the comparison between tower observations and ERA5 100 m wind speed data (Hersbach et al., 2023b) was used to validate the feasibility of the assumption in the z0 estimation method. Second, tower data were used as independent validations to evaluate the impact of refined z0 on wind speed simulations. These anemometer towers cover varying periods between 2004 and 2022 with a temporal resolution of 10 min.
2.2 Method for deriving z0 at CMA stations
First, the theoretical basis for deriving z0 at CMA stations is presented. In the framework of Monin-Obukhov similarity theory (Monin and Obukhov, 1954), the neutral logarithmic wind profile can be expressed with Eq. (1).
where uz is the wind speed (m s−1) at height z, the measuring height above ground (m); u* is the friction velocity (m s−1); k is the von Karman constant and equals to 0.4, and d is the zero-plane displacement height (m), calculated as using a widely accepted empirical formula (Watts et al., 2000).
Based on Eq. (1), the 100 m neutral wind speed for ERA5 and CMA stations can be expressed in Eqs. (2) and (3), respectively.
and then z0 values at CMA stations can be estimated by the following three steps:
First, we assumed: (1) the near-surface wind speed difference between ERA5 and CMA is primarily attributed to z0, and the influence of z0 diminishes with height. Consequently, the 100 m wind speed from ERA5 reanalysis is considered comparable to that from observations; (2) the impact of atmospheric stability on wind speed is identical for both ERA5 and CMA stations, allowing us to neglect stability correction terms under non-neutral conditions when deriving z0 for each hourly interval. The validity of these assumptions will be supported by the subsequent validation of wind speed simulations based on the derived z0 values (Sect. 3.3).
Second, we calculated the hourly z0_CMA values based on Eqs. (2) and (3). Given that u10_ERA5, u10_CMA, and z0_ERA5 values are known, an optimal z0_CMA value at each hour was derived through minimizing the difference between u100_ERA5 and u100_CMA calculated using Eqs. (2) and (3). To align with Assumption (1), we only retained z0_CMA values corresponding to times when the percentage difference between the calculated u100_ERA5 and u100_CMA was less than 10 %. ERA5 provides native 100 m winds, but here we use log-law – reconstructed 100 m winds from u10_ERA5 and z0_ERA5 instead. The reason is that the z0_CMA is derived under the assumption that stability-correction term is neglected. This means that the 100 m wind speeds in Eqs. (2) and (3) are both calculated without considering stability effects. However, the native ERA5 100 m wind field inherently embeds model-diagnosed stability influences. Therefore, directly pairing native ERA5 100 m winds with our CMA log-law construction would amplify the error in the derived ln (z0). In addition, the reconstruction offers two practical advantages. First, it requires fewer variables and a more transparent linkage, relying only on 10 m wind speeds and z0 from reanalysis, together with 10 m wind speeds from observations; Second, our results indicate that the z0 estimates are not particularly sensitive to the choice of reference height (see Sect. 4), so there is no need to use native reanalysis winds at heights other than 10 m.
Third, these retained z0_CMA values were grouped by months, and the monthly median values were selected as the final roughness length (z0_optimal). To avoid unreasonable estimates, the values of z0_optimal satisfying the condition that the absolute difference between ln (z0_optimal) and the corresponding ln (z0_ERA5) does not exceed 2 were considered valid.
Finally, we obtained monthly z0 estimates at 1837 stations out of the 2161 CMA stations.
2.3 Method for estimating gridded z0 at regional scale
Machine learning serves as an effective tool for extending the z0_optimal estimates at CMA stations to the regional scale. In this study, we employed the RFR algorithm (Eq. 4) (Breiman, 2001), a widely used method for similar applications (Duan and Takemi, 2021; Hu et al., 2020; Peng et al., 2022, 2023). All samples were divided into training and test subsets at a ratio of 8:2 for each bin of ln (z0_optimal), with the bins defined at intervals of 0.2. Sensitivity tests were conducted to determine the optimal number of decision trees in the RFR algorithm (Fig. 3b), resulting in the selection of 300 trees. The maximum depth of the trees was set to 18, and the minimum sample split was set to 5. Five-fold cross-validation shows the stable performance (Fig. 3d). Furthermore, the training and test results exhibit minimal sensitivity to the randomization seed used for dataset splitting (Fig. 3a). The resulting gridded aerodynamic roughness length data are referred to as z0_RFR.
2.4 Model configuration
To demonstrate the applicability of gridded z0_RFR data, the WRF (Version 4.0) Model (Skamarock et al., 2019) was used in this study to simulate wind speed with z0_RFR. For comparison, two additional simulations were performed: one utilized the WRF model's default roughness length (z0_Default) based on land cover types, and the other used z0_Peng.
First, we set z0_RFR and z0_Peng in WRF model, respectively. Given that z0_RFR is concentrated in high-roughness surface areas, the missing values over other regions are filled with z0_Default. Notably, the setting of z0_Peng in WRF is different from that of z0_RFR. In the WRF model, z0 values over bare fraction and vegetated fraction are determined separately. Specifically, in the Noah-MP land surface model, z0 is set to a constant over bare areas, while it is assigned by a look-up table according to vegetation type over vegetated areas. Peng et al. (2022) only provided the z0 over vegetation areas, which is the gridded mean effective roughness length including vegetated fraction and bare fraction. Thus, before conducting the simulation of wind speed in the WRF model with the gridded z0_Peng, we adjusted the roughness length over vegetated fraction in each grid from z0_Peng. The specific adjustment of z0_Peng in the WRF model is comprehensively described in the supplementary material Sect. 1. Apart from the difference in the sources of z0, other model configurations for z0_RFR, z0_Default, and z0_Peng are identical. The specific model configurations are as follows.
The simulation domains were configured with a “lat–lon” map projection, centered at coordinates 31.5° N, 109.0° E. As illustrated in Fig. 4b, nested domains were employed, with horizontal resolutions of 0.09° for Domain 1 (d01) and 0.03° for Domain 2 (d02). Specifically, d01 consisted of 225 grid points in the west–east direction and 191 in the south–north direction, while d02 consisted of 469 grid points in the west–east direction and 367 in the south–north direction. The vertical level had 70 layers and was stretched with dzstretch_s=1.1 and dzstretch_u=1.04. The model top was set to 50 hPa. The simulation periods spanned from 31 March to 30 April 2019. The integral time interval was set to 30 s. The re-initialization simulation was performed. Specifically, each simulation started at 12:00 LT (local time, LT = UTC+8) and ran for 36 h until 00:00 LT the next day. The first 12 h were considered the spin-up time and the remaining hours were used for analysis. Additionally, the initial and boundary conditions in the simulations were taken from hourly ERA5 reanalysis data, which provide pressure-level variables (geopotential height, air temperature, air humidity, and wind field) (Hersbach et al., 2023c) and surface variables (surface air temperature, humidity, pressure, 10 m wind field, sea level pressure, land surface temperature, soil temperature, and soil water content) (Hersbach et al., 2023b).
For physical parameterization schemes, the modified Thompson microphysics scheme (Thompson et al., 2008), Dudhia scheme for shortwave radiation (Dudhia, 1989), Rapid Radiative Transfer Model (RRTM) scheme for longwave radiation (Mlawer et al., 1997), Noah-MP land surface model (Niu et al., 2011), Yonsei University scheme for planetary boundary layer (Hong et al., 2006), and Grell–Freitas for cumulus parameterization (Grell and Freitas, 2014) were adopted. The cumulus parameterization scheme was activated only in the d01 domain and switched off in the d02 domain. A turbulent orographic form drag scheme with description of the dynamic drag caused by sub-grid orography was also applied (Beljaars et al., 2004; Zhou et al., 2018).
2.5 Calculation of statistical metrics
To evaluate the performance of the simulated wind speed with z0_RFR, z0_Default, and z0_Peng, three statistical metrics, including correlation coefficient (R), mean absolute bias (MAB), and root mean square error (RMSE), were used in temporal and spatial aspects. For the spatial performance assessment, the average 10 m wind speed simulation during 1–30 April 2019 at each station was used to calculate R, MAB, and RMSE with the CMA observations.
Regarding the temporal evaluation, the index (representing R, MAB, and RMSE) was calculated as the mean of the corresponding metric for hourly 10 m wind speed during 1–30 April 2019 across all CMA stations (Eq. 5).
where indexi denotes the respective metric value at the ith station, and M represents the total number of stations.
Additionally, to incorporate the direction of the bias into the wind-speed evaluation, we used the mean bias percentage (MBP) to quantify the signed bias of ERA5 reanalysis and simulated wind speeds against observations from CMA stations and anemometer towers (Eq. 6).
where represents mean wind speed from ERA5 or model simulations, and represents observed mean wind speed from CMA stations and anemometer towers.
To more intuitively compare the performance of wind speed simulations using z0_Default, z0_Peng, and z0_RFR, we also calculated the percentage reduction in wind speed error (PRE) achieved by z0_RFR relative to z0_Default and z0_Peng (Eq. 7).
where represents or , and denotes the mean 10 or 100 m wind speed from simulations based on z0_Default, z0_Peng, and z0_RFR, as well as from observations (CMA stations or anemometer towers).
3.1 The distribution characteristics of the z0 estimates at CMA stations
Figure 1a presents the spatial distribution of annual mean z0_optimal values derived from 1837 CMA stations, representing a subset of all accessible 2161 stations (Fig. S1a in the Supplement). These 1837 stations are primarily located in the eastern, southern, and central regions of China, with most stations having z0 values ranging between 0.2 and 1.5 m. In contrast, the excluded 324 stations are mostly distributed in the western regions of China. The exclusions of these stations can be attributed to the poor performance of ERA5 100 m wind speed data, which may result from altitude differences between the observation sites and the model terrain, thereby rendering our initial assumption, i.e. ERA5 100 m wind speed data are reliable for z0 estimation, invalid in these areas. To test this, we evaluated the performance of ERA5 100 m wind speed by comparing it with 589 anemometer tower data, since CMA stations only provide 10 m wind speed observations. Overall, ERA5 shows a smaller MBP in the eastern regions compared to the western regions (Fig. 2a). Therefore, the spatial distribution of the 1837 stations with valid z0 values is reasonable.
Figure 1(a) Spatial distribution of annual mean z0_optimal across 1837 CMA stations. (b) Difference between annual mean ln (z0_ERA5) and ln (z0_optimal) (i.e., ln (z0_ERA5) minus ln (z0_optimal)). (c) MBP of 10 m wind speed between ERA5 and CMA stations. (d) Boxplots illustrating the statistical distribution of the MBP for 10 m wind speed shown in (c) across different intervals of ln (z0) difference shown in (b).
Figure 2(a) MBP of 100 m wind speed between ERA5 and 589 anemometer towers. (b) Spatial distribution of urban-rural classification (URC) at 1837 CMA stations. The legend on the right indicates the URC codes, with the corresponding URC types labeled in panel (c). (c) Number of CMA stations for each URC. The numerical labels on the x axis represent the URC codes, with the specific URC types annotated on the bars. (d) Spatial distribution of high-roughness surface areas, which are composed of the 11 types covered by CMA stations in panel (b).
Additionally, as a consistency check, we examined how the difference in ln (z0) covaries with the 10 m wind-speed bias between ERA5 reanalysis and station observations. Compared to the annual mean ln (z0_optimal) derived from 1837 stations, the ln (z0_ERA5) values are systematically lower at most locations, resulting in positive MBP values of 10 m wind speed between ERA5 reanalysis data and station observations (Fig. 1b and c). The discrepancies between ln (z0_ERA5) and ln (z0_optimal) are likely due to rapid urbanization around the majority of CMA stations, characterized by extensive construction of buildings, which enhances surface roughness and consequently reduces near-surface wind speeds (Li et al., 2018; Zhang and Wang, 2021). However, the impact of urbanization is likely not considered in the ERA5 reanalysis. Figure 2b and c depicts the distribution of CMA stations classified by urban-rural categories. All stations are situated in high-roughness surface areas, with the majority located in urban and town regions, highlighting the need to incorporate urbanization effects into wind speed simulations to improve model accuracy. In contrast, at a few locations, where the ln (z0_ERA5) values are higher, the corresponding MBP values of 10 m wind speed are negative (Fig. 1b and c). The influence of ln (z0) difference on wind speed bias becomes more pronounced as the magnitude of ln (z0) deviation increases (Fig. 1d). Because ln (z0_optimal) is defined as a monthly median of hourly ln (z0), this cross-time statistic does not trivially inherit the instantaneous relationship implied by Eqs. (1)–(3). The monotonic, theory-consistent pattern observed in the binned ln (z0) difference versus wind-speed MBP therefore serves as a post-aggregation consistency check, rather than as proof. Accordingly, the robust consistency in the relationship between z0 and wind speed preliminarily supports that z0_optimal is reasonable, and suggests that improving z0 values over high-roughness surface areas in numerical models could significantly enhance wind speed simulation accuracy. The validity of z0_optimal will be assessed via independent validation by comparing simulated wind speeds with observations (Sect. 3.3).
3.2 Development of a gridded z0 dataset in high-roughness surface areas across China
To demonstrate the reliability and practicality of the estimated z0_optimal, we constructed a gridded z0 dataset based on these estimations in order to apply it in numerical simulations. Given that the estimated z0 values from 1837 stations are located within high-roughness surface areas consisting of 11 distinct types (Fig. 2b and c), this study developed a monthly gridded z0 dataset specifically for these categories of areas with a spatial resolution of 0.01°×0.01° using the RFR algorithm, referred to as z0_RFR. As a representative example, the z0_RFR dataset was generated for the year 2019, and its spatial coverage is shown in Fig. 2d. Although 2019 was chosen for demonstration, the RFR model itself is year-independent and can be applied to other years, provided that the required input features are available. Six feature variables closely related to z0 were used as inputs, encompassing topographic characteristics ( and TSD), vegetation conditions (PTC, LAI, and NDVI), and urban-rural distribution (URC).
Figure 3c shows that the RFR algorithm exhibits satisfactory performance on both training and test subsets. Feature importance analysis reveals that topographic features and PTC exert the most significant influence on ln (z0_RFR) (Fig. 3e). z0 is primarily controlled by the characteristic height of surface roughness elements, particularly their relief. Consequently, topographic features rank among the most influential factors. For vegetation-related features, PTC not only reflects the horizontal distribution of vegetation density but also serves as a proxy for the presence of tall roughness elements. By contrast, LAI mainly represents vegetation density, making it relatively less critical. Although LAI is strongly correlated with NDVI (R=0.72), its low importance is not driven by this collinearity. The URC ranks only fourth in feature importance. This ranking should not be interpreted as implying that land use or urbanization is insignificant. Rather, in our framework, URC is used mainly to delineate the study domain and to ensure that the RFR algorithm is applied only to high-roughness surface areas. The aerodynamic effects of high-roughness elements, such as tall vegetation, buildings, and other infrastructure, are already embedded in the wind observations from CMA stations. As a result, the influence of these roughness elements is directly reflected in the z0 values themselves, rather than being captured by the URC. Essentially, URC is not defined in terms of the morphological height and density of roughness elements; instead, derived from global land-cover and population data (Li et al., 2023), and therefore z0 is weakly sensitive to URC. For example, in categories of Urban center and Urban landscape, there remains non-negligible tree cover, mean tree fractions of approximately 10 % and 11 %, respectively (Fig. S1b). This lowers URC's ranking in feature-importance analyses. To better capture the influence of roughness elements, more detailed surface parameters, such as building height and building density, would be helpful. Once such data are widely accessible, they should be incorporated to further improve the accuracy of z0 estimates.
Figure 3Sensitivity analysis and performance evaluation of the Random Forest Regression (RFR) algorithm. (a) Sensitivity of RFR results to the randomization seed for training and test subsets splitting. R and RMSE represent correlation coefficient and root mean square error, respectively. (b) Determination of the optimal number of decision trees. R2 represents determination coefficient. (c) Performance of the RFR algorithm on the training and test subsets. The R and RMSE values are displayed. (d) Performance evaluation using five-fold cross-validation. (e) Importance scores of different feature variables.
The spatial distribution of ln (z0_RFR) shows limited monthly variability (Fig. S2). The most pronounced monthly variations occur predominantly in the northern regions, likely because these areas exhibit strong seasonal changes in vegetation structure and biomass. The annual mean spatial distribution of z0_RFR, with values in high-roughness surface areas generally falling within the range of 0.3 to 0.9 m, exhibits distinct patterns compared to z0_Default and z0_Peng (Fig. 4a). In comparison with z0_Default and z0_Peng, z0_RFR shows a more homogeneous spatial distribution pattern across China. Specifically, in northern China, z0_RFR values are consistently higher than those of both z0_Default and z0_Peng, with z0_Default generally higher than z0_Peng. Conversely, in southern China, z0_Peng values are significantly higher than both z0_Default and z0_RFR. However, in southeastern and southwestern China, z0_Default values exceed those of z0_RFR, while in the remaining southern areas, z0_RFR maintains higher values compared to z0_Default.
Figure 4(a) Spatial distributions of annual mean z0_RFR, z0_Default, and z0_Peng. The dashed rectangular box indicates the simulation domain (d02) in panel (b). (b) Nested simulation domains (d01: outer domain; d02: inner domain) with terrain standard deviation within 0.01° window (TSD) represented by color shading. (c) Spatial distributions of z0 used in simulations over d02 in April.
3.3 Application of the produced z0 dataset in wind speed simulation
To evaluate the performance of z0_RFR, we implemented it in the WRF model for wind speed simulations, as z0 directly affects near-surface wind speed. A ∼3 km simulation for April 2019 was conducted using the WRF model with z0_RFR over the regions outlined in Fig. 4a, which correspond to the d02 domain in Fig. 4b and represent the primary areas of z0_RFR concentration. April was selected because it is the month with the highest average wind speed in the target domain (Fig. S3), thus better reflecting the impact of z0 on wind speed. For comparison, two additional simulations were performed: one utilizing the WRF model's default roughness length (z0_Default) based on land cover types, and the other employing a recent z0 dataset (z0_Peng). In the northeastern, northern, and western regions of the d02 domain, both z0_Default and z0_Peng are generally lower than z0_RFR estimates, with z0_Peng having even lower values than z0_Default (Fig. 4c). However, this pattern reverses in the southeastern areas and along the surrounding area of the Sichuan Basin, where both z0_Default and z0_Peng surpass z0_RFR estimates, and notably, with z0_Peng having significantly higher values than z0_Default in these regions. These discrepancies in z0 would inevitably directly affect the accuracy of wind speed simulation. To evaluate the influence, we conducted a comprehensive assessment on both 10 and 100 m wind speed simulations, which represent typical heights for meteorological observations and wind energy applications, respectively.
3.3.1 Evaluation of the simulated 10 m wind speed
We first compared the simulated 10 m wind speed with observations from 753 CMA stations in study areas (d02 domain), showing that z0_RFR significantly enhances the accuracy of simulations. The improvement due to z0_RFR is evident in the smaller MBP values of the simulated wind speed (Figs. 5a and S4) and the closer alignment of average wind speed with observational data (Fig. 6a).
Figure 5(a) z0_Default, z0_Peng, and z0_RFR against observations from CMA stations. (b) Distribution of MBP in 10 m wind speed as a function of TSD. (c) Box plot of MBP in 10 m wind speed across different TSD bins.
Figure 6(a) Comparisons of mean 10 m wind speed in April between the simulations using z0_Default, z0_Peng, and z0_RFR versus observations from CMA stations. All points (grey circles and purple crosses) represent the 753 CMA stations within the d02 domain available for comparison, while the purple crosses represent the 148 stations utilized for independent validation, which were not used in training the z0_RFR model. The corresponding wind speed means, R, and RMSE of all stations are also indicated. (b) Distribution of the 148 independent CMA stations (black stars). Colored shaded areas represent TSD. (c) Comparison of daily mean 10 m wind speed between simulations and observations from 753 CMA stations. (d) Statistical metrics comparing simulated and observed 10 m wind speeds, including temporal and spatial R, MAB, and RMSE.
Specifically, the frequency histogram of MBP values reveals that the simulation results using z0_RFR mostly fall within an absolute MBP range of less than 30 %, with a substantial proportion concentrated below 10 %. In contrast, simulations employing z0_Default display a majority of MBP values exceeding 30 %, while simulations using z0_Peng are even poorer, with a larger number of stations falling within higher MBP ranges (Fig. 5a). The improvement in 10 m wind speed induced by z0_RFR is primarily evident in relatively flat regions. As TSD increases, the improvement gradually diminishes (Fig. 5b). z0_RFR outperforms both z0_Default and z0_Peng when TSD does not exceed 50 m, while it shows superior performance to z0_Default and comparable results to z0_Peng when TSD is greater than 50 m (Fig. 5c). Spatially, significant improvements are observed in the relatively flat eastern and northern study areas, whereas limited enhancements are found in regions with higher TSD surrounding the Sichuan Basin (Fig. S4). The limited improvement in relatively complex terrain arises because, in addition to z0, wind speed over these regions is influenced by multi-scale factors, including microscale terrain features (Ge et al., 2025), turbulent orographic form drags (Beljaars et al., 2004; Jiménez and Dudhia, 2012; Zhou et al., 2018), surface heating-induced mountain-valley circulations (Kim et al., 2021), mountain waves (Draxl et al., 2021) and other processes. Inaccurate parameterizations of these factors in numerical models can all lead to errors in wind speed simulations.
Table 1Mean 10 m wind speed at 753 CMA stations and mean 100 m wind speed at 50 anemometer towers from simulations and observations. Simulations were performed using z0_Default, z0_Peng, and z0_RFR. Also shown is the percentage reduction in wind speed error (PRE) achieved by z0_RFR relative to z0_Default and z0_Peng.
For the mean 10 m wind speed, simulations using z0_RFR (2.26 m s−1) show better agreement with the CMA observations (2.08 m s−1), whereas simulations with z0_Default and z0_Peng show greater overestimations, producing mean wind speeds of 2.97 and 2.90 m s−1, respectively (Fig. 6a and Table 1). In other words, z0_RFR decreases mean bias of 10 m wind speed by 79.8 % and 78.0 % compared to z0_Default and z0_Peng, respectively. Independent validations across 148 stations (Fig. 6b), from the test subset in the generation of z0_RFR, further confirm the superiority of z0_RFR (Fig. 6a). In addition, the improvements in 10 m wind speed were observed throughout the entire simulation period (Fig. 6c). Note that our experimental design, employing a re-initialization strategy, means that 30 independent simulation experiments were conducted in April. Thus, although the simulations were only conducted for a month, the consistent improvement across all days shows that the enhancement achieved by z0_RFR is robust. Moreover, the statistical metrics also show that the simulated 10 m wind speed using z0_RFR outperforms those using z0_Default and z0_Peng in temporal and spatial MAB and RMSE (Fig. 6d).
3.3.2 Evaluation of the simulated 100 m wind speeds
In addition to 10 m wind speed, the simulated 100 m wind speed was also improved through the use of z0_RFR (Fig. 7a and Table 1). Compared to observations from 50 anemometer towers (Fig. 7b), with an average 100 m wind speed of 6.26 m s−1, simulations based on z0_Default and z0_Peng overestimate the wind speed, with averages of 7.09 and 7.29 m s−1, respectively. However, the mean 100 m wind speed simulated using z0_RFR is 6.50 m s−1, closer to the observations (Table 1). This improvement using z0_RFR reduces wind speed mean bias by 71.1 % and 76.7 % compared to z0_Default and z0_Peng, respectively. Consistent with the performance of z0_RFR at 10 m wind speed, the improvement in 100 m wind speed is more pronounced in relatively flat regions (Fig. 7c). The outliers in Fig. 7a, where wind speed biases remain significant despite using z0_RFR, are located in areas with higher TSD. Furthermore, similar to its performance at 10 m height, z0_RFR demonstrates superior performance in simulated 100 m wind speed across both temporal and spatial metrics, with the exception of the temporal correlation coefficient (Fig. 7d). The relatively lower temporal R is reasonable, as the improvement in wind speed induced by z0 primarily stems from enhancements in the vertical profile.
Figure 7(a) Comparisons of mean 100 m wind speed in April between the simulations using z0_Default, z0_Peng, and z0_RFR versus observations from anemometer towers. The corresponding wind speed means, R, and RMSE of all towers are also indicated. (b) The locations of 50 anemometer towers (black triangles) utilized for 100 m wind speed evaluation. Colored shaded areas represent TSD. (c) Distribution of MBP in 100 m wind speed as a function of TSD. (d) Statistical metrics comparing simulated and observed 100 m wind speeds, including temporal and spatial R, MAB, and RMSE.
In summary, the 30 independent simulation cases conducted for April demonstrate that the z0 values derived from the combination of CMA observations and ERA5 data are highly reliable. The resulting gridded z0 dataset significantly reduces uncertainties in mesoscale near-surface wind speed simulations, particularly over relatively flat high-roughness surface areas. To further validate the robustness of the z0 estimation method and the resulting dataset, we conducted additional simulations for October 2019, a month characterized by generally weaker wind conditions (Fig. S3), using the same model configuration as in April. The results (Figs. S5–S7) also show consistent improvements when using z0_RFR. Station-wise correlations increase and errors decrease to a similar extent in both months, and the daily time series likewise show closer tracking of peaks and lulls. Taken together, these results further reinforce the reliability and applicability of the proposed z0 estimation under varying meteorological conditions. They also indicate that although phenology-driven changes in canopy structure and seasonal circulation modulate wind speeds, the performance advantage of the proposed z0 is not diminished.
Here we discuss the sensitivity and generality of the site z0 estimation approach with respect to the input simulation or reanalysis data, addressing concerns about potential methodological dependence on ERA5. Our study utilized ERA5 reanalysis data and CMA observations for initial z0 estimation. Compared to traditional meteorological or morphological methods, our approach can provide z0 values at large spatial coverage and low cost, and these values lead to clear improvements in WRF-simulated wind speeds at both 10 and 100 m above ground level. To assess whether the performance gain stems from improved z0 representation rather than from alignment with ERA5 reanalysis data, we carried out two additional sets of evaluations.
First, we applied the same approach to estimate z0 from WRF-simulated 10 m wind speed and the model's default z0 values (0.03°×0.03°), instead of ERA5. The z0 values estimated using this alternative dataset were found to be highly similar to those derived from ERA5 (Fig. 8), indicating that the method is not inherently reliant on ERA5 as a data source. The primary advantage of using ERA5 lies in its extensive spatiotemporal coverage, which offers greater convenience and consistency with observational data. Meanwhile, although 10 and 100 m winds over lands are not assimilated directly in ERA5, its 4D-Var system ingests a wide range of surface and upper-air observations that constrain boundary-layer structure and indirectly improve near-surface winds; this strengthens the credibility of using ERA5 as the reference field (Hersbach et al., 2020). However, the methodology itself is general and transferable to other datasets.
Figure 8(a) Comparison of z0 values from default WRF model (z0_WRF) and ERA5 (z0_ERA5). (b) Comparison of z0 estimates using different datasets. z0 derived from WRF represents the estimated values based on WRF simulations (10 m wind speed and default z0) and CMA station observations (10 m wind speed) during April 2019, while z0 derived from ERA5 denotes the estimates obtained in this study using ERA5 reanalysis data in April. (c) Distribution of station counts across intervals of the difference in derived ln (z0) (ln (z0) derived from ERA5 minus ln (z0) derived from WRF).
Moreover, the agreement between ERA5- and WRF-derived z0 values suggests that the spatial extent represented by the estimated site-level z0 values is not determined by the resolution of the reanalysis or simulation dataset used, but rather by the measurement height of wind observations at the stations. In this study, 10 m wind speeds from CMA stations were used. As a rule of thumb, the horizontal representativeness of wind measurements is approximately 10–100 times the measurement height. Therefore, z0 values estimated from 10 m wind observations are reasonably representative at ∼100 m–1 km scales, making the generation of 0.01° gridded z0 datasets for use in mesoscale simulations both appropriate and justified, with no evident resolution dependence observed. We compared simulation results at different resolutions. Leveraging the nested modeling setup used in this study, the d01 domain with a 0.09° resolution was treated as the coarse-resolution simulation, while d02 at 0.03° served as the fine-resolution simulation. The results show that, even at the coarser resolution, our gridded z0 dataset provides a clear advantage and substantially improves near-surface wind speed simulations (Figs. S8 and S9). However, for simulations at ∼1 km resolution and finer, such as urban-scale wind modelling, our z0 dataset cannot fully capture urban heterogeneity, because it did not incorporate key morphological parameters (e.g., building height and density) to distinguish between different urban forms. Therefore, an urban canopy model (UCM) would be a more appropriate choice. UCMs were conceived to operate at ∼0.5–1 km grid spacing to bridge mesoscale forecasting (∼105 m) with microscale transport/dispersion (∼100 m) models (Tewari et al., 2006; Chen et al., 2011), and they have been widely applied and validated in subsequent urban studies (Lian et al., 2018; Salamanca et al., 2018; Wang and Hu, 2021). Therefore, our z0 data are suitable and effective for mesoscale simulations at kilometer-level resolutions.
Second, we further validated the robustness of the refined z0 dataset (z0_RFR) by conducting additional WRF simulations driven by the reanalysis from National Centers for Environmental Prediction (NCEP) instead of ERA5. These results (Fig. S10 and Table S1 in the Supplement) still showed significant improvement in wind speed simulation performance when using z0_RFR, consistent with those driven by ERA5. This cross-reanalysis consistency demonstrates that the benefits are attributable to the improved surface representation through z0_RFR refinement, not simply tuning to match ERA5-driven wind fields.
Taken together, these findings confirm that the z0 estimation method proposed in this study is robust, flexible, and not dependent on alignment with a specific reanalysis dataset. It provides a practical framework for z0 estimation that can be widely applied across different reanalysis/simulation datasets and observational data with consistent benefits. However, this method is limited in regions with sparse or no surface weather stations. Notably, these regions, such as western and northern China, are rich in wind resources and are key targets for wind energy development. Therefore, producing high-quality gridded z0 datasets in these regions warrants further study by exploring alternative data sources, such as anemometer tower wind profiles, to supplement z0 truth values (Wang et al., 2024).
The two assumptions used in the z0 estimation are also discussed. Although these assumptions cannot be fully verified with the available data, they are pragmatically motivated and indirectly supported by the improved performance of wind-speed simulations using the resulting z0 estimates. Assumption 1 posits that the near-surface wind-speed discrepancy between ERA5 reanalysis and CMA observations is dominated by z0 and that the influence of z0 weakens with height, making ERA5 winds at higher levels within the surface layer comparable to observations. This is partly supported by the spatial pattern of estimated z0 (denser over eastern China, where 100 m wind-speed biases between ERA5 reanalysis and anemometer tower observations are smaller; Figs. 1c and 2) and by a sensitivity test on the reference height (Fig. S11a and c). When re-estimating annual-mean z0_CMA at 150 and 200 m, most stations show an absolute difference from the 100 m-based estimate below 0.2, indicating broad consistency across heights. A minority of stations exhibit slightly larger deviation, which may be influenced by local terrain complexity (Fig. S11b and d). Assumption 2 treats the effects of atmospheric stability on wind speed as effectively similar in ERA5 and at CMA sites, allowing us to omit explicit stability corrections in estimating z0_CMA. This simplification enhances methodological consistency and computational efficiency, and it is indirectly supported by the validation of simulated winds. Moreover, prior work has shown that neutral log-law method can perform comparably to stability-corrected scheme for vertical interpolation in US wind-resource assessments (Duplyakin et al., 2021), suggesting that such an approximate treatment seems feasible and a widely adopted simplification. Overall, although neither assumption can be fully verified with the presently available data, their practical applicability is evidenced by improved WRF wind-speed simulations. Future work, ideally leveraging multi-height wind profile observations and coincident stability metrics could further test these assumptions, yield more precise z0 estimates.
The representation of z0 in numerical models, typically determined by land cover types, may lead to significant uncertainties in wind speed simulations and predictions. Traditional methods for obtaining z0 ground truth are mainly constrained by high costs. In this study, we proposed a low-cost z0 estimation method, allowing the acquisition of z0 values at routine weather stations.
Specifically, this approach leverages 10 m wind speed and z0 values from ERA5 reanalysis data, along with observed 10 m wind speeds at CMA stations, to derive optimal z0 at stations by minimizing the difference in 100 m wind speeds between reanalysis and observations. Here, the 100 m wind speed is expressed with 10 m wind speed and z0 using similarity theory. Based on this approach, we derived z0 values at 1837 CMA stations out of a total of 2161 stations. These stations are located in high-roughness surface regions, indicating the estimated z0 values inherently include the effects of built-up and tall vegetation.
To validate the reliability and practicality of the estimation method, we utilized a Random Forest Regression algorithm, incorporating feature variables closely related to z0, to develop a monthly gridded z0 dataset for high-roughness surface areas in China with a spatial resolution of 0.01°×0.01°. The resulting ln (z0) values mainly range from −1 to 0. Simulations with WRF model show that, compared to the default z0 in WRF and a recent gridded z0 dataset developed by Peng et al. (2022), the z0 dataset constructed in this study has significantly improved the accuracy of near-surface wind speed simulations in high-roughness surface areas, particularly in relatively flat regions. Evaluations against weather station data and anemometer tower data show simulations with the new z0 dataset mitigate mean bias of 10 m wind speed by 79.8 % and 78.0 %, and mean bias of 100 m wind speed by 71.1 % and 76.7 %, respectively, compared to the default z0 in WRF and the z0 dataset from Peng et al. (2022).
In summary, this study developed a simple yet effective approach for correcting model z0, addressing the limitations of relying on empirical values assigned based on land cover types. The method shows particular effectiveness in z0 correction for high-roughness surface areas and offers valuable support for wind field-dependent studies and applications.
Code required to conduct the analyses herein is available at https://doi.org/10.5281/zenodo.15108200 (Wang, 2025).
The datasets used in this study fall into two categories based on their accessibility:
-
Publicly Available Datasets (accessible via DOI/URL).
-
The hourly wind speed data at 10 and 100 m heights are obtained from the ERA5 reanalysis dataset (Hersbach et al., 2020), accessible at https://doi.org/10.24381/cds.adbb2d47 (Hersbach et al., 2023b).
-
For the gridded datasets of z0 used in this study, z0_ERA5 (Hersbach et al., 2020) is available at https://doi.org/10.24381/cds.f17050d7 (Hersbach et al., 2023a), while z0_Peng (Peng et al., 2022) can be acquired by contacting the corresponding authors.
-
The initial and boundary conditions for the simulations are from the ERA5 dataset (Hersbach et al., 2020), which can be downloaded from https://doi.org/10.24381/cds.adbb2d47 (Hersbach et al., 2023b) and https://doi.org/10.24381/cds.bd0915c6 (Hersbach et al., 2023c).
-
The digital elevation data, with a spatial resolution of 3 arcsec, are sourced from the Shuttle Radar Topography Mission (SRTM) and can be downloaded from https://csidotinfo.wordpress.com/data/srtm-90m-digital-elevation-database-v4-1/ (last access: 8 November 2023) (Jarvis et al., 2008).
-
The urban-rural classification data (Li et al., 2023a) are available at https://doi.org/10.6084/m9.figshare.21716357.v6 (Li et al., 2022).
-
The variance of the slope () data can be obtained by contacting Zhou et al. (2018).
-
The Leaf Area Index (LAI) data (Lin et al., 2023; Yuan et al., 2011) are accessible at http://globalchange.bnu.edu.cn/research/laiv061 (Beijing Normal University Global Change Data Archive, 2022).
-
The percent tree cover data (DiMiceli et al., 2022) can be obtained from https://doi.org/10.5067/MODIS/MOD44B.061 and https://search.earthdata.nasa.gov/search/granules?p=C2565805839-LPCLOUD&pg[0][v]=f&pg[0][gsk]=-start_date&q=MOD44B&tl=1733462795.688!3!!&lat=-0.140625 (NASA EOSDIS, 2024a).
-
The NDVI data (Didan, 2021) are available from https://doi.org/10.5067/MODIS/MOD13A3.061 and https://search.earthdata.nasa.gov/search/granules?p=C2327962326-LPCLOUD&pg[0][v]=f&pg[0][gsk]=-startdate&q=MOD13A3&tl=1732851935.718!3!!&lat=-0.140625 (NASA EOSDIS, 2024b).
-
The NCEP forcing data (National Centers for Environmental Prediction/National Weather Service/NOAA/US Department of Commerce, 2025) are available from https://rda.ucar.edu/datasets/d083002/dataaccess/ (last access: 28 May 2025).
-
-
Restricted Datasets. We would like to clarify that the meteorological station data from the China Meteorological Administration (CMA) and the anemometer tower data used in this study are not publicly accessible but can be accessed through the following way. Specifically:
-
The data from anemometer towers are provided by China State Shipbuilding Corporation Haizhuang Windpower Co., Ltd., however, they are not accessible publicly because of their commercial interests. These data can be obtained by cooperation with the company.
-
The hourly 10 m wind speed data at meteorological stations are from the China Meteorological Administration (CMA). In accordance with the data policy of China, these data record are not directly accessible for public download via a website. Nevertheless, individuals interested in obtaining detailed information about data acquisition can reach out to the China Meteorological Data Service Center at their official website (http://data.cma.cn/en/?r=data/detail&dataCode=A.0012.0001, China meteorological data service centre, 2023).
-
The supplement related to this article is available online at https://doi.org/10.5194/gmd-18-10077-2025-supplement.
All authors contributed to the study. JW and KY conceived the study and conducted the design; JW, KY, and JL carried out data analyses; JW, XZ and XM performed the configuration of WRF model; WT processed data from CMA stations; LY provided the data from anemometer towers; ZR conducted data collection and cleaning of anemometer towers; JW and KY wrote the manuscript; all authors discussed, reviewed and edited the manuscript.
The contact author has declared that none of the authors has any competing interests.
Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.
This work was supported by the National Natural Science Foundation of China (grant nos. 42475138 and 42361144875) and Huadian Xizang Energy Co., Ltd. (grant no. 12IJD202400023).
This paper was edited by Guoqing Ge and reviewed by five anonymous referees.
Beljaars, A., Brown, A. R., and Wood, N: A new parametrization of turbulent orographic form drag, Q. J. Roy. Meteorol. Soc., 130, 1327–1347, https://doi.org/10.1256/qj.03.73, 2004.
Beijing Normal University Global Change Data Archive: Leaf Area Index (LAI), Beijing Normal University Global Change Data Archive [data set], http://globalchange.bnu.edu.cn/research/laiv061 (last access: 24 March 2022), 2022.
Bottema, M. and Mestayer, P. G.: Urban roughness mapping – Validation techniques and some first results, J. Wind Eng. Indust. Aerodynam., 74–76, 163–173, https://doi.org/10.1016/S0167-6105(98)00014-2, 1998.
Breiman, L.: Random forests, Mach. Learn., 45, 5–32, https://doi.org/10.1023/A:1010933404324, 2001.
Chen, F., Kusaka, H., Bornstein, R., Ching, J., Grimmond, C. S. B., Grossman-Clarke, S., Loridan, T., Manning, K. W., Martilli, A., Miao, S., Sailor, D., Salamanca, F. P., Taha, H., Tewari, M., Wang, X., Wyszogrodzki, A. A., and Zhang, C.: The integrated WRF/urban modelling system: development, evaluation, and applications to urban environmental problems, Int. J. Climatol., 31, 273–288, https://doi.org/10.1002/joc.2158, 2011.
China meteorological data service centre: Daily Timed Data from automated weather stations in China, China meteorological data service centre [data set], http://data.cma.cn/en/?r=data/detail&dataCode=A.0012.0001 (last access: 6 May 2023), 2023.
Danielson, J. J. and Gesch, D. B.: Global multi-resolution terrain elevation data 2010 (GMTED2010), no. 2011-1073, US Geological Survey, https://doi.org/10.3133/ofr20111073, 2011.
Didan, K.: MODIS/Terra Vegetation Indices Monthly L3 Global 1 km SIN Grid V061, NASA EOSDIS Land Processes Distributed Active Archive Center [data set], https://doi.org/10.5067/MODIS/MOD13A3.061, 2021.
Di Nicola, F., Brattich, E., and Di Sabatino, S.: A new approach for roughness representation within urban dispersion models, Atmos. Environ., 283, 119181, https://doi.org/10.1016/j.atmosenv.2022.119181, 2022.
DiMiceli, C., Sohlberg, R., and Townshend, J.: MODIS/Terra Vegetation Continuous Fields Yearly L3 Global 250 m SIN Grid V061, NASA EOSDIS Land Processes Distributed Active Archive Center [data set], https://doi.org/10.5067/MODIS/MOD44B.061, 2022.
Draxl, C., Worsnop, R. P., Xia, G., Pichugina, Y., Chand, D., Lundquist, J. K., Sharp, J., Wedam, G., Wilczak, J. M., and Berg, L. K.: Mountain waves can impact wind power generation, Wind Energ. Sci., 6, 45–60, https://doi.org/10.5194/wes-6-45-2021, 2021.
Duan, G. and Takemi, T.: Predicting urban surface roughness aerodynamic parameters using random forest, J. Appl. Meteorol. Clim., 60, 999–1018, https://doi.org/10.1175/JAMC-D-20-0266.1, 2021.
Dudhia, J.: Numerical study of convection observed during the winter monsoon experiment using a mesoscale two-dimensional model, J. Atmos. Sci., 46, 3077–3107, https://doi.org/10.1175/1520-0469(1989)046<3077:NSOCOD>2.0.CO;2, 1989.
Duplyakin, D., Zisman, S., Phillips, C., and Tinnesand, H.: Bias characterization, vertical interpolation, and horizontal interpolation for distributed wind siting using mesoscale wind resource estimates, NREL/TP-2C00-78412, NREL – National Renewable Energy Laboratory, Golden, CO, USA, https://doi.org/10.2172/1760659, 2021.
Ge, C., Yan, J., Song, W., Zhang, H., Wang, H., Li, Y., and Liu, Y.: Middle-term wind power forecasting method based on long-span NWP and microscale terrain fusion correction, Renew. Energy, 240, 122123, https://doi.org/10.1016/j.renene.2024.122123, 2025.
Grell, G. A. and Freitas, S. R.: A scale and aerosol aware stochastic convective parameterization for weather and air quality modeling, Atmos. Chem. Phys., 14, 5233–5250, https://doi.org/10.5194/acp-14-5233-2014, 2014.
Grimmond, C. S. B. and Oke, T. R.: Aerodynamic properties of urban areas derived from analysis of surface form, J. Appl. Meteorol. Clim., 38, 1262–1292, https://doi.org/10.1175/1520-0450(1999)038<1262:APOUAD>2.0.CO;2, 1999.
Grimmond, C. S. B., King, T. S., Roth, M., and Oke, T. R.: Aerodynamic roughness of urban areas derived from wind observations, Bound.-Lay. Meteorol., 89, 1–24, https://doi.org/10.1023/A:1001525622213, 1998.
Hadavi, M. and Pasdarshahri, H.: Quantifying impacts of wind speed and urban neighborhood layout on the infiltration rate of residential buildings, Sustain. Cities Soc., 53, 101887, https://doi.org/10.1016/j.scs.2019.101887, 2020.
Han, Z., Zhou, B., Xu, Y., Wu, J., and Shi, Y.: Projected changes in haze pollution potential in China: an ensemble of regional climate model simulations, Atmos. Chem. Phys., 17, 10109–10123, https://doi.org/10.5194/acp-17-10109-2017, 2017.
Hersbach, H., Bell, B., Berrisford, P., Hirahara, S., Horányi, A., Muñoz-Sabater, J., Nicolas, J., Peubey, C., Radu, R., Schepers, D., Simmons, A., Soci, C., Abdalla, S., Abellan, X., Balsamo, G., Bechtold, P., Biavati, G., Bidlot, J., Bonavita, M., De Chiara, G., Dahlgren, P., Dee, D., Diamantakis, M., Dragani, R., Flemming, J., Forbes, R., Fuentes, M., Geer, A., Haimberger, L., Healy, S., Hogan, R. J., Hólm, E., Janisková, M., Keeley, S., Laloyaux, P., Lopez, P., Lupu, C., Radnoti, G., de Rosnay, P., Rozum, I., Vamborg, F., Villaume, S., and Thépaut, J.-N.: The ERA5 global reanalysis, Q. J. Roy. Meteorol. Soc., 146, 1999–2049, https://doi.org/10.1002/qj.3803, 2020.
Hersbach, H., Bell, B., Berrisford, P., Biavati, G., Horányi, A., Muñoz Sabater, J., Nicolas, J., Peubey, C., Radu, R., Rozum, I., Schepers, D., Simmons, A., Soci, C., Dee, D., and Thépaut, J.-N.: ERA5 monthly averaged data on single levels from 1940 to present, Copernicus Climate Change Service (C3S) Climate Data Store (CDS) [data set], https://doi.org/10.24381/cds.f17050d7, 2023a.
Hersbach, H., Bell, B., Berrisford, P., Biavati, G., Horányi, A., Muñoz Sabater, J., Nicolas, J., Peubey, C., Radu, R., Rozum, I., Schepers, D., Simmons, A., Soci, C., Dee, D., and Thépaut, J.-N.: ERA5 hourly data on single levels from 1940 to present, Copernicus Climate Change Service (C3S) Climate Data Store (CDS) [data set], https://doi.org/10.24381/cds.adbb2d47, 2023b.
Hersbach, H., Bell, B., Berrisford, P., Biavati, G., Horányi, A., Muñoz Sabater, J., Nicolas, J., Peubey, C., Radu, R., Rozum, I., Schepers, D., Simmons, A., Soci, C., Dee, D., and Thépaut, J.-N.: ERA5 hourly data on pressure levels from 1940 to present, Copernicus Climate Change Service (C3S) Climate Data Store (CDS) [data set], https://doi.org/10.24381/cds.bd0915c6, 2023c.
Hong, S. Y., Noh, Y., and Dudhia, J.: A new vertical diffusion package with an explicit treatment of entrainment processes, Mon. Weather Rev., 134, 2318–2341, https://doi.org/10.1175/MWR3199.1, 2006.
Hu, X., Shi, L., Lin, L., and Magliulo, V.: Improving surface roughness lengths estimation using machine learning algorithms, Agr. Forest Meteorol., 287, 107956, https://doi.org/10.1016/j.agrformet.2020.107956, 2020.
Ishugah, T. F., Li, Y., Wang, R. Z., and Kiplagat, J. K.: Advances in wind energy resource exploitation in urban environment: A review, Renew. Sustain. Energy Rev., 37, 613–626, https://doi.org/10.1016/j.rser.2014.05.053, 2014.
Jarvis, A., Reuter, H. I., Nelson, A., and Guevara, E.: Hole-filled SRTM for the globe Version 4, CGIAR-CSI SRTM 90 m Database [data set], http://srtm.csi.cgiar.org (last access: 8 November 2023), 2008.
Jiménez, P. A. and Dudhia, J.: Improving the representation of resolved and unresolved topographic effects on surface wind in the WRF model, J. Appl. Meteorol. Clim., 51, 300–316, https://doi.org/10.1175/JAMC-D-11-084.1, 2012.
Kammen, D. M. and Sunter, D. A.: City-integrated renewable energy for urban sustainability, Science, 352, 922–928, https://doi.org/10.1126/science.aad9302, 2016.
Kanda, M., Inagaki, A., Miyamoto, T., Gryschka, M., and Raasch, S.: A new aerodynamic parametrization for real urban surfaces, Bound.-Lay. Meteorol., 148, 357–377, https://doi.org/10.1007/s10546-013-9818-x, 2013.
Kim, G., Lee, J., Lee, M. I., and Kim, D.: Impacts of urbanization on atmospheric circulation and aerosol transport in a coastal environment simulated by the WRF-Chem coupled with urban canopy model, Atmos. Environ., 249, 118253, https://doi.org/10.1016/j.atmosenv.2021.118253, 2021.
Li, D., Liao, W., Rigden, A. J., Liu, X., Wang, D., Malyshev, S., and Shevliakova, E.: Urban heat island: Aerodynamics or imperviousness?, Sci. Adv., 5, eaau4299, https://doi.org/10.1126/sciadv.aau4299, 2019.
Li, X., Yu, L., and Chen, X.: New insights into urbanization based on global mapping and analysis of human settlements in the rural-urban continuum, figshare [data set], https://doi.org/10.6084/m9.figshare.21716357.v6, 2022.
Li, X., Yu, L., and Chen, X.: New insights into urbanization based on global mapping and analysis of human settlements in the rural-urban continuum, Land, 12, 1607, https://doi.org/10.3390/land12081607, 2023.
Li, Y., Sun, P. P., Li, A., and Deng, Y.: Wind effect analysis of a high-rise ancient wooden tower with a particular architectural profile via wind tunnel test, Int. J. Archit. Herit., 17, 518–537, https://doi.org/10.1080/15583058.2021.1938748, 2021.
Li, Z., Song, L., Ma, H., Xiao, J., Wang, K., and Chen, L.: Observed surface wind speed declining induced by urbanization in East China, Clim. Dynam., 50, 735–749, https://doi.org/10.1007/s00382-017-3637-6, 2018.
Lian, J., Wu, L., Bréon, F.-M., Broquet, G., Vautard, R., Zaccheo, T. S., Dobler, J., and Ciais, P.: Evaluation of the WRF-UCM mesoscale model and ECMWF global operational forecasts over the Paris region in the prospect of tracer atmospheric transport modeling, Elem. Sci. Anth., 6, 64, https://doi.org/10.1525/elementa.319, 2018.
Lin, W., Yuan, H., Dong, W., Zhang, S., Liu, S., Wei, N., Lu, X., Wei, Z., Hu, Y., and Dai, Y.: Reprocessed MODIS version 6.1 leaf area index dataset and its evaluation for land surface and climate modeling, Remote Sens., 15, 1780, https://doi.org/10.3390/rs15071780, 2023.
Liu, J., Gao, Z., Wang, L., Li, Y., and Gao, C. Y.: The impact of urbanization on wind speed and surface aerodynamic characteristics in Beijing during 1991–2011, Meteorol. Atmos. Phys., 130, 311–324, https://doi.org/10.1007/s00703-017-0519-8, 2018.
Liu, Z., He, C., Zhou, Y., and Wu, J.: How much of the world's land has been urbanized, really? A hierarchical framework for avoiding confusion, Landsc. Ecol., 29, 763–771, https://doi.org/10.1007/s10980-014-0034-y, 2014.
Luu, L. N., van Meijgaard, E., Philip, S. Y., Kew, S. F., de Baar, J. H. S., and Stepek, A.: Impact of surface roughness changes on surface wind speed over western Europe: A study with the regional climate model RACMO, J. Geophys. Res.-Atmos., 128, e2022JD038426, https://doi.org/10.1029/2022JD038426, 2023.
Macdonald, R. W., Griffiths, R. F., and Hall, D. J.: An improved method for the estimation of surface roughness of obstacle arrays, Atmos. Environ., 32, 1857–1864, https://doi.org/10.1016/S1352-2310(97)00403-2, 1998.
Manju, N., Balakrishnan, R., and Mani, N.: Assimilative capacity and pollutant dispersion studies for the industrial zone of Manali, Atmos. Environ., 36, 3461–3471, https://doi.org/10.1016/S1352-2310(02)00306-0, 2002.
Mlawer, E. J., Taubman, S. J., Brown, P. D., Iacono, M. J., and Clough, S. A.: Radiative transfer for inhomogeneous atmospheres: RRTM, a validated correlated-k model for the longwave, J. Geophys. Res., 102, 16663–16682, https://doi.org/10.1029/97JD00237, 1997.
Monin, A. S. and Obukhov, A. M.: Osnovnye zakonomernosti turbulentnogo peremesivanija v prizemnon sloe atmosfery (Basic laws of turbulent mixing in the atmosphere near the ground), Dokl. Akad. Nauk SSSR, 151, 1963–1987, 1954.
NASA EOSDIS: MODIS/Terra Vegetation Continuous Fields Yearly L3 Global 250 m SIN Grid V061, NASA EOSDIS [data set], https://search.earthdata.nasa.gov/search/granules?p=C2565805839-LPCLOUD&pg[0][v]=f&pg[0][gsk]=-start_date&q=MOD44B&tl=1733462795.688!3!!&lat=-0.140625 (last access: 3 October 2024), 2024a.
NASA EOSDIS: MODIS/Terra Vegetation Indices Monthly L3 Global 1 km SIN Grid V061, NASA EOSDIS [data set], https://search.earthdata.nasa.gov/search/granules?p=C2327962326-LPCLOUD&pg[0][v]=f&pg[0][gsk]=-start_date&q=MOD13A3&tl=1732851935.718!3!!&lat=-0.140625 (last access: 22 September 2024), 2024b.
National Centers for Environmental Prediction/National Weather Service/NOAA/US Department of Commerce: NCEP FNL Operational Model Global Tropospheric Analyses, continuing from July 1999, updated daily, Research Data Archive at the National Center for Atmospheric Research, Computational and Information Systems Laboratory [data set], https://doi.org/10.5065/D6M043C6, 2025.
Niu, G. Y., Yang, Z. L., and Mitchell, K. E.: The community Noah land surface model with multiparameterization options (Noah-MP): 1. Model description and evaluation with local-scale measurements, J. Geophys. Res.-Atmos., 116, D12109, https://doi.org/10.1029/2010JD015139, 2011.
Peng, Z., Tang, R., Jiang, Y., Liu, M., and Li, Z. L.: Global estimates of 500 m daily aerodynamic roughness length from MODIS data, ISPRS J. Photogram. Remote Sens., 183, 336–351, https://doi.org/10.1016/j.isprsjprs.2021.11.015, 2022.
Peng, Z., Tang, R., Liu, M., Jiang, Y., and Li, Z. L.: Coupled estimation of global 500 m daily aerodynamic roughness length, zero-plane displacement height and canopy height, Agr. Forest Meteorol., 342, 109754, https://doi.org/10.1016/j.agrformet.2023.109754, 2023.
Raupach, M. R.: Drag and drag partition on rough surfaces, Bound.-Lay. Meteorol., 60, 375–395, https://doi.org/10.1007/BF00155203, 1992.
Raupach, M. R.: Simplified expressions for vegetation roughness length and zero-plane displacement as functions of canopy height and area index, Bound.-Lay. Meteorol., 71, 211–216, https://doi.org/10.1007/BF00709229, 1994.
Roy, P., Chen, L. W. A., Chen, Y. T., Ahmad, S., Khan, E., and Buttner, M.: Pollen dispersion and deposition in real-world urban settings: A computational fluid dynamic study, Aerosol Sci. Eng., 7, 543–555, https://doi.org/10.1007/s41810-023-00198-1, 2023.
Salamanca, F., Zhang, Y., Barlage, M., Chen, F., Mahalov, A., and Miao, S.: Evaluation of the WRF-urban modeling system coupled to Noah and Noah-MP land surface models over a semiarid urban environment, J. Geophys. Res.-Atmos., 123, 2387–2408, https://doi.org/10.1002/2018JD028377, 2018.
Shen, C., Shen, A., Cui, Y., Chen, X., Liu, Y., Fan, Q., Chan, P., Tian, C., Wang, C., Lan, J., Gao, M., Li, X., and Wu, J.: Spatializing the roughness length of heterogeneous urban underlying surfaces to improve the WRF simulation – part 1: A review of morphological methods and model evaluation, Atmos. Environ., 270, 118874, https://doi.org/10.1016/j.atmosenv.2021.118874, 2022.
Shen, G., Zheng, S., Jiang, Y., Zhou, W., and Zhu, D.: An improved method for calculating urban ground roughness considering the length and angle of upwind sector, Build. Environ., 266, 112144, https://doi.org/10.1016/j.buildenv.2024.112144, 2024.
Skamarock, W. C., Klemp, J. B., Dudhia, J., Gill, D. O., Liu, Z., Berner, J., Wang, W., Powers, J. G., Duda, M. G., Barker, D. M., and Huang, X.-Y.: A description of the advanced research WRF model version 4 Rep (Vol. 145), National Center for Atmos Res National Center for Atmospheric Research, https://doi.org/10.5065/1DFH-6P97, 2019.
Stathopoulos, T., Alrawashdeh, H., Al-Quraan, A., Blocken, B., Dilimulati, A., Paraschivoiu, M., and Pilay, P.: Urban wind energy: Some views on potential and challenges, J. Wind Eng. Indust. Aerodynam., 179, 146–157, https://doi.org/10.1016/j.jweia.2018.05.018, 2018.
Stull, R. B.: An introduction to boundary layer meteorology, Springer Science & Business Media, ISBN 978-90-277-2769-5, 1988.
Tanentzap, A. J., Taylor, P. A., Yan, N. D., and Salmon, J. R.: On Sudbury-area wind speeds – a tale of forest regeneration, J. Appl Meteorol. Clim., 46, 1645–1654, https://doi.org/10.1175/JAM2552.1, 2007.
Tasneem, Z., Noman, A. A., Das, S. K., Saha, D. K., Islam, M. R., Ali, M. F., Badal, M. F. R., Ahamed, M. H., Moyeen, S. I., and Alam, F.: An analytical review on the evaluation of wind resource and wind turbine for urban application: Prospect and challenges, Dev. Built Environ., 4, 100033, https://doi.org/10.1016/j.dibe.2020.100033, 2020.
Tewari, M., Chen, F., and Kusaka, H.: Implementation and evaluation of a single-layer urban canopy model in WRF/Noah, in: Proceedings of the WRF Users' Workshop, NCAR, Boulder, CO, USA, https://www2.mmm.ucar.edu/wrf/users/workshops/WS2006/abstracts/Session05/5_6_Tewari.pdf (last access: 20 October 2025), 2006.
Thompson, G., Field, P. R., Rasmussen, R. M., and Hall, W. D.: Explicit forecasts of winter precipitation using an improved bulk microphysics scheme. Part II: implementation of a new snow parameterization, Mon. Weather Rev., 136, 5095–5115, https://doi.org/10.1175/2008MWR2387.1, 2008.
Wang, J.: Codes for manuscript “Improvement of near-surface wind speed modeling through refined aerodynamic roughness length in high-roughness surface regions: implementation and validation in the Weather Research and Forecasting (WRF) model version 4.0”, Zenodo [code], https://doi.org/10.5281/zenodo.15108200, 2025.
Wang, J. and Hu, X.-M.: Evaluating the performance of WRF urban schemes and PBL schemes over Dallas-Fort Worth during a dry summer and a wet summer, J. Appl. Meteorol. Clim., 60, 779–798, https://doi.org/10.1175/JAMC-D-19-0195.1, 2021.
Wang, J., Yang, K., Yuan, L., Liu, J., Peng, Z., Ren, Z., and Zhou, X.: Deducing aerodynamic roughness length from abundant anemometer tower data to inform wind resource modeling, Geophys. Res. Lett., 51, e2024GL111056, https://doi.org/10.1029/2024GL111056, 2024.
Watts, C. J., Chehbouni, A., Rodriguez, J. C., Kerr, Y. H., Hartogensis, O., and de Bruin, H. A. R.: Comparison of sensible heat flux estimates using AVHRR with scintillometer measurements over semi-arid grassland in northwest Mexico, Agr. Forest Meteorol., 105, 81–89, https://doi.org/10.1016/S0168-1923(00)00188-X, 2000.
Wever, N.: Quantifying trends in surface roughness and the effect on surface wind speed observations, J. Geophys. Res.-Atmos., 117, D11101, https://doi.org/10.1029/2011JD017118, 2012.
Wieringa, J.: Representative roughness parameters for homogeneous terrain, Bound.-Layer Meteorol., 63, 323-363, 1993.
Winckler, J., Reick, C. H., Bright, R. M., and Pongratz, J.: Importance of surface roughness for the local biogeophysical effects of deforestation, J. Geophys. Res.-Atmos., 124, 8605–8618, https://doi.org/10.1029/2018JD030127, 2019.
Wong, C. C. and Liu, C. H.: Pollutant plume dispersion in the atmospheric boundary layer over idealized urban roughness, Bound.-Lay. Meteorol., 147, 281–300, https://doi.org/10.1007/s10546-012-9785-7, 2013.
Wu, J., Zha, J., Zhao, D., and Yang, Q.: Effects of surface friction and turbulent mixing on long-term changes in the near-surface wind speed over the eastern China plain from 1981 to 2010, Clim. Dynam., 51, 1–15, https://doi.org/10.1007/s00382-017-4012-3, 2017.
Yuan, H., Dai, Y., Xiao, Z., Ji, D., and Shangguan, W.: Reprocessing the MODIS leaf area index products for land surface and climate modelling, Remote Sens. Environ., 115, 1171–1187, https://doi.org/10.1016/j.rse.2011.01.001, 2011.
Zhang, F., Sha, M., Wang, G., Li, Z., and Shao, Y.: Urban aerodynamic roughness length mapping using multitemporal SAR data, Adv. Meteorol., 2017, 8958926, https://doi.org/10.1155/2017/8958926, 2017.
Zhang, Z. and Wang, K.: Quantifying and adjusting the impact of urbanization on the observed surface wind speed over China from 1985 to 2017, Fundam. Res., 1, 785–791, https://doi.org/10.1016/j.fmre.2021.09.006, 2021.
Zhang, Z., Wang, K., Chen, D., Li, J., and Robert, D.: Increase in surface friction dominates the observed surface wind speed decline during 1973–2014 in the northern hemisphere lands, J. Climate, 32, 7421–7435, https://doi.org/10.1175/JCLI-D-18-0691.1, 2019.
Zhao, L., Lee, X., Smith, R. B., and Oleson, K.: Strong contributions of local background climate to urban heat islands, Nature, 511, 216–219, https://doi.org/10.1038/nature13462, 2014.
Zhou, X., Yang, K., and Wang, Y.: Implementation of a turbulent orographic form drag scheme in WRF and its application to the Tibetan Plateau, Clim. Dynam., 50, 2443–2455, https://doi.org/10.1007/s00382-017-3677-y, 2018.