Introduction

Heatwaves are extreme climate events that pose a direct threat to human health1,2. Within just a few weeks, heatwaves can cause thousands of deaths3,4, accounting for 93% of the total fatalities ( ~ 148,109) from hydrometeorological extreme events in Europe during 1970–20195. In China, the annual death toll from heatwaves has risen from 3679 in the 1980s to 15,500 in the 2010s6. In addition to heatwaves occurring during the daytime, the number of days with extreme temperatures persisting day and night has significantly increased in recent decades7,8,9. These persistent day-night hot extremes, referred to as compound heatwaves (CoHot), pose greater risks and burdens on cardiovascular and respiratory health, local environments, and the communities compared to heatwaves occurring solely during the day or night8,10.

Anthropogenic greenhouse gas emissions causing climate warming is considered the primary factor making CoHot events hotter, longer-lasting, and more frequent8,11,12, primarily by increasing average temperatures rather than temperature variability8. At the local scales, urbanization can alter temperature trends9,13, causing different changes in CoHot events between urban and rural areas. Additionally, urbanization may advance the onset date of CoHot events14. For instance, the 2022 heatwave in India and Pakistan15 and the 2023 heatwave in Mediterranean region16 occurred during atypical high-temperature seasons (e.g., spring and autumn), catching the public off-guard due to inadequate heat preparedness, thereby exacerbating their impacts on human health and socioeconomic systems17. Moreover, shifts in the onset date of CoHot events exacerbates the grand challenges to early warning systems, as the predictability of heatwave onset remains limited18. However, compared to the extensive research on the frequency, duration, and intensity of CoHot events, current knowledge about the influence of urbanization on their onset dates remains insufficient, hindering the development of sustainable urban planning and mitigation strategies.

Unlike anthropogenic warming, the influence of urbanization on CoHot events is highly complex. For example, the contribution of urbanization to the increasing trend of CoHots in urban regions ranges from negative values to over 50%11,13,19,20. This highly heterogeneous effect is not only driven by varying urban fraction (usually represented by fraction of artificial impervious surface), but is also influenced by differences in the local environment background19,21,22,23,24. Recent studies have begun to highlight the importance of three-dimensional (3-D) urban morphology (e.g., building height, and urban volume)25,26, but most of them focus on the urban heat island effect. The relative significance of 3-D urban morphology and urban fraction in modulating the influence on CoHots onset, remains a subject of ongoing debate11,27.

To this end, here we identify over 700 urban–rural station pairs from more than 5000 meteorological observation stations worldwide. With respect to these paired stations, we present a global pattern of urbanization’s influences on the onset date of CoHots (\(\Delta {{\rm{CoHot}}}_{{\rm{date}}}\)). Employing interpretable machine learning techniques, we attribute the spatial variations in \(\Delta {{\rm{CoHot}}}_{{\rm{date}}}\) to differences in environment background, 3-D urban morphology and urban fraction. The results show that the onset of CoHots in urban areas occurs, on average 0.23 days earlier than in surrounding rural areas. Sunny, warm and humid climates, along with larger urban volumes, favors earlier onsets of CoHot in cities, while tall buildings can significantly advance CoHot onset in warm and humid climates with limited solar radiation. However, no significant relationship between \(\Delta {{\rm{CoHot}}}_{{\rm{date}}}\) and urban fraction is observed unless urban building volume is held constant.

Results

Impact of urbanization on the onset date of CoHot events

CoHot events are defined as periods of at least three consecutive days with extreme high temperatures during both day and night9,13. Urban and rural observation stations are identified based on the proportion of impervious surface area, and 722 urban–rural station pairs within 50 km of each other are selected from over 5000 stations globally (Supplementary Fig. 1 and “Method”). For each station pair, large-scale atmospheric processes (e.g., circulation and climate change) are assumed to be identical9, with differences in onset dates (\(\Delta {{\rm{CoHot}}}_{{\rm{date}}}\)) attributed to the effects of urbanization28. Figure 1 depicts the urban morphology characteristics and urban heat island intensity of the selected 722 urban stations. Overall, urban stations in China are located in areas with high building heights (e.g., >12 m) and high building volumes (e.g., >2 × 106 m3 km2). Urban stations in the United State are primarily associated with lower building heights (e.g., 3–7 m) and building volumes ( < 12 × 105 m3 km2) than China, but spatial heterogeneity still exists. Urban stations in Europe (EU) capture high building height and low building volume, potentially related to a low building number density (Li et al. 2020). The average urban fraction (UrbF) during 1985–2017 does not show strong heterogeneity among China, the United States and Europe, with most values ranging from 0.3 to 0.5. Higher urban fraction, building volume, and building height are associated with lower vegetation cover (FVC), leaf area index (LAI), and albedo (BSKA and WSKA), which are likely due to the lower albedo of impervious surfaces compared to natural vegetation (Supplementary Fig. 2). Urbanization generally increases near surface air temperature, a phenomenon known of the urban heat island effect, that is most pronounced at night than during the day (Fig. 1d). Meanwhile, urban cooling effect can be detected over parts of northern China and western U.S., which is also found in previous studies13,29.

Fig. 1: Morphological characteristics and urban heat island intensity of 722 urban stations.
figure 1

Spatial distributions of (a) mean building height, (b) mean building volume, (c) mean urban fraction and (d) urban heat island intensity. Urban height and volume refer to the average height of buildings and the total building volume per square kilometer, while urban fraction is the fraction of artificial impervious surface. Urban heat island intensity is the mean difference of daily maximum and minimum air temperature between the urban station and its paired rural station.

Figure 2a depicts the distribution of \(\Delta {{\rm{CoHot}}}_{{\rm{date}}}\) calculated through pairwise comparison of simultaneous CoHot events occurring within a 5-day window at urban-rural station pairs (“Method”). A positive \(\Delta {{\rm{CoHot}}}_{{\rm{date}}}\) value indicates earlier onset of CoHot events at urban stations relative to their paired rural stations. The spatial pattern of \(\Delta {{\rm{CoHot}}}_{{\rm{date}}}\) is much more heterogeneous than the distribution of urban morphology. Urbanization advances the onset of CoHot events by 0.4–4 days significantly in southern Europe, the eastern, western, and central parts of the United States, as well as along the southeastern coast and the Yangtze River Economic Belt in China (24–35°N, 100–125°E). These regions represent global hotspots of rapid urbanization, suggesting a potential link between urban development patterns and the observed advancement of CoHot event. Meanwhile, urbanization delays their onset by 0.4–0.01 days in central to northern China and midwest United States (e.g., Missouri). Overall, approximately 61% of the 722 samples (blue bars and lines in Fig. 2b), together with 81% of the 235 samples that passed the significant test, exhibits positive \(\Delta {{\rm{CoHot}}}_{{\rm{date}}}\) values. When considering all samples, the observed mean \(\Delta {{\rm{CoHot}}}_{{\rm{date}}}\) for China, Europe, the United States, and Australia is 0.21, 0.22, 0.26 and 0.23 days, respectively. For urban-rural pairs with significant differences in the CoHot onset dates, the regional means increased to 0.44–0.54 days (Fig. 2c). Although with some seasonal variations, the observed mean \(\Delta {{\rm{CoHot}}}_{{\rm{date}}}\) keeps positive throughout the early (May-June), mid (July-August), and late (September) summer (Supplementary Fig. 3a). This pattern holds robustly across diverse climatic and geographic regions, including monsoonal versus nonmonsoonal regions that defined based on precipitation30, and coastal versus noncoastal areas (Supplementary Fig. 3). These findings indicate that from 1971 to 2017, urbanization has generally advanced the occurrence of CoHot events on a global scale.

Fig. 2: The influence of urbanization on the onset date of summertime CoHot events.
figure 2

a Spatial distributions of the average differences in Cohot events onset dates (\(\Delta {{\rm{CoHot}}}_{{\rm{date}}}\)) between urban and rural paired stations during 1971–2017. Circles mean the differences pass the student’s t test at 90% confidence level. b Cumulative distribution function (CDF) and probability density function (PDF) of \(\Delta {{\rm{CoHot}}}_{{\rm{date}}}\) from panel (a). c Mean \(\Delta {{\rm{CoHot}}}_{{\rm{date}}}\) for China (CA), Europe (EU), the United State (US), and Australia (AU), with error bars representing the 5–95% uncertainty. Uncertainty estimates were derived from 10,000 bootstrap resamplings, stratified by five climate zones (Supplementary Table 1) with 80% of samples in each iteration. Note that positive \(\Delta {{\rm{CoHot}}}_{{\rm{date}}}\) values indicate that CoHot event occurs earlier at urban stations than at rural stations.

Subsequently, the causes of the \({{\rm{CoHot}}}_{{\rm{date}}}\) differences between urban and rural observation stations for each CoHot event are differentiated, with the results summarized in Supplementary Fig. 4. Urbanization alters the onset time of CoHot events in two main ways: (1) advancing only hot nights, and (2) advancing both hot days and hot nights. “Only hot nights” means that the onset time of hot days was the same for urban and rural stations, the onset time of hot nights is earlier at urban stations. “Both hot days and hot nights” indicates that the onset times of both hot days and hot nights are earlier at urban stations compared to rural stations. This finding is consistent with previous studies, which showed that nighttime heat extremes are more significantly influenced by urbanization, while daytime extreme heat is affected to a lesser extent8,27,31.

Driving factors of the spatial variation in CoHot onset date due to urbanization

Previous studies have suggested that, the spatial heterogeneity of urbanization’s influences on canopy temperature is modulated by numerous factors, including local climates (e.g., wind speed, humidity, and other near-surface meteorological conditions), urban fraction, 3-D urban morphology, urban greening space, and surface albedo11,21,32,33. We therefore employed regression methods including eXtreme Gradient Boosting (XGBoost)34, Random Forest (RF)35, and Multiple Linear Regression (MLR) regression methods to analyze the relationship between \(\Delta {{\rm{CoHot}}}_{{\rm{date}}}\) for 722 urban–rural station pairs and various climatic and urbanization factors. Figure 3a presents the evaluation of different regression methods, with the metrics for training and validating samples shown independently in Supplementary Fig. 5. Although all methods showed a tendency to underestimate extreme positive and negative values (blue dots in Fig. 3a), XGBoost outperforms RF and MLR. Moreover, the coefficients of determination (R2) between the simulated and observed \(\Delta {{\rm{CoHot}}}_{{\rm{date}}}\) for the XGBoost, RF, and MLR models are 0.67, 0.48, and 0.02, respectively, suggesting that the XGBoost model explains the majority of the observed spatial variation in \(\Delta {{\rm{CoHot}}}_{{\rm{date}}}\) (up to 67%). We also repeat the model training and evaluation using only the samples with significant \(\Delta {{\rm{CoHot}}}_{{\rm{date}}}\), and the results are consistent with those using all the samples (red dots in Fig. 3a and Supplementary Fig. 5). XGBoost’s superior performance over RF is likely due to its boosting algorithm, which improves prediction accuracy by incrementally building models and focusing on correcting the errors of previous models34.

Fig. 3: The performance of different regression models and feature importances.
figure 3

a Scatter plot of observed \(\Delta {{\rm{CoHot}}}_{{\rm{date}}}\) versus the simulated values based on the XGBoost, random forest (RF), and multiple linear regression (MLR) methods. b Beeswarm plot of SHAP values for various urban and climatic factors in the XGBoost model. Factors are arranged in descending order of the mean absolute SHAP value. c Heatmap of SHAP values for various factors across all model instances. The black bars on the right indicate the global importance of each factor, and the gray dashed line represents f(x) = 0.

Figure 3b provides a detailed explanation of the impact of each feature on the XGBoost model output, using SHapley Additive exPlanation (SHAP) method. Each feature, listed vertically, is associated with a distribution of SHAP values on the horizontal axis, quantifying its contribution to the model’s prediction. Specifically, the sign of a SHAP value indicates the direction of the feature’s influence: positive values suggest that higher feature values increase \(\Delta {{\rm{CoHot}}}_{{\rm{date}}}\) (earlier onset of CoHot events in urban areas), while negative values indicate that higher feature values decrease \(\Delta {{\rm{CoHot}}}_{{\rm{date}}}\). The magnitude of the SHAP value reflects the strength of the feature’s influence, with larger absolute values indicating greater importance in the model’s predictions. The color gradient ranges from purple to red, representing the magnitude of the feature values, with brighter colors indicating higher values. Shortwave radiation (SW) and urban volume (UrbV) demonstrate a broad range of SHAP values, indicating their significant influence on the model output across different instances. In contrast, features like mean summertime maximum temperature (TMAX) and vegetation fraction (FVC) show smaller variations in SHAP values, indicating a relatively minor impact on the prediction. In general, high shortwave radiation, high minimum temperature and high dew point temperature are associated with higher SHAP values, suggesting that sunlight-rich, warm, humid climate background is favorable to an earlier onset of CoHot events in urban regions. This is potentially because that the warm and humid climate can greatly diminish convective efficiency, leading to localized warming in urban areas21, which in turn fosters the occurrence of heat waves. High wind speed has a negative influence on \(\Delta {{\rm{CoHot}}}_{{\rm{date}}}\), as strong winds may enhance air mixing between urban and rural regions, reducing the urban heat island effect36. Additionally, large building volume, medium building height, and high urban fraction are conducive to the earlier onset of CoHot events. Urban engineered materials typically exhibit lower albedo and higher heat capacity than natural surfaces, enhancing solar radiation absorption. The reduction in native vegetation diminishes the evaporative cooling effect of plants and soil, causing the heat stored in the land surface to transfer into urban canopy layers primarily as sensible heat. These processes can contribute to faster warming in urban areas than rural areas33,37, making an earlier onset of CoHot in urban areas. In summary, the data-driven XGBoost model reveals relationships between multiple factors and \(\Delta {{\rm{CoHot}}}_{{\rm{date}}}\) that are physically interpretable and consistent with previous research findings.

Meanwhile, some stations exhibit delayed CoHot onset relative to their rural counterparts. This urban cooling effect aligns with previous studies documenting urbanization-induced surface and canopy temperature cooling26,38 and reduced CoHot frequency13. However, the physical mechanisms underlying this cooling effect remain poorly understood13. Potential processes include the shading effect of buildings38 and aerosol39, evaporative cooling from urban water bodies and green spaces, and enhanced vertical mixing between cooler upper atmospheric air and warmer canopy air due to increased surface roughness40. The complex interplay of these localized processes contributes to the high spatial heterogeneity in urbanization impact. For instance, some adjacent urban stations (within 200 km) may exhibit contrasting effects on CoHot onset date (Fig. 2a). The XGBoost model generally captures these local variations (Supplementary Fig. 6a). SHAP analysis further reveals that, the opposite signs of \(\Delta {{\rm{CoHot}}}_{{\rm{date}}}\) between neighboring urban stations in California and in Missouri are related to the differences in urban volume and urban greening space (Supplementary Fig. 6), respectively. Nevertheless, the XGBoost model does not fully capture all the stations with negative \(\Delta {{\rm{CoHot}}}_{{\rm{date}}}\), as the model is trained globally to reveal large-scale variations of \(\Delta {{\rm{CoHot}}}_{{\rm{date}}}\) and does not account for certain localized factors (e.g., aerosol and water body). Further efforts are needed to present a more in-depth investigation into the strong heterogeneity in urbanization’s impact on CoHot onset dates at local scales.

The SHAP heatmap presented in the Fig. 3c illustrates the overall influence of different features on the model output. f(x) represents the model’s predicted value equal to the total sum of SHAP values on the vertical axis, while the black bars show the total sum of SHAP values on the horizontal axis, indicating the global importance of specific factors. Overall, shortwave radiation is the most important factor, followed by building volume, minimum temperature, building height, urban fraction, and dew point temperature. Positive \(\Delta {{\rm{CoHot}}}_{{\rm{date}}}\) predictions (e.g., f(x)>0) are typically associated with significant positive contributions from these factors. Other factors, such as precipitation, vegetation, wind speed, albedo, and maximum temperature, have only 1/6 to 1/3 of the importance of shortwave radiation and building volume. Therefore, the spatial variation in \(\Delta {{\rm{CoHot}}}_{{\rm{date}}}\) is primarily shaped by climatic factors (e.g., solar radiation, minimum temperature, and humidity) and urban morphology.

Interdependence of urban morphology on the CoHot onset date

Considering that the impact of specific urban morphological features (e.g., building volume) on the CoHot onset date is modulated by the local climates, this study uses the K-means clustering method to categorize the 722 samples into five groups based on the top 6 environmental factors ranked in Fig. 3c. Supplementary Table 1 presents the average values of environmental factors in each of the five clusters, and the influences of environmental factors are supposed to be similar in each cluster.

Figure 4a depicts the scatter plot of urban volume and its impact on CoHot events (represented by SHAP values), with color indicating the urban fraction. Figure 4b, c display similar relationships for urban building height and urban fraction, respectively. There is a significant positive correlation between urban volume and its SHAP values (Fig. 4a). Higher building volume results in greater heat capacity (thus storing more heat), reduced wind speed, and the release of more anthropogenic heat fluxes, which enhances the urban canopy’s heat effect and favors the onset of CoHot events in urban than rural regions under the same synoptic conditions (e.g., anticyclones). Meanwhile, a significant positive correlation between building height and its SHAP values is observed only in cities with high minimum temperatures, large precipitation (therefore humid climate), but low solar radiation (Fig. 4b and Supplementary Table 1). Previous studies have shown that buildings reduce urban temperatures during the day by increasing shading25,41,42, but at night, they increase temperatures through multiple reflections and absorption of longwave radiation26. Therefore, in a cloudy day, warm night with humid climate background, the cooling effect of tall buildings is limited, while their warming effect at night is enhanced, potentially leading to an overall net warming effect. In regions with higher solar radiation, lower nighttime temperatures, and drier climates, the offset between the cooling and warming effects of tall buildings is more complex, and taller buildings do not always lead to an earlier CoHot onset date.

Fig. 4: The dependency between urban factors and their impacts on ΔCoHotdate.
figure 4

a Scatter plot of the normalized urban volume (UrbV) versus its impacts on \(\Delta {{\rm{CoHot}}}_{{\rm{date}}}\) (represented by SHAP value), with colors indicating normalized urban fraction (UrbF). The five clustered climatic categories (e.g., from Category 1 to Category 5) are detailed in Supplementary Table 1. The black solid and dashed lines represent linear regression and its 5–95% uncertainty, respectively. Panels (b) and (c) Same as (a), but showing the influence of urban height (UrbH) and urban fraction (UrbF), respectively. The normalization was performed using z-score normalization: \(({\rm{X}}-{\overline{\rm{X}}})/{\rm{\sigma }},\) where \({\rm{X}}\) means a specific variable (e.g., urban fraction), and \({\overline{\rm{X}}}\) and \({\rm{\sigma }}\) are the mean value and standard deviation of \({\rm{X}}\) from all urban stations.

There is no significant positive correlation between the urban fraction and its SHAP values (Fig. 4c), but a higher urban fraction leads to significantly higher positive SHAP values when urban volume density if fixed (blue to orange dots in Fig. 4a). This suggests that the influence of urban fraction on CoHotdate depends on urban volume density. In contrast, the effects of urban volume density do not depend on the urban fraction (Fig. 4c). The observed interdependence is not attributable to collinearity between urban fraction and urban volume density (correlation coefficient of 0.23), because (1) the XGBoost model utilizes regularization techniques to address multicollinearity, and (2) there is no significant dependence of urban fraction on urban building height (Fig. 4b), despite the high correlation between urban building height and urban volume density. Instead, these findings highlight the greater importance of urban volume density over urban fraction in determining the impact of urbanization on CoHot onset.

Discussion

Compared to previous studies focused on the frequency, duration, and intensity of CoHot events, this research goes a step further by investigating the impacts of urbanization on the onset of CoHot events, which is important for heatwave early warning. The result shows a global emerging of earlier CoHot event onset in urban regions, and the sunny, warm, and humid climate background can facilitate this advancement. In addition, urban volume has the strongest positive influence on \(\Delta {{\rm{CoHot}}}_{{\rm{date}}}\), compared to urban building height and urban impervious fraction. Higher building height significantly increases \(\Delta {{\rm{CoHot}}}_{{\rm{date}}}\) only in the cloudy day, warming night and humid climate, while urban impervious fractions show no significant relationship with \(\Delta {{\rm{CoHot}}}_{{\rm{date}}}\).

While the mean urban-rural differences in CoHot onset (0.21–0.23 days globally, 0.44–0.54 days for specific locations) may appear modest at first glance, they are statistically significant at both global and regional scales, demonstrating a systematic impact of urbanization. Importantly, there is considerable variability in individual events, as not all CoHot events during 1971–2017 were advanced by urbanization. For instance, when synoptic system anomalies are strong, CoHot events in rural-urban station pairs may occur on the same day, accounting for 53% of total CoHot events (Supplementary Fig. 7a). However, in the remaining cases where urbanization drives an earlier onset (35% of events), the advance is typically 1–2 days (Supplementary Fig. 7b and Supplementary Fig. 8), exceeding the operational timescales of most heatwave warning systems (e.g., 1 day). Moreover, the proportion of advanced CoHot events exceeds 50% of total events in regions including southern Europe, the southeastern and western United States, as well as along China’s southeastern coast and the Yangtze River Economic Belt (Supplementary Fig. 7b). As global urbanization intensifies and compound heatwave events become more frequent and severe in the future, the impact of urbanization on CoHot onset is likely to grow. Therefore, while synoptic-scale anomalies may sometimes synchronize CoHot onset across urban-rural pairs, incorporating urbanization-induced timing differences into warning systems is critical for enhancing urban preparedness and targeted interventions in vulnerable regions such as southeastern China, the southeastern and western U.S., and southern Europe. Additionally, our findings on the role of urban three-dimensional structure suggest that reducing urban building volumes, alongside traditional mitigation strategies like green roofs and urban greening, could effectively mitigate an earlier CoHot onset. Sustainable urban development strategies could consider reducing urban building volumes while meeting urbanization demands, thereby reducing the risk of an earlier CoHot onset.

Previous studies defined the onset date of CoHot as the time when the first CoHot event occurs in a particular summertime of a given year14,43. However, this definition is more applicable to phenomena with average state variables or long-term events (e.g., monsoons), rather than extreme events. This study focuses on CoHot events occurring simultaneously in both urban and rural stations, comparing each event individually. The selected sample represents about half of all CoHot events in global urban stations (Supplementary Fig. 9), while the remaining samples, which only occur in urban stations, are considered to reflect the influence of urbanization on CoHot frequency, possibly due to highly localized processes or observational perturbations14. The spatial threshold used to identify urban-rural station pairs should neither be too small nor too large to ensure that stations are paired under homogeneous synoptic conditions while including as many pairs as possible. Therefore, a 50 km threshold was chosen, as it can cover most medium to large-scale synoptic systems that are typically associated with heatwave extremes44,45,46,47, and the threshold falls within the practical ranges (20–100 km) commonly used in previous studies48,49. Moreover, the results are not sensitive to the spatial threshold, as the conclusions remain unchanged when the threshold changes between 10 km and 100 km (Supplementary Fig. 10).

Due to the lack of long-term observation stations, the current urban-rural pairs are mainly concentrated in China, United States, and Europe, with large cities not fully represented. For example, the average building volume in U.S. is larger than in China50, but most of U.S. urban stations identified in this study have lower building volumes than those in China. Therefore, the average earlier start date for urban CoHot events may be underestimated. Satellite-based spatial grid datasets for daily minimum and maximum surface air temperature could provide a more comprehensive urbanization impact pattern51, but how to apply satellite short-term records to extreme event analysis (e.g., CoHot) still requires further exploration.

While the XGBoost model demonstrates superior performance, some uncertainties remain. First, the reliance on static urban 3-D morphology data may not adequately capture the dynamic changes during urban development, limiting the model’s ability to accurately reflect the complex relationships between climatic and urbanization factors. In addition, neglecting this dynamic process would underestimate the importance of urban height and volume in influencing the spatial variations of \(\Delta {{\rm{CoHot}}}_{{\rm{date}}}\). Second, while our predictor variables were carefully selected, they may not fully capture all relevant processes, potentially introducing biases in the predictions. Specifically, although we primarily attribute urban-rural differences in CoHot onset to urbanization effects, other small-to-meso scale ( ≤ 50 km) processes—such as regional topography and associated climate variability52, and land-sea contrasts53,—may also contribute to the differences. These factors can interact with urbanization processes, modifying local climate conditions and heatwave dynamics. However, it is important to note that dynamic urban morphology data are currently lacking, and the factors influencing urban CoHot events, along with their underlying physical processes, remain an active area of research. While these uncertainties need further investigation, they are not expected to significantly impact the validity of our current findings, as the XGBoost model accounts for the majority of the spatial variance in CoHot onset and produces physically reasonable results.

Finally, while our machine learning framework (XGBoost with SHAP analysis) robustly identifies urban morphological features as key predictors of \(\Delta {{\rm{CoHot}}}_{{\rm{date}}}\)—demonstrating consistent, physically plausible relationships – we recognize that data-driven approaches alone cannot fully elucidate the underlying physical mechanisms. The relative contributions of processes like heat storage, roughness effects, and radiation trapping in driving earlier CoHot onset remain to be quantitatively resolved through process-based modeling based on urbanized climate models (e.g., WRF-UCM) or high-resolution large eddy simulations (LES). We also acknowledge that energy use patterns and infrastructure systems (e.g., air conditioning prevalence) can modify urban thermal environments, but these dynamic variables present measurement challenges for comparative analysis across diverse cities due to data availability. Future research could productively build on our findings that mainly focus on urban morphological foundation by incorporating dynamic human-system variables where reliable data become available, particularly through city-specific case studies and event-based dynamic modelings.

Methods

Data

The daily maximum temperature (TMAX) and minimum temperature (TMIN) data for the summer months (May-September and November-March in Northern and Southern Hemispheres, respectively) from 1971 to 2017 were obtained from China Meteorological Administration (CMA) and the Global Historical Climatology Network-Daily (GHCND) database11,54. If there were 5 or more consecutive days of missing data during a summer, that season was considered a missing summer, and stations with 5 or more missing summers were excluded from the analysis11. There are no differences between GHCND and CMA datasets over China, except that the latter provides observations from more stations. The CMA dataset is necessary to ensure adequate spatial coverage in China, as the GHCND dataset contains only 177 stations over China after quality control, which is substantially fewer than the >2000 stations available in the U.S. Moreover, no urban-rural pairs can be found from GHCND dataset in China.

Urbanization data from 1985 to 2017 were derived from the 30-meter global artificial impervious areas (GAIA)55. The urbanization percentage for each station was derived by calculating the proportion of artificial impervious area within a 2-km radius buffer zone around the station8. Urban morphology data, including building height and volume, were obtained from the 1-km resolution global three-dimension building structure dataset50. Here the urban morphology was assumed to be constant due to a lack of global dataset of dynamic 3-D urban building structure. Climate background data during the summertime (1971–2017), including downward shortwave radiation (SW), longwave radiation (LW), dew point temperature (TD), total precipitation (TP), and 10-meter wind speed (U), were sourced from the European Center for Medium-Range Weather Forecasts (ECMWF) Fifth Generation Atmospheric Reanalysis (ERA5)56. Vegetation and albedo data, including leaf area index (LAI), fraction of vegetation cover (FVC), white sky albedo (WSA), and black sky albedo (BSA), were derived from the Global LAnd Surface Satellite (GLASS) dataset57. The spatial resolution of the ERA5 and GLASS datasets were 0.25° and 0.05°, respectively. The nearest neighbor interpolation method was used to map these gridded data to each urban station.

Definitions of CoHot events and the influence of urbanization

Similar to previous studies8,11,13, CoHot events are defined as extreme high-temperature events lasting at least three consecutive days, during which both the daytime maximum temperature and nighttime minimum temperature exceed their respective 90th percentile thresholds. The daily 90th percentile thresholds for maximum (TMAX) and minimum (TMIN) temperatures were estimated based on a 15-day window over the baseline period (1971–2000) using 450 samples (e.g., 15 days × 30 years). These daily-based percentile thresholds, in contrast to seasonal-fixed thresholds, incorporate the varying levels of preparedness and acclimatization of both humans and ecosystems to extreme heat throughout the season8. In addition, utilizing a 15-day moving window helps mitigate potential inconsistencies in the percentile-based indices of temperature extremes8.

A station is considered urban area if its average UrbF during 1985–2017 exceeds 0.33. Other stations are categorized as rural area9. Urban stations are paired with rural stations within a 50-km radius, assuming that both stations experience similar large-scale atmospheric processes (e.g., atmospheric circulation and climate change). The \(\Delta {{\rm{CoHot}}}_{{\rm{date}}}\) between urban and rural stations was calculated through event-by-event comparison. Specifically, if a CoHot event occurs at a particular urban station, and within a 5-day window at its paired rural station, the CoHot events at both urban and rural stations are considered as the same event. The difference in their start dates (e.g., \(\Delta {{\rm{CoHot}}}_{{\rm{date}}}\)) is then calculated. However, if the CoHot event at the urban station is unrelated to the event at the rural station (e.g., due to the influence of urbanization on CoHot frequency), that event is excluded from the analysis. The mean \(\Delta {{\rm{CoHot}}}_{{\rm{date}}}\) across all rural-urban paired stations is taken as the urbanization effect on the CoHot onset date. The student’s t test is used to determine whether the average \(\Delta {{\rm{CoHot}}}_{{\rm{date}}}\) is significantly positive or negative. It is important to note that CoHot events at urban stations were excluded when the annual urban fraction (UrbF) ≤ 0.33, typically reflecting early urbanization stages. Similarly, events at rural stations were excluded when UrbF > 0.33, a condition often observed in later years as rural areas urbanize. This ensures all samples reflect clear urban-rural contrasts.

Regression models and Shapley additive explanation (SHAP) method

This study employed Random Forest (RF), eXtreme Gradient Boosting (XGBoost), and Multiple Linear Regression (MLR) methods. Random Forest is a classical ensemble machine learning method that constructs multiple decision trees randomly and uses ensemble predictions to improve accuracy and stability35. XGBoost is an improved implementation of Gradient Boosting Decision Trees (GBDT). Unlike Random Forest, XGBoost generates multiple decision trees iteratively, where each new tree fits the errors made by the previous tree34. XGBoost also employs regularization techniques that help mitigate the impact of multicollinearity by penalizing overly complex models. This regularization not only reduces overfitting but also allows the model to identify and weigh the contributions of each feature more accurately, even in the presence of correlated variables. In addition, the tree-based nature of XGBoost allows it to capture complex interactions between features. Each decision tree in the ensemble can split based on different variables, effectively learning how the relationships between features influence the target variable. This structure helps the model to discern the independent effects of each feature while considering their interactions, therefore reduces potential confounding effects between different variables. In recent years, both Random Forest and XGBoost have been widely used in regression analysis, demonstrating high modeling accuracy.

Above regression models were trained and evaluated based on the cross-validation method. Specifically, the \(\Delta {{\rm{CoHot}}}_{{\rm{date}}}\) of all urban-rural pairs was randomly divided into two groups, with 622 pairs (185 pairs for samples with significant \(\Delta {{\rm{CoHot}}}_{{\rm{date}}}\) changes) utilized for training and 100 (50 pairs for samples with significant \(\Delta {{\rm{CoHot}}}_{{\rm{date}}}\) changes) reserved for validation. Hyperparameter optimization was performed, encompassing key parameters such as the learning rate, number of estimators (n_estimators), maximum tree depth (max_depth), and gamma factors, utilizing the GridSearchCV module in Python.

Shapley additive explanation (SHAP) is a game-theory-based method for explaining machine learning model output58. SHAP quantifies the specific contribution of each input element to the prediction outcome. Its additive attribution formula is as follows:

$$f({\rm{x}})=g({\rm{x}}^{\prime} )={{\rm{\phi }}}_{0}+\sum _{{\rm{i}}=1}^{{\rm{N}}}{{\rm{\phi }}}_{{\rm{i}}}{\rm{x}}^{\prime}$$
(1)

where \(f({\rm{x}})\) is the original model, \(g({\rm{x}}^{\prime} )\) is the explanatory model with simple input \({\rm{x}}^{\prime}\) (\({\rm{x}}^{\prime} \in {\{0,1\}}^{{\rm{N}}}\)), while \({\rm{x}}\) and \({\rm{x}}^{\prime}\) are linked by the mapping equation \({\rm{x}}={{\rm{h}}}_{{\rm{x}}}({\rm{x}}^{\prime} )\). The total number of the model input predictors is denoted by \({\rm{N}}\), while \({{\rm{\phi }}}_{{\rm{i}}}\) is the feature attribution function of the ith predictor. The explanatory model \(g({\rm{x}}^{\prime} )\) has a unique solution:

$${{\rm{\phi }}}_{{\rm{i}}}(f,{\rm{x}})=\sum _{{\rm{z}}^{\prime} \subseteq {\rm{x}}^{\prime} }\frac{|{\rm{z}}^{\prime} |!({\rm{M}}-|{\rm{z}}^{\prime} |-1)!}{{\rm{M}}!}({f}_{{\rm{x}}}({\rm{z}}^{\prime} )-{f}_{{\rm{x}}}({\rm{z}}^{\prime} {\rm{\backslash i}}))$$
(2)

where \(|{\rm{z}}^{\prime} |\) is the number of non-zero values in \({\rm{z}}^{\prime}\), and \({\rm{z}}^{\prime} \subseteq {\rm{x}}^{\prime}\), \(f({\rm{x}}^{\prime} )=f({{\rm{h}}}_{{\rm{x}}}({\rm{z}}^{\prime} ))={\rm{E}}[f({\rm{z}})|{{\rm{z}}}_{{\rm{S}}}]\), and \({\rm{S}}\) is the set of non-zero indexes in \({\rm{z}}^{\prime}\).