Abstract
Human-perceived temperature (HPT) describes the joint effects of multiple climatic factors such as temperature and humidity. Extreme HPT events may reduce labor capacity and cause thermal discomfort and even mortality. These events are becoming more frequent and more intense under global warming, posing severe threats to human and natural systems worldwide, particularly in populated areas with intensive human activities, e.g., the North China Plain (NCP). Therefore, a fine-scale HPT dataset in both spatial and temporal dimensions is urgently needed. Here we construct a daily high-resolution (~1 km) human thermal index collection over NCP from 2003 to 2020 (HiTIC-NCP). This dataset contains 12 HPT indices and has high accuracy with averaged determination coefficient, mean absolute error, and root mean squared error of 0.987, 0.970 °C, and 1.292 °C, respectively. Moreover, it exhibits high spatiotemporal consistency with ground-level observations. The dataset provides a reference for human thermal environment and could facilitate studies such as natural hazards, regional climate change, and urban planning.
Similar content being viewed by others
Background & Summary
Under global warming, the frequency and intensity of extreme temperature events (e.g., heatwaves and cold spells) are increasing in most parts of the world1,2,3,4,5. They pose severe impacts on human society and the ecological systems6,7,8,9,10, putting more people at risk of extremely hot or cold11,12. For example, heatwave led to more than 70,000 deaths in Europe in 2003 and 55,000 deaths in Russia in 201013,14. Extreme cold events also pose a great threat to human health and often cause a higher proportion of deaths than extreme heat15,16.
Human-perceived temperature (HPT) has attracted much attention in recent years due to its close connection with human health17,18. HPT is not only affected by air temperature but also by other meteorological variables (e.g., humidity and wind speed). For instance, HPT would become higher on hot days with high humidity while becoming lower on cold days under the additional influence of wind19,20,21,22. Relevant studies revealed that discomfort HPT could impact human health, such as respiratory distress and reduced skin evaporation, and significantly increase the health risks and mortality in heat- and cold-exposed populations under the joint effects of multiple meteorological factors23,24,25. Li et al.26 showed that global HPT increases faster than the actual air temperature. Faster increases in HPT were also confirmed by regional studies27,28. However, previous studies mainly quantitated the human thermal environment and human heat exposure assessment based on meteorological station data and homogeneous climatic datasets29,30,31,32. These station-based or raster-based datasets have a coarse spatial resolution (e.g., 0.5° × 0.5° or 2.5° × 2.5°) and insufficient temporal resolution (e.g., monthly), which cannot meet the requirements of long time series, full regional coverage, and fine spatial scale studies28. A seamless dataset of HPT with high temporal frequency and spatial resolution is still lacking and sorely needed.
A better understanding of the thermal environment helps prevent or mitigate heat/cold stress effectively18. Various HPT indices have been proposed to investigate human thermal stress previously. For example, the wet-bulb temperature (WBT), effective temperature (ET), and heat index (HI) have been widely used for quantitative assessment of human thermal environment research17,28,33,34,35. However, there is still a lack of unified evaluation standards or comprehensive datasets for assessing the human thermal environment and stress36. Therefore, it is imperative to develop a new collection with multiple indices for evaluating the thermal environment at a fine scale (e.g., at daily frequencies, and ~0.00833° spatial resolution in a geographical coordinate system, which is equal to around 1 km near the equator).
Previous studies have projected that the morbidity and mortality induced by extreme temperatures may continue to rise, especially in populated Asia37,38,39. In particular, the North China Plain (NCP), as one of the most populated and urbanized areas of the world, is suffering from severe thermal stress40,41. It is located in a typical East Asian monsoon region (113°E–121°E and 34°N–41°N)40,42, and spans eleven provincial units of China, including one of the regions with the most profound human activities in China, i.e., the Beijing-Tianjin-Hebei region43. NCP has a total population of 0.3 billion, which accounts for around one-fifth of the population of the country44. The population exposed to heatwaves in this area is up to 0.66 million persons/day during 1961–201545. By examining the long-term changes of major urbanization agglomerations in China, Wang et al.46 indicated that HPT in northern China grew faster than that in the south, particularly in NCP. Zhang et al.47 have generated a 1-km-resolution dataset of monthly human thermal index collection (HiTIC-Monthly) over China during 2003–2020. However, this monthly dataset is not capable of detecting extreme weather events such as heatwaves and cold spells, which are usually defined based on daily temperatures. The lack of a daily HPT dataset limits accurate heat stress-related studies in this region, and a multi-index dataset of human thermal stress at a daily scale over NCP is highly essential.
To fill the aforementioned research gap, this study aims to construct a high spatial resolution (1 km × 1 km) human thermal index collection at a daily scale over NCP from 2003 to 2020 (i.e., HiTIC-NCP). The 12 widely-used indices include Surface Air Temperature (SAT), indoor Apparent Temperature (ATin), outdoor shaded Apparent Temperature (ATout), Discomfort Index (DI), Effective Temperature (ET), Heat Index (HI), Humidex (HMI), Modified Discomfort Index (MDI), Net Effective Temperature (NET), Wet-Bulb Temperature (WBT), simplified Wet-Bulb Globe Temperature (sWBGT), and Wind Chill Temperature (WCT). Compared with our previous HiTIC-Monthly dataset, the current HiTIC-NCP dataset has improved the temporal resolution from a monthly to a daily scale. It also incorporates different predictors by including new variables such as daily precipitable water vapor, which reflect the high-frequency (day-to-day) fluctuations of atmospheric humidity that induces excessive heat stress as perceived by human body. This study is thus of great significance for regional climate change, urban planning, urban thermal environment, and public health research.
Methods
Overview
The data used in this study mainly include meteorological station observations and four gridded datasets. The Light Gradient Boosting Machine (LightGBM) algorithm proposed by Ke et al.48 is used to produce 12 human thermal indices. The workflow of this study is shown in Fig. 1. First, we derived daily HPT indices from daily values of temperature, humidity, and wind speed based on station observations, and preprocessed the daily gridded datasets. We extracted the covariate values at meteorological stations from gridded datasets to form candidate samples (which include station-level HPTs and their co-located covariate values), then randomly selected 80% of the samples as a training set and 20% of that as a validation set. Hyperparameter optimization based on a 5-fold cross-validation was used to determine the optimal parameters of the model by the training set, and a systematic evaluation of model performance was conducted based on the validation set. Finally, a collection of 12 HPTs was generated by the trained LightGBM model. The detailed data and methods are described as follows.
Framework of generating the daily human thermal index collection over the North China Plain (HiTIC-NCP). HPT: Human-perceived Temperature; LightGBM: Light Gradient Boosting Machine.
Meteorological observations
The station-observed daily data including surface air temperature, relative humidity, and wind speed from 2003 to 2020 were collected from the China Meteorological Data Service Center (CMDSC, http://data.cma.cn). All station data have been strictly quality-controlled based on the method provided by Xu et al.49, including homogenizing climate data and processing outliers in the data series. The observational network includes 374 stations across NCP (Fig. 2). In this study, we calculated daily human-perceived temperature from meteorological observations and extracted the covariate values from gridded datasets at the same locations. The data were stored in the form of one per station per day, then processed and divided into training (80%) and validation (20%) sets for model calibration. The calculation of the 12 human thermal indices is described in Table 1.
Study area and the spatial distribution of meteorological stations in the North China Plain (NCP). The shaded color indicates elevation, and the black dots denote the stations.
Gridded datasets
Four gridded datasets with a total of seven independent variables (Table 2) are used as input in the LightGBM model. These variables include land surface temperature (LST), population density, elevation, slope, aspect, precipitable water vapor, and day of the year.
LST is closely related to near-surface air temperature50. In this study, we adopted an LST dataset named MODIS-like LST51. This dataset was developed based on the Moderate Resolution Imaging Spectroradiometer (MODIS) LST dataset and the gap-filling algorithm51 to make it seamless at a global scale in 1 km grids from 2003 to 2020. The average root mean squared error (RMSE) of this LST dataset is 1.80–2.03 °C and 1.23–1.45 °C for daytime and nighttime temperatures, respectively. The daily mean LST was calculated by averaging the corresponding daytime and nighttime LSTs. In addition, population distribution and regional population density are demonstrated to affect regional temperature52,53,54,55, and we used a high-accuracy population density dataset provided by the WorldPop project team as a predictor56.
The Multi-Error-Removed Improved-Terrain (MERIT) Digital Elevation Model (DEM)57 with a spatial resolution of 90 m was used in this study to extract slope and aspect. Compared with previous DEM data (e.g., SRTM and AW3D-30m), the errors of MERIT DEM are significantly reduced, and the accuracy in the flat area has been effectively improved57,58,59. In addition, precipitation has been proven to affect air temperature and humidity60,61. Therefore, a global daily precipitable water vapor dataset with a spatial resolution of 1 km was used.
The LightGBM machine learning algorithm
LightGBM is a gradient boosting decision tree (GBDT) based machine learning algorithm. Compared with other algorithms based on GBDT such as XGBoost and pGBRT, LightGBM has improved and optimized by the Gradient-based One-Side Sampling (GOSS) and Exclusive Feature Bundling (EFB) technologies, which allow LightGBM to better retain samples with large gradients without significantly altering the original data shape and focus more on undertrained samples48. LightGBM consumes much less memory and improves training efficiency by more than 20 times while maintaining nearly the same accuracy48. Due to its superior degree of generalizability, extremely fast processing speed, and high accuracy, LightGBM has been widely applied in different fields (e.g., climatology, air pollution, social psychology)47,62,63,64,65,66,67. As shown in Fig. 1, the daily gridded datasets were fed to the trained LightGBM model for HPT prediction.
Hyperparameters tunning
The accuracy of machine learning relies heavily on parameterization68. Hyperparameter optimization configures the model by searching for various model hyperparameters and selecting a subset that achieves the best performance on the given dataset. The grid search-based hyperparameters tunning method was adopted in our study69.
Accuracy evaluation
Three statistical metrics were used to assess the performance of the training model, including the determination coefficient (R2), root mean absolute error (MAE), and RMSE. The R2, RMSE, and MAE values are computed as follows:
where N represents the number of data points, xp and xo are the ith prediction and observation values, respectively.
Data Records
The high spatial resolution (1 km) human thermal index collection at a daily scale from 2003 to 2020 over NCP (HiTIC-NCP) shows a high consistency between observations and predictions, and the product has respectable accuracies. HiTIC-NCP contains 216.rar files, with individual files decompressed into the NetCDF format. The dataset is available on the general-purpose repository Zenodo, at https://doi.org/10.5281/zenodo.752800170. This dataset is stacked by year and each stack consists of daily images by day of the year in the NetCDF format. The unit of the dataset is 0.01 degree Celsius (°C), and the values are stored in an integer data type (Int16) to save storage space, and thus need to be divided by 100 to get the values in degree Celsius when in use. The geographic coordinate system of the dataset is World Geodetic System 1984 (WGS84) Coordinate System. The spatial distribution of the 12 human thermal indices on August 13, 2013, one of the severest heat stress days in the history of NCP, and January 3, 2008, when the sharpest snowstorm struck NCP, are shown as examples in Figs. 3, 4.
Spatial distribution of the 12 human thermal indices over NCP on August 13, 2013: (a) ATin, (b) ATout, (c) DI, (d) ET, (e) HI, (f) HMI, (g) MDI, (h) SAT, (i) sWBGT, (j) WBT, (k) NET, and (l) WCT.
As Fig. 3 but on January 3, 2008.
Technical Validation
Overall accuracy of 12 predicted human thermal indices
We assessed the overall accuracy of 12 human thermal indices over NCP during 2003–2020 in terms of R2, MAE, and RMSE. All indices exhibit desirable performance, with the averaged R2, MAE, and RMSE being 0.987, 1.292 °C, and 0.970 °C, respectively. It demonstrates that the LightGBM model has good effectiveness in the prediction of human thermal indices at a daily scale. Figure 5 displays the scatter plots of observations versus predictions of 12 human thermal indices. The closer the scattered points are to the 1:1 line, the better agreement between the predicted and observed values.
Scatter plots of predicted versus observed daily human thermal indices over NCP from 2003 to 2020: (a) ATin, (b) ATout, (c) DI, (d) ET, (e) HI, (f) HMI, (g) MDI, (h) SAT, (i) sWBGT, (j) WBT, (k) NET, and (l) WCT. Colors mean the density of data points, and the black dashed line is the 1:1 line.
The R2, MAE, and RMSE values of 12 predicted thermal indices during 2003–2020 are shown in Fig. 6. The R2 values range from 0.975 to 0.990, the MAE values are all smaller than 1.4 °C, and the RMSE values are smaller than 1.9 °C (Table 3). Particularly, sWBGT has the highest R2 values of 0.990, followed by HMI, WBT, and DI, and its MAE and RMSE values are 0.698 °C and 0.926 °C, respectively. ET is with an R2 of 0.987 and has the lowest MAE and RMSE values (i.e., 0.598 °C and 0.785 °C, respectively). WCT and NET exhibit the highest MAEs (i.e., 1.17 and 1.37 °C, respectively) and RMSEs (i.e., 1.61 and 1.84 °C, respectively). It is likely caused by the involvement of wind speed in the calculation of WCT and NET (recall Table 1).
R2, MAE, and RMSE values of 12 predicted human thermal indices over NCP from 2003 to 2020.
Spatial variations of prediction accuracy
Figures 7–9 depict the distributions of R2, MAE, and RMSE values in 12 human thermal indices at individual stations of NCP during 2003–2020. As shown in Fig. 7, the high R2 values (i.e., ranging from 0.970 to 0.995) are seen at most stations across NCP, especially in the central, southwestern, and northern parts. The R2 values of NET at most stations have relatively lower accuracies, likely caused by the inclusion of wind speed. Figures 8, 9 show that ET has the best results with the lowest MAEs and RMSEs (i.e., < 0.911 °C and < 1.161 °C, respectively, see Figs. 8d, 9d). Compared with other 11 human thermal indices, NET has higher MAE and RMSE values (i.e., > 1.058 °C and 1.415 °C, respectively, see Figs. 8k, 9k). Spatially, MAEs and RMSEs exhibit an increasing tendency from low to high elevation, with smaller MAE and RMSE values in inland areas with lower elevation and relatively larger values in mountainous areas with higher elevation.
Spatial distribution of R2 of the 12 predicted human thermal indices at individual meteorological stations of NCP from 2003 to 2020: (a) ATin, (b) ATout, (c) DI, (d) ET, (e) HI, (f) HMI, (g) MDI, (h) SAT, (i) sWBGT, (j) WBT, (k) NET, and (l) WCT.
As Fig. 7 but for MAE.
As Fig. 7 but for RMSE.
Prediction accuracy in different periods
Figure 10 illustrates the averaged R2, MAE, and RMSE values of 12 indices in individual years of 2003–2020. All R2 values of 12 indices in all years exhibit well performance (i.e., > 0.955 °C, Fig. 10a). Better annual predictions appear in WBT over 2008–2013 and sWBGT over 2008–2014, with R2 exceeding 0.990. The lowest R2 is observed in NET in 2003 (i.e., 0.956). Annual averaged MAEs and RMSEs of all indices in nearly all years are less than 1.5 °C and 2.0 °C, respectively (Fig. 10b,c), demonstrating that our product is reliable in all years.
Prediction accuracies of 12 human thermal indices over NCP in individual years from 2003 to 2020.
Code availability
The HiTIC-NCP dataset generation codes are available on GitHub (https://github.com/CSLixiang/HiTIC-NCP.git), and operational under Python 3.8 or JavaScript. In the GitHub repository, we uploaded three code scripts, i.e., “Data preprocessing code.py”, “HiTIC-NCP Code.py” and “Figures code.py”. Additionally, the data samples were uploaded to the “Data Samples” folder.
References
IPCC. Climate Change 2021: The Physical Science Basis. Contribution of Working Group I to the Sixth Assessment Report of the Intergovernmental Panel on Climate Change. (Cambridge University Press, Cambridge, United Kingdom and New York, NY, USA, 2021).
Meehl, G. A. & Tebaldi, C. More intense, more frequent, and longer lasting heat waves in the 21st century. Science. 305, 994–997 (2004).
Alexander, L. V. et al. Global observed changes in daily climate extremes of temperature and precipitation. Journal of Geophysical Research: Atmospheres. 111, D05109 (2006).
Wu, S. et al. Local mechanisms for global daytime, nighttime, and compound heatwaves. npj Climate and Atmospheric Science. 6, 36 (2023).
Zhang, H. et al. Unequal urban heat burdens impede climate justice and equity goals. The Innovation. 4, 100488 (2023).
Donovan, M. K. et al. Local conditions magnify coral loss after marine heatwaves. Science. 372, 977–980 (2021).
Zachariah, M., Mondal, A. & AghaKouchak, A. Probabilistic assessment of extreme heat stress on indian wheat yields under climate change. Geophysical Research Letters. 48, e2021GL094702 (2021).
Markonis, Y. et al. The rise of compound warm-season droughts in Europe. Science Advances. 7, eabb9668 (2021).
Meehl, G. A. et al. Future heat waves and surface ozone. Environmental Research Letters. 13, 064004 (2018).
Basu, R. High ambient temperature and mortality: a review of epidemiologic studies from 2001 to 2008. Environmental Health. 8, 40 (2009).
Mazdiyasni, O. et al. Increasing probability of mortality during Indian heat waves. Science Advances. 3, e1700066 (2017).
Kovats, R. S. & Hajat, S. Heat stress and public health: a critical review. Annual Review of Public Health. 29, 41–55 (2008).
Barriopedro, D., Fischer, E. M., Luterbacher, J., Trigo, R. M. & Garcia-Herrera, R. The hot summer of 2010: redrawing the temperature record map of Europe. Science. 332, 220–224 (2011).
Robine, J. M. et al. Death toll exceeded 70,000 in Europe during the summer of 2003. Comptes Rendus Biologies. 331, 171–178 (2008).
Kephart, J. L. et al. City-level impact of extreme temperatures and mortality in Latin America. Nature Medicine. 28, 1700–1705 (2022).
Oudin Åström, D., Forsberg, B., Ebi, K. L. & Rocklöv, J. Attributing mortality from extreme temperatures to climate change in Stockholm, Sweden. Nature Climate Change. 3, 1050–1054 (2013).
Luo, M. & Lau, N. C. Increasing human-perceived heat stress risks exacerbated by urbanization in China: a comparative study based on multiple metrics. Earth’s Future. 9, e2020EF001848 (2021).
Nazarian, N. & Lee, J. K. W. Personal assessment of urban heat exposure: a systematic review. Environmental Research Letters. 16, 033005 (2021).
Kumar, P., Rai, A., Upadhyaya, A. & Chakraborty, A. Analysis of heat stress and heat wave in the four metropolitan cities of India in recent period. Science of The Total Environment. 818, 151788 (2022).
Oleson, K. W. et al. Interactions between urbanization, heat stress, and climate change. Climatic Change. 129, 525–541 (2013).
Chan, A. P. C. & Yi, W. Heat stress and its impacts on occupational health and performance. Indoor and Built Environment. 25, 3–5 (2016).
Kjellstrom, T., Holmer, I. & Lemke, B. Workplace heat stress, health and productivity - an increasing challenge for low and middle-income countries during climate change. Glob Health Action. 2, 2047 (2009).
Cheng, J. et al. Impacts of exposure to ambient temperature on burden of disease: a systematic review of epidemiological evidence. International Journal of Biometeorology. 63, 1099–1115 (2019).
Joubert, D., Thomsen, J. & Harrison, O. Safety in the heat: a comprehensive program for prevention of heat illness among workers in Abu Dhabi, United Arab Emirates. American Journal of Public Health. 101, 395–398 (2011).
Monteiro, A., Carvalho, V., Oliveira, T. & Sousa, C. Excess mortality and morbidity during the July 2006 heat wave in Porto, Portugal. International Journal of Biometeorology. 57, 155–167 (2013).
Li, J., Chen, Y. D., Gan, T. Y. & Lau, N.-C. Elevated increases in human-perceived temperature under climate warming. Nature Climate Change. 8, 43–47 (2018).
Luo, M. & Lau, N.-C. Characteristics of summer heat stress in China during 1979‒2014: climatology and long-term trends. Climate Dynamics. 53, 5375–5388 (2019).
Wang, Y. et al. Human-perceived temperature changes over South China: long-term trends and urbanization effects. Atmospheric Research. 215, 116–127 (2019).
Jeong, S.-J. et al. Effects of double cropping on summer climate of the North China Plain and neighbouring regions. Nature Climate Change. 4, 615–619 (2014).
Gabriel, K. M. & Endlicher, W. R. Urban and rural mortality rates during heat waves in Berlin and Brandenburg, Germany. Environmental Pollution. 159, 2044–2050 (2011).
Zhang, X., Tang, Q., Zheng, J. & Ge, Q. Warming/cooling effects of cropland greenness changes during 1982–2006 in the North China Plain. Environmental Research Letters. 8, 024038 (2013).
Conlon, K., Monaghan, A., Hayden, M. & Wilhelmi, O. Potential impacts of future warming and land sse changes on intra-urban heat exposure in Houston, Texas. PLoS One. 11, e0148890 (2016).
Bianca, W. Relative importance of dry- and wet-bulb temperatures in causing heat stress in Cattle. Nature 195, 251–252 (1962).
Luo, M. & Lau, N. C. Increasing heat stress in urban areas of eastern China: acceleration by urbanization. Geophysical Research Letters. 45, 13,060–013,069 (2018).
Mekis, É., Vincent, L. A., Shephard, M. W. & Zhang, X. Observed trends in severe weather conditions based on Humidex, Wind Chill, and heavy rainfall events in Canada for 1953–2012. Atmosphere-Ocean. 53, 383–397 (2015).
Yan, Y., Xu, Y. & Yue, S. A high-spatial-resolution dataset of human thermal stress indices over South and East Asia. Scientific Data. 8, 229 (2021).
Chen, J. et al. Global socioeconomic exposure of heat extremes under climate change. Journal of Cleaner Production. 277, 123275 (2020).
Li, T. et al. Heat-related mortality projections for cardiovascular and respiratory disease under the changing climate in Beijing, China. Scientific Reports. 5, 11441 (2015).
Chen, C. C. et al. Projection of future temperature extremes, related mortality, and adaptation due to climate and population changes in Taiwan. Science of The Total Environment 760, 143373 (2021).
Kang, S. & Eltahir, E. A. B. North China Plain threatened by deadly heatwaves due to climate change and irrigation. Nature Communications. 9, 2894 (2018).
Freychet, N., Tett, S. F. B., Yan, Z. & Li, Z. Underestimated change of wet-bulb temperatures over East and South China. Geophysical Research Letters. 47, e2019GL086140 (2020).
Ding, Y. et al. On the characteristics, driving forces and inter-decadal variability of the East Asian Summer Monsoon. Chinese Journal of Atmospheric Sciences. 42, 553–558 (2018).
Cao, W., Wu, D., Huang, L., Pan, M. & Huhe, T. Determinizing the contributions of human activities and climate change on greening in the Beijing-Tianjin-Hebei Region, China. Scientific Reports. 11, 21201 (2021).
Yang, Y. et al. Trend analysis on temperature and precipitation over North China Plain for past five decades. IOP Conference Series: Earth and Environmental Science. 675, 012014 (2021).
Zhao, G. N., Li, B. Z., Kong, P., Xia, L. J. & Zhan, M. J. Population exposure to large-scale heatwaves in China for 1961-2015. IOP Conference Series: Earth and Environmental Science. 344, 012073 (2019).
Wang, P. et al. Urbanization contribution to human perceived temperature changes in major urban agglomerations of China. Urban Climate. 38, 100910 (2021).
Zhang, H. et al. HiTIC-Monthly: a monthly high spatial resolution (1km) human thermal index collection over China during 2003–2020. Earth System Science Data. 15, 359–381 (2023).
Ke, G. et al. in Proceedings of the 31st International Conference on Neural Information Processing Systems 3149–3157 (Curran Associates Inc., Long Beach, California, USA, 2017).
Xu, W. et al. Homogenization of chinese daily surface air temperatures and analysis of trends in the extreme temperature indices. Journal of Geophysical Research: Atmospheres. 118, 9708–9720 (2013).
Jin, M., Dickinson, R. E. & Vogelmann, A. M. A comparison of CCM2–BATS skin temperature and surface-air temperature with satellite and surface observations. Journal of Climate. 10, 1505–1524 (1997).
Zhang, T., Zhou, Y., Zhu, Z., Li, X. & Asrar, G. R. A global seamless 1 km resolution daily land surface temperature dataset (2003–2020). Earth System Science Data. 14, 651–664 (2022).
Jendritzky, G. & Tinz, B. The thermal environment of the human being on the global scale. Glob Health Action. 2, 2005 (2009).
Li, L. et al. Impact of land cover and population density on land surface temperature: case study in Wuhan, China. Journal of Applied Remote Sensing. 8, 084993 (2014).
Hu, L., Wilhelmi, O. V. & Uejio, C. Assessment of heat exposure in cities: combining the dynamics of temperature and population. Science of The Total Environment. 655, 1–12 (2019).
Jaber, S. M. Is there a relationship between human population distribution and land surface temperature? Global perspective in areas with different climatic classifications. Remote Sensing Applications: Society and Environment. 20, 100435 (2020).
Gaughan, A. E., Stevens, F. R., Linard, C., Jia, P. & Tatem, A. J. High resolution population distribution maps for Southeast Asia in 2010 and 2015. PLoS One. 8, e55882 (2013).
Yamazaki, D. et al. A high-accuracy map of global terrain elevations. Geophysical Research Letters. 44, 5844–5853 (2017).
Harding, D. J. ICESat waveform measurements of within-footprint topographic relief and vegetation vertical structure. Geophysical Research Letters. 32, L21S10 (2005).
Farr, T. G. et al. The shuttle radar topography mission. Reviews of Geophysics. 45, 2005RG000183 (2007).
Reber, E. E. On the correlation of the total precipitable water in a vertical column and absolute humidity at the surface. Journal of Applied Meteorology and Climatology. 11, 1322–1325 (1972).
Tian, J., Zhang, Z., Zhao, T., Tao, H. & Zhu, B. Warmer and wetter climate induced by the continual increase in atmospheric temperature and precipitable water vapor over the arid and semi–arid regions of Northwest China. Journal of Hydrology: Regional Studies. 42, 101151 (2022).
Tian, H. et al. Estimating PM2.5 from multisource data: a comparison of different machine learning models in the Pearl River Delta of China. Urban Climate. 35, 100740 (2021).
Zhang, R., Jia, X. & Qian, Q. Analysis of lower-boundary climate factors contributing to the summer heatwave frequency over eastern Europe using a machine-learning model. Atmospheric and Oceanic Science Letters. 15, 100256 (2022).
Fan, J. et al. Light gradient boosting machine: an efficient soft computing model for estimating daily reference evapotranspiration with local and external meteorological data. Agricultural Water Management. 225, 105758 (2019).
Ma, J., Zhang, R., Xu, J. & Yu, Z. MERRA-2 PM2.5 mass concentration reconstruction in China mainland based on LightGBM machine learning. Science of The Total Environment. 827, 154363 (2022).
He, C. et al. Deriving full-coverage and fine-scale XCO2 across china based on OCO‐2 satellite retrievals and CarbonTracker output. Geophysical Research Letters. 49, e2022GL098435 (2022).
Alzamzami, F., Hoda, M. & El Saddik, A. Light gradient boosting machine for general sentiment classification on short texts: a comparative evaluation. IEEE Access. 8, 101840–101858 (2020).
Sarker, I. H. Machine Learning: algorithms, real-world applications and research directions. SN Computer Science. 2, 160 (2021).
Bergstra, J. & Bengio, Y. Random search for hyper-parameter optimization. Journal of Machine Learning Research 13, 281–305 (2012).
Li, X. et al. A daily high-resolution (1 km) human thermal index collection over the North China Plain from 2003 to 2020 [Data set]. Zenodo https://doi.org/10.5281/zenodo.7528001 (2023).
Steadman, R. G. The assessment of sultriness. part I: a temperature-humidity index based on human physiology and clothing science. Journal of Applied Meteorology and Climatology. 18, 861–873 (1979).
Steadman, R. G. A universal scale of apparent temperature. Journal of Applied Meteorology and Climatology. 23, 1674–1687 (1984).
Epstein, Y. & Moran, D. Thermal comfort and the heat stress indices. Industrial health. 44, 388–398 (2006).
Gagge, A. P., Stolwijk, J. & Nishi, Y. An effective temperature scale based on a simple model of human physiological regulatiry response. ASHRAE Transactions. 77, 247–262 (1971).
Rothfusz, L. P. & Headquarters, N. S. R. The heat index equation (or, more than you ever wanted to know about heat index). Fort Worth, Texas: National Oceanic and Atmospheric Administration, National Weather Service, Office of Meteorology. 9023, 640 (1990).
Masterton, J. M. & Richardson, F. A. Humidex: a Method of Quantifying Human Discomfort Due to Excessive Heat and Humidity. Vol. 79 (Environment Canada, Atmospheric Environment, 1979).
Moran, D., Shapiro, Y., Epstein, Y., Matthew, W. & Pandolfi, K. A modified discomfort index (MDI) as an alternative to the wet bulb globel temperature (WBGT). 77–80 (Environmental Ergonomics VIII, 1998).
Willett, K. M. & Sherwood, S. Exceedance of heat index thresholds for 15 regions under a warming climate using the wet-bulb globe temperature. International Journal of Climatology. 32, 161–177 (2012).
Stull, R. Wet-Bulb Temperature from Relative Humidity and Air Temperature. Journal of Applied Meteorology and Climatology. 50, 2267–2269 (2011).
Houghton, F. C. & Yaglou, C. P. Determining equal comfort lines. J. Am. Soc. Heating and Ventilation in England. 29, 165–176 (1923).
Osczevski, R. & Bluestein, M. The new wind chill equivalent temperature chart. Bulletin of the American Meteorological Society. 86, 1453–1458 (2005).
Lyapustin, A. & Wang, Y. MODIS/Terra+Aqua Land Aerosol Optical Depth Daily L2G Global 1km SIN Grid V061 [Data set]. Accessed 2022-03-05 from. NASA EOSDIS Land Processes DAAC. https://doi.org/10.5067/MODIS/MCD19A2.061 (2022).
Acknowledgements
This study is funded by the National Natural Science Foundation of China (42371028, 41871029, 41871306), the National Key R&D Program of China (2019YFC1510400), and the Science and Technology Program of Guangzhou (202102020489).
Author information
Authors and Affiliations
Contributions
M.L. and Y.Z. designed the study. X.L. carried out the analysis and wrote the first draft of the manuscript. All co-authors discussed the results and edited the manuscript.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Li, X., Luo, M., Zhao, Y. et al. A daily high-resolution (1 km) human thermal index collection over the North China Plain from 2003 to 2020. Sci Data 10, 634 (2023). https://doi.org/10.1038/s41597-023-02535-y
Received:
Accepted:
Published:
Version of record:
DOI: https://doi.org/10.1038/s41597-023-02535-y












