A daily high-resolution (1 km) human thermal index collection over the North China Plain from 2003 to 2020

Li, Xiang; Luo, Ming; Zhao, Yongquan; Zhang, Hui; Ge, Erjia; Huang, Ziwei; Wu, Sijia; Wang, Peng; Wang, Xiaoyu; Tang, Yu

doi:10.1038/s41597-023-02535-y

Download PDF

Data Descriptor
Open access
Published: 18 September 2023

A daily high-resolution (1 km) human thermal index collection over the North China Plain from 2003 to 2020

Xiang Li ORCID: orcid.org/0000-0002-1815-3617¹,
Ming Luo ORCID: orcid.org/0000-0002-5474-3892^1,2,
Yongquan Zhao³,
Hui Zhang¹,
Erjia Ge⁴,
Ziwei Huang¹,
Sijia Wu¹,
Peng Wang¹,
Xiaoyu Wang¹ &
…
Yu Tang¹

Scientific Data volume 10, Article number: 634 (2023) Cite this article

4801 Accesses
6 Citations
Metrics details

Subjects

Abstract

Human-perceived temperature (HPT) describes the joint effects of multiple climatic factors such as temperature and humidity. Extreme HPT events may reduce labor capacity and cause thermal discomfort and even mortality. These events are becoming more frequent and more intense under global warming, posing severe threats to human and natural systems worldwide, particularly in populated areas with intensive human activities, e.g., the North China Plain (NCP). Therefore, a fine-scale HPT dataset in both spatial and temporal dimensions is urgently needed. Here we construct a daily high-resolution (~1 km) human thermal index collection over NCP from 2003 to 2020 (HiTIC-NCP). This dataset contains 12 HPT indices and has high accuracy with averaged determination coefficient, mean absolute error, and root mean squared error of 0.987, 0.970 °C, and 1.292 °C, respectively. Moreover, it exhibits high spatiotemporal consistency with ground-level observations. The dataset provides a reference for human thermal environment and could facilitate studies such as natural hazards, regional climate change, and urban planning.

A high-spatial-resolution dataset of human thermal stress indices over South and East Asia

Article Open access 01 September 2021

Augmented human thermal discomfort in urban centers of the Arabian Peninsula

Article Open access 17 February 2024

High-resolution climate reconstruction from historical Chinese weather records using optimized natural language processing

Article Open access 24 December 2025

Background & Summary

Under global warming, the frequency and intensity of extreme temperature events (e.g., heatwaves and cold spells) are increasing in most parts of the world^1,2,3,4,5. They pose severe impacts on human society and the ecological systems^6,7,8,9,10, putting more people at risk of extremely hot or cold^11,12. For example, heatwave led to more than 70,000 deaths in Europe in 2003 and 55,000 deaths in Russia in 2010^13,14. Extreme cold events also pose a great threat to human health and often cause a higher proportion of deaths than extreme heat^15,16.

Human-perceived temperature (HPT) has attracted much attention in recent years due to its close connection with human health^17,18. HPT is not only affected by air temperature but also by other meteorological variables (e.g., humidity and wind speed). For instance, HPT would become higher on hot days with high humidity while becoming lower on cold days under the additional influence of wind^19,20,21,22. Relevant studies revealed that discomfort HPT could impact human health, such as respiratory distress and reduced skin evaporation, and significantly increase the health risks and mortality in heat- and cold-exposed populations under the joint effects of multiple meteorological factors^23,24,25. Li et al.²⁶ showed that global HPT increases faster than the actual air temperature. Faster increases in HPT were also confirmed by regional studies^27,28. However, previous studies mainly quantitated the human thermal environment and human heat exposure assessment based on meteorological station data and homogeneous climatic datasets^29,30,31,32. These station-based or raster-based datasets have a coarse spatial resolution (e.g., 0.5° × 0.5° or 2.5° × 2.5°) and insufficient temporal resolution (e.g., monthly), which cannot meet the requirements of long time series, full regional coverage, and fine spatial scale studies²⁸. A seamless dataset of HPT with high temporal frequency and spatial resolution is still lacking and sorely needed.

A better understanding of the thermal environment helps prevent or mitigate heat/cold stress effectively¹⁸. Various HPT indices have been proposed to investigate human thermal stress previously. For example, the wet-bulb temperature (WBT), effective temperature (ET), and heat index (HI) have been widely used for quantitative assessment of human thermal environment research^{17,28,33,34,35}. However, there is still a lack of unified evaluation standards or comprehensive datasets for assessing the human thermal environment and stress³⁶. Therefore, it is imperative to develop a new collection with multiple indices for evaluating the thermal environment at a fine scale (e.g., at daily frequencies, and ~0.00833° spatial resolution in a geographical coordinate system, which is equal to around 1 km near the equator).

Previous studies have projected that the morbidity and mortality induced by extreme temperatures may continue to rise, especially in populated Asia^37,38,39. In particular, the North China Plain (NCP), as one of the most populated and urbanized areas of the world, is suffering from severe thermal stress^40,41. It is located in a typical East Asian monsoon region (113°E–121°E and 34°N–41°N)^40,42, and spans eleven provincial units of China, including one of the regions with the most profound human activities in China, i.e., the Beijing-Tianjin-Hebei region⁴³. NCP has a total population of 0.3 billion, which accounts for around one-fifth of the population of the country⁴⁴. The population exposed to heatwaves in this area is up to 0.66 million persons/day during 1961–2015⁴⁵. By examining the long-term changes of major urbanization agglomerations in China, Wang et al.⁴⁶ indicated that HPT in northern China grew faster than that in the south, particularly in NCP. Zhang et al.⁴⁷ have generated a 1-km-resolution dataset of monthly human thermal index collection (HiTIC-Monthly) over China during 2003–2020. However, this monthly dataset is not capable of detecting extreme weather events such as heatwaves and cold spells, which are usually defined based on daily temperatures. The lack of a daily HPT dataset limits accurate heat stress-related studies in this region, and a multi-index dataset of human thermal stress at a daily scale over NCP is highly essential.

To fill the aforementioned research gap, this study aims to construct a high spatial resolution (1 km × 1 km) human thermal index collection at a daily scale over NCP from 2003 to 2020 (i.e., HiTIC-NCP). The 12 widely-used indices include Surface Air Temperature (SAT), indoor Apparent Temperature (AT_in), outdoor shaded Apparent Temperature (AT_out), Discomfort Index (DI), Effective Temperature (ET), Heat Index (HI), Humidex (HMI), Modified Discomfort Index (MDI), Net Effective Temperature (NET), Wet-Bulb Temperature (WBT), simplified Wet-Bulb Globe Temperature (sWBGT), and Wind Chill Temperature (WCT). Compared with our previous HiTIC-Monthly dataset, the current HiTIC-NCP dataset has improved the temporal resolution from a monthly to a daily scale. It also incorporates different predictors by including new variables such as daily precipitable water vapor, which reflect the high-frequency (day-to-day) fluctuations of atmospheric humidity that induces excessive heat stress as perceived by human body. This study is thus of great significance for regional climate change, urban planning, urban thermal environment, and public health research.

Methods

Overview

The data used in this study mainly include meteorological station observations and four gridded datasets. The Light Gradient Boosting Machine (LightGBM) algorithm proposed by Ke et al.⁴⁸ is used to produce 12 human thermal indices. The workflow of this study is shown in Fig. 1. First, we derived daily HPT indices from daily values of temperature, humidity, and wind speed based on station observations, and preprocessed the daily gridded datasets. We extracted the covariate values at meteorological stations from gridded datasets to form candidate samples (which include station-level HPTs and their co-located covariate values), then randomly selected 80% of the samples as a training set and 20% of that as a validation set. Hyperparameter optimization based on a 5-fold cross-validation was used to determine the optimal parameters of the model by the training set, and a systematic evaluation of model performance was conducted based on the validation set. Finally, a collection of 12 HPTs was generated by the trained LightGBM model. The detailed data and methods are described as follows.

Meteorological observations

The station-observed daily data including surface air temperature, relative humidity, and wind speed from 2003 to 2020 were collected from the China Meteorological Data Service Center (CMDSC, http://data.cma.cn). All station data have been strictly quality-controlled based on the method provided by Xu et al.⁴⁹, including homogenizing climate data and processing outliers in the data series. The observational network includes 374 stations across NCP (Fig. 2). In this study, we calculated daily human-perceived temperature from meteorological observations and extracted the covariate values from gridded datasets at the same locations. The data were stored in the form of one per station per day, then processed and divided into training (80%) and validation (20%) sets for model calibration. The calculation of the 12 human thermal indices is described in Table 1.

Table 1 Equations of the 12 thermal indices.

Full size table

Gridded datasets

Four gridded datasets with a total of seven independent variables (Table 2) are used as input in the LightGBM model. These variables include land surface temperature (LST), population density, elevation, slope, aspect, precipitable water vapor, and day of the year.

Table 2 Gridded datasets and covariates used in this study.

Full size table

Table 3 Overall R², MAE, and RMSE values of the 12 human thermal indices over NCP from 2003 to 2020.

Full size table

LST is closely related to near-surface air temperature⁵⁰. In this study, we adopted an LST dataset named MODIS-like LST⁵¹. This dataset was developed based on the Moderate Resolution Imaging Spectroradiometer (MODIS) LST dataset and the gap-filling algorithm⁵¹ to make it seamless at a global scale in 1 km grids from 2003 to 2020. The average root mean squared error (RMSE) of this LST dataset is 1.80–2.03 °C and 1.23–1.45 °C for daytime and nighttime temperatures, respectively. The daily mean LST was calculated by averaging the corresponding daytime and nighttime LSTs. In addition, population distribution and regional population density are demonstrated to affect regional temperature^52,53,54,55, and we used a high-accuracy population density dataset provided by the WorldPop project team as a predictor⁵⁶.

The Multi-Error-Removed Improved-Terrain (MERIT) Digital Elevation Model (DEM)⁵⁷ with a spatial resolution of 90 m was used in this study to extract slope and aspect. Compared with previous DEM data (e.g., SRTM and AW3D-30m), the errors of MERIT DEM are significantly reduced, and the accuracy in the flat area has been effectively improved^57,58,59. In addition, precipitation has been proven to affect air temperature and humidity^60,61. Therefore, a global daily precipitable water vapor dataset with a spatial resolution of 1 km was used.

The LightGBM machine learning algorithm

LightGBM is a gradient boosting decision tree (GBDT) based machine learning algorithm. Compared with other algorithms based on GBDT such as XGBoost and pGBRT, LightGBM has improved and optimized by the Gradient-based One-Side Sampling (GOSS) and Exclusive Feature Bundling (EFB) technologies, which allow LightGBM to better retain samples with large gradients without significantly altering the original data shape and focus more on undertrained samples⁴⁸. LightGBM consumes much less memory and improves training efficiency by more than 20 times while maintaining nearly the same accuracy⁴⁸. Due to its superior degree of generalizability, extremely fast processing speed, and high accuracy, LightGBM has been widely applied in different fields (e.g., climatology, air pollution, social psychology)^{47,62,63,64,65,66,67}. As shown in Fig. 1, the daily gridded datasets were fed to the trained LightGBM model for HPT prediction.

Hyperparameters tunning

The accuracy of machine learning relies heavily on parameterization⁶⁸. Hyperparameter optimization configures the model by searching for various model hyperparameters and selecting a subset that achieves the best performance on the given dataset. The grid search-based hyperparameters tunning method was adopted in our study⁶⁹.

Accuracy evaluation

Three statistical metrics were used to assess the performance of the training model, including the determination coefficient (R²), root mean absolute error (MAE), and RMSE. The R², RMSE, and MAE values are computed as follows:

$$\begin{array}{c}{R}^{2}=1-\frac{{\sum }_{i=1}^{N}{\left({x}_{oi}-{x}_{pi}\right)}^{2}}{{\sum }_{i=1}^{N}{\left({x}_{pi}-\overline{{x}_{o}}\right)}^{2}}\end{array}$$

(1)

$$\begin{array}{c}RMSE=\sqrt{\frac{1}{N}{\sum }_{i=1}^{N}{\left({x}_{oi}-{x}_{pi}\right)}^{2}}\end{array}$$

(2)

$$\begin{array}{c}MAE=\frac{1}{N}{\sum }_{i=1}^{N}\left|{x}_{oi}-{x}_{pi}\right|\end{array}$$

(3)

where N represents the number of data points, x_p and x_o are the i_th prediction and observation values, respectively.

Data Records

The high spatial resolution (1 km) human thermal index collection at a daily scale from 2003 to 2020 over NCP (HiTIC-NCP) shows a high consistency between observations and predictions, and the product has respectable accuracies. HiTIC-NCP contains 216.rar files, with individual files decompressed into the NetCDF format. The dataset is available on the general-purpose repository Zenodo, at https://doi.org/10.5281/zenodo.7528001⁷⁰. This dataset is stacked by year and each stack consists of daily images by day of the year in the NetCDF format. The unit of the dataset is 0.01 degree Celsius (°C), and the values are stored in an integer data type (Int16) to save storage space, and thus need to be divided by 100 to get the values in degree Celsius when in use. The geographic coordinate system of the dataset is World Geodetic System 1984 (WGS84) Coordinate System. The spatial distribution of the 12 human thermal indices on August 13, 2013, one of the severest heat stress days in the history of NCP, and January 3, 2008, when the sharpest snowstorm struck NCP, are shown as examples in Figs. 3, 4.

Technical Validation

Overall accuracy of 12 predicted human thermal indices

We assessed the overall accuracy of 12 human thermal indices over NCP during 2003–2020 in terms of R², MAE, and RMSE. All indices exhibit desirable performance, with the averaged R², MAE, and RMSE being 0.987, 1.292 °C, and 0.970 °C, respectively. It demonstrates that the LightGBM model has good effectiveness in the prediction of human thermal indices at a daily scale. Figure 5 displays the scatter plots of observations versus predictions of 12 human thermal indices. The closer the scattered points are to the 1:1 line, the better agreement between the predicted and observed values.

The R², MAE, and RMSE values of 12 predicted thermal indices during 2003–2020 are shown in Fig. 6. The R² values range from 0.975 to 0.990, the MAE values are all smaller than 1.4 °C, and the RMSE values are smaller than 1.9 °C (Table 3). Particularly, sWBGT has the highest R² values of 0.990, followed by HMI, WBT, and DI, and its MAE and RMSE values are 0.698 °C and 0.926 °C, respectively. ET is with an R² of 0.987 and has the lowest MAE and RMSE values (i.e., 0.598 °C and 0.785 °C, respectively). WCT and NET exhibit the highest MAEs (i.e., 1.17 and 1.37 °C, respectively) and RMSEs (i.e., 1.61 and 1.84 °C, respectively). It is likely caused by the involvement of wind speed in the calculation of WCT and NET (recall Table 1).

Spatial variations of prediction accuracy

Figures 7–9 depict the distributions of R², MAE, and RMSE values in 12 human thermal indices at individual stations of NCP during 2003–2020. As shown in Fig. 7, the high R² values (i.e., ranging from 0.970 to 0.995) are seen at most stations across NCP, especially in the central, southwestern, and northern parts. The R² values of NET at most stations have relatively lower accuracies, likely caused by the inclusion of wind speed. Figures 8, 9 show that ET has the best results with the lowest MAEs and RMSEs (i.e., < 0.911 °C and < 1.161 °C, respectively, see Figs. 8d, 9d). Compared with other 11 human thermal indices, NET has higher MAE and RMSE values (i.e., > 1.058 °C and 1.415 °C, respectively, see Figs. 8k, 9k). Spatially, MAEs and RMSEs exhibit an increasing tendency from low to high elevation, with smaller MAE and RMSE values in inland areas with lower elevation and relatively larger values in mountainous areas with higher elevation.

Prediction accuracy in different periods

Figure 10 illustrates the averaged R², MAE, and RMSE values of 12 indices in individual years of 2003–2020. All R² values of 12 indices in all years exhibit well performance (i.e., > 0.955 °C, Fig. 10a). Better annual predictions appear in WBT over 2008–2013 and sWBGT over 2008–2014, with R² exceeding 0.990. The lowest R² is observed in NET in 2003 (i.e., 0.956). Annual averaged MAEs and RMSEs of all indices in nearly all years are less than 1.5 °C and 2.0 °C, respectively (Fig. 10b,c), demonstrating that our product is reliable in all years.

Code availability

The HiTIC-NCP dataset generation codes are available on GitHub (https://github.com/CSLixiang/HiTIC-NCP.git), and operational under Python 3.8 or JavaScript. In the GitHub repository, we uploaded three code scripts, i.e., “Data preprocessing code.py”, “HiTIC-NCP Code.py” and “Figures code.py”. Additionally, the data samples were uploaded to the “Data Samples” folder.

References

IPCC. Climate Change 2021: The Physical Science Basis. Contribution of Working Group I to the Sixth Assessment Report of the Intergovernmental Panel on Climate Change. (Cambridge University Press, Cambridge, United Kingdom and New York, NY, USA, 2021).
Meehl, G. A. & Tebaldi, C. More intense, more frequent, and longer lasting heat waves in the 21st century. Science. 305, 994–997 (2004).
ADS CAS PubMed Google Scholar
Alexander, L. V. et al. Global observed changes in daily climate extremes of temperature and precipitation. Journal of Geophysical Research: Atmospheres. 111, D05109 (2006).
ADS Google Scholar
Wu, S. et al. Local mechanisms for global daytime, nighttime, and compound heatwaves. npj Climate and Atmospheric Science. 6, 36 (2023).
Google Scholar
Zhang, H. et al. Unequal urban heat burdens impede climate justice and equity goals. The Innovation. 4, 100488 (2023).
PubMed PubMed Central Google Scholar
Donovan, M. K. et al. Local conditions magnify coral loss after marine heatwaves. Science. 372, 977–980 (2021).
CAS PubMed Google Scholar
Zachariah, M., Mondal, A. & AghaKouchak, A. Probabilistic assessment of extreme heat stress on indian wheat yields under climate change. Geophysical Research Letters. 48, e2021GL094702 (2021).
ADS Google Scholar
Markonis, Y. et al. The rise of compound warm-season droughts in Europe. Science Advances. 7, eabb9668 (2021).
ADS PubMed PubMed Central Google Scholar
Meehl, G. A. et al. Future heat waves and surface ozone. Environmental Research Letters. 13, 064004 (2018).
ADS Google Scholar
Basu, R. High ambient temperature and mortality: a review of epidemiologic studies from 2001 to 2008. Environmental Health. 8, 40 (2009).
PubMed PubMed Central Google Scholar
Mazdiyasni, O. et al. Increasing probability of mortality during Indian heat waves. Science Advances. 3, e1700066 (2017).
ADS PubMed PubMed Central Google Scholar
Kovats, R. S. & Hajat, S. Heat stress and public health: a critical review. Annual Review of Public Health. 29, 41–55 (2008).
PubMed Google Scholar
Barriopedro, D., Fischer, E. M., Luterbacher, J., Trigo, R. M. & Garcia-Herrera, R. The hot summer of 2010: redrawing the temperature record map of Europe. Science. 332, 220–224 (2011).
ADS CAS PubMed Google Scholar
Robine, J. M. et al. Death toll exceeded 70,000 in Europe during the summer of 2003. Comptes Rendus Biologies. 331, 171–178 (2008).
PubMed Google Scholar
Kephart, J. L. et al. City-level impact of extreme temperatures and mortality in Latin America. Nature Medicine. 28, 1700–1705 (2022).
CAS PubMed PubMed Central Google Scholar
Oudin Åström, D., Forsberg, B., Ebi, K. L. & Rocklöv, J. Attributing mortality from extreme temperatures to climate change in Stockholm, Sweden. Nature Climate Change. 3, 1050–1054 (2013).
ADS Google Scholar
Luo, M. & Lau, N. C. Increasing human-perceived heat stress risks exacerbated by urbanization in China: a comparative study based on multiple metrics. Earth’s Future. 9, e2020EF001848 (2021).
ADS Google Scholar
Nazarian, N. & Lee, J. K. W. Personal assessment of urban heat exposure: a systematic review. Environmental Research Letters. 16, 033005 (2021).
ADS Google Scholar
Kumar, P., Rai, A., Upadhyaya, A. & Chakraborty, A. Analysis of heat stress and heat wave in the four metropolitan cities of India in recent period. Science of The Total Environment. 818, 151788 (2022).
ADS CAS PubMed Google Scholar
Oleson, K. W. et al. Interactions between urbanization, heat stress, and climate change. Climatic Change. 129, 525–541 (2013).
ADS Google Scholar
Chan, A. P. C. & Yi, W. Heat stress and its impacts on occupational health and performance. Indoor and Built Environment. 25, 3–5 (2016).
Google Scholar
Kjellstrom, T., Holmer, I. & Lemke, B. Workplace heat stress, health and productivity - an increasing challenge for low and middle-income countries during climate change. Glob Health Action. 2, 2047 (2009).
Google Scholar
Cheng, J. et al. Impacts of exposure to ambient temperature on burden of disease: a systematic review of epidemiological evidence. International Journal of Biometeorology. 63, 1099–1115 (2019).
ADS PubMed Google Scholar
Joubert, D., Thomsen, J. & Harrison, O. Safety in the heat: a comprehensive program for prevention of heat illness among workers in Abu Dhabi, United Arab Emirates. American Journal of Public Health. 101, 395–398 (2011).
PubMed PubMed Central Google Scholar
Monteiro, A., Carvalho, V., Oliveira, T. & Sousa, C. Excess mortality and morbidity during the July 2006 heat wave in Porto, Portugal. International Journal of Biometeorology. 57, 155–167 (2013).
ADS PubMed Google Scholar
Li, J., Chen, Y. D., Gan, T. Y. & Lau, N.-C. Elevated increases in human-perceived temperature under climate warming. Nature Climate Change. 8, 43–47 (2018).
ADS Google Scholar
Luo, M. & Lau, N.-C. Characteristics of summer heat stress in China during 1979‒2014: climatology and long-term trends. Climate Dynamics. 53, 5375–5388 (2019).
ADS Google Scholar
Wang, Y. et al. Human-perceived temperature changes over South China: long-term trends and urbanization effects. Atmospheric Research. 215, 116–127 (2019).
ADS Google Scholar
Jeong, S.-J. et al. Effects of double cropping on summer climate of the North China Plain and neighbouring regions. Nature Climate Change. 4, 615–619 (2014).
ADS Google Scholar
Gabriel, K. M. & Endlicher, W. R. Urban and rural mortality rates during heat waves in Berlin and Brandenburg, Germany. Environmental Pollution. 159, 2044–2050 (2011).
CAS PubMed Google Scholar
Zhang, X., Tang, Q., Zheng, J. & Ge, Q. Warming/cooling effects of cropland greenness changes during 1982–2006 in the North China Plain. Environmental Research Letters. 8, 024038 (2013).
ADS Google Scholar
Conlon, K., Monaghan, A., Hayden, M. & Wilhelmi, O. Potential impacts of future warming and land sse changes on intra-urban heat exposure in Houston, Texas. PLoS One. 11, e0148890 (2016).
PubMed PubMed Central Google Scholar
Bianca, W. Relative importance of dry- and wet-bulb temperatures in causing heat stress in Cattle. Nature 195, 251–252 (1962).
ADS CAS PubMed Google Scholar
Luo, M. & Lau, N. C. Increasing heat stress in urban areas of eastern China: acceleration by urbanization. Geophysical Research Letters. 45, 13,060–013,069 (2018).
Google Scholar
Mekis, É., Vincent, L. A., Shephard, M. W. & Zhang, X. Observed trends in severe weather conditions based on Humidex, Wind Chill, and heavy rainfall events in Canada for 1953–2012. Atmosphere-Ocean. 53, 383–397 (2015).
ADS Google Scholar
Yan, Y., Xu, Y. & Yue, S. A high-spatial-resolution dataset of human thermal stress indices over South and East Asia. Scientific Data. 8, 229 (2021).
PubMed PubMed Central Google Scholar
Chen, J. et al. Global socioeconomic exposure of heat extremes under climate change. Journal of Cleaner Production. 277, 123275 (2020).
Google Scholar
Li, T. et al. Heat-related mortality projections for cardiovascular and respiratory disease under the changing climate in Beijing, China. Scientific Reports. 5, 11441 (2015).
ADS CAS PubMed PubMed Central Google Scholar
Chen, C. C. et al. Projection of future temperature extremes, related mortality, and adaptation due to climate and population changes in Taiwan. Science of The Total Environment 760, 143373 (2021).
ADS CAS PubMed Google Scholar
Kang, S. & Eltahir, E. A. B. North China Plain threatened by deadly heatwaves due to climate change and irrigation. Nature Communications. 9, 2894 (2018).
ADS PubMed PubMed Central Google Scholar
Freychet, N., Tett, S. F. B., Yan, Z. & Li, Z. Underestimated change of wet-bulb temperatures over East and South China. Geophysical Research Letters. 47, e2019GL086140 (2020).
ADS Google Scholar
Ding, Y. et al. On the characteristics, driving forces and inter-decadal variability of the East Asian Summer Monsoon. Chinese Journal of Atmospheric Sciences. 42, 553–558 (2018).
Google Scholar
Cao, W., Wu, D., Huang, L., Pan, M. & Huhe, T. Determinizing the contributions of human activities and climate change on greening in the Beijing-Tianjin-Hebei Region, China. Scientific Reports. 11, 21201 (2021).
ADS PubMed PubMed Central Google Scholar
Yang, Y. et al. Trend analysis on temperature and precipitation over North China Plain for past five decades. IOP Conference Series: Earth and Environmental Science. 675, 012014 (2021).
Google Scholar
Zhao, G. N., Li, B. Z., Kong, P., Xia, L. J. & Zhan, M. J. Population exposure to large-scale heatwaves in China for 1961-2015. IOP Conference Series: Earth and Environmental Science. 344, 012073 (2019).
Google Scholar
Wang, P. et al. Urbanization contribution to human perceived temperature changes in major urban agglomerations of China. Urban Climate. 38, 100910 (2021).
Google Scholar
Zhang, H. et al. HiTIC-Monthly: a monthly high spatial resolution (1km) human thermal index collection over China during 2003–2020. Earth System Science Data. 15, 359–381 (2023).
ADS Google Scholar
Ke, G. et al. in Proceedings of the 31st International Conference on Neural Information Processing Systems 3149–3157 (Curran Associates Inc., Long Beach, California, USA, 2017).
Xu, W. et al. Homogenization of chinese daily surface air temperatures and analysis of trends in the extreme temperature indices. Journal of Geophysical Research: Atmospheres. 118, 9708–9720 (2013).
ADS Google Scholar
Jin, M., Dickinson, R. E. & Vogelmann, A. M. A comparison of CCM2–BATS skin temperature and surface-air temperature with satellite and surface observations. Journal of Climate. 10, 1505–1524 (1997).
ADS Google Scholar
Zhang, T., Zhou, Y., Zhu, Z., Li, X. & Asrar, G. R. A global seamless 1 km resolution daily land surface temperature dataset (2003–2020). Earth System Science Data. 14, 651–664 (2022).
ADS Google Scholar
Jendritzky, G. & Tinz, B. The thermal environment of the human being on the global scale. Glob Health Action. 2, 2005 (2009).
Google Scholar
Li, L. et al. Impact of land cover and population density on land surface temperature: case study in Wuhan, China. Journal of Applied Remote Sensing. 8, 084993 (2014).
ADS Google Scholar
Hu, L., Wilhelmi, O. V. & Uejio, C. Assessment of heat exposure in cities: combining the dynamics of temperature and population. Science of The Total Environment. 655, 1–12 (2019).
ADS CAS PubMed Google Scholar
Jaber, S. M. Is there a relationship between human population distribution and land surface temperature? Global perspective in areas with different climatic classifications. Remote Sensing Applications: Society and Environment. 20, 100435 (2020).
Google Scholar
Gaughan, A. E., Stevens, F. R., Linard, C., Jia, P. & Tatem, A. J. High resolution population distribution maps for Southeast Asia in 2010 and 2015. PLoS One. 8, e55882 (2013).
ADS CAS PubMed PubMed Central Google Scholar
Yamazaki, D. et al. A high-accuracy map of global terrain elevations. Geophysical Research Letters. 44, 5844–5853 (2017).
ADS Google Scholar
Harding, D. J. ICESat waveform measurements of within-footprint topographic relief and vegetation vertical structure. Geophysical Research Letters. 32, L21S10 (2005).
Google Scholar
Farr, T. G. et al. The shuttle radar topography mission. Reviews of Geophysics. 45, 2005RG000183 (2007).
Google Scholar
Reber, E. E. On the correlation of the total precipitable water in a vertical column and absolute humidity at the surface. Journal of Applied Meteorology and Climatology. 11, 1322–1325 (1972).
Google Scholar
Tian, J., Zhang, Z., Zhao, T., Tao, H. & Zhu, B. Warmer and wetter climate induced by the continual increase in atmospheric temperature and precipitable water vapor over the arid and semi–arid regions of Northwest China. Journal of Hydrology: Regional Studies. 42, 101151 (2022).
Google Scholar
Tian, H. et al. Estimating PM2.5 from multisource data: a comparison of different machine learning models in the Pearl River Delta of China. Urban Climate. 35, 100740 (2021).
Google Scholar
Zhang, R., Jia, X. & Qian, Q. Analysis of lower-boundary climate factors contributing to the summer heatwave frequency over eastern Europe using a machine-learning model. Atmospheric and Oceanic Science Letters. 15, 100256 (2022).
Google Scholar
Fan, J. et al. Light gradient boosting machine: an efficient soft computing model for estimating daily reference evapotranspiration with local and external meteorological data. Agricultural Water Management. 225, 105758 (2019).
Google Scholar
Ma, J., Zhang, R., Xu, J. & Yu, Z. MERRA-2 PM2.5 mass concentration reconstruction in China mainland based on LightGBM machine learning. Science of The Total Environment. 827, 154363 (2022).
ADS CAS PubMed Google Scholar
He, C. et al. Deriving full-coverage and fine-scale XCO₂ across china based on OCO‐2 satellite retrievals and CarbonTracker output. Geophysical Research Letters. 49, e2022GL098435 (2022).
ADS CAS Google Scholar
Alzamzami, F., Hoda, M. & El Saddik, A. Light gradient boosting machine for general sentiment classification on short texts: a comparative evaluation. IEEE Access. 8, 101840–101858 (2020).
Google Scholar
Sarker, I. H. Machine Learning: algorithms, real-world applications and research directions. SN Computer Science. 2, 160 (2021).
PubMed PubMed Central Google Scholar
Bergstra, J. & Bengio, Y. Random search for hyper-parameter optimization. Journal of Machine Learning Research 13, 281–305 (2012).
MathSciNet MATH Google Scholar
Li, X. et al. A daily high-resolution (1 km) human thermal index collection over the North China Plain from 2003 to 2020 [Data set]. Zenodo https://doi.org/10.5281/zenodo.7528001 (2023).
Steadman, R. G. The assessment of sultriness. part I: a temperature-humidity index based on human physiology and clothing science. Journal of Applied Meteorology and Climatology. 18, 861–873 (1979).
Google Scholar
Steadman, R. G. A universal scale of apparent temperature. Journal of Applied Meteorology and Climatology. 23, 1674–1687 (1984).
Google Scholar
Epstein, Y. & Moran, D. Thermal comfort and the heat stress indices. Industrial health. 44, 388–398 (2006).
PubMed Google Scholar
Gagge, A. P., Stolwijk, J. & Nishi, Y. An effective temperature scale based on a simple model of human physiological regulatiry response. ASHRAE Transactions. 77, 247–262 (1971).
Google Scholar
Rothfusz, L. P. & Headquarters, N. S. R. The heat index equation (or, more than you ever wanted to know about heat index). Fort Worth, Texas: National Oceanic and Atmospheric Administration, National Weather Service, Office of Meteorology. 9023, 640 (1990).
Google Scholar
Masterton, J. M. & Richardson, F. A. Humidex: a Method of Quantifying Human Discomfort Due to Excessive Heat and Humidity. Vol. 79 (Environment Canada, Atmospheric Environment, 1979).
Moran, D., Shapiro, Y., Epstein, Y., Matthew, W. & Pandolfi, K. A modified discomfort index (MDI) as an alternative to the wet bulb globel temperature (WBGT). 77–80 (Environmental Ergonomics VIII, 1998).
Willett, K. M. & Sherwood, S. Exceedance of heat index thresholds for 15 regions under a warming climate using the wet-bulb globe temperature. International Journal of Climatology. 32, 161–177 (2012).
ADS Google Scholar
Stull, R. Wet-Bulb Temperature from Relative Humidity and Air Temperature. Journal of Applied Meteorology and Climatology. 50, 2267–2269 (2011).
ADS Google Scholar
Houghton, F. C. & Yaglou, C. P. Determining equal comfort lines. J. Am. Soc. Heating and Ventilation in England. 29, 165–176 (1923).
Google Scholar
Osczevski, R. & Bluestein, M. The new wind chill equivalent temperature chart. Bulletin of the American Meteorological Society. 86, 1453–1458 (2005).
ADS Google Scholar
Lyapustin, A. & Wang, Y. MODIS/Terra+Aqua Land Aerosol Optical Depth Daily L2G Global 1km SIN Grid V061 [Data set]. Accessed 2022-03-05 from. NASA EOSDIS Land Processes DAAC. https://doi.org/10.5067/MODIS/MCD19A2.061 (2022).

Download references

Acknowledgements

This study is funded by the National Natural Science Foundation of China (42371028, 41871029, 41871306), the National Key R&D Program of China (2019YFC1510400), and the Science and Technology Program of Guangzhou (202102020489).

Author information

Authors and Affiliations

School of Geography and Planning, and Guangdong Key Laboratory for Urbanization and Geo-simulation, Sun Yat-sen University, Guangzhou, 510006, China
Xiang Li, Ming Luo, Hui Zhang, Ziwei Huang, Sijia Wu, Peng Wang, Xiaoyu Wang & Yu Tang
Institute of Environment, Energy and Sustainability, The Chinese University of Hong Kong, Hong Kong, China
Ming Luo
Key Laboratory of Watershed Geographic Sciences, Nanjing Institute of Geography and Limnology, Chinese Academy of Sciences, Nanjing, 210008, China
Yongquan Zhao
Dalla Lana School of Public Health, University of Toronto, Toronto, Ontario, M5T 3M7, Canada
Erjia Ge

Authors

Xiang Li
View author publications
Search author on:PubMed Google Scholar
Ming Luo
View author publications
Search author on:PubMed Google Scholar
Yongquan Zhao
View author publications
Search author on:PubMed Google Scholar
Hui Zhang
View author publications
Search author on:PubMed Google Scholar
Erjia Ge
View author publications
Search author on:PubMed Google Scholar
Ziwei Huang
View author publications
Search author on:PubMed Google Scholar
Sijia Wu
View author publications
Search author on:PubMed Google Scholar
Peng Wang
View author publications
Search author on:PubMed Google Scholar
Xiaoyu Wang
View author publications
Search author on:PubMed Google Scholar
Yu Tang
View author publications
Search author on:PubMed Google Scholar

Contributions

M.L. and Y.Z. designed the study. X.L. carried out the analysis and wrote the first draft of the manuscript. All co-authors discussed the results and edited the manuscript.

Corresponding authors

Correspondence to Ming Luo or Yongquan Zhao.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Li, X., Luo, M., Zhao, Y. et al. A daily high-resolution (1 km) human thermal index collection over the North China Plain from 2003 to 2020. Sci Data 10, 634 (2023). https://doi.org/10.1038/s41597-023-02535-y

Download citation

Received: 23 February 2023
Accepted: 01 September 2023
Published: 18 September 2023
Version of record: 18 September 2023
DOI: https://doi.org/10.1038/s41597-023-02535-y