Vectorized rooftop area data for 90 cities in China

Zhang, Zhixin; Qian, Zhen; Zhong, Teng; Chen, Min; Zhang, Kai; Yang, Yue; Zhu, Rui; Zhang, Fan; Zhang, Haoran; Zhou, Fangzhuo; Yu, Jianing; Zhang, Bingyue; Lü, Guonian; Yan, Jinyue

doi:10.1038/s41597-022-01168-x

Download PDF

Data Descriptor
Open access
Published: 02 March 2022

Vectorized rooftop area data for 90 cities in China

Zhixin Zhang ORCID: orcid.org/0000-0002-3898-0863^1,2,3,
Zhen Qian ORCID: orcid.org/0000-0002-0423-7430^1,2,3,
Teng Zhong^1,2,3,
Min Chen ORCID: orcid.org/0000-0001-8922-8789^1,2,3,4,
Kai Zhang^1,2,3,
Yue Yang^1,2,3,
Rui Zhu⁵,
Fan Zhang ORCID: orcid.org/0000-0002-3643-018X⁶,
Haoran Zhang^7,8,9,
Fangzhuo Zhou^1,2,3,
Jianing Yu^1,2,3,
Bingyue Zhang^1,2,3,
Guonian Lü^1,2,3 &
…
Jinyue Yan^9,10

Scientific Data volume 9, Article number: 66 (2022) Cite this article

15k Accesses
131 Citations
11 Altmetric
Metrics details

Subjects

Abstract

Reliable information on building rooftops is crucial for utilizing limited urban space effectively. In recent decades, the demand for accurate and up-to-date data on the areas of rooftops on a large-scale is increasing. However, obtaining these data is challenging due to the limited capability of conventional computer vision methods and the high cost of 3D modeling involving aerial photogrammetry. In this study, a geospatial artificial intelligence framework is presented to obtain data for rooftops using high-resolution open-access remote sensing imagery. This framework is used to generate vectorized data for rooftops in 90 cities in China. The data was validated on test samples of 180 km² across different regions with spatial resolution, overall accuracy, and F1 score of 1 m, 97.95%, and 83.11%, respectively. In addition, the generated rooftop area conforms to the urban morphological characteristics and reflects urbanization level. These results demonstrate that the generated dataset can be used for data support and decision-making that can facilitate sustainable urban development effectively.

Measurement(s)	building rooftop area
Technology Type(s)	computational modeling technique
Sample Characteristic - Environment	city
Sample Characteristic - Location	China

Building rooftop extraction from high resolution aerial images using multiscale global perceptron with spatial context refinement

Article Open access 22 February 2025

Carbon mitigation potential afforded by rooftop photovoltaic in China

Article Open access 24 April 2023

Vectorized building rooftop prints of the Qinghai-Tibetan Plateau and its neighboring regions

Article Open access 17 June 2025

Background & Summary

Rooftops of buildings have been intensively studied in fields such as sustainable urban development, building energy modeling, and urban planning and design in recent decades^1,2,3. Owing to urbanization associated with the digital age, reliable information on rooftops is in increasing demand^4,5,6. The rapid access to accurate rooftop information is important for the evaluation of urban and rural development trends. These trends are useful for formulating development strategies and protecting urban and rural ecosystems^7,8,9. However, data on rooftop areas are unavailable in many developing countries because of resource constraints. Therefore, methods suitable for generating reliable data on rooftop areas of buildings at low cost are urgently needed^10,11,12.

The automatic extraction of rooftop area data is gaining popularity in diverse fields, and studies involving varied data sources exist¹³. Three-dimensional (3D) spatial data, such as the Digital Surface Model (DSM) and Light Detection and Ranging (LiDAR), are exploited for reconstructing buildings, which includes the rooftop area representation and geometric modeling^13,14,15,16. However, the costs of acquiring 3D spatial data and of constructing the associated 3D models are costly, especially at the city scale. Due to the development of image processing algorithms, such as the edge detection and image segmentation, rooftops data can be extracted from high-resolution remote sensing imagery^17,18. Conventional image process techniques, however, involve complex empirical rules and threshold settings, and thus, exhibit limitations when applied to high-resolution remote sensing imagery in large-scale¹⁴.

Open-access data from public service providers, such as Google Earth, Baidu Map, and OpenStreetMap, provide opportunities for the acquisition of urban information associated with broad coverage, fast updating speeds, and low cost^19,20,21. However, although open-access data, for example, from a Google Earth Satellite (GES) image are valuable for obtaining information on cities, conventional processing methods hardly discover in-depth semantic information and lack flexibility when examining large data involving complex features²².

In recent years, deep learning methods have been employed for efficient feature learning and urban information acquisition^23,24. In fact, deep learning-based image semantic segmentation methods have been applied for the extraction of rooftops data^25,26. Nevertheless, regarding the optimization of rooftop data extraction applications, the data acquisition process requires the incorporation of geographic information^27,28. In addition, public rooftop area datasets that are suitable for use as training dataset in machine learning are scant²⁹. Further, naive deep learning models based on unbalanced and insufficient training samples exhibit unsatisfactory performances³⁰. Therefore, a robust high-performance rooftop extraction model remains elusive in China.

In the present study, the main objective is to extract accurate rooftop areas in China using high-resolution open-access remote sensing imagery based on a geospatial artificial intelligence (GeoAI) framework. The principal components of this framework are illustrated in Fig. 1. The following steps were employed for generating the rooftop area dataset: (1) data preparation through spatial stratified sampling involving geospatial prior knowledge and data processing pipeline to augment the representativeness and number of samples; (2) creation of a deep learning segmentation model, which is based on an ensemble learning strategy and an improved prediction method to improve the rooftop extraction performance.

Based on the framework, we developed a national-scale vectorized rooftop area dataset involving 90 cities in China. The data was validated on test samples of 180 km² across different regions with spatial resolution, overall accuracy, and F1 score of 1 m, 97.95%, and 83.11%, respectively.

Methods

Data collection

GES imagery

In the present study, open-access GES imagery served as the primary data source (Table 1). GES imagery is advantageous because of its high resolution and wide coverage. These images were downloaded in October 2020 using Python scripts in the open map service application program interface (Google Earth API) provided by Google. The spatial resolution of GES imagery varies according to its image level. The spatial resolution of the 18-level GES imagery was approximately 0.6 m/pixel in most developed countries, and this adequately displayed the geometry and structure of different rooftops. However, because the GES images originated from multiple sensors, differences in their performance induced variations in the resolution from region to region. For example, in China, images of major cities are usually obtained from the WorldView, and QuickBird satellites, and these involve an original resolution within 1 m. In contrast, images of remote areas are acquired from the SPOT series satellites, and the original resolution of which is within 5 m.

Table 1 Data type, provided information, and the source used for accessing data involved in the present study.

Full size table

FROM-GLC30

The spatial stratified sampling standard was based on a priori knowledge of the urban land cover from the global 30 m resolution land cover data (FROM-GLC30) created in 2017 (Table 1). These data include the following types: cultivated land, woodland, grassland, shrubland, wetland, waterbody, tundra, artificial surface, bare land, glacier, and permafrost. The overall accuracy of the FROM-GLC30 data is 72.43%³¹, which is based on a global all-season validation sample set from more than 36,000 locations.

Site selection

In the present study, we selected 90 cities in China (Table 2), and these were partitioned into four tiers based on the city administration hierarchy established by the government in China. Tier 1 involved municipalities with a central administration and regions linked to a special administration in China. Tier 2 comprised mainly sub-provincial cities, while Tier 3 involved provincial capitals and major prefecture-level cities. Tier 4 contained ordinary prefecture-level cities, and the locations of these 90 cities are displayed in Fig. 2. These cities involve all provincial capitals and major administrative levels in China, and their distribution covers different climate regions, and thus, highlight the economics, political, and geographic adequacy. In terms of population, the 90 selected cities cover about 40% of the entire China.

Table 2 Data for the 90 cities in China involved in the present study.

Full size table

Information about the characteristics of the 90 cities in each tier is presented in Table 3 (collated from national statistics^32,33, no data available for Hong Kong and Macau). The administrative center bias in resource allocation makes cities with higher administrative rank more advantageous in terms of expansion. Therefore, the average size of population and built district shows an increasing trend from Tier 4 to Tier 1. Cities in Tier 1 and Tier 2 are mostly the economic centers of provinces, with developed economies and high urbanization rates, and usually show a multi-core urban morphology. Cities in Tier 3 and Tier 4, on the other hand, usually have a single-core urban morphology. In addition, the shape of cities in each tier generally varies depending on the topography. Cities in the plains tend to have a clumped shape, while cities along rivers and valleys tend to have a striped shape.

Table 3 Characteristics information of the 90 cities in different tiers.

Full size table

Data preparation

GES imagery preprocessing

The quality of a GES image varies based on the imaging sensor, imaging time, and environmental factors (e.g., atmospheric condition and climate), all of which affect the model training and generalization. Therefore, standardization procedure for GES imagery is necessary, and in the present study, the Gamma Correction algorithm³⁴ and Contrast Limited Adaptive Histogram Equalization algorithm³⁵ were used to resolve brightness and sharpness issues.

Spatial stratified sampling strategy

The study area involves several land cover types, and the proportion of unbuilt areas including water, cultivated land, and forest, exceeds that of built-up areas. Therefore, regular random sampling will create the imbalanced category problem, in which the proportion of negative samples (non-rooftop samples) significantly surpasses that of positive samples (rooftop samples). Therefore, the priori knowledge of the urban land cover was utilized to partition the study area into built-up and unbuilt areas based on the FROM-GLC30 data. The built-up area contains mainly artificial surfaces, which easily yield positive samples. In contrast, the unbuilt area comprises water bodies, wetlands, grasslands, bare lands, cultivated lands, shrublands, and forests, which commonly provide negative samples.

However, based on empirical evidence from previous studies in unbuilt areas, we find farmlands, bare lands, and intersections of different land cover types can be misclassified by the rooftop extraction model. Therefore, in the present study, the unbuilt areas were manually filtered to determine confusing areas, and these areas supplied confusing negative samples.

Therefore, to obtain representative and balanced positive and negative samples, spatial stratified sampling was employed in the sample acquisition. The results of stratified sampling in the built-up and confusing areas are displayed in Fig. 3, and this approach was used to produce patches from the GES images.

Sample processing and division

In the present study, sample images were correspondingly labeled positive and negative manually. Patches of the image samples and the corresponding masks were then divided into training samples (served as input for the rooftop extraction model training) and test samples (served as ground truth for the validation of rooftop area dataset).

During the rooftop extraction model training task, the built-up area covered by the image samples and their corresponding masks was 200 km², while the confusing area was 160 km², and thus, the total training samples covering 360 km². According to previous deep learning studies, higher training data volume produce more robust models. However, in practice, available data are usually limited. To resolve this limitation, data augmentation without changing the labeled categories was conducted, thereby enhancing the generalization potential of the model. Data augmentation operations used in the present study included the following: random cropping, image rotation, image flipping, image blurring, and noise addition.

Modelling

Naive rooftop extraction model

Rooftop areas of cities were extracted using the DeepLabV3+ model and GES images. The DeepLabV3+ is an open-source image semantic segmentation model that was launched by the Google R&D team³⁶. In GES images, the rooftop areas exhibit varied sizes and shapes because of differences in architectural styles and dimensions³⁷. Rooftop edges are sometimes difficult to accurately identify because the GES image quality is affected by weather conditions³⁸. DeepLabV3 + can perceive features of different scales, thereby improving the recognition accuracy for multifarious rooftops. In addition, the DeepLabV3+ enables transformation of the feature map into a constant resolution map based on the encoder–decoder structure, and this resolves the blurred edges of the rooftop area extraction issue³⁹.

In the present study, the cross-entropy⁴⁰ and dice loss functions⁴¹ were integrated to generate a composite loss function that can simultaneously handle the imbalance categories of samples problem and accelerate the convergence of training. These functions are expressed as follows:

$$L\left({p}_{i},{p}_{i}\ast \right)=\alpha \ast {L}_{dice}\left({p}_{i},{p}_{i}\ast \right)+\beta \ast {L}_{bce}\left({p}_{i},{p}_{i}\ast \right)$$

(1)

where p_i denotes the predicted value of the i^th sample, ${p}_{i}^{\ast }$ represents the ground truth value of the ith sample, L_dice(·) is the dice loss function, L_bce(·) stands for the cross-entropy loss function, and α and β are weight coefficients of the loss function, with corresponding values of 0.2 and 0.8 in the present study.

Expansion prediction

To extract rooftop areas, standardized GES images served as input for the ensemble model. Considering that original GES images were significantly larger than the required dimension for model input, cropping into smaller patches was necessary for the prediction. However, this creates an uneven transition or stitched problem at the splicing gap of the prediction result of cropped smaller images⁴². The expansion prediction techniques are suitable for eliminating this uneven transition at the splicing gap.

Steps implemented in the expansion prediction (Fig. 4) include the following: (1) An n × n sliding window characterized by a step size of n was created. During movement, this window expanded to m × m, and the original remote sensing image was then cropped into small patches. (2) The cropped images were concatenated into a tensor. (3) The ensemble rooftop extraction model was then used to predict the tensor obtained in step 2. (4) The central portion of each n x n tensor element was extracted and split into patches. (5) Patches obtained in step 4 were stitched to produce a large predicted image, which was then cropped based on the original GES image.

Model training

Overall network parameters were fine-tuned using the MS COCO dataset⁴³ and pre-trained Xception backbone network⁴⁴, while the AdamW optimizer⁴⁵ and Cosine Annealing Warm Restarts algorithm⁴⁶ were employed for rapid convergence of the network. Four Tesla V100 GPUs were used to train the model using the mixed accuracy method, and the development was based on the Pytorch framework⁴⁷. The hyperparameter settings are presented in Table 4.

Table 4 Summary of data for hyperparameters utilized in the present study.

Full size table

After the model reaches convergence, rooftop areas were predicted using the snapshot⁴⁸ based on union ensemble rule and expansion prediction techniques, the union ensemble rule is defined in Eq. 2. Considering the trade-off between the quality and speed of prediction, two local optimal models (in this work, checkpoints of the 29^th and 55^th epochs were chosen) were integrated into the ensemble rooftop extraction model. In addition, the TensorRT framework was used to accelerate the model calculation based on the FP16 accuracy. Accordingly, the prediction speed was increased approximately 5-fold without a significant loss in the accuracy, thereby reducing the prediction time from 100 to 20 days.

$${P}_{union-ensemble}=\mathop{\bigcup }\limits_{i=1}^{N}{P}_{i}$$

(2)

where P_i denotes the matrix of the prediction category, ${P}_{union-ensemble}$ represents the result of the union integration, and N is the number of sub models.

Data Records

Data file path

The rooftop area dataset will be updated regularly corresponding to urbanization process of different cities in China. The dataset, metadata, and detailed documentation are freely available for all users at National Tibetan Plateau Data Center (https://doi.org/10.11888/Geogra.tpdc.271702)⁴⁹.

Data file format

The rooftop area dataset is arranged on the tier of cities, where rooftop area data of each city is ESRI Shapefile format⁵⁰, which is composed of .shx, .shp, .prj, .dbf, and .cpg files. The dataset is divided into the original version and the simplified version. The original version is converted by prediction results from model directly, and on which simplified version is obtained by using the Douglas–Peucker algorithm⁵¹. The total sizes of two version dataset are 118 GB and 21.3 GB without compression.

File structure

The rooftop area data of each city contains three fields (area, X and Y), as presented in Table 5. All area fields are double float format, in square meters, calculated in CGCS 2000 Albers geographic coordinates. All X and Y fields are double float format, in decimal degrees, calculated in WGS 1984 Web Mercator Auxiliary Sphere geographic coordinates.

Table 5 Field description for rooftop area dataset.

Full size table

Technical Validation

Sampling design

Based on the spatial stratified sampling method proposed in this study, the test samples used for the validation of the rooftop area dataset are obtained from GES images and manually labeled with the ground truth through visual interpretation. To better reflect the quality of the rooftop extraction results in each city tier, four tiers of 45 km² were created, and these produced a test dataset covering 180 km², in which the built-up and the confusing areas covered by the image samples and their corresponding masks were 100 and 80 km², respectively.

Analysis design

Qualitative and quantitative evaluation criteria were utilized to validate the rooftop area dataset generated. Regarding the qualitative evaluation, morphological and topological characteristics of ground truth and extracted rooftop area data for various city tiers were compared. Conversely, for the quantitative evaluation, a testing dataset covering 180 km² comprising four city tiers of 45 km² was employed. In the present study, indicators calculated based on the confusion matrix⁵² include accuracy, precision, recall, and F1 score were used. The closer the values of these indicators are to 100%, the higher the quality of the rooftop extraction. It should be mentioned that precision is equivalent to user accuracy, which is a measure of exactness, and recall is equivalent to producer accuracy, which is a measure of completeness. Precision and recall can also be communicated in terms of error, as either commission error (1-precision) or omission error (1-recall). In particular, the F1 score, which is a weighted average of the recall and precision, is an important indicator for comprehensive evaluation of rooftop extraction results. These data and indicators were then used to evaluate the relationship between the rooftop extraction results and the ground truth⁵³.

Validation results

The validation results for various city tiers are presented in Table 6. The overall accuracy for all city tiers is 97.95%, while the F1 score is 83.11%. However, the overall quality of the rooftop area dataset for high-tier cities is better than that of the lower-tier cities. Based on empirical exploration, we realized that the GES image quality for various cities varied according to the imaging sensor, imaging time, and environmental factors such as the atmospheric condition and climate. These image quality differences affect the model training and generalization. The quality of GES images for high-tier cities is better, so the results of the model extraction are generally better, which explains the quality difference between the rooftop area dataset for different city tiers. Therefore, in using this dataset for other applications, an evaluation of the impact of regional differences in quality on specific applications is necessary.

Table 6 Summarized data from the evaluation of rooftop extraction results associated with different city tiers.

Full size table

In addition, we validated the quality of the rooftop area dataset for different city tiers using two types of sampling areas, as shown in Fig. 5. Each submap corresponds to a ground extent of 1 km² and the indicators for evaluating the rooftop extraction results are given below. In addition, we visualized the elements TN (True Negative), TP (True Positive), FN (False Negative) and FP (False Positive) in the confusion matrix. In this way the commission and omission errors in the extraction results can be clearly indicated by FP (in blue) and FN (in orange), respectively. It can be seen that the rooftop extraction in the built-up area is better than that in the confusing area, which explains why we use a stratified sampling strategy when collecting samples. In general, the extraction results adequately delineate characteristics of the rooftops, and these are consistent with the GES images. Our dataset also comprises small, sparsely distributed, and irregularly structured rooftops in confusing area, which highlights that the rooftop area dataset generated by the proposed framework has good performance in finding details and effectively avoids the interference of complex background information.

Examples of rooftop area dataset for different city tiers are shown in Fig. 6. The extracted rooftop areas exhibit clustering, and this is consistent with the spatial morphology and city boundaries. Therefore, the spatial distribution of rooftop areas is supportive for understanding the urbanization level and urban planning needs of cities. For example, Shanghai is a high-density mega-city in China, and its frank and vast plains and numerous waterways and ports provide advantages for the establishment of its polycentric urban spatial system. The concept of intensive development has also resulted in a more compact urban space in Shanghai. However, the eastern and northern parts of Harbin are mountainous and hilly, and the expansion of the city is limited by natural conditions, so the urban space is loose and the urban area is mainly distributed in the central and western plains.

Specific details on the rooftop area dataset for different city tiers are displayed in Fig. 7 using Beijing, Nanjing, Jilin, and Hengshui as examples. Compared with the GES images, the rooftop extraction results display significant details for various cities, with rooftops and complex backgrounds effectively distinguished. Moreover, dense and sparse spatial distribution areas are accurately extracted. In fact, the extraction results exhibit no sign of the stitched problem because of the application of the expansion prediction.

Meanwhile, Fig. 7 reveals that the GES image used in the present study was not adequately orthorectified, and this partially accounts for the offset in positions between the extracted rooftop area and the ground truth. This offset in positions is significant (approximately 20 m based on empirical survey) for a few high-rise buildings, but it is negligible for low- and medium-rise buildings. Therefore, if the rooftop area data are used for a major city- or country-scale estimation, such position offsets will minimally impact the results. Nevertheless, the rooftop extraction results obtained in the present study provide high-quality details suitable for supporting many architecture-oriented applications.

The empirical evidence of our study in 90 Chinese cities shows that the proposed method can be quickly generalized at a large scale and shows strong robustness in regions with different characteristics. When it is to be extended to regions outside of China, we suggest fine-tuning the existing model by adding new feature samples, to enhance its applicability in the target region.

Code availability

The procedure of spatial sampling is executed in the ArcGIS Pro platform. The code of the deep learning model is available at https://github.com/ChanceQZ/RoofTopSegmatation. The program is described by Python3, packages of which are Pytroch, Numpy, and OpenCV mainly.

References

Biljecki, F., Heuvelink, G. B., Ledoux, H. & Stoter, J. Propagation of positional error in 3D GIS: estimation of the solar irradiation of building roofs. International Journal of Geographical Information Science 29, 2269–2294 (2015).
Article Google Scholar
Nouvel, R., Zirak, M., Coors, V. & Eicker, U. The influence of data quality on urban heating demand modeling using 3D city models. Computers, Environment and Urban Systems 64, 68–80 (2017).
Article Google Scholar
Chen, Y., Tang, L., Yang, X., Bilal, M. & Li, Q. Object-based multi-modal convolution neural networks for building extraction using panchromatic and multispectral imagery. Neurocomputing 386, 136–146 (2020).
Article Google Scholar
Biljecki, F., Arroyo Ohori, K., Ledoux, H., Peters, R. & Stoter, J. Population estimation using a 3D city model: A multi-scale country-wide study in the Netherlands. PloS one 11, e0156808 (2016).
Article Google Scholar
Acuto, M., Parnell, S. & Seto, K. C. Building a global urban science. Nature Sustainability 1, 2–4 (2018).
Article Google Scholar
Hu, Q., Zhen, L., Mao, Y., Zhou, X. & Zhou, G. Automated building extraction using satellite remote sensing imagery. Automation in Construction 123, 103509 (2021).
Article Google Scholar
Assouline, D., Mohajeri, N. & Scartezzini, J. L. Quantifying rooftop photovoltaic solar energy potential: A machine learning approach. Solar Energy 141, 278–296 (2017).
Article ADS Google Scholar
Assouline, D., Mohajeri, N. & Scartezzini, J. L. Large-scale rooftop solar photovoltaic technical potential estimation using Random Forests. Applied energy 217, 189–211 (2018).
Article Google Scholar
Dehwah, A. H. & Asif, M. Assessment of net energy contribution to buildings by rooftop photovoltaic systems in hot-humid climates. Renewable energy 131, 1288–1299 (2019).
Article Google Scholar
Chen, Y., Hong, T. & Piette, M. A. Automatic generation and simulation of urban building energy models based on city datasets for city-scale building retrofit analysis. Applied Energy 205, 323–335 (2017).
Article Google Scholar
Mohajeri, N. et al. A city-scale roof shape classification using machine learning for solar energy applications. Renewable Energy 121, 81–93 (2018).
Article Google Scholar
Shepero, M., Munkhammar, J., Widén, J., Bishop, J. D. & Boström, T. Modeling of photovoltaic power generation and electric vehicles charging on city-scale: A review. Renewable and Sustainable Energy Reviews 89, 61–71 (2018).
Article Google Scholar
Chen, D., Wang, R. & Peethambaran, J. Topologically aware building rooftop reconstruction from airborne laser scanning point clouds. IEEE Transactions on Geoscience and Remote Sensing 55, 7032–7052 (2017).
Article ADS Google Scholar
Huang, J., Zhang, X., Xin, Q., Sun, Y. & Zhang, P. Automatic building extraction from high-resolution aerial images and LiDAR data using gated residual refinement network. ISPRS journal of photogrammetry and remote sensing 151, 91–105 (2019).
Article ADS Google Scholar
Gooding, J., Crook, R. & Tomlin, A. S. Modelling of roof geometries from low-resolution LiDAR data for city-scale solar energy applications using a neighbouring buildings method. Applied Energy 148, 93–104 (2015).
Article Google Scholar
Poli, D., Remondino, F., Angiuli, E. & Agugiaro, G. Radiometric and geometric evaluation of GeoEye-1, WorldView-2 and Pléiades-1A stereo images for 3D information extraction. ISPRS Journal of Photogrammetry and Remote Sensing 100, 35–47 (2015).
Article ADS Google Scholar
Kabir, M. H., Endlicher, W. & Jägermeyr, J. Calculation of bright roof-tops for solar PV applications in Dhaka Megacity, Bangladesh. Renewable Energy 35, 1760–1764 (2010).
Article CAS Google Scholar
Hazelhoff, L. In Visual Information Processing and Communication II, 7882, 788208 (International Society for Optics and Photonics, 2011).
Taylor, J. R. & Lovell, S. T. Mapping public and private spaces of urban agriculture in Chicago through the analysis of high-resolution aerial images in Google Earth. Landscape and urban planning 108, 57–70 (2012).
Article Google Scholar
Yu, L. & Gong, P. Google Earth as a virtual globe tool for Earth science applications at the global scale: progress and perspectives. International Journal of Remote Sensing 33, 3966–3986 (2012).
Article ADS Google Scholar
Liang, J., Gong, J. & Li, W. Applications and impacts of Google Earth: A decadal review (2006–2016). ISPRS Journal of Photogrammetry and Remote Sensing 146, 91–107 (2018).
Article ADS Google Scholar
Ye, Y. et al. Measuring daily accessed street greenery: A human-scale approach for informing better urban planning practices. Landscape and Urban Planning 191, 103434 (2019).
Article Google Scholar
Chen, L. C., Zhu, Y., Papandreou, G., Schroff, F. & Adam, H. In Proceedings of the European conference on computer vision (ECCV). 801–818 (2018).
Chen, G. et al. Symmetrical dense-shortcut deep fully convolutional networks for semantic segmentation of very-high-resolution remote sensing images. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing 11, 1633–1644 (2018).
Article ADS Google Scholar
Long, J., Shelhamer, E. & Darrell, T. in Proceedings of the IEEE conference on computer vision and pattern recognition. 3431–3440 (2015).
Noh, H., Hong, S. & Han, B. in Proceedings of the IEEE international conference on computer vision. 1520–1528 (2015).
Li, P. et al. Understanding rooftop PV panel semantic segmentation of satellite and aerial images for better using machine learning. Advances in Applied Energy 4, 100057 (2021).
Article Google Scholar
Zhong, T. et al. A city-scale estimation of rooftop solar photovoltaic potential based on deep learning. Applied Energy 298, 117132 (2021).
Article Google Scholar
Shorten, C. & Khoshgoftaar, T. M. A survey on image data augmentation for deep learning. Journal of Big Data 6, 1–48 (2019).
Article Google Scholar
Wang, Y., Yao, Q., Kwok, J. T. & Ni, L. M. Generalizing from a few examples: A survey on few-shot learning. ACM Computing Surveys (CSUR) 53, 1–34 (2020).
Google Scholar
Chen, B. et al. Stable classification with limited sample: Transferring a 30-m resolution sample set collected in 2015 to mapping 10-m resolution global land cover in 2017. Sci. Bull 64, 370–373 (2019).
Article Google Scholar
Ministry of Housing and Urban-Rural Development of the People’s Republic of China. China urban-rural construction statistical yearbook http://www.mohurd.gov.cn/ (2019).
National Bureau of Statistics of the People’s Republic of China. Bulletin of the seventh national census http://www.stats.gov.cn/tjsj/tjgb/rkpcgb/ (2021).
Peng, G. & Lin, Z. A study on gamma correction and its implementation in image processing. Electronic engineer 2 (2006).
Pizer, S. M. et al. Adaptive histogram equalization and its variations. Computer vision, graphics, and image processing 39, 355–368 (1987).
Article Google Scholar
Chen, L. C., Papandreou, G., Schroff, F. & Adam, H. Rethinking atrous convolution for semantic image segmentation. arXiv 2017. arXiv preprint arXiv:1706.05587 (2019).
Xiong, Y., Liu, J. & Kim, J. Understanding differences in thermal comfort between urban and rural residents in hot summer and cold winter climate. Building and environment 165, 106393 (2019).
Article Google Scholar
Burdziakowski, P. A Novel Method for the Deblurring of Photogrammetric Images Using Conditional Generative Adversarial Networks. Remote Sensing 12, 2586 (2020).
Article ADS Google Scholar
Du, S., Du, S., Liu, B. & Zhang, X. Incorporating DeepLabv3+ and object-based image analysis for semantic segmentation of very high resolution remote sensing images. International Journal of Digital Earth 14, 357–378 (2021).
Article ADS Google Scholar
Boudiaf, M. et al. In European Conference on Computer Vision. 548–564 (Springer, 2020).
Milletari, F., Navab, N. & Ahmadi, S.-A. In 2016 fourth international conference on 3D vision (3DV). 565–571 (IEEE, 2016).
Huang, B. et al. In IGARSS 2018-2018 IEEE International Geoscience and Remote Sensing Symposium. 6947–6950 (IEEE, 2018).
Lin, T. Y. et al. In European conference on computer vision. 740–755 (Springer, 2014).
Chollet, F. In Proceedings of the IEEE conference on computer vision and pattern recognition. 1251–1258 (2017).
Loshchilov, I. & Hutter, F. Decoupled weight decay regularization. arXiv preprint arXiv:1711.05101 (2017).
Loshchilov, I. & Hutter, F. Sgdr: Stochastic gradient descent with warm restarts. arXiv preprint arXiv:1608.03983 (2016).
Paszke, A. et al. Pytorch: An imperative style, high-performance deep learning library. Advances in neural information processing systems 32, 8026–8037 (2019).
Google Scholar
Huang, G. et al. Snapshot ensembles: Train 1, get m for free. arXiv preprint arXiv:1704.00109 (2017).
Nanjing Normal University Lab of Smart City Sensing and Simulation. Vectorized rooftop area data for 90 cities in China. National Tibetan Plateau Data Center https://doi.org/10.11888/Geogra.tpdc.271702 (2021).
Environmental Systems Research Institute. ESRI Shapefile Technical Description, an ESRI white paper. (1998).
Saalfeld, A. Topologically consistent line simplification with the Douglas-Peucker algorithm. Cartography and Geographic Information Science 26, 7–18 (1999).
Article Google Scholar
Hay, A. The derivation of global estimates from a confusion matrix. International Journal of Remote Sensing 9, 1395–1398 (1988).
Article ADS Google Scholar
Deng, X., Liu, Q., Deng, Y. & Mahadevan, S. An improved method to construct basic probability assignment based on the confusion matrix for classification problem. Information Sciences 340, 250–261 (2016).
Article Google Scholar

Download references

Acknowledgements

We appreciate the detailed suggestions and comments from the anonymous reviewers. We express heartfelt thanks to the other members of the Smart City Sensing and Simulation lab, who undertook data collection and annotation work. This work was supported by the National Natural Science Foundation (Grant 41930648), Joint Fund Project of National Natural Science Foundation of China (Grant U1811464), and National Research Foundation Singapore. The data of this work is licensed and hosted by National Tibetan Plateau Data Center.

Author information

Authors and Affiliations

Key Laboratory of Virtual Geographic Environment (Ministry of Education of PRC), Nanjing Normal University, Nanjing, 210023, China
Zhixin Zhang, Zhen Qian, Teng Zhong, Min Chen, Kai Zhang, Yue Yang, Fangzhuo Zhou, Jianing Yu, Bingyue Zhang & Guonian Lü
State Key Laboratory Cultivation Base of Geographical Environment Evolution, Nanjing, 210023, China
Zhixin Zhang, Zhen Qian, Teng Zhong, Min Chen, Kai Zhang, Yue Yang, Fangzhuo Zhou, Jianing Yu, Bingyue Zhang & Guonian Lü
Jiangsu Center for Collaborative Innovation in Geographical Information Resource Development and Application, Nanjing, 210023, China
Zhixin Zhang, Zhen Qian, Teng Zhong, Min Chen, Kai Zhang, Yue Yang, Fangzhuo Zhou, Jianing Yu, Bingyue Zhang & Guonian Lü
Jiangsu Provincial Key Laboratory for NSLSCS, School of Mathematical Science, Nanjing Normal University, Nanjing, 210023, China
Min Chen
Department of Land Surveying and Geo-Informatics, The Hong Kong Polytechnic University, Kowloon, Hong Kong, China
Rui Zhu
Senseable City Lab, Massachusetts Institute of Technology, Cambridge, MA, 02139, USA
Fan Zhang
Center for Spatial Information Science, The University of Tokyo, 5-1-5 Kashiwanoha, Kashiwa-shi, Chiba, 277-8568, Japan
Haoran Zhang
LocationMind Inc, 3-5-2 Iwamotocho, Chiyoda-ku, Tokyo, 101-0032, Japan
Haoran Zhang
Future Energy Center, Malardalen University, 72123, Vasteras, Sweden
Haoran Zhang & Jinyue Yan
Department of Chemical Engineering, KTH Royal Institute of Technology, Stockholm, 10044, Sweden
Jinyue Yan

Authors

Zhixin Zhang
View author publications
Search author on:PubMed Google Scholar
Zhen Qian
View author publications
Search author on:PubMed Google Scholar
Teng Zhong
View author publications
Search author on:PubMed Google Scholar
Min Chen
View author publications
Search author on:PubMed Google Scholar
Kai Zhang
View author publications
Search author on:PubMed Google Scholar
Yue Yang
View author publications
Search author on:PubMed Google Scholar
Rui Zhu
View author publications
Search author on:PubMed Google Scholar
Fan Zhang
View author publications
Search author on:PubMed Google Scholar
Haoran Zhang
View author publications
Search author on:PubMed Google Scholar
Fangzhuo Zhou
View author publications
Search author on:PubMed Google Scholar
Jianing Yu
View author publications
Search author on:PubMed Google Scholar
Bingyue Zhang
View author publications
Search author on:PubMed Google Scholar
Guonian Lü
View author publications
Search author on:PubMed Google Scholar
Jinyue Yan
View author publications
Search author on:PubMed Google Scholar

Contributions

Z.X.Z. investigated study area and related work, designed the geospatial stratified sampling algorithm, collected and processed GES image and wrote manuscript. Z.Q. designed, trained and applied rooftop extraction model, developed parallel computing programs, simplified rooftop area dataset, and wrote manuscript. T.Z. assisted with validation and quality control and reviewed manuscript. M.C. designed and condensed experimental ideas, guided experiments work, reviewed manuscript, and provided project funding support. K.Z. collected and processed GES image, and published rooftop area dataset. Y.Y. converted rooftop area dataset from raster format to vector format and runed data preprocessing procedure using a supercomputer platform. R.Z., F.Z. and H.R.Z. provided technical guidance and reviewed manuscript. F.Z.Z, J.N.Y and B.Y.Z processed and quality controlled the GES image. G.N.L. and J.Y.Y. designed and condensed experimental ideas and reviewed manuscript.

Corresponding author

Correspondence to Min Chen.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

The Creative Commons Public Domain Dedication waiver http://creativecommons.org/publicdomain/zero/1.0/ applies to the metadata files associated with this article.

Reprints and permissions

About this article

Cite this article

Zhang, Z., Qian, Z., Zhong, T. et al. Vectorized rooftop area data for 90 cities in China. Sci Data 9, 66 (2022). https://doi.org/10.1038/s41597-022-01168-x

Download citation

Received: 01 October 2021
Accepted: 17 January 2022
Published: 02 March 2022
DOI: https://doi.org/10.1038/s41597-022-01168-x

This article is cited by

Analysis of the impact mechanisms and driving factors of urban spatial morphology on urban heat islands
- Caiyi Huang
- Ke Liu
- Lin Li
Scientific Reports (2025)
Vectorized building rooftop prints of the Qinghai-Tibetan Plateau and its neighboring regions
- Tao Ye
- Hongyu Shan
- Lizheng Wu
Scientific Data (2025)
Unveiling deployable rooftop solar potential across Chinese cities
- Mai Shi
- Xi Lu
- Michael T. Craig
Nature Cities (2025)
CMAB: A Multi-Attribute Building Dataset of China
- Yecheng Zhang
- Huimin Zhao
- Ying Long
Scientific Data (2025)
Exploring the Impact of Park and Surrounding Environments on Violent Crime in Wuhan: An XGBoost-SHAP Approach
- Sainan Lin
- Shudi Chen
- Yao Yao
Applied Spatial Analysis and Policy (2025)

Subjects

Abstract

Similar content being viewed by others

Background & Summary

Methods

Data collection

GES imagery

FROM-GLC30

Site selection

Data preparation

GES imagery preprocessing

Spatial stratified sampling strategy

Sample processing and division

Modelling

Naive rooftop extraction model

Expansion prediction

Model training

Data Records

Data file path

Data file format

File structure

Technical Validation

Sampling design

Analysis design

Validation results

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Quick links