A Large Crowdsourced Street View Dataset for Mapping Road Surface Types in Africa

Zhou, Qi; Duan, Jiayuan; Qiao, Jie; Liu, Zixian; Yang, Hong

doi:10.1038/s41597-025-05153-y

Download PDF

Data Descriptor
Open access
Published: 13 June 2025

A Large Crowdsourced Street View Dataset for Mapping Road Surface Types in Africa

Qi Zhou ORCID: orcid.org/0000-0001-7177-1820¹^na1,
Jiayuan Duan¹^na1,
Jie Qiao¹,
Zixian Liu¹ &
…
Hong Yang^2,3

Scientific Data volume 12, Article number: 1003 (2025) Cite this article

2122 Accesses
2 Altmetric
Metrics details

Subjects

Abstract

Identifying road surface types is crucial for road maintenance and socio-economic development. Crowdsourced street view data, with global coverage and free access, is a valuable source for this task. However, deep learning models typically require large, manually annotated datasets, which are lacking for road surface classification. To address this, three experts initially labeled 200,000 images from the Mapillary crowdsourced street view dataset, categorizing them into “Paved,” “Unpaved,” and “Unknown” road types through a voting mechanism. Five deep learning models were then trained on this expert-labeled dataset to assess the usefulness of the labels for further analysis. The results show that all models achieved accuracy, precision, recall, and F1 scores above 0.90, with the Swin Transformer model reaching the highest accuracy of 0.924. When applied to six 10 × 10 km sub-regions in Africa, the best model exceeded 0.95 in most performance metrics. These findings demonstrate the potential of this dataset to enhance road surface identification and support infrastructure improvement in Africa.

StreetSurfaceVis: a dataset of crowdsourced street-level imagery annotated by road surface type and quality

Article Open access 16 January 2025

Mapping Road Surface Type of Kenya Using OpenStreetMap and High-resolution Google Satellite Imagery

Article Open access 03 April 2024

Self-supervised and semi-supervised learning for road condition estimation from distributed road-side cameras

Article Open access 26 December 2022

Background & Summary

Road surface type impacts driving comfort^1,2,3, traffic safety⁴, and socio-economic development^5,6. For example, improvements in road surface types have been shown to significantly boost the rural economic transformation in the Java-Bali region⁵. Some studies have also found a positive correlation between road condition improvements and GDP growth in sub-Saharan Africa⁶. Road surface type has also been included by the United Nations as one of the indicators for assessing the progress of the Sustainable Development Goals (SDGs). Specifically, SDG 9.1.1 refers to the “proportion of rural population living within 2 kilometers of a road that is passable all year-round”^7,8,9. Generally, paved roads ensure year-round accessibility, while unpaved roads may be subject to seasonal disruptions due to adverse weather conditions⁸. Therefore, identifying road surface types is crucial for improving transportation infrastructure⁹.

Various techniques have been developed for identifying road surface types. For instance, Tai et al. proposed a motorcycle-based pavement evaluation system that utilizes a smartphone’s triaxial accelerometer, employing both supervised and unsupervised machine learning methods to detect road anomalies and assess pavement quality¹⁰. Similarly, Takahashi et al. developed a bicycle-based road condition monitoring system, which classifies road surface signals captured by smartphones into categories such as damage or flatness¹¹. Menegazzo et al. developed a CNN-based deep neural network model that uses accelerometer and gyroscope data from different vehicles, drivers, and environments to classify road surfaces into categories such as asphalt and dirt roads¹². Lee et al. introduced a real-time pavement classification algorithm that uses deep neural networks and triaxial accelerometers to distinguish between asphalt, gravel, and unpaved roads¹³. Al-Mansour also developed a low-cost pavement maintenance management system, using inexpensive sensors embedded in smartphones to classify road surfaces as “Good,” “Fair,” “Poor,” or “Very Poor”¹⁴.

Remote sensing technologies have also been widely used to identify road surface types. Dobson et al. used aerial images collected from remote-controlled helicopters or manned fixed-wing aircraft to create three-dimensional models of road segments, detecting potholes and cracks in the pavement¹⁵. Brooks et al. introduced an object-based classification algorithm that uses optical aerial imagery with a resolution of 0.3 meters to classify road surfaces as either paved or unpaved¹⁶. Thiede et al. used high-resolution remote sensing imagery (0.5 meters) to identify unpaved roads in South Africa’s Gauteng and Northwest provinces¹⁷. Brewer et al. employed a convolutional neural network (CNN) algorithm to predict road surface types using 0.3-meter resolution remote sensing images¹⁸. Recently, Zhou et al. proposed a method combining OpenStreetMap data with high-resolution satellite imagery, leveraging the VGG-16 model to identify paved and unpaved roads. They also developed the first dataset of road surface types in Kenya, Africa¹⁹.

Despite these advancements, there are still some limitations in existing studies. First, using mobile devices mounted on vehicles to identify road surface types is a time-consuming and labor-intensive task, as this method can only collect road surface data for individual roads. While high-resolution remote sensing data (e.g., with a resolution greater than 1 meter) can be used to identify road surface types over large-scale road networks, such as at the national or regional level, vegetation in densely covered areas may obstruct the view, reducing the accuracy of surface type identification.

In recent years, open street view data (e.g., from Mapillary²⁰ and OpenStreetCam²¹) has become increasingly available. These crowdsourced street views, contributed by volunteers from different countries and regions^22,23, have proven to be valuable for identifying road surface types. Thanks to their open access and global coverage, crowdsourced street view data is considered one of the most important data sources for this task. For example, Lee et al. proposed an AdapNet-based semantic segmentation model that can segment road pixels from Mapillary crowdsourced street view and classify them as either paved or unpaved roads²⁴. Despite this progress, deep learning models for road surface type identification require large training datasets. To the best of our knowledge, there is currently a lack of publicly available datasets specifically designed for identifying road surface types from crowdsourced street views.

Therefore, this study aims to develop the first comprehensive Mapillary crowdsourced street view dataset that includes a large number of samples labeled with road surface types. The main contributions of this study are as follows:

1)
The first dataset containing 200,000 Mapillary street images has been developed, all of which were collected from Africa. These images are classified into three categories: ‘Paved,’ ‘Unpaved,’ and ‘Unknown.’
2)
This study demonstrates the effectiveness of deep learning models trained on the aforementioned dataset. It also shows that the models can accurately classify road surface types in six sub-regions of Africa, with classification metrics all exceeding 0.95.

This study focuses on the African region due to the generally low rate of road paving compared to other parts of the world²⁵. For example, an existing study has reported that 70% of roads in sub-Saharan Africa remain unpaved²⁶, resulting in poor transportation and inefficient trade, which in turn hinders local socio-economic development. Therefore, our classified street view dataset may provide valuable data support for improving transportation infrastructure in Africa.

Methods

Overview

This study developed a dataset consisting of 200,000 street view images classified by different road surface types. The main steps involved are as follows:

Firstly, Africa was divided into grids of a certain size, and Mapillary street view images along with their location data were randomly selected from each grid as candidates. Then, road data for Africa was obtained and matched with the street view data, filtering out those street views that corresponded to roads and included road surface type labels. Next, street view images with different road surface types were randomly and evenly sampled from the candidate set, and visually reclassified to produce the final dataset. Finally, deep learning models were trained based on the street view labeled dataset, and the trained models were applied to different sub-regions of Africa to validate the effectiveness and usability of the dataset. The workflow is shown in Fig. 1.

Data collection

Mapillary street view

In this study, 3,440 grids of 100 km × 100 km were created across Africa. The Mapillary API (https://www.mapillary.com/developer/api-documentation) was then used to randomly retrieve street view images from each grid along with their corresponding location information (represented by latitude and longitude coordinates), resulting in a total of 907,963 street view images.

The main advantages of using Mapillary street view data include: 1) The data is freely accessible. 2) In addition to street view images, Mapillary provides location information for each image, which facilitates the visualization of the spatial distribution of road surface types. 3) Mapillary data has global coverage²⁰, making it suitable for large-scale studies at national and regional levels. In contrast, Google Street View data is only available in certain countries or regions.

Road data

This study utilized OpenStreetMap (OSM) road data, specifically for roads that are accessible to motor vehicles (as shown in Table 1). The data can be obtained from the third-party platform at (http://download.geofabrik.de/index.html). OSM data has the advantages of global coverage and open accessibility^27,28. More importantly, OSM road data includes tags (e.g., surface = paved) that record the road surface type, providing a reference for classifying road surface types in Mapillary street view images.

Table 1 Road classifications based on four-wheeled vehicle passability.

Full size table

Data processing

Data matching

In theory, all Mapillary street view images could be used to create the dataset. However, some images may not have been taken on roads, and even for those captured on roadways, the corresponding OSM road data may either lack surface type tags or contain erroneous entries due to mislabeling (e.g., surface = ???, yes, or 7387.92 m²). Therefore, the OSM road data were first filtered to retain only road segments with clearly defined surface type tags. Then, for each retrieved street view image, the distance to the nearest road was calculated. If the distance was less than a certain threshold (10 meters), the street view image was considered to be on a road. Otherwise, the image was considered to be off-road. We set the threshold at 10 meters because existing studies suggest that the positioning accuracy of most mobile phones is around 10 meters²⁹. As a result, 709,463 Mapillary street view images were obtained from Africa, all of which were associated with roads that included surface type labels.

Data sampling

From the matched 709,463 Mapillary street view images, we randomly and evenly sampled images representing various road surface types. After reviewing these images, the followings were observed: (1) Paved roads in Africa are predominantly asphalt, while unpaved roads are mostly dirt roads; (2) Some street view images, due to low resolution or because they were not taken on roads, could not be visually interpreted for their surface type. Therefore, these street view images were classified into three categories: “Paved,” “Unpaved,” and “Unknown.” The specific steps are as follows:

First, the tags in the OSM road data were reclassified into two categories: “Paved” and “Unpaved” (as shown in Table 2). Then, 100,000 street view images were randomly selected from the matched “Paved” roads and 100,000 images from the matched “Unpaved” roads, for a total of 200,000 images.

Table 2 Reclassification of OSM road surface tags.

Full size table

Next, recognizing that surface type tags in the OSM road data may contain errors, three experts with backgrounds in GIS and transportation were assigned to visually interpret and classify the 200,000 extracted street view images into three categories: “Paved,” “Unpaved,” and “Unknown.” To validate the accuracy of visual interpretation, the Fleiss’ Kappa coefficient was employed for inter-rater consistency assessment³⁰. The calculated value of 0.995 indicates a high level of agreement among the three experts. Lastly, the final classification label for each street view image was determined using a majority voting mechanism. Table 3 and Fig. 2 present the number of street view images in each category after visual classification, along with representative examples of each category.

Table 3 Distribution of Mapillary Street View Images in Each Category After Visual Interpretation.

Full size table

Data usability evaluation

Although the accuracy of our dataset has been verified using the Fleiss’ Kappa coefficient, it is necessary to adjust the usability of this dataset. Therefore, deep learning models were further employed to assess the usability of this dataset for road surface classification. As a result, this study considers two scenarios:

1)
Scenario 1: Training different deep learning models using the developed dataset of 200,000 street view images and validating the model’s performance. Several classic deep learning models were employed, including VGG-16³¹, ResNet-50³², Swin Transformer³³, Yolo v7³⁴, and ConvNeXt³⁵. These models have been widely used in image recognition, object detection, and other fields, and generally outperform traditional machine learning models, such as decision trees and random forests, in classification tasks^{36,37,38,39,40,41}. Besides, existing studies have shown that these models can perform better than some traditional convolutional neural network models (e.g., AlexNet and GoogleNet) on datasets like ImageNet^42,43.

For each model, we randomly sampled 5,500 images per class (‘Paved’, ‘Unpaved’, ‘Unknown’), yielding 16,500 total images. These were split into 80% training and 20% validation sets. All models were implemented in PyTorch and trained on a workstation equipped with an NVIDIA P100 GPU and 16 GB of RAM. After extensive tuning, we set the learning rate to 0.0001, batch size to 64, and number of epochs to 100. Model performance was then evaluated using accuracy, precision, recall and F1-score^44,45.
2)
Scenario 2: Applying the trained models to sub-regions to validate their usability.

To further validate the usability of the street view dataset, the optimal model from Scenario 1 was applied to six 10 × 10 km road network sub-regions to identify the road surface types in each sub-region (as shown in Fig. 3). For each sub-region, all Mapillary street view images and their corresponding location data were first retrieved. Then, the optimal model from Scenario 1 was used to classify the road surface types of all street view images within each sub-region. The classification accuracy for each sub-region was validated using metrics such as accuracy, precision, recall, and F1-score.

The selection of these six sub-regions is based on their location in different countries and regions across Africa, covering both urban (Fig. 3a,c,d) and rural (Fig. 3b,e,f) road network patterns. This selection helps minimize subjectivity that may arise from focusing on a single type of study area. More importantly, these sub-regions provide a relatively rich set of Mapillary street view images.

Data Records

This dataset includes Mapillary street view images and corresponding location data from Africa⁴⁶. The street view images are saved in JPG format, with each image assigned a unique ID number, totaling 200,000 images. The location data is represented as point vector data in Esri Shapefile format, where each point includes: a unique street view ID, longitude, latitude, and road surface type (as shown in Table 4), with a total of 200,000 points. All data are projected using the World Geodetic System (WGS) 84 and the pseudo-Mercator coordinate system (EPSG: 3857) (Fig. 4). The dataset is publicly accessible on Figshare, which is released under a CC-BY 4.0 license. Both the dataset and metadata can be retrieved from the corresponding link⁴⁶. Additionally, all data were checked against their descriptions to ensure consistency and accuracy. This verification was performed to ensure that the data aligns with the provided metadata and classification details.

Table 4 The fields of our dataset.

Full size table

Technical Validation

Results evaluation

Results and analyses of Scenario 1

Table 5 presents a comparison of the accuracy in road surface type classification of Mapillary street view data using four different models.

Table 5 Comparison of accuracy across different models.

Full size table

As shown in Table 5, the accuracy, precision, and recall for identifying various road surface types from Mapillary street view images using different models all exceed 0.9, with F1 scores above 0.91. This indicates that the models trained on the developed Mapillary street view dataset perform well overall. The consistently high performance across multiple models demonstrates the usability of the developed dataset. The highest accuracy was achieved using the Swin Transformer model, with a value of 0.924, slightly outperforming the other four models (ResNet-50, VGG-16, Yolo v7, and ConvNeXt).

Results and analyses of Scenario 2

The results from Scenario 1 indicate that the Swin Transformer model achieved the highest accuracy. Therefore, in Scenario 2, the Swin Transformer was applied to classify all street view images from the six sub-regions. The classification accuracy for each sub-region is shown in Table 6. Figure 5 presents the classification results of road surface types for each sub-region.

Table 6 Evaluation results of applying the best model to the six tested regions (I, II, III, IV, V, VI).

Full size table

As shown in Table 6, when applying the Swin Transformer model trained in Scenario 1 to the road networks of six sub-regions in Africa, the accuracy, precision, recall, and F1 scores generally exceed 0.95. This suggests that the deep learning models trained on the street view dataset developed in this study can be applied to other regions of Africa. Notably, the performance metrics for Sub-region VI even exceed 0.99, likely because this sub-region contains only unpaved roads (Fig. 5f), leading to fewer classification errors.

Limitations

This study has some limitations as follows:

1)
Model selection: Firstly, only five classic deep learning models were selected for this study because existing studies have shown that these models perform better than some traditional convolutional neural network models (e.g., AlexNet and GoogleNet) on datasets like ImageNet^42,43. Our experiments also confirmed that the accuracy, precision, and recall using these models all exceed 0.9. However, as artificial intelligence is continuously evolving and new models are emerging, future research could explore whether incorporating modules such as attention mechanisms or introducing other models can further improve the classification accuracy of road surface types.
2)
Road surface type categories: Secondly, this study classified road surface types into just three categories: “Paved,” “Unpaved,” and “Unknown.” In reality, road surface types may be more diverse, including asphalt, concrete, and dirt roads, among others. However, by analyzing the 200,000 street view images, we observed that paved roads in Africa are predominantly asphalt, while unpaved roads are mostly dirt roads, with very few other types of surfaces. Therefore, this dataset can be effectively used for road surface type classification of Mapillary street view images in most regions of Africa. Besides, the “Unknown” category in our dataset could assist users in identifying and filtering low-quality street view data. Nonetheless, road surface types may differ across countries and regions. Thus, future research could consider adding street view images of other road surface types to enhance the dataset’s applicability.
3)
Geographical regions: Thirdly, the street view data used in this study were all sourced from Africa. Since road surface types can vary significantly across regions (such as in Asia or Europe), the models trained on this dataset may not be directly applicable to other regions of the world. Nevertheless, the methodology developed in this study could be applied to create street view datasets for road surface type classification in other regions globally.
4)
Mapillary data: The Mapillary street view data has inherent limitations, including gaps in coverage, particularly in rural or remote areas, as well as variability in image quality due to factors such as resolution, lighting, and capture angle. Furthermore, the data tends to be biased toward urban areas where street view images are more frequently captured, which may result in certain road types being underrepresented⁴⁷. Nevertheless, the crowdsourced nature of Mapillary allows for continuous updates, which could improve data quality and expand coverage in future studies.

Despite these limitations, our study offers several valuable contributions. First, a generalizable methodology for creating road surface type datasets from crowdsourced street view imagery has been proposed. By leveraging Mapillary data, this approach can be adapted to other regions worldwide and extended to additional platforms such as Google Street View⁴⁸ and OpenStreetCam²¹, thereby broadening its applicability. Second, a comprehensive dataset of 200,000 African street view images labeled by road surface type has been developed. These images serve as rich training samples for deep learning models, facilitating automated classification of road surfaces across diverse African environments. Third, the dataset can support assessment of Sustainable Development Goal 9.1.1, which measures the proportion of the rural population living within 2 km of all-season roads. National-and regional-scale road surface maps generated from this dataset (as shown in Fig. 5) can provide crucial indicators for monitoring progress toward this goal. Last but not least, the dataset can be employed to evaluate and enhance OpenStreetMap (OSM) road surface tags. By comparing “Paved” and “Unpaved” labels with existing OSM tags, inconsistencies can be identified. Furthermore, deep learning models trained on these labeled images can automatically infer and supplement missing or erroneous surface tags in OSM, improving overall data quality.

Code availability

The python scripts used for model training are available online through GitHub respository: https://github.com/leishelaoshi/RoadSurfaceType.git.

References

Chen, S., Saeed, T. U. & Labi, S. Impact of road-surface condition on rural highway safety: A multivariate random parameters negative binomial approach. Anal. Methods Accid. Res. 16, 75–89, https://doi.org/10.1016/j.amar.2017.09.001 (2017).
Article Google Scholar
Lee, J., Nam, B. & Abdel-Aty, M. Effects of pavement surface conditions on traffic crash severity. J. Transp. Eng. 141, 04015020, https://doi.org/10.1061/(ASCE)TE.1943-5436.0000785 (2015).
Article Google Scholar
Sari, Y. & Yudhistira, M. H. Bad light, bad road, or bad luck? The associations of road lighting and road surface quality on road crash severities in Indonesia. Case Stud. Transp. Policy 9, 1407–1417, https://doi.org/10.1016/j.cstp.2021.07.005 (2021).
Article Google Scholar
Pérez-Fortes, A. P. & Giudici, H. A recent overview of the effect of road surface properties on road safety, environment, and how to monitor them. Environ. Sci. Pollut. Res. 29, 65993–66009, https://doi.org/10.1007/s11356-022-21847-x (2022).
Article Google Scholar
Kamaludin, A. S. & Qibthiyyah, R. M. Village road quality and accessibility on transforming rural development. AGRARIS: J. Agribusiness Rural Dev. Res. 8, 160–180, https://doi.org/10.18196/agraris.v8i2.13618 (2022).
Article Google Scholar
Berg, C.N., Blankespoor, B. & Selod, H. Roads and rural development in Sub-Saharan Africa. In The Transformation of Rural Africa 80–100. https://doi.org/10.4324/9780429450365-5 (Routledge, 2020).
Mikou, M., Rozenberg, J., Koks, E.E., Fox, C.J.E. & Peralta Quiros, T. Assessing rural accessibility and rural roads investment needs using open source data. World Bank Policy Res. Working Paper. https://doi.org/10.1596/1813-9450-8746 (2019).
Workman, R. & McPherson, K. Measuring rural access for SDG 9.1.1. Trans. GIS 25, 721–734, https://doi.org/10.1111/tgis.12721 (2021).
Article Google Scholar
Li, W., Zhou, Q., Zhang, Y. & Chen, Y. Visualising rural access index and not served rural population in Africa. Environ. Plann. A: Econ. Space 54, 215–218, https://doi.org/10.1177/0308518x211035786 (2022).
Article Google Scholar
Tai, Y., Chan, C. & Hsu, J.Y. Automatic road anomaly detection using smart mobile device. In Proceedings of the Conference on Technologies and Applications of Artificial Intelligence, Hsinchu, Taiwan (2010).
Takahashi, J., Kobana, Y., Tobe, Y. & Lopez, G. Classification of steps on road surface using acceleration signals. EAI Endorsed Trans. Cogn. Commun. 1, 229–234, https://doi.org/10.4108/eai.22-7-2015.2260293 (2015).
Article Google Scholar
Menegazzo, J. & Von Wangenheim, A. Road surface type classification based on inertial sensors and machine learning: A comparison between classical and deep machine learning approaches for multi-contextual real-world scenarios. Computing 103, 2143–2170, https://doi.org/10.1007/s00607-021-00914-0 (2021).
Article MathSciNet Google Scholar
Lee, D., Kim, J.-C., Kim, M. & Lee, H. Intelligent tire sensor-based real-time road surface classification using an artificial neural network. Sensors 21, 3233, https://doi.org/10.3390/s21093233 (2021).
Article ADS PubMed PubMed Central Google Scholar
Al-Mansour, A. I. & Al-Qaili, A. H. An application of Android sensors and Google Earth in pavement maintenance management systems for developing countries. Appl. Sci. 12, 5636, https://doi.org/10.3390/app12115636 (2022).
Article CAS Google Scholar
Dobson, R. J. et al. Collecting decision support system data through remote sensing of unpaved roads. Transp. Res. Rec. 2433, 108–115, https://doi.org/10.3141/2433-12 (2014).
Article Google Scholar
Brooks, C. N. et al. Identification of unpaved roads in a regional road network using remote sensing. Photogramm. Eng. Remote Sens. 83, 377, https://doi.org/10.14358/PERS.83.5.377 (2017).
Article Google Scholar
Thiede, R. N., Fabris-Rotelli, I. N., Debba, P., Lib, M. & Stein, A. Uncertainty quantification for the extraction of informal roads from remote sensing images of South Africa. South Afr. Geogr. J. 102, 249–272, https://doi.org/10.1080/03736245.2019.1685404 (2020).
Article Google Scholar
Brewer, E., Lin, J., Kemper, P., Hennin, J. & Runfola, D. Predicting road quality using high resolution satellite imagery: A transfer learning approach. PLOS ONE 16, e0253370, https://doi.org/10.1371/journal.pone.0253370 (2021).
Article CAS PubMed PubMed Central Google Scholar
Zhou, Q., Liu, Z. & Huang, Z. Mapping road surface type of Kenya using OpenStreetMap and high-resolution Google satellite imagery. Sci. Data 11, 331, https://doi.org/10.1038/s41597-024-03158-7 (2024).
Article PubMed PubMed Central Google Scholar
Ma, D. et al. The state of Mapillary: An exploratory analysis. ISPRS Int. J. Geo-Inf. 9, 10, https://doi.org/10.3390/ijgi9010010 (2019).
Article Google Scholar
Alvarez Leon, L. F. & Quinn, S. The value of crowdsourced street-level imagery: Examining the shifting property regimes of OpenStreetCam and Mapillary. GeoJournal 84, 395–414, https://doi.org/10.1007/s10708-018-9865-4 (2019).
Article Google Scholar
Solís, P., McCusker, B., Menkiti, N., Cowan, N. & Blevins, C. Engaging global youth in participatory spatial data creation for the UN Sustainable Development Goals: The case of open mapping for malaria prevention. Appl. Geogr. 98, 143–155, https://doi.org/10.1016/j.apgeog.2018.07.013 (2018).
Article Google Scholar
Quinn, S. & Alvarez León, L. Every single street? Rethinking full coverage across street-level imagery platforms. Trans. GIS 23, 1251–1272, https://doi.org/10.1111/tgis.12571 (2019).
Article Google Scholar
Lee, D., et al. Paved and unpaved road segmentation using deep neural network. In Pattern Recognition: ACPR 2019 Workshops (pp. 20–28). https://doi.org/10.1007/978-981-15-3651-9_3 Auckland, New Zealand: Springer. (2020).
Peden, M., Kobusingye, O. & Monono, M. E. Africa’s roads—the deadliest in the world. South Afr. Med. J. 103, 228–229, https://doi.org/10.7196/samj.6866 (2013).
Article CAS Google Scholar
Greening, T. & O’Neill, P. Traffic generated dust from unpaved roads: An overview of impacts and options for control. In 1st AFCAP Practitioners Conference (2010).
Barrington-Leigh, C. & Millard-Ball, A. The world’s user-generated road map is more than 80% complete. PLOS ONE 12, e0180698, https://doi.org/10.1371/journal.pone.0180698 (2017).
Article CAS PubMed PubMed Central Google Scholar
Seto, T. & Nishimura, Y. Analysis of the spatiotemporal accumulation process of Mapillary data and its relationship with OSM road data: A case study in Japan. ISPRS Arch. Photogrammetry Remote Sens. Spatial Inf. Sci. 48, 403–410, https://doi.org/10.5194/isprs-archives-XLVIII-4-W1-2022-403-2022 (2022).
Article Google Scholar
Astarita, V. et al. A mobile application for road surface quality control: UNIquALroad. Procedia-Soc. Behav. Sci. 54, 1135–1144, https://doi.org/10.1016/j.sbspro.2012.09.828 (2012).
Article Google Scholar
Fleiss, J. L. Measuring nominal scale agreement among many raters. Psychol. Bull. 76, 378, https://doi.org/10.1037/h0031619 (1971).
Article Google Scholar
Tammina, S. Transfer learning using VGG-16 with deep convolutional neural network for classifying images. Int. J. Sci. Res. Publ. https://doi.org/10.29322/IJSRP.9.10.2019.p9420 (2019).
He, K., Zhang, X., Ren, S. & Sun, J. Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 770–778. https://doi.org/10.1109/CVPR.2016.90 (2016).
Liu, Z., et al Swin transformer: Hierarchical vision transformer using shifted windows. In Proceedings of the IEEE/CVF International Conference on Computer Vision, 10012–10022 (2021).https://doi.org/10.1109/ICCV48922.2021.00986
Wang, C.-Y., Bochkovskiy, A. & Liao, H.-Y. M. YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 7464–7475. https://doi.org/10.1109/CVPR52729.2023.00721 (2023).
Liu, Z., et al A ConvNet for the 2020s. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 11976–11986. https://doi.org/10.1109/CVPR52688.2022.01167 (2022).
Simonyan, K. & Zisserman, A. Very deep convolutional networks for large-scale image recognition. In Proceedings of the 3rd International Conference on Learning Representations, 1–14 (2015).
Li, W. et al. A perspective survey on deep transfer learning for fault diagnosis in industrial scenarios: Theories, applications and challenges. Mech. Syst. Signal Process. 167, 108487, https://doi.org/10.1016/j.ymssp.2021.108487 (2022).
Article ADS Google Scholar
Deng, J., et al ImageNet: A large-scale hierarchical image database. In Proceedings of the 2009 IEEE Conference on Computer Vision and Pattern Recognition, 248–255. https://doi.org/10.1109/CVPR.2009.5206848 (2009).
Kang, J., Körner, M., Wang, Y., Taubenböck, H. & Zhu, X. X. Building instance classification using street view images. ISPRS J. Photogramm. Remote Sens. 145, 44–59, https://doi.org/10.1016/j.isprsjprs.2018.02.006 (2018).
Article ADS Google Scholar
Hoffmann, E. J., Wang, Y., Werner, M., Kang, J. & Zhu, X. X. Model fusion for building type classification from aerial and street view images. Remote Sensing 11, 1259, https://doi.org/10.3390/rs11111259 (2019).
Article ADS Google Scholar
Saeed, N., Nyberg, R. G. & Alam, M. Gravel road classification based on loose gravel using transfer learning. Int. J. Pavement Eng. 24, 2138879, https://doi.org/10.1080/10298436.2022.2138879 (2023).
Article Google Scholar
Yuan, L., et al. Tokens-to-token ViT: Training vision transformers from scratch on ImageNet. In Proceedings of the IEEE/CVF International Conference on Computer Vision, 558–567. https://doi.org/10.1109/ICCV48922.2021.00060 (2021).
Maurício, J., Domingues, I. & Bernardino, J. Comparing vision transformers and convolutional neural networks for image classification: A literature review. Appl. Sci. 13, 5521, https://doi.org/10.3390/app13095521 (2023).
Article CAS Google Scholar
Wang, Y., Gan, W., Yang, J., Wu, W. & Yan, J. Dynamic curriculum learning for imbalanced data classification. In Proceedings of the IEEE/CVF International Conference on Computer Vision, 5017–5026. https://doi.org/10.1109/ICCV.2019.00512 (2019).
Shahinfar, S., Meek, P. & Falzon, G. “How many images do I need?” Understanding how sample size per class affects deep learning model performance metrics for balanced designs in autonomous wildlife monitoring. Ecological Informatics 57, 101085, https://doi.org/10.1016/j.ecoinf.2020.101085 (2020).
Article Google Scholar
Zhou, Q., Duan, J., Qiao, J., Liu, Z. & Yang, H. A Large Crowdsourced Street View Dataset for Mapping Road Surface Types in Africa. Figshare https://doi.org/10.6084/m9.figshare.27719577 (2025).
Mahabir, R. et al. Crowdsourcing street view imagery: A comparison of Mapillary and OpenStreetCam. ISPRS Int. J. Geo-Inf. 9, 341, https://doi.org/10.3390/ijgi9060341 (2020).
Article Google Scholar
Anguelov, D. et al. Google street view: Capturing the world at street level. Computer 43, 32–38, https://doi.org/10.1109/MC.2010.170 (2010).
Article Google Scholar

Download references

Acknowledgements

The project was supported by the National Natural Science Foundation of China (Grant No.42471492; No.42371478) and the National Key R&D Program of China (Grant No.2021YFB390050).

Author information

These authors contributed equally: Qi Zhou and Jiayuan Duan.

Authors and Affiliations

School of Geography and Information Engineering, China University of Geosciences, Wuhan, 430074, P. R. China
Qi Zhou, Jiayuan Duan, Jie Qiao & Zixian Liu
School of Electronics and Information, Northwestern Polytechnical University, Xi’an, 710129, P. R. China
Hong Yang
Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing, 100190, P. R. China
Hong Yang

Authors

Qi Zhou
View author publications
Search author on:PubMed Google Scholar
Jiayuan Duan
View author publications
Search author on:PubMed Google Scholar
Jie Qiao
View author publications
Search author on:PubMed Google Scholar
Zixian Liu
View author publications
Search author on:PubMed Google Scholar
Hong Yang
View author publications
Search author on:PubMed Google Scholar

Contributions

Qi Zhou: Conceptualization, Writing-review&editing; Jiayuan Duan: Data collection, Data curation, Quality Assessment, Writing-original draft; Jie Qiao: Data collection, Data curation, Quality Assessment; Zixian Liu: Writing-review&editing; Hong Yang: Project Administration.

Corresponding authors

Correspondence to Qi Zhou or Hong Yang.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Response to Editor and Reviewers

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Zhou, Q., Duan, J., Qiao, J. et al. A Large Crowdsourced Street View Dataset for Mapping Road Surface Types in Africa. Sci Data 12, 1003 (2025). https://doi.org/10.1038/s41597-025-05153-y

Download citation

Received: 18 December 2024
Accepted: 08 May 2025
Published: 13 June 2025
DOI: https://doi.org/10.1038/s41597-025-05153-y