Abstract
Identifying road surface types is crucial for road maintenance and socio-economic development. Crowdsourced street view data, with global coverage and free access, is a valuable source for this task. However, deep learning models typically require large, manually annotated datasets, which are lacking for road surface classification. To address this, three experts initially labeled 200,000 images from the Mapillary crowdsourced street view dataset, categorizing them into “Paved,” “Unpaved,” and “Unknown” road types through a voting mechanism. Five deep learning models were then trained on this expert-labeled dataset to assess the usefulness of the labels for further analysis. The results show that all models achieved accuracy, precision, recall, and F1 scores above 0.90, with the Swin Transformer model reaching the highest accuracy of 0.924. When applied to six 10 × 10 km sub-regions in Africa, the best model exceeded 0.95 in most performance metrics. These findings demonstrate the potential of this dataset to enhance road surface identification and support infrastructure improvement in Africa.
Similar content being viewed by others
Background & Summary
Road surface type impacts driving comfort1,2,3, traffic safety4, and socio-economic development5,6. For example, improvements in road surface types have been shown to significantly boost the rural economic transformation in the Java-Bali region5. Some studies have also found a positive correlation between road condition improvements and GDP growth in sub-Saharan Africa6. Road surface type has also been included by the United Nations as one of the indicators for assessing the progress of the Sustainable Development Goals (SDGs). Specifically, SDG 9.1.1 refers to the “proportion of rural population living within 2 kilometers of a road that is passable all year-round”7,8,9. Generally, paved roads ensure year-round accessibility, while unpaved roads may be subject to seasonal disruptions due to adverse weather conditions8. Therefore, identifying road surface types is crucial for improving transportation infrastructure9.
Various techniques have been developed for identifying road surface types. For instance, Tai et al. proposed a motorcycle-based pavement evaluation system that utilizes a smartphone’s triaxial accelerometer, employing both supervised and unsupervised machine learning methods to detect road anomalies and assess pavement quality10. Similarly, Takahashi et al. developed a bicycle-based road condition monitoring system, which classifies road surface signals captured by smartphones into categories such as damage or flatness11. Menegazzo et al. developed a CNN-based deep neural network model that uses accelerometer and gyroscope data from different vehicles, drivers, and environments to classify road surfaces into categories such as asphalt and dirt roads12. Lee et al. introduced a real-time pavement classification algorithm that uses deep neural networks and triaxial accelerometers to distinguish between asphalt, gravel, and unpaved roads13. Al-Mansour also developed a low-cost pavement maintenance management system, using inexpensive sensors embedded in smartphones to classify road surfaces as “Good,” “Fair,” “Poor,” or “Very Poor”14.
Remote sensing technologies have also been widely used to identify road surface types. Dobson et al. used aerial images collected from remote-controlled helicopters or manned fixed-wing aircraft to create three-dimensional models of road segments, detecting potholes and cracks in the pavement15. Brooks et al. introduced an object-based classification algorithm that uses optical aerial imagery with a resolution of 0.3 meters to classify road surfaces as either paved or unpaved16. Thiede et al. used high-resolution remote sensing imagery (0.5 meters) to identify unpaved roads in South Africa’s Gauteng and Northwest provinces17. Brewer et al. employed a convolutional neural network (CNN) algorithm to predict road surface types using 0.3-meter resolution remote sensing images18. Recently, Zhou et al. proposed a method combining OpenStreetMap data with high-resolution satellite imagery, leveraging the VGG-16 model to identify paved and unpaved roads. They also developed the first dataset of road surface types in Kenya, Africa19.
Despite these advancements, there are still some limitations in existing studies. First, using mobile devices mounted on vehicles to identify road surface types is a time-consuming and labor-intensive task, as this method can only collect road surface data for individual roads. While high-resolution remote sensing data (e.g., with a resolution greater than 1 meter) can be used to identify road surface types over large-scale road networks, such as at the national or regional level, vegetation in densely covered areas may obstruct the view, reducing the accuracy of surface type identification.
In recent years, open street view data (e.g., from Mapillary20 and OpenStreetCam21) has become increasingly available. These crowdsourced street views, contributed by volunteers from different countries and regions22,23, have proven to be valuable for identifying road surface types. Thanks to their open access and global coverage, crowdsourced street view data is considered one of the most important data sources for this task. For example, Lee et al. proposed an AdapNet-based semantic segmentation model that can segment road pixels from Mapillary crowdsourced street view and classify them as either paved or unpaved roads24. Despite this progress, deep learning models for road surface type identification require large training datasets. To the best of our knowledge, there is currently a lack of publicly available datasets specifically designed for identifying road surface types from crowdsourced street views.
Therefore, this study aims to develop the first comprehensive Mapillary crowdsourced street view dataset that includes a large number of samples labeled with road surface types. The main contributions of this study are as follows:
-
1)
The first dataset containing 200,000 Mapillary street images has been developed, all of which were collected from Africa. These images are classified into three categories: ‘Paved,’ ‘Unpaved,’ and ‘Unknown.’
-
2)
This study demonstrates the effectiveness of deep learning models trained on the aforementioned dataset. It also shows that the models can accurately classify road surface types in six sub-regions of Africa, with classification metrics all exceeding 0.95.
This study focuses on the African region due to the generally low rate of road paving compared to other parts of the world25. For example, an existing study has reported that 70% of roads in sub-Saharan Africa remain unpaved26, resulting in poor transportation and inefficient trade, which in turn hinders local socio-economic development. Therefore, our classified street view dataset may provide valuable data support for improving transportation infrastructure in Africa.
Methods
Overview
This study developed a dataset consisting of 200,000 street view images classified by different road surface types. The main steps involved are as follows:
Firstly, Africa was divided into grids of a certain size, and Mapillary street view images along with their location data were randomly selected from each grid as candidates. Then, road data for Africa was obtained and matched with the street view data, filtering out those street views that corresponded to roads and included road surface type labels. Next, street view images with different road surface types were randomly and evenly sampled from the candidate set, and visually reclassified to produce the final dataset. Finally, deep learning models were trained based on the street view labeled dataset, and the trained models were applied to different sub-regions of Africa to validate the effectiveness and usability of the dataset. The workflow is shown in Fig. 1.
Data collection
Mapillary street view
In this study, 3,440 grids of 100 km × 100 km were created across Africa. The Mapillary API (https://www.mapillary.com/developer/api-documentation) was then used to randomly retrieve street view images from each grid along with their corresponding location information (represented by latitude and longitude coordinates), resulting in a total of 907,963 street view images.
The main advantages of using Mapillary street view data include: 1) The data is freely accessible. 2) In addition to street view images, Mapillary provides location information for each image, which facilitates the visualization of the spatial distribution of road surface types. 3) Mapillary data has global coverage20, making it suitable for large-scale studies at national and regional levels. In contrast, Google Street View data is only available in certain countries or regions.
Road data
This study utilized OpenStreetMap (OSM) road data, specifically for roads that are accessible to motor vehicles (as shown in Table 1). The data can be obtained from the third-party platform at (http://download.geofabrik.de/index.html). OSM data has the advantages of global coverage and open accessibility27,28. More importantly, OSM road data includes tags (e.g., surface = paved) that record the road surface type, providing a reference for classifying road surface types in Mapillary street view images.
Data processing
Data matching
In theory, all Mapillary street view images could be used to create the dataset. However, some images may not have been taken on roads, and even for those captured on roadways, the corresponding OSM road data may either lack surface type tags or contain erroneous entries due to mislabeling (e.g., surface = ???, yes, or 7387.92 m2). Therefore, the OSM road data were first filtered to retain only road segments with clearly defined surface type tags. Then, for each retrieved street view image, the distance to the nearest road was calculated. If the distance was less than a certain threshold (10 meters), the street view image was considered to be on a road. Otherwise, the image was considered to be off-road. We set the threshold at 10 meters because existing studies suggest that the positioning accuracy of most mobile phones is around 10 meters29. As a result, 709,463 Mapillary street view images were obtained from Africa, all of which were associated with roads that included surface type labels.
Data sampling
From the matched 709,463 Mapillary street view images, we randomly and evenly sampled images representing various road surface types. After reviewing these images, the followings were observed: (1) Paved roads in Africa are predominantly asphalt, while unpaved roads are mostly dirt roads; (2) Some street view images, due to low resolution or because they were not taken on roads, could not be visually interpreted for their surface type. Therefore, these street view images were classified into three categories: “Paved,” “Unpaved,” and “Unknown.” The specific steps are as follows:
First, the tags in the OSM road data were reclassified into two categories: “Paved” and “Unpaved” (as shown in Table 2). Then, 100,000 street view images were randomly selected from the matched “Paved” roads and 100,000 images from the matched “Unpaved” roads, for a total of 200,000 images.
Next, recognizing that surface type tags in the OSM road data may contain errors, three experts with backgrounds in GIS and transportation were assigned to visually interpret and classify the 200,000 extracted street view images into three categories: “Paved,” “Unpaved,” and “Unknown.” To validate the accuracy of visual interpretation, the Fleiss’ Kappa coefficient was employed for inter-rater consistency assessment30. The calculated value of 0.995 indicates a high level of agreement among the three experts. Lastly, the final classification label for each street view image was determined using a majority voting mechanism. Table 3 and Fig. 2 present the number of street view images in each category after visual classification, along with representative examples of each category.
Examples to illustrate the classification (including ‘Paved’, ‘Unpaved’ and ‘Unknown’) of street views (Data source: https://www.mapillary.com).
Data usability evaluation
Although the accuracy of our dataset has been verified using the Fleiss’ Kappa coefficient, it is necessary to adjust the usability of this dataset. Therefore, deep learning models were further employed to assess the usability of this dataset for road surface classification. As a result, this study considers two scenarios:
-
1)
Scenario 1: Training different deep learning models using the developed dataset of 200,000 street view images and validating the model’s performance. Several classic deep learning models were employed, including VGG-1631, ResNet-5032, Swin Transformer33, Yolo v734, and ConvNeXt35. These models have been widely used in image recognition, object detection, and other fields, and generally outperform traditional machine learning models, such as decision trees and random forests, in classification tasks36,37,38,39,40,41. Besides, existing studies have shown that these models can perform better than some traditional convolutional neural network models (e.g., AlexNet and GoogleNet) on datasets like ImageNet42,43.
For each model, we randomly sampled 5,500 images per class (‘Paved’, ‘Unpaved’, ‘Unknown’), yielding 16,500 total images. These were split into 80% training and 20% validation sets. All models were implemented in PyTorch and trained on a workstation equipped with an NVIDIA P100 GPU and 16 GB of RAM. After extensive tuning, we set the learning rate to 0.0001, batch size to 64, and number of epochs to 100. Model performance was then evaluated using accuracy, precision, recall and F1-score44,45.
-
2)
Scenario 2: Applying the trained models to sub-regions to validate their usability.
To further validate the usability of the street view dataset, the optimal model from Scenario 1 was applied to six 10 × 10 km road network sub-regions to identify the road surface types in each sub-region (as shown in Fig. 3). For each sub-region, all Mapillary street view images and their corresponding location data were first retrieved. Then, the optimal model from Scenario 1 was used to classify the road surface types of all street view images within each sub-region. The classification accuracy for each sub-region was validated using metrics such as accuracy, precision, recall, and F1-score.
The selection of these six sub-regions is based on their location in different countries and regions across Africa, covering both urban (Fig. 3a,c,d) and rural (Fig. 3b,e,f) road network patterns. This selection helps minimize subjectivity that may arise from focusing on a single type of study area. More importantly, these sub-regions provide a relatively rich set of Mapillary street view images.
Data Records
This dataset includes Mapillary street view images and corresponding location data from Africa46. The street view images are saved in JPG format, with each image assigned a unique ID number, totaling 200,000 images. The location data is represented as point vector data in Esri Shapefile format, where each point includes: a unique street view ID, longitude, latitude, and road surface type (as shown in Table 4), with a total of 200,000 points. All data are projected using the World Geodetic System (WGS) 84 and the pseudo-Mercator coordinate system (EPSG: 3857) (Fig. 4). The dataset is publicly accessible on Figshare, which is released under a CC-BY 4.0 license. Both the dataset and metadata can be retrieved from the corresponding link46. Additionally, all data were checked against their descriptions to ensure consistency and accuracy. This verification was performed to ensure that the data aligns with the provided metadata and classification details.
Technical Validation
Results evaluation
Results and analyses of Scenario 1
Table 5 presents a comparison of the accuracy in road surface type classification of Mapillary street view data using four different models.
As shown in Table 5, the accuracy, precision, and recall for identifying various road surface types from Mapillary street view images using different models all exceed 0.9, with F1 scores above 0.91. This indicates that the models trained on the developed Mapillary street view dataset perform well overall. The consistently high performance across multiple models demonstrates the usability of the developed dataset. The highest accuracy was achieved using the Swin Transformer model, with a value of 0.924, slightly outperforming the other four models (ResNet-50, VGG-16, Yolo v7, and ConvNeXt).
Results and analyses of Scenario 2
The results from Scenario 1 indicate that the Swin Transformer model achieved the highest accuracy. Therefore, in Scenario 2, the Swin Transformer was applied to classify all street view images from the six sub-regions. The classification accuracy for each sub-region is shown in Table 6. Figure 5 presents the classification results of road surface types for each sub-region.
As shown in Table 6, when applying the Swin Transformer model trained in Scenario 1 to the road networks of six sub-regions in Africa, the accuracy, precision, recall, and F1 scores generally exceed 0.95. This suggests that the deep learning models trained on the street view dataset developed in this study can be applied to other regions of Africa. Notably, the performance metrics for Sub-region VI even exceed 0.99, likely because this sub-region contains only unpaved roads (Fig. 5f), leading to fewer classification errors.
Limitations
This study has some limitations as follows:
-
1)
Model selection: Firstly, only five classic deep learning models were selected for this study because existing studies have shown that these models perform better than some traditional convolutional neural network models (e.g., AlexNet and GoogleNet) on datasets like ImageNet42,43. Our experiments also confirmed that the accuracy, precision, and recall using these models all exceed 0.9. However, as artificial intelligence is continuously evolving and new models are emerging, future research could explore whether incorporating modules such as attention mechanisms or introducing other models can further improve the classification accuracy of road surface types.
-
2)
Road surface type categories: Secondly, this study classified road surface types into just three categories: “Paved,” “Unpaved,” and “Unknown.” In reality, road surface types may be more diverse, including asphalt, concrete, and dirt roads, among others. However, by analyzing the 200,000 street view images, we observed that paved roads in Africa are predominantly asphalt, while unpaved roads are mostly dirt roads, with very few other types of surfaces. Therefore, this dataset can be effectively used for road surface type classification of Mapillary street view images in most regions of Africa. Besides, the “Unknown” category in our dataset could assist users in identifying and filtering low-quality street view data. Nonetheless, road surface types may differ across countries and regions. Thus, future research could consider adding street view images of other road surface types to enhance the dataset’s applicability.
-
3)
Geographical regions: Thirdly, the street view data used in this study were all sourced from Africa. Since road surface types can vary significantly across regions (such as in Asia or Europe), the models trained on this dataset may not be directly applicable to other regions of the world. Nevertheless, the methodology developed in this study could be applied to create street view datasets for road surface type classification in other regions globally.
-
4)
Mapillary data: The Mapillary street view data has inherent limitations, including gaps in coverage, particularly in rural or remote areas, as well as variability in image quality due to factors such as resolution, lighting, and capture angle. Furthermore, the data tends to be biased toward urban areas where street view images are more frequently captured, which may result in certain road types being underrepresented47. Nevertheless, the crowdsourced nature of Mapillary allows for continuous updates, which could improve data quality and expand coverage in future studies.
Despite these limitations, our study offers several valuable contributions. First, a generalizable methodology for creating road surface type datasets from crowdsourced street view imagery has been proposed. By leveraging Mapillary data, this approach can be adapted to other regions worldwide and extended to additional platforms such as Google Street View48 and OpenStreetCam21, thereby broadening its applicability. Second, a comprehensive dataset of 200,000 African street view images labeled by road surface type has been developed. These images serve as rich training samples for deep learning models, facilitating automated classification of road surfaces across diverse African environments. Third, the dataset can support assessment of Sustainable Development Goal 9.1.1, which measures the proportion of the rural population living within 2 km of all-season roads. National-and regional-scale road surface maps generated from this dataset (as shown in Fig. 5) can provide crucial indicators for monitoring progress toward this goal. Last but not least, the dataset can be employed to evaluate and enhance OpenStreetMap (OSM) road surface tags. By comparing “Paved” and “Unpaved” labels with existing OSM tags, inconsistencies can be identified. Furthermore, deep learning models trained on these labeled images can automatically infer and supplement missing or erroneous surface tags in OSM, improving overall data quality.
Code availability
The python scripts used for model training are available online through GitHub respository: https://github.com/leishelaoshi/RoadSurfaceType.git.
References
Chen, S., Saeed, T. U. & Labi, S. Impact of road-surface condition on rural highway safety: A multivariate random parameters negative binomial approach. Anal. Methods Accid. Res. 16, 75–89, https://doi.org/10.1016/j.amar.2017.09.001 (2017).
Lee, J., Nam, B. & Abdel-Aty, M. Effects of pavement surface conditions on traffic crash severity. J. Transp. Eng. 141, 04015020, https://doi.org/10.1061/(ASCE)TE.1943-5436.0000785 (2015).
Sari, Y. & Yudhistira, M. H. Bad light, bad road, or bad luck? The associations of road lighting and road surface quality on road crash severities in Indonesia. Case Stud. Transp. Policy 9, 1407–1417, https://doi.org/10.1016/j.cstp.2021.07.005 (2021).
Pérez-Fortes, A. P. & Giudici, H. A recent overview of the effect of road surface properties on road safety, environment, and how to monitor them. Environ. Sci. Pollut. Res. 29, 65993–66009, https://doi.org/10.1007/s11356-022-21847-x (2022).
Kamaludin, A. S. & Qibthiyyah, R. M. Village road quality and accessibility on transforming rural development. AGRARIS: J. Agribusiness Rural Dev. Res. 8, 160–180, https://doi.org/10.18196/agraris.v8i2.13618 (2022).
Berg, C.N., Blankespoor, B. & Selod, H. Roads and rural development in Sub-Saharan Africa. In The Transformation of Rural Africa 80–100. https://doi.org/10.4324/9780429450365-5 (Routledge, 2020).
Mikou, M., Rozenberg, J., Koks, E.E., Fox, C.J.E. & Peralta Quiros, T. Assessing rural accessibility and rural roads investment needs using open source data. World Bank Policy Res. Working Paper. https://doi.org/10.1596/1813-9450-8746 (2019).
Workman, R. & McPherson, K. Measuring rural access for SDG 9.1.1. Trans. GIS 25, 721–734, https://doi.org/10.1111/tgis.12721 (2021).
Li, W., Zhou, Q., Zhang, Y. & Chen, Y. Visualising rural access index and not served rural population in Africa. Environ. Plann. A: Econ. Space 54, 215–218, https://doi.org/10.1177/0308518x211035786 (2022).
Tai, Y., Chan, C. & Hsu, J.Y. Automatic road anomaly detection using smart mobile device. In Proceedings of the Conference on Technologies and Applications of Artificial Intelligence, Hsinchu, Taiwan (2010).
Takahashi, J., Kobana, Y., Tobe, Y. & Lopez, G. Classification of steps on road surface using acceleration signals. EAI Endorsed Trans. Cogn. Commun. 1, 229–234, https://doi.org/10.4108/eai.22-7-2015.2260293 (2015).
Menegazzo, J. & Von Wangenheim, A. Road surface type classification based on inertial sensors and machine learning: A comparison between classical and deep machine learning approaches for multi-contextual real-world scenarios. Computing 103, 2143–2170, https://doi.org/10.1007/s00607-021-00914-0 (2021).
Lee, D., Kim, J.-C., Kim, M. & Lee, H. Intelligent tire sensor-based real-time road surface classification using an artificial neural network. Sensors 21, 3233, https://doi.org/10.3390/s21093233 (2021).
Al-Mansour, A. I. & Al-Qaili, A. H. An application of Android sensors and Google Earth in pavement maintenance management systems for developing countries. Appl. Sci. 12, 5636, https://doi.org/10.3390/app12115636 (2022).
Dobson, R. J. et al. Collecting decision support system data through remote sensing of unpaved roads. Transp. Res. Rec. 2433, 108–115, https://doi.org/10.3141/2433-12 (2014).
Brooks, C. N. et al. Identification of unpaved roads in a regional road network using remote sensing. Photogramm. Eng. Remote Sens. 83, 377, https://doi.org/10.14358/PERS.83.5.377 (2017).
Thiede, R. N., Fabris-Rotelli, I. N., Debba, P., Lib, M. & Stein, A. Uncertainty quantification for the extraction of informal roads from remote sensing images of South Africa. South Afr. Geogr. J. 102, 249–272, https://doi.org/10.1080/03736245.2019.1685404 (2020).
Brewer, E., Lin, J., Kemper, P., Hennin, J. & Runfola, D. Predicting road quality using high resolution satellite imagery: A transfer learning approach. PLOS ONE 16, e0253370, https://doi.org/10.1371/journal.pone.0253370 (2021).
Zhou, Q., Liu, Z. & Huang, Z. Mapping road surface type of Kenya using OpenStreetMap and high-resolution Google satellite imagery. Sci. Data 11, 331, https://doi.org/10.1038/s41597-024-03158-7 (2024).
Ma, D. et al. The state of Mapillary: An exploratory analysis. ISPRS Int. J. Geo-Inf. 9, 10, https://doi.org/10.3390/ijgi9010010 (2019).
Alvarez Leon, L. F. & Quinn, S. The value of crowdsourced street-level imagery: Examining the shifting property regimes of OpenStreetCam and Mapillary. GeoJournal 84, 395–414, https://doi.org/10.1007/s10708-018-9865-4 (2019).
Solís, P., McCusker, B., Menkiti, N., Cowan, N. & Blevins, C. Engaging global youth in participatory spatial data creation for the UN Sustainable Development Goals: The case of open mapping for malaria prevention. Appl. Geogr. 98, 143–155, https://doi.org/10.1016/j.apgeog.2018.07.013 (2018).
Quinn, S. & Alvarez León, L. Every single street? Rethinking full coverage across street-level imagery platforms. Trans. GIS 23, 1251–1272, https://doi.org/10.1111/tgis.12571 (2019).
Lee, D., et al. Paved and unpaved road segmentation using deep neural network. In Pattern Recognition: ACPR 2019 Workshops (pp. 20–28). https://doi.org/10.1007/978-981-15-3651-9_3 Auckland, New Zealand: Springer. (2020).
Peden, M., Kobusingye, O. & Monono, M. E. Africa’s roads—the deadliest in the world. South Afr. Med. J. 103, 228–229, https://doi.org/10.7196/samj.6866 (2013).
Greening, T. & O’Neill, P. Traffic generated dust from unpaved roads: An overview of impacts and options for control. In 1st AFCAP Practitioners Conference (2010).
Barrington-Leigh, C. & Millard-Ball, A. The world’s user-generated road map is more than 80% complete. PLOS ONE 12, e0180698, https://doi.org/10.1371/journal.pone.0180698 (2017).
Seto, T. & Nishimura, Y. Analysis of the spatiotemporal accumulation process of Mapillary data and its relationship with OSM road data: A case study in Japan. ISPRS Arch. Photogrammetry Remote Sens. Spatial Inf. Sci. 48, 403–410, https://doi.org/10.5194/isprs-archives-XLVIII-4-W1-2022-403-2022 (2022).
Astarita, V. et al. A mobile application for road surface quality control: UNIquALroad. Procedia-Soc. Behav. Sci. 54, 1135–1144, https://doi.org/10.1016/j.sbspro.2012.09.828 (2012).
Fleiss, J. L. Measuring nominal scale agreement among many raters. Psychol. Bull. 76, 378, https://doi.org/10.1037/h0031619 (1971).
Tammina, S. Transfer learning using VGG-16 with deep convolutional neural network for classifying images. Int. J. Sci. Res. Publ. https://doi.org/10.29322/IJSRP.9.10.2019.p9420 (2019).
He, K., Zhang, X., Ren, S. & Sun, J. Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 770–778. https://doi.org/10.1109/CVPR.2016.90 (2016).
Liu, Z., et al Swin transformer: Hierarchical vision transformer using shifted windows. In Proceedings of the IEEE/CVF International Conference on Computer Vision, 10012–10022 (2021).https://doi.org/10.1109/ICCV48922.2021.00986
Wang, C.-Y., Bochkovskiy, A. & Liao, H.-Y. M. YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 7464–7475. https://doi.org/10.1109/CVPR52729.2023.00721 (2023).
Liu, Z., et al A ConvNet for the 2020s. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 11976–11986. https://doi.org/10.1109/CVPR52688.2022.01167 (2022).
Simonyan, K. & Zisserman, A. Very deep convolutional networks for large-scale image recognition. In Proceedings of the 3rd International Conference on Learning Representations, 1–14 (2015).
Li, W. et al. A perspective survey on deep transfer learning for fault diagnosis in industrial scenarios: Theories, applications and challenges. Mech. Syst. Signal Process. 167, 108487, https://doi.org/10.1016/j.ymssp.2021.108487 (2022).
Deng, J., et al ImageNet: A large-scale hierarchical image database. In Proceedings of the 2009 IEEE Conference on Computer Vision and Pattern Recognition, 248–255. https://doi.org/10.1109/CVPR.2009.5206848 (2009).
Kang, J., Körner, M., Wang, Y., Taubenböck, H. & Zhu, X. X. Building instance classification using street view images. ISPRS J. Photogramm. Remote Sens. 145, 44–59, https://doi.org/10.1016/j.isprsjprs.2018.02.006 (2018).
Hoffmann, E. J., Wang, Y., Werner, M., Kang, J. & Zhu, X. X. Model fusion for building type classification from aerial and street view images. Remote Sensing 11, 1259, https://doi.org/10.3390/rs11111259 (2019).
Saeed, N., Nyberg, R. G. & Alam, M. Gravel road classification based on loose gravel using transfer learning. Int. J. Pavement Eng. 24, 2138879, https://doi.org/10.1080/10298436.2022.2138879 (2023).
Yuan, L., et al. Tokens-to-token ViT: Training vision transformers from scratch on ImageNet. In Proceedings of the IEEE/CVF International Conference on Computer Vision, 558–567. https://doi.org/10.1109/ICCV48922.2021.00060 (2021).
Maurício, J., Domingues, I. & Bernardino, J. Comparing vision transformers and convolutional neural networks for image classification: A literature review. Appl. Sci. 13, 5521, https://doi.org/10.3390/app13095521 (2023).
Wang, Y., Gan, W., Yang, J., Wu, W. & Yan, J. Dynamic curriculum learning for imbalanced data classification. In Proceedings of the IEEE/CVF International Conference on Computer Vision, 5017–5026. https://doi.org/10.1109/ICCV.2019.00512 (2019).
Shahinfar, S., Meek, P. & Falzon, G. “How many images do I need?” Understanding how sample size per class affects deep learning model performance metrics for balanced designs in autonomous wildlife monitoring. Ecological Informatics 57, 101085, https://doi.org/10.1016/j.ecoinf.2020.101085 (2020).
Zhou, Q., Duan, J., Qiao, J., Liu, Z. & Yang, H. A Large Crowdsourced Street View Dataset for Mapping Road Surface Types in Africa. Figshare https://doi.org/10.6084/m9.figshare.27719577 (2025).
Mahabir, R. et al. Crowdsourcing street view imagery: A comparison of Mapillary and OpenStreetCam. ISPRS Int. J. Geo-Inf. 9, 341, https://doi.org/10.3390/ijgi9060341 (2020).
Anguelov, D. et al. Google street view: Capturing the world at street level. Computer 43, 32–38, https://doi.org/10.1109/MC.2010.170 (2010).
Acknowledgements
The project was supported by the National Natural Science Foundation of China (Grant No.42471492; No.42371478) and the National Key R&D Program of China (Grant No.2021YFB390050).
Author information
Authors and Affiliations
Contributions
Qi Zhou: Conceptualization, Writing-review&editing; Jiayuan Duan: Data collection, Data curation, Quality Assessment, Writing-original draft; Jie Qiao: Data collection, Data curation, Quality Assessment; Zixian Liu: Writing-review&editing; Hong Yang: Project Administration.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
About this article
Cite this article
Zhou, Q., Duan, J., Qiao, J. et al. A Large Crowdsourced Street View Dataset for Mapping Road Surface Types in Africa. Sci Data 12, 1003 (2025). https://doi.org/10.1038/s41597-025-05153-y
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41597-025-05153-y