Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

Advertisement

Scientific Reports
  • View all journals
  • Search
  • My Account Login
  • Content Explore content
  • About the journal
  • Publish with us
  • Sign up for alerts
  • RSS feed
  1. nature
  2. scientific reports
  3. articles
  4. article
Comparative analysis of machine learning models with SHAP interpretation for causes of highway flood-damage blocking
Download PDF
Download PDF
  • Article
  • Open access
  • Published: 13 January 2026

Comparative analysis of machine learning models with SHAP interpretation for causes of highway flood-damage blocking

  • Bin Li1,
  • Lingyi Wu2,
  • Jian Gao2,
  • Feng Yang3,
  • Yuqi Guo2 &
  • …
  • Mingyue Yan3 

Scientific Reports , Article number:  (2026) Cite this article

  • 679 Accesses

  • 1 Altmetric

  • Metrics details

We are providing an unedited version of this manuscript to give early access to its findings. Before final publication, the manuscript will undergo further editing. Please note there may be errors present which affect the content, and all legal disclaimers apply.

Subjects

  • Climate sciences
  • Environmental sciences
  • Hydrology
  • Natural hazards

Abstract

Highway flood-damage blocking poses a critical threat to transportation system resilience, yet risk assessment is often limited by insufficient modeling of temporal disaster evolution, highly imbalanced data, and weak model interpretability. To address these challenges, this study proposes an integrated modeling framework that combines temporal data augmentation, machine learning, and interpretable mechanism analysis. Three data-balancing strategies—Time-series Generative Adversarial Network (TimeGAN) augmentation, undersampling, and a hybrid approach—were systematically compared to handle imbalanced temporal data. Six machine learning models (Logistic Regression, Support Vector Machine, Random Forest, Decision Tree, eXtreme Gradient Boosting, and Multilayer Perceptron) were evaluated, and SHapley Additive exPlanations (SHAP) was used to quantify factor contributions and explore nonlinear effects and interaction patterns. Results show that the Multilayer Perceptron trained on TimeGAN-augmented sequences achieved the highest performance, with an F1 score of 49.81% and PR-AUC of 49.46%. SHAP analysis identified key drivers and their threshold effects: Daily precipitation exceeding 2.8 mm, 7-day effective precipitation (EP 7) exceeding 22 mm, temperature above 21 °C, and average road-stream distance within 1 km (ARSD) above 0.15 km significantly increase the risk of highway flood-damage blocking. High temperature conditions are more likely to coincide with heavy precipitation and elevated EP 7, and their combined effects further amplify blocking risk. Factor contributions also varied across methods, reflecting SHAP’s ability to capture nonlinear effects and reveal interaction patterns, whereas linear regression mainly reflects independent linear effects. By integrating temporal generation, systematic model evaluation, and interpretable analysis, this study enhances the accuracy and reliability of highway flood-damage blocking prediction, providing quantitative guidance for flood damage prevention and resilience improvement of highway systems.

Similar content being viewed by others

Predicting spatiotemporal changes in flood prone regions using PSO-ML coupling under climate change scenarios

Article Open access 02 December 2025

Leveraging SHapley Additive exPlanations (SHAP) and fuzzy logic for efficient rainfall forecasts

Article Open access 20 October 2025

Mapping at-risk transportation infrastructure assets using statistical and machine learning methods

Article Open access 29 January 2026

Data availability

The public data that support the findings of this study are available on request from the corresponding author.

References

  1. Zheng, J. et al. Construction of an intelligent risk identification system for highway flood damage based on multimodal large models. Appl. Sci. 15, 12782 (2025).

    Google Scholar 

  2. Du, B. et al. Assessing the impact of precipitation variability on landslide hazards in urbanized regions. Int. J. Appl. Earth Obs. Geoinf. 136, 104360 (2025).

    Google Scholar 

  3. Xin, Z. et al. The relationship between geological disasters with land use change, meteorological and hydrological factors: A case study of Neijiang City in Sichuan Province. Ecol. Ind. 154, 110840 (2023).

    Google Scholar 

  4. Liu, D. et al. A new method for calculating highway blocking due to high-impact weather conditions. Nat. Hazard. 25, 493–513 (2025).

    Google Scholar 

  5. Pan, J. et al. Characterizing China’s road network development from a spatial entropy perspective. J. Transp. Geogr. 116, 103848 (2024).

    Google Scholar 

  6. Ministry of Transport of the People’s Republic of China. Statistical Communique on the Development of the Transportation Industry (2024). https://xxgk.mot.gov.cn/2020/jigou/zhghs/202506/t20250610_4170228.html (2025).

  7. Zhang, W. et al. A year marked by extreme precipitation and floods: Weather and climate extremes in 2024. Adv. Atmos. Sci. 42, 1045–1063 (2025).

    Google Scholar 

  8. Smyl, D., Ghasemzadeh, F. & Pour-Ghaz, M. Modeling water absorption in concrete and mortar with distributed damage. Constr. Build. Mater. 125, 438–449 (2016).

    Google Scholar 

  9. Kringos, N. & Scarpas, A. Raveling of asphaltic mixes due to water damage. Transp. Res. Record: J. Transp. Res. Board 1929, 79–87 (2005).

    Google Scholar 

  10. Zeng, Y., Wang, Q., Cao, J. & Liu, G. (2020) Study on Water Damage Mechanism and Emergency Restore of Fill Subgrade upon Squashy Slope Foundation in Mountain Area. IOP Conf. Ser.: Earth Environ. Sci. 455: 012181.

  11. Sun, D., Wen, H., Zhang, Y. & Xue, M. An optimal sample selection-based logistic regression model of slope physical resistance against rainfall-induced landslide. Nat Hazards 105, 1255–1279 (2021).

    Google Scholar 

  12. Mahmoud, A. A. et al. Synergizing machine learning and experimental analysis to predict post-heating compressive strength in waste concrete. Struct. Concr. 26, 2916–2950 (2025).

    Google Scholar 

  13. Li, Z. et al. Spatiotemporal assessment of water damage susceptibility in China’s road infrastructure: a machine learning and SHAP approach using social media data. J. Hydrol. 664, 134539 (2026).

    Google Scholar 

  14. Wang, L. et al. Time series prediction of reservoir bank landslide failure probability considering the spatial variability of soil properties. J. Rock Mech. Geotech. Eng. 16, 3951–3960 (2024).

    Google Scholar 

  15. Bashir, N. et al. Enhancing seismic activity classification in augmented soil gas radon time series data through computational intelligence techniques. J. Atmos. Solar Terr. Phys. 274, 106560 (2025).

    Google Scholar 

  16. EskandariNasab, M., Hamdi, S. M. & Filali Boubrahimi, S. Impacts of data preprocessing and sampling techniques on solar flare prediction from multivariate time series data of photospheric magnetic field parameters. ApJS 275, 6 (2024).

    Google Scholar 

  17. Wang, K., Yang, T., Kong, S. & Li, M. Air quality index prediction through TimeGAN data recovery and PSO-optimized VMD-deep learning framework. Appl. Soft Comput. 170, 112626 (2025).

    Google Scholar 

  18. Ali, R., Muayad, M., Mohammed, A. S. & Asteris, P. G. Analysis and prediction of the effect of Nanosilica on the compressive strength of concrete with different mix proportions and specimen sizes using various numerical approaches. Struct. Concr. 24, 4161–4184 (2022).

    Google Scholar 

  19. Zeyad, A. M. et al. Compressive strength of nano concrete materials under elevated temperatures using machine learning. Sci. Rep. 14, 24246 (2024).

    Google Scholar 

  20. Mahmoud, A. A., El-Sayed, A. A., Aboraya, A. M., Fathy, N. & Nabil, I. M. Enhancing predictive accuracy of nano-additive concrete gamma ray attenuation at high temperatures using AI-based models. Neural Comput. Applic. 37, 21833–21866 (2025).

    Google Scholar 

  21. Liu, Y. et al. Regional sustainable development strategy based on the coordination between ecology and economy: A case study of Sichuan Province China. Ecol. Indicators 134, 108445 (2022).

    Google Scholar 

  22. Department of Transportation of Sichuan Province. Statistical Bulletin on the Development of the Transportation Sector in Sichuan Province (2023). https://jtt.sc.gov.cn/jtt/c101520/2024/10/15/5f5e139f8e8c4c4397440e0805e04bd9.shtml (2024).

  23. Li, H. et al. A daily gap-free normalized difference vegetation index dataset from 1981 to 2023 in China. Sci Data 11, 527 (2024).

    Google Scholar 

  24. Bandara, C. M. M. Drainage density and effective precipitation. J. Hydrol. 21, 187–190 (1974).

    Google Scholar 

  25. China Meteorological Administration. Meteorological risk early warning levels of geological disaster induced by torrential rain. Standard QX/T 487–2019 (2019).QX/T 487—2019. (2019).

  26. Yan, Y. et al. Spatial Distribution-based Imbalanced Undersampling. IEEE Trans. Knowl. Data Eng. 35(6), 6376–6391. https://doi.org/10.1109/tkde.2022.3161537 (2022).

    Google Scholar 

  27. Chen, C., Wang, F., Wang, Z., Zhang, D. & Xiang, L. A novel flood forecasting model based on TimeGAN for data-sparse basins. Stoch. Env. Res. Risk Assess. 39, 2267–2280 (2025).

    Google Scholar 

  28. Zhang, C. & Ji, D. HAL-Net: A historical analogy learning network for adaptive and interpretable pandemic forecasting. Expert Syst. Appl. 299, 130038 (2026).

    Google Scholar 

  29. Wang, L., Xiao, M., Lv, J. & Liu, J. Analysis of influencing factors of traffic accidents on urban ring road based on the SVM model optimized by Bayesian method. PLoS ONE 19, e0310044 (2024).

    Google Scholar 

  30. Bai, J. et al. Multinomial random forest. Pattern Recogn. 122, 108331 (2022).

    Google Scholar 

  31. He, Z., Wu, Z., Xu, G., Liu, Y. & Zou, Q. Decision Tree for Sequences. IEEE Trans. Knowl. Data Eng. 35(1), 251–263. https://doi.org/10.1109/tkde.2021.3075023 (2021).

    Google Scholar 

  32. Yan, X. et al. Driving risk prediction of urban arterial and collector roads using multi-dimensional real-time data. Eng. Appl. Artif. Intell. 138, 109386 (2024).

    Google Scholar 

  33. Betkier, I. Estimating travel time in transport network with a combined multi-attributed graph convolutional neural network and multilayer perceptron model. Eng. Appl. Artif. Intell. 142, 109898 (2025).

    Google Scholar 

  34. Wang, X., Jin, Y., Schmitt, S. & Olhofer, M. Recent Advances in Bayesian Optimization. ACM Comput. Surv. 55, 1–36 (2023).

    Google Scholar 

  35. Fujiwara, K. Knowledge distillation with resampling for imbalanced data classification: Enhancing predictive performance and explainability stability. Results Eng. 24, 103406 (2024).

    Google Scholar 

  36. López-Oriona, Á. & Vilar, J. A. The bootstrap for testing the equality of two multivariate time series with an application to financial markets. Inf. Sci. 616, 255–275 (2022).

    Google Scholar 

  37. Fathy, I. N., Dahish, H. A., Alkharisi, M. K., Mahmoud, A. A. & Fouad, H. E. E. Predicting the compressive strength of concrete incorporating waste powders exposed to elevated temperatures utilizing machine learning. Sci. Rep. 15, 25275 (2025).

    Google Scholar 

  38. Pradhan, S., Toll, D. G., Rosser, N. J. & Brain, M. J. An investigation of the combined effect of rainfall and road cut on landsliding. Eng. Geol. 307, 106787 (2022).

    Google Scholar 

  39. Ashland, F. X. Critical shallow and deep hydrologic conditions associated with widespread landslides during a series of storms between February and April 2018 in Pittsburgh and vicinity, western Pennsylvania, USA. Landslides https://doi.org/10.1007/s10346-021-01665-x (2021)

  40. Sun, W., Li, J., Yu, R., Li, N. & Zhang, Y. Exploring changes of precipitation extremes under climate change through global variable-resolution modeling. Science Bulletin 69, 237–247 (2024).

    Google Scholar 

  41. Ran, Q. et al. The relative importance of antecedent soil moisture and precipitation in flood generation in the middle and lower Yangtze River basin. Hydrol. Earth Syst. Sci. 26, 4919–4931 (2022).

    Google Scholar 

  42. Ye, S. et al. From rainfall to runoff: The role of soil moisture in a mountainous catchment. J. Hydrol. 625, 130060 (2023).

    Google Scholar 

  43. Zhang, L., Zhang, X., Gao, S. & Gu, X. Revealing nonlinear relationships and thresholds of human activities and climate change on ecosystem services in Anhui Province based on the XGBoost–SHAP model. Sustainability 17, 8728 (2025).

    Google Scholar 

Download references

Funding

This research was generously funded by the National Key R&D Program of China (Grant No. 2024YFB4303100) on Integrated Technology Application for Autonomous Transportation Systems.

Author information

Authors and Affiliations

  1. State Key Laboratory of Intelligent Transportation System, Highway Monitoring and Emergency Response Center, Ministry of Transport of China, Beijing, 100029, China

    Bin Li

  2. State Key Laboratory of Intelligent Transportation System, Research Institute of Highway Ministry of Transport, Beijing, 100088, China

    Lingyi Wu, Jian Gao & Yuqi Guo

  3. Highway Monitoring and Emergency Response Center, Ministry of Transport of China, Beijing, 100029, China

    Feng Yang & Mingyue Yan

Authors
  1. Bin Li
    View author publications

    Search author on:PubMed Google Scholar

  2. Lingyi Wu
    View author publications

    Search author on:PubMed Google Scholar

  3. Jian Gao
    View author publications

    Search author on:PubMed Google Scholar

  4. Feng Yang
    View author publications

    Search author on:PubMed Google Scholar

  5. Yuqi Guo
    View author publications

    Search author on:PubMed Google Scholar

  6. Mingyue Yan
    View author publications

    Search author on:PubMed Google Scholar

Contributions

Bin Li: Conceptualization, Validation, Supervision, Funding acquisition, Writing—review & editing; Lingyi Wu: Writing—review & editing, Writing—original draft, Methodology, Formal analysis, Data curation; Jian Gao: Resources, Project administration; Feng Yang: Resources, Project administration; Yuqi Guo: Validation, Investigation; Mingyue Yan: Writing—review & editing.

Corresponding author

Correspondence to Lingyi Wu.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Li, B., Wu, L., Gao, J. et al. Comparative analysis of machine learning models with SHAP interpretation for causes of highway flood-damage blocking. Sci Rep (2026). https://doi.org/10.1038/s41598-026-35074-8

Download citation

  • Received: 10 November 2025

  • Accepted: 02 January 2026

  • Published: 13 January 2026

  • DOI: https://doi.org/10.1038/s41598-026-35074-8

Share this article

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

Keywords

  • Machine learning
  • Highway flood damage
  • Highway blocking
  • SHAP
  • Time-series data
Download PDF

Advertisement

Explore content

  • Research articles
  • News & Comment
  • Collections
  • Subjects
  • Follow us on Facebook
  • Follow us on Twitter
  • Sign up for alerts
  • RSS feed

About the journal

  • About Scientific Reports
  • Contact
  • Journal policies
  • Guide to referees
  • Calls for Papers
  • Editor's Choice
  • Journal highlights
  • Open Access Fees and Funding

Publish with us

  • For authors
  • Language editing services
  • Open access funding
  • Submit manuscript

Search

Advanced search

Quick links

  • Explore articles by subject
  • Find a job
  • Guide to authors
  • Editorial policies

Scientific Reports (Sci Rep)

ISSN 2045-2322 (online)

nature.com sitemap

About Nature Portfolio

  • About us
  • Press releases
  • Press office
  • Contact us

Discover content

  • Journals A-Z
  • Articles by subject
  • protocols.io
  • Nature Index

Publishing policies

  • Nature portfolio policies
  • Open access

Author & Researcher services

  • Reprints & permissions
  • Research data
  • Language editing
  • Scientific editing
  • Nature Masterclasses
  • Research Solutions

Libraries & institutions

  • Librarian service & tools
  • Librarian portal
  • Open research
  • Recommend to library

Advertising & partnerships

  • Advertising
  • Partnerships & Services
  • Media kits
  • Branded content

Professional development

  • Nature Awards
  • Nature Careers
  • Nature Conferences

Regional websites

  • Nature Africa
  • Nature China
  • Nature India
  • Nature Japan
  • Nature Middle East
  • Privacy Policy
  • Use of cookies
  • Legal notice
  • Accessibility statement
  • Terms & Conditions
  • Your US state privacy rights
Springer Nature

© 2026 Springer Nature Limited

Nature Briefing Anthropocene

Sign up for the Nature Briefing: Anthropocene newsletter — what matters in anthropocene research, free to your inbox weekly.

Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing: Anthropocene