Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

Advertisement

Nature Communications
  • View all journals
  • Search
  • My Account Login
  • Content Explore content
  • About the journal
  • Publish with us
  • Sign up for alerts
  • RSS feed
  1. nature
  2. nature communications
  3. articles
  4. article
Test case sampling optimization for safety validation of automated driving systems
Download PDF
Download PDF
  • Article
  • Open access
  • Published: 24 February 2026

Test case sampling optimization for safety validation of automated driving systems

  • Chen Qian  ORCID: orcid.org/0000-0002-0968-49691 na1,
  • Jingbin Xu  ORCID: orcid.org/0000-0003-3392-25142 na1,
  • Xin Xing  ORCID: orcid.org/0000-0001-9121-00863 &
  • …
  • Feng Guo  ORCID: orcid.org/0000-0002-2572-481X3,4 

Nature Communications , Article number:  (2026) Cite this article

We are providing an unedited version of this manuscript to give early access to its findings. Before final publication, the manuscript will undergo further editing. Please note there may be errors present which affect the content, and all legal disclaimers apply.

Subjects

  • Civil engineering
  • Statistics

Abstract

Testing and validating automated driving systems require carefully designed test cases that capture the complexity of real-world driving conditions. However, the inherent complexity of driving environments and the rarity of safety-critical situations pose significant challenges to developing reliable and efficient validation frameworks. This paper addresses these issues by selecting appropriate test cases from the largest-scale naturalistic driving study. We introduce a Kernel Test Case Sampling method, which selects cases satisfying two key criteria: representativeness, ensuring alignment with real-world scenarios, and coverage, capturing high-risk corner cases. To demonstrate the proposed method, it is applied to large-scale naturalistic driving study data. By selecting a limited number of cases, the method effectively captures long-tailed scenarios while approximating the distribution of naturalistic driving conditions. The sampling framework also enables robust accident-rate estimation, thereby ensuring fair comparisons across human driving performance and multiple systems. The proposed method supports standardized and scalable automated driving system safety validation, facilitating accelerated development and deployment while building public trust and regulatory confidence.

Similar content being viewed by others

World model-based end-to-end scene generation for accident anticipation in autonomous driving

Article Open access 05 August 2025

A matched case-control analysis of autonomous vs human-driven vehicle accidents

Article Open access 18 June 2024

A Dataset on Takeover during Distracted L2 Automated Driving

Article Open access 31 March 2025

Data availability

The scenario feature data and non-PII Strategic Highway Research Program Naturalistic Driving Study data are available under restricted access. Researchers may obtain access by submitting a Data Use License application, which requires specification of the requested data, documentation of Institutional Review Board (IRB) approval, institutional authorization, and a data security plan. Detailed instructions for data access are available at the following links: https://www.trb.org/StrategicHighwayResearchProgram2SHRP2/SHRP2DataSafetyAccess.aspxhttps://insight.shrp2nds.us/. The data underlying the figures presented in this paper are provided in the accompanying Source Data file. Source data are provided with this paper.

Code availability

The source code and implementation details are publicly available at Code Ocean through https://doi.org/10.24433/CO.9203840.v1

References

  1. Feng, S., Yan, X., Sun, H., Feng, Y. & Liu, H. X. Intelligent driving intelligence test for autonomous vehicles with naturalistic and adversarial environment. Nat. Commun. 12, 748 (2021).

    Google Scholar 

  2. Thorn, E., Kimmel, S. C., Chaka, M., Hamilton, B. A. et al. A framework for automated driving system testable cases and scenarios. Tech. Rep., United States. Department of Transportation. National Highway Traffic Safety Administration, DOT HS 812 623 (2018).

  3. Weber, H. et al. A framework for definition of logical scenarios for safety assurance of automated driving. Traffic Inj. Prev. 20, S65–S70 (2019).

    Google Scholar 

  4. Albrecht, H., Barickman, F. S., Schnelle, S. C. et al. Advanced test tools for ADAS and ADS. Tech. Rep., (2021).

  5. Yan, X. et al. Learning a naturalistic driving environment with statistical realism. Nat. Commun. 14, 2037 (2023).

    Google Scholar 

  6. Chen, R., Arief, M., Zhang, W. & Zhao, D. How to evaluate proving grounds for self-driving? A quantitative approach. IEEE Trans. Intell. Trans. Syst. 22, 5737–5748 (2020).

    Google Scholar 

  7. Rahmani, S. et al. A systematic review of edge case detection in automated driving: Methods, challenges and future directions. arXiv preprint arXiv:2410.08491 (2024).

  8. Abdel-Aty, M. & Ding, S. A matched case-control analysis of autonomous vs human-driven vehicle accidents. Nat. Commun. 15, 4931 (2024).

    Google Scholar 

  9. Venkatraman, V., Richard, C. M., Magee, K., Johnson, K. et al. Countermeasures that work: A highway safety countermeasure guide for state highway safety offices, 2020. Tech. Rep., United States. Department of Transportation. National Highway Traffic Safety Administration, DOT HS 813 097 (2021).

  10. Harb, M., Stathopoulos, A., Shiftan, Y. & Walker, J. L. What do we (not) know about our future with automated vehicles? Trans. Res. Part C: Emerg. Technol. 123, 102948 (2021).

    Google Scholar 

  11. Wang, Y., Yu, R., Qiu, S., Sun, J. & Farah, H. Safety performance boundary identification of highly automated vehicles: a surrogate model-based gradient descent searching approach. IEEE Trans. Intell. Trans. Syst. 23, 23809–23820 (2022).

    Google Scholar 

  12. Liu, H. X. & Feng, S. Curse of rarity for autonomous vehicles. Nat. Commun. 15, 4808 (2024).

    Google Scholar 

  13. Scanlon, J. M. et al. Waymo simulated driving behavior in reconstructed fatal crashes within an autonomous vehicle operating domain. Accid. Anal. Prev. 163, 106454 (2021).

    Google Scholar 

  14. Dingus, T. A. et al. Driver crash risk factors and prevalence evaluation using naturalistic driving data. Proc. Natl. Acad. Sci. 113, 2636–2641 (2016).

    Google Scholar 

  15. Guo, F. Statistical methods for naturalistic driving studies. Annu. Rev. Stat. Appl. 6, 309–328 (2019).

    Google Scholar 

  16. Antin, J., Stulce, K., Eichelberger, L. & Hankey, J. Naturalistic driving study: descriptive comparison of the study sample with national data.Tech. Rep., Virginia Tech Transportation Institute (2015).

  17. Ali, G., McLaughlin, S. & Ahmadian, M. The surface accelerations reference—a large-scale, interactive catalog of passenger vehicle accelerations. IEEE Trans. Intell. Trans. Syst. 24, 9031–9040 (2023).

    Google Scholar 

  18. Chatalic, A., Schreuder, N., Rosasco, L. & Rudi, A. Nyström kernel mean embeddings. In International Conference on Machine Learning, 3006–3024 (PMLR, 2022).

  19. Mak, S. & Joseph, V. R. Support points. Ann. Stat. 46, 2562–2592 (2018).

    Google Scholar 

  20. Zhang, J. et al. An optimal transport approach for selecting a representative subsample with application in efficient kernel density estimation. J. Comput. Graph. Stat. 32, 329–339 (2023).

    Google Scholar 

  21. Kim, B., Khanna, R. & Koyejo, O. O. Examples are not enough, learn to criticize. Criticism for interpretability. Advances in Neural Information Processing Systems 29, (2016).

  22. Joseph, V. R., Dasgupta, T., Tuo, R. & Wu, C. J. Sequential exploration of complex surfaces using minimum energy designs. Technometrics 57, 64–74 (2015).

    Google Scholar 

  23. Principe, J. C. Information theoretic learning: Rényi’s entropy and kernel perspectives (Springer Science & Business Media, 2010).

  24. Gretton, A., Borgwardt, K. M., Rasch, M. J., Schölkopf, B. & Smola, A. A kernel two-sample test. J. Mach. Learn. Res. 13, 723–773 (2012).

    Google Scholar 

  25. Vaswani, A. et al. Attention is all you need. Advances in Neural Information Processing Systems. 30, (2017).

  26. Chai, J. & Wang, X. Fairness with adaptive weights. In International Conference on Machine Learning, 2853–2866 (PMLR, 2022).

  27. Hankey, J. M., Perez, M. A. & McClafferty, J. A. Description of the shrp 2 naturalistic database and the crash, near-crash, and baseline data sets. Tech. Rep., Virginia Tech Transportation Institute (2016).

  28. Hammit, B. E., Ghasemzadeh, A., James, R. M., Ahmed, M. M. & Young, R. K. Evaluation of weather-related freeway car-following behavior using the shrp2 naturalistic driving study database. Trans. Res. Part F: Traffic Psychol. Behav. 59, 244–259 (2018).

    Google Scholar 

  29. Kumar, S., Mohri, M. & Talwalkar, A. Sampling methods for the nyström method. J. Mach. Learn. Res. 13, 981–1006 (2012).

    Google Scholar 

  30. Zhong, W., Qian, C., Liu, W., Zhu, L. & Li, R. Feature screening for interval-valued response with application to study association between posted salary and required skills. J. Am. Stat. Assoc. 118, 805–817 (2023).

    Google Scholar 

  31. Wang, Y., Sun, F. & Xu, H. On design orthogonality, maximin distance, and projection uniformity for computer experiments. J. Am. Stat. Assoc. 117, 375–385 (2022).

    Google Scholar 

  32. Kalra, N. & Paddock, S. M. Driving to safety: How many miles of driving would it take to demonstrate autonomous vehicle reliability? Trans. Res. Part A: Policy Pract. 94, 182–193 (2016).

    Google Scholar 

  33. Feng, S. et al. Dense reinforcement learning for safety validation of autonomous vehicles. Nature 615, 620–627 (2023).

    Google Scholar 

  34. Russell, L. et al. Gaia-2: A controllable multi-view generative world model for autonomous driving. Preprint at https://doi.org/10.48550/arXiv.2503.20523 (2025).

  35. Guan, Y. et al. World models for autonomous driving: An initial survey. IEEE Transactions on Intelligent Vehicles. (2024).

  36. ASAM. Open scenario. https://www.asam.net/standards/detail/openscenario/v200/ (2024).

  37. Si, S., Hsieh, C.-J. & Dhillon, I. S. Memory efficient kernel approximation. J. Mach. Learn. Res. 18, 1–32 (2017).

    Google Scholar 

  38. Teymur, O., Gorham, J., Riabiz, M. & Oates, C. Optimal quantisation of probability measures using maximum mean discrepancy. In International Conference on Artificial Intelligence and Statistics, 1027–1035 (PMLR, 2021).

  39. Pourahmadi, M. High-dimensional covariance estimation: with high-dimensional data. (John Wiley & Sons, 2013).

  40. Li, C.-L., Chang, W.-C., Cheng, Y., Yang, Y. & Póczos, B. Mmd gan: towards deeper understanding of moment matching network. Advances in Neural Information Processing Systems. 30, (2017).

  41. Han, R., Shi, P. & Zhang, A. R. Guaranteed functional tensor singular value decomposition. J. Am. Stat. Assoc. 119, 995–1007 (2024).

    Google Scholar 

  42. Joseph, V. R. Space-filling designs for computer experiments: a review. Qual. Eng. 28, 28–35 (2016).

    Google Scholar 

  43. Wolfer, G. & Alquier, P. Variance-aware estimation of kernel mean embedding. J. Mach. Learn. Res. 26, 1–48 (2025).

    Google Scholar 

  44. Simon-Gabriel, C.-J. & Schölkopf, B. Kernel distribution embeddings: universal kernels, characteristic kernels and kernel metrics on distributions. J. Mach. Learn. Res. 19, 1–29 (2018).

    Google Scholar 

  45. Pronzato, L. Performance analysis of greedy algorithms for minimising a maximum mean discrepancy. Stat. Comput. 33, 14 (2023).

    Google Scholar 

  46. Tillé, Y. Sampling and estimation from finite populations (John Wiley & Sons, 2020).

Download references

Acknowledgments

We thank Drs. Miguel Perez, Jon Hankey, Kevin Kefauver and Zac Doerzaph for their valuable guidance on naturalistic driving study data and automated driving systems testing and validation.

Author information

Author notes
  1. These authors contributed equally: Chen Qian, Jingbin Xu.

Authors and Affiliations

  1. Dalian University of Technology, School of Economics and Management, Dalian, China

    Chen Qian

  2. Dalian University of Technology, School of Mechanical Engineering, Dalian, China

    Jingbin Xu

  3. Virginia Tech, Department of Statistics, Blacksburg, VA, USA

    Xin Xing & Feng Guo

  4. Virginia Tech Transportation Institute, Blacksburg, VA, USA

    Feng Guo

Authors
  1. Chen Qian
    View author publications

    Search author on:PubMed Google Scholar

  2. Jingbin Xu
    View author publications

    Search author on:PubMed Google Scholar

  3. Xin Xing
    View author publications

    Search author on:PubMed Google Scholar

  4. Feng Guo
    View author publications

    Search author on:PubMed Google Scholar

Contributions

F. Guo. conceptualized the study and provided the data. C. Qian, J.Xu. and F. Guo developed the methodology. X. Xing validated the findings. C. Qian. and J.Xu performed the formal analysis. C. Qian. wrote the original draft, while all authors revising the editing the manuscript. F. Guo and X. Xing provided supervision.

Corresponding author

Correspondence to Feng Guo.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks Salvatore Cuomo, Francesco Finazzi, Marcos Nieto and the other anonymous reviewer(s) for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Transparent Peer Review file

Source data

Source Data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Qian, C., Xu, J., Xing, X. et al. Test case sampling optimization for safety validation of automated driving systems. Nat Commun (2026). https://doi.org/10.1038/s41467-026-69675-8

Download citation

  • Received: 28 February 2025

  • Accepted: 03 February 2026

  • Published: 24 February 2026

  • DOI: https://doi.org/10.1038/s41467-026-69675-8

Share this article

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

Download PDF

Advertisement

Explore content

  • Research articles
  • Reviews & Analysis
  • News & Comment
  • Videos
  • Collections
  • Subjects
  • Follow us on Facebook
  • Follow us on X
  • Sign up for alerts
  • RSS feed

About the journal

  • Aims & Scope
  • Editors
  • Journal Information
  • Open Access Fees and Funding
  • Calls for Papers
  • Editorial Values Statement
  • Journal Metrics
  • Editors' Highlights
  • Contact
  • Editorial policies
  • Top Articles

Publish with us

  • For authors
  • For Reviewers
  • Language editing services
  • Open access funding
  • Submit manuscript

Search

Advanced search

Quick links

  • Explore articles by subject
  • Find a job
  • Guide to authors
  • Editorial policies

Nature Communications (Nat Commun)

ISSN 2041-1723 (online)

nature.com sitemap

About Nature Portfolio

  • About us
  • Press releases
  • Press office
  • Contact us

Discover content

  • Journals A-Z
  • Articles by subject
  • protocols.io
  • Nature Index

Publishing policies

  • Nature portfolio policies
  • Open access

Author & Researcher services

  • Reprints & permissions
  • Research data
  • Language editing
  • Scientific editing
  • Nature Masterclasses
  • Research Solutions

Libraries & institutions

  • Librarian service & tools
  • Librarian portal
  • Open research
  • Recommend to library

Advertising & partnerships

  • Advertising
  • Partnerships & Services
  • Media kits
  • Branded content

Professional development

  • Nature Awards
  • Nature Careers
  • Nature Conferences

Regional websites

  • Nature Africa
  • Nature China
  • Nature India
  • Nature Japan
  • Nature Middle East
  • Privacy Policy
  • Use of cookies
  • Legal notice
  • Accessibility statement
  • Terms & Conditions
  • Your US state privacy rights
Springer Nature

© 2026 Springer Nature Limited

Nature Briefing AI and Robotics

Sign up for the Nature Briefing: AI and Robotics newsletter — what matters in AI and robotics research, free to your inbox weekly.

Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing: AI and Robotics