The internal and external cost of motor vehicle crashes

Dai, Shian; Yu, Liqiang; Liu, Zhaoran; Cui, Mengying; Levinson, David

doi:10.1038/s41598-025-89058-1

Download PDF

Article
Open access
Published: 14 February 2025

The internal and external cost of motor vehicle crashes

Shian Dai¹,
Liqiang Yu²,
Zhaoran Liu^3,4,
Mengying Cui⁵ &
…
David Levinson⁶

Scientific Reports volume 15, Article number: 5441 (2025) Cite this article

3677 Accesses
1 Citations
8 Altmetric
Metrics details

Subjects

Abstract

Crash cost estimates are essential for evaluating road safety management policies and assessing the economic benefits of safety improvements. Existing studies often rely on aggregated crash data, assuming an even distribution of incidents, which overlooks significant spatial variations influenced by road characteristics and traffic conditions. This research presents a methodological framework for link-based crash cost analysis that considers both internal and external costs, enabling detailed quantification at a localized level. By employing safety performance functions and ordered probit models, we estimate on-road crash rates by crash type and injury severity, distinguishing between internal costs borne by individuals involved in crashes and external costs that impact victims, insurers, and government agencies. This framework is applied to the Minneapolis-St. Paul metropolitan area for a proof-of-concept. Our findings reveal that the costs incurred by drivers are higher than those imposed on others, and that highways are generally safer than surface streets. However, these crash costs are too low compared to the value of travel time to significantly influence route choices, even when drivers are aware of these costs. To enhance effective decision-making, related policies should consider offering incentives for safe driving practices. Future research on the practical applications of this framework is encouraged to maintain a dynamic dataset that reflects ongoing changes in road safety conditions.

Unveiling the risks of speeding behavior by investigating the dynamics of driver injury severity through advanced analytics

Article Open access 28 September 2024

Temporal instability and differences in injury severity between restrained and unrestrained drivers in speeding-related crashes

Article Open access 16 June 2023

Data-driven risk analysis of nonlinear factor interactions in road safety using Bayesian networks

Article Open access 15 August 2024

Introduction

Crash cost estimates reflect the effects of road safety management policies and allow the appraisal of the economic benefits of road safety improvement projects^1,2. However, estimating crash costs is complex due to the involvement of various factors, including direct cost (such as property damage, medical, and legal costs), indirect cost (such as congestion, productivity loss for work and family, and tax losses), and intangible cost (such as loss of life or degradation in quality of life, and the pain and suffering for both victims and their families)^3,4,5,6,7. External cost estimation is particularly challenging, as it requires consideration of multiple stakeholders, including victims, insurance companies, government agencies, and affected individuals such as families and friends⁸.

Previous studies on crash cost estimates have largely focused on the magnitude, aggregated values from large-scale crash data, and provided an overall compilation of the estimates without exploring the finer details . For instance, Blincoe et al.⁹ and Wijnen et al.¹ estimated that national-level crash costs range from 0.4% to 4.1% of Gross Domestic Product (GDP) in European countries and the United States. Similarly, a global comparison of crash fatality costs held the same trends, but further indicates that less-developed countries have a lower percentage of GDP attributed to crash costs¹⁰. Levinson et al.¹¹ calculated crash costs per vehicle-kilometer-traveled (vkt), reporting $0.04/vkt for rural highways and $0.02/vkt for urban highways. This supports previous observations that rural areas suffer a higher crash costs^12,13. Parry¹⁴ examined external costs borne by others, finding an average of 2.2–6.6 cents per mile annually for crashes with and without fatalities during 1998–2000. However, these studies often assume an even distribution of crashes and fail to capture spatial variations across road networks.

In reality, crash costs are influenced by road characteristics and traffic conditions, leading to significant fluctuations at a fine geographical scale^15,16,17. The principles of “Sustainably-Safe Traffic” emphasize the importance of encouraging drivers to select safer routes to reduce crash casualties^18,19, which requires more localized estimates of crash cost differences. Unfortunately, this aspect has not been adequately solved in previous research.

To address these gaps, this study proposes a methodological framework for link-based crash cost analysis that considers both internal and external cost factors and explains how to quantify the average and marginal values at a microscopic level. It allows for an examination of spatial variations in crash costs, identifying which road segments are safer for travel.

As a proof-of-concept, this framework is applied to the Minneapolis-St. Paul (Twin Cities) metropolitan area to assess crash costs across the region’s road network. The data, methodology, results, and conclusion of this study are discussed in sections Methodology–Conclusions in turn.

Methodology

The National Safety Council (NSC) introduced the “KABCO” injury scale to classify crash severity levels: “K” for fatal crashes, “A” for incapacitating injuries, “B” for non-incapacitating injuries, “C” for pain complaints, and “O” for property-damage only crashes. This scale has been widely adopted for reporting crash records and establishing crash cost estimates²⁰. Previous studies typically estimated overall crash frequency without distinguishing crash types initially, and then analyzed crash severity levels separately^21,22,23,24. These estimates can be further used for crash cost analysis by applying unit crash cost specifications to monetize the results. These steps build the train of thoughts in this study, as outlined in Fig. 1.

This framework can be applied to any region where the necessary data, including crash records and link-related variables, are accessible. Detailed methodologies are presented in the following subsections.

Crash frequency

Safety Performance Functions (SPFs), as defined in Highway Safety Manual (HSM)²⁵, are statistical base models that can be used to estimate the average crash frequency based on specific roadway features under existing conditions or to predict crash frequency under projected future conditions²⁶. SPFs are applicable at both the micro (e.g., individual road segments or intersections) and the macro level (e.g., transportation analysis zones or counties)²⁷.

In this study, micro-level estimation is performed, considering individual link segments as observations. The resulting models are applied to all links across the metropolitan area. Notably, conventional variables used in SPFs include Annual Average Daily Traffic (AADT) and segment length. However, additional variables, such as those representing driving speed or speed variance, are also tested to improve model performance.

We use the Negative Binomial Distribution for the SPFs, which is an extension of the Poisson Distribution that effectively models count data with overdispersion (where variance exceeds the mean)²⁶. This overdispersion often arises from the nature of crash occurrences, as many roads report a frequency of zero crashes, while only a few locations experience multiple incidents. Traditional models may underestimate the variability in such cases, leading to inaccurate representations of crash patterns.

The Negative Binomial model for SPFs is expressed as follows,

$$\begin{aligned} ln(y)=\beta _{0}+\sum _{k=1}^{K}\beta _{k}x_{k} \end{aligned}$$

(1)

where y represents the dependent variable measuring the number of crashes in SPFs, $x_{k}$ represents independent variables, K represents the number of independent variables, $\beta _{k}$ are coefficients.

Crash severity

The ordered probit model is widely used for crash severity analysis and is suitable for cases involving categorical dependent variables^28,29,30. Specifically, it recognizes the ordinal nature of the data and operates under the assumption of a continuous underlying variable with normally distributed errors, enabling more efficient parameter estimation while reducing the number of required parameters. Moreover, this model effectively addresses the limitations found in traditional approaches, such as the independence of irrelevant alternatives and the lack of a closed-form likelihood^30,31.

Its general specification is expressed as:

$$\begin{aligned} y^{*}_{j}=X_{j}\beta +\varepsilon _{j} \end{aligned}$$

(2)

where $y^{*}_{j}$ is a latent variable describing the crash severity of the $j^{th}$ crash, $X_{j}$ is a vector of independent variables, $\beta$ is a vector of coefficients, $\varepsilon _{j}$ is the random error term.

The observed variable $y_{j}$ is resolved by the following model,

$$\begin{aligned} y_{j}= \left\{ \begin{array}{l l} 1 & \quad {if \ \ -\infty \ \le y^{*}_{j} \le \ \mu _{1}} \\ 2 & \quad {if \ \ \ \mu _{1} \le y^{*}_{j} \le \ \mu _{2}}\\ 3 & \quad {if \ \ \ \mu _{2} \le y^{*}_{j} \le \ \mu _{3}}\\ 4 & \quad {if \ \ \ \mu _{3} \le y^{*}_{j} \le \ \mu _{4}}\\ 5 & \quad {if \ \ \ \mu _{4} \le y^{*}_{j} \le \ \infty }\\ \end{array}\right. \end{aligned}$$

(3)

where $y_{j}=(1,2,3,4,5)$ stand for difference severity levels, recognizing property damage only, complaint of pain, non-incapacitating injury, incapacitating injury, and fatal crashes, respectively; $\mu _{1}$, $\mu _{2}$, $\mu _{3}$ and $\mu _{4}$ stand for the to-be-estimated threshold values.

Unit crash cost specifications

From a traveler’s perspective, crash costs include both internal and external components. Internal costs refer to the expenses borne by each traveler involved in a crash, while external costs account for the impact on others, such as victims, insurance companies, and government agencies⁸.

Blincoe et al.⁹ evaluated the economic costs of motor vehicle crashes by examining various cost factors based on the severity of incidents. These cost factors include direct expenses, such as medical expenses, property damage, legal fees, and vehicle repairs, as well as indirect costs like lost productivity, insurance premiums, and administrative expenses. However, a key question remains: to what extent are these cost factors external to individual travelers?

Vickrey³² defined crash externality as the increase in crash costs experienced by existing drivers due to additional vehicles on the roads. This definition simplifies the analysis by bypassing the complexities of joint effects, such as driving behavior, insurance policies, and traffic laws. Empirical studies have shown that marginal changes in crash costs can sometimes be negative, as increased congestion raises crash rates but lowers severity due to slower speeds^33,34.

Parry¹⁴ quantified the external portion of various cost factors for single- and multi-vehicle crashes, providing a foundation for subsequent crash cost studies^8,35,36. In our study, we define unit crash cost specifications using estimates from Blincoe et al.⁹ as a reference, see Table 1. This table outlines the cost factors considered in our crash cost estimates and the external proportion of each cost factor based on Parry¹⁴’s study.

In summary, in our estimates,

The internal costs of crashes fully account for lost productivity, both in the market and household settings, and partially include medical expenses, property damage, emergency service costs, insurance administration, and legal fees. The proportions of these costs are determined by individual health and vehicle insurance policies; however, we use an aggregated average here. The loss of quality of life is considered as an internal cost if the incident involves a single vehicle; in the case of multi-vehicle crashes, it is only partially accounted for as an internal cost.
The external costs of crashes include the remaining medical expenses, property damage, emergency service costs, insurance administration, legal fees, and the loss of quality of life in multi-vehicle incidents. This category also fully accounts for workplace disruptions due to employee loss or absence, as well as congestion costs resulting from vehicle crashes.

Based on the unit crash cost specifications, the average internal and external crash costs can be estimated as follows:

$$\begin{aligned} C_{\bar{s},i_{f,Q}}=\sum _{z} \frac{N_{s,i_{f,Q}}*R_{s,i_{f,Q}}*u_{s_{z}}}{N_{Y}*N_{D}*Q} \end{aligned}$$

(4)

where $C_{\bar{s},i_{f,Q}}$: Average crash cost on link $i_{f}$, where f specifies the functional road classifications and Q refers to the average annual daily traffic (AADT), defining the traffic condition; $N_{s,i_{f,Q}}$: Expected crash frequency on link $i_{f}$; $R_{s,i_{f,Q},z}$: Probability of crashes specific to severity level z happened on link $i_{f,Q}$; $u_{s_{z}}$: Unit crash cost, internal or external, specific to severity level z; $N_{Y}$ and $N_{D}$ describe the duration of the analysis period, representing the number of years in counting and number of days in a year, respectively.

Marginal crash costs are calculated by introducing one additional vehicle on each link to evaluate the combined effects of changes in crash frequency ($N_{s,i_{f,Q}}$) and severity ($R_{i_{f,Q}},z$), expressed as:

$$\begin{aligned} C_{\hat{s},i_{f,Q}}=(C_{\bar{s},i_{f,Q+1}}-C_{\bar{s},i_{f,Q}})\times Q \end{aligned}$$

(5)

Table 1 Unit crash cost specification and their external portion.

Full size table

Data

This study incorporates multiple datasets to implement the proposed framework in a real-world scenario, as outlined in the process shown in Fig. 1. The geographical scope of the study area, the seven-county Minneapolis-St. Paul Metropolitan Area, is shown in Fig. 2.

Crash records

Crash records from 2003 to 2014 were obtained from the Minnesota Department of Transportation (MnDOT). Note that these records track police-reported crashes, which are more reliable for documenting severe crashes but likely under-report minor ones. Throughout the remainder of the text, crash data refers only to reported crashes.

The records for each year include GIS attributes, e.g., route numbers, reference points, and coordinates, along with crash-related details like type, severity, weather conditions, and lighting. These records are provided as GIS shapefiles, enabling precise mapping onto the road network. The data can be aggregated by link segment to calculate crash counts for use as dependent variables in safety performance functions or analyzed individually to examine crash severity for ordered probit models.

Table 2 summarizes the number of crashes, categorized by crash type, severity, and Functional Road Classification, over the 12-year period.

Table 2 Total number of crashes (12 years) by crash type, severity, and functional road classification.

Full size table

Link variables

The TomTom road network, sourced from the Metropolitan Council, is drawn as a polyline shapefile that provides spatial details of roadways within the Twin Cities network³⁷. Variables such as link length and road type are derived directly from this dataset.

TomTom speed data offers speed estimates aggregated from millions of GPS records and linked to the TomTom road network. These estimates are stratified by time periods, dividing a day into seven parts to account for peak and non-peak hours, as well as by speed percentiles, ranging from the fastest 5% to the slowest 5% of recorded speeds. For this study, the 50th percentile (median) speed during morning peak hours (7 a.m. to 9 a.m.) is selected to represent travel speed, while the difference between the 10th and 90th percentiles is used as an indicator of speed variance. These measures serve as independent variables in crash frequency estimation models. Notably, the speed variance reflects an aggregated yearly index across all vehicles using the link, rather than intra-day or intra-vehicle variations.

The MnDOT Traffic Volume Program³⁸ provides Annual Average Daily Traffic (AADT) estimates for Minnesota, based on data collected from approximately 33,000 count locations on trunk highways, county state aid highways (CSAH), county roads (CR), and municipal state aid streets (MSAS). Traffic counts, typically recorded over short durations (e.g., 48 hours), are adjusted using seasonal and axle correction factors (for trunk highways). As a standard independent variable in safety performance functions, AADT data for the Twin Cities metro region was extracted and integrated with the TomTom road network.

The Federal Urban/Rural GIS Shapefile, obtained from MnDOT’s Transportation Data and Analysis division³⁹, delineates roadways in Minnesota by Federal Adjusted Urban Area boundaries into Urban, Small Urban, and Rural classifications. This dataset is also linked to the TomTom road network for further analysis.

Results

Separate models are developed for estimating crash frequency and severity, specific to crash type and functional road classifications, for two main reasons: first, these estimates are used in crash cost analysis, where internal and external cost assignments differ by crash type and by single- versus multi-vehicle crashes; second, using specialized models for different road types is statistically validated, as road types have distinct attributes influencing crash characteristics^23,40.

Safety performance function

The selected independent variables for the SPFs are described in Table 3. Note that $V_{\text {Var}}$ is not the typical measure of speed variance, which reflects the dispersion of space-mean speeds among drivers within or across lanes at the same time^41,42. Instead, it is more likely to represents the dispersion of time-mean speeds, calculated as the difference between the fastest 5% and the slowest 5% speeds over a specific time period, such as morning peak hours across a year.

The regression results of the safety performance functions are shown in Table 4. Note that speed (V) is dropped from all models to avoid multicollinearity problem, as it is highly correlated with AADT (Q). AADT (Q) and segment length (L) are transformed into natural log format. Other functional forms are also tested, but cannot improve the fits.

Table 3 Definitions and descriptive statistics of independent variables selected for the safety performance functions.

Full size table

Table 4 Safety performance function results for single-vehicle and multi-vehicle crashes by roadway class.

Full size table

.

The regression results for the SPFs are presented in Table 4. Note that speed (V) is excluded from all models to avoid multicollinearity problem, as it is highly correlated with AADT (Q). Both AADT (Q) and segment length (L) are transformed into their natural logarithmic forms for better model performance. Alternative functional forms were tested but did not improve the model fits.

The conventional variables (Q and L) have significant positive effects on crash counts for both single- and multi-vehicle crashes across all road classifications. As expected, links with higher AADT or longer lengths experience more crashes, regardless of the crash type. Speed variance , which indicates on-road shockwaves, is positively correlated with crash counts, highlighting that more severe stop-and-go driving conditions are associated with higher collision rates, particularly for multi-vehicle crashes.

Additionally, urban roadways tend to have higher crash counts than rural ones. This pattern may be attributed to network structure features, such as higher road density in urban areas.

Ordered probit model

The same link property attributes are used as independent variables in the ordered probit models to estimate the probability of each injury severity category for a given crash. Additional dummy variables are included to capture road surface conditions, including $W_{\text {Wet}}$, $W_{\text {Snow}}$, $W_{\text {Iced}}$, and $W_{\text {Others}}$, with dry road surface serving as the baseline. To account for light conditions, two dummy variables are added to identify the road light-on ($D_{\text {light-on}}$) and light-off ($D_{\text {light-off}}$) scenarios, using daylight as the reference group. The descriptive statistics for these variables are summarized in Table 5.

As in the safety performance functions, the natural logarithmic transformations of Q and L are used, as they yield lower AIC values and higher pseudo $R^{2}$, compared to the untransformed variables. The regression results for the ordered probit models are shown in Table 6.

The analysis reveals that AADT is negatively correlated with crash severity, indicating that higher traffic volumes are associated with less severe crashes, likely because drivers tend to exercise greater caution on busier roadways. Segment length shows a positive relationship with crash severity, as longer segments are associated with more severe crashes. Speed variance has some impact, particularly in cases such as multi-vehicle crashes on primary and minor arterials; however, the coefficients are too small to significantly affect the probability of each injury severity category. This finding is surprising, as greater speed variance was expected to have a stronger positive effect, particularly for multi-vehicle crashes. Future studies should consider this question. Additionally, urban roadways are associated with less severe crashes, likely due to lower speeds and shorter travel distances in urban environments.

The road surface condition variables show that crashes on wet, snowy, and icy roads are associated with lower injury severity compared to crashes on dry roads. These results align with previous findings that adverse winter conditions, such as snow and ice, tend to reduce injury severity due to lower average speeds and more cautious driving. However, property damage-only crashes are more frequent during winter, as noted in other studies^43,44,45.

Link-based crash frequency and severity estimates

Link-based crash frequency and severity, along with their marginal changes, are estimated under current traffic conditions. The mean values across different link types are summarized in Table 7. As expected, highways experience a higher number of crashes compared to surface roadways for both single- and multi-vehicle crashes; however, the proportion of injury crashes is comparatively lower, with property damage-only (PDO) crashes accounting for over 70% of the total. Urban roads generally have a higher crash frequency than rural roads, while roads in core cities, Minneapolis and St. Paul, exhibit lower crash frequencies compared to suburban areas (urbanized areas outside the core cities). The severity distribution across area types indicates that fatal crashes, for both single- and multi-vehicle types, represent a higher percentage in rural areas. Additionally, the introduction of more vehicles on roads increases crash frequency but reduces injury severity, consistent with the regression results.

Table 5 Descriptive statistics of independent variables selected for the ordered probit models.

Full size table

Table 6 Regression results of ordered probit models to estimate crash severity for single-vehicle and multi-vehicle crashes by roadway class.

Full size table

Table 7 Estimates of average crash frequency and severity and their marginal changes.

Full size table

Crash cost estimates

The internal and external crash costs are estimated for each link in the Twin Cities road network, considering both average and marginal costs. Weighted average costs are calculated to summarize the estimates using the following formula:

$$\begin{aligned} \bar{C}_{w,s}=\frac{\sum _{i}C_{s,i}*Q_{i}*L_{i}}{\sum _{i}Q_{i}*L_{i}} \end{aligned}$$

(6)

where $\bar{C}_{w,s}$: Weighted average crash cost; $C_{s,i}$: Crash cost on link i ($/veh); $Q_{i}$: AADT on link i; $L_{i}$: Segment length of link i (km).

The results, presented in Table 8, show that the weighted average internal crash cost across all links in the Twin Cities is approximately $0.13/veh-km, with 94.5% of links having a value below $1.00/veh-km. While costs are similar by area type, roads in core cities generally exhibit higher internal crash costs than suburban roads, and rural roads are slightly more hazardous. Highways, however, are significantly safer than other surface roadways, with a weighted average cost of about 1/2–1/3 of that for other roads.

Table 8 Link-based Internal and External Crash Cost Estimates ($/vkt): This table shows that the internal crash costs incurred by travelers themselves are higher than the external costs imposed on others, and that highway crash costs are lower compared to those for other road types.

Full size table

The external average crash cost imposed on others is much lower than the internal costs borne by travelers. The weighted average external cost is approximately $0.08/veh-km, with 98% of links having an external average crash cost below $1.00/veh-km. The trends for external costs by area type and road type are the same as those for internal crash costs.

Marginal internal and external crash costs are lower than their average counterparts, but the patterns remain consistent: internal crash costs borne by travelers exceed external costs imposed on others, and highways and suburban roads (outside core cities) are safer than other roadways.

Our estimates reveal no significant conflicts with previous studies in terms of magnitude^{Footnote 1}. For instance, Levinson and Gillen⁴⁶ reported an average crash cost of $0.048 per veh-km for intercity highways. Although this estimate does not specify the proportions of internal and external costs, we believe it primarily reflects the internal costs of crashes, as the factors considered mainly pertain to the functional years lost due to personal injury severity. Additionally, several studies have estimated external crash costs, e.g., Lemp and Kockelman³⁶, who reported $0.077 per veh-km, and Parry and Small⁴⁷, who reported $0.026 per veh-km in the United States. However, to the best of our knowledge, no previous studies provide fine-grained crash cost estimates that differentiate between internal and external costs by urban area and road type. Therefore, no further comparisons can be made at this stage.

Figure 3 illustrates the spatial distribution of crash cost estimates across the network. It confirms that internal crash costs exceed external costs for both average and marginal calculations (Fig. 3a vs. 3b and Fig. 3c vs. 3d). Additionally, highways are notable for being safer than surface roadways, as indicated by the blue shape in the network maps.

Conclusion

This study develops a framework for link-based crash cost analysis, which quantifies the internal and external costs of vehicle crashes at a detailed, microscopic level and implements these costs across a metropolitan road network. To be more specific, internal costs refer to those borne by travelers involved in crashes, while external costs are those imposed on others, such as victims, insurance companies, or government agencies.

The framework is applied to the Twin Cities region as a case study. The results indicate that the weighted average internal crash cost is approximately $0.130/veh-km, while the average external crash cost is about $0.079/veh-km. This demonstrates that travelers bear significantly higher crash costs themselves compared to what they impose on others. Importantly, highways exhibit lower average internal and external crash costs compared to surface roadways, reinforcing the conclusion that they are safer due to their effectiveness in reducing crash-related costs, in line with 2022 statistics from the Insurance Institute for Highway Safety⁴⁸. This finding suggests that superior design standards of highways, such as separation of traffic, access control, and surface configurations, enhance both travel efficiency and safety. Therefore, allocating resources to upgrade existing roadway infrastructure to higher standards could effectively reduce the incidence of crashes and enhance overall safety in urban areas. However, specific investments should be guided by detailed benefit-cost evaluations, consideration of government budgets, and alignment with strategic plans, particularly if other safety management projects need to take priority for greater efficiency.

Given a value of time at $18.30/hr ($0.305/min)⁴⁹, which is about twice the average internal crash cost and three times the external cost (based on the network’s mean 50th percentile speed of 62.58 km/hr), crash costs appear to play a smaller role than travel time in route decisions, even if travelers were aware of these costs⁵⁰. However, insurance companies could play a crucial role by offering incentives for safe driving practices. For instance, reduced premiums or rewards for drivers who maintain a clean driving record could encourage more cautious behavior behind the wheel. But this approach raises important technical questions about how to keep link-based datasets dynamically updated and effectively communicate this information to drivers. If these challenges can be addressed, a positive feedback loop could be established: as dangerous routes are improved and become safer, the updated estimates would reflect these changes, potentially altering drivers’ route preferences even further. Policymakers may also benefit from addressing these challenges, as doing so will help identify areas that require targeted safety interventions. Additionally, these datasets could be significantly important for the decision-making processes of intelligent automated vehicles, which, while generally safer, will likely still face risks in mixed environments with human-driven vehicles, pedestrians, and bicyclists.

Notably, this study uses historical crash records, constrained by limited data access, to validate the practicality of the theoretical framework for link-based crash cost estimates. While we do not anticipate that our key findings would change significantly with the latest updated dataset, given the recent statistics from crash reports and the consistency of previous studies over the past decade, we highly encourage future research to keep the data updated, preferably in a dynamic manner. This will better inform safety-related policy adjustments and enhance policy implementation.

Additionally, the estimates of crash frequency and severity, which serve as the foundation for the link-based crash cost analysis, are derived using negative binomial regression and ordered probit models. Currently, these models lack several important link attributes, such as the number of lanes, speed limits, curvature, and slope-factors that are anticipated to enhance predictive accuracy. While this limitation is not significant for the case of the Twin Cities, which are predominantly flat with well-maintained roads and effective lighting, it would be beneficial for future studies, especially those applying our framework in areas with varying slopes and road conditions, to collect this data for more reliable estimates.

Finally, for the unit cost specification, the framework employs Parry¹⁴’s settings for the external proportion of crash costs, which are derived from plausible yet heuristic methods. Refinements to these settings would improve accuracy, but would require more detailed data, including information on driver behavior, insurance policies, crash records, responsibility distribution, vehicle types, and other relevant factors.

Data availability

The datasets used and/or analysed during the current study available from the corresponding author on reasonable request.

Notes

All estimates from previous studies are converted to 2014 U.S. dollars

References

Wijnen, W. et al. An analysis of official road crash cost estimates in European countries. Saf. Sci. 113, 318–327. https://doi.org/10.1016/j.ssci.2018.12.004 (2019).
Article MATH Google Scholar
Wijnen, W. & Stipdonk, H. Social costs of road crashes: An international analysis. Accid. Anal. Prev. 94, 97–106. https://doi.org/10.1016/j.aap.2016.05.005 (2016).
Article PubMed MATH Google Scholar
Goodchild, M., Sanderson, K. & Nana, G. Measuring the total cost of injury in New Zealand. Dept of Labour publication[SPACE]https://doi.org/10.1136/INJURYPREV-2012-040580A.31 (2002).
Zhang, A. et al. Towards estimating the social and environmental costs of transportation in Canada (The University of British Columbia, Tech. Rep., 2004).
Harmon, T. et al. Crash costs for highway safety analysis (Tech. Rep, Federal Highway Administration, 2018).
Elvik, R., Vaa, T., Hoye, A. & Sorensen, M. The Handbook of Road Safety Measures (Emerald Group Publishing, 2009).
Bougna, T., Hundal, G. & Taniform, P. Quantitative analysis of the social costs of road traffic crashes literature. Accid. Anal. Prev. 165, 106282. https://doi.org/10.1016/j.aap.2021.106282 (2022).
Article PubMed Google Scholar
Small, K. A., Verhoef, E. T. & Lindsey, R. The Economics of Urban Transportation (Routledge, 2007).
Book MATH Google Scholar
Blincoe, L., Miller, T. R., Zaloshnja, E. & Lawrence, B. A. The economic and societal impact of motor vehicle crashes, 2010 (revised) (Tech. Rep, US Department of Transportation, National Highway Traffic Safety Administration, 2015).
Jacobs, G., Aeron-Thomas, A. & Astrop, A. Estimating global road fatalities (Tech. Rep, Department for International Development, 2000).
Levinson, D., Mathieu, J. M., Gillen, D. & Kanafani, A. The full cost of high-speed rail: An engineering approach. Ann. Reg. Sci. 31, 189–215. https://doi.org/10.1007/s001680050045 (1997).
Article Google Scholar
Marshall, W. E. & Ferenchak, N. N. Assessing equity and urban/rural road safety disparities in the us. J. Urban.: Int. Res. Placemak. Urban Sustain. 10, 422–441. https://doi.org/10.1080/17549175.2017.1310748 (2017).
Article MATH Google Scholar
Zwerling, C. et al. Fatal motor vehicle crashes in rural and urban areas: Decomposing rates into contributing factors. Injury Prev. 11, 24–28. https://doi.org/10.1136/ip.2004.005959 (2005).
Article CAS Google Scholar
Parry, I. W. Comparing alternative policies to reduce traffic accidents. J. Urban Econ. 56, 346–368. https://doi.org/10.1016/j.jue.2004.04.004 (2004).
Article MATH Google Scholar
Bíl, M., Andrášik, R. & Sedoník, J. Which curves are dangerous? A network-wide analysis of traffic crash and infrastructure data. Transp. Res. Part A: Policy Pract. 120, 252–260. https://doi.org/10.1016/j.tra.2019.01.001 (2019).
Article MATH Google Scholar
Noland, R. B. & Adediji, Y. Are estimates of crash modification factors mis-specified?. Accid. Anal. Prev. 118, 29–37. https://doi.org/10.1016/j.aap.2018.05.017 (2018).
Article PubMed MATH Google Scholar
Wang, L., Abdel-Aty, M. & Lee, J. Safety analytics for integrating crash frequency and real-time risk modeling for expressways. Accid. Anal. Prev. 104, 58–64. https://doi.org/10.1016/j.aap.2017.04.009 (2017).
Article PubMed MATH Google Scholar
Dijkstra, A. & Drolenga, H. Safety effects of route choice in a road network: Simulation of changing route choice (Tech. Rep, SWOV Institute for Road Safety Research, 2008).
Wegman, F., Aarts, L. & Bax, C. Advancing sustainable safety: National road safety outlook for the Netherlands for 2005–2020. Saf. Sci. 46, 323–343. https://doi.org/10.1016/j.ssci.2007.06.013 (2008).
Article Google Scholar
Federal Highway Administration. Highway safety improvement program. https://safety.fhwa.dot.gov/hsip/ (2021).
Lord, D. & Mannering, F. The statistical analysis of crash-frequency data: A review and assessment of methodological alternatives. Transp. Res. Part A: Policy Pract. 44, 291–305. https://doi.org/10.1016/j.tra.2010.02.001 (2010).
Article MATH Google Scholar
Lee, J. & Mannering, F. Impact of roadside features on the frequency and severity of run-off-roadway accidents: An empirical analysis. Accid. Anal. Prev. 34, 149–161. https://doi.org/10.1016/S0001-4575(01)00009-4 (2002).
Article PubMed MATH Google Scholar
Carson, J. & Mannering, F. The effect of ice warning signs on ice-accident frequencies and severities. Accid. Anal. Prev. 33, 99–109. https://doi.org/10.1016/s0001-4575(00)00020-8 (2001).
Article PubMed MATH CAS Google Scholar
Mahmud, A., Gayah, V. V. & Paleti, R. Estimation of crash type frequency accounting for misclassification in crash data. Accid. Anal. Prev. 184, 106998. https://doi.org/10.1016/j.aap.2023.106998 (2023).
Article PubMed MATH Google Scholar
American Association of State Highway and Transportation Officials (AASHTO). Highway Safety Manual. Tech. Rep. (2010).
Brimley, B., Saito, M. & Schultz, G. Calibration of highway safety manual safety performance function: Development of new models for rural two-lane two-way highways. Transp. Res. Rec.: J. Transp. Res.Board[SPACE]https://doi.org/10.3141/2279-10 (2012).
Article MATH Google Scholar
Cai, Q., Lee, J., Eluru, N. & Abdel-Aty, M. Macro-level pedestrian and bicycle crash analysis: Incorporating spatial spillover effects in dual state count models. Accid. Anal. Prev. 93, 14–22. https://doi.org/10.1016/j.aap.2016.04.018 (2016).
Article PubMed Google Scholar
Quddus, M. A., Noland, R. B. & Chin, H. C. An analysis of motorcycle injury and vehicle damage severity using ordered probit models. J. Saf. Res. 33, 445–462. https://doi.org/10.1016/S0022-4375(02)00051-8 (2002).
Article MATH Google Scholar
Duncan, C., Khattak, A. & Council, F. Applying the ordered probit model to injury severity in truck-passenger car rear-end collisions. Transp. Res. Rec.: J. Transp. Res. Board[SPACE]https://doi.org/10.3141/1635-09 (1998).
Article MATH Google Scholar
Kockelman, K. M. & Kweon, Y.-J. Driver injury severity: An application of ordered probit models. Accid. Anal. Prev. 34, 313–321. https://doi.org/10.1016/S0001-4575(01)00028-8 (2002).
Article PubMed Google Scholar
Ye, F. & Lord, D. Comparing three commonly used crash severity models on sample size requirements: Multinomial logit, ordered probit and mixed logit models. Analyt. Methods Accid. Res. 1, 72–85. https://doi.org/10.1016/j.amar.2013.03.001 (2014).
Article MATH Google Scholar
Vickrey, W. Automobile accidents, tort law, externalities, and insurance: An economist’s critique. Law Contemp. Probl. 33, 464–487. https://doi.org/10.2307/1190938 (1968).
Article MATH Google Scholar
Lindberg, G. Traffic insurance and accident externality charges. J. Transp. Econ. Policy (JTEP) 35, 399–416 (2001).
MATH Google Scholar
Fridstrøm, L. & Ingebrigtsen, S. An aggregate accident model based on pooled, regional time-series data. Accid. Anal. Prev. 23, 363–378. https://doi.org/10.1016/0001-4575(91)90057-C (1991).
Article PubMed Google Scholar
Parry, I. W., Walls, M. & Harrington, W. Automobile externalities and policies. J. Econ. Lit. 45, 373–399. https://doi.org/10.1257/jel.45.2.373 (2007).
Article MATH Google Scholar
Lemp, J. D. & Kockelman, K. M. Quantifying the external costs of vehicle use: Evidence from America’s top-selling light-duty models. Transp. Res. Part D: Transp. Environ. 13, 491–504. https://doi.org/10.1016/j.trd.2008.09.005 (2008).
Article Google Scholar
TomTom International BV. Speed profiles (Tech. Rep., 2013).
Minnesota Department of Transportation. Traffic forecasting & analysis data collection methods, traffic volume program (2017).
Transportation Data and Analysis, MnDOT. Federal urban/rural GIS shapefile (2016).
Carson, J. L. The effect of ice warning signs on ice accident frequencies and severities: An investigation using advanced econometric modeling methods. Ph.D. thesis, University of Washington (1998). https://doi.org/10.1016/s0001-4575(00)00020-8.
Wang, H., Li, Z., Hurwitz, D. & Shi, J. Parametric modeling of the heteroscedastic traffic speed variance from loop detector data. J. Adv. Transp. 49, 279–296. https://doi.org/10.1002/atr.1258 (2015).
Article MATH CAS Google Scholar
Kockelman, K. K. & Ma, J. Freeway speeds and speed variations preceding crashes, within and across lanes. J. Transp. Res. Forum[SPACE]https://doi.org/10.22004/ag.econ.206875 (2010).
Article Google Scholar
Brown, B. & Baass, K. Seasonal variation in frequencies and rates of highway accidents as function of severity. Transp. Res. Rec.: J. Transp. Res. Board[SPACE]https://doi.org/10.3141/1581-08 (1997).
Article Google Scholar
Brorsson, B., Ifver, J. & Rydgren, H. Injuries from single-vehicle crashes and snow depth. Accid. Anal. Prev. 20, 367–377. https://doi.org/10.1016/0001-4575(88)90019-X (1988).
Article PubMed CAS Google Scholar
Maze, T., Agarwai, M. & Burchett, G. Whether weather matters to traffic demand, traffic safety, and traffic operations and flow. Transp. Res. Rec.: J. Transp. Res. Board[SPACE]https://doi.org/10.1177/0361198106194800119 (2006).
Article MATH Google Scholar
Levinson, D. M. & Gillen, D. The full cost of intercity highway transportation. Transp. Res. Part D: Transp. Environ. 3, 207–223 (1998).
Article MATH Google Scholar
Parry, I. W. H. & Small, K. A. Does Britain or the united states have the right gasoline tax?. Am. Econ. Rev. 95, 1276–1289 (2005).
Article MATH Google Scholar
Insurance Institute for Highway Safety. Fatality facts 2022: Urban/rural comparison. https://www.iihs.org/topics/fatality-statistics (2024).
Minnesota Department of Transportation. Benefit-cost analysis for transportation projects (2015).
Cui, M. & Levinson, D. Shortest paths, travel costs, and traffic. Environ. Plan. B: Urban Analyt. City Sci. 48, 828–844. https://doi.org/10.1177/2399808319897619 (2021).
Article MATH Google Scholar

Download references

Acknowledgements

This study is supported by National Natural Science Foundation of China [Grant number: 52402399].

Author information

Authors and Affiliations

Intelligent Construction and Environment College, Xi’an Jiaotong University City College, Xi’an, Shaanxi Province, China
Shian Dai
China Northwest Architectural Design and Research Institute Co., Ltd, Xi’an, Shaanxi Province, China
Liqiang Yu
Key Laboratory of Transport Industry of Big Data Application Technologies for Comprehensive Transport, Beijing Jiaotong University, Beijing, China
Zhaoran Liu
Institute of Comprehensive Transportation, National Development and Reform Commission, Beijing, China
Zhaoran Liu
School of Transportation Engineering, Chang’an University, Xi’an, China
Mengying Cui
School of Civil Engineering, The University of Sydney, Sydney, Australia
David Levinson

Authors

Shian Dai
View author publications
Search author on:PubMed Google Scholar
Liqiang Yu
View author publications
Search author on:PubMed Google Scholar
Zhaoran Liu
View author publications
Search author on:PubMed Google Scholar
Mengying Cui
View author publications
Search author on:PubMed Google Scholar
David Levinson
View author publications
Search author on:PubMed Google Scholar

Contributions

Mengying Cui conceived the experiments, Mengying Cui conducted the experiments, Shian Dai and Liqiang Yu analyzed the results, Mengying Cui was responsible for data curation. Shian Dai, Liqiang Yu, and Zhaoran Liu prepared the original draft of the manuscript. Zhaoran Liu and Mengying Cui acquired funding. Mengying Cui and David Levinson reviewed and edited the manuscript. All authors reviewed the manuscript.

Corresponding author

Correspondence to Mengying Cui.

Ethics declarations

Competing interests

The authors declare that there are no conflicts of interest.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Dai, S., Yu, L., Liu, Z. et al. The internal and external cost of motor vehicle crashes. Sci Rep 15, 5441 (2025). https://doi.org/10.1038/s41598-025-89058-1

Download citation

Received: 24 November 2024
Accepted: 03 February 2025
Published: 14 February 2025
Version of record: 14 February 2025
DOI: https://doi.org/10.1038/s41598-025-89058-1