Table 5 Summary of Evaluation Metrics for Vascular Segmentation: TP = True Positive, TN = True Negative, FP = False Positive, FN = False Negative, \(V_p\) = Predicted Volume, \(V_g\) = Ground Truth Volume, A, B = Sets of Points for Spatial Distance, \(T_{\text {prec}}\) = Topological Precision, \(T_{\text {sens}}\) = Topological Sensitivity, RI = Rand Index, x = Predicted Segmentation Value, y = Ground Truth Value, p(xy) = Joint Probability Distribution of x and y, p(x) = Marginal Probability Distribution of x, p(y) = Marginal Probability Distribution of y, TPR = True Positive Rate, FPR = False Positive Rate, d(FPR) = Differential of the False Positive Rate.

From: Deep learning for 3D vascular segmentation in hierarchical phase contrast tomography: a case study on kidney

Metric category

Metric

Formula

Strengths

Limitations

Overlap-based

Dice Similarity Coefficient (DSC)

\(DSC = \frac{2TP}{FP + FN + 2TP}\)123

Easy to implement - widely used (26/31 studies). Good for overlap assessment.

Sensitive to class imbalance and lacks connectivity assessment.

Sensitivity (Recall or TPR)

\(Sensitivity = \frac{TP}{TP + FN}\)124

Measures true positive rates.

Ignores structural/topological features.

Specificity (TNR)

\(Specificity = \frac{TN}{FP + TN}\)124

Measures true negative rates.

Ignores structural/topological features.

Precision

\(Precision = \frac{TP}{TP + FP}\)124

Accounts for false positives/negatives.

Rarely used in medical image segmentation.

F1-Score

\(F1 = 2 \cdot \frac{\text {Precision} \cdot \text {Recall}}{\text {Precision} + \text {Recall}}\)124

Balances precision and recall.

Rarely used in medical image segmentation.

Topology-preserving

Centerline Dice (clDice)

\(clDice = \frac{2T_{\text {prec}}T_{\text {sens}}}{T_{\text {prec}} + T_{\text {sens}}}\)125

Preserves connectivity of thin, tubular structures like vessels.

Computationally expensive - less commonly used (2/31 studies).

Volume-based

Relative Volume Difference (RVD)

\(RVD = \frac{|V_p - V_g|}{V_g}\)123

Quantifies the disparity in volume.

Sensitive to small structures

Spatial distance-based

95th Percentile Hausdorff Distance (95HD)

percentile(h(AB), 95)126

Captures boundary accuracy.

Sensitive to outliers or small discrepancies.

Average Symmetric Surface Distance (ASSD)

\(ASSD = \frac{1}{2} \left( \frac{1}{|A|} \sum _{a \in A} \min _{b \in B} d(a,b) + \frac{1}{|B|} \sum _{b \in B} \min _{a \in A} d(b,a) \right)\)127

Provides average boundary accuracy.

Less sensitive to topology, focuses on boundary alignment.

Probabilistic-based

Area Under the Curve (AUC)

\(AUC = \int _0^1 TPR(FPR) \, d(FPR)\)128

Common in binary classification models, reflects overall performance.

Rarely used (3/31 studies).

Pair-counting-based

Adjusted Rand Index (ARI)

\(ARI = \frac{\text {RI} - \text {Expected\_RI}}{\text {Max\_RI} - \text {Expected\_RI}}\)128

Adjusts for chance in clustering assessments.

Not widely used in vascular segmentation.

Information-theoretic

Mutual Information (MI)

\(MI(X,Y) = \sum _{x \in X} \sum _{y \in Y} p(x,y) \log \left( \frac{p(x,y)}{p(x)p(y)} \right)\)128

Captures shared info between segmented and true data.

Complex to compute and rarely used in vascular segmentation.