Table 8 Summary of evaluation metrics used for classification, uncertainty quantification, and cross-domain adaptation

From: Uncertainty-aware and causal test-time adaptive foundation model for robust colorectal cancer pathology diagnosis

Category

Metrics

Significance

Classification

AUROC, Accuracy, F1-score

Discriminative power, overall correctness, class balance

Uncertainty quantification

ECE, Brier Score, NLL

Calibration, probabilistic reliability, confidence quality

Cross-domain adaptation

ΔAcc, ΔAUROC

Robustness gains under distributional shift