Table 2 Main results on the primary TCGA-LIHC test set

	Segmentation		Characterization
Model	DSC (%) ↑	HD95 (mm) ↓	AUC ↑	Acc. (%) ↑	F1 ↑
CNN-based Methods
3D U-Net	78.4	11.23	0.812	75.7	0.751
V-Net	79.1	10.55	0.819	76.1	0.758
nnU-Net	83.5	7.14	0.865	80.0	0.795
Transformer-based Methods
UNETR	82.6	7.98	0.851	78.6	0.780
Swin UNETR	84.0	6.81	0.870	81.4	0.811
MedNeXt	84.7	6.45	0.882	82.9	0.825
SegMamba	84.3	6.60	0.876	82.1	0.818
Foundation Model-based Methods
Medical SAM	80.5	9.87	–	–	–
SAM-Med3D	81.8	8.41	–	–	–
Advanced Fusion & Temporal Methods
Cross-Attn Fusion	83.8	7.02	0.871	81.4	0.809
I3D	80.2	10.11	0.840	78.6	0.782
Timesformer	81.5	9.15	0.859	80.0	0.798
ST-Adapter	82.1	8.80	0.863	80.7	0.801
LoGoFormer	82.5	8.54	0.868	81.4	0.810
CF-Net (2024)	84.9	6.72	0.878	82.8	0.822
MVFusion (2024)	85.0	6.51	0.881	83.1	0.826
CoCa-DR	85.1	6.33	0.886	83.6	0.832
Ours
STD-Net	87.2	4.12	0.924	87.1	0.868

This table compares the multi-task performance for both segmentation and characterization. All models were trained and evaluated using the same three-phase (arterial, portal, delayed) inputs and identical preprocessing for fairness. For architectures not natively supporting multiphasic input (e.g., nnU-Net, MedNeXt), early channel concatenation was applied following standard practice. Best results are in bold, second-best are underlined.

Quick links

Search