Table 1 Overview of the performance of our model, the pathologists, and previous models.

Model / Pathologist	TP	FP	FN	TN	Sensitivity	Specificity	MCC
Our model	263	12	32	849	0.8915(0.8503–0.9246)	0.9861(0.9758–0.9928)	0.8986(0.8686–0.9269)
Pathologist S.-C.H.	289	2	6	859	0.9797(0.9563–0.9925)	0.9977(0.9916–0.9997)	0.9818(0.9681–0.9932)
Pathologist 1	279	2	16	859	0.9458(0.9134–0.9687)	0.9977(0.9916–0.9997)	0.9589(0.9392–0.9767)
Pathologist 2	286	11	9	850	0.9695(0.9429–0.9860)	0.9872(0.9773–0.9936)	0.9546(0.9340–0.9731)
Pathologist 3	275	2	20	859	0.9322(0.8972–0.9581)	0.9977(0.9916–0.9997)	0.9497(0.9284–0.9690)
Pathologist 1with partial AI assistance	289	2	6	859	0.9797(0.9563–0.9925)	0.9977(0.9916–0.9997)	0.9818(0.9682–0.9932)
Pathologist 2with partial AI assistance	291	2	4	859	0.9864(0.9656–0.9963)	0.9977(0.9916–0.9997)	0.9863(0.9742–0.9956)
Pathologist 3with partial AI assistance	288	2	7	859	0.9763(0.9517–0.9904)	0.9977(0.9916–0.9997)	0.9795(0.9648–0.9912)
Hu et al.	159	11	21	1025	0.8833(0.8272–0.9263)	0.9894(0.9811–0.9947)	0.8937(0.8566–0.9283)
Wang et al.	5217	391	82	9544	0.9845 (0.9808–0.9877)	0.9606 (0.9566–0.9644)	0.9334(0.9275–0.9391)

The confusion matrices were calculated for our model (at a threshold of 0.4) and the pathologists, including the number of true-positive (TP), false-positive (FP), false-negative (FN), and true-negative (TN) LN images under the main test set (n = 1156). Three pathologists (J.L., H.-C.C., and T.-Y.H.) relabeled the 38 equivocal LN images with AI assistance (denoted as partial AI assistance). The data on model performance reported in the bottom two rows of the table were directly retrieved from the publications in question. Considering the between-study discrepancies in test slide distributions, the results may contain bias. MCC is an abbreviation for Matthews correlation coefficient. Supplementary Table 1 provides extended information, including additional metrics and model performance results on the micrometastasis and ITC test subsets.

Quick links

Search