Table 2 Comparison of model performance among the three data sets and statistical approaches. HJ23 had the highest model performance, yet lowest cohort size, while the opposite is true for eICU-CRD. For MIMIC and eICU-CRD, model performance is similar across all three methods. AUROC area under the receiver operator curve. 95% Confidence Intervals are provided in brackets.

Dataset	Cohort size	Methods	Accuracy	AUROC
eICU	3394	Logistic regression	78.65 (75.56–81.73)	87.36 (84.86–89.86)
		Random forest	77.29 (74.13–80.44)	84.36 (81.65–87.11)
		Partial least square	77.32 (74.17–80.47)	83.70 (80.93–86.48)
MIMIC	1295	Logistic regression	80.31 (75.47–85.15)	87.41 (83.37–91.45)
		Random forest	80.23 (75.38–85.08)	87.11 (83.03–91.19)
		Partial least square	79.54 (74.62–84.45)	87.06 (82.97–91.15)
HJ23	172	Logistic regression	97.14 (91.54–100)	98.69 (94.87–100)
		Random forest	80.00 (66.55–93.45)	89.87 (79.73–100)
		Partial least square	88.57 (77.88–99.27)	97.67 (92.59–100)

Quick links

Search