Table 4 Instability prediction performance of APRICOT-M compared to acuity baseline for episode and step level

Cohort	Model	Prevalence (episode)	AUROC (episode)	AUPRC (episode)	Prevalence (step)	AUROC (step)	AUPRC (step)
Development	SOFA	0.116	0.51 (0.50–0.52)	0.13 (0.13–0.14)	0.012	0.60 (0.60–0.61)	0.02 (0.02–0.02)
	SOFA (≥2 points)		0.49 (0.48–0.50)	0.28 (0.27–0.29)		0.54 (0.54–0.55)	0.17 (0.16–0.17)
	APRICOT-M		0.73 (0.72–0.74)**	0.32 (0.31–0.34)**		0.74 (0.73–0.75)**	0.10 (0.09–0.11)**
External	SOFA	0.131	0.46 (0.46–0.47)	0.13 (0.13–0.13)	0.013	0.55 (0.55–0.56)	0.02 (0.02–0.02)
	SOFA (≥2 points)		0.46 (0.46–0.46)	0.23 (0.22–0.23)		0.51 (0.51–0.51)	0.13 (0.12–0.13)
	APRICOT-M		0.74 (0.74–0.75)**	0.47 (0.47–0.48)**		0.75 (0.74–0.75)**	0.24 (0.23–0.25)**
Prospective	SOFA	0.242	0.61 (0.54–0.68)	0.39 (0.29–0.49)	0.011	0.68 (0.64–0.71)	0.02 (0.02–0.03)
	SOFA (≥2 points)		0.52 (0.46–0.58)	0.40 (0.33–0.48)		0.53 (0.50–0.57)	0.12 (0.09–0.16)
	APRICOT-M		0.61 (0.53–0.68)	0.41 (0.27–0.48)		0.69 (0.64–0.74)*	0.07 (0.04–0.10)**

AUPRC area under the precision-recall curve, AUROC area under the receiving operating characteristic, SOFA sequential organ failure assessment. Performance is the median AUROC and AUPRC across a 100-iteration bootstrap with replacement, with 95% confidence intervals in parenthesis. P values are based on pairwise two-sided Wilcoxon rank sum tests.
Bold values represent performance metrics that are significantly higher based on statistical analysis.
*p value < 0.05 compared to one of SOFA or SOFA (≥2 points).
**p value < 0.05 compared to SOFA and SOFA (≥2 points).

Quick links

Search