Fig. 3: Comparison of receiver operating characteristic curve and precision-recall curve.

Comparison of a receiver operating characteristic curves and b precision-recall curves for the baseline ESI model, triage data-only model, video-only model, and late fusion video and triage data model. We observe that each uni-modal model is able to achieve performance over the baseline (p < 0.01), with the late fusion multi-modal model achieving higher performance than the uni-modal ones (p < 0.01). The square bracket indicates 95% confidence intervals.