Fig. 2: The deep-learning system’s AUCs.
From: Assessment of a deep-learning system for fracture detection in musculoskeletal radiographs

Error bars represent 95% confidence intervals calculated using bootstrap sampling (m = 1000). n indicates the number of radiographs tested.