Fig. 2: Comparisons of reliability.

Observed HMT reliability was higher than human reliability (A), but not higher than machine reliability (B) or the reliability of the better of humans and machines (C), and was lower than ideal HMT reliability (D). Condition-level data are used. Weights = the number of clinicians * the number of diagnostic cases.