Table 1 Inter-annotator agreement across skin tone scales
From: Validity of two subjective skin tone scales and its implications on healthcare model fairness
Analysis | Annotator pair | Fitzpatrick | Monk |
|---|---|---|---|
Primary inter-annotator agreement measure | |||
Intraclass Correlation coefficient (ICC[2,k]) | All annotators | 0.66 (95% CI [0.02–0.87]) | 0.64 (95% CI [0.02–0.85]) |
Secondary inter-annotator agreement measures | |||
Weighted Cohen’s Kappa | 1 vs. 2 | 0.63 | 0.64 |
1 vs. 3 | 0.39 | 0.36 | |
2 vs. 3 | 0.29 | 0.30 | |
Kendall’s W | All Annotators | 0.90 | 0.85 |
Krippendorff’s Alpha | All Annotators | 0.41 | 0.41 |