Extended Data Fig. 8: Error rates across hairstyle pairs for face verification models. | Nature

Extended Data Fig. 8: Error rates across hairstyle pairs for face verification models.

From: Fair human-centric image dataset for ethical AI benchmarking

Extended Data Fig. 8: Error rates across hairstyle pairs for face verification models.

This figure shows the percentage of incorrect predictions for face verification using (a) ArcFace60, (b) CurricularFace61, and (c) FaceNet62 models. For He/Him/His pronouns, errors are concentrated in cases with non-stereotypical hairstyles, whereas for She/Her/Hers pronouns, errors remain high whenever hairstyle variation within the pair is large. The number on top of each bar in black denotes the ratio of incorrect samples within that subgroup, while the number in red denotes the percentage of individuals with that pronoun who exhibit the corresponding hairstyle combination. This pattern highlights that hairstyle diversity disproportionately impacts error rates for She/Her/Hers pronouns. Error rates are conditioned on hairstyle changes and pronoun groups, underscoring variability in model performance.

Back to article page