Fig. 4: Correlation heatmaps of inequity categories in CTM and MQA tasks.
From: Mitigating the risk of health inequity exacerbated by large language models

The left plot shows the correlation between inequity categories in CTM tasks, illustrating how different inequity-modified queries resulted in similar trial rankings or selections by the models. The right plot shows the correlation between inequity categories in MQA tasks, displaying how often different inequity-modified queries led to the same answers or error patterns. These heatmaps help analyze how inequities across categories are interconnected, impacting model fairness.