Table 7 Race composition across datasets (count and percentage)

From: Mitigating the risk of health inequity exacerbated by large language models

Dataset

Not mentioned

White

African-Am.

Hispanic

Total

 

Count (%)

Count (%)

Count (%)

Count (%)

 

MedQA

1000 (72.1%)

185 (13.3%)

90 (6.5%)

50 (3.6%)

1387

MedMCQA

6065 (98.6%)

8 (0.1%)

1 (0.02%)

75 (1.2%)

6149

SIGIR 2016

40 (69.0%)

10 (17.2%)

4 (6.9%)

2 (3.4%)

58

TREC 2021

45 (60.0%)

15 (20.0%)

8 (10.7%)

4 (5.3%)

75

TREC 2022

35 (70.0%)

8 (16.0%)

3 (6.0%)

2 (4.0%)

50