Fig. 3: LLM bias evaluation. | npj Digital Medicine

Fig. 3: LLM bias evaluation.

From: Large language models to identify social determinants of health in electronic health records

Fig. 3: LLM bias evaluation.

The proportion of synthetic sentence pairs with and without demographics injected led to a classification mismatch, meaning that the model predicted a different SDoH label for each sentence in the pair. Results are shown across race/ethnicity and gender for a any SDoH mention task and b adverse SDoH mention task. Asterisks indicate statistical significance (P ≤ 0.05) chi-squared tests for multi-class comparisons and 2-proportion z tests for binary comparisons. LLM large language model, SDoH Social determinants of health.

Back to article page