Identifying a complex panel of bias dimensions to be evaluated, a framework is proposed to assess how prone large language models are to biased reasoning, with possible consequences on equity-related harms, and is applied to a large-scale and diverse user survey on Med-PaLM 2.
- Stephen R. Pfohl
- Heather Cole-Lewis
- Karan Singhal