In scenarios involving 1,000 acute-pain clinical vignettes, varied across 34 socio-demographic features, a panel of ten large language models is found to provide inconsistent recommendations, which also present disparities with respect to marginalized populations.
- Mahmud Omar
- Shelly Soffer
- Eyal Klang