Fig. 2: Probability distributions of 17 WEPs elicited from humans and two LLMs under different gender-specific (male and female) contexts.
From: An evaluation of estimative uncertainty in large language models

Graphs on the left and right cover different probability ranges on the x-axis. Outliers are omitted from the plots, and - indicates zero variability in responses.