Fig. 4
From: Large language models predict human sensory judgments across six modalities

Color naming experiment using 330 Munsell colors from the World Color Survey (top, color space). (a) Adjusted Rand index illustrating the alignment between human and LLM experiments (95% CIs). The dashed lines for English represent lab-based free naming and forced-choice naming experiments collected by Lindsey and Brown35 (data reproduced with permission). (b) Data comparison between humans and LLMs in Russian and English. Participants and LLMs were shown colors and were asked to choose from the same 15-color list. The count of chosen colors for each option is given in parentheses. The color of a response cluster in the maps represents its average color (see Supplementary Fig. S3 for all maps). Colors for which less than 50% and 90% of the times the dominant color term was selected were indicated by “−” and “*”, respectively. If the dominant color term was selected more than 90% of the time, no marking was used.