Table 3 Correlations of chatbots and expert ratings.
From: Large language models can outperform humans in social situational judgments
Expert ratings | Copilot | ChatGPT | Claude | Gemini | Gemini | |
---|---|---|---|---|---|---|
Copilot | 0.80 | 1 | 0.98 | 0.92 | 0.90 | 0.91 |
ChatGPT | 0.79 | 1 | 0.92 | 0.91 | 0.89 | |
Claude | 0.87 | 1 | 0.92 | 0.84 | ||
Gemini | 0.78 | 1 | 0.81 | |||
you.com | 0.82 | 1 |