Table 10 Comparison of human and o1-preview performance on LogiQA (accuracy %).
From: Comparative evaluation of OpenAI O1 and human performance in higher order cognition
Model | Sample size | Accuracy (%) |
|---|---|---|
Human | 651 | 86.00 ± 6.50 |
o1-Preview | 10 | 90.00 ± 10.00 |