Table 8 Performance of o1-preview and human participants on Chen et al.’s data literacy assessment.

From: Comparative evaluation of OpenAI O1 and human performance in higher order cognition

Data literacy

Human (mean ± SD)

o1-Preview (mean ± SD)

o1-Preview Z-score

Data management (3 items)

0.17 ± 0.44

2.00 ± 0.30

4.16

Data visualization (6 items)

3.56 ± 1.46

6.00 ± 0.00

1.67

Basic data analysis (9 items)

5.38 ± 2.22

9.00 ± 0.00

1.63

  1. Human sample size: \(N = 555\) (from the original study). AI results reflect the mean across 10 stateless trials. Scores represent (raw / normalized / percentage) values as indicated