Table 1 Comparison of Phy-Q with related physics benchmarks and competitions
From: Phy-Q as a measure for physical reasoning intelligence
Test | Generalization | Categorization | Procedurally | Destructible | Observe outcome | Human |
|---|---|---|---|---|---|---|
environment | to individual | of tasks to | generated | objects | of a desired | player |
physical scenario/s | physical scenarios | tasks/variations | physical action | data | ||
PHYRE12 | ✗ | ✗ | ✓ | ✗ | ✓ | ✗ |
Virtual Tools14 | ✗ | ✓ | ✗ | ✗ | ✓ | ✓ |
OGRE13 | ✗ | ✗ | ✓ | ✗ | ✓ | ✗ |
IntPhys 2019 24 | ✓ | ✓ | ✓ | ✗ | ✗ | ✓ |
CLEVERER25 | ✓ | ✗ | ✓ | ✗ | ✗ | ✗ |
CATER31 | ✓ | ✓ | ✓ | ✗ | ✗ | ✗ |
Physion27 | ✓ | ✓ | ✓ | ✗ | ✗ | ✓ |
COPHY26 | ✓ | ✓ | ✓ | ✗ | ✗ | ✓ |
CausalWorld15 | ✓ | ✓ | ✓ | ✗ | ✓ | ✗ |
RLBench32 | ✗ | ✗ | ✓ | ✗ | ✓ | ✗ |
Computational Pool33 | ✗ | ✗ | ✗ | ✗ | ✓ | ✗ |
Geometry Friends34 | ✗ | ✗ | ✓ | ✗ | ✓ | ✗ |
AIBIRDS35 | ✗ | ✗ | ✓ | ✓ | ✓ | ✓ |
Phy-Q (this study) | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ |