Table 1 Descriptive statistics of reported results over time for specific benchmarks and AI tasks

From: Mapping global dynamics of benchmark creation and saturation in artificial intelligence

 

NLP

Computer vision

Total

Benchmarks with ≥1 reported result

1318

2447

3765

Benchmarks with ≥3 results at different time points (% of above)

661 (50%)

1274 (52%)

1935 (51%)

AI tasks with ≥1 reported result

346

601

947

AI tasks with ≥3 results at different time points (% of above)

197 (57%)

386 (64%)

583 (62%)

  1. A single task can be represented through several benchmarks.
  2. NLP natural language processing.