Table 3 Progressive filtering of the corpus showing retained abstract count at each classification step.

From: Large-scale transformer-based topic graphs identify thematic links between engineering and biology

Pass

Threshold

# Abstracts

Interdisciplinarity

\(p_{\textrm{ID}}\ge 0.50\)

21,844,908

Engineering

\(p_{\textrm{ENG}}\ge 0.47\)

6,237,432

Biology

\(p_{\textrm{BIO}}\ge 0.37\)

63,006