Table 6 Semantic Structural Congruence (SSC) across datasets for different document layout models with input embeddings: Image and Layout Only.
From: Representation learning approach for understanding structured documents
Dataset / Model | LayoutLMv36 | DocLayout-YOLO30 | DocSAM28 | LayoutLLM36 | DocLayLLM37 | D-REEL |
|---|---|---|---|---|---|---|
PRImA Newspaper Dataset34 | 59.34 | 61.27 | 60.18 | 66.41 | 67.58 | 75.42 |
German-Brazilian Newspapers (GBN)35 | 61.05 | 62.13 | 61.59 | 66.52 | 67.04 | 74.38 |
S2-VL Dataset10 | 85.53 | 87.42 | 88.16 | 89.03 | 89.57 | 91.49 |
IIIT AR 13K33 | 86.51 | 87.36 | 88.28 | 89.24 | 89.54 | 90.37 |
Publaynet32 | 87.11 | 88.44 | 88.53 | 89.57 | 90.19 | 90.58 |