Fig. 4

Ablation study of the effect of R-GCN layer numbers on F1 score (%) for relation extraction on CDR and GDA datasets. For each dataset, three relation types (overall, intra-sentence, and inter-sentence) are shown across five settings (L1–L5). CDR bars are solid; GDA bars are hatched. The results show that deeper R-GCNs (up to 3 or 4 layers) generally improve performance, with consistent trends observed for both datasets and all relation categories.