Fig. 2: Comparative performance of pum6a and baseline models across diverse datasets.

a, b ROC AUC scores of pum6a and baseline models on 20 different datasets. a Bag-wise ROC AUC; b Instance-wise ROC AUC, showing pum6a’s consistent performance across various biological data. c, d Average rank of each method across all experiments and different label frequencies. c Bag level; d Instance level ranking, with pum6a outperforming other methods in predictive accuracy and model robustness.