Fig. 2

Receiver Operating Characteristic (ROC) curve for the random forest classifier trained on de novo identified plasmid presence-absence data. The model was evaluated using fivefold cross-validation. The solid blue line represents the mean ROC curve across all folds, and the shaded area indicates ± 1 standard deviation. The individual ROC curves for each fold are shown in lighter colors, with their corresponding AUC values indicated.