Fig. 3: PhyloTune’s performance in identifying the smallest taxonomic unit.

a PhyloTune’s performance in identifying the smallest taxonomic unit on simulated datasets with varying training sequences. Top: Taxonomic classification metrics for known taxa. Bottom: Novelty detection metrics for unknown taxa. Line charts show the mean ± 95% confidence interval (CI, computed from SEM, n = 30 independent experiments). b Comparison of taxonomic classification between PhyloTune and MMseqs2 (using training data as a reference). Line charts show the mean  ± 95% CI (n = 10 independent experiments). c Comparative analysis of novelty detection scores for PhyloTune and baseline, using in-distribution (ID) and out-of-distribution (OOD) test sequences from the Plant dataset (n = 15000 sequences). Source data are provided as a Source Data file.