Extended Data Fig. 8: Feature distributions.

Each plot shows the mean value of a given predictor across different gold-standard datasets (y-axis) for either gold standard positive genes (GSP, green) or gold standard negative genes (GSN, yellow). GSP genes are more easily distinguished from GSN genes by distance in the manually curated datasets (especially Progem, Fauman_twitter, and T2D).