Fig. 3

Functional classification of pan-gene categories. Gene Ontology (GO) biological process categories enriched in the core a and shell b pan-gene subsets, showing the distribution of all genes in that category across the pan-genome. Note that the “neg. (negative) translation” category includes defensive peptides that inhibit translation. c Percent of respective GO biological process categories comprised of non-reference genes. Total number of genes in each category is listed after the category label on the y axis. d The ratio of non-synonymous to synonymous mutations indicates that shell genes are evolving faster than core genes within B. distachyon (p < 2.2e−16, t-test). e Core genes are expressed at higher levels than shell genes (p < 2.2e-16, Wilcoxon signed rank). f Core genes are more broadly expressed within multiple tissues than shell genes and g are more likely to be identified as conserved in rice or sorghum. Whiskers in the above plots extend to the most extreme data point which is no more than 1.5 times the IQR