Fig. 4: Identification of selection signals during G. barbadense improvement.

a Diversity and FST plots detecting putative regions of selection in G. barbadense, with purple lines indicating the 5% threshold. The approximate position of functional genes known to be associated with fiber development is indicated by their respective gene names. b Scatter plots showing Pan-SV frequencies in landrace and cultivar (adj p-value computed using two-sided Fisher’s exact test). c Frequency pattern of improvement-related Pan-SVs. Lines in red and green indicate impSVs during improvement. d Venn plot of improvement genes that were identified by the Pan-SVs and SNPs. e Pan-SV frequency captures some genes that were not identified by SNPs. The distribution of FST values (purple curve) and πlandrace/πcultivar (yellow curve), the dotted line (purple and yellow) represents the top 5% value of FST and πlandrace/πcultivar, respectively (Upper). The red blocks represent the SVs that were potentially selected during in G. barbadense improvement (Middle). SV frequency for each group is shown in the bar charts (Lower).