Fig. 4

Distribution of the best linear unbiased estimations (BLUEs) across experiments for outlier-corrected yellow rust (YR, Puccinia striiformis f. sp. tritici) infections of plant genetic resources (PGR or SSD-PGR) and elite cultivars (Elite) tested in precision (boxplot, upper left corner), large-scale screening (boxplot, lower right) or both types of field experiments (scatter plot, upper right). YR infections were scored using an ordinal rating scale between 1 and 9, where 1 means complete absence of YR leaf symptoms and 9 denotes fully infected leaves. BLUEs that lie outside of the 1–9 parametric space are due to the unorthogonal structure of unbalanced experiments. In total, 19 field experiments were conducted between harvest years 2015 and 2020 considering five German locations. Large-scale screenings fully relied on natural YR infections, while five out of seven precision experiments were artificially inoculated. The exact numbers of genotypes according to each category are included within brackets []. In boxplots, boxes enclose 50% of the central data, including median (black bold line) and mean (black diamond), while whiskers are ± 1.5 × interquartile range and dots represent extreme values. In the scatter plot, ** denotes the significance [-log10(p-value) = 128.4] of the correlation between YR scores from precision and large-scale screening experiments.