Figure 7
From: Methods and open-source toolkit for analyzing and visualizing challenge results

Significance maps for visualizing ranking stability based on statistical significance. They depict incidence matrices of pairwise significant test results e.g. for the one-sided Wilcoxon signed rank test at 5% significance level with adjustment for multiple testing according to Holm. Yellow shading indicates that metric values of the algorithm on the x-axis are significantly superior to those from the algorithm on the y-axis, blue color indicates no significant superiority.