Fig. 2: The mAP framework evaluation on simulated data.
From: A versatile information retrieval framework for evaluating profile strength and similarity

Benchmarking retrieval performance of mAP p-value (orange), mp-value (blue), MMD p-value (green), and k-means clustering (purple) for retrieving phenotypic activity on simulated data, where unperturbed and perturbed features are sampled from \({{{\mathscr{N}}}}\) (0,1) and \({{{\mathscr{N}}}}\) (1,1), correspondingly. Recall indicates the percentage of 100 simulated perturbations under each condition that were called accurately by each method (as distinguishable from negative controls, or not). The horizontal axis probes what proportion of the features in the profile were different from controls (note the binary exponential scaling). Marker and line styles indicate different numbers of replicates per perturbation (# replicates of 2, 3, and 4). Columns correspond to the different number of controls (# controls of 12, 24, and 36). Rows correspond to different profile sizes (# features being 100, 200, 500, 1000, 2500, and 5000). mAP, mp-value, and MMD used a one-sided permutation test to obtain p-values without adjusting for multiple comparisons; no statistical test was performed for k-means. Source data are provided as a Source Data file.