Fig. 4: Mapping and reduction to common identifier space of gene names.

a Distribution of the number of gene names mapped from protein groups per dataset. Ten protein IDs shown in the table as source IDs (first column) do not have an official gene name in UniProt and are, therefore, filtered out. b Reduction of the resulting mapped gene names, upon removal of the 10 protein IDs from (a), based on mappability in Ensembl ID space.