Fig. 6 | Nature Communications

Fig. 6

From: Capturing variation impact on molecular interactions in the IMEx Consortium mutations data set

Fig. 6

Literature biases in IMEx mutations data set. a, b: Scatter plot of number of publications (in a logarithmic scale) in which a protein is reported vs a the number of annotations and b the number of variants reported in the IMEx mutations data set. Disease-related annotations are highlighted in blue, unrelated to disease in yellow. Regression lines are coloured accordingly. Correlation coefficients and p-values for the linear model fit are indicated. 95% Confidence Interval (CI) is shown as greyed areas over the regression lines; c Overlap of significantly enriched pathways (q < 0.01) across different sets of proteins and word enrichment analysis (using Wordle on enriched pathway names) for the overlapping set (any mutational effect), the set of proteins annotated with no effect and the remaining proteins in IMEx (non-mutated). In ‘No effect’ word enrichment analysis, the words ‘pathway’ and ‘action’ have been removed to make remaining words more visible (original word cloud available as Supplementary Figure 6a), while in ‘Overlapping mut’ word cloud the words ‘pathway’ and ‘signalling’ have been removed (original word cloud available as Supplementary Figure 6b). The analysis in this figure was performed considering human proteins only. Source data for Fig. 6a and 6b are provided as a Source Data file, source data for Fig. 6c is available as Supplementary Data 4 and 5

Back to article page