Extended Data Fig. 7: DisP-seq signals and DisP islands in different cell lines.
From: DisP-seq reveals the genome-wide functional organization of DNA-associated disordered proteins

a, Graph showing the median length of IDRs found in b-isox precipitated nuclear proteins identified by mass spectrometry in MRC5 cells compared to permutations of equal numbers of proteins selected from the human proteome. The P-value was calculated by permutation test. b, Bar plot depicting the percentage of proteins with large IDRs (greater than 100 amino acids) in the human proteome and in b-isox precipitated nuclear proteins identified by mass spectrometry in MRC5 cells. c, Scatter plot showing DisP-seq signals for two biological replicates in MRC5 cells. The corresponding Pearson correlation coefficient is shown. d, Top: Percent amino acid composition for IDRs found in b-isox precipitated nuclear proteins identified by mass spectrometry in each cell line compared with all IDRs in the human proteome (n = 2 biologically independent experiments). Bottom: Percent differences between the human proteome values and the average cell line values shown in the top graph. Median value is shown as a line within the boxplot, which spans from the 25th to 75th percentiles. Whiskers indicate a 1.5× inter-quartile range. e, Representative example of DisP island (highlighted in light gray). f, Venn diagram depicting the overlap among DisP-seq peaks in SKNMC, NCI-H446 and MRC5 cell lines. g, Heatmaps depicting DisP-seq signal intensities at unique DisP islands from each cell line. Two biological replicates are shown for each cell lines. h, Numbers of DisP islands in each cell line.