Fig. 5: InstaNovo increases protein coverage, identifies novel organisms, and detects semi- and non-tryptic peptides. | Nature Machine Intelligence

Fig. 5: InstaNovo increases protein coverage, identifies novel organisms, and detects semi- and non-tryptic peptides.

From: InstaNovo enables diffusion-powered de novo peptide sequencing in large-scale proteomics experiments

Fig. 5

a, Protein coverage and peptide sequences for UniProt ID P01859 - IGHG2 (immunoglobulin heavy constant gamma 2 chain) in human wound fluids, where database search peptides and novel predictions with IN are shown. b, Correct PSMs for different precision thresholds in the ‘Candidatus Scalindua brodae’ proteome. c, Phylogenetic tree of a representative sample of additional organisms identified in the co-culture. d, Venn diagram of database search and novel IN predictions of peptide sequences at 5% FDR from snake venom proteomics that map to the proteomes database used. e, Venn diagram of database search, IN and IN+ predictions at 5% FDR peptide sequences matching the proteome database used from immunopeptidomics dataset. f, Shannon information content of residues in sequence positions of immunopeptidomics experiments. g, PRM monitoring of fully GluC-generated peptide ATVWIHGDNEENKE, and its abundance in the two conditions (n = 3, median as centre line, 25th to 75th percentiles as bounds of the box, whiskers extending to 1.5 times the interquartile range from the bounds of the box, with minima and maxima beyond the whiskers plotted as individual points). RT, retention time in minutes. h, GluC specificity profile from statistically significant predicted PSMs matching database search results.

Back to article page