Fig. 6: Comparison of de novo sequencing and database search results.

a De novo sequencing and database search results on the E. coli datasets. b De novo sequencing and database search results on the yeast datasets. c Venn diagrams of amino acids sequenced by DiNovo and database search engines on the combined results of four proteases. Database search engines for comparison include pFind, MSFragger, and MS-GF + . d The consistency of peptide sequences of spectra identified by both DiNovo and pFind. Gray indicates that the DiNovo sequence is consistent with the pFind sequence; otherwise, it is shown in yellow. To ensure a fair comparison, only pFind was included here, as it used the same set of spectra (mgf format) as DiNovo (exported by the pParse algorithm with the co-eluted precursors detected), while other engines directly used the raw spectra. Source data are provided as a Source Data file.