Extended Data Figure 2: Length distribution of SMRT subreads and FALCON parameter optimization for assembly.

a, The y axis on the left shows the number of subreads with given length (bin size = 100 bp) on the x axis, whereas the y axis on the right shows the sum of the length of subreads longer than or equal to the given length on the x axis. b, Effects of length cutoff parameters on contig N50 in de novo assembly by FALCON is shown on the right. The contig N50 depends on the two parameters, related to the amount of error-corrected reads for final assembly, length_cutoff and length_cutoff_pr, respectively, where the former was fixed at 10 kb but the latter varied from 10 to 16 kb. Black and green lines indicate the changes of N50 for 72× and 101× sequencing dataset, respectively.