Table 2 The number of reads and predicted transcripts. We show: (1) maximum RT-PCR product length per family (including primers), (2) the number of original PacBio CCS reads, with the number of distinct read sequences in parentheses, (3) the number of Illumina reads, (4) the number of distinct proovread Illumina-corrected CCS reads, (5) the number of ICE-predicted transcripts, and (6) the number of IsoCon-predicted transcripts. We use s1 and s2 to indicate sample 1 and sample 2, respectively
From: Deciphering highly similar multigene family transcripts from Iso-Seq data with IsoCon
Family | Max length (nt) | Original PacBio CCS reads (s1) | Illumina reads (s1) | Illumina-corrected CCS (s1) | ICE (s1) | IsoCon (s1) | Original PacBio CCS reads (s2) | Illumina reads (s2) | Illumina-corrected CCS (s2) | ICE (s2) | IsoCon (s2) |
---|---|---|---|---|---|---|---|---|---|---|---|
BPY | 321 | 36 (22) | 6854 | 22a | 1 | 2 | 37 (15) | 9916 | 15a | 1 | 1 |
CDY_1 b | 1660 | 1110 (1090) | 55,228 | 508 | 72 | 28 | 453 (439) | 41,434 | 184 | 18 | 5 |
CDY_2 c | 1623 | 442 (440) | 75,862 | 322 | 19 | 11 | 1766 (1670) | 74,080 | 630 | 28 | 28 |
DAZ | 2235 | 495 (487) | 49,500 | 208 | 14 | 34 | 530 (519) | 39,318 | 291 | 16 | 34 |
HSFY | 1163 | 933 (877) | 14,832 | 350 | 26 | 25 | 205 (181) | 26,408 | 59 | 5 | 2 |
PRY | 421 | 177 (126) | 40,904 | 121 | 8 | 8 | 25 (20) | 6864 | 20 | 4 | 3 |
RBMY | 1483 | 6615 (6365) | 85,068 | 3698 | 105 | 162 | 6939 (6284) | 65,284 | 2840 | 90 | 181 |
TSPY | 916 | 2121 (1955) | 27,428 | 903 | 32 | 133 | 1418 (1249) | 8756 | 772 | 36 | 80 |
VCY | 378 | 50 (23) | 11,820 | 23 | 2 | 2 | 53 (47) | 3328 | 47 | 1 | 7 |
XKRY | 340 | 53 (28) | 15,722 | 28a | 2 | 1 | 55 (39) | 2890 | 39a | 1 | 3 |
Total | N/A | 12,032 (11,413) | 383,218 | 6183 | 281 | 406 | 11,481 (10,463) | 278,278 | 4897 | 200 | 344 |