Table 2 The number of reads and predicted transcripts. We show: (1) maximum RT-PCR product length per family (including primers), (2) the number of original PacBio CCS reads, with the number of distinct read sequences in parentheses, (3) the number of Illumina reads, (4) the number of distinct proovread Illumina-corrected CCS reads, (5) the number of ICE-predicted transcripts, and (6) the number of IsoCon-predicted transcripts. We use s1 and s2 to indicate sample 1 and sample 2, respectively

From: Deciphering highly similar multigene family transcripts from Iso-Seq data with IsoCon

Family

Max length (nt)

Original PacBio CCS reads (s1)

Illumina reads (s1)

Illumina-corrected CCS (s1)

ICE (s1)

IsoCon (s1)

Original PacBio CCS reads (s2)

Illumina reads (s2)

Illumina-corrected CCS (s2)

ICE (s2)

IsoCon (s2)

BPY

321

36 (22)

6854

22a

1

2

37 (15)

9916

15a

1

1

CDY_1 b

1660

1110 (1090)

55,228

508

72

28

453 (439)

41,434

184

18

5

CDY_2 c

1623

442 (440)

75,862

322

19

11

1766 (1670)

74,080

630

28

28

DAZ

2235

495 (487)

49,500

208

14

34

530 (519)

39,318

291

16

34

HSFY

1163

933 (877)

14,832

350

26

25

205 (181)

26,408

59

5

2

PRY

421

177 (126)

40,904

121

8

8

25 (20)

6864

20

4

3

RBMY

1483

6615 (6365)

85,068

3698

105

162

6939 (6284)

65,284

2840

90

181

TSPY

916

2121 (1955)

27,428

903

32

133

1418 (1249)

8756

772

36

80

VCY

378

50 (23)

11,820

23

2

2

53 (47)

3328

47

1

7

XKRY

340

53 (28)

15,722

28a

2

1

55 (39)

2890

39a

1

3

Total

N/A

12,032 (11,413)

383,218

6183

281

406

11,481 (10,463)

278,278

4897

200

344

  1. aproovread exited with an error that the sequences are too short and was not able to correct any reads
  2. bCDY transcripts captured using the first primer pair (see Supplementary Fig. 11 for details)
  3. cCDY transcripts captured using the second primer pair (see Supplementary Fig. 11 for details)