Table 3 IsoCon-predicted transcripts of the nine ampliconic gene families. Columns show predictions shared between samples, separated by gene family and categorized as coding or non-coding. The calculated number of groups is shown, in comparison to known copy numbers from the reference genome and observed in human populations. Novel transcripts are those that do not have a perfect alignment to the NCBI transcript reference database, the numbers in parentheses indicate additional transcripts that have a perfect alignment only to ESTs, synthetic constructs, or in silico-predicted transcripts

From: Deciphering highly similar multigene family transcripts from Iso-Seq data with IsoCon

Gene family

Number of coding members annotated in the ref. 67

Range of copy numbers observed in human populations68,69

IsoCon transcripts shared between samples, coding

IsoCon transcripts shared between samples, non-coding

Total number of inferred groups

Number of inferred coding groups

Novel transcripts

Predicted novel splice variants, coding

Predicted novel splice variants, non-coding

BPY

3

2–3

1

0

1

1

0 (0)

–

–

CDY

4

2–4

3

2

2

2

1(1)

–

–

DAZ

4

2–5

5

1

3

3

5(0)

4

1

HSFY

2

2

0

0

–

–

–

–

–

PRY

2

2–3

1

2

1

1

2(0)

–

2

RBMY

6

6–18

26

35

18

14

49(3)

2

10

TSPY

35

12–38

34

9

20

14

37(3)

4

2

VCY

2

2–3

1

0

1

1

0(0)

–

–

XKRY

2

2–3

1a

0

1

1

0(0)

–

–

Total

32

32–79

72

49

51

38

94

10

15

  1. aThe XKRY transcript shared by two samples for XKRY-based primers had a better alignment to XKR3 than to XKRY and thus might not be Y-specific, but we did find a Y-specific transcript unique to sample 2