Table 1 Characteristics of the Trichobilharzia szidati and T. regenti transcriptomes and annotation of their predicted proteomes.
T. szidati | T. regenti 18 | |
|---|---|---|
Transcriptome | ||
Total sequences | 13,007 | 12,705 |
Minimum and maximum sequence lengths; N50 (bp) | 160-39,264; 3,402 | 115-41,111; 3,682 |
Predicted proteome | ||
Total sequences | 13,007 | 12,705 |
Minimum and maximum sequence lengths; N50 (bp) | 30-11,108; 671 | 30-8,133; 805 |
Matches to conserved metazoan BUSCO genes | ||
Complete single copy orthologous groupsa | 603 (61.7) | 659 (67.4) |
Complete, duplicated orthologous groupsa | 97 (9.9) | 81 (8.3) |
Fragmented orthologous groupsa | 87 (8.9) | 57 (5.8) |
Total BUSCO orthologous groupsa | 978 (100) | 978 (100) |
Protein annotation | ||
NCBI nr databaseb | 10,457 (80.4) | 10,900 (85.8) |
SwissProtb | 7,633 (58.7) | 8,120 (63.9) |
MEROPS peptidasesb | 357 (2.7) | 392 (3.1) |
MEROPS inhibitorsb | 249 (1.9) | 259 (2.0) |
KEGG BRITE protein familiesc | 5,663 (43.5; 3,065) | 5,935 (46.7; 3,275) |
KEGG pathwayc | 3,452 (26.5; 1,837) | 3,611 (28.4; 1,934) |
InterProScanb | 8,171 (62.8) | 9,642 (75.9) |
Gene ontology annotation (InterProScan)b | 6,730 (51.7) | 6,961 (54.8) |
Proteins predicted to be excreted/secreted | 642 (4.9) | 468 (3.7) |