Table 2 Host mapping statistics of RNA sequencing datasets.

From: Bronchoalveolar lavage fluid metagenomic datasets: a multidimensional clinical biomolecular resource

 

Bacteria

Fungi

Cancer

TB

Number of datasets

114

79

123

86

Assigned (M reads)

5.86 ± 3.28

5.86 ± 3.66

5.89 ± 3.12

5.41 ± 3.19

Unassigned Unmapped (M reads)

3.25 ± 4.06

2.35 ± 2.02

3.23 ± 3.58

2.34 ± 2.21

Unassigned MultiMapping (M reads)

8.66 ± 4.88

8.54 ± 5.49

8.86 ± 4.06

7.81 ± 4.62

Unassigned NoFeatures (M reads)

9.03 ± 4.23

9.13 ± 5.4

9.05 ± 4.45

8.99 ± 5.04

Unassigned Ambiguity (M reads)

0.28 ± 0.16

0.25 ± 0.15

0.28 ± 0.15

0.23 ± 0.15

  1. Assigned: Reads that map uniquely and completely to a single annotated feature according to the GFF file.
  2. Unassigned_Unmapped: Reads that cannot be mapped to any annotated feature.
  3. Unassigned_MultiMapping: Reads that map to multiple annotated features.
  4. Unassigned_NoFeatures: Reads that map to the reference genome, but the mapped genomic location lacks any annotation (absence of features).
  5. Unassigned_Ambiguity: Reads where the alignment identity falls below 90%.