Fig. 1: Study design and the frequency of pneumococcal serotypes within the host.
From: Pneumococcal within-host diversity during colonization, transmission and treatment

a, A schematic of the study sampling design. b, A barplot indicating the number of times each serotype was observed across all deep-sequenced samples. The distribution of the corresponding within-host frequencies of these serotypes is given in the adjacent plot, with overlapping points separated to indicate the density at each position along the x axis. Lineages with ambiguous serotype calls were excluded from this plot. Serotypes found at significantly lower frequencies using the Kolgomorov-Smirnov test are coloured red. c, Histograms indicating the distribution of the number of unique serotypes observed using either PDS or latex sweeps. d, Comparisons between the estimated GPSC lineage frequencies in 192 samples that were sequenced in replicate. The vertical red line indicates the minimum frequency required for consideration in the mSWEEP pipeline. e, Barplots indicating the differences in the representation of serotypes between mothers and infants. f, Boxplots indicating the distribution in the mean number of serotypes (excluding non-typables) observed in 107 mothers and 450 of their infants. The median and interquartile range are given by the horizontal lines, with the whiskers indicating the largest and smallest values excluding those outside 1.5 times the interquartile range.