Fig. 3: Transcriptomic signatures obtained from the discovery cohort.

AUCs and ROC curves from the density plots (left panels) of the AUC values computed on the 999 training resamples and using the optimal n-transcript signature; red dashed vertical lines in the left panels indicate the median values. ROC curves and AUC values (central panels) for the total cohort of i) M. pneumoniae (n = 30) vs. viral pneumonia (n = 77) (black line; AUCTO|M-V), ii) M. pneumoniae pneumonias (n = 30) vs. Bacterial pneumonias (n = 5) (red line; AUCTO|M-B) and iii) M. pneumoniae pneumonias (n = 30) vs. all non-mycoplasma pneumonia infections (including, virus, non-mycoplasma bacteria, and non-mycoplasma co-infections; n = 92) (violet line; AUCTO|M-VBC). Boxplots of the predicted values using each optimal model in the total cohort with two-sided Wilcoxon rank-sum test P-values (right panels). Red dashed line represents the optimal cutpoint. The boxes are defined by the upper and lower quartile (Q1 and Q3); the median is shown as a bold-colored horizontal line; whiskers extend to the most extreme data point which is no more than 1.5 times the interquartile range (IQR) from the box. VBC all non-mycoplasma pneumonias (including, virus, non-mycoplasma bacteria, and non-mycoplasma co-infections), B bacterial, M M. pneumoniae, TO total sample, V viral.