Supplementary Figure 1: Association between Mtb genetic distance and spatial proximity of patients.

For all pairs of Mtb isolates, the probability of both originating from patients living in the same district was calculated (y-axis) and stratified according to genetic distance between the Mtb genomes (expressed in number of SNPs, x-axis). P-values indicate result of testing for difference in proportions of isolate pairs originating from same district, for isolate pairs with SNP distance <10 vs >20, calculated separately for Beijing and non-Beijing isolates. (Note it was not possible to further stratify non-Beijing pairs into L1 pairs vs L4 pairs for statistical tests, due to low sample size.).