Fig. 3: Association of the LARGE-LRH haplotype with susceptibility to Lassa fever.

a, K-means clustering of haplotypes in the LARGE1 region. Rows are phased haplotypes; columns are individual variants with reference alleles shown in purple, alternate alleles shown in yellow and K-means clusters separated. b, Scatter plot of q values for allelic skew in the MPRA, coloured by the absolute value of the Pearson correlation with the haplotype. c,d, Scatter plot of GWAS association P values over the LARGE1 region for Nigeria (c) and Sierra Leone (d) coloured by Pearson correlation of the protective allele in the GWAS with the LARGE-LRH. P values in c and d are based on SAIGE. e, Contingency table of LARGE-LRH genotype counts in cases and controls for Nigeria (NG, top) and Sierra Leone (SL, bottom). f, Ecologically estimated Lassa fever prevalence from Fichet-Calvet et al.70 with pie charts indicating the frequency of the LARGE1 haplotype in 1000 Genomes populations (YRI, Yoruba; ESN, Esan; MSL, Mende; LWK, Luhya; GWD, Gambian Mandinka)51 or our GWAS cohorts (NG, SL). Stars indicate towns, villages or hospitals that encountered outbreaks as detailed in Fichet-Calvet et al.70.