Host control of persistent Epstein–Barr virus infection

Schmidt, Axel; Alawathurage, T. Madhusankha; David, Friederike S.; Ogawa, Yosuke; Frach, Leonard; Richter, Sylvia; Schaefer, Merle; Mathey, Carina M.; Henne, Sabrina K.; Forstner, Andreas J.; Dilthey, Alexander T.; Pröbstel, Anne-Katrin; Boztug, Kaan; Nöthen, Markus M.; Namkoong, Ho; Okada, Yukinori; Beins, Eva C.; Ludwig, Kerstin U.

doi:10.1038/s41586-026-10274-4

Download PDF

Article
Open access
Published: 19 February 2026

Host control of persistent Epstein–Barr virus infection

Nature (2026)Cite this article

11k Accesses
1 Citations
282 Altmetric
Metrics details

Subjects

Abstract

Epstein–Barr virus (EBV) infects approximately 90–95% of the global population^1,2 and persists in B cells as a lifelong infection³. Previous EBV infection is associated with autoimmune and neoplastic disease⁴. Still, the biological basis of host control during EBV persistence remains unclear. Here we report the identification of non-genetic and genetic factors that are associated with EBV control during persistent infection. Using blood-based genome sequence data from 486,315 UK Biobank and 336,123 All of Us participants, we identified short-read pairs mapping to the EBV genome in 16.2% and 21.8% of individuals, respectively. EBV read detection (EBVread⁺) reflects increased viral load in blood cells, as shown by orthogonal measurements, and was associated with HIV infection, immunosuppressive drug intake and current smoking. Genome-wide analyses of EBVread⁺ identified strong associations at the major histocompatibility complex (MHC), including 54 independent human leukocyte antigen (HLA) alleles of MHC classes I and II, and at 27 genomic regions outside MHC. Epistasis with distinct HLA alleles of MHC class I was observed at the ERAP2 locus. Analysis of individuals with EBV-associated diseases⁴ revealed a higher polygenic burden of EBVread⁺ for HLA alleles at MHC class I in multiple sclerosis (driven by HLA-A*02:01) and at MHC class II in rheumatoid arthritis. Phenome-wide analyses identified a polygenic overlap of EBVread⁺ with inflammatory bowel disease, hypothyroidism and type 1 diabetes. Our study establishes by-products of human genome sequencing as a surrogate marker of EBV viral load. This will facilitate investigation and treatment for EBV and other persistent viral infections.

Population-scale sequencing resolves determinants of persistent EBV DNA

Article Open access 28 January 2026

The DNA virome varies with human genes and environments

Article Open access 25 March 2026

Epstein–Barr virus as a potentiator of autoimmune diseases

Article 10 October 2024

Main

EBV (human herpesvirus 4) is a DNA virus that infects approximately 90–95% of the global population^1,2. Primary EBV infection usually occurs in childhood and remains asymptomatic or mild. From adolescence onwards, it can cause infectious mononucleosis⁵. EBV enters the host via the oropharyngeal epithelium and infects naive B cells. These differentiate into long-lived memory B cells that become part of the circulation, thereby establishing persistent infection^3,6. Occasionally, EBV-infected memory B cells reactivate to produce new infectious virions⁷.

EBV infection is a risk factor for various neoplasms (for example, Hodgkin and non-Hodgkin lymphoma and multiple sclerosis)^4,8,9. Although EBV seropositivity is a prerequisite for multiple sclerosis¹⁰, only some individuals infected with EBV develop the disease, following a prodromal phase¹¹. Furthermore, although multiple sclerosis risk is significantly elevated post-infectious mononucleosis, many patients with multiple sclerosis did not have a severe primary EBV infection¹². Thus, multiple sclerosis may arise secondary to inefficient EBV immune control during the prodromal phase, as indicated by high EBV viral load¹¹. Similar mechanisms might be implicated in other EBV-associated autoimmune disorders, as suggested by elevated EBV viral loads in systemic lupus erythematosus¹³ and rheumatoid arthritis¹⁴. In EBV-associated cancers, the importance of proper EBV immune control has been demonstrated by studies of inborn errors of immunity (IEIs): patients with IEIs involving impaired T and natural killer (NK) cell cytotoxicity have elevated EBV viral loads in blood¹⁵, and an increased risk for B cell-derived EBV-positive lymphomas¹⁶. Individuals with human immunodeficiency virus (HIV) or immunosuppression also show impaired EBV control¹⁷ and an increased incidence of EBV-positive lymphomas^18,19. Still, despite its presumed clinical relevance, data on immune control during persistent EBV infection are limited.

Research into the biological basis of immune control of persistent EBV infection is hampered by a lack of direct measurements of EBV viral load in large immunocompetent cohorts, and limited knowledge regarding the role of serological factors in the control of EBV²⁰.

To address this, we exploited the fact that EBV DNA in memory B cells is sequenced as a by-product of genome sequencing (GS) of human peripheral blood²¹. Using blood-based GS data from the UK Biobank (UKB)²² and All of Us (AoU)²³ together with orthogonal data, we demonstrated that short-read pairs mapping to the EBV genome (EBV reads) in GS data are a surrogate measure for increased EBV viral load. EBV read prevalence was increased in immunosuppressed individuals; in current smokers; and in samples obtained in winter. Strong genetic associations were found for the MHC locus and 27 loci outside MHC, which were broadly consistent across the two biobanks. Downstream analyses suggested candidate genes, and highlighted pathways and cell types relevant for EBV immunity. Investigations of EBV-associated diseases generated novel hypotheses regarding mechanisms in multiple sclerosis and rheumatoid arthritis, and phenome-wide analyses identified novel diseases for which host control of EBV viral load might be pathophysiologically relevant.

EBV reads are present in GS data from biobanks

We retrieved EBV reads from the GS data of 490,293 UKB participants²⁴ (Methods; Fig. 1a and Supplementary Notes 1 and 2). During quality control (QC), 51 library-preparation plates showed evidence of contamination and were excluded (Methods; Extended Data Fig. 1 and Supplementary Fig. 1). Aggregated EBV reads of the remaining 486,315 individuals (UKB QC cohort) were evenly distributed across the EBV genome (Fig. 1b). EBV read distribution was zero inflated, that is, no EBV reads were observed in n = 407,544 individuals (83.8%, denoted as ‘EBVread⁻’; Fig. 1c). Of the 78,771 individuals with detected EBV reads (‘EBVread⁺’, 16.2%), 61.9% had EBV read count = 1. Further analysis of coverage and sequence data (Methods) confirmed that EBV reads from this group reflect true signals (Extended Data Fig. 2 and Supplementary Table 1).

**Fig. 1: Analysis of EBV reads in blood-based GS data.**

EBV reads were also extracted from the blood-based GS data of 336,123 ethnically diverse individuals from AoU²³ (AoU QC cohort; Methods). EBV read distribution was similar to that in UKB, but a lower fraction of individuals had EBV read count = 1 (n = 37,901 out of 73,137 EBVread⁺ individuals, 51.8%; Extended Data Fig. 3). Overall, 21.8% of the AoU QC cohort were EBVread⁺, although this varied across ancestries (Supplementary Table 2). For European (EUR) cohorts, the fraction of EBVread⁺ individuals was comparable in AoU (17.6%) and UKB (15.8%; UKB EUR cohort; Fig. 1a; Methods). Whether the residual difference is due to ancestry-specific mechanisms of EBV control or characteristics such as a higher average GS coverage in AoU (Supplementary Table 2) awaits elucidation. In our data, the EBVread⁺ fraction is higher than in smaller GS (14.0%)²¹ or diagnostic quantitative PCR (qPCR; 11.03%)¹⁸ studies of immunocompetent individuals. This might be attributable to differences in cohort composition and/or strict cut-offs used in clinical settings.

EBVread⁺ status reflects increased EBV viral load in blood cells

We then assessed the relevance of GS-based EBVread⁺ to EBV biology. First, we investigated how well EBVread⁺ matches EBV seropositivity. In a UKB subcohort with available serology data (UKB serology cohort; n = 9,281), 491 individuals were EBVsero⁻ and 8,790 EBVsero⁺, based on previous definitions¹. EBV reads were observed in 0.61% of EBVsero⁻ and 16.38% of EBVsero⁺ individuals (sensitivity of 16.4% and specificity of 99.4%; Fig. 1d). Second, we investigated whether EBV read detection reflects high viral load in blood cells. Therefore, we (1) simulated GS and compared modelled versus observed outcomes; (2) measured viral load via qPCR in samples from two small, independent cohorts with GS data^25,26; and (3) correlated EBV read counts with EBV gene expression from blood-based RNA sequencing (RNA-seq; Japan COVID-19 Task Force²⁶ (JCTF); Methods).

The simulation reproduced the EBV read distribution observed in UKB or AoU, including the zero inflation (Extended Data Fig. 4), and was compatible with an underlying log-normal distribution, as reported for HIV-1 viral load²⁷. In the qPCR analysis, EBV read counts showed a positive correlation with EBV DNA detection, and a negative correlation with Cp (crossing point) values (Fig. 1e,f and Extended Data Fig. 2). In 1,010 individuals from the JCTF, the fraction of individuals with detected EBV transcripts was higher among EBVread⁺ than among EBVread⁻ samples (Fig. 1g). Together, this provides evidence that EBVread⁺ represents an approximation of elevated EBV viral load within human blood cells.

EBVread⁺ is associated with decreased EBV control during persistence

To determine which phase of the EBV life cycle is reflected by EBVread⁺, we investigated correlations between EBV read counts and (1) individual EBV transcript counts from the JCTF cohort; and (2) four individual EBV antibody levels (EA-D, EBNA-1, ZEBRA and VCA-p18, all IgG; median fluorescence intensity (MFI) values)¹. In step one, the strongest EBVread⁺ correlations were with transcripts of A73 (ρ = 0.47, Spearman’s rank correlation), BARF0 (ρ = 0.42) and RPMS1 (ρ = 0.43; Fig. 1g). All three belong to the BART gene cluster that is associated with latency⁴. We also observed correlations with transcripts of some lytic genes, particularly from the same genomic region.

Step two was performed in 7,338 EUR EBVsero⁺ individuals from the UKB serology cohort, with presumed persistent (not primary) EBV infection given their age at recruitment (Methods). The strongest correlation was observed with IgG levels to VCA-p18 (ρ = 0.12, P < 2.2 × 10⁻¹⁶), followed by IgG levels to ZEBRA and EBNA-1 (Extended Data Fig. 5). Although VCA-p18 is a lytic-phase antigen, IgG to VCA-p18 is detectable during persistent EBV infection²⁸ and increased titres are found in individuals with high EBV viral load in blood^29,30. Thus, higher viral load in blood cells, as measured by EBVread⁺, might correlate with ongoing lytic activity. This aligns with the ‘germinal centre model of EBV persistence’⁷, in which the latently infected memory B cell pool in blood is maintained in equilibrium by lytic reactivation events in lymphoid tissues (Extended Data Fig. 2). However, our data suggest an extension to this model, as some reactivation might occur within blood, as recently also demonstrated in individuals with systemic lupus erythematosus³¹.

Non-genetic factors and sex contribute to EBVread⁺

Next, we investigated the influence of non-genetic factors and sex on EBVread⁺, with the aim to (1) identify those factors; (2) enable exclusion from further analysis of all individuals whose EBV read count was probably determined exogenously; and (3) control for these factors in subsequent analyses. Whenever possible, we minimized overfitting by using one biobank for discovery and the other for replication (Supplementary Note 1).

First, we assessed 11,111 SNOMED concept IDs and their association with EBVread⁺ in the AoU QC cohort (Methods). Initial test statistics were highly inflated, with HIV positivity and smoking showing the strongest associations (Supplementary Table 3 and Supplementary Fig. 2). When the analysis was conditioned on these two traits, inflation was largely resolved, although some residual associations with several immune-related SNOMED concepts remained (Supplementary Note 3).

To quantify the effect of HIV or immunosuppression on EBVread⁺ and identify additional contributors, we investigated non-related individuals of EUR ancestry. Individuals with outlier blood count measurements and those in the top EBV read count percentile were excluded, given the high prevalence of pathophysiological processes in this group, which probably drive EBV abundance (Supplementary Fig. 3 and Supplementary Note 4). In this UKB no outlier cohort, 48,771 of 313,387 individuals were EBVread⁺ (that is, 15.6%; with an expected standard deviation (s.d.) of 0.1% based on bootstrapping). HIV infection and immune-modulatory drugs significantly increased the likelihood of EBVread⁺. The highest probability was for reported HIV infection (39.7%, s.d. = 3.5%;), followed by intake of glucocorticoids (19.4%, s.d. = 0.7%) or other immunosuppressive drugs (18.3%, s.d. = 0.5%).

We then excluded from the UKB no outlier cohort individuals with reported HIV infection, or current use of glucocorticoids or other immunosuppressive drugs (‘UKB no immune supp. cohort’; Fig. 1a), and performed variable selection on a set of predefined covariates to identify further contributing factors in immunocompetent individuals (non-related individuals; 47,234 EBVread⁺ and 257,899 EBVread⁻; Methods; Supplementary Table 4). EBV reads were more frequent in male individuals than in female individuals (17.1%, s.d. = 0.1% versus 14.1%, s.d. = 0.1%) and in current smokers than in current non-smokers (22.1%, s.d. = 0.3% versus 14.7%, s.d. = 0.1%; Fig. 1i). Former smoking status alone was not identified as a relevant predictor of EBVread⁺. Other selected variables were increasing age, GS yield and lymphocyte percentage, all of which were positively correlated with EBVread⁺ (Fig. 1j). EBV read detection was also more probable in samples collected in winter (Fig. 1j). This seasonality effect was confirmed in AoU (Extended Data Fig. 3) and requires further investigation. A plausible hypothesis is that seasonal infections during winter, such as co-infections with respiratory viruses, drive EBVread⁺. This would be consistent with observations of a higher prevalence of EBVread⁺ in the JCTF, whose participants were infected with SARS-CoV-2 around the time of sampling (39.2% EBVread⁺; Supplementary Table 5, Supplementary Fig. 4 and Supplementary Note 5). Together, the identified factors might also contribute to cross-biobank and cross-ancestry differences in EBVread⁺ prevalence.

Common variants in and outside of MHC contribute to EBVread⁺

To identify associations between common genetic variants and EBVread⁺, we performed a genome-wide association study (GWAS) using related individuals from the UKB no immune supp. cohort (Fig. 1a) and imputed data (Methods). Variants at 28 loci showed genome-wide significance (Fig. 2a), including a long-range association at the MHC locus and additional associations at 27 non-MHC loci (Methods; Table 1 and Supplementary Tables 6 and 7). The heritability estimate for EBVread⁺ for all common variants outside the MHC region was 2.04% (standard error of the mean (s.e.m.) = 0.44%; linkage disequilibrium score regression³²).

Fig. 2: Genetic analyses of EBVread+. — **Fig. 2: Genetic analyses of EBVread⁺.**

Table 1 Overview of 27 non-MHC loci associated with EBVread⁺ in UKB

Full size table

At the non-MHC loci, gene prioritization approaches (Methods) highlighted genes implicated in immune processes (for example, ERAP2 and EOMES), known IEIs (for example, CD70, IKZF3 and CTLA4) and genes of pharmacological relevance (for example, SLAMF7, inhibited by elotuzumab; Supplementary Table 8). Non-MHC lead variants were also associated with a broad range of phenotypes in OpenTargets (Supplementary Table 9), although the extent varied across loci. While some loci showed high pleiotropy (more than 100 associated phenotypes, for example, loci including SH2B3, PTPN22 and IRF1), other lead variants had only few associations at the same significance threshold, suggesting a more specific role in EBV control (for example, ILDR1 and CMC1). Finemapping with SuSie³³ identified potentially causative variants at four loci (Table 1 and Supplementary Table 10), including three missense variants with posterior inclusion probability (PIP) scores > 0.1, and one non-coding variant, rs531660643, at PIP > 0.95 (rs531660643). The latter is a splice quantitative trait locus (QTL) for BCL3 (whole blood, GTEx v8), which is involved in B cell fate and NF-κB regulation³⁴.

At the MHC region, the immunologically relevant variants are alleles of HLA genes (‘HLA alleles’), which determine the repertoire of antigens that can be presented to the immune system. On the basis of the imputed HLA alleles²², 116 different classical HLA alleles were associated with EBVread⁺ (Methods; Supplementary Table 7). The lowest P value was for the MHC class II (MHC-II) allele HLA-DRB1*04:04 (beta = 0.79, s.e.m. = 0.02), which is associated with increased rheumatoid arthritis risk³⁵. The next most significant HLA alleles were HLA-A*02:01 (beta = −0.31, s.e.m. = 0.01), which decreases risk for multiple sclerosis³⁶, EBV⁺ Hodgkin lymphoma³⁷ and endemic Burkitt lymphoma³⁸, and HLA-B*14:02 (beta = −0.68, s.e.m. = 0.02). After iterative conditional analyses, 54 independent alleles from MHC-I and MHC-II remained with genome-wide significance (Methods; Fig. 2b and Supplementary Table 7).

Given previous evidence for epistatic effects between HLA alleles and genes involved in antigen processing, for example, ERAP2 (ref. ³⁹) and ERAP1 (ref. ⁴⁰), we conducted an interaction analysis between the 54 conditionally independent HLA alleles and the top three non-MHC loci (Methods). After correction for multiple testing, three significant interactions were identified between the ERAP2 lead variant rs2548225 and HLA alleles of MHC-I (that is, HLA-A*02:01, HLA-B*40:01 and HLA-B*15:01; Fig. 2c and Supplementary Table 11). This is functionally plausible, as ERAP2 encodes an aminopeptidase that trims peptides within the endoplasmic reticulum before loading onto MHC-I⁴¹. The rs2548225 risk allele tags ERAP2 haplotypes that are characterized by splice variants, which render ERAP2 mRNA non-functional⁴¹.

Finally, we aimed to replicate the UKB-based EBVread⁺ GWAS results in 184,948 individuals of EUR ancestry from the AoU no outlier cohort. Of the 116 associated HLA alleles, 106 were matched to HLA alleles in AoU (Methods). Of these, 100 showed P < 0.05 and a consistent effect direction in both datasets (Supplementary Table 7). For the 54 conditionally independent HLA alleles, 46 of the 52 that were available in AoU were replicated, as were lead variants at 25 of the 27 non-MHC loci (at P < 0.05; Supplementary Table 6). No meta-analysis was performed due to missing or different covariates, for example, the lack of blood count data in AoU (Supplementary Note 4).

Associated GWAS loci for EBVread⁺ are specific for increased EBV viral load

To explore whether the identified loci are specific for EBV viral load, we compared effect sizes of lead variants from the EBVread⁺ GWAS to GWAS data for memory B cell abundance⁴² and human herpesvirus 7 (HHV7). For memory B cell abundance, no significant Spearman’s correlation was observed for non-MHC loci (Extended Data Fig. 6; no MHC data provided). However, a genome-wide significant association was observed for the EBVread⁺ lead variant at the 13q33.3 locus comprising TNFSF13B, which is implicated in memory B cell survival⁴³ (Supplementary Table 12). For HHV7, we extracted reads from UKB and calculated effect sizes as for EBV (Methods; Supplementary Fig. 5 and Supplementary Note 6). No significant Spearman’s correlations were found for the EBVread⁺ non-MHC loci or HLA alleles (Fig. 2d), although six of the non-MHC loci had P < 0.05 and a consistent direction of effect. For two of these (SLC8A1 and PTPN22), colocalization analyses indicated shared causal variants (posterior probability (H4) > 0.5; Supplementary Table 13).

We then created a case–control definition in UKB that captures viral load rather than viral susceptibility, by excluding individuals with EBV read count = 0 (that is, almost all seronegative individuals). Effect sizes from an analysis of EBV read count = 1 versus EBV read count ≥ 2 were highly correlated with those of our main GWAS (non-MHC loci: Spearman’s ρ = 0.93, P = 6.2 × 10⁻⁷; HLA alleles: ρ = 0.94, P < 2.2 × 10⁻¹⁶; Fig. 2e). Similar results were obtained in other case–control definitions within EBVread⁺ individuals and in additional comparisons of (1) EBV read count = 0 versus EBV read count = 1, and (2) female and male participants (Extended Data Fig. 6 and Supplementary Table 12).

Finally, we analysed GWAS summary statistics of four EBV antibody levels⁴⁴. Consistent with the aforementioned correlation of EBVread counts with IgG antibody levels, effect sizes of lead variants for EBVread⁺ and VCA-p18 IgG levels were strongly correlated, particularly for the HLA alleles (ρ = 0.64, P = 1.55 × 10⁻⁵; Fig. 2f). These findings suggest that the genetic associations with EBVread⁺ reflect specific EBV viral load-associated factors.

Gene-based analyses suggest an enrichment of IEI genes

We then performed gene-based analyses to capture additional biology and enable systematic downstream analyses, using EBVread⁺ summary statistics for common variants and exome sequencing data for rare variants.

Common variants were assigned to individual genes, and gene-based P values were calculated (MAGMA⁴⁵; see Methods; without MHC region). Of 63 genes that remained significant after Bonferroni correction (Supplementary Table 14), ten were located outside of genome-wide significant loci and thus represent additional candidate genes. Nine of the 63 genes were IEI genes, including four (IKZF3, NFKB1, CTLA4 and CD70) that predispose to severe clinical phenotypes post-EBV infection, including persistent EBV viraemia, EBV-associated lymphoproliferation and/or EBV-driven lymphoma (Fig. 3a and Supplementary Note 7). Formal testing using MAGMA gene set enrichment (Methods) showed that IEI genes (n = 456) were strongly enriched for association with EBVread⁺ (P = 4.66 × 10⁻⁶, beta = 0.19, s.e.m. = 0.04). When considering 14 genes that cause monogenic EBV-driven lymphoproliferative diseases¹⁵, the effect size increased (beta = 0.35, s.e.m. = 0.22, P = 0.055; Supplementary Table 14).

Fig. 3: Characterization of non-MHC risk loci associated with EBVread+. — **Fig. 3: Characterization of non-MHC risk loci associated with EBVread⁺.**

The aggregate effect of rare variants was captured by gene-based collapsing analyses, based on exome sequencing data (minor allele frequency < 0.01; gene-based association analysis of rare variants (RVAS_gene); Methods). Twenty-eight genes within the MHC locus and one non-MHC gene (TNFRSF13B) were test-wide significant in at least one of four variant pathogenicity definitions (P_gene < 8.86 × 10⁻⁷; Methods; Supplementary Table 15). The TNFRSF13B signal was driven by p.Cys104Arg (P_{without p.Cys104Arg} = 0.087), which is associated with common variable immunodeficiency, tonsillectomy and ear surgery^46,47.

Intersecting both analyses (MAGMA and RVAS_gene, each at P < 0.01) showed 24 genes with evidence from common and rare variants (Supplementary Table 16). These included seven genes whose rare variant enrichment was driven by putative loss-of-function variants (PTPN22, GP1BA, CD226, C6orf222, ZNF284, CHD4 and HKR1), all of which are strong novel candidate genes for host control of persistent EBV infection.

Identification of candidate pathways and effector cell types

We then used the gene-based association statistics for common variants, to obtain insights into effector pathways, tissues and cell types⁴⁸. Using Gene Ontology Biological Processes, we identified 30 test-wide significant pathways (Fig. 3b and Supplementary Table 17). These encompassed various immune processes, for example, T cell activation and differentiation, thus supporting the established role of T cells in EBV control⁴⁹. In expression data from 54 tissues available in GTEx v8, five (that is, spleen, whole blood, EBV-transformed lymphocytes, lung and terminal ileum) were identified as potential effector tissues (Fig. 3c). For non-blood tissues, we hypothesize that tissue-resident leukocytes are partially responsible for the observed enrichments. For blood, the enrichment was further elucidated using a gene expression dataset from peripheral blood mononuclear cells (PBMCs)⁵⁰ and the single-cell disease relevance score approach⁵¹ (scDRS; Methods). Within eight major cell types (annotation level 1), we observed significant enrichments in CD8⁺ T cells, consistent with their role in eliminating EBV-infected B cells⁴⁹ and NK cells (Fig. 3d,e). At a more fine-grained annotation (level 2, 21 cell types; Methods), the highest average scDRS was observed in the small cell cluster annotated as NK_bright cells. Furthermore, support was generated for NK_dim and memory CD8⁺ T cells, both of which have similar enrichment P values, albeit for much larger cell numbers (Extended Data Fig. 7).

We also mapped lead variants (or proxies thereof) to cell-type-specific cis-expression QTL (eQTL) data from PBMCs⁵² (OneK1K project; Methods), and identified 18 variant–gene–cell-type associations. Most were for ERAP2, with consistent direction of effect in multiple cell types, including CD8⁺ T and NK cells. Additional cell-type-specific eQTL effects were found for CTLA4 and CMC1 in S100B-positive CD8⁺ T cells and SLC22A5 in NK cells (Supplementary Table 18).

EBVread⁺ has a polygenic architecture

We then evaluated whether an aggregated genetic risk score (GRS) improves risk prediction for EBVread⁺ compared with a baseline model (including age and sex), and is transferable across cohorts and ancestries. First, we assigned individuals from the UKB no outlier cohort (EUR) to one of three cohorts: (1) UKB serology target cohort (individuals for whom serology data were available), (2) UKB disease target cohort (individuals with EBV-associated diseases⁴), or (3) UKB base cohort (remaining individuals; Methods). In the UKB base cohort, we generated six GRSs, using either imputed HLA alleles (three GRSs: HLA all, HLA MHC-I and HLA MHC-II) or genotyped singe-nucleotide polymorphisms (SNPs; all, SNPs in MHC and SNPs outside of MHC; Methods).

We then applied these GRSs to the UKB serology target cohort and found that the GRSs encompassing all HLA alleles (HLA all) best explained EBVread⁺ according to Nagelkerke R² (improvement over the base model: ΔR² = 0.080 ± 0.009 s.d.). HLA MHC-I and HLA MHC-II GRS, which represent uncorrelated predictors (Extended Data Fig. 8), performed similarly well when compared to each other (Fig. 4a). The three GRSs based on HLA alleles outperformed SNP-based GRSs, although the GRSs using SNPs outside of MHC (SNP wo MHC) captured independent genetic risk (Fig. 4a). We therefore proceeded with HLA all, HLA MHC-I, HLA MHC-II and SNP wo MHC, none of which differed between EBVsero⁺ and EBVsero⁻ groups (Fig. 4b) and which were positively correlated with observed EBV read counts in the serology cohort (Fig. 4c and Extended Data Fig. 8).

**Fig. 4: GRS analyses in UKB and AoU.**

To analyse transferability, we applied similar GRSs within the AoU no outlier cohort, which was stratified by genetic ancestry (Methods). In the EUR subcohort, which had the highest genetic similarity to the UKB base cohort, improvements in Nagelkerke R² values compared with the baseline model were similar to our results from UKB, with HLA all best explaining EBVread⁺ (ΔR² = 0.072 ± 0.002 s.d.; Fig. 4d and Extended Data Fig. 8). Similarly, HLA all showed the largest improvements in Nagelkerke R² in each of the five non-EUR ancestry groups, despite differences in absolute values (Fig. 4d). In the African (ΔR² = 0.055 ± 0.002 s.d.) and admixed American (ΔR² = 0.065 ± 0.002 s.d.) groups, predictive performance was similar to that of the AoU EUR subcohort (Fig. 4d). This demonstrates some degree of transferability for the GRS comprising all HLA alleles. In all ancestry groups, the SNP-based GRS was least predictive, but was again similar between the EUR subcohorts of UKB and AoU (Extended Data Fig. 8). These results provide evidence for a polygenic component to EBV viral load that is largely driven by the MHC region and can be transferred across ancestries when calculated based on HLA alleles.

GRSs associate with EBV-associated and novel diseases

The four selected GRSs were then applied to the UKB disease target cohorts (infectious mononucleosis, Hodgkin lymphoma, multiple sclerosis, rheumatoid arthritis, non-Hodgkin lymphoma, systemic lupus erythematosus and/or Sjögren disease; see above). Highly significant associations were found for an elevated HLA MHC-I GRS in multiple sclerosis and an elevated HLA MHC-II GRS in rheumatoid arthritis (Fig. 4e). For multiple sclerosis, this effect was attenuated when HLA-A*02:01 was excluded from the GRS (P_{HLA MHC-I} = 3.09 × 10⁻⁵, P_{without HLA-A*02:01} = 0.031). By contrast, exclusion of HLA-DRB1*04:04, which is a risk factor for rheumatoid arthritis³⁵ and was the most significant HLA allele in the EBVread⁺ GWAS, from the HLA MHC-II GRS did not attenuate the association of this GRS with rheumatoid arthritis. At P < 0.1, we also observed a lower HLA all GRS in individuals with non-Hodgkin lymphoma, and a lower HLA MHC-I GRS in rheumatoid arthritis (Fig. 4e).

We then conducted a phenome-wide association study (PheWAS) in the EUR AoU QC cohort using 1,751 PheCodes. With the exception of Sjögren disease, these PheCodes included all of the aforementioned EBV-associated diseases (Methods; Fig. 4f and Extended Data Fig. 8). At P < 0.001, the PheWas replicated all four significant associations identified in UKB. This approach also identified novel candidate diseases associated with EBV host control: the strongest associations were found for type 1 diabetes (beta = 0.176, s.e. = 0.023 for HLA MHC-II), inflammatory bowel disease (beta = −0.14, s.e. = 0.018 for HLA all, and beta = −0.112, s.e. = 0.018 for HLA MHC-II) and hypothyroidism (beta = −0.043, s.e. = 0.008 for HLA MHC-I, and beta = 0.037, s.e. = 0.007 for SNP wo MHC; Supplementary Table 19).

Suggestive causal effects of EBVread⁺ are driven by variants in MHC region

To investigate whether EBVread⁺ as an exposure has a causal effect, we performed two-sample Mendelian randomization (2SMR; Methods) for the five diseases with strong evidence for epidemiological association (that is, multiple sclerosis, rheumatoid arthritis, Hodgkin lymphoma, non-Hodgkin lymphoma and systemic lupus erythematosus)⁴, and three diseases identified by our PheWAS (that is, inflammatory bowel disease, type 1 diabetes and hypothyroidism). For multiple sclerosis, we tested both case–control status and disease course severity (Supplementary Table 20). We found suggestive evidence for causal effects of EBVread⁺ on rheumatoid arthritis (beta_wMed = 0.192, s.e. = 0.053) and type 1 diabetes (beta_wMed = 0.620, s.e. = 0.062), which were consistent across six estimators including two that are robust to pleiotropy (Methods; Supplementary Table 21, Supplementary Fig. 6 and Supplementary Note 8). However, the effects on both outcomes were driven by variants in the MHC region (Supplementary Table 22). Attributing causality is thus problematic, given the unknown extent of pleiotropic effects of MHC variants, and the limited heritability of EBVread⁺ attributed to non-MHC variants. No evidence for an EBVread⁺ causal effect was found for the other seven tested outcomes (Supplementary Table 21) or the negative control trait (Methods).

Discussion

This study is one of the first to demonstrate that GS-based EBVreads are a highly specific proxy for elevated EBV viral load in blood cells. Using this measure, we identified associations between EBVread⁺ and several non-genetic factors, including current smoking as well as sex. Smoking is also a risk factor for several EBV-associated diseases^53,54,55, although the underlying mechanisms remain largely unknown. Current smoking affects both adaptive and innate immunity, with the latter normalizing upon smoking cessation⁵⁶. This suggests an interaction of the innate immune system with current smoking status in EBV host control. The increased prevalence of EBVread⁺ in male sex encourages investigations into sex-specific factors, especially in the light of the contrary female predisposition of autoimmune diseases, including multiple sclerosis⁵⁷.

We found that EBVread⁺ is polygenic and characterized by a major (and largely equal) contribution of alleles at MHC-I and MHC-II, which supports previous observations that CD8⁺ cytotoxic T and NK cells⁴⁹ (MHC-I) as well as CD4⁺ helper T cells^49,58 (MHC-II) are important in EBV control. Some genes implicated by common variants underly monogenic IEIs with increased susceptibility to severe EBV infections, often associated with a pronounced risk of EBV-associated diseases including lymphoma (for example, CD70)^59,60. Our results thus probably harbour novel candidate genes for IEIs, such as CD226, which is a member of the immunoglobulin superfamily that contributes to NK and CD8⁺ T cell regulation⁶¹ and impairs CD8⁺ T cell response in chronic HIV when downregulated⁶².

Using genetically predicted EBV viral load, we identified genetic overlap with multiple sclerosis and rheumatoid arthritis. Although EBV is a prerequisite for multiple sclerosis, HLA-A*02:01, which reduces multiple sclerosis risk, was among our most significant findings and was associated with better EBV control. By contrast, no consistent effect on EBVread⁺ was found for the major multiple sclerosis risk allele HLA-DRB1*15:01 (ref. ³⁶), suggesting a pathomechanism distinct from EBV viral load control. This could include a stronger antibody response through preferential EBV peptide presentation^63,64, expansion of specific B cell subsets⁶⁵ or molecular mimicry. In support of this, detailed analysis of HLA-DRB1*15:01 (Extended Data Fig. 9) found that the strongest effect size was with antibody levels of IgG EBNA-1, in line with previous findings that antibodies to EBNA-1 cross-react with the central nervous system protein GlialCAM⁸. In rheumatoid arthritis, alleles at MHC-I and MHC-II were associated with lower and higher EBV viral load, respectively. This suggests a specific dysregulation of the immune response to EBV, rather than a generic loss of EBV immune control. Although further research is required to determine whether the effect of EBV viral load is causal, the 2SMR results support this hypothesis. Our analyses also revealed a genetic overlap between EBV control and type 1 diabetes, inflammatory bowel disease or ulcerative colitis, and hypothyroidism, suggesting that the pathophysiological relevance of EBV host control may be broader than currently assumed.

Our study had several limitations. First, owing to the standard depth of human GS, most individuals had an EBV read count of zero, and many had an EBV read count of exactly 1. For statistical analyses, we binarized the phenotype into low or high EBV viral load, based on absolute EBV read count numbers, and compared EBV read count 0 versus 1 and higher. Given the limited resolution, some individuals with presumed high viral load might actually have low viral load. However, this potential mis-classification is unlikely to have impacted the overall conclusions, which are supported by our sensitivity analyses and are similar to those of a recent study, which used a different definition for increased viral load⁶⁶. If specific quantitative measures or deeper GS data become available, statistical power will probably increase. Second, unobserved factors may have confounded associations with EBVread⁺, although we mitigated this risk by replicating findings across biobanks. Third, despite the partial transferability of HLA-based GRS across ancestries, the discovery analyses mainly involved EUR individuals. This might have influenced the identity of associated HLA alleles, and limit the generalizability of the findings with respect to different EBV strains and EBV-associated diseases, which vary in terms of global distribution and prevalence. Thus, replication of the GWAS findings and downstream analyses in non-EUR ancestries are required. Finally, given the biological complexity of the MHC region and current challenges in HLA allele imputation⁶⁷, some HLA associations might have been missed or mimicked by extended regions of LD.

This work has established EBV viral sequence traces from blood-based human GS data as the basis for future investigations into functional, mechanistic and epidemiological aspects of persistent EBV infection. Quantification of viral load using host GS data could be extended to other human pathogens, and facilitate investigation of interactions between chronic infections and the host immune system in health and disease.

Methods

Analysis of UKB data

UKB data, accessed based on application ID 135122, were used as the primary discovery cohort, unless stated otherwise (Supplementary Note 1). Individual-level data analyses were conducted within the UKB Research Analysis Platform (RAP).

Extraction of high-quality EBV reads

All individuals with available GS data (n = 490,293)²⁴ were included in the initial stage of analysis (UKB cohort). During the process of the project, 208 individuals (0.04%) withdrew their consent from UKB, explaining slightly lower sample counts in some follow-up analyses (n = 490,085). DNA extraction, library preparation, sequencing and alignment have been described elsewhere^68,69 and are summarized in Supplementary Note 2. Reads mapping to the EBV genome (NC_007605.1) were accessed in CRAM files (field 24048), which had been previously generated by aligning fastq data to a GRCh38 graph genome (including the contig chrEBV) and were extracted using samtools (v1.20). Only read pairs where both forwards and reverse reads, respectively, mapped to NC_007605.1, were retained. Within pairs, reads were removed if they had more than 20 soft-clip bases, less than 120 bases matching the reference or were duplicates (see Supplementary Note 2). Finally, if at least one read of a read pair remained, this was counted as one EBV read. We also generated a similar dataset for HHV7 for the purpose of comparison, as described in Supplementary Note 6.

Quality control

We calculated the fraction of individuals with EBV reads per library preparation plate (field 32056). Fifty-one plates had excessively high proportions of EBVread⁺ individuals, probably due to contamination, and were excluded (Extended Data Fig. 1 and Supplementary Note 1). We also excluded individuals with low GS data quality (field 32064), sex chromosome aneuploidies (array-based genotyping data, field 22019) or discrepancies between reported and genetic sex (fields 31, 22001), resulting in the UKB QC cohort. For analyses limited to EUR ancestry, individuals were selected based on UKB field 22006. Applying a high-quality set of common genotyped variants for principal component analysis and for regenie step 1 (Supplementary Note 9) led to the exclusion of an additional 180 individuals (Supplementary Note 2), leaving n = 403,014 individuals for analyses (UKB EUR cohort).

We also generated a subcohort of the UKB QC cohort, comprising individuals for whom serology measurements were available (UKB serology cohort; n = 9,281, based on data field 23053). In this cohort, EBV seropositivity was defined based on the detection of at least two out of four EBV-related IgG antibodies (EA-D, ZEBRA, EBNA-1 and VCA-p18), as previously suggested^44,70.

Processing of covariates

For individuals of the UKB EUR cohort, potentially important confounders of EBV read detection were retrieved based on ref. ⁷¹, including information on sequencing, technical aspects, blood composition and demographics. On the basis of the SNOMED associations identified in the AoU cohort, we additionally considered smoking status, pack years of smoking, number of cigarettes smoked per day (or previously smoked in cigar and/or pipe smokers) and number of weekly alcoholic drinks. Extracted values were processed to finally obtain transformed values for each covariate (Supplementary Note 4). Correlated covariates were identified by calculating Pearson correlations (one of each pair removed if correlation > 0.7; n = 4). Together with covariates age × sex and age × age, this resulted in 28 potential covariates, which were further reduced to a final set of 18 covariates by forwards and backwards selection with Bayesian information criterion (Supplementary Table 4 and Supplementary Note 4).

Immunosuppressive and EBV-associated conditions

Immunosuppressed individuals were identified as those reported with (1) taking immunosuppressive drugs (including glucocorticoids) at the time of visiting the UKB assessment centre (verbal interview, field 20003; n = 9,681), or (2) HIV infection (UKB fields 130204, 130206, 130208, 130210 and 130212; n = 230). Individuals affected by EBV-associated diseases were identified based on self-reporting in the assessment centre, International Statistical Classification of Diseases and Related Health Problems, 10th revision codes or codes for operative procedures (OPCS4). Full lists are given in Supplementary Table 23.

Association analyses

For common variants and HLA alleles, the main GWAS on EBVread⁺ was conducted with two-step regenie (v3.2.4)⁷², on related individuals of the No immune supp. cohort. Common variants have been previously imputed using the Haplotype Reference Consortium and UK10K haplotype resource²² (UKB field 22828; 29,865,259 variants with info-score > 0.8; 481 individuals lacked imputation data). Individual HLA alleles were obtained from field 22182, based on previous imputation with HLA*IMP:02 (ref. ⁷³). Variants were included if they had a predicted minor allele count of ≥ 25. Non-classical HLA alleles were not included due to the lack of established standards for imputing these alleles. For compatibility with regenie step 2, the provided dosages were converted to plink2 pgen-files. In the statistical analysis, the 18 selected covariates and 20 principal components were used as covariates, and saddle point approximation was applied to account for case–control imbalance (see Supplementary Fig. 7 and Supplementary Notes 9 and 10).

For conditional analysis of HLA alleles, we applied a forwards-stepwise regression approach to identify HLA alleles that independently associate with the trait, based on the following procedure: (1) initial single-variant test for all HLA alleles as described in common variants and HLA alleles. (2) Iterative conditioning: repeat the following process: (i) Identify the allele with the lowest P value from the previous step; (ii) add this allele to the alleles to condition on; and (iii) run the conditioned association analysis (regenie v3.2.4). Step 2 was repeated until the most significant allele in the current iteration had a P value greater than the commonly used genome-wide significance threshold of 5 × 10⁻⁸.

For epistatic analyses, the lead variants of the three top non-MHC loci for EBVread⁺ were tested for interaction with conditionally independent HLA alleles, based on data from non-related individuals of the UKB no immune supp. cohort (n = 304,523 with complete data). Likelihood-ratio tests (LRTs; 1 d.f.) were used, comparing an additive logistic regression model with a model that additionally included an interaction term between the non-MHC SNP and the HLA allele (see Supplementary Note 11). LRT P values were Bonferroni corrected for multiple testing.

For rare variants, RVAS_gene was performed as described for common variants (identical phenotypes, and covariates, same procedure for regenie step 1), but based on exome variants and annotations as provided by the UKB⁷⁴ (field 23158; Supplementary Note 9). This resulted in a slight reduction of the overall sample number (based on no immune supp. cohort; n = 54,259 EBVread⁺ cases and n = 293,834 EBVread⁻ controls). For regenie step 2, SKAT-O was used as a test (parameter: ‘--vc-tests skato’) and we restricted the analysis to rare variants with an alternative allele frequency below 1% (parameter ‘--vc-maxAAF 0.01’). The following definitions of variant pathogenicity (masks) were used: (1) M1: predicted loss-of-function variants; (2) strong coding: variants from (1) and likely deleterious missense variants; (3) medium coding: variants from (2) plus possibly deleterious missense variants; and (4) all coding variants from (3) plus likely benign missense variants (Supplementary Note 9). Overall, this analysis comprised rare variants in 18,796 protein-coding genes.

Additional case–control definitions and subcohorts

In addition to the main analysis of EBVread⁺, in which we compared individuals with EBV reads (1–18) to those without any EBV reads (0), we generated modified case–control definitions. These included GWAS analyses of 0 versus 1 read counts, 0 versus 2–18 read counts, and a ‘within EBVread⁺’ analysis comparing individuals with 1 read count versus 2–18 read counts. We also performed sex-restricted analyses, that is, on male or female participants only. Sample numbers are provided in Supplementary Table 12.

Analysis in the AoU cohort

We used release 8 (C2024Q3R3) of the AoU Research Program, which included array and GS data from blood-based DNA samples of 365,931 individuals (AoU cohort). The AoU resource, including data generation, processing and quality control of genomic data, is described in ref. ²³ and accompanying documents.

Generation of EBV read data and cohort from GS data

First, EBV reads were extracted from CRAM files as described for UKB participants. At the individual level, we restricted our analyses to unrelated individuals with plausible time points of DNA sampling (between 11:00 and 23:59), without mismatch between reported and genetic sex and who were not flagged as population outliers (‘flagged samples’) in accompanying documents (AoU QC cohort, n = 336,123). For population-specific analyses, precomputed genetically predicted population backgrounds were used, which assigned each individual to one of six continental populations (Extended Data Fig. 3; see ‘Genomic research data quality report’).

Phenome-wide association analysis of EBVread⁺

We retrieved individuals from the AoU QC cohort who had electronic health record data available. For SNOMED concept IDs annotated in 250 or more individuals (n = 11,111), associations with the presence of EBV reads was tested as follows: we first applied logistic regression models with the presence of EBV reads as outcome, the presence of a SNOMED ID as predictor, and included age, sex, age × sex and 16 precomputed principal components as covariates. In a second step, we also included HIV and smoking status as covariates (see Supplementary Note 3). P values were calculated using LRT.

Replication of associated loci and HLA alleles

Detailed information of variant sets, generation of principal components, imputation and quality control of HLA alleles are described in Supplementary Fig. 8 and Supplementary Notes 3 and 12. Association analyses were performed in the EUR subcohort of AoU using regenie (v2.0.2), but without using step 1. We selected similar covariates as in the analysis within UKB, that is, sex, age, age × sex, mean sequencing coverage, hour as well as the week and time of biosample collection, nicotine usage, sequencing site and 20 principal components. However, certain covariates (including blood count traits) are not directly available in AoU and therefore could not be included (see Supplementary Table 4), which prevented a meta-analysis between UKB and AoU (Supplementary Note 4).

Validation cohorts

Two non-UKB/non-AoU cohorts were used for validation (Supplementary Note 13). For each of them, EBV reads were extracted from short-read GS data, in analogy to the analysis in UKB:

(1) Validation 1, qPCR. This cohort was recruited to study ACE inhibitor-induced angiooedema and consisted of 110 participants for whom GS data and DNA samples were available (blood or saliva derived²⁵). To quantify EBV viral load, qPCR was performed on 72 individuals, including all EBVread⁺ and a random subset of EBVread⁻ individuals, using the clinically validated GeneProof EBV PCR Kit (TaqPath Menu, Applied Biosystems; four technical replicates per sample), with the target gene EBNA1.

(2) Validation 2, qPCR and RNA-seq. Partially overlapping subsets of JCTF participants with SARS-CoV-2 infection^26,75 were used for qPCR for EBV viral load and reanalysis of RNA-seq data (n = 1,010), respectively. GS was obtained from whole-blood-derived DNA. For qPCR (n = 262 individuals, 3 technical replicates each), an in-house developed qPCRs assay was run, targeting EBNA1 (Supplementary Note 13; sequences available on request). Full-length RNA-seq data were reanalysed for the expression of 94 EBV genes. In short, reads were aligned against the GRCh38 reference genome, which included the EBV sequence NC_007605.1, and EBV transcripts were quantified using RSEM (v1.3.0). Given the high prevalence of EBVread⁺ in the JCTF subcohort, we investigated whether a more severe COVID-19 disease course drives EBVread⁺, but did not observe a strong effect (Supplementary Table 5 and Supplementary Note 5).

Genetic risk loci associated with EBVread⁺

Annotation of non-MHC risk loci

Regional association plots were generated with LocusZoom⁷⁶ (see Supplementary Fig. 9 and Supplementary Note 14). Genome-wide significance was defined as P < 5 × 10⁻⁸, and independent risk loci were defined in FUMA (v1.6.3)⁴⁸, based on 1000Gv3 (EUR population; r² threshold of 0.6) lead SNPs (merging distance of 250 kb). For each locus, we reported (1) closest gene (based on distance of lead SNP to the transcription starting site); (2) linkage disequilibrium genes (that is, genes located within associated region, defined through variants with r² > 0.2 to lead variant); (3) eGenes from GTEx (based on Adult GTEx v10, with genome-wide significant single-tissue eQTL effects (P < 5 × 10⁻⁸) in any tissue); and (4) V2G scores from Open Targets (v22.10; based on a cut-off > 0.1). To identify pleiotropic effects of lead variants, we retrieved from OpenTargets (v22.10) all traits at P < 0.005 that were reported in either GWAS Catalog, UKB or FinnGen. To identify potential targets for drug repurposing, approved drugs (clinical phase IV) targeting the identified genes were retrieved from OpenTargets (v25.3).

To investigate for potential regulatory effects on transcription in specific blood cell types, lead variants (or proxies thereof; r² > 0.7 based on 1000Gv3, EUR subset) were retrieved from the OneK1K dataset⁵², and reported eQTLs with FDR < 0.05 in the original dataset.

Generation of credible SNP sets

Fine mapping for each non-MHC locus was performed with SuSie (sum of single effects regression)³³, using 1-Mb window size (except for 12q24.12_SH2B3 (3 Mb) and 5q31.1_SLC22A5 (5 Mb) due to extended local linkage disequilibrium). Linkage disequilibrium matrices were generated from the imputed genotype data of the unrelated UKB EUR cohort (see above; n = 339,539; without principal component filter) using plink2 (v2.0.0-a.6). Coding variants within the credible SNP sets (cumulative PIP: 0.95) were annotated using Ensembl Variant Effect Predictor⁷⁷ (VEP; release 113), ClinVar (version June, 2023)⁷⁸ and AlphaMissense prediction scores⁷⁹.

Correlation of effect sizes

We retrieved association statistics for lead variants at the 27 genome-wide significant non-MHC risk loci as well as for 54 conditionally independent HLA alleles, from additional GWAS. These included four different case–control definitions based on EBV read counts and female-only or male-only GWAS (see above), as well as from three external datasets: memory B cell absolute counts (GCST90001407 (ref. ⁴²); no MHC data available) and EBV antibody titres⁴⁴. We additionally calculated effect sizes at these loci for HHV7read⁺ (Supplementary Note 6) and recalculated effect sizes for main EBVread⁺ GWAS (0 versus 1–18) using different sets of covariates (Supplementary Note 4). Variants in linkage disequilibrium with the lead variant were used if they increased the overlap between datasets. We then calculated the correlation of effect sizes (betas) using Spearman’s correlation for non-MHC risk variants as well as HLA alleles. For HHV7, we investigated loci with potentially shared causal variants using coloc (v5.2.3)⁸⁰ in R (v4.4.2).

Gene-level analyses

Gene-based association testing as well as enrichment analyses were conducted using MAGMA (v1.08)⁴⁵, using default settings unless stated otherwise. Variants were assigned to 19,736 genes using the MAGMA gene boundaries Ensembl v102 file (excluding the extended MHC region as previously suggested⁷¹ (25–36 Mb)), and a window of 10 kb upstream and 1.5 kb downstream. Gene sets for IEIs were defined based on literature¹⁵ (n = 14 genes) or the IEI classification (available at https://iuis.org/committees/iei/, accessed 6 May 2025; n = 456 genes available in our data). Gene ontology biological processes (n = 7,743 terms) and tissue types (n = 54, GTEx v8) were provided by FUMA (v1.6.3).

Cell-type identification was performed using scDRS⁵¹ (v1.0.3) and single-cell RNA-seq data from the 1M-scBloodNL project, published by the sc-eQTLGen consortium⁵⁰ (samples processed with Genomics (v3); broader level of cell-type annotations with 10 cell types; see Supplementary Note 15). Following data processing using the Seurat package (v5.2.1) in R (v4.3.2), 37,033 cells annotated to 8 cell types remained for the scDRS analysis. The top 1,000 EBVread⁺ MAGMA genes and their z-scores were used as weights in the scDRS analysis, with otherwise default parameters. Subsequent group analyses (that is, cell-type association and heterogeneity) were conducted with default parameters. Multiple testing correction of P values for the number of cell types was performed using Benjamini–Hochberg procedure.

GRS analyses in UKB

Analysis of polygenic contribution

To study the joint contribution of common variants associated with EBVread⁺ to EBV-associated diseases within EUR individuals of the UKB no outlier cohort, we generated an independent base cohort plus additional target cohorts, which encompassed (1) individuals with EBV-associated diseases (see results, data fields given in Supplementary Table 23), or (2) individuals for whom serology data were available. We additionally removed all individuals that were related to any other individual of the target or the base cohort. Individuals of the target cohorts were required to pass all filters applied to the UKB no outlier cohort, except that individuals within the top 1% EBV read counts were kept.

For SNP-based GRS, common variant association analysis was performed within the base cohort as described above, except that only autosomal and genotyped variants with a minor allele count > 25 were used for regenie step 2 (n = 739,066). Polygenic risk scoring was performed on non-ambiguous SNPs using PRS-CS (v1.0.0) in combination with a linkage disequilibrium matrix derived from EUR individuals of the 1000 genomes project⁸¹. To generate separate GRS for the MHC regions and the non-MHC regions, the summary statistics generated using the base dataset were split to only contain the required regions (that is, MHC region and non-MHC regions). Scores of individuals of the target cohort were obtained using the score function of plink (v1.90b6.21). For HLA allele-based GRS, GRS based on imputed HLA alleles were calculated by fitting a multivariable logistic regression model to the base dataset, where EBVread⁺ was the outcome. As predictors, we used the 18 covariates and the 20 principal component (see above), plus 178 HLA alleles that had a minor allele frequency > 0.1%. After model fit, the coefficient estimates of the 178 HLA alleles were retrieved. Scores for individuals in target cohorts were generated by multiplying HLA allele dosages with coefficient estimates and summing-up these values. To obtain MHC-I-specific or MHC-II-specific scores, we calculated the GRS using the same coefficient estimates, but considered only the HLA alleles belonging to the respective MHC class. All risk scores were normalized to means of 0 and standard deviations of 1 within the combined target cohorts.

Evaluation of GRS performance

To evaluate GRS performance, we used logistic regression models, where the group membership was the outcome (for example, EBVread⁺ versus EBVread⁻ or serology cohort versus a cohort of individuals with EBV-associated disease). As predictors, we used age, sex, age × sex, 20 principal components (base model) or the predictors of the base model as well as the respective GRS (GRS model). We then calculated Nagelkerke R² for the base and the GRS models, where variability of Nagelkerke R² estimates was evaluated using bootstrapping (n = 1,000). To test for statistical significance, we compared base models and GRS models using LRT.

GRS analyses in AoU

Transferability of GRS

For GRS analyses across biobanks and populations, we used our EBVread⁺ summary statistics from the UKB no immune supp. cohort, as the base dataset. SNP-based GRS were calculated based on 1,509,024 genotyped variants within AoU, which had a Hardy–Weinberg equilibrium P ≥ 1×10⁻¹⁰ and a variant-level missingness < 0.05 in each continental ancestry. Polygenic risk scoring was performed using PRS-CS CS and plink (v1.9.0-b.7.7) as described above. To obtain HLA-based GRS, we used coefficient estimates of the 178 HLA alleles from the UKB (see above) and multiplied them with the estimated dosage for each HLA allele in AoU. Mapping of HLA allele names between HLA*IMP:02 and HLA-TAPAS was performed manually, which resulted in a successful mapping for 166 of 178 alleles (no clear mapping for one MHC-I allele and 11 MHC-II alleles).

Phenome-wide association of EBVread⁺ GRS with PheCodes

We used the software package PheTK (v0.1.47)⁸² to assign PheCodes (v1.2) to individuals of the AoU QC cohort (EUR subset, n = 189,658). Individuals were considered as having a certain PheCode if the PheCode was annotated at least twice to the individual, as suggested by PheTK. We used logistic regression models with the presence of a PheCode as outcome and GRS as predictors, respectively. Age, sex, age × sex and 20 population-specific principal components were used as covariates. P values were calculated using LRT comparing models with and without the respective GRS. To comply with AoU publishing guidelines, we have only reported PheCodes annotated to more than 20 individuals, that do not supply count-related data and only give proportions when all underlying groups contain more than 20 individuals. PheCodes (n = 1,751) were compliant with these parameters.

2SMR analysis

Selection of outcome traits

We retrieved publicly available summary statistics for EUR ancestry cohorts for (1) known EBV-associated diseases: multiple sclerosis case–control⁸³, multiple sclerosis severity⁸⁴, Hodgkin disease⁸⁵, non-Hodgkin lymphoma⁸⁵, systemic lupus erythematosus⁸⁶ and rheumatoid arthritis⁸⁷; and (2) candidate diseases based on significant PheWAS results: hypothyroidism⁸⁵, type 1 diabetes⁸⁸ and inflammatory bowel disease⁸⁹. Of note, none of the nine outcome GWAS included samples from UKB. We also used ‘red hair colour’⁹⁰ (including UKB) as a negative control outcome. Summary statistics were retrieved from the GWAS Catalog except for multiple sclerosis severity, where summary statistics were shared by the authors. Further details are provided in Supplementary Table 20. Analyses were performed in R (v4.5.0) using the packages ieugwasr (v1.0.3) and TwoSampleMR (v0.6.15)^91,92.

Selection of instrumental variables

First, we applied quality control on exposure and outcome GWAS summary statistics, retaining autosomal and non-duplicate variants with minor allele frequency > 0.01 and info-score > 0.8. Linkage disequilibrium-independent genome-wide significant variants from the GWAS on EBVread⁺ (the exposure) were identified using linkage disequilibrium clumping (ld_clump function of the ieugwasr package, standard parameters) and harmonized with the outcome GWAS summary statistics. The remaining genome-wide significant variants of the exposure were not suspicious of weak-instrument bias (I² statistic = 0.99). We further used Steiger filtering⁹² to exclude potentially invalid instruments (that is, variants showing stronger associations with the outcome versus with the exposure). Together, this resulted in a reduction of the number of variants available for 2SMR.

2SMR

2SMR was performed using four methods⁹³: the inverse variance-weighted estimator, MR-Egger, weighted median and weighted mode. For traits in which the exposure–outcome associations were nominally significant in all four estimators and reached test-wide significance (P < 0.05/9) in two out of four (Supplementary Table 20), we applied the outlier-robust and pleiotropy-robust estimators MR-RAPS⁹⁴ and MR-PRESSO⁹⁵. Outcomes that were also significant in these two additional tests were then subjected to further sensitivity analyses, specifically heterogeneity (Cochran’s Q statistic⁹⁶), pleiotropy tests (for example, MR-Egger intercept⁹⁷) and leave-one-out analyses. For these outcomes, we also performed 2SMR after excluding variants in the MHC region.

Ethics declaration

This study used de-identified data from the UKB and AoU, which were accessed through the respective computing platforms. UKB has approval from the North West Multi-centre Research Ethics Committee (MREC) as a Research Tissue Bank. This approval means that researchers do not require separate ethical clearance and can operate under the Research Tissue Bank approval. The data collection of the AoU Research Program was conducted under centralized Institutional Review Board (IRB) approval, with informed consent being obtained from the participants. Further ethical approvals were obtained from the Ethics Committee of the Medical Faculty Bonn (no. 101/16; for analysis of validation cohort 1) and by the ethical committees of the affiliated institutes (Keio IRB approval 20200061, Osaka University IRB approval 734-14 and University of Tsukuba IRB approval H29-294) for the JCTF.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

All genetic and phenotype data from the biobanks are available upon application and approved data access from the UKB study and AoU projects. All interested readers will be able to access the data in the same manner that the authors did, including usage of the UKB Research Analysis Platform and AoU workbench environments for the analysis of de-identified individual-level data. GWAS summary statistics are available through the GWAS Catalog (main EBVread⁺ GWAS: GCST90809298; GWAS for additional case–control definitions: GCST90809299–GCST90809306). The main EBVread⁺ GWAS is also available at LocusZoom (https://my.locuszoom.org/gwas/968885/?token=b74ac20f6ad94a88a5ea27b6ac214645). All additional data are either provided in Supplementary Tables or through Zenodo⁹⁸. Data access for the two validation cohorts is described in their respective original articles^25,26. Complementary data used for secondary analyses were obtained from: OneK1K (https://onek1k.org/), eQTLgen 1M-scBloodNL (https://www.eqtlgen.org/sc/datasets/1m-scbloodnl-dataset.html), GTEx (https://www.gtexportal.org/home/), OpenTargets (https://platform.opentargets.org/), IUIS (https://iuis.org/committees/iei/), GWAS Catalog (https://www.ebi.ac.uk/gwas/) and the International Multiple Sclerosis Genomics consortium (https://imsgc.net/).

Code availability

References

Mentzer, A. J. et al. Identification of host-pathogen-disease relationships using a scalable multiplex serology platform in UK Biobank. Nat. Commun. 13, 1818 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Zamora, M. R. DNA viruses (CMV, EBV, and the herpesviruses). Semin. Respir. Crit. Care Med. 32, 454–470 (2011).
Article PubMed Google Scholar
Souza, T. A., Stollar, B. D., Sullivan, J. L., Luzuriaga, K. & Thorley-Lawson, D. A. Peripheral B cells latently infected with Epstein-Barr virus display molecular hallmarks of classical antigen-selected memory B cells. Proc. Natl Acad. Sci. USA 102, 18093–18098 (2005).
Article ADS CAS PubMed PubMed Central Google Scholar
Damania, B., Kenney, S. C. & Raab-Traub, N. Epstein-Barr virus: biology and clinical disease. Cell 185, 3652–3670 (2022).
Article CAS PubMed PubMed Central Google Scholar
Cohen, J. I. Epstein-Barr virus infection. N. Engl. J. Med. 343, 481–492 (2000).
Article CAS PubMed Google Scholar
Houldcroft, C. J. & Kellam, P. Host genetics of Epstein-Barr virus infection, latency and disease. Rev. Med. Virol. 25, 71–84 (2015).
Article CAS PubMed Google Scholar
Thorley-Lawson, D. A. EBV persistence — introducing the virus. Curr. Top. Microbiol. Immunol. 390, 151–209 (2015).
CAS PubMed PubMed Central Google Scholar
Lanz, T. V. et al. Clonally expanded B cells in multiple sclerosis bind EBV EBNA1 and GlialCAM. Nature 603, 321–327 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Robinson, W. H., Younis, S., Love, Z. Z., Steinman, L. & Lanz, T. V. Epstein-Barr virus as a potentiator of autoimmune diseases. Nat. Rev. Rheumatol. 20, 729–740 (2024).
Article PubMed Google Scholar
Bjornevik, K. et al. Longitudinal analysis reveals high prevalence of Epstein-Barr virus associated with multiple sclerosis. Science 375, 296–301 (2022).
Article ADS CAS PubMed Google Scholar
Münz, C. Altered EBV specific immune control in multiple sclerosis. J. Neuroimmunol. 390, 578343 (2024).
Article PubMed Google Scholar
Goldacre, R. Risk of multiple sclerosis in individuals with infectious mononucleosis: a national population-based cohort study using hospital records in England, 2003-2023. Mult. Scler. 30, 489–495 (2024).
Article CAS PubMed PubMed Central Google Scholar
Draborg, A. H., Duus, K. & Houen, G. Epstein-Barr virus and systemic lupus erythematosus. Clin. Dev. Immunol. 2012, 370516 (2012).
Article PubMed PubMed Central Google Scholar
Lünemann, J. D. et al. Increased frequency of EBV-specific effector memory CD8⁺ T cells correlates with higher viral load in rheumatoid arthritis. J. Immunol. 181, 991–1000 (2008).
Article PubMed PubMed Central Google Scholar
Tangye, S. G. Genetic susceptibility to EBV infection: insights from inborn errors of immunity. Hum. Genet. 139, 885–901 (2020).
Article PubMed Google Scholar
Tangye, S. G. & Latour, S. Primary immunodeficiencies reveal the molecular requirements for effective host defense against EBV infection. Blood 135, 644–655 (2020).
Article PubMed Google Scholar
Niller, H.-H. & Bauer, G. Epstein-Barr virus: clinical diagnostics. Methods Mol. Biol. 1532, 33–55 (2017).
Article CAS PubMed Google Scholar
Kanakry, J. A. et al. The clinical significance of EBV DNA in the plasma and peripheral blood mononuclear cells of patients with or without EBV diseases. Blood 127, 2007–2017 (2016).
Article CAS PubMed PubMed Central Google Scholar
Verdu-Bou, M., Tapia, G., Hernandez-Rodriguez, A. & Navarro, J.-T. Clinical and therapeutic implications of Epstein-Barr virus in HIV-related lymphomas. Cancers 13, 5534 (2021).
Article CAS PubMed PubMed Central Google Scholar
Latour, S. Human immune responses to Epstein-Barr virus highlighted by immunodeficiencies. Annu. Rev. Immunol. 43, 723–749 (2025).
Article CAS PubMed Google Scholar
Moustafa, A. et al. The blood DNA virome in 8,000 humans. PLoS Pathog. 13, e1006292 (2017).
Article PubMed PubMed Central Google Scholar
Bycroft, C. et al. The UK Biobank resource with deep phenotyping and genomic data. Nature 562, 203–209 (2018).
Article ADS CAS PubMed PubMed Central Google Scholar
The All of Us Research Program Genomics Investigators et al. Genomic data in the All of Us Research Program. Nature 627, 340–346 (2024).
Article CAS Google Scholar
UK Biobank Whole-Genome Sequencing Consortium. Whole-genome sequencing of 490,640 UK Biobank participants. Nature 645, 692–701 (2025).
Article Google Scholar
Mathey, C. M. et al. Molecular genetic screening in patients with ACE inhibitor/angiotensin receptor blocker-induced angioedema to explore the role of hereditary angioedema genes. Front. Genet. 13, 914376 (2022).
Article CAS PubMed PubMed Central Google Scholar
Wang, Q. S. et al. Statistically and functionally fine-mapped blood eQTLs and pQTLs from 1,405 humans reveal distinct regulation patterns and disease relevance. Nat. Genet. 56, 2054–2067 (2024).
Article CAS PubMed PubMed Central Google Scholar
Fraser, C., Hollingsworth, T. D., Chapman, R., de Wolf, F. & Hanage, W. P. Variation in HIV-1 set-point viral load: epidemiological analysis and an evolutionary hypothesis. Proc. Natl Acad. Sci. USA 104, 17441–17446 (2007).
Article ADS CAS PubMed PubMed Central Google Scholar
De Paschale, M. & Clerici, P. Serological diagnosis of Epstein-Barr virus infection: problems and solutions. World J. Virol. 1, 31–43 (2012).
Article PubMed PubMed Central Google Scholar
Dias, M. H. F. et al. Impact of Epstein-Barr virus co-infection on natural acquired Plasmodium vivax antibody response. PLoS Negl. Trop. Dis. 16, e0010305 (2022).
Article CAS PubMed PubMed Central Google Scholar
Stevens, S. J. C., Blank, B. S. N., Smits, P. H. M., Meenhorst, P. L. & Middeldorp, J. M. High Epstein-Barr virus (EBV) DNA loads in HIV-infected patients: correlation with antiretroviral therapy and quantitative EBV serology. AIDS 16, 993–1001 (2002).
Article CAS PubMed Google Scholar
Younis, S. et al. Epstein-Barr virus reprograms autoreactive B cells as antigen-presenting cells in systemic lupus erythematosus. Sci. Transl. Med. 17, eady0210 (2025).
Article CAS PubMed PubMed Central Google Scholar
Bulik-Sullivan, B. K. et al. LD score regression distinguishes confounding from polygenicity in genome-wide association studies. Nat. Genet. 47, 291–295 (2015).
Article CAS PubMed PubMed Central Google Scholar
Wang, G., Sarkar, A., Carbonetto, P. & Stephens, M. A simple new approach to variable selection in regression, with application to genetic fine mapping. J. R. Stat. Soc. Series B Stat. Methodol. 82, 1273–1300 (2020).
Article MathSciNet PubMed PubMed Central Google Scholar
Seaton, G., Smith, H., Brancale, A., Westwell, A. D. & Clarkson, R. Multifaceted roles for BCL3 in cancer: a proto-oncogene comes of age. Mol. Cancer 23, 7 (2024).
Article CAS PubMed PubMed Central Google Scholar
Raychaudhuri, S. et al. Five amino acids in three HLA proteins explain most of the association between MHC and seropositive rheumatoid arthritis. Nat. Genet. 44, 291–296 (2012).
Article CAS PubMed PubMed Central Google Scholar
Moutsianas, L. et al. Class II HLA interactions modulate genetic risk for multiple sclerosis. Nat. Genet. 47, 1107–1113 (2015).
Article CAS PubMed PubMed Central Google Scholar
Hjalgrim, H. et al. HLA-A alleles and infectious mononucleosis suggest a critical role for cytotoxic T-cell response in EBV-related Hodgkin lymphoma. Proc. Natl Acad. Sci. USA 107, 6400–6405 (2010).
Article ADS CAS PubMed PubMed Central Google Scholar
Kirimunda, S. et al. Variation in the human leukocyte antigen system and risk for endemic Burkitt lymphoma in northern Uganda. Br. J. Haematol. 189, 489–499 (2020).
Article CAS PubMed PubMed Central Google Scholar
Al-Kaabi, M. et al. Epistatic interaction between ERAP2 and HLA modulates HIV-1 adaptation and disease outcome in an Australian population. PLoS Pathog. 20, e1012359 (2024).
Article CAS PubMed PubMed Central Google Scholar
Evans, D. M. et al. Interaction between ERAP1 and HLA-B27 in ankylosing spondylitis implicates peptide handling in the mechanism for HLA-B27 in disease susceptibility. Nat. Genet. 43, 761–767 (2011).
Article CAS PubMed PubMed Central Google Scholar
Raja, A. & Kuiper, J. J. W. Evolutionary immuno-genetics of endoplasmic reticulum aminopeptidase II (ERAP2). Genes Immun. 24, 295–302 (2023).
Article CAS PubMed PubMed Central Google Scholar
Orrù, V. et al. Complex genetic signatures in immune cells underlie autoimmunity and inform therapy. Nat. Genet. 52, 1036–1045 (2020).
Article PubMed PubMed Central Google Scholar
Müller-Winkler, J. et al. Critical requirement for BCR, BAFF, and BAFFR in memory B cell survival. J. Exp. Med. 218, e20191393 (2021).
Article PubMed PubMed Central Google Scholar
Butler-Laporte, G. et al. Genetic determinants of antibody-mediated immune responses to infectious diseases agents: a genome-wide and HLA association study. Open Forum Infect. Dis. 7, ofaa450 (2020).
Article PubMed PubMed Central Google Scholar
de Leeuw, C. A., Mooij, J. M., Heskes, T. & Posthuma, D. MAGMA: generalized gene-set analysis of GWAS data. PLoS Comput. Biol. 11, e1004219 (2015).
Article PubMed PubMed Central Google Scholar
Salzer, U. & Grimbacher, B. TACI deficiency — a complex system out of balance. Curr. Opin. Immunol. 71, 81–88 (2021).
Article CAS PubMed Google Scholar
Karczewski, K. J. et al. Systematic single-variant and gene-based association testing of thousands of phenotypes in 394,841 UK Biobank exomes. Cell Genom. 2, 100168 (2022).
Article CAS PubMed PubMed Central Google Scholar
Watanabe, K., Taskesen, E., van Bochoven, A. & Posthuma, D. Functional mapping and annotation of genetic associations with FUMA. Nat. Commun. 8, 1826 (2017).
Article ADS PubMed PubMed Central Google Scholar
Rickinson, A. B., Long, H. M., Palendira, U., Münz, C. & Hislop, A. D. Cellular immune controls over Epstein-Barr virus infection: new lessons from the clinic and the laboratory. Trends Immunol. 35, 159–169 (2014).
Article CAS PubMed Google Scholar
van der Wijst, M. G. P. et al. Single-cell RNA sequencing identifies celltype-specific cis-eQTLs and co-expression QTLs. Nat. Genet. 50, 493–497 (2018).
Article PubMed PubMed Central Google Scholar
Zhang, M. J. et al. Polygenic enrichment distinguishes disease associations of individual cells in single-cell RNA-seq data. Nat. Genet. 54, 1572–1580 (2022).
Article CAS PubMed PubMed Central Google Scholar
Yazar, S. et al. Single-cell eQTL mapping identifies cell type-specific genetic control of autoimmune disease. Science 376, eabf3041 (2022).
Article CAS PubMed Google Scholar
Chang, K. et al. Smoking and rheumatoid arthritis. Int. J. Mol. Sci. 15, 22279–22295 (2014).
Article CAS PubMed PubMed Central Google Scholar
Kamper-Jørgensen, M. et al. Cigarette smoking and risk of Hodgkin lymphoma and its subtypes: a pooled analysis from the International Lymphoma Epidemiology Consortium (InterLymph). Ann. Oncol. 24, 2245–2255 (2013).
Article PubMed PubMed Central Google Scholar
Wingerchuk, D. M. Smoking: effects on multiple sclerosis susceptibility and disease progression. Ther. Adv. Neurol. Disord. 5, 13–22 (2012).
Article PubMed PubMed Central Google Scholar
Saint-André, V. et al. Smoking changes adaptive immunity with persistent effects. Nature 626, 827–835 (2024).
Article ADS PubMed PubMed Central Google Scholar
Fairweather, D., Beetler, D. J., McCabe, E. J. & Lieberman, S. M. Mechanisms underlying sex differences in autoimmunity. J. Clin. Invest. 134, e180076 (2024).
Article CAS PubMed PubMed Central Google Scholar
Liu, M., Wang, R. & Xie, Z. T cell-mediated immunity during Epstein-Barr virus infections in children. Infect. Genet. Evol. 112, 105443 (2023).
Article CAS PubMed Google Scholar
Abolhassani, H. et al. Combined immunodeficiency and Epstein-Barr virus-induced B cell malignancy in humans with inherited CD70 deficiency. J. Exp. Med. 214, 91–106 (2017).
Article CAS PubMed Google Scholar
Izawa, K. et al. Inherited CD70 deficiency in humans reveals a critical role for the CD70-CD27 pathway in immunity to Epstein-Barr virus infection. J. Exp. Med. 214, 73–89 (2017).
Article CAS PubMed Google Scholar
Huang, Z., Qi, G., Miller, J. S. & Zheng, S. G. CD226: an emerging role in immunologic diseases. Front. Cell Dev. Biol. 8, 564 (2020).
Article ADS PubMed PubMed Central Google Scholar
Cella, M. et al. Loss of DNAM-1 contributes to CD8⁺ T-cell exhaustion in chronic HIV-1 infection. Eur. J. Immunol. 40, 949–954 (2010).
Article CAS PubMed PubMed Central Google Scholar
Drosu, N. et al. CD4 T cells restricted to DRB1*15:01 recognize two Epstein-Barr virus glycoproteins capable of intracellular antigen presentation. Proc. Natl Acad. Sci. USA 121, e2416097121 (2024).
Article CAS PubMed PubMed Central Google Scholar
Lanz, T. V. & Robinson, W. H. Connecting the dots: presentation of EBV antigens on HLA class II risk alleles connects the two main risk factors of multiple sclerosis. Proc. Natl Acad. Sci. USA 121, e2420070121 (2024).
Article CAS PubMed PubMed Central Google Scholar
Läderach, F. et al. EBV induces CNS homing of B cells attracting inflammatory T cells. Nature 646, 171–179 (2025).
Article ADS PubMed Google Scholar
Nyeo, S. S. et al. Population-scale sequencing resolves determinants of persistent EBV DNA. Nature 650, 664–672 (2026).
Article PubMed PubMed Central Google Scholar
Prodanov, T. et al. Locityper enables targeted genotyping of complex polymorphic genes. Nat. Genet. 57, 2901–2908 (2025).
Article CAS PubMed PubMed Central Google Scholar
Welsh, S., Peakman, T., Sheard, S. & Almond, R. Comparison of DNA quantification methodology used in the DNA extraction protocol for the UK Biobank cohort. BMC Genomics 18, 26 (2017).
Article PubMed PubMed Central Google Scholar
Halldorsson, B. V. et al. The sequences of 150,119 genomes in the UK Biobank. Nature 607, 732–740 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Brenner, N. et al. Validation of multiplex serology detecting human herpesviruses 1-5. PLoS ONE 13, e0209379 (2018).
Article PubMed PubMed Central Google Scholar
Gupta, R. et al. Nuclear genetic control of mtDNA copy number and heteroplasmy in humans. Nature 620, 839–848 (2023).
Article ADS CAS PubMed PubMed Central Google Scholar
Mbatchou, J. et al. Computationally efficient whole-genome regression for quantitative and binary traits. Nat. Genet. 53, 1097–1103 (2021).
Article CAS PubMed Google Scholar
Dilthey, A. et al. Multi-population classical HLA type imputation. PLoS Comput. Biol. 9, e1002877 (2013).
Article CAS PubMed PubMed Central Google Scholar
Backman, J. D. et al. Exome sequencing and analysis of 454,787 UK Biobank participants. Nature 599, 628–634 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Namkoong, H. et al. DOCK2 is involved in the host genetics and biology of severe COVID-19. Nature 609, 754–760 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Pruim, R. J. et al. LocusZoom: regional visualization of genome-wide association scan results. Bioinformatics 26, 2336–2337 (2010).
Article CAS PubMed PubMed Central Google Scholar
McLaren, W. et al. The Ensembl Variant Effect Predictor. Genome Biol. 17, 122 (2016).
Article PubMed PubMed Central Google Scholar
Landrum, M. J. et al. ClinVar: public archive of relationships among sequence variation and human phenotype. Nucleic Acids Res. 42, D980–D985 (2014).
Article CAS PubMed Google Scholar
Cheng, J. et al. Accurate proteome-wide missense variant effect prediction with AlphaMissense. Science 381, eadg7492 (2023).
Article CAS PubMed Google Scholar
Giambartolomei, C. et al. Bayesian test for colocalisation between pairs of genetic association studies using summary statistics. PLoS Genet. 10, e1004383 (2014).
Article PubMed PubMed Central Google Scholar
Ge, T., Chen, C.-Y., Ni, Y., Feng, Y.-C. A. & Smoller, J. W. Polygenic prediction via Bayesian regression and continuous shrinkage priors. Nat. Commun. 10, 1776 (2019).
Article ADS PubMed PubMed Central Google Scholar
Tran, T. C. et al. PheWAS analysis on large-scale biobank data with PheTK. Bioinformatics 41, btae719 (2024).
Article PubMed PubMed Central Google Scholar
International Multiple Sclerosis Genetics Consortium (IMSGC). Analysis of immune-related loci identifies 48 new susceptibility variants for multiple sclerosis. Nat. Genet. 45, 1353–1360 (2013).
Article Google Scholar
International Multiple Sclerosis Genetics Consortium & MultipleMS Consortium. Locus for severity implicates CNS resilience in progression of multiple sclerosis. Nature 619, 323–331 (2023).
Article ADS Google Scholar
Verma, A. et al. Diversity and scale: genetic architecture of 2068 traits in the VA Million Veteran Program. Science 385, eadj1182 (2024).
Article CAS PubMed PubMed Central Google Scholar
Langefeld, C. D. et al. Transancestral mapping and genetic load in systemic lupus erythematosus. Nat. Commun. 8, 16021 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Ishigaki, K. et al. Multi-ancestry genome-wide association analyses identify novel genetic mechanisms in rheumatoid arthritis. Nat. Genet. 54, 1640–1651 (2022).
Article CAS PubMed PubMed Central Google Scholar
Robertson, C. C. et al. Fine-mapping, trans-ancestral and genomic analyses identify causal variants, cells, genes and drug targets for type 1 diabetes. Nat. Genet. 53, 962–971 (2021).
Article CAS PubMed PubMed Central Google Scholar
de Lange, K. M. et al. Genome-wide association study implicates immune activation of multiple integrin genes in inflammatory bowel disease. Nat. Genet. 49, 256–261 (2017).
Article PubMed PubMed Central Google Scholar
Jiang, L., Zheng, Z., Fang, H. & Yang, J. A generalized linear mixed model association tool for biobank-scale data. Nat. Genet. 53, 1616–1621 (2021).
Article CAS PubMed Google Scholar
Hemani, G. et al. The MR-Base platform supports systematic causal inference across the human phenome. eLife 7, e34408 (2018).
Article PubMed PubMed Central Google Scholar
Hemani, G., Tilling, K. & Davey Smith, G. Orienting the causal relationship between imprecisely measured traits using GWAS summary data. PLoS Genet. 13, e1007081 (2017).
Article PubMed PubMed Central Google Scholar
Sanderson, E. et al. Mendelian randomization. Nat. Rev. Methods Primer 2, 6 (2022).
Article CAS Google Scholar
Zhao, Q., Wang, J., Hemani, G., Bowden, J. & Small, D. S. Statistical inference in two-sample summary-data Mendelian randomization using robust adjusted profile score. Ann. Stat. 48, 1742–1769 (2020).
Verbanck, M., Chen, C.-Y., Neale, B. & Do, R. Detection of widespread horizontal pleiotropy in causal relationships inferred from Mendelian randomization between complex traits and diseases. Nat. Genet. 50, 693–698 (2018).
Article CAS PubMed PubMed Central Google Scholar
Bowden, J. et al. Improving the accuracy of two-sample summary-data Mendelian randomization: moving beyond the NOME assumption. Int. J. Epidemiol. 48, 728–742 (2019).
Article PubMed PubMed Central Google Scholar
Burgess, S. & Thompson, S. G. Interpreting findings from Mendelian randomization using the MR-Egger method. Eur. J. Epidemiol. 32, 377–389 (2017).
Article PubMed PubMed Central Google Scholar
Schmidt, A. & Ludwig, K. U. Host control of persistent Epstein-Barr virus infection. Zenodo https://doi.org/10.5281/ZENODO.18417294 (2026).

Download references

Acknowledgements

We thank A. Vyvers and S. Heilmann-Heimbach for critical discussions; H. Schrage for laboratory support; C. Schmäl for manuscript editing; the AoU participants for their contributions, without whom this research would not have been possible; the US National Institutes of Health’s AoU Research Program for making available the participant data examined in this study; the International Multiple Sclerosis Genetics Consortium for providing summary statistics on multiple sclerosis; and the granted access to the Bonna and Marvin HPC clusters hosted by the University of Bonn. K.B., A.-K.P., M.M.N. and K.U.L. are members of the Excellence Cluster ImmunoSensation³ (EXC2151), which is funded by the German Research Foundation (DFG) under 390873048. A.S. was supported by the BONFOR program of the Medical Faculty of the University of Bonn (O-149.0134). Y. Okada was supported by JSPS KAKENHI (25H01057); AMED (JP24km0405217, JP24ek0109594, JP24ek0410113, JP24kk0305022, JP223fa627001, JP223fa627002, JP223fa627010, JP223fa627011, JP22zf0127008, JP24tm0524002, JP24wm0625504 and JP24gm1810011); JST Moonshot R&D (JPMJMS2021 and JPMJMS2024); Takeda Science Foundation; Ono Pharmaceutical Foundation for Oncology, Immunology, and Neurology; Bioinformatics Initiative of Osaka University Graduate School of Medicine; Institute for Open and Transdisciplinary Research Initiatives; Center for Infectious Disease Education and Research (CiDER); and Center for Advanced Modality and DDS, Osaka University, and RIKEN TRIP initiative (AGIS). H.N. was supported by AMED (JP24tm0524008, JP22fk0108510 and JP22fk0108537), JST PRESTO(JPMJPR21R7) and Takeda Science Foundation. UKB analyses were performed under application 135122. This work uses data provided by patients and collected by the NHS as part of their care and support, and we thank the participants and coordinators of the UKB study. This publication was supported by the Open Access Publication Fund of the University of Bonn.

Author information

Authors and Affiliations

Institute of Human Genetics, School of Medicine, University of Bonn and University Hospital Bonn, Bonn, Germany
Axel Schmidt, T. Madhusankha Alawathurage, Friederike S. David, Leonard Frach, Sylvia Richter, Merle Schaefer, Carina M. Mathey, Sabrina K. Henne, Andreas J. Forstner, Markus M. Nöthen, Eva C. Beins & Kerstin U. Ludwig
Department of Psychiatry and Psychotherapy, University of Marburg, Marburg, Germany
Friederike S. David
Department of Genome Informatics, Graduate School of Medicine, The University of Tokyo, Tokyo, Japan
Yosuke Ogawa, Kyuto Sonehara, Tatsuhiko Naito, Shinichi Namba, Noah Sasa & Yukinori Okada
Department of Pediatrics, Graduate School of Medicine, The University of Tokyo, Tokyo, Japan
Yosuke Ogawa
Laboratory for Systems Genetics, RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
Yosuke Ogawa, Ryuya Edahiro, Yuya Shirai, Kyuto Sonehara, Tatsuhiko Naito, Shinichi Namba, Noah Sasa & Yukinori Okada
Department of Clinical, Educational and Health Psychology, Division of Psychology and Language Sciences, Faculty of Brain Sciences, University College London, London, UK
Leonard Frach
Institute of Neuroscience and Medicine (INM-1), Research Center Jülich, Jülich, Germany
Andreas J. Forstner
Institute of Medical Microbiology and Hospital Hygiene, Heinrich Heine University Düsseldorf, Düsseldorf, Germany
Alexander T. Dilthey
Center for Digital Medicine, Heinrich Heine University Düsseldorf, Düsseldorf, Germany
Alexander T. Dilthey
Center of Neurology, Department of Neuroimmunology, University Hospital and University Bonn, Bonn, Germany
Anne-Katrin Pröbstel
German Center for Neurodegenerative Diseases (DZNE), Bonn, Germany
Anne-Katrin Pröbstel
Department of Neurology, University Hospital of Basel and University of Basel, Basel, Switzerland
Anne-Katrin Pröbstel
Department of Biomedicine, University Hospital of Basel and University of Basel, Basel, Switzerland
Anne-Katrin Pröbstel
Department of Clinical Research, University Hospital of Basel and University of Basel, Basel, Switzerland
Anne-Katrin Pröbstel
Research Center for Clinical Neuroimmunology and Neuroscience Basel, University Hospital of Basel and University of Basel, Basel, Switzerland
Anne-Katrin Pröbstel
Clinic for Pediatric Immunology and Rheumatology, Center for Pediatrics and Adolescent Medicine, University Hospital Bonn, Bonn, Germany
Kaan Boztug
St. Anna Children’s Cancer Research Institute, Vienna, Austria
Kaan Boztug
CeMM Research Center for Molecular Medicine of the Austrian Academy of Sciences, Vienna, Austria
Kaan Boztug
Department of Pediatrics and Adolescent Medicine, Medical University of Vienna, Vienna, Austria
Kaan Boztug
Department of Infectious Diseases, Keio University School of Medicine, Tokyo, Japan
Sho Uchida, Shunsuke Uno, Tomoyasu Nishimura, Naoki Hasegawa & Ho Namkoong
Department of Statistical Genetics, Graduate School of Medicine, The University of Osaka, Suita, Japan
Ryuya Edahiro, Yuya Shirai, Kyuto Sonehara, Tatsuhiko Naito, Kenichi Yamamoto, Qingbo S. Wang, Shinichi Namba, Ken Suzuki, Toshihiro Kishikawa, Noah Sasa & Yukinori Okada
Laboratory of Statistical Immunology, Immunology Frontier Research Center (WPI-IFReC), The University of Osaka, Suita, Japan
Yukinori Okada
Premium Research Institute for Human Metaverse Medicine (WPI-PRIMe), The University of Osaka, Suita, Japan
Yukinori Okada
Division of Pulmonary Medicine, Department of Medicine, Keio University School of Medicine, Tokyo, Japan
Genta Nagao, Hiromu Tanaka, Shuhei Azekawa, Ko Lee, Naoki Fukunaga, Junko Hamamoto, Hiroki Kabata, Katsunori Masaki, Hirofumi Kamata, Shinnosuke Ikemura, Shotaro Chubachi, Satoshi Okamori, Hideki Terai, Atsuho Morita, Takanori Asakura, Makoto Ishii & Koichi Fukunaga
Department of Laboratory Medicine, Keio University School of Medicine, Tokyo, Japan
Yoshifumi Uwamino
Keio University Health Center, Tokyo, Japan
Tomoyasu Nishimura
Genomics Unit, Keio Cancer Center, Keio University Hospital, Tokyo, Japan
Emmy Yanagita & Hiroshi Nishihara
Department of Emergency and Critical Care Medicine, Keio University School of Medicine, Tokyo, Japan
Junichi Sasaki
Department of Anesthesiology, Keio University School of Medicine, Tokyo, Japan
Hiroshi Morisaki
Department of Organoid Medicine, Keio University School of Medicine, Tokyo, Japan
Toshiro Sato
Department of Surgery, Keio University School of Medicine, Tokyo, Japan
Yuko Kitagawa
Division of Gastroenterology and Hepatology, Department of Medicine, Keio University School of Medicine, Tokyo, Japan
Yuta Matsubara, Yohei Mikami, Kosaku Nanki & Takanori Kanai
Department of Respiratory Medicine and Clinical Immunology, Graduate School of Medicine, The University of Osaka, Suita, Japan
Ryuya Edahiro, Yuya Shirai, Yasuhiro Kato, Takayoshi Morita, Takayuki Shiroyama, Yuichi Maeda, Takuro Nii, Yoshimi Noda, Takayuki Niitsu, Yuichi Adachi, Takatoshi Enomoto, Saori Amiya, Reina Hara, Yuta Yamaguchi, Teruaki Murakami, Tomoki Kuge, Kinnosuke Matsumoto, Yuji Yamamoto, Makoto Yamamoto, Midori Yoneda, Haruhiko Hirata, Yoshito Takeda & Atsushi Kumanogoh
Single Cell Genomics, Human Immunology, WPI Immunology Frontier Research Center, The University of Osaka, Suita, Japan
Daisuke Okuzaki, Yu-Chen Liu & Ayako Takuwa
Genome Information Research Center, Research Institute for Microbial Diseases, The University of Osaka, Suita, Japan
Daisuke Motooka & Yoko Naito
Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
Masahiro Kanai
Laboratory of Children’s Health and Genetics, Division of Health Science, Graduate School of Medicine, The University of Osaka, Suita, Japan
Kenichi Yamamoto
Department of Immunopathology, Immunology Frontier Research Center (WPI-IFReC), The University of Osaka, Suita, Japan
Yasuhiro Kato, Takayoshi Morita, Yuta Yamaguchi, Teruaki Murakami & Atsushi Kumanogoh
Core Instrumentation Facility, Immunology Frontier Research Center and Research Institute for Microbial Diseases, The University of Osaka, Suita, Japan
Fuminori Sugihara
Laboratory of Human Immunology (Single Cell Immunology), Immunology Frontier Research Center, The University of Osaka, Suita, Japan
James B. Wing
Laboratory of Immune Regulation, Immunology Frontier Research Center, The University of Osaka, Suita, Japan
Shuhei Sakakibara
Department of Pulmonary Medicine, Faculty of Medicine, University of Tsukuba, Tsukuba, Japan
Nobuyuki Hizawa
Department of Neurosurgery, Faculty of Medicine, The University of Tokyo, Tokyo, Japan
Satoru Miyawaki
Department of Integrative Physiology and Bio-Nano Medicine, National Defense Medical College, Tokorozawa, Japan
Yusuke Kawamura, Akiyoshi Nakayama & Hirotaka Matsuo
Department of Otorhinolaryngology-Head and Neck Surgery, Graduate School of Medicine, The University of Osaka, Suita, Japan
Toshihiro Kishikawa, Noah Sasa, Yuya Ueno, Motoyuki Suzuki, Norihiko Takemoto, Hirotaka Eguchi, Takahito Fukusumi, Takao Imai, Munehisa Fukushima & Hidenori Inohara
Department of Head and Neck Surgery, Aichi Cancer Center Hospital, Nagoya, Japan
Toshihiro Kishikawa
Department of Neurosurgery, Graduate School of Medicine, The University of Osaka, Suita, Japan
Shuhei Yamada, Shuhei Kawabata, Noriyuki Kijima, Masatoshi Takagaki & Haruhiko Kishima
Department of Otolaryngology and Head and Neck Surgery, Kansai Rosai Hospital, Hyogo, Japan
Munehisa Fukushima
Division of Infection Control and Prevention, The University of Osaka Hospital, Suita, Japan
Kazunori Tomono
Department of Biomedical Ethics and Public Policy, Graduate School of Medicine, The University of Osaka, Suita, Japan
Kazuto Kato
Center for Genomic Medicine, Kyoto University Graduate School of Medicine, Kyoto, Japan
Meiko Takahashi & Fumihiko Matsuda
Integrated Frontier Research for Medical Science Division, Institute for Open and Transdisciplinary Research Initiatives, The University of Osaka, Suita, Japan
Atsushi Kumanogoh
Center for Infectious Disease Education and Research (CiDER), The University of Osaka, Suita, Japan
Atsushi Kumanogoh
M&D Data Science Center, Institute of Integrated Research, Institute of Science Tokyo, Tokyo, Japan
Takanori Hasegawa, Kunihiko Takahashi, Tatsuhiko Anzai, Satoshi Ito & Satoru Miyano
Department of Medical Informatics, Institute of Science Tokyo Hospital, Tokyo, Japan
Yuji Uchimura
Clinical Research Center, Institute of Science Tokyo Hospital, Tokyo, Japan
Akifumi Endo
Respiratory Medicine, Institute of Science Tokyo Hospital, Tokyo, Japan
Yasunari Miyazaki, Takayuki Honda & Tomoya Tateishi
Clinical Laboratory, Institute of Science Tokyo Hospital, Tokyo, Japan
Shuji Tohda, Naoya Ichimura, Kazunari Sonobe, Chihiro Tani Sassa & Jun Nakajima
Department of Insured Medical Care Management, Institute of Science Tokyo Hospital, Tokyo, Japan
Masumi Ai
Health Science Research and Development Center (HeRD), Institute of Science Tokyo, Tokyo, Japan
Ryuji Koike
Institute of Science Tokyo, Tokyo, Japan
Akinori Kimura
Laboratory of Veterinary Infectious Disease, School of Veterinary Medicine, Kitasato University, Aomori, Japan
Tomomi Takano
Laboratory of Viral Infection, Department of Infection Control and Immunology, Omura Satoshi Memorial Institute and Graduate School of Infection Control Sciences, Kitasato University, Tokyo, Japan
Kazuhiko Katayama
Department of Pathology Saitama Medical University, Saitama, Japan
Koji Okudela
Department of Pathology and Tumor Biology, Kyoto University, Kyoto, Japan
Ryunosuke Saiki, Yasuhito Nannya & Seishi Ogawa
Institute for the Advanced Study of Human Biology (WPI-ASHBi), Kyoto University, Kyoto, Japan
Seishi Ogawa
Division of Health Medical Intelligence, Human Genome Center, Institute of Medical Science, The University of Tokyo, Tokyo, Japan
Takayoshi Hyugaji, Eigo Shimizu, Kotoe Katayama & Seiya Imoto
Genome Medical Science Project (Toyama), National Center for Global Health and Medicine, Tokyo, Japan
Yosuke Omae & Katsushi Tokunaga
Department of Biomolecular Engineering, Graduate School of Tokyo Institute of Technology, Tokyo, Japan
Takafumi Ueno
Division of Immunogenetics, Department of Immunobiology and Neuroscience, Medical Institute of Bioregulation, Kyushu University, Fukuoka, Japan
Yoshinori Fukui
Division of Pathology, Yokohama Municipal Citizen’s Hospital, Yokohama, Japan
Hiroyuki Hayashi
Division of Infectious Disease, Yokohama Municipal Citizen’s Hospital, Yokohama, Japan
Yukihiro Yoshimura & Natsuo Tachikawa
Department of Respiratory Medicine, Juntendo University Faculty of Medicine and Graduate School of Medicine, Tokyo, Japan
Kazuhisa Takahashi, Norihiro Harada, Yuki Tanabe, Haruhi Takagi, Ai Nakamura, Sonoko Harada & Hitoshi Sasano
Department of General Medicine, Juntendo University Faculty of Medicine and Graduate School of Medicine, Tokyo, Japan
Toshio Naito
Department of Emergency and Disaster Medicine, Juntendo University Faculty of Medicine and Graduate School of Medicine, Tokyo, Japan
Makoto Hiki
Department of Cardiovascular Biology and Medicine, Juntendo University Faculty of Medicine and Graduate School of Medicine, Tokyo, Japan
Makoto Hiki
Department of Internal Medicine and Rheumatology, Juntendo University Faculty of Medicine and Graduate School of Medicine, Tokyo, Japan
Yasushi Matsushita
Department of Nephrology, Juntendo University Faculty of Medicine and Graduate School of Medicine, Tokyo, Japan
Ryousuke Aoki
Atopy (Allergy) Research Center, Juntendo University Graduate School of Medicine, Tokyo, Japan
Sonoko Harada
Department of Respiratory Medicine, Saitama Cardiovascular and Respiratory Center, Kumagaya, Japan
Takashi Ishiguro, Taisuke Isono, Shun Shibata, Yuma Matsui, Chiaki Hosoda, Kenji Takano, Takashi Nishida, Yoichi Kobayashi, Yotaro Takaku & Noboru Takayanagi
Internal Medicine, Japan Community Healthcare Organization Saitama Medical Center, Saitama, Japan
Soichiro Ueda, Natsumi Yazaki, Ai Tada, Masayoshi Miyawaki, Masaomi Yamamoto, Eriko Yoshida, Reina Hayashi, Tomoki Nagasaka, Sawako Arai, Yutaro Kaneko & Kana Sasaki
Department of Respiratory Medicine, Tokyo Women’s Medical University, Tokyo, Japan
Etsuko Tagaya & Ken Arimura
Department of General Medicine, Tokyo Women’s Medical University, Tokyo, Japan
Masatoshi Kawana
Kawasaki Municipal Ida Hospital, Department of Internal Medicine, Kawasaki, Japan
Yasushi Nakano, Yukiko Nakajima, Ryusuke Anan, Ryosuke Arai, Yuko Kurihara, Yuko Harada & Kazumi Nishio
Department of Respiratory Medicine, Osaka Saiseikai Nakatsu Hospital, Osaka, Japan
Tetsuya Ueda, Masanori Azuma, Ryuichi Saito, Toshikatsu Sado, Yoshimune Miyazaki, Ryuichi Sato, Yuki Haruta, Tadao Nagasaki, Yoshinori Hasegawa, Akihiro Noda, Yusei Fukushima & Reina Kitagawa
Department of Infection Control, Osaka Saiseikai Nakatsu Hospital, Osaka, Japan
Yoshinori Yasui
Department of Infectious Diseases, Tosei General Hospital, Seto, Japan
Yoshikazu Mutoh
Department of Respiratory, Allergic Diseases Internal Medicine, Tosei General Hospital, Seto, Japan
Tomoki Kimura, Tomonori Sato, Reoto Takei, Satoshi Hagimoto, Yoichiro Noguchi, Yasuhiko Yamano, Hajime Sasano & Sho Ota
Department of Emergency and Critical Care Medicine, Kansai Medical University General Medical Center, Moriguchi, Japan
Yasushi Nakamori, Kazuhisa Yoshiya, Fukuki Saito, Tomoyuki Yoshihara, Daiki Wada, Hiromu Iwamura, Syuji Kanayama & Shuhei Maruyama
Fukujuji hospital, Kiyose, Japan
Takashi Yoshiyama, Ken Ohta, Hiroyuki Kokuto, Hideo Ogata, Yoshiaki Tanaka, Kenichi Arakawa, Masafumi Shimoda & Takeshi Osawa
Department of Pulmonary Medicine, Saitama City Hospital, Saitama, Japan
Hiroki Tateno, Isano Hase, Shuichi Yoshida & Shoji Suzuki
Department of Infectious Diseases, Saitama City Hospital, Saitama, Japan
Miki Kawada
Department of General Thoracic Surgery, Saitama City Hospital, Saitama, Japan
Hirohisa Horinouchi
Department of Pulmonary Medicine, Eiju General Hospital, Tokyo, Japan
Fumitake Saito & Junichi Ochi
Division of Infection Control, Eiju General Hospital, Tokyo, Japan
Keiko Mitamura
Department of Hematology, Eiju General Hospital, Tokyo, Japan
Masao Hagihara & Tomoyuki Uchida
Saiseikai Utsunomiya Hospital, Utsunomiya, Japan
Rie Baba, Daisuke Arai, Takayuki Ogura, Hidenori Takahashi, Shigehiro Hagiwara, Shunichiro Konishi & Ichiro Nakachi
Department of Respiratory Medicine, Tohoku University Graduate School of Medicine, Sendai, Japan
Koji Murakami, Mitsuhiro Yamada, Hisatoshi Sugiura, Hirohito Sano, Shuichiro Matsumoto, Nozomu Kimura & Yoshinao Ono
Department of Infectious Diseases, Tohoku University Graduate School of Medicine, Sendai, Japan
Hiroaki Baba
Department of Respiratory Medicine, Kitasato University Kitasato Institute Hospital, Tokyo, Japan
Yusuke Suzuki, Sohei Nakayama & Keita Masuzawa
Tachikawa Hospital, Tachikawa, Japan
Hidefumi Koh, Tadashi Manabe, Yohei Funatsu, Fumimaro Ito, Takahiro Fukui, Keisuke Shinozuka, Sumiko Kohashi & Masatoshi Miyazaki
Department of Emergency and Critical Care Medicine, Tokyo Women’s Medical University Adachi Medical Center, Tokyo, Japan
Tomohisa Shoko
Internal Medicine, Sano Kosei General Hospital, Sano, Japan
Takashi Inoue, Takahiro Asami, Toshiyuki Hirano, Keigo Kobayashi & Hatsuyo Takaoka
Japan Community Healthcare Organization Kanazawa Hospital, Kanazawa, Japan
Kazuyoshi Watanabe
Department of Respiratory Medicine, Saiseikai Yokohamashi Nanbu Hospital, Yokohama, Japan
Naoki Miyazawa, Yasuhiro Kimura, Reiko Sado & Hideyasu Sugimoto
Department of Clinical Laboratory, Saiseikai Yokohamashi Nanbu Hospital, Yokohama, Japan
Akane Kamiya
Internal Medicine, Internal Medicine Center, Showa University Koto Toyosu Hospital, Tokyo, Japan
Naota Kuwahara, Akiko Fujiwara, Tomohiro Matsunaga, Yoko Sato & Takenori Okada
Department of Respiratory Medicine, Japan Organization of Occupational Health and Safety, Kanto Rosai Hospital, Kawasaki, Japan
Yoshihiro Hirai, Hidetoshi Kawashima & Atsuya Narita
Department of General Internal Medicine, Japan Organization of Occupational Health and Safety, Kanto Rosai Hospital, Kawasaki, Japan
Kazuki Niwa
Division of Infectious Diseases, Japanese Red Cross Musahino Hospital, Tokyo, Japan
Yoshiyuki Sekikawa
Ishikawa Prefectural Central Hospital, Kanazawa, Japan
Koichi Nishi, Masaru Nishitsuji, Mayuko Tani, Junya Suzuki & Hiroki Nakatsumi
Kanagawa Cardiovascular and Respiratory Center, Yokohama, Japan
Takashi Ogura, Hideya Kitamura, Eri Hagiwara, Kota Murohashi & Hiroko Okabayashi
Department of Respiratory Medicine, National Hospital Organization Tokyo Medical Center, Tokyo, Japan
Takao Mochimaru, Shigenari Nukaga, Ryosuke Satomi & Yoshitaka Oyamada
Department of Allergy, National Hospital Organization Tokyo Medical Center, Tokyo, Japan
Takao Mochimaru & Yoshitaka Oyamada
Division of Clinical Infectious Diseases, Department of Medicine, Showa University School of Medicine, Tokyo, Japan
Nobuaki Mori
Department of Respiratory Medicine, Toyohashi Municipal Hospital, Toyohashi, Japan
Tomoya Baba, Yasutaka Fukui, Mitsuru Odate, Shuko Mashimo & Yasushi Makino
Keiyu Hospital, Yokohama, Japan
Kazuma Yagi, Mizuha Hashiguchi, Junko Kagyo & Tetsuya Shiomi
Department of Respiratory Medicine, KKR Sapporo Medical Center, Sapporo, Japan
Satoshi Fuke & Hiroshi Saito
Division of General Internal Medicine, Department of Internal Medicine, St. Marianna University School of Medicine, Kawasaki, Japan
Tomoya Tsuchida
Department of Emergency and Critical Care Medicine, St. Marianna University School of Medicine, Kawasaki, Japan
Shigeki Fujitani, Mumon Takita, Daiki Morikawa & Toru Yoshida
Japanese Red Cross Medical Center, Tokyo, Japan
Takehiro Izumo, Minoru Inomata, Naoyuki Kuse, Nobuyasu Awano & Mari Tone
Matsumoto City Hospital, Matsumoto, Japan
Akihiro Ito
Department of Emergency and Critical Care Medicine, Faculty of Medicine, Fukuoka University, Fukuoka, Japan
Yoshihiko Nakamura, Kota Hoshino, Junichi Maruyama & Hiroyasu Ishikura
Department of Infection Control, Fukuoka University Hospital, Fukuoka, Japan
Tohru Takata
Department of Rheumatology, National Hospital Organization Hokkaido Medical Center, Sapporo, Japan
Toshio Odani
Department of Respiratory Medicine, National Hospital Organization Hokkaido Medical Center, Sapporo, Japan
Masaru Amishima & Takeshi Hattori
Department of Emergency and Critical Care Medicine, National Hospital Organization Hokkaido Medical Center, Sapporo, Japan
Yasuo Shichinohe
NHO Kanazawa Medical Center, Kanazawa, Japan
Takashi Kagaya, Toshiyuki Kita, Kazuhide Ohta, Satoru Sakagami & Kiyoshi Koshida
Department of Internal Medicine, Division of Respiratory Medicine, School of Medicine, Nihon University, Tokyo, Japan
Kentaro Hayashi, Tetsuo Shimizu, Yutaka Kozu, Hisato Hiranuma & Yasuhiro Gon
Musashino Red Cross Hospital, Musashino, Japan
Namiki Izumi, Kaoru Nagata, Ken Ueda, Reiko Taki & Satoko Hanada
Division of Respiratory Medicine, Social Welfare Organization Saiseikai Imperial Gift Foundation, Inc., Saiseikai Kumamoto Hospital, Kumamoto, Japan
Kodai Kawamura, Kazuya Ichikado, Kenta Nishiyama, Hiroyuki Muranaka & Kazunori Nakamura
Department of Respiratory Medicine, Nagoya University Graduate School of Medicine, Nagoya, Japan
Naozumi Hashimoto, Keiko Wakahara, Sakamoto Koji, Norihito Omote & Akira Ando
Department of Internal Medicine, Fukuoka Tokushukai Hospital, Kasuga, Japan
Nobuhiro Kodama, Yasunari Kaneyama & Shunsuke Maeda
Respiratory Medicine, Fukuoka Tokushukai Hospital, Kasuga, Japan
Takashige Kuraki & Takemasa Matsumoto
Department of Endocrinology, Hematology and Gerontology, Chiba University Graduate School of Medicine, Chiba, Japan
Koutaro Yokote
Department of Emergency and Critical Care Medicine, Chiba University Graduate School of Medicine, Chiba, Japan
Taka-Aki Nakada, Ryuzo Abe, Taku Oshima & Tadanaga Shimada
National Hospital Organization Kumamoto Medical Center, Kumamoto, Japan
Masahiro Harada, Takeshi Takahashi, Hiroshi Ono, Toshihiro Sakurai & Takayuki Shibusawa
Division of Infectious Diseases and Respiratory Medicine, Department of Internal Medicine, National Defense Medical College, Tokorozawa, Japan
Yoshifumi Kimizuka, Akihiko Kawana, Tomoya Sano, Chie Watanabe & Ryohei Suematsu
Sapporo City General Hospital, Sapporo, Japan
Hisako Sageshima
Department of Internal Medicine, Tokyo Saiseikai Central Hospital, Tokyo, Japan
Ayumi Yoshifuji & Kazuto Ito
Department of Pulmonary Medicine, Tokyo Saiseikai Central Hospital, Tokyo, Japan
Saeko Takahashi & Kota Ishioka
National Hospital Organization Kanagawa Hospital, Hadano, Japan
Morio Nakamura
Department of Respiratory Medicine, Fujisawa City Hospital, Fujisawa, Japan
Makoto Masuda, Aya Wakabayashi, Hiroki Watanabe, Suguru Ueda & Masanori Nishikawa
Uji-Tokushukai Medical Center, Uji, Japan
Yusuke Chihara, Mayumi Takeuchi, Keisuke Onoi, Jun Shinozuka & Atsushi Sueyoshi
Fukuoka Tokushukai Hospital, Kasuga, Japan
Atsushi Sueyoshi
Department of Infectious Disease, NHO Kyushu Medical Center, Fukuoka, Japan
Yoji Nagasaki, Sayoko Ishihara & Masatoshi Shimo
Department of Respirology, NHO Kyushu Medical Center, Fukuoka, Japan
Masaki Okamoto & Yoshihisa Tokunaga
Division of Respirology, Rheumatology, and Neurology, Department of Internal Medicine, Kurume University School of Medicine, Kurume, Japan
Masaki Okamoto & Yoshihisa Tokunaga
Ome Medical Center, Ome, Japan
Yu Kusaka, Takehiko Ohba & Susumu Isogai
Research Institute for Diseases of the Chest, Graduate School of Medical Sciences, Kyushu University, Fukuoka, Japan
Satoru Fukuyama, Keiko Kan-o & Koichiro Matsumoto
Department of Medicine and Biosystemic Science, Kyushu University Graduate School of Medical Sciences, Fukuoka, Japan
Yoshihiro Eriguchi & Akiko Yonekawa
Daini Osaka Police Hospital, Osaka, Japan
Kensuke Kanaoka, Shoichi Ihara & Kiyoshi Komuta
Department of Emergency and Critical Care Medicine, Faculty of Medicine, University of Tsukuba, Tsukuba, Japan
Yoshiaki Inoue
Department of Hematology, Faculty of Medicine, University of Tsukuba, Tsukuba, Japan
Shigeru Chiba
Department of Nephrology, Faculty of Medicine, University of Tsukuba, Tsukuba, Japan
Kunihiro Yamagata & Hirayasu Kai
Department of Cardiovascular Surgery, Faculty of Medicine, University of Tsukuba, Tsukuba, Japan
Yuji Hiramatsu
Division of Pulmonary Medicine, Department of Medicine, Tokai University School of Medicine, Isehara, Japan
Koichiro Asano, Tsuyoshi Oguma & Yoko Ito
Department of Anesthesiology and Intensive Care Medicine, Kyoto Prefectural University of Medicine, Kyoto, Japan
Satoru Hashimoto & Masaki Yamasaki
Department of Infection Control and Laboratory Medicine, Kyoto Prefectural University of Medicine, Kyoto, Japan
Yu Kasamatsu
Department of Respiratory Internal Medicine, St Marianna University School of Medicine, Yokohama-City Seibu Hospital, Yokohama, Japan
Yuko Komase, Naoya Hida, Takahiro Tsuburai & Baku Oyama
KINSHUKAI Hanwa The Second Hospital, Osaka, Japan
Minoru Takada & Hidenori Kanda
Emergency and Disaster Medicine, Gifu University School of Medicine Graduate School of Medicine, Gifu, Japan
Yuichiro Kitagawa, Tetsuya Fukuta, Takahito Miyake & Shozo Yoshida
School of Health Sciences, Asahi University, Gifu, Japan
Shinji Ogura
Department of Respiratory Medicine, Tokyo Medical University Hospital, Tokyo, Japan
Shinji Abe, Yuta Kono, Yuki Togashi, Hiroyuki Takoi & Ryota Kikuchi
JA Toride Medical Hospital, Toride, Japan
Shinichi Ogawa, Tomouki Ogata & Shoichiro Ishihara
Okayama Rosai Hospital, Okayama, Japan
Arihiko Kanehiro, Shinji Ozaki, Yasuko Fuchimoto, Sae Wada & Nobukazu Fujimoto
Himeji St. Mary’s Hospital, Himeji, Japan
Arihiko Kanehiro
Emergency and Critical Care, Niigata University, Niigata, Japan
Kei Nishiyama
Emergency and Critical Care Center, National Hospital Organization Kyoto Medical Center, Kyoto, Japan
Mariko Terashima, Satoru Beppu & Kosuke Yoshida
National Hospital Organization Tokyo Hospital Hospital, Kiyose, Japan
Osamu Narumoto, Hideaki Nagai & Nobuharu Ooshima
Fujioka General Hospital, Fujioka, Japan
Mitsuru Motegi
Department of General Medicine, School of Medicine, International University of Health and Welfare Shioya Hospital, Yaita, Japan
Akira Umeda
Department of Pharmacology, School of Pharmacy, International University of Health and Welfare Shioya Hospital, Ohtawara, Japan
Kazuya Miyagawa
Department of Respiratory Medicine, International University of Health and Welfare Shioya Hospital, Ohtawara, Japan
Hisato Shimada
Department of Clinical Laboratory, International University of Health and Welfare Shioya Hospital, Ohtawara, Japan
Mayu Endo
Department of General Medicine, School of Medicine, International University of Health and Welfare Shioya Hospital, Ohtawara, Japan
Yoshiyuki Ohira
Department of Cardiology, Pulmonology, and Nephrology, Yamagata University Faculty of Medicine, Yamagata, Japan
Masafumi Watanabe, Sumito Inoue, Akira Igarashi & Masamichi Sato
Division of Respiratory Medicine and Allergology, Department of Medicine, School of Medicine, Showa University, Tokyo, Japan
Hironori Sagara, Akihiko Tanaka, Shin Ohta & Tomoyuki Kimura
Department of Pulmonary Medicine, Fukushima Medical University, Fukushima, Japan
Yoko Shibata, Yoshinori Tanino, Takefumi Nikaido, Hiroyuki Minemura & Yuki Sato
Kansai Electric Power Hospital, Osaka, Japan
Yuichiro Yamada, Takuya Hashino & Masato Shinoki
Division of Infectious Diseases, Kumamoto City Hospital, Kumamoto, Japan
Hajime Iwagoe
Department of Respiratory Medicine, Kumamoto City Hospital, Kumamoto, Japan
Hiroshi Takahashi, Kazuhiko Fujii & Hiroto Kishi
Department of Emergency and Critical Care Medicine, Tokyo Metropolitan Police Hospital, Tokyo, Japan
Masayuki Kanai, Tomonori Imamura & Tatsuya Yamashita
Department of Respiratory Medicine, Gunma University Graduate School of Medicine, Maebashi, Japan
Masakiyo Yatomi & Toshitaka Maeno
National Hospital Organization Saitama Hospital, Wako, Japan
Shinichi Hayashi, Mai Takahashi, Mizuki Kuramochi, Isamu Kamimaki & Yoshiteru Tominaga
Tokyo Medical University Ibaraki Medical Center, Inashiki, Japan
Tomoo Ishii
Department of Internal Medicine, Kiryu Kosei General Hospital, Kiryu, Japan
Mitsuyoshi Utsugi & Akihiro Ono
Department of Pulmonary Medicine and Oncology, Graduate School of Medicine, Nippon Medical School, Tokyo, Japan
Toru Tanaka, Takeru Kashiwada, Kazue Fujita, Yoshinobu Saito & Masahiro Seike
Division of Respiratory Medicine, Tsukuba Kinen General Hospital, Tsukuba, Japan
Hiroko Watanabe
Division of Respiratory Medicine, Department of Internal Medicine, Toho University Ohashi Medical Center, Tokyo, Japan
Hiroto Matsuse, Norio Kodaka, Chihiro Nakano, Takeshi Oshio & Takatomo Hirouchi
Division of Anesthesiology, Department of Surgery Related, Kobe University Graduate School of Medicine, Kobe, Japan
Shohei Makino & Moritoki Egi

Authors

Axel Schmidt
View author publications
Search author on:PubMed Google Scholar
T. Madhusankha Alawathurage
View author publications
Search author on:PubMed Google Scholar
Friederike S. David
View author publications
Search author on:PubMed Google Scholar
Yosuke Ogawa
View author publications
Search author on:PubMed Google Scholar
Leonard Frach
View author publications
Search author on:PubMed Google Scholar
Sylvia Richter
View author publications
Search author on:PubMed Google Scholar
Merle Schaefer
View author publications
Search author on:PubMed Google Scholar
Carina M. Mathey
View author publications
Search author on:PubMed Google Scholar
Sabrina K. Henne
View author publications
Search author on:PubMed Google Scholar
Andreas J. Forstner
View author publications
Search author on:PubMed Google Scholar
Alexander T. Dilthey
View author publications
Search author on:PubMed Google Scholar
Anne-Katrin Pröbstel
View author publications
Search author on:PubMed Google Scholar
Kaan Boztug
View author publications
Search author on:PubMed Google Scholar
Markus M. Nöthen
View author publications
Search author on:PubMed Google Scholar
Ho Namkoong
View author publications
Search author on:PubMed Google Scholar
Yukinori Okada
View author publications
Search author on:PubMed Google Scholar
Eva C. Beins
View author publications
Search author on:PubMed Google Scholar
Kerstin U. Ludwig
View author publications
Search author on:PubMed Google Scholar

Consortia

Japan COVID-19 Task Force

Genta Nagao
, Hiromu Tanaka
, Shuhei Azekawa
, Ko Lee
, Naoki Fukunaga
, Junko Hamamoto
, Hiroki Kabata
, Katsunori Masaki
, Hirofumi Kamata
, Shinnosuke Ikemura
, Shotaro Chubachi
, Satoshi Okamori
, Hideki Terai
, Atsuho Morita
, Takanori Asakura
, Makoto Ishii
, Koichi Fukunaga
, Yoshifumi Uwamino
, Sho Uchida
, Shunsuke Uno
, Tomoyasu Nishimura
, Ho Namkoong
, Naoki Hasegawa
, Emmy Yanagita
, Hiroshi Nishihara
, Junichi Sasaki
, Hiroshi Morisaki
, Toshiro Sato
, Yuko Kitagawa
, Yuta Matsubara
, Yohei Mikami
, Kosaku Nanki
, Takanori Kanai
, Ryuya Edahiro
, Yuya Shirai
, Kyuto Sonehara
, Daisuke Okuzaki
, Daisuke Motooka
, Masahiro Kanai
, Tatsuhiko Naito
, Kenichi Yamamoto
, Qingbo S. Wang
, Yasuhiro Kato
, Takayoshi Morita
, Shinichi Namba
, Ken Suzuki
, Yoko Naito
, Yu-Chen Liu
, Ayako Takuwa
, Fuminori Sugihara
, James B. Wing
, Shuhei Sakakibara
, Nobuyuki Hizawa
, Takayuki Shiroyama
, Satoru Miyawaki
, Yusuke Kawamura
, Akiyoshi Nakayama
, Hirotaka Matsuo
, Yuichi Maeda
, Takuro Nii
, Yoshimi Noda
, Takayuki Niitsu
, Yuichi Adachi
, Takatoshi Enomoto
, Saori Amiya
, Reina Hara
, Yuta Yamaguchi
, Teruaki Murakami
, Tomoki Kuge
, Kinnosuke Matsumoto
, Yuji Yamamoto
, Makoto Yamamoto
, Midori Yoneda
, Toshihiro Kishikawa
, Shuhei Yamada
, Shuhei Kawabata
, Noriyuki Kijima
, Masatoshi Takagaki
, Noah Sasa
, Yuya Ueno
, Motoyuki Suzuki
, Norihiko Takemoto
, Hirotaka Eguchi
, Takahito Fukusumi
, Takao Imai
, Munehisa Fukushima
, Haruhiko Kishima
, Hidenori Inohara
, Kazunori Tomono
, Kazuto Kato
, Meiko Takahashi
, Fumihiko Matsuda
, Haruhiko Hirata
, Yoshito Takeda
, Atsushi Kumanogoh
, Yukinori Okada
, Takanori Hasegawa
, Kunihiko Takahashi
, Tatsuhiko Anzai
, Satoshi Ito
, Yuji Uchimura
, Akifumi Endo
, Yasunari Miyazaki
, Takayuki Honda
, Tomoya Tateishi
, Shuji Tohda
, Naoya Ichimura
, Kazunari Sonobe
, Chihiro Tani Sassa
, Jun Nakajima
, Masumi Ai
, Ryuji Koike
, Akinori Kimura
, Satoru Miyano
, Tomomi Takano
, Kazuhiko Katayama
, Koji Okudela
, Ryunosuke Saiki
, Yasuhito Nannya
, Seishi Ogawa
, Takayoshi Hyugaji
, Eigo Shimizu
, Kotoe Katayama
, Seiya Imoto
, Yosuke Omae
, Katsushi Tokunaga
, Takafumi Ueno
, Yoshinori Fukui
, Hiroyuki Hayashi
, Yukihiro Yoshimura
, Natsuo Tachikawa
, Kazuhisa Takahashi
, Norihiro Harada
, Yuki Tanabe
, Toshio Naito
, Makoto Hiki
, Yasushi Matsushita
, Haruhi Takagi
, Ryousuke Aoki
, Ai Nakamura
, Sonoko Harada
, Hitoshi Sasano
, Takashi Ishiguro
, Taisuke Isono
, Shun Shibata
, Yuma Matsui
, Chiaki Hosoda
, Kenji Takano
, Takashi Nishida
, Yoichi Kobayashi
, Yotaro Takaku
, Noboru Takayanagi
, Soichiro Ueda
, Natsumi Yazaki
, Ai Tada
, Masayoshi Miyawaki
, Masaomi Yamamoto
, Eriko Yoshida
, Reina Hayashi
, Tomoki Nagasaka
, Sawako Arai
, Yutaro Kaneko
, Kana Sasaki
, Etsuko Tagaya
, Masatoshi Kawana
, Ken Arimura
, Yasushi Nakano
, Yukiko Nakajima
, Ryusuke Anan
, Ryosuke Arai
, Yuko Kurihara
, Yuko Harada
, Kazumi Nishio
, Tetsuya Ueda
, Masanori Azuma
, Ryuichi Saito
, Toshikatsu Sado
, Yoshimune Miyazaki
, Ryuichi Sato
, Yuki Haruta
, Tadao Nagasaki
, Yoshinori Yasui
, Yoshinori Hasegawa
, Akihiro Noda
, Yusei Fukushima
, Reina Kitagawa
, Yoshikazu Mutoh
, Tomoki Kimura
, Tomonori Sato
, Reoto Takei
, Satoshi Hagimoto
, Yoichiro Noguchi
, Yasuhiko Yamano
, Hajime Sasano
, Sho Ota
, Yasushi Nakamori
, Kazuhisa Yoshiya
, Fukuki Saito
, Tomoyuki Yoshihara
, Daiki Wada
, Hiromu Iwamura
, Syuji Kanayama
, Shuhei Maruyama
, Takashi Yoshiyama
, Ken Ohta
, Hiroyuki Kokuto
, Hideo Ogata
, Yoshiaki Tanaka
, Kenichi Arakawa
, Masafumi Shimoda
, Takeshi Osawa
, Hiroki Tateno
, Isano Hase
, Shuichi Yoshida
, Shoji Suzuki
, Miki Kawada
, Hirohisa Horinouchi
, Fumitake Saito
, Keiko Mitamura
, Masao Hagihara
, Junichi Ochi
, Tomoyuki Uchida
, Rie Baba
, Daisuke Arai
, Takayuki Ogura
, Hidenori Takahashi
, Shigehiro Hagiwara
, Shunichiro Konishi
, Ichiro Nakachi
, Koji Murakami
, Mitsuhiro Yamada
, Hisatoshi Sugiura
, Hirohito Sano
, Shuichiro Matsumoto
, Nozomu Kimura
, Yoshinao Ono
, Hiroaki Baba
, Yusuke Suzuki
, Sohei Nakayama
, Keita Masuzawa
, Hidefumi Koh
, Tadashi Manabe
, Yohei Funatsu
, Fumimaro Ito
, Takahiro Fukui
, Keisuke Shinozuka
, Sumiko Kohashi
, Masatoshi Miyazaki
, Tomohisa Shoko
, Takashi Inoue
, Takahiro Asami
, Toshiyuki Hirano
, Keigo Kobayashi
, Hatsuyo Takaoka
, Kazuyoshi Watanabe
, Naoki Miyazawa
, Yasuhiro Kimura
, Reiko Sado
, Hideyasu Sugimoto
, Akane Kamiya
, Naota Kuwahara
, Akiko Fujiwara
, Tomohiro Matsunaga
, Yoko Sato
, Takenori Okada
, Yoshihiro Hirai
, Hidetoshi Kawashima
, Atsuya Narita
, Kazuki Niwa
, Yoshiyuki Sekikawa
, Koichi Nishi
, Masaru Nishitsuji
, Mayuko Tani
, Junya Suzuki
, Hiroki Nakatsumi
, Takashi Ogura
, Hideya Kitamura
, Eri Hagiwara
, Kota Murohashi
, Hiroko Okabayashi
, Takao Mochimaru
, Shigenari Nukaga
, Ryosuke Satomi
, Yoshitaka Oyamada
, Nobuaki Mori
, Tomoya Baba
, Yasutaka Fukui
, Mitsuru Odate
, Shuko Mashimo
, Yasushi Makino
, Kazuma Yagi
, Mizuha Hashiguchi
, Junko Kagyo
, Tetsuya Shiomi
, Satoshi Fuke
, Hiroshi Saito
, Tomoya Tsuchida
, Shigeki Fujitani
, Mumon Takita
, Daiki Morikawa
, Toru Yoshida
, Takehiro Izumo
, Minoru Inomata
, Naoyuki Kuse
, Nobuyasu Awano
, Mari Tone
, Akihiro Ito
, Yoshihiko Nakamura
, Kota Hoshino
, Junichi Maruyama
, Hiroyasu Ishikura
, Tohru Takata
, Toshio Odani
, Masaru Amishima
, Takeshi Hattori
, Yasuo Shichinohe
, Takashi Kagaya
, Toshiyuki Kita
, Kazuhide Ohta
, Satoru Sakagami
, Kiyoshi Koshida
, Kentaro Hayashi
, Tetsuo Shimizu
, Yutaka Kozu
, Hisato Hiranuma
, Yasuhiro Gon
, Namiki Izumi
, Kaoru Nagata
, Ken Ueda
, Reiko Taki
, Satoko Hanada
, Kodai Kawamura
, Kazuya Ichikado
, Kenta Nishiyama
, Hiroyuki Muranaka
, Kazunori Nakamura
, Naozumi Hashimoto
, Keiko Wakahara
, Sakamoto Koji
, Norihito Omote
, Akira Ando
, Nobuhiro Kodama
, Yasunari Kaneyama
, Shunsuke Maeda
, Takashige Kuraki
, Takemasa Matsumoto
, Koutaro Yokote
, Taka-Aki Nakada
, Ryuzo Abe
, Taku Oshima
, Tadanaga Shimada
, Masahiro Harada
, Takeshi Takahashi
, Hiroshi Ono
, Toshihiro Sakurai
, Takayuki Shibusawa
, Yoshifumi Kimizuka
, Akihiko Kawana
, Tomoya Sano
, Chie Watanabe
, Ryohei Suematsu
, Hisako Sageshima
, Ayumi Yoshifuji
, Kazuto Ito
, Saeko Takahashi
, Kota Ishioka
, Morio Nakamura
, Makoto Masuda
, Aya Wakabayashi
, Hiroki Watanabe
, Suguru Ueda
, Masanori Nishikawa
, Yusuke Chihara
, Mayumi Takeuchi
, Keisuke Onoi
, Jun Shinozuka
, Atsushi Sueyoshi
, Yoji Nagasaki
, Masaki Okamoto
, Sayoko Ishihara
, Masatoshi Shimo
, Yoshihisa Tokunaga
, Yu Kusaka
, Takehiko Ohba
, Susumu Isogai
, Satoru Fukuyama
, Yoshihiro Eriguchi
, Akiko Yonekawa
, Keiko Kan-o
, Koichiro Matsumoto
, Kensuke Kanaoka
, Shoichi Ihara
, Kiyoshi Komuta
, Yoshiaki Inoue
, Shigeru Chiba
, Kunihiro Yamagata
, Yuji Hiramatsu
, Hirayasu Kai
, Koichiro Asano
, Tsuyoshi Oguma
, Yoko Ito
, Satoru Hashimoto
, Masaki Yamasaki
, Yu Kasamatsu
, Yuko Komase
, Naoya Hida
, Takahiro Tsuburai
, Baku Oyama
, Minoru Takada
, Hidenori Kanda
, Yuichiro Kitagawa
, Tetsuya Fukuta
, Takahito Miyake
, Shozo Yoshida
, Shinji Ogura
, Shinji Abe
, Yuta Kono
, Yuki Togashi
, Hiroyuki Takoi
, Ryota Kikuchi
, Shinichi Ogawa
, Tomouki Ogata
, Shoichiro Ishihara
, Arihiko Kanehiro
, Shinji Ozaki
, Yasuko Fuchimoto
, Sae Wada
, Nobukazu Fujimoto
, Kei Nishiyama
, Mariko Terashima
, Satoru Beppu
, Kosuke Yoshida
, Osamu Narumoto
, Hideaki Nagai
, Nobuharu Ooshima
, Mitsuru Motegi
, Akira Umeda
, Kazuya Miyagawa
, Hisato Shimada
, Mayu Endo
, Yoshiyuki Ohira
, Masafumi Watanabe
, Sumito Inoue
, Akira Igarashi
, Masamichi Sato
, Hironori Sagara
, Akihiko Tanaka
, Shin Ohta
, Tomoyuki Kimura
, Yoko Shibata
, Yoshinori Tanino
, Takefumi Nikaido
, Hiroyuki Minemura
, Yuki Sato
, Yuichiro Yamada
, Takuya Hashino
, Masato Shinoki
, Hajime Iwagoe
, Hiroshi Takahashi
, Kazuhiko Fujii
, Hiroto Kishi
, Masayuki Kanai
, Tomonori Imamura
, Tatsuya Yamashita
, Masakiyo Yatomi
, Toshitaka Maeno
, Shinichi Hayashi
, Mai Takahashi
, Mizuki Kuramochi
, Isamu Kamimaki
, Yoshiteru Tominaga
, Tomoo Ishii
, Mitsuyoshi Utsugi
, Akihiro Ono
, Toru Tanaka
, Takeru Kashiwada
, Kazue Fujita
, Yoshinobu Saito
, Masahiro Seike
, Hiroko Watanabe
, Hiroto Matsuse
, Norio Kodaka
, Chihiro Nakano
, Takeshi Oshio
, Takatomo Hirouchi
, Shohei Makino
, Moritoki Egi
& The Biobank Japan Project192

Contributions

A.S., M.M.N. and K.U.L. conceptualized the study. A.S., T.M.A., F.S.D., L.F., S.K.H. and A.T.D. provided the methodology. A.S., T.M.A., F.S.D., L.F., S.R., M.S. and Y. Ogawa performed the formal analysis. C.M.M., Japan COVID-19 Task Force, A.J.F., H.N. and Y. Okada provided resources. Y. Ogawa, H.N. and E.C.B. conducted the investigation. A.S., F.S.D., Y. Ogawa, L.F., H.N., E.C.B. and K.U.L. wrote the original draft of the manuscript. T.M.A., S.R., M.S., C.M.M., S.K.H., A.J.F., A.T.D., A.-K.P., K.B., Y. Okada and M.M.N. reviewed and edited the manuscript. A.S., T.M.A., F.S.D., Y. Ogawa, L.F., S.R., M.S. and E.C.B. performed the visualization. A.S., M.M.N., Y. Okada and K.U.L. provided supervision. A.S., Y. Okada and K.U.L. acquired funding.

Corresponding authors

Correspondence to Axel Schmidt or Kerstin U. Ludwig.

Ethics declarations

Competing interests

K.U.L. is a co-founder of LAMPseq Diagnostics. A.T.D. is a co-founder of Peptide Groove, a company that commercializes statistical HLA-typing approaches. A.-K.P. (institution) has received speaker honoraria from Biogen, Novartis, Roche and UCB. M.M.N. has received fees for membership in the advisory board from HMG Systems Engineering, for membership in the Medical-Scientific Editorial Office of the Deutsches Ärzteblatt, for review activities from the European Research Council, and for serving as a consultant for EVERIS Belgique SPRL in a project of the European Commission (REFORM/SC2020/029); and receives salary payments from Life & Brain and holds shares in Life & Brain. The remaining authors declare no competing interests.

Peer review

Peer review information

Nature thanks Paul McLaren, Cristina Venturini and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Extended data figures and tables

Extended Data Fig. 1 Identification of library plate outliers in UK Biobank.

a) The distribution of EBVread+ individuals per 96-well library plate was used to identify 51 library plates with high rates of EBVread+ individuals. Different colors are assigned to regular plates (grey), intermediate plates (above 28.8%; i.e., 2 standard deviations of mean, up to 80%; yellow), and outlier plates (orange). b) Allele frequencies of common EBV variants were determined in each group, based on aggregated reads (see Supplementary Note 2). c) When library plate barcode IDs were sorted alphabetically, the outlier plates with very high rates of EBVread+ were clustered, along with some of the intermediate plates, further supporting potential batch effects. Two representative library plates (one regular plate containing one sample with high EBV read count, and one outlier plate with high number of EBVread+ individuals) are highlighted by circles (left) and shown as examples (right), with plate-positions colored as follows: white: no EBV read; blue: at least one EBV read, light blue: highest EBV read count on plate. NA = empty positions.

Extended Data Fig. 2 Technical validation of GS-based EBV-reads.

Multiple lines of evidence support that individuals with EBV read count =1 are true positives. a) Plotted rank distribution of the allele frequency data (AF; described in Extended Data Fig. 1) illustrate separate trajectories for contaminated outlier (orange) or regular plates (grey). b-d) Comparison of individuals with EBV read counts =1 and ≥2, regarding rank distribution of allele frequencies (b), average coverage values across 94 EBV genes (c) and allele frequencies of common EBV variants (d). Note that rank distribution plotted in (b) is different from outlier plates in (a). e, f) Distribution of EBV read counts in individuals of the two validation cohorts, i.e. validation 1 ((e); GS of 110 European individuals; 26.3% EBVread+) and validation 2 ((f), JCTF, GS of 1,010 East Asian individuals; 39.2% EBVread+). Samples from these cohorts were used for qPCR analyses as shown in Fig. 1. g) Proposed model of EBV life cycle and the correlation with EBVread+ as determined in our study. Figure adapted from ref. ⁷, Springer Natute Ltd.

Extended Data Fig. 3 Analysis of EBVread+ in All of Us cohort.

a) Flow chart showing the generation of different All of Us (AoU) cohorts that were used for subsequent steps of the analysis. Details are provided in the Methods section. The number of individuals in each of the six population backgrounds are given for the AoU no outlier cohort. b) Cumulative read coverage across the EBV genome (line smoothed, 500 bp rolling window), for all individuals of the AoU QC cohort. c) Number of individuals within EBV read count groups. d) The first two principal components (PCs) of common genotypes as provided by AoU are displayed for each of the six population backgrounds. e) EBVread+ in relation to the week of the year in which blood samples were collected. Abbreviations: AFR: African, AMR: Admixed American, EAS: East Asian, EUR: European, MID: Middle Eastern, SAS: South Asian.

Extended Data Fig. 4 Simulation of EBV viral load and the generation of EBV-reads from genome sequencing (GS).

a) We modeled EBV viral load in 500,000 individuals using a log-normal distribution (“ground truth”). This distribution was informed by prior observations on measured viral load in HIV²⁷. The x-axis reflects theoretical units, which could be transferred to biological units if quantified standards were available. The numbers of individuals per unit are plotted on the y-axis. Individuals were assigned to 20% percentile groups (color coded). b) From the simulated viral loads, we sampled “reads” for each individual, using a binomial distribution, with 400 million trials (approximately the average number of sequencing reads available per individual in our study). The probability values for successfully drawing EBV reads were proportional to the viral load of the respective individual. The success rate of the binomial distribution as well as the parameters of the log-normal distribution shown in a), were manually fitted to match the observed read count distribution in our data (cf. Fig. 1c). c) is a zoom in on panel b. d) Within our simulation, EBV viral load increased with increasing numbers of observed EBV reads (reads of 3 or above are aggregated).

Extended Data Fig. 5 Correlation of GS-based EBV-reads and individual measurements of four EBV-related antibodies.

7,338 individuals of the UKB EUR cohort were seropositive for EBV, based on the detection of at least 2 out of 4 EBV-related antibodies. For IgG antibodies against a) EA-D, b) EBNA-1, c) VCA-p18 and d) ZEBRA, individuals were assigned to deciles based on median fluorescence intensity (MFI), and the deciles were tested for significant correlation with the 0/1-encoded EBVread+ status using Spearman correlation coefficients (ρ) and two-sided P-values (P). **Exact P value not available due to computational limits. Bar sizes indicate overall fractions of EBVread+ individuals within the respective deciles (left y-axis), with colors representing different EBV read count groups (legend on bottom). Dots represent average MFI values per decile (right y-axis labels). Analysis was performed on raw measurement data, without adjustment for covariates.

Extended Data Fig. 6 Correlation of effect sizes of EBVread+ GWAS lead variants.

We compared the results of the main analysis (EBVread+ : controls: 0 EBV reads vs. cases: 1–18 EBV reads) to (i) three different case-control definitions based on EBV read counts in UKB (1 read vs. 2–18 reads, 0 reads vs. 1 read, 0 reads vs. 2–18 reads), (ii) different sets of covariates in UKB (“basic”, “no blood”, “w_hla” see Supplementary Table 4). (iii) male- and female-specific analyses in UKB, (iv) HHV7read+ in UKB and (v) external GWAS: memory B cell absolute counts (from GCST90001407³⁰, no MHC-data), and EBV-related serology data for four antibodies (Methods). Point estimates of effect sizes (beta) are color-coded for (a) 54 conditionally independent HLA-alleles and (b) the lead variants at 27 non-MHC loci. +/− illustrates the direction of effect and +/− font is faded grey if the individual association was not nominally significant. Grey boxes indicate missing data. In (c), Spearman’s correlations and respective P values (two-sided) were calculated between all pairs of traits, based on effect sizes and alleles. Correlation coefficients (ρ) are shown for HLA-alleles (bottom triangle) and non-MHC loci (upper triangle). * P < 0.05; ** P < 0.001; NA, not available. Numbers of individuals as well as association statistics used to calculate correlation of effect sizes are given for each trait in Supplementary Table 12.

Extended Data Fig. 7 scDRS analysis as in Fig. 3, using more fine-grained cell annotation.

a) UMAP representation plot of the 1M-scBloodNL data (v3) colored according to cluster labels of cell type annotation level 2. b) Distribution of normalized single-cell disease relevance scores (scDRS) across cell types of annotation level 2, sorted according to the largest average score. White bars indicate the median scDRS. c) Results of the Monte Carlo (MC)-based statistical inference of cell type association (left) and within-cell type heterogeneity (right) with scDRS based on EBVread+. Bar colors represent significance, with purple color indicating a multiple comparison-adjusted false discovery rate (FDR) < 0.05. Further information is provided in Methods and Fig. 3.

Extended Data Fig. 8 Prediction of EBVread+ using Genetic Risk Scores and Phenome-wide association studies (PheWAS).

a) Individuals from the UKB serology target cohort (n = 6,063, unrelated) were stratified according to EBV read counts in the GS data, and the distributions of specific GRSs within these groups are shown as boxplots (median (thick line), 25th and 75th percentile (box) and largest/smallest value no further from the box than 1.5 times the interquartile range (whiskers)). b) Scatter plots of individual GRSs (indicated by the axis labels) illustrate the correlation structures between HLA all, HLA MHC-I, and HLA MHC-II. Only weak correlation was observed between the GRS encompassing HLA-alleles from MHC class I vs those from MHC class II (Pearson correlation). c) In analogy to Fig. 4e, improvements in Nagelkerke’s R² relative to base models within the UKB serology target cohort (extreme left bar of each GRS category), and the six continental ancestries in AoU for the indicated GRSs are given (abbreviations as in Extended Data Fig. 3). Sample sizes are provided within the panel and error bars correspond to standard deviation derived from n = 1,000 bootstrap iterations. d) PheWAS using HLA all and SNP wo MHC in analogy to Fig. 4f. In addition to annotating all significant PheWAS associations, the association identified in UKB with NHL (HLA all) is encircled. P values were calculated using logistic regression, with adjustment for covariates and likelihood ratio tests (Methods).

Extended Data Fig. 9 Analysis of HLA-DRB1*15:01 associations across datasets.

Point estimates of effect sizes (beta) and 95% confidence intervals (unadjusted) for the major multiple sclerosis (MS) risk allele HLA-DRB1*15:01, across different EBV-associated traits and multiple case-control definitions in the present study. For comparisons, we also extracted these values from a recent MS GWAS in which HLA-alleles were present³⁶. Highlighted in bold are analyses in which the association of HLA-DRB1*15:01 reached genome-wide significance. Sample sizes are given within the panel with case-control numbers for binary traits and total numbers for continuous traits.

Supplementary information

Supplementary Information (download PDF )

This file contains Supplementary Notes 1–15, including Supplementary Figs 1–9.

Reporting Summary (download PDF )

Supplementary Tables (download XLSX )

Supplementary Tables 1–23.

Peer Review File (download PDF )

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Schmidt, A., Alawathurage, T.M., David, F.S. et al. Host control of persistent Epstein–Barr virus infection. Nature (2026). https://doi.org/10.1038/s41586-026-10274-4

Download citation

Received: 16 July 2025
Accepted: 12 February 2026
Published: 19 February 2026
Version of record: 01 April 2026
DOI: https://doi.org/10.1038/s41586-026-10274-4