Genome-wide association study of long COVID

Lammi, Vilma; Nakanishi, Tomoko; Jones, Samuel E.; Andrews, Shea J.; Karjalainen, Juha; Cortés, Beatriz; O’Brien, Heath E.; Ochoa-Guzman, Ana; Fulton-Howard, Brian E.; Broberg, Martin; Haapaniemi, Hele H.; Kanai, Masahiro; Pirinen, Matti; Schmidt, Axel; Mitchell, Ruth E.; Mousas, Abdou; Mangino, Massimo; Huerta-Chagoya, Alicia; Sinnott-Armstrong, Nasa; Cirulli, Elizabeth T.; Vaudel, Marc; Kwong, Alex S. F.; Maiti, Amit K.; Marttila, Minttu M.; Posner, Daniel C.; Rodriguez, Alexis A.; Batini, Chiara; Minnai, Francesca; Dearman, Anna R.; Warmerdam, C. A. Robert; Sequeros, Celia B.; Winkler, Thomas W.; Jordan, Daniel M.; Rešcenko, Raimonds; Miano, Lorenzo; Lane, Jacqueline M.; Chung, Ryan K.; Guillen-Guio, Beatriz; Leavy, Olivia C.; Carvajal-Silva, Laura; Aguilar-Valdés, Kevin; Frangione, Erika; Guare, Lindsay; Vergasova, Ekaterina; Marouli, Eirini; Striano, Pasquale; Zainulabid, Ummu Afeera; Kumar, Ashutosh; Ahmad, Hajar Fauzan; Edahiro, Ryuya; Azekawa, Shuhei; Luoh, Shiuh-Wen; Erikstrup, Christian; Pedersen, Ole B. V.; Lerner-Ellis, Jordan; Colombo, Alicia; Grzymski, Joseph J.; Ishii, Makoto; Okada, Yukinori; Beckmann, Noam D.; Kumari, Meena; Wagner, Ralf; Heid, Iris M.; John, Catherine; Short, Patrick J.; Magnus, Per; Ansone, Laura; Valenti, Luca V. C.; Lee, Sulggi A.; Wain, Louise V.; Verdugo, Ricardo A.; Banasik, Karina; Geller, Frank; Franke, Lude H.; Rakitko, Alexander; Duncan, Emma L.; Renieri, Alessandra; Tsilidis, Konstantinos K.; de Cid, Rafael; Niavarani, Ahmadreza; Abner, Erik; Tusié-Luna, Teresa; Verma, Shefali S.; Smith, George Davey; Timpson, Nicholas J.; Madduri, Ravi K.; Cho, Kelly; Daly, Mark J.; Ganna, Andrea; Schulte, Eva C.; Richards, J. Brent; Ludwig, Kerstin U.; Marks-Hultström, Michael; Zeberg, Hugo; Ollila, Hanna M.

doi:10.1038/s41588-025-02100-w

Download PDF

Article
Open access
Published: 21 May 2025

Genome-wide association study of long COVID

Nature Genetics volume 57, pages 1402–1417 (2025)Cite this article

70k Accesses
32 Citations
584 Altmetric
Metrics details

Subjects

Abstract

Infections can lead to persistent symptoms and diseases such as shingles after varicella zoster or rheumatic fever after streptococcal infections. Similarly, severe acute respiratory syndrome coronavirus 2 (SARS‑CoV‑2) infection can result in long coronavirus disease (COVID), typically manifesting as fatigue, pulmonary symptoms and cognitive dysfunction. The biological mechanisms behind long COVID remain unclear. We performed a genome-wide association study for long COVID including up to 6,450 long COVID cases and 1,093,995 population controls from 24 studies across 16 countries. We discovered an association of FOXP4 with long COVID, independent of its previously identified association with severe COVID-19. The signal was replicated in 9,500 long COVID cases and 798,835 population controls. Given the transcription factor FOXP4’s role in lung physiology and pathology, our findings highlight the importance of lung function in the pathophysiology of long COVID.

Epidemiology, clinical presentation, pathophysiology, and management of long COVID: an update

Article 25 July 2023

The long-term health outcomes, pathophysiological mechanisms and multidisciplinary management of long COVID

Article Open access 01 November 2023

Long COVID: major findings, mechanisms and recommendations

Article 13 January 2023

Main

The coronavirus disease 2019 (COVID-19) pandemic has led to the recognition of a new condition known as postacute sequelae of COVID-19 (PASC), post-COVID-19 condition or long COVID. The World Health Organization’s definition includes any symptoms that present typically within three months after COVID-19 and persist for at least two months¹. Common symptoms include fatigue, pulmonary dysfunction, muscle and chest pain, dysautonomia and cognitive disturbances^2,3,4,5,6. The incidence of long COVID varies widely, with estimates in severe acute respiratory syndrome coronavirus 2 (SARS‑CoV‑2)-infected individuals ranging from 10% to 70%⁷. Long COVID is more common in individuals who have been hospitalized or treated at the intensive care unit due to COVID-19 (refs. ^7,8). However, long COVID can also occur in those with initially mild COVID-19 symptoms⁹. Moreover, several mechanisms may contribute to long COVID, including alterations of the serotonin system that may be related to cognitive changes¹⁰, mitochondrial mechanisms to fatigue¹¹ and mechanisms involving complement and platelet activation to vascular disease observed in patients with long COVID¹².

The COVID-19 Host Genetics Initiative (COVID-19 HGI) was launched to investigate host genetics in COVID-19 susceptibility, hospitalization and critical illness^13,14,15,16. These findings implicate canonical pathways involved in viral entry, mucosal airway defense and type I interferon response^15,16,17,18.

To elucidate biological mechanisms behind long COVID, we conducted a genome-wide association study (GWAS) and replication in 33 cohorts across 19 countries, totaling 15,950 individuals with long COVID and 1,892,830 controls (Fig. 1).

**Fig. 1: Geographic overview of studies contributing to the Long COVID HGI.**

Results

Genetic variants in FOXP4 locus associated with long COVID

We performed a meta-analysis of 24 independent GWAS of long COVID using two case definitions and two control definitions. A strict long COVID case definition required having an earlier test-verified SARS-CoV-2 infection (strict case definition), while a broader long COVID case definition also included self-reported or clinician-diagnosed SARS-CoV-2 infection (broad case definition). The broad definition included all contributing studies, whereas the strict definition included 11 studies (Supplementary Tables 11 and 12). Controls were either population controls, or participants that had recovered from SARS-CoV-2 infection without long COVID (strict control definition; Fig. 1 and Supplementary Tables 11 and 12). Data were obtained from 16 countries, representing populations from six genetic ancestries. The most common symptoms in the questionnaire-based studies were fatigue, shortness of breath and problems with memory and concentration. However, there was some heterogeneity in the frequency of symptoms (Supplementary Fig. 1).

The GWAS meta-analysis using the strict case definition (n = 3,018) and the broad control definition (n = 994,582) identified a genome-wide significant association within the FOXP4 locus (chr6: 41,515,652 G > C, Genome Reference Consortium Human Build 38 (GRCh38), rs9367106, as the lead variant; P = 1.8 × 10⁻¹⁰; Fig. 2 and Supplementary Table 13). The C allele at rs9367106 was associated with an increased risk of long COVID (odds ratio (OR) = 1.63, 95% confidence interval (CI) = 1.40–1.89, risk allele frequency = 4.2%). The association replicated in an independent sample from eight additional contributing cohorts with 5,226 individuals with long COVID and 260,036 population controls (P = 0.025, OR = 1.13, 95% CI = 1.02–1.25; Supplementary Fig. 3d). Furthermore, the lead variants rs9367106 and rs12660421 replicated in the VA Million Veteran Program (MVP) in the strict case analyses with the broad control definition (P = 1 × 10⁻⁴, OR = 1.21, 95% CI = 1.10–1.34, long COVID cases, n = 4,274 and controls, n = 538,799; Supplementary Fig. 3e,f) and with the strict control definition (P = 0.0018, OR = 1.17, 95% CI = 1.06–1.29, long COVID cases, n = 4,274 and controls, n = 73,739; Supplementary Fig. 3g,h).

**Fig. 2: Meta-analysis of 11 GWAS studies of long COVID shows an association at the *FOXP4* locus.**

We observed an association, albeit not genome-wide significant, with rs9367106-C and long COVID also in all other three meta-analyses, including our largest meta-analysis with the broad case definition (n = 6,450) and the broad control definition (n = 1,093,995) from 24 studies (OR = 1.34, 95% CI = 1.20–1.49, P = 1.1 × 10⁻⁷; Supplementary Figs. 2 and 3). Analyses with the strict case definition (n = 2,964) and strict control definition (n = 37,935; OR = 1.30, 95% CI = 1.09–1.56, P = 3.8 × 10⁻³), and with the broad case definition (n = 6,396) and strict control definition (n = 46,208; OR = 1.16, 95% CI = 1.02–1.32, P = 0.023), further supported our findings (Supplementary Fig. 3).

To examine the consistency of the FOXP4 signal across the contributing studies, we investigated the effect in each study (Fig. 2b). Genetic variants in the meta-analysis had varying statistical power due to missingness, due to genotyping and imputation quality, and due to differences in allele frequency differences between populations. Therefore, the genetic variant that was present in majority of the studies was the most statistically significant variant, not necessarily because it is the causal variant but because it had the best statistical power. We, therefore, examined the effect size of variants within 30 kb around the lead variant (rs9367106, r² > 0.01 in individuals of Europeans in the Human Genome Diversity Project¹⁹ and 1000 Genomes Project^20,21) and effective sample size of at least one-third the sample size of the lead variant. Through this analysis, we identified a haplotype spanning the genomic region chr6:41,512,355–41,537,458 located upstream of FOXP4 gene (Fig. 3d), for which variants had P values less than 5 × 10⁻⁷ (Fig. 3a) and effect sizes similar to the lead variant across ancestries (Fig. 3b,c). This analysis identified 15 variants (Supplementary Table 14). Relying on linkage disequilibrium (LD) in the 1000 Genomes Project across African, East Asian European, admixed American and South Asian populations, we found 18 variants cosegregating with the lead variant with tightest LD at the end of the haplotype (r² > 0.5; Supplementary Table 15). Nine variants overlapped between these two analyses.

**Fig. 3: The chromosome 6 region (chr6: 41,490,001–41,560,000 (70 kb); *FOXP4* locus) in the long COVID GWAS meta-analysis.**

Frequency of long COVID variants varies across ancestries

The allele frequency of rs9367106-C at the FOXP4 locus varied across the study populations ranging from 1.6% in non-Finnish Europeans to 7.1% in Finnish, 19% in admixed Americans and 36% in East Asians (Supplementary Fig. 4; https://gnomad.broadinstitute.org/variant/6-41515652-G-C?dataset=gnomad_r3). Most of the contributing studies comprised individuals of European ancestry (Supplementary Fig. 5). Despite smaller sample sizes, we observed significant associations in admixed American, East Asian and Finnish ancestries (Fig. 2b), owing to the higher allele frequency, and thus larger statistical power to detect an association with the rs9367106 variant in these cohorts.

Risk variants, FOXP4 expression and COVID-19 severity

We next investigated whether the long COVID variants were associated with differential expression of any of the surrounding genes within a 100-kb window (FOXP4, FOXP4-AS1, LINC01276 and MIR4641). We found that rs12660421-A is associated with an increase in FOXP4 expression in the lung (P = 5.3 × 10⁻⁹, normalized effect size (NES) = 0.56) and in the hypothalamus (P = 2.6 × 10⁻⁶, NES = 1.4; Fig. 4a and Supplementary Fig. 6; GTEx, https://gtexportal.org/home/snp/rs12660421). Furthermore, there were no additional expression quantitative trait loci (eQTL) or colocalization with the expression of FOXP4-AS1 (Supplementary Table 16). FOXP4 (HUGO Gene Nomenclature Committee ID: 20842) is a transcription factor gene that has a broad tissue expression pattern and is expressed in nearly all tissues, with the highest expression in the cervix, the thyroid, the vasculature, the stomach and the testis²². The expression also spans a broad set of cell types, including endothelial lung cells, immune cells and myocytes²³. A colocalization analysis suggested that the association signal of long COVID is the same signal that associates with the differential expression of FOXP4 in the lung (posterior probability = 0.91; Supplementary Fig. 7a,b and Supplementary Table 17).

**Fig. 4: *FOXP4* expression in the lung.**

Furthermore, variants in the FOXP4 region have also been identified as risk factors for COVID-19 hospitalization, colocalizing with FOXP4 expression eQTL in the COVID-19 HGI meta-analyses and follow-up studies^16,24 (Supplementary Fig. 8 and Supplementary Table 18). Our colocalization analysis demonstrated the FOXP4 association identified here as the same association identified for COVID-19 severity (posterior probability > 0.97; Supplementary Fig. 7e,f and Supplementary Table 17).

FOXP4 expression in blood is associated with long COVID

To understand whether higher FOXP4 expression was seen in long COVID, we collected blood samples from participants with or without active SARS-CoV-2 infection. We discovered that the higher FOXP4 levels in nonacute COVID-19 samples were associated with increased risk of long COVID (OR = 2.31 per 1 s.d. increase in FOXP4 expression, 95% CI = 1.27–4.22, P = 0.0063; Supplementary Fig. 9), while FOXP4 levels in acute COVID-19 samples were not associated with long COVID (P = 0.62). This is orthogonal evidence to the genetic signal that higher FOXP4 levels may lead to long COVID.

FOXP4 expression in alveolar and immune cells in the lung

As lung tissue consists of several cell types, we wanted to elucidate the relevant cells that express FOXP4 and may contribute to long COVID. We analyzed single-cell sequencing data from the Tabula Sapiens, a previously published atlas of single-cell sequencing data in healthy individuals free of COVID-19²⁵. FOXP4 expression was the highest in type 2 alveolar cells in individuals without SARS-CoV-2 infection (Fig. 4c) and during active infection (Supplementary Fig. 10), suggesting that SARS-CoV-2 infection was not required for FOXP4 expression. Furthermore, type 2 alveolar cells are capable of mounting robust innate immune responses, thus participating in the immune regulation in the lung. Additionally, type 2 alveolar cells secrete surfactant, keep the alveoli free from fluid, and serve as progenitor cells repopulating damaged epithelium after injury²⁶. In addition, we observed nearly equally high expression of FOXP4 in granulocytes that similarly participate in the regulation of innate immune responses. Overall, the findings suggest a possible role of both immune and alveolar cells in the lung and higher expression of FOXP4 in long COVID.

FOXP4 variants located at active chromatin in the lung

To understand the possible causal variation at the FOXP4 locus, we performed statistical fine mapping using SLALOM²⁷ (Supplementary Note). There were nine variants within the 95% credible set with the maximum posterior probability of 0.28 for rs9381074 (Supplementary Fig. 11). Given the strong LD pattern among the nine variants within the credible set, fine mapping alone might not be able to pinpoint a single causal variant in this locus. Therefore, to understand possible functional regulatory effects behind the variant association, we used the data from the Regulome database^28,29, ENCODE³⁰ and VannoPortal³¹. While the majority of the long COVID variants were at active enhancer or transcription factor binding sites, four variants had direct evidence of transcription factor binding based on chromatin immunoprecipitation sequencing experiments (Supplementary Tables 19 and 20). One of these variants (rs9381074) was directly located on a region that had DNA methylation marks across multiple tissues, including immune and lung cells (H3K27me3 and H3K4me1, H3K4me3, H3K27ac, H3K4me2 and H3K4me3), and had evidence of transcriptional activity from 49 different transcription factors, of which we saw the most consistent direct binding of FOXA1 across 55 experiments. Furthermore, we downloaded DNase sequencing data from the ENCODE project and observed that rs9381074 was directly positioned on a DNase hypersensitivity site in the lung (Supplementary Note). Finally, this variant is the same variant implicated by statistical fine mapping, suggesting the rs9381074 variant as the causal variant for association at the FOXP4 locus.

FOXP4 variant associated with lung cancer

To understand the role of FOXP4 and its associations across diseases, we performed phenome-wide association analysis. We first focused on Biobank Japan³², as the long COVID risk allele frequency is highest in East Asia. Phenome-wide association study (PheWAS) between rs9367106 and all phenotypes in Biobank Japan (n = 262) revealed that long COVID risk allele was associated with lung cancer (P = 1.2 × 10⁻⁶, Bonferroni P = 3.1 × 10⁻⁴, OR = 1.13, 95% CI = 1.07–1.18; Supplementary Fig. 8 and Supplementary Table 18). Furthermore, the long COVID risk allele is in LD with the known risk variants for non-small cell lung carcinoma in Chinese and European populations³³ (rs1853837, r² = 0.88 in East Asians³⁴) and for lung cancer in never-smoking Asian women³⁵ (rs7741164, r² = 0.98 in East Asians³⁴). Colocalization analysis supported that the associations in this locus (within 500 kb of rs9367106) for long COVID and lung cancer shared the same genetic signal (colocalization posterior probability = 0.98; Supplementary Fig. 7c,d). COVID-19 phenotypes and lung cancer traits were the only associations found with linked variants in the GWAS Catalog (Supplementary Table 21).

We then broadened the analysis to other cohorts. Using data from FinnGen and Open Targets, we observed a robust gene level PheWAS association with prostate cancer, immune traits including reticulocytes and chronotype (Supplementary Tables 22–24). Moreover, colocalization analysis provided by Open Targets showed that FOXP4 expression and FOXP4 splice QTLs colocalized with blood count traits specifically in the blood and the thyroid, but the blood count traits did not colocalize with the expression in the lung (Supplementary Table 25). These findings suggest that separate regulatory variation may contribute to tissue-specific expression and the control of otherwise ubiquitously expressed FOXP4 and contribute to trait associations in a tissue-specific manner.

Long COVID and other phenotypes

We investigated the relationship between long COVID and cardiometabolic, behavioral and psychiatric traits³⁶ (Fig. 5 and Supplementary Table 26). We found positive genetic correlations between long COVID and insomnia symptoms, depression, risk tolerance, asthma, diabetes and SARS-CoV-2 infection, while we saw negative correlations with red and white blood cell counts (Fig. 5a). However, identified correlations were only nominally significant without multiple testing correction (P < 0.05; Supplementary Table 27). The observed scale heritability estimates of long COVID ranged from 0.97% to 12.36% (s.e. = 0.0362), with the highest heritability in the strict case and strict control definitions (Supplementary Table 28).

**Fig. 5: Genetic correlations and MR causal estimates between long COVID and potential risk factors, biomarkers and diseases.**

We used Mendelian randomization (MR) to estimate potential risk factors by analyzing the same traits mentioned above (Supplementary Table 26). Genetically predicted earlier smoking initiation (P = 0.022), more cigarettes consumed per day (P = 0.046), higher levels of high-density lipoproteins (P = 0.029) and higher body mass index (P = 0.046) were nominally significant causal risk factors of long COVID (Fig. 5b and Supplementary Table 29). However, none of these associations survived correction for multiple comparisons.

FOXP4 signal not explained simply by COVID-19 severity

Earlier research has suggested that COVID-19 severity is a risk factor for long COVID^8,37,38,39 and FOXP4 variants have earlier been implicated in COVID-19 severity⁶. Our initial GWAS and robust replication across different cohorts show FOXP4 variants also associated with long COVID. However, the results pose an interesting question of whether the mechanism of FOXP4 association with long COVID is the same mechanism that contributes to COVID-19 severity. We thus investigated the relationship between COVID-19 hospitalization and long COVID by performing a two-sample MR (Supplementary Table 30). In terms of causality, we caution that COVID-19 hospitalization as causal exposure is difficult to interpret because both long COVID and COVID-19 hospitalization are two outcomes of the same underlying infection. Nevertheless, the relationship between the effect size for long COVID versus the effect size for COVID-19 severity can shed some light on the role of COVID-19 severity in long COVID. To perform two-sample MR without overlapping samples, we have excluded the studies that contributed to the current long COVID freeze 4 and computed a meta-analysis of SARS-CoV-2 infection susceptibility and COVID-19 hospitalization of the remaining cohorts in the COVID-19 HGI. We observed a causal relationship of susceptibility and hospitalization on long COVID (strict case and broad control definition; inverse variance-weighted (IVW) MR, P = 1.8 × 10⁻⁷ for infection and P = 4.8 × 10⁻⁸ for hospitalization) with no evidence of pleiotropy (MR–Egger intercept P = 0.47 and 0.83, respectively; Fig. 5c,d and Supplementary Table 30). Furthermore, sensitivity analysis by leaving one variant out (Supplementary Table 31), or by including long COVID cohorts with European-ancestry only (Supplementary Table 32), both supported a robust causal association between COVID hospitalization and long COVID. Nevertheless, the Wald ratio of long COVID to COVID-19 hospitalization for the FOXP4 variant is 1.97 (95% CI = 1.36–2.57), which is significantly greater than the slope of the MR-estimated relationship between COVID-19 hospitalization and long COVID (0.35, 95% CI = 0.12–0.57). Furthermore, adjusting or stratifying the long COVID GWAS for hospitalization did not explain the association between FOXP4 and long COVID (Supplementary Table 33a).

Thus, the FOXP4 signal demonstrates a stronger association with long COVID than expected, meaning that it cannot simply be explained by its association with either susceptibility or severity of the acute disease alone (Fig. 5c,d). A recent systematic review of epidemiological data found a positive association between COVID-19 hospitalization and long COVID with a relationship on a log-odds scale of 0.91 (95% CI = 0.68–1.14)⁴⁰. Even assuming this stronger relationship between COVID-19 hospitalization and long COVID, the observed effect of the FOXP4 variant on long COVID still exceeds what would be expected based on the association with severity alone.

When SARS-CoV-2 infection is required for COVID-19 disease, and for severe COVID-19, an important question is whether all genetic variants that increase COVID-19 susceptibility or severity are equally large risk factors for long COVID. Bayesian methods provide an opportunity to estimate whether some variants that affect COVID-19 susceptibility or severity systematically contribute to the risk of long COVID more than the other variants. To answer this question, we estimated the posterior probabilities for all susceptibility and severity variants for long COVID using four models—susceptibility/severity only, long COVID only and two models for joint effects that differed in their slopes. We observed that for COVID-19 susceptibility, the 3p21.31 locus and the ABO locus contributed to both susceptibility and long COVID with a high posterior probability (Fig. 5e and Supplementary Table 34). Moreover, while many severity variants are also likely to contribute to long COVID, their slope between long COVID and severity effects was smaller than that of FOXP4 (Fig. 5f and Supplementary Table 35).

Finally, previous studies have shown a potential effect of vaccination, strain and severity on long COVID^{5,7,41,42,43,44}. To clarify these factors with long COVID, we used data from additional cohorts, including FinnGen. We observed that, while adjusting for severity or vaccination status did not remove the signal, there was a possible stronger risk of FOXP4 risk alleles before vaccination and with wild-type and Alpha strains (Supplementary Table 33 b,c). A significant association of the FOXP4 locus with long COVID in individuals before vaccination was observed. Although the effect remained positive postvaccination (OR = 1.3), the lack of significant association in these cases may be influenced by the relatively small sample size of individuals diagnosed with long COVID after vaccination (n = 40; Supplementary Table 33 b). Earlier epidemiological studies have shown that immunization against COVID-19 is associated with a reduced risk of long COVID^43,44,45. Our data are in line with these earlier observations. Furthermore, we sought replication for the strain association in the Estonian Biobank, where higher risk was also observed with earlier strains, particularly the Alpha strain (P = 0.0138).

The possible time-dependent association with strain prompted us to explore the temporal relationship between FOXP4 and long COVID from the start of the year 2020 till the spring of 2023. Using data from 3,684 individuals with long COVID from FinnGen, we observed a significant temporal association with the Cox proportional hazards model (HR = 1.3, 95% CI = 1.1–1.7, P = 0.005, n_{population controls} = 496,664; Supplementary Fig. 12). Moreover, particularly homozygosity for the FOXP4 risk allele increased the risk for long COVID (recessive P = 2.3 × 10⁻⁴, OR = 5.64, 95% CI = 2.25–14.17). Moreover, we observed a consistently higher risk allele homozygosity among long COVID cases in the Estonian Biobank and MexGene-COVID (Supplementary Note). Overall, these results indicate a temporal relationship with FOXP4 risk variants on long COVID and higher risk with homozygosity and earlier viral strains. In all these analyses, FOXP4 stood out as an independent risk factor for long COVID.

FOXP4 associates with multiple symptoms of long COVID

We aimed to investigate the symptomatic associations between FOXP4 and long COVID. We focused on well-established components of long COVID as documented in earlier literature⁷. Using symptom data from the two largest cohorts, FinnGen and MVP, we re-examined the association of FOXP4 with long COVID, requiring lifetime symptoms from any of the previously identified subtypes. Our analysis revealed consistent associations across both MVP and FinnGen cohorts, with fatigue and asthma diagnoses, and β-adrenergic and proton pump inhibitor medication showing significant associations in the meta-analysis of the two cohorts (Supplementary Fig. 13 and Supplementary Table 36). The replication of these associations in datasets from two different countries, with distinct healthcare settings and patient populations, strengthens the robustness of the link between FOXP4 and the plethora of manifestations of long COVID.

Discussion

In this study, we aimed to understand the host genetic factors that contribute to long COVID, using data from 24 studies across 16 countries and replicating in independent cohorts. Our analysis identified genetic variants within the FOXP4 locus as a risk factor for long COVID. The FOXP4 gene is expressed in the lung and the genetic variants associated with long COVID are also associated with differential expression of FOXP4 and with lung cancer and COVID-19 severity. Additionally, using MR, we characterized COVID-19 severity as a causal risk factor for long COVID. Overall, our findings provide genomic evidence consistent with previous epidemiological and clinical reports of long COVID, indicating that long COVID, similarly to other postviral conditions, is a heterogeneous disease entity where likely both individual genetic variants and the environmental risk factors contribute to disease risk.

Our analysis revealed a connection between long COVID and pulmonary endpoints through both individual variants at FOXP4, a transcription factor-coding gene previously linked to lung cancer and COVID-19 severity²⁴, and MR analysis identifying smoking and COVID-19 severity as risk factors. Furthermore, expression analysis of the lung, and cell type-specific single-cell sequencing analysis, showed FOXP4 expression in both alveolar cell types and immune cells of the lung.

FOXP4 belongs to the subfamily P of the forkhead box transcription factor family genes and is expressed in various tissues, including the lungs and the gut^45,46. Moreover, it is highly expressed in mucus-secreting cells of the stomach and intestines⁴⁷, as well as in naïve B, natural killer and memory T_reg cells⁴⁸, and required for normal T cell memory function following infection⁴⁹. FOXP1/FOXP2/FOXP4 are also required for promoting lung endoderm development by repressing expression of nonpulmonary transcription factors⁵⁰, and the loss of FOXP1/FOXP4 adversely affects airway epithelial regeneration⁵¹. Furthermore, FOXP4 has been implicated in airway fibrosis⁵² and the promotion of lung cancer growth and invasion⁵³. We find that the variants associated with long COVID are also associated with lung cancer in Biobank Japan³². These observations together with the present study may suggest that the connection between FOXP4 and long COVID may be rooted in both lung function and immunology. Furthermore, FOXP4 expression in both alveolar and immune cells in the lung, and the association with severe COVID-19 and pulmonary diseases such as cancer, suggests that FOXP4 may participate in local immune responses in the lung.

Our functional analysis further implicated FOXP4 as a risk factor for long COVID, irrespective of the genotype status of the here-identified risk variant. FOXP4 expression levels were higher in individuals with long COVID than controls. Furthermore, we observed a consistent effect of FOXP4 risk variants across ancestries. Moreover, having multiple ancestries enabled us to fine-map a likely causal variant at rs9381074, which was further supported by functional methylation and expression data.

We also discovered a causal relationship between SARS-CoV-2 infection and long COVID, as expected, and an additional causal risk between severe, hospital treatment-requiring COVID-19 and long COVID. This finding is in agreement with earlier epidemiological observations^8,37,38,39. The relationship between COVID-19 severity and long COVID raises an interesting question—when SARS-CoV-2 infection is required for both COVID-19 and severe COVID-19, are all genetic variants that increase COVID-19 susceptibility or severity equally large risk factors for long COVID? In the present study, we aimed to answer this question by examining variant effect sizes between SARS-CoV-2 infection susceptibility, COVID-19 severity and long COVID using stratified and adjusted analyses, and by Bayesian modeling. Among the known SARS-CoV-2 susceptibility loci, ABO and 3p21.31 had a high probability of also contributing to long COVID. Moreover, the FOXP4 variants had higher effect sizes for long COVID than expected based on the other severity variants, suggesting an independent role of FOXP4 for long COVID that was not observed among the other COVID-19 severity variants. Such observation offers clues on biological mechanisms, such as FOXP4 affecting pulmonary function and immunity, which then contribute to the development of long COVID. Overall, our study elucidates genetic risk factors for long COVID, the relationship between long COVID and severe COVID-19, and finally possible mechanisms of how FOXP4 contributes to the risk of long COVID.

Moreover, while several lines of evidence from the original GWAS association, replication, stratified analyses to Bayesian analysis and the significance of individual variants suggest that FOXP4 contributes to long COVID in a stronger way than expected, the mechanism that FOXP4 associates with long COVID may be the same mechanism that contributes to COVID-19 severity. Future studies and iterations of this work will likely grow the number of observed genetic variants and further clarify the biological mechanisms underlying long COVID. We also caution that the genetic predisposition to long COVID might be dependent on SARS-CoV-2 variation and vaccination status, and that a large portion of our data was collected before the omicron wave and widespread vaccination (Supplementary Table 12), which might have an impact on the genetic associations.

The contribution of genetic factors to COVID-19 phenotypes is intriguing. As heritability in general is defined as the proportion of phenotypic variation attributable to genetic differences within a specific environment, in a hypothetical world where every environmental factor would be similar, heritability would theoretically approach 100%. However, as the heritability in infections can be shaped by exposure, viral strain, prophylactics, earlier immunity, for example, through vaccination efforts, or differences in diagnostic criteria, reporting or local recommendations, estimating heritability requires relatively large samples for precise estimates. Similarly, heritability in earlier studies of COVID-19 phenotypes was initially less than 1% for COVID-19 susceptibility, severity and critical illness even with over 46,000 COVID-19 cases and 2 million controls⁶. However, all COVID-19 traits showed robust genetic correlations with the known COVID-19 epidemiological risk factors. In our study, we similarly see low heritability with long COVID, which is a limitation in the current study. Nonetheless, the estimate provides a tool to understand between-trait correlations and will likely become more precise with larger sample sizes.

We recognize that the symptomatology of long COVID is variable and includes, in addition to lung symptoms, also other symptom domains such as fatigue and cognitive dysfunction^7,37,54. In addition, the long-term effects of COVID-19 are still being studied, and more research is needed to understand the full extent of the long-term damage caused by SARS-CoV-2 and long COVID disease. We also recognize that the long COVID diagnosis is still evolving. Nevertheless, our study provides direct genetic evidence that lung pathophysiology can have an integral part in the development of long COVID.

Methods

Contributing studies

Participants of each of the contributing 33 studies provided written informed consent to participate in each respective study, with recruitment and ethics following study-specific protocols approved by their respective institutional review boards (details are provided in Supplementary Table 12).

For the initial discovery analysis, we used data from the following 24 studies: Avon Longitudinal Study of Parents and Children (ALSPAC), Bonn Study of COVID Genetics (BoSCO), Banque québécoise de la COVID-19 (BQC19), Danish Blood Donor Study (DBDS), Extended Cohort for E-health, Environment and DNA (EXCEED), FinnGen, GCAT | Genomes for life, Genetic Bases of COVID-19 Clinical Variability (GEN-COVID), Genotek, Genetics of Long COVID (GOLD), Helix Exome+ and Healthy Nevada Project COVID-19 Phenotypes (Helix), MexGen-COVID Initiative, COVID-19 Ioannina Biobank (Ioannina), Genome-wide assessment of the gene variants associated with severe COVID-19 phenotype in Iran (IrCovid), Japan COVID-19 Task Force (JapanTaskForce), Lifelines, Norwegian Mother, Father and Child Cohort Study (MoBa), Mount Sinai COVID Biobank (MSCIC), Penn Medicine BioBank (PMBB), Follow-UP study of patients with critical COVID-19/COVID-19 Cohort Study of the University Hospital of the Technical University Munich (SweCovid/COMRI), Tirschenreuth Study (TiKoCo), TwinsUK, UK Biobank and Understanding Society—UK Household Longitudinal Study. The total sample size of this Long COVID HGI data freeze 4 was 6,450 long COVID cases, 46,208 COVID-19-positive controls and 1,093,955 population controls (Supplementary Table 12). For the replication of the FOXP4 lead variants, we obtained data from the following nine additional studies: COVID-19 cohort at LGDB (LatviaGDB), COVID-19 Genomics Network (C19-GenoNet), COVID-19 Host Immune Response Pathogenesis Study (CHIRP), Estonian Biobank (EstBB), Fondazione Genomics SARS-CoV-2 Study (FoGS), GENCOV Study (GENCOV), Mass General Brigham Biobank (MGB), The Post-hospitalization COVID-19 study (PHOSP-COVID) and VA MVP. The replication datasets together comprised 9,500 individuals with long COVID and 798,835 population controls (Supplementary Fig. 3d,e and Supplementary Table 12).

The effective sample sizes for each study shown in Fig. 1 were calculated for display using the given formula: (4 × n_case × n_control)/(n_case + n_control). The Long COVID HGI is a global and ongoing collaboration, open to all studies around the world that have data to run long COVID GWAS using our phenotypic criteria described below.

Phenotype definitions

We used the following criteria for assigning case–control status for long COVID aligning with the World Health Organization guidelines¹ (Supplementary Note; https://github.com/long-covid-hg/LongCovidTools/blob/main/PhenotypeDefinitions_LongCOVID_v1.docx). Study participants were defined as long COVID cases if, at least three months since SARS-CoV-2 infection or COVID-19 onset, they met any of the following criteria:

1.
Presence of one or more self-reported COVID-19 symptoms that cannot be explained by an alternative diagnosis
2.
Report of ongoing substantial impact on day-to-day activities
3.
Any diagnosis codes of long COVID (for example, post-COVID-19 condition, ICD-10 code U09(.9))

Criteria 1 and 2 were applied only to questionnaire-based cohorts, whereas 3 was used in studies with electronic health records (EHR). Detailed phenotyping criteria and diagnosis codes of each study are provided in Supplementary Table 12.

We used two long COVID case definitions, a strict definition requiring a test-verified SARS-CoV-2 infection and a broad definition including self-reported or clinician-diagnosed SARS-CoV-2 infection (any long COVID).

We applied two control definitions. First, we used population controls, that is, everybody that is not the case. Population controls were genetic ancestry-matched individuals who were not defined as long COVID cases using the above-mentioned questionnaire or EHR-based definition. In the second analysis, we compared long COVID cases to individuals who had had SARS-CoV-2 infection but who did not meet the criteria of long COVID, that is, had fully recovered within three months from the infection.

We used in total four different case–control definitions to generate four GWASs as below:

1.
Long COVID cases after test-verified SARS-CoV-2 infection versus population controls (the strict case definition versus the broad control definition)
2.
Long COVID within test-verified SARS-CoV-2 infection (the strict case definition versus the strict control definition)
3.
Any long COVID cases versus population controls (the broad case definition versus the broad control definition)
4.
Long COVID within any SARS-CoV-2 infection (the broad case definition versus the strict control definition)

To further investigate the effect of FOXP4 locus on the different manifestations of long COVID⁷ in the FinnGen and MVP datasets, we used combined criteria of any long COVID diagnosis (BB: ICD-10 diagnosis code: U09* (where * can be empty or any string, referring to subdiagnoses)) with lifetime occurrence of specific symptom diagnoses: diabetes (ICD-10: E10*, E11*, E12*, E13*, E14*), fatigue and malaise (ICD-10: R53*, G93.3), asthma (ICD-10: J45*), skin paresthesia (ICD-10: R20.2), β-adrenergic inhalants (Anatomical Therapeutic Chemical (ATC) drug code: R03AC*), headache (ICD-10: R51*), proton pump inhibitors (ATC: A02BC*) or cardiac arrhythmia/abnormalities of heartbeat (ICD-10: I49*, R00*; Supplementary Fig. 13 and Supplementary Table 36). The effect of the risk variant rs9367106-C on long COVID with each symptom or medication was estimated separately using logistic regression, adjusting for age, sex and ten principal components. Finnish ancestry from FinnGen and African, Admixed American and European ancestries from the MVP were first analyzed separately, followed by a meta-analysis and test for heterogeneity.

GWAS

We largely applied the GWAS analysis plans used in the COVID-19 HGI⁶. Each study performed its own sample collection, genotyping, genotype and sample quality control, imputation and association analyses independently, according to our central analysis plan (https://github.com/long-covid-hg/LongCovidTools/blob/main/COVID19HostGenetics_AnalysisPlan_LongCOVID_v1.docx), before submitting the GWAS summary statistic level results for meta-analysis (details are provided in Supplementary Table 12). We recommended that GWASs were run using REGENIE⁵⁷ on chromosomes 1–22 and X, although a minority of the contributing studies used SAIGE⁵⁸ or PLINK2 (ref. ⁵⁹; Supplementary Table 12). The minimum set of covariates to be included at runtime were age, age², sex, age × sex and the first ten genetic principal components. We advised studies to include any additional study-specific covariates where needed, such as those related to genotype batches or other demographic and technical factors that could lead to stratification within the cohort. Studies (n = 2) performing the GWAS using software that does not account for sample relatedness (such as PLINK) were advised to exclude related individuals.

GWAS meta-analyses

The meta-analysis pipeline was also adopted from the COVID-19 HGI flagship paper¹⁶. The code is available at Long COVID HGI GitHub (https://github.com/long-covid-hg/META_ANALYSIS/) and is a modified version of the pipeline developed for the COVID-19 HGI (https://github.com/covid19-hg/META_ANALYSIS). To ensure that individual study results did not suffer from excessive inflation, deflation and false positives, we manually investigated plots of the reported allele frequencies against aggregated gnomAD v3.0 (ref. ⁵⁵) allele frequencies in the same population. We also evaluated whether the association standard errors were excessively small, given the calculated effective sample size, to identify studies deviating from the expected trend. Where these issues were detected, the studies were contacted to reperform the association analysis, if needed, and resubmit their results.

Before the meta-analysis itself, the summary statistics were standardized, filtered (excluding variants with allele frequency <0.1% or imputation INFO score <0.6), lifted over to reference genome build GRCh38 (in studies imputed to GRCh37) and harmonized to gnomAD v3.0 through matching by chromosome, position and alleles (Supplementary Note).

The meta-analysis was performed using a fixed-effects IVW method on variants that were present in at least two studies contributing to the specific phenotype being analyzed. To assess whether one study was primarily driving any associations, we simultaneously ran a leave-most-significant-study-out (LMSSO) meta-analysis for each variant (based on the variant’s study-level P value). Heterogeneity between studies was estimated using Cochran’s Q test⁶⁰. Each set of meta-analysis results was then filtered to exclude variants whose total effective sample size (in the non-LMSSO analysis) was less than one-third of the total effective sample size of all studies contributing to that meta-analysis. We report significant loci that pass the genome-wide significance threshold (P ≤ 5 × 10⁻⁸/4 = 1.25 × 10⁻⁸) accounting for the number of GWAS meta-analyses we performed.

Principal component projection

In a similar fashion to the COVID-19 HGI, we asked each study to project their cohort onto a multiethnic genetic principal component space (Supplementary Fig. 5), by providing studies with precomputed PC loadings and reference allele frequencies from unrelated samples from the 1000 Genomes Project^20,21 and the Human Genome Diversity Project. The loadings and frequencies were generated for a set of 117,221 autosomal, common (minor allele frequency (MAF) ≥ 0.1%) and LD-pruned (r² < 0.8; 500-kb window) SNPs that would be available in the imputed data of most studies. Access to the projecting and plotting scripts was made available to the studies at https://github.com/long-covid-hg/pca_projection.

eQTL, PheWAS and colocalization

For the single (Bonferroni-corrected) genome-wide significant lead variant, rs9367106, we used the GTEx portal (https://gtexportal.org/)^22,23 to understand whether this variant had any tissue-specific effects on gene expression. As rs9367106 was not available in the GTEx database, we first identified a proxy variant, rs12660421 (r² = 0.90) using all individuals from the 1000 Genomes Project^20,21 and then performed a lookup in the portal’s GTEx v8 dataset²³.

To identify other phenotypes associated with rs9367106, we used the Biobank Japan PheWeb portal (https://pheweb.jp/)⁹ to perform a phenome-wide association analysis, as the MAF of rs9367106 is highest in East Asia. Furthermore, we explored variant and locus-level associations in Estonian Biobank, FinnGen and Open Targets.

To assess whether the FOXP4 association is shared between long COVID, and tissue-specific eQTLs, lung cancer and COVID-19 hospitalization, we extracted a 1-Mb region centered on rs9367107 (chr6: 41,015,652–42,015,652) from the lung cancer and COVID-19 hospitalization summary statistics and the GTEx v8 data and performed colocalization analyses using the R package coloc (v5.1.0.1)^61,62 in R v4.2.2. Colocalization locus zoom plots were created using the LocusCompareR R package v1.0.0 (ref. ⁶³), with LD r² estimated using 1000 Genomes European-ancestry individuals^20,21.

Genetic correlation and MR

We assessed the genetic overlap and causal associations between long COVID outcomes and the same set of risk factors, biomarkers and disease liabilities as in the COVID-19 HGI flagship paper¹⁶. Additionally, we tested the overlap and causal impact of COVID-19 susceptibility and hospitalization risk. Genetic correlations were assessed using Linkage Disequilibrium Score Regression v1.0.1 (ref. ⁶⁴). Where there were sufficient genome-wide significant variants, the causal impact was tested in a two-sample MR framework using the TwoSampleMR (v0.5.6) R package⁶⁵ with R v4.0.3. To avoid sample overlap between exposure GWASs (here COVID-19 hospitalization and SARS-CoV-2 reported infection) and outcome GWASs (here long COVID phenotypes), we performed meta-analyses of COVID-19 hospitalization and SARS-CoV-2 reported infection using data freeze 7 of the COVID-19 HGI by excluding studies that participated in the long COVID (data freeze 4) effort. Independent significant exposure variants with P ≤ 5 × 10⁻⁸ were identified by LD-clumping the full set of summary statistics using an LD r² threshold of 0.001 (based on the 1000 Genomes European-ancestry reference samples^20,21) and a 10-Mb clumping window. For each exposure–outcome pair, these variants were then harmonized to remove variants with mismatched alleles and ambiguous palindromic variants (MAF > 45%). Fixed-effects IVW meta-analysis was used as the primary MR method, with MR–Egger, weighted median estimator, weighted mode-based estimator and MR-PRESSO used in sensitivity analyses. Heterogeneity was assessed using the MR-PRESSO global test and pleiotropy using the MR–Egger intercept. The genetic correlation and MR analyses were implemented as a Snakemake Workflow made available at https://github.com/marcoralab/MRcovid. Leave-one-variant-out-MR and European-only long COVID analyses were run as sensitivity analyses to test the robustness of MR results with COVID hospitalization as exposure and long COVID as outcome.

Summaries of the exposure GWAS are provided in Supplementary Table 26, and the association statistics for all exposure variants are provided in Supplementary Data.

Bayesian clustering of effects based on linear relationships

We compared effect size estimates between long COVID and COVID severity, and similarly, between long COVID and SARS-CoV-2 infection. COVID-19 hospitalization was used as a proxy for severity. For this purpose, we selected those variants that had earlier association evidence at the genome-wide significant level for COVID-19 severity or SARS-CoV-2 infection and examined whether these variants had joint or higher effect than expected for long COVID. The linemodels R package was utilized for comparing linear relationships (https://github.com/mjpirinen/linemodels)⁶⁶. This line model method performs probabilistic clustering of variables based on their observed effect sizes on two outcomes (Supplementary Note).

Statistics and reproducibility

To maximize the statistical power for detecting genetic variants associated with long COVID, we used data from as many cohorts as possible with information on long COVID and study participants without long COVID. Moreover, to ensure reproducibility, we examined the robustness and replication of the signal across nine independent cohorts that joined the Long COVID HGI after data freeze 4 where the association was initially discovered.

For additional methodological details, see Supplementary Note.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

We have made the results of these GWAS meta-analyses publicly available for variants passing post-meta-analysis filtering for MAF ≥ 1% and effective sample size >1/3 of the maximum effective sample size for each meta-analysis. The results from the four meta-analyses have been deposited to GWAS Catalog⁶⁷ and LocusZoom⁶⁸, where the associations can be visually explored and the summary statistics exported for further scientific discovery.

Strict case definition (long COVID after test-verified SARS-CoV-2 infection) versus broad control definition (population control):

https://www.ebi.ac.uk/gwas/studies/GCST90454540

https://my.locuszoom.org/gwas/192226/

Broad case definition (long COVID after any SARS-CoV-2 infection) versus broad control definition:

https://www.ebi.ac.uk/gwas/studies/GCST90454541

https://my.locuszoom.org/gwas/826733/

Strict case definition versus strict control definition (individuals that had SARS-CoV-2 but did not develop long COVID):

https://www.ebi.ac.uk/gwas/studies/GCST90454542

https://my.locuszoom.org/gwas/793752/

Broad case definition versus strict control definition:

https://www.ebi.ac.uk/gwas/studies/GCST90454543

https://my.locuszoom.org/gwas/91854/

Code availability

Instructions and example code for phenotyping, sample collection, genotyping, genotype and sample quality control, imputation and association analyses are shared in our central analysis plan (https://github.com/long-covid-hg/LongCovidTools/blob/main/COVID19HostGenetics_AnalysisPlan_LongCOVID_v1.docx, https://github.com/long-covid-hg/LongCovidTools/blob/main/PhenotypeDefinitions_LongCOVID_v1.docx). Furthermore, we have used GitHub public repositories for providing code for GWAS summary statistics lift-over and meta-analyses (https://github.com/long-covid-hg/META_ANALYSIS, modified from the previously published COVID-19 HGI pipeline^15,16), for PCA projecting and plotting (https://github.com/long-covid-hg/pca_projection) and for MR and genetic correlation (https://github.com/marcoralab/MRcovid). Code used for fine mapping (https://github.com/mkanai/slalom)²⁷ and Bayesian clustering of effects based on linear relationships (https://github.com/mjpirinen/linemodels)⁶⁶ is also publicly available and has been previously published.

References

Soriano, J. B., Murthy, S., Marshall, J. C., Relan, P. & Diaz, J. V. A clinical case definition of post-COVID-19 condition by a Delphi consensus. Lancet Infect. Dis. 22, e102–e107 (2022).
Article CAS PubMed Google Scholar
Desai, A. D., Lavelle, M., Boursiquot, B. C. & Wan, E. Y. Long-term complications of COVID-19. Am. J. Physiol. Cell Physiol. 322, C1–C11 (2022).
Article CAS PubMed Google Scholar
Mehandru, S. & Merad, M. Pathological sequelae of long-haul COVID. Nat. Immunol. 23, 194–202 (2022).
Article CAS PubMed PubMed Central Google Scholar
Hugon, J., Msika, E.-F., Queneau, M., Farid, K. & Paquet, C. Long COVID: cognitive complaints (brain fog) and dysfunction of the cingulate cortex. J. Neurol. 269, 44–46 (2022).
Article CAS PubMed Google Scholar
Ceban, F. et al. Fatigue and cognitive impairment in post-COVID-19 syndrome: a systematic review and meta-analysis. Brain Behav. Immun. 101, 93–135 (2022).
Article CAS PubMed Google Scholar
Sykes, D. L. et al. Post-COVID-19 symptom burden: what is long-COVID and how should we manage it? Lung 199, 113–119 (2021).
Article CAS PubMed PubMed Central Google Scholar
Davis, H. E., McCorkell, L., Vogel, J. M. & Topol, E. J. Long COVID: major findings, mechanisms and recommendations. Nat. Rev. Microbiol. 21, 133–146 (2023).
Article CAS PubMed PubMed Central Google Scholar
Global Burden of Disease Long COVID Collaborators. et al. Estimated global proportions of individuals with persistent fatigue, cognitive, and respiratory symptom clusters following symptomatic COVID-19 in 2020 and 2021. JAMA 328, 1604–1615 (2022).
Article PubMed Central Google Scholar
Mizrahi, B. et al. Long COVID outcomes at one year after mild SARS-CoV-2 infection: nationwide cohort study. BMJ 380, e072529 (2023).
Article PubMed Google Scholar
Wong, A. C. et al. Serotonin reduction in post-acute sequelae of viral infection. Cell 186, 4851–4867 (2023).
Article CAS PubMed PubMed Central Google Scholar
Appelman, B. et al. Muscle abnormalities worsen after post-exertional malaise in long COVID. Nat. Commun. 15, 17 (2024).
Article CAS PubMed PubMed Central Google Scholar
Cervia-Hasler, C. et al. Persistent complement dysregulation with signs of thromboinflammation in active long COVID. Science 383, eadg7942 (2024).
Article CAS PubMed Google Scholar
The COVID-19 Host Genetics Initiative The COVID-19 Host Genetics Initiative, a global initiative to elucidate the role of host genetic factors in susceptibility and severity of the SARS-CoV-2 virus pandemic. Eur. J. Hum. Genet. 28, 715–718 (2020).
Nakanishi, T. et al. Age-dependent impact of the major common genetic risk factor for COVID-19 on severity and mortality. J. Clin. Invest. 131, e152386 (2021).
Article CAS PubMed PubMed Central Google Scholar
Kanai, M. et al. A second update on mapping the human genetic architecture of COVID-19. Nature 621, E7–E26 (2023).
Article Google Scholar
COVID-19 Host Genetics Initiative Mapping the human genetic architecture of COVID-19. Nature 600, 472–477 (2021).
Article Google Scholar
Ellinghaus, D. et al. Genomewide association study of severe COVID-19 with respiratory failure. N. Engl. J. Med. 383, 1522–1534 (2020).
Article CAS PubMed Google Scholar
Pairo-Castineira, E. et al. Genetic mechanisms of critical illness in COVID-19. Nature 591, 92–98 (2021).
Article PubMed Google Scholar
Bergström, A. et al. Insights into human genetic variation and population history from 929 diverse genomes. Science 367, eaay5012 (2020).
Article PubMed PubMed Central Google Scholar
Auton, A. et al. A global reference for human genetic variation. Nature 526, 68–74 (2015).
Article PubMed Google Scholar
Lek, M. et al. Analysis of protein-coding genetic variation in 60,706 humans. Nature 536, 285–291 (2016).
Article CAS PubMed PubMed Central Google Scholar
GTEx Consortium The Genotype-Tissue Expression (GTEx) project. Nat. Genet. 45, 580–585 (2013).
GTEx Consortium The GTEx Consortium atlas of genetic regulatory effects across human tissues. Science 369, 1318–1330 (2020).
Article Google Scholar
D’Antonio, M. et al. SARS-CoV-2 susceptibility and COVID-19 disease severity are associated with genetic variants affecting gene expression in a variety of tissues. Cell Rep. 37, 110020 (2021).
Article PubMed PubMed Central Google Scholar
Tabula Sapiens Consortium The Tabula Sapiens: a multiple-organ, single-cell transcriptomic atlas of humans. Science 376, eabl4896 (2022).
Mason, R. J. Biology of alveolar type II cells. Respirology 11, S12–S15 (2006).
Article PubMed Google Scholar
Kanai, M. et al. Meta-analysis fine-mapping is often miscalibrated at single-variant resolution. Cell Genom. 2, 100210 (2022).
Article CAS PubMed PubMed Central Google Scholar
Boyle, A. P. et al. Annotation of functional variation in personal genomes using RegulomeDB. Genome Res. 22, 1790–1797 (2012).
Article CAS PubMed PubMed Central Google Scholar
Dong, S. et al. Annotating and prioritizing human non-coding variants with RegulomeDB v.2. Nat. Genet. 55, 724–726 (2023).
Article CAS PubMed PubMed Central Google Scholar
ENCODE Project Consortium An integrated encyclopedia of DNA elements in the human genome Nature 489, 57–74 (2012).
Huang, D. et al. VannoPortal: multiscale functional annotation of human genetic variants for interrogating molecular mechanism of traits and diseases. Nucleic Acids Res. 50, D1408–D1416 (2022).
Article CAS PubMed Google Scholar
Nagai, A. et al. Overview of the BioBank Japan Project: study design and profile. J. Epidemiol. 27, S2–S8 (2017).
Article PubMed PubMed Central Google Scholar
Dai, J. et al. Identification of risk loci and a polygenic risk score for lung cancer: a large-scale prospective cohort study in Chinese populations. Lancet Respir. Med. 7, 881–891 (2019).
Article PubMed PubMed Central Google Scholar
Machiela, M. J. & Chanock, S. J. LDlink: a web-based application for exploring population-specific haplotype structure and linking correlated alleles of possible functional variants. Bioinformatics 31, 3555–3557 (2015).
Article CAS PubMed PubMed Central Google Scholar
Wang, Z. et al. Meta-analysis of genome-wide association studies identifies multiple lung cancer susceptibility loci in never-smoking Asian women. Hum. Mol. Genet. 25, 620–629 (2016).
Article CAS PubMed PubMed Central Google Scholar
COVID-19 Host Genetics Initiative A first update on mapping the human genetic architecture of COVID-19. Nature 608, E1–E10 (2022).
Article Google Scholar
Sudre, C. H. et al. Attributes and predictors of long COVID. Nat. Med. 27, 626–631 (2021).
Article CAS PubMed PubMed Central Google Scholar
Subramanian, A. et al. Symptoms and risk factors for long COVID in non-hospitalized adults. Nat. Med. 28, 1706–1714 (2022).
Article CAS PubMed PubMed Central Google Scholar
Resendez, S. et al. Defining the subtypes of long COVID and risk factors for prolonged disease: population-based case-crossover study. JMIR Public Health Surveill. 10, e49841 (2024).
Article PubMed PubMed Central Google Scholar
Tsampasian, V. et al. Risk factors associated with post-COVID-19 condition: a systematic review and meta-analysis. JAMA Intern. Med. 183, 566–580 (2023).
Article PubMed PubMed Central Google Scholar
Al-Aly, Z., Bowe, B. & Xie, Y. Long COVID after breakthrough SARS-CoV-2 infection. Nat. Med. 28, 1461–1467 (2022).
Article CAS PubMed PubMed Central Google Scholar
Antonelli, M. et al. Risk factors and disease profile of post-vaccination SARS-CoV-2 infection in UK users of the COVID Symptom Study app: a prospective, community-based, nested, case-control study. Lancet Infect. Dis. 22, 43–55 (2022).
Article CAS PubMed PubMed Central Google Scholar
Ayoubkhani, D. et al. Trajectory of long covid symptoms after covid-19 vaccination: community based cohort study. BMJ 377, e069676 (2022).
Article PubMed Google Scholar
Du, M., Ma, Y., Deng, J., Liu, M. & Liu, J. Comparison of long COVID-19 caused by different SARS-CoV-2 strains: a systematic review and meta-analysis. Int. J. Environ. Res. Public Health 19, 16010 (2022).
Article PubMed PubMed Central Google Scholar
Lu, M. M., Li, S., Yang, H. & Morrisey, E. E. Foxp4: a novel member of the Foxp subfamily of winged-helix genes co-expressed with Foxp1 and Foxp2 in pulmonary and gut tissues. Mech. Dev. 119, S197–S202 (2002).
Article PubMed Google Scholar
Takahashi, K., Liu, F.-C., Hirokawa, K. & Takahashi, H. Expression of Foxp4 in the developing and adult rat forebrain. J. Neurosci. Res. 86, 3106–3116 (2008).
Article CAS PubMed Google Scholar
Uhlén, M. et al. Proteomics. Tissue-based map of the human proteome. Science 347, 1260419 (2015).
Article PubMed Google Scholar
Schmiedel, B. J. et al. Impact of genetic polymorphisms on human immune cell gene expression. Cell 175, 1701–1715.e16 (2018).
Article CAS PubMed PubMed Central Google Scholar
Wiehagen, K. R. et al. Foxp4 is dispensable for T cell development, but required for robust recall responses. PLoS ONE 7, e42273 (2012).
Article CAS PubMed PubMed Central Google Scholar
Li, S. et al. Foxp transcription factors suppress a non-pulmonary gene expression program to permit proper lung development. Dev. Biol. 416, 338–346 (2016).
Article CAS PubMed PubMed Central Google Scholar
Li, S. et al. Foxp1/4 control epithelial cell fate during lung development and regeneration through regulation of anterior gradient 2. Development 139, 2500–2509 (2012).
Article CAS PubMed PubMed Central Google Scholar
Chen, Y. et al. Downregulation of microRNA‑423‑5p suppresses TGF‑β1‑induced EMT by targeting FOXP4 in airway fibrosis. Mol. Med. Rep. 26, 242 (2022).
Article CAS PubMed PubMed Central Google Scholar
Yang, T. et al. FOXP4 modulates tumor growth and independently associates with miR-138 in non-small cell lung cancer cells. Tumour Biol. 36, 8185–8191 (2015).
Article CAS PubMed Google Scholar
Castanares-Zapatero, D. et al. Pathophysiology and mechanism of long COVID: a comprehensive review. Ann. Med. 54, 1473–1487 (2022).
Article CAS PubMed PubMed Central Google Scholar
Karczewski, K. J. et al. The mutational constraint spectrum quantified from variation in 141,456 humans. Nature 581, 434–443 (2020).
Article CAS PubMed PubMed Central Google Scholar
Cunningham, F. et al. Ensembl 2022. Nucleic Acids Res. 50, D988–D995 (2022).
Article CAS PubMed Google Scholar
Mbatchou, J. et al. Computationally efficient whole-genome regression for quantitative and binary traits. Nat. Genet. 53, 1097–1103 (2021).
Article CAS PubMed Google Scholar
Zhou, W. et al. Efficiently controlling for case-control imbalance and sample relatedness in large-scale genetic association studies. Nat. Genet. 50, 1335–1341 (2018).
Article CAS PubMed PubMed Central Google Scholar
Chang, C. C. et al. Second-generation PLINK: rising to the challenge of larger and richer datasets. Gigascience 4, 7 (2015).
Article PubMed PubMed Central Google Scholar
Neupane, B., Loeb, M., Anand, S. S. & Beyene, J. Meta-analysis of genetic association studies under heterogeneity. Eur. J. Hum. Genet. 20, 1174–1181 (2012).
Article CAS PubMed PubMed Central Google Scholar
Wang, G., Sarkar, A., Carbonetto, P. & Stephens, M. A simple new approach to variable selection in regression, with application to genetic fine mapping. J. R. Stat. Soc. Series B Stat. Methodol. 82, 1273–1300 (2020).
Article PubMed PubMed Central Google Scholar
Wallace, C. A more accurate method for colocalisation analysis allowing for multiple causal variants. PLoS Genet. 17, e1009440 (2021).
Article CAS PubMed PubMed Central Google Scholar
Liu, B., Gloudemans, M. J., Rao, A. S., Ingelsson, E. & Montgomery, S. B. Abundant associations with gene expression complicate GWAS follow-up. Nat. Genet. 51, 768–769 (2019).
Article CAS PubMed PubMed Central Google Scholar
Bulik-Sullivan, B. K. et al. LD Score regression distinguishes confounding from polygenicity in genome-wide association studies. Nat. Genet. 47, 291–295 (2015).
Article CAS PubMed PubMed Central Google Scholar
Hemani, G., Tilling, K. & Davey Smith, G. Orienting the causal relationship between imprecisely measured traits using GWAS summary data. PLoS Genet. 13, e1007081 (2017).
Article PubMed PubMed Central Google Scholar
Pirinen, M. linemodels: clustering effects based on linear relationships. Bioinformatics 39, btad115 (2023).
Article CAS PubMed PubMed Central Google Scholar
Sollis, E. et al. The NHGRI-EBI GWAS Catalog: knowledgebase and deposition resource. Nucleic Acids Res. 51, D977–D985 (2023).
Article CAS PubMed Google Scholar
Boughton, A. P. et al. LocusZoom.js: interactive and embeddable visualization of genetic association study results. Bioinformatics 37, 3017–3018 (2021).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We are extremely grateful to all the participants, healthcare professionals, interviewers, computer and laboratory technicians, clerical workers, research scientists, volunteers, managers and everyone participating in making possible the collection and analysis of datasets contributing to this study. We acknowledge the funding and research infrastructure support in Supplementary Note (see also the full Long COVID HGI author information in Supplementary Table 2).

Funding

Open access funding provided by Max Planck Society.

Author information

These authors contributed equally: Vilma Lammi, Tomoko Nakanishi, Samuel E. Jones.
These authors jointly supervised this work: Hugo Zeberg, Hanna M. Ollila.
Lists of members and their affiliations appears in the Supplementary Information.

Authors and Affiliations

Institute for Molecular Medicine Finland (FIMM), Helsinki Institute of Life Science (HiLIFE), University of Helsinki, Helsinki, Finland
Vilma Lammi, Tomoko Nakanishi, Samuel E. Jones, Juha Karjalainen, Martin Broberg, Hele H. Haapaniemi, Matti Pirinen, Mari E. K. Niemi, Mattia Cordioli, Mark J. Daly, Andrea Ganna & Hanna M. Ollila
Department of Human Genetics, McGill University, Montreal, Quebec, Canada
Tomoko Nakanishi & J. Brent Richards
Centre for Clinical Epidemiology, Department of Medicine, Lady Davis Institute, Jewish General Hospital, McGill University, Montreal, Quebec, Canada
Tomoko Nakanishi & J. Brent Richards
Kyoto-McGill International Collaborative Program in Genomic Medicine, Graduate School of Medicine, Kyoto University, Kyoto, Japan
Tomoko Nakanishi
Department of Genome Informatics, Graduate School of Medicine, the University of Tokyo, Tokyo, Japan
Tomoko Nakanishi & Yukinori Okada
Research Fellow, Japan Society for the Promotion of Science, Tokyo, Japan
Tomoko Nakanishi
Department of Psychiatry and Behavioral Sciences, University of California San Francisco, San Francisco, CA, USA
Shea J. Andrews
Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, Cambridge, MA, USA
Juha Karjalainen
Stanley Center for Psychiatric Research, Broad Institute of Harvard and MIT, Cambridge, MA, USA
Juha Karjalainen
Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
Juha Karjalainen
Genomes for Life-GCAT Lab, CORE Program, Germans Trias i Pujol Research Institute (IGTP), Badalona, Spain
Beatriz Cortés, Rafael de Cid, Susana Iraola-Guzmán, Natalia Blay, Xavier Farré & Rafael de Cid
Grup de REcerca en Impacte de les Malalties Cròniques i les seves Trajectòries (GRIMTra), Barcelona, Spain
Beatriz Cortés, Rafael de Cid, Susana Iraola-Guzmán, Natalia Blay, Xavier Farré & Rafael de Cid
Sano Genetics Limited, London, UK
Heath E. O’Brien, Thompson Hannah & Patrick J. Short
Unidad de Biología Molecular y Medicina Genómica, Instituto Nacional de Ciencias Médicas y Nutrición Salvador Zubirán, Mexico City, Mexico
Ana Ochoa-Guzman & Michelle Duran-Gomez
Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York City, NY, USA
Brian E. Fulton-Howard
Broad Institute, Cambridge, MA, USA
Masahiro Kanai, Mark J. Daly & Andrea Ganna
Analytical and Translational Genetics Unit, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA
Masahiro Kanai, Mark J. Daly & Andrea Ganna
Department of Mathematics and Statistics, University of Helsinki, Helsinki, Finland
Matti Pirinen & Mari E. K. Niemi
Department of Public Health, University of Helsinki, Helsinki, Finland
Matti Pirinen & Mari E. K. Niemi
Institute of Human Genetics, University of Bonn, School of Medicine and University Hospital Bonn, Bonn, Germany
Axel Schmidt, Daniella Balla, Julia Heggemann, Sonja Schultz, Pari Behzad, Markus M. Nöthen, Abigail Miller & Kerstin U. Ludwig
MRC Integrative Epidemiology Unit, University of Bristol, Bristol, UK
Ruth E. Mitchell, George Davey Smith & Nicholas J. Timpson
Population Health Sciences, Bristol Medical School, University of Bristol, Bristol, UK
Ruth E. Mitchell, George Davey Smith & Nicholas J. Timpson
Department of Hygiene and Epidemiology, University of Ioannina School of Medicine, Ioannina, Greece
Abdou Mousas, Evangelos Evangelou, Evangelia Ntzani & Konstantinos K. Tsilidis
Department of Twin Research, King’s College London, London, UK
Massimo Mangino & J. Brent Richards
Program in Metabolism and Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
Alicia Huerta-Chagoya
Center for Genomic Medicine and Diabetes Unit, Endocrine Division, Department of Medicine, Massachusetts General Hospital, Boston, MA, USA
Alicia Huerta-Chagoya
Departamento de Medicina Genómica y Toxicología Ambiental, Instituto de Investigaciones Biomédicas, Universidad Nacional Autónoma de México, Mexico City, Mexico
Alicia Huerta-Chagoya
Unidad de Biología Molecular y Medicina Genómica, Instituto Nacional de Ciencias Médicas y Nutrición, Mexico City, Mexico
Alicia Huerta-Chagoya
Herbold Computational Biology Program, Public Health Sciences Division, Fred Hutchinson Cancer Center, Seattle, WA, USA
Nasa Sinnott-Armstrong
Department of Genome Sciences, University of Washington, Seattle, WA, USA
Nasa Sinnott-Armstrong
Finnish Institute of Molecular Medicine, University of Helsinki, Helsinki, Finland
Nasa Sinnott-Armstrong
Helix, San Mateo, CA, USA
Elizabeth T. Cirulli, Francisco Tanudjaja, Efren Sandoval, Nicole L. Washington, Simon White, Alexandre Bolze & Kelly M. Schiabor Barrett
Mohn Center for Diabetes Precision Medicine, Department of Clinical Science, University of Bergen, Bergen, Norway
Marc Vaudel
Department of Genetics and Bioinformatics, Health Data and Digitalization, Norwegian Institute of Public Health, Oslo, Norway
Marc Vaudel
Computational Biology Unit, Department of Informatics, University of Bergen, Bergen, Norway
Marc Vaudel
Centre for Clinical Brain Sciences, Division of Psychiatry, University of Edinburgh, Edinburgh, UK
Alex S. F. Kwong
Department of Genetics and Genomics, Mydnavar, Southfield, MI, USA
Amit K. Maiti
University of Helsinki, Helsinki, Finland
Minttu M. Marttila
Helsinki University Central Hospital, Helsinki, Finland
Minttu M. Marttila
VA Boston Healthcare System, Boston, MA, USA
Daniel C. Posner
Data Science and Learning, Argonne National Laboratory, Lemont, IL, USA
Alexis A. Rodriguez
Department of Population Health Sciences, University of Leicester, Leicester, UK
Chiara Batini, Beatriz Guillen-Guio, Olivia C. Leavy, Anna L. Guyatt, Catherine John & Louise V. Wain
University Hospitals of Leicester NHS Trust, Leicester, UK
Chiara Batini
Institute for Biomedical Technologies—National Research Council, Segrate, Italy
Francesca Minnai
Department of Medical Biotechnology and Translational Medicine (BioMeTra), Università degli Studi di Milano, Milan, Italy
Francesca Minnai
Institute for Social and Economic Research, University of Essex, Colchester, UK
Anna R. Dearman, Benedict Hignell & Meena Kumari
Department of Genetics, University Medical Center Groningen, University of Groningen, Groningen, the Netherlands
C. A. Robert Warmerdam & Lude H. Franke
Oncode Investigator, Utrecht, the Netherlands
C. A. Robert Warmerdam & Lude H. Franke
Novo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
Celia B. Sequeros, Søren Brunak, David Westergaard & Karina Banasik
Department of Genetic Epidemiology, University of Regensburg, Regensburg, Germany
Thomas W. Winkler & Iris M. Heid
Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York City, NY, USA
Daniel M. Jordan, Ryan C. Thompson, Alexander W. Charney, Laura G. Sloofman, Nicole W. Simons & Noam D. Beckmann
Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York City, NY, USA
Daniel M. Jordan, Ryan C. Thompson, Laura G. Sloofman & Nicole W. Simons
Latvian Biomedical Research and Study Centre, Riga, Latvia
Raimonds Rešcenko, Vita Rovite, Peculis Raitis, Monta Briviba, Janis Klovinš & Laura Ansone
Università degli Studi di Milano, Milan, Italy
Lorenzo Miano, Flora Peyvandi, Francesco Malvestiti, Nicola Montano, Alessandra Bandera & Francesco Bruno Arturo Blasi
Brigham and Women’s Hospital Division of Sleep and Circadian Disorders, Boston, MA, USA
Jacqueline M. Lane & Jakob M. Cherry
Massachusetts General Hospital, Center for Genomic Medicine, Boston, MA, USA
Jacqueline M. Lane
Broad Institute, Molecular and Population Genetics Program, Cambridge, MA, USA
Jacqueline M. Lane
Center for Computational Biology, University of California Berkeley, Berkeley, CA, USA
Ryan K. Chung
The Institute for Lung Health, NIHR Leicester Biomedical Research Centre, University of Leicester, Leicester, UK
Beatriz Guillen-Guio, Olivia C. Leavy & Louise V. Wain
Departamento de Oncología Básico Clínica, Facultad de Medicina, Universidad de Chile, Santiago, Chile
Laura Carvajal-Silva, Kevin Aguilar-Valdés, Leslie C. Cerpa, Tamara V. Arévalo, Eduardo Lamoza, Alicia Colombo & Ricardo A. Verdugo
Mount Sinai Hospital, Sinai Health, Toronto, Ontario, Canada
Erika Frangione, Xu Xinyi, Jennifer Taher & Jordan Lerner-Ellis
Department of Pathology and Laboratory Medicine, University of Pennsylvania, Philadelphia, PA, USA
Lindsay Guare & Shefali S. Verma
Genotek Ltd, Moscow, Russia
Ekaterina Vergasova, Michil Trofimov, Layal Shaheen, Nikolay Plotnikov, Anna Kim, Dmitrii Kharitonov, Alexei Kamelin & Alexander Rakitko
William Harvey Research Institute, Barts and the London School of Medicine and Dentistry, Queen Mary University of London, London, UK
Eirini Marouli
IRCCS G Gaslini, Genoa, Italy
Pasquale Striano
Department of Internal Medicine, Kulliyyah of Medicine, International Islamic University Malaysia, Pahang, Malaysia
Ummu Afeera Zainulabid
Department of Anatomy, All India Institute of Medical Sciences—Patna, Patna, India
Ashutosh Kumar
Faculty of Industrial Sciences and Technology, Universiti Malaysia Pahang Al Sultan Abdullah, Pahang, Malaysia
Hajar Fauzan Ahmad
Department of Statistical Genetics, Osaka University Graduate School of Medicine, Suita, Japan
Ryuya Edahiro & Yukinori Okada
Department of Respiratory Medicine and Clinical Immunology, Osaka University Graduate School of Medicine, Suita, Japan
Ryuya Edahiro
Division of Pulmonary Medicine, Department of Medicine, Keio University School of Medicine, Tokyo, Japan
Shuhei Azekawa & Makoto Ishii
Department of Respiratory Medicine, Nagoya University Graduate School of Medicine, Nagoya, Japan
Shuhei Azekawa & Makoto Ishii
VA Portland Health Care System, Portland, Portland, OR, USA
Kelly Cho & Shiuh-Wen Luoh
Division of Hematology and Medical Oncology, Knight Cancer Institute, Oregon Health and Science University, Portland, OR, USA
Shiuh-Wen Luoh
Department of Clinical Immunology, Aarhus University Hospital, Aarhus, Denmark
Christian Erikstrup
Department of Clinical Immunology, Zealand University Hospital—Køge, Køge, Denmark
Ole B. V. Pedersen
University of Toronto, Toronto, Ontario, Canada
Jennifer Taher & Jordan Lerner-Ellis
Lunenfeld-Tanenbaum Research Institute, Sinai Health, Toronto, Ontario, Canada
Jordan Lerner-Ellis
Departamento de Anatomía Patológica, Facultad de Medicina, Universidad de Chile, Santiago, Chile
Pamela Bocchieri, Iskra A. Signore & Alicia Colombo
Servicio de Anatomía Patológica, Hospital Clínico de la Universidad de Chile, Santiago, Chile
Alicia Colombo
Department of Internal Medicine, University of Nevada Reno, School of Medicine, Reno, NV, USA
Joseph J. Grzymski
Laboratory for Systems Genetics, RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
Yukinori Okada
Laboratory of Statistical Immunology, Immunology Frontier Research Center (WPI-IFReC), Osaka University, Suita, Japan
Yukinori Okada
Division of Data Driven Medicine, Department of Medicine, Icahn School of Medicine at Mount Sinai, New York City, NY, USA
Noam D. Beckmann
Institute of Medical Microbiology and Hygiene, Molecular Microbiology (Virology), University of Regensburg, Regensburg, Germany
Ralf Wagner
Institute of Clinical Microbiology and Hygiene, University Hospital Regensburg, Regensburg, Germany
Ralf Wagner
Leicester National Institute for Health and Care Research, Biomedical Research Centre, Glenfield Hospital, Leicester, UK
Anna L. Guyatt & Catherine John
Centre for Fertility and Health, Norwegian Institute of Public Health, Oslo, Norway
Per Magnus
Department of Pathophysiology and Transplantation, Università degli Studi di Milano, Milan, Italy
Luca V. C. Valenti
Biological Resource Center, Fondazione IRCCS Ca’ Granda Ospedale Maggiore Policlinico, Milan, Italy
Luca V. C. Valenti
Department of Medicine, Division of HIV, Infectious Diseases and Global Medicine, University of California, San Francisco, CA, USA
Maria Sophia Donaire, Sannidhi Sarvadhavabhatla & Sulggi A. Lee
Instituto de Investigación Interdisciplinaria y Facultad de Medicina, Universidad de Talca, Talca, Chile
Ricardo A. Verdugo
Statens Serum Institute, Copenhagen, Denmark
Bjarke Feenstra & Frank Geller
Department of Endocrinology, Guy’s and St Thomas’ NHS Foundation Trust, London, UK
Tom Hemming Karlsen & Emma L. Duncan
Department of Twin Research and Genetic Epidemiology, King’s College London, London, UK
Emma L. Duncan
Medical Genetics, University of Siena, Siena, Italy
Margherita Baldassarri & Alessandra Renieri
Med Biotech Hub and Competence Center, Department of Medical Biotechnologies, University of Siena, Siena, Italy
Margherita Baldassarri & Alessandra Renieri
Genetica Medica, Azienda Ospedaliero-Universitaria Senese, Siena, Italy
Margherita Baldassarri & Alessandra Renieri
Department of Epidemiology and Biostatistics, School of Public Health, Imperial College London, London, UK
Evangelos Evangelou & Konstantinos K. Tsilidis
Digestive Oncology Research Center, Digestive Disease Research Institute, Shariati Hospital, Tehran University of Medical Sciences, Tehran, Iran
Bahareh Sharififard & Ahmadreza Niavarani
Estonian Genome Center, Institute of Genomics, University of Tartu, Tartu, Estonia
Arne Kukkonen & Erik Abner
Instituto de Investigaciones Biomédicas, UNAM, Mexico City, Mexico
Teresa Tusié-Luna
Instituto Nacional de Ciencias Médicas y Nutrición Salvador Zubirán, Mexico City, Mexico
Teresa Tusié-Luna
Data Science and Learning Division, Argonne National Laboratory, Lemont, IL, USA
Ravi K. Madduri
Department of Medicine, Harvard Medical School and Mass General Brigham, Boston, MA, USA
Kelly Cho & Kelly Cho
Department of Psychiatry, University of Munich, Munich, Germany
Eva C. Schulte
Institute of Human Genetics, University Hospital, Faculty of Medicine, University of Bonn, Bonn, Germany
Eva C. Schulte
Institute of Virology, Technical University of Munich/Helmholtz Munich, Munich, Germany
Ulrike Protzer & Eva C. Schulte
Institute of Psychiatric Phenomics and Genomics, University of Munich, Munich, Germany
Eva C. Schulte
Department of Psychiatry, University Hospital, Faculty of Medicine, University of Bonn, Bonn, Germany
Eva C. Schulte
5 Prime Sciences Inc, Montreal, Quebec, Canada
Vince Forgetta & J. Brent Richards
Department of Epidemiology, Biostatistics and Occupational Health, McGill University, Montreal, Quebec, Canada
J. Brent Richards & Michael Marks-Hultström
Lady Davis Institute of Medical Research, Jewish General Hospital, McGill University, Montreal, Quebec, Canada
Michael Marks-Hultström
Anaesthesiology and Intensive Care Medicine, Department of Surgical Sciences, Uppsala University, Uppsala, Sweden
Ewa Wallin, Robert Frithiof, Miklos Lipcsey, Ing-Marie Larsson & Michael Marks-Hultström
Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
Tomislav Maricic & Hugo Zeberg
Department of Physiology and Pharmacology, Karolinska Institutet, Stockholm, Sweden
Hugo Zeberg
Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA
Richa Saxena, Matthew Maher & Hanna M. Ollila
Anesthesia, Critical Care, and Pain Medicine, Massachusetts General Hospital and Harvard Medical School, Boston, MA, USA
Richa Saxena, Matthew Maher & Hanna M. Ollila
Broad Institute of MIT and Harvard, Cambridge, MA, USA
Richa Saxena, Matthew Maher & Hanna M. Ollila
Victor Phillip Dahdaleh Institute of Genomic Medicine at McGill University and Department of Human Genetics, McGill University, Montreal, Quebec, Canada
Janick St-Cyr, David Bujold, Guillaume Bourque, Ariane Boisclair, Daniel Auld, Solomia Yanishevsky, G. Mark Lathrop, Jiannis Ragoussis, Danielle Perley & Vincent Mooser
Lady Davis Institute, Jewish General Hospital, McGill University, Montreal, Quebec, Canada
Darin Adra, Laetitia Laurent, Fangyi Shi & David R. Morrison
Research Centre of the Centre Hospitalier de l’Université de Montréal (CRCHUM), Montreal, Quebec, Canada
Madeleine Durand
Centre hospitalier de l’Université de Montréal (CHUM), Montreal, Quebec, Canada
Madeleine Durand
Institut universitaire de cardiologie et de pneumologie de Québec, Université Laval, Quebec, Quebec, Canada
Mylene Bertrand
The Meakins-Christie Laboratories at the Research Institute of the McGill University Heath, Centre Research Institute, and Department of Medicine, Faculty of Medicine, McGill University, Montreal, Quebec, Canada
Simon Rousseau
Department of Psychiatry and Psychotherapy, University of Bonn, Bonn, Germany
Max C. Pensel
Center for Human Genetics, University Hospital of Marburg, Marburg, Germany
Carlo Maj
Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
Tianxi Cai
Louis Stokes Cleveland VA Medical Center, Cleveland, Ohio, USA
Sudha K. Iyengar
Department of Population and Quantitative Health Sciences, Case Western Reserve University, Cleveland, OH, USA
Sudha K. Iyengar
Instituto Nacional de Ciencias Medicas y Nutricion, Ciudad de México, Mexico
Carlos A. Aguilar Salinas
Department of Human Genetics, David Geffen School of Medicine at UCLA, Los Angeles, CA, USA
Seung Hyuk T. Lee & Päivi Pajukanta
Universidad Autonoma Metropolitana, Mexico City, Mexico
Hortensia Moreno-Macias
Institute for Precision Health, David Geffen School of Medicine at UCLA, Los Angeles, CA, USA
Päivi Pajukanta
Division of Infection Control, Norwegian Institute of Public Health, Oslo, Norway
Lill Trogstad
Department of Medicine, Department of Genetics, Division of Translational Medicine and Human Genetics, Institute for Translational Medicine and Therapeutics, University of Pennsylvania, Philadelphia, PA, USA
Daniel J. Rader
Department of Genetics, University of Pennsylvania, Philadelphia, PA, USA
Marylyn D. Ritchie
Department of Medicine, Division of Translational Medicine and Human Genetics, Institute for Translational Medicine and Therapeutics, University of Pennsylvania, Philadelphia, PA, USA
Anurag Verma & Colleen M. Kripke
Institute of Psychiatric Phenomics and Genomics (IPPG), University Hospital, LMU Munich, Munich, Germany
Sergi Papiol, Thomas G. Schulze, Fanny Senner, Janos L. Kalman, Urs Heilbronner & Monika Budde
Max-Planck Institute of Psychiatry, Munich, Germany
Sergi Papiol
Department of Psychiatry and Psychotherapy, University Medical Center Goettingen, Goettingen, Germany
Jens Wiltfang
German Center for Neurodegenerative Diseases (DZNE), Goettingen, Germany
Jens Wiltfang
Neurosciences and Signaling Group, Institute of Biomedicine (iBiMED), Department of Medical Sciences, University of Aveiro, Aveiro, Portugal
Jens Wiltfang
Department of Internal Medicine II, University Hospital rechts der Isar, Technical University of Munich, School of Medicine, Munich, Germany
Jochen Schneider, Christoph D. Spinner & Johanna Erber
Department of Psychiatry and Behavioral Sciences, Johns Hopkins University, Baltimore, MD, USA
Thomas G. Schulze
Department of Psychiatry and Behavioral Sciences, SUNY Upstate Medical University, Syracuse, NY, USA
Thomas G. Schulze
Department of Genetic Epidemiology in Psychiatry, Central Institute of Mental Health, Medical Faculty Mannheim, University of Heidelberg, Mannheim, Germany
Thomas G. Schulze
Institute of Clinical Chemistry and Pathobiochemistry, Klinikum rechts der Isar, School of Medicine, Technical University of Munich, Munich, Germany
Christof Winter
TranslaTUM, Center for Translational Cancer Research, Technical University of Munich, Munich, Germany
Christof Winter
German Center for Infection Research (DZIF), Munich Site, Braunschweig, Germany
Ulrike Protzer
Institute of Computational Biology, Helmholtz Center Munich, Oberschleissheim, Germany
Nikola S. Mueller
Department of Psychosomatic Medicine and Psychotherapy, Klinikum rechts der Isar, School of Medicine, Technical University of Munich, Munich, Germany
Andreas Dinkel
Department of Psychiatry and Psychotherapy, University Hospital, LMU Munich, Munich, Germany
Janos L. Kalman
Institute of Psychiatric Phenomics and Genomics (IPPG), LMU University Hospital, LMU Munich, Munich, Germany
Kristina Adorjan
Department of Psychiatry and Psychotherapy, University of Bern, Bern, Switzerland
Kristina Adorjan
Department of Internal Medicine II, Klinikum Rechts der Isar, School of Medicine, Technical University of Munich, Munich, Germany
Lisa Fricke
Department of Twin Research and Epidemiology, King’s College London, London, UK
Nicholas R. Harvey
Genómica Evolutiva y Médica de Magallanes (GEMMa), Centro Asistencial, Docente e Investigación (CADI-UMAG), Punta Arenas, Chile
Yolanda Espinosa-Parrilla, Daniela Zapata-Contreras & Paula Zuñiga-Pacheco
Escuela de Medicina, Universidad de Magallanes, Punta Arenas, Chile
Yolanda Espinosa-Parrilla, Daniela Zapata-Contreras & Paula Zuñiga-Pacheco
Interuniversity Center for Healthy Aging, Santiago, Chile
Yolanda Espinosa-Parrilla
Departamento de Ciencias de la Computación, Facultad de Ciencias Físicas y Matemáticas, Universidad de Chile, Santiago, Chile
Juan M. Saez Hidalgo
Molecular and Translational Immunology Laboratory, Department of Clinical Biochemistry and Immunology, Pharmacy Faculty, University of Concepción, Concepción, Chile
Estefania Nova-Lamperti, Camilo Cabrera, Romina Quiroga & Sergio Sanhueza
Departamento de Tecnología Médica, Facultad de Ciencias de la Salud, Universidad de Antofagasta, Antofagasta, Chile
Scarlett Gutiérrez-Richards & Christian A. Muñoz
Servicio de Anatomía, Hospital Clínico de la Universidad de Chile, Santiago, Chile
Gerardo Donoso
ATACAMA OMICS, Laboratorio de Biología Molecular y Genómica, Facultad de Medicina, Universidad de Atacama, Copiapó, Chile
Cesar A. Echeverria
Departamento de Tecnología Médica, Universidad de Tarapacá, Arica, Chile
Macarena Fuentes-Guajardo
Instituto de Investigación Interdisciplinaria y Escuela de Medicina, Universidad de Talca, Talca, Chile
Karen Y. Oróstica
AUSTRAL-omics, Vicerrectoría de Investigación Desarrollo y Creación Artística, Universidad Austral de Chile, Valdivia, Chile
Alvaro Figueroa & Héctor Valenzuela-Jorquera
Unidades de Diagnóstico Fundación Arturo López Pérez, Providencia, Chile
Lissette G. Guajardo, Teresa A. Alarcon & Carolina S. Selman
Facultad de Medicina, Universidad de Atacama, Copiapó, Chile
Iskra A. Signore
Laboratorio Clínico del Área Técnica de Biología Molecular, Hospital del Salvador, Santiago, Chile
Virginia A. Monardes-Ramírez
Programa de Genética Humana del Instituto de Ciencias Biomédicas (ICBM), Facultad de Medicina, Universidad de Chile, Santiago, Chile
Eduardo A. Tobar-Calfucoy, Cristian E. Yáñez & Rocío Retamales-Ortega
Departamento de Oncología Básico Clínica, Facultad de Medicina and Departamento de Ciencias y Tecnología Farmacéutica, Universidad de Chile, Santiago, Chile
Luis A. Quiñones
Departamento de Ciencias y Tecnología Farmacéutica, Universidad de Chile, Santiago, Chile
Matías F. Martínez
AUSTRAL-omics, Vicerrectoría de Investigación Desarrollo y Creación Artística, Valdivia, Chile
Andrea X. Silva
Instituto de Ciencias Ambientales y Evolutivas, Facultad de Ciencias, Universidad Austral de Chile, Valdivia, Chile
Andrea X. Silva
Department of Clinical Immunology, Copenhagen University Hospital—Rigshospitalet, Copenhagen, Denmark
Sisse R. Ostrowski
Department of Medical Endocrinology and Metabolism, Copenhagen University Hospital (Rigshospitalet), Copenhagen, Denmark
Anne Sofie B. Mortensen
ISGlobal, Hospital Clínic - Universitat de Barcelona, Barcelona, Spain
Gemma Moncunill & Carlota Dobaño
CIBER de Enfermedades Infecciosas (CIBERINFEC), Barcelona, Spain
Gemma Moncunill & Carlota Dobaño
Genomes for Life-GCAT lab, Barcelona, Spain
Alba Blasco & Anna Carreras
Germans Trias i Pujol Research Institute (IGTP), Badalona, Spain
Alba Blasco & Anna Carreras
Grup de REcerca en Impacte de les Malalties Cròniques i les seves Trajectòries (GRIMTra), (2021 SGR 01537), Badalona, Spain
Alba Blasco & Anna Carreras
ISGlobal, Barcelona, Spain
Judith Garcia-Aymerich, Manolis Kogevinas & Gemma Castaño-Vinyals
Universitat Pompeu Fabra (UPF), Barcelona, Spain
Judith Garcia-Aymerich, Manolis Kogevinas & Gemma Castaño-Vinyals
CIBER Epidemiología y Salud Pública (CIBERESP), Madrid, Spain
Manolis Kogevinas & Gemma Castaño-Vinyals
IMIM (Hospital del Mar Medical Research Institute), Barcelona, Spain
Manolis Kogevinas
Department of Electrical, Electronic and Information Engineering ‘Guglielmo Marconi’, University of Bologna, Cesena, Italy
Simone Furini
Department of Medical Biotechnologies, Med Biotech Hub and Competence Center, University of Siena, Siena, Italy
Chiara Fallerini
Medical Genetics Unit, University of Siena, Policlinico Le Scotte, Siena, Italy
Chiara Fallerini
Med Biotech Hub and Competence Centre, Department of Medical Biotechnologies, University of Siena, Siena, Italy
Kristina Zguro
Institute for Biomedical Technologies, National Reasearch Council, Segrate, Italy
Francesca Colombo
Eligens SIA, Riga, Latvia
Anna Ilinskaya & Valery Ilinsky
University of Nevada, School of Medicine, Reno, NV, USA
Iva Neveux
Renown Health, Reno, NV, USA
Shaun Dabe
First Department of Internal Medicine and Infectious Diseases Unit, University Hospital of Ioannina, Ioannina, Greece
Eirini Christaki, Haralampos Milionis & Angelos Liontos
Biomedical Research Foundation Academy of Athens, Athens, Greece
Ioanna Tzoulaki
Center for Evidence-Based Medicine, Department of Health Services, Policy and Practice, School of Public Health, Brown University, Providence, RI, USA
Evangelia Ntzani
Department of Pulmonary and Critical Care, School of Medicine, Shariati Hospital, Tehran University of Medical Sciences, Tehran, Iran
Rasoul Aliannejad
General Intensive Care Unit, Department of Anesthesiology, School of Medicine, Shariati Hospital, Tehran University of Medical Sciences, Tehran, Iran
Vahideh Zarei
Intensive Care Unit, Department of Emergency, School of Medicine, Shariati Hospital, Tehran University of Medical Sciences, Tehran, Iran
Nastaran Soltani & Hengameh Ansari Tadi
Department of Critical Care Medicine, Noorafshar Hospital, Tehran, Iran
Ali Amirsavadkouhi
Department of Infectious Diseases, Keio University School of Medicine, Tokyo, Japan
Ho NamKoong
Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York City, NY, USA
Alexander W. Charney
Simon Fraser University, Burnaby, British Columbia, Canada
Olga Vishnyakova & Lloyd T. Elliott
Brigham and Women’s Hospital and Harvard Medical School, Boston, MA, USA
Angus C. Burns
Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Milan, Italy
Mauro Tettamanti & Alessandro Nobili
Fondazione IRCCS Ca’ Granda Ospedale Maggiore Policlinico, Milan, Italy
Luisa Ronzoni, Daniele Prati, Flora Peyvandi, Rossana Carpani, Antonio Muscatello, Sara Margarita, Giuseppe Lamorte, Marco Mantero, Nathalie Iannotti, Alessandra Bandera, Fabio Blandini & Francesco Bruno Arturo Blasi
University of Oslo, Oslo, Norway
Andre Franke, David Ellinghaus & Frauke Degenhardt

Authors

Vilma Lammi
View author publications
Search author on:PubMed Google Scholar
Tomoko Nakanishi
View author publications
Search author on:PubMed Google Scholar
Samuel E. Jones
View author publications
Search author on:PubMed Google Scholar
Shea J. Andrews
View author publications
Search author on:PubMed Google Scholar
Juha Karjalainen
View author publications
Search author on:PubMed Google Scholar
Beatriz Cortés
View author publications
Search author on:PubMed Google Scholar
Heath E. O’Brien
View author publications
Search author on:PubMed Google Scholar
Ana Ochoa-Guzman
View author publications
Search author on:PubMed Google Scholar
Brian E. Fulton-Howard
View author publications
Search author on:PubMed Google Scholar
Martin Broberg
View author publications
Search author on:PubMed Google Scholar
Hele H. Haapaniemi
View author publications
Search author on:PubMed Google Scholar
Masahiro Kanai
View author publications
Search author on:PubMed Google Scholar
Matti Pirinen
View author publications
Search author on:PubMed Google Scholar
Axel Schmidt
View author publications
Search author on:PubMed Google Scholar
Ruth E. Mitchell
View author publications
Search author on:PubMed Google Scholar
Abdou Mousas
View author publications
Search author on:PubMed Google Scholar
Massimo Mangino
View author publications
Search author on:PubMed Google Scholar
Alicia Huerta-Chagoya
View author publications
Search author on:PubMed Google Scholar
Nasa Sinnott-Armstrong
View author publications
Search author on:PubMed Google Scholar
Elizabeth T. Cirulli
View author publications
Search author on:PubMed Google Scholar
Marc Vaudel
View author publications
Search author on:PubMed Google Scholar
Alex S. F. Kwong
View author publications
Search author on:PubMed Google Scholar
Amit K. Maiti
View author publications
Search author on:PubMed Google Scholar
Minttu M. Marttila
View author publications
Search author on:PubMed Google Scholar
Daniel C. Posner
View author publications
Search author on:PubMed Google Scholar
Alexis A. Rodriguez
View author publications
Search author on:PubMed Google Scholar
Chiara Batini
View author publications
Search author on:PubMed Google Scholar
Francesca Minnai
View author publications
Search author on:PubMed Google Scholar
Anna R. Dearman
View author publications
Search author on:PubMed Google Scholar
C. A. Robert Warmerdam
View author publications
Search author on:PubMed Google Scholar
Celia B. Sequeros
View author publications
Search author on:PubMed Google Scholar
Thomas W. Winkler
View author publications
Search author on:PubMed Google Scholar
Daniel M. Jordan
View author publications
Search author on:PubMed Google Scholar
Raimonds Rešcenko
View author publications
Search author on:PubMed Google Scholar
Lorenzo Miano
View author publications
Search author on:PubMed Google Scholar
Jacqueline M. Lane
View author publications
Search author on:PubMed Google Scholar
Ryan K. Chung
View author publications
Search author on:PubMed Google Scholar
Beatriz Guillen-Guio
View author publications
Search author on:PubMed Google Scholar
Olivia C. Leavy
View author publications
Search author on:PubMed Google Scholar
Laura Carvajal-Silva
View author publications
Search author on:PubMed Google Scholar
Kevin Aguilar-Valdés
View author publications
Search author on:PubMed Google Scholar
Erika Frangione
View author publications
Search author on:PubMed Google Scholar
Lindsay Guare
View author publications
Search author on:PubMed Google Scholar
Ekaterina Vergasova
View author publications
Search author on:PubMed Google Scholar
Eirini Marouli
View author publications
Search author on:PubMed Google Scholar
Pasquale Striano
View author publications
Search author on:PubMed Google Scholar
Ummu Afeera Zainulabid
View author publications
Search author on:PubMed Google Scholar
Ashutosh Kumar
View author publications
Search author on:PubMed Google Scholar
Hajar Fauzan Ahmad
View author publications
Search author on:PubMed Google Scholar
Ryuya Edahiro
View author publications
Search author on:PubMed Google Scholar
Shuhei Azekawa
View author publications
Search author on:PubMed Google Scholar
Shiuh-Wen Luoh
View author publications
Search author on:PubMed Google Scholar
Christian Erikstrup
View author publications
Search author on:PubMed Google Scholar
Ole B. V. Pedersen
View author publications
Search author on:PubMed Google Scholar
Jordan Lerner-Ellis
View author publications
Search author on:PubMed Google Scholar
Alicia Colombo
View author publications
Search author on:PubMed Google Scholar
Joseph J. Grzymski
View author publications
Search author on:PubMed Google Scholar
Makoto Ishii
View author publications
Search author on:PubMed Google Scholar
Yukinori Okada
View author publications
Search author on:PubMed Google Scholar
Noam D. Beckmann
View author publications
Search author on:PubMed Google Scholar
Meena Kumari
View author publications
Search author on:PubMed Google Scholar
Ralf Wagner
View author publications
Search author on:PubMed Google Scholar
Iris M. Heid
View author publications
Search author on:PubMed Google Scholar
Catherine John
View author publications
Search author on:PubMed Google Scholar
Patrick J. Short
View author publications
Search author on:PubMed Google Scholar
Per Magnus
View author publications
Search author on:PubMed Google Scholar
Laura Ansone
View author publications
Search author on:PubMed Google Scholar
Luca V. C. Valenti
View author publications
Search author on:PubMed Google Scholar
Sulggi A. Lee
View author publications
Search author on:PubMed Google Scholar
Louise V. Wain
View author publications
Search author on:PubMed Google Scholar
Ricardo A. Verdugo
View author publications
Search author on:PubMed Google Scholar
Karina Banasik
View author publications
Search author on:PubMed Google Scholar
Frank Geller
View author publications
Search author on:PubMed Google Scholar
Lude H. Franke
View author publications
Search author on:PubMed Google Scholar
Alexander Rakitko
View author publications
Search author on:PubMed Google Scholar
Emma L. Duncan
View author publications
Search author on:PubMed Google Scholar
Alessandra Renieri
View author publications
Search author on:PubMed Google Scholar
Konstantinos K. Tsilidis
View author publications
Search author on:PubMed Google Scholar
Rafael de Cid
View author publications
Search author on:PubMed Google Scholar
Ahmadreza Niavarani
View author publications
Search author on:PubMed Google Scholar
Erik Abner
View author publications
Search author on:PubMed Google Scholar
Teresa Tusié-Luna
View author publications
Search author on:PubMed Google Scholar
Shefali S. Verma
View author publications
Search author on:PubMed Google Scholar
George Davey Smith
View author publications
Search author on:PubMed Google Scholar
Nicholas J. Timpson
View author publications
Search author on:PubMed Google Scholar
Ravi K. Madduri
View author publications
Search author on:PubMed Google Scholar
Kelly Cho
View author publications
Search author on:PubMed Google Scholar
Mark J. Daly
View author publications
Search author on:PubMed Google Scholar
Andrea Ganna
View author publications
Search author on:PubMed Google Scholar
Eva C. Schulte
View author publications
Search author on:PubMed Google Scholar
J. Brent Richards
View author publications
Search author on:PubMed Google Scholar
Kerstin U. Ludwig
View author publications
Search author on:PubMed Google Scholar
Michael Marks-Hultström
View author publications
Search author on:PubMed Google Scholar
Hugo Zeberg
View author publications
Search author on:PubMed Google Scholar
Hanna M. Ollila
View author publications
Search author on:PubMed Google Scholar

Consortia

Long COVID Host Genetics Initiative

Vilma Lammi
, Tomoko Nakanishi
, Samuel E. Jones
, Hugo Zeberg
, Hanna M. Ollila
, Shea J. Andrews
, Juha Karjalainen
, Brian E. Fulton-Howard
, Amit K. Maiti
, Minttu M. Marttila
, Eirini Marouli
, Pasquale Striano
, Ummu Afeera Zainulabid
, Ashutosh Kumar
& Hajar Fauzan Ahmad

FinnGen

Vilma Lammi
, Samuel E. Jones
, Hanna M. Ollila
, Martin Broberg
, Hele H. Haapaniemi
, Matti Pirinen
, Nasa Sinnott-Armstrong
, Mark J. Daly
, Andrea Ganna
, Mari E. K. Niemi
, Masahiro Kanai
Avon Longitudinal Study of Parents and Children (ALSPAC)
- Ruth E. Mitchell
- , Alex S. F. Kwong
- , George Davey Smith
- & Nicholas J. Timpson
Banque québécoise de la COVID-19 (BQC19)
- Tomoko Nakanishi
- , J. Brent Richards
- , Janick St-Cyr
- , Darin Adra
- , Madeleine Durand
- , David Bujold
- , Guillaume Bourque
- , Ariane Boisclair
- , Mylene Bertrand
- , Daniel Auld
- , Laetitia Laurent
- , Solomia Yanishevsky
- , G. Mark Lathrop
- , Fangyi Shi
- , Simon Rousseau
- , Jiannis Ragoussis
- , Danielle Perley
- , Vincent Mooser
- & David R. Morrison
Bonn Study of COVID Genetics (BoSCO)
- Axel Schmidt
- , Kerstin U. Ludwig
- , Daniella Balla
- , Julia Heggemann
- , Sonja Schultz
- , Pari Behzad
- , Markus M. Nöthen
- , Abigail Miller
- , Max C. Pensel
- & Carlo Maj

VA Million Veteran Program

Daniel C. Posner
, Alexis A. Rodriguez
, Shiuh-Wen Luoh
, Ravi K. Madduri
, Kelly Cho
, Tianxi Cai
& Sudha K. Iyengar

MexGen-COVID Initiative

Teresa Tusié-Luna
, Ana Ochoa-Guzman
, Alicia Huerta-Chagoya
, Carlos A. Aguilar Salinas
, Seung Hyuk T. Lee
, Hortensia Moreno-Macias
, Päivi Pajukanta
, Michelle Duran-Gomez
Norwegian Mother Father and Child Cohort Study (MoBa)
- Marc Vaudel
- , Per Magnus
- & Lill Trogstad
Penn Medicine BioBank (PMBB)
- Lindsay Guare
- , Shefali S. Verma
- , Daniel J. Rader
- , Marylyn D. Ritchie
- , Anurag Verma
- & Colleen M. Kripke
Follow-UP study of patients with critical COVID-19 (SweCovid) and COVID-19 Cohort Study of the University Hospital of the Technical University Munich (Muenchen rechts der Isar) (COMRI)
- Eva C. Schulte
- , Michael Marks-Hultström
- , Hugo Zeberg
- , Sergi Papiol
- , Jens Wiltfang
- , Jochen Schneider
- , Thomas G. Schulze
- , Christof Winter
- , Ewa Wallin
- , Robert Frithiof
- , Fanny Senner
- , Christoph D. Spinner
- , Ulrike Protzer
- , Mattia Cordioli
- , Nikola S. Mueller
- , Andreas Dinkel
- , Janos L. Kalman
- , Tomislav Maricic
- , Kristina Adorjan
- , Miklos Lipcsey
- , Lisa Fricke
- , Ing-Marie Larsson
- , Urs Heilbronner
- , Monika Budde
- & Johanna Erber
Tirschenreuth Study (TiKoCo)
- Thomas W. Winkler
- , Ralf Wagner
- & Iris M. Heid
TwinsUK
- Massimo Mangino
- , Emma L. Duncan
- & Nicholas R. Harvey
UK Biobank (UKB)
- Tomoko Nakanishi
- , J. Brent Richards
- & Vince Forgetta
UnderstandingSociety: UK Household Longitudinal Study
- Anna R. Dearman
- , Meena Kumari
- & Benedict Hignell
COVID-19 Genomics Network (C19-GenoNet)
- Ricardo A. Verdugo
- , Laura Carvajal-Silva
- , Kevin Aguilar-Valdés
- , Alicia Colombo
- , Yolanda Espinosa-Parrilla
- , Juan M. Saez Hidalgo
- , Estefania Nova-Lamperti
- , Scarlett Gutiérrez-Richards
- , Gerardo Donoso
- , Leslie C. Cerpa
- , Cesar A. Echeverria
- , Camilo Cabrera
- , Pamela Bocchieri
- , Macarena Fuentes-Guajardo
- , Christian A. Muñoz
- , Karen Y. Oróstica
- , Alvaro Figueroa
- , Lissette G. Guajardo
- , Iskra A. Signore
- , Virginia A. Monardes-Ramírez
- , Eduardo A. Tobar-Calfucoy
- , Luis A. Quiñones
- , Cristian E. Yáñez
- , Daniela Zapata-Contreras
- , Paula Zuñiga-Pacheco
- , Romina Quiroga
- , Matías F. Martínez
- , Teresa A. Alarcon
- , Andrea X. Silva
- , Carolina S. Selman
- , Sergio Sanhueza
- , Rocío Retamales-Ortega
- , Tamara V. Arévalo
- , Eduardo Lamoza
- & Héctor Valenzuela-Jorquera
COVID-19 Host Immune Response Pathogenesis Study (CHIRP)
- Ryan K. Chung
- , Sulggi A. Lee
- , Maria Sophia Donaire
- & Sannidhi Sarvadhavabhatla

DBDS Genomic Consortium

Celia B. Sequeros
, Christian Erikstrup
, Ole B. V. Pedersen
, Karina Banasik
, Frank Geller
, Sisse R. Ostrowski
, Søren Brunak
, David Westergaard
, Bjarke Feenstra
, Anne Sofie B. Mortensen
Extended Cohort for E-health, Environment and DNA (EXCEED)
- Chiara Batini
- , Louise V. Wain
- , Catherine John
- & Anna L. Guyatt
Genomes for Life (GCAT) and Cohort COVID in Catalonia (COVICAT study)
- Rafael de Cid
- , Beatriz Cortés
- , Susana Iraola-Guzmán
- , Gemma Moncunill
- , Alba Blasco
- , Judith Garcia-Aymerich
- , Natalia Blay
- , Carlota Dobaño
- , Anna Carreras
- , Xavier Farré
- , Manolis Kogevinas
- & Gemma Castaño-Vinyals

GEN-COVID Multicenter Study

Francesca Minnai
, Alessandra Renieri
, Simone Furini
, Chiara Fallerini
, Kristina Zguro
, Margherita Baldassarri
, Francesca Colombo
Genetics of Long Covid (GOLD)
- Heath E. O’Brien
- , Patrick J. Short
- & Thompson Hannah
Genotek
- Alexander Rakitko
- , Ekaterina Vergasova
- , Anna Ilinskaya
- , Michil Trofimov
- , Layal Shaheen
- , Nikolay Plotnikov
- , Anna Kim
- , Dmitrii Kharitonov
- , Valery Ilinsky
- & Alexei Kamelin
Helix–Helix Exome+ and Healthy Nevada Project COVID-19 Phenotypes
- Elizabeth T. Cirulli
- , Joseph J. Grzymski
- , Francisco Tanudjaja
- , Efren Sandoval
- , Nicole L. Washington
- , Simon White
- , Iva Neveux
- , Shaun Dabe
- , Alexandre Bolze
- & Kelly M. Schiabor Barrett
Covid-19 Ioannina Biobank
- Abdou Mousas
- , Konstantinos K. Tsilidis
- , Eirini Christaki
- , Haralampos Milionis
- , Ioanna Tzoulaki
- , Angelos Liontos
- , Evangelos Evangelou
- & Evangelia Ntzani
Genome-wide assessment of the gene variants associated with severe COVID-19 phenotype in Iran (IrCovid)
- Ahmadreza Niavarani
- , Rasoul Aliannejad
- , Vahideh Zarei
- , Nastaran Soltani
- , Bahareh Sharififard
- , Hengameh Ansari Tadi
- & Ali Amirsavadkouhi
Japan COVID-19 Task Force
- Ryuya Edahiro
- , Shuhei Azekawa
- , Makoto Ishii
- , Yukinori Okada
- , Ho NamKoong
- & Masahiro Kanai
Lifelines
- C. A. Robert Warmerdam
- & Lude H. Franke
Mount Sinai COVID Biobank (MSCIC)
- Daniel M. Jordan
- , Noam D. Beckmann
- , Ryan C. Thompson
- , Alexander W. Charney
- , Laura G. Sloofman
- & Nicole W. Simons

PHOSP-COVID Collaborative Group

Beatriz Guillen-Guio
, Olivia C. Leavy
& Louise V. Wain

GENCOV Study

Erika Frangione
, Jordan Lerner-Ellis
, Olga Vishnyakova
, Xu Xinyi
, Jennifer Taher
, Lloyd T. Elliott
Genome Database of the Latvian Population (LGDB)
- Raimonds Rešcenko
- , Laura Ansone
- , Vita Rovite
- , Peculis Raitis
- , Monta Briviba
- & Janis Klovinš
MassGeneralBrigham (MGB)
- Jacqueline M. Lane
- , Richa Saxena
- , Angus C. Burns
- , Jakob M. Cherry
- , Matthew Maher
- & Hanna M. Ollila

Estonian Biobank Research Team

Erik Abner
, Arne Kukkonen
Fondazione COVID-19 Genomic Study (FOGS)
- Lorenzo Miano
- , Luca V. C. Valenti
- , Mauro Tettamanti
- , Luisa Ronzoni
- , Daniele Prati
- , Flora Peyvandi
- , Rossana Carpani
- , Antonio Muscatello
- , Sara Margarita
- , Francesco Malvestiti
- , Giuseppe Lamorte
- , Marco Mantero
- , Andre Franke
- , David Ellinghaus
- , Nathalie Iannotti
- , Nicola Montano
- , Alessandro Nobili
- , Frauke Degenhardt
- , Alessandra Bandera
- , Fabio Blandini
- , Francesco Bruno Arturo Blasi
- & Tom Hemming Karlsen

Contributions

V.L., T.N., S.E.J., H.Z. and H.M.O. contributed to scientific leadership, project management, experimental design and conception, ethics and governance, and bioinformatics. V.L., T.N., S.E.J., H.Z., H.M.O. and the Long COVID HGI were members of the steering committee. V.L., S.E.J., T.N., H.Z., A.A.R., A.H.-C., A.M., A.N., A.R.D., A.S., A.S.F.K., B.C., B.G.-G., C.B., C.B.S., C.A.R.W., D.C.P., D.M.J., E.A., E.F., E.T.C., E.V., F.M., H.E.O., J.M.L., K.A.-V., K.B., L.C.-S., L.G., L.M., M.M., M.V., O.C.L., R.E., R.E.M., R.K.C., R.R., S.A., S.S.V., T.W.W., M.B., M.M.-H. and N.S.-A. performed primary cohort data analyses. V.L., T.N., S.E.J., M.B. and J.K. performed GWAS meta-analyses. S.E.J., T.N., V.L., H.Z., S.J.A., M. Kanai, A.O.-G., B.E.F.-H., H.H.H., M.P., A.K.M. and N.S.-A. performed follow-up analyses. A. Renieri, A. Rakitko, M. Kumari, A.C., A.N., C.E., C.J., E.C.S., E.L.D., F.G., G.D.S., H.M.O., I.M.H., J.B.R., J.J.G., J.L.-E., K.C., K.K.T., K.U.L., L.A., L.H.F., L.V.C.V., L.V.W., M.I., M.M.-H., N.D.B., N.J.T., O.B.V.P., P.J.S., P.M., R.A.V., R.d.C., R.K.M., R.W., S.A.L., S.L., S.S.V., T.T.-L., Y.O., A.O.-G., M.B., A.S. and H.Z. contributed to data/sample collection. Data for initial discovery GWASs (Long COVID HGI data freeze 4) was collected by DBDS, EstBB, FinnGen, GEN-COVID, GENCOV, MexGen-COVID (Supplementary Tables 1, 3–8 and 12), ALSPAC, BoSCO, BQC19, EXCEED, GCAT (COVICAT), Genotek, GOLD, Helix, Ioannina, IrCovid, JapanTaskForce, Lifelines, MoBa, MSCIC, PMBB, SweCovid, COMRI, TiKoCo, TwinsUK, UKB and Understanding Society (Supplementary Tables 11 and 12). Replication datasets were provided by PHOSP-COVID, MVP (Supplementary Tables 9, 10 and 12), LatviaGDB, C19-GenoNet, CHIRP, EstBB, FoGS and MGB (Supplementary Table 12). V.L., S.E.J., T.N., H.Z., A.G., A.K., A.N., E.L.D., E.M., H.F.A., M.J.D., M.M.-H., M.M.M., N.S.-A., P.S., U.A.Z., A. Renieri, A. Rakitko, M. Kumari, A.C., C.E., C.J., E.C.S., F.G., G.D.S., H.M.O., I.M.H., J.B.R., J.J.G., J.L.-E., K.C., K.K.T., K.U.L., L.A., L.H.F., L.V.C.V., L.V.W., M.I., N.D.B., N.J.T., O.B.V.P., P.J.S., P.M., R.A.V., R.d.C., R.K.M., R.W., S.A.L., S.L., S.S.V., T.T.-L., Y.O., A.A.R., A.H.-C., A.M., A.R.D., A.S., A.S.F.K., B.C., B.G.G., C.B., C.B.S., C.A.R.W., D.C.P., D.M.J., E.A., E.F., E.T.C., E.V., F.M., H.E.O., J.M.L., K.A.-V., K.B., L.C.-S., L.G., L.M., M.M., M.V., O.C.L., R.E., R.E.M., R.K.C., R.R., S.A. and T.W.W. wrote and reviewed the manuscript. All other authors were involved in the design, management, coordination or analysis of contributing studies. See Supplementary Tables 2–10 for more detailed information on author contributions and roles.

Corresponding authors

Correspondence to Hugo Zeberg or Hanna M. Ollila.

Ethics declarations

Competing interests

S.B. has ownerships in Intomics A/S, Hoba Therapeutics Aps, Novo Nordisk A/S, Lundbeck A/S, ALK abello A/S, Eli Lilly and Co and is managing board memberships in Proscion A/S and Intomics A/S. A.B., K.M.S.B., S.W., N.L.W., F.T., E.S. and E.T.C. are employees of Helix. A.D. received an honorarium from Gilead Sciences. A.L.G. and C.J. have funded research collaborations with Orion for collaborative research projects outside the submitted work. T.H. and H.E.O.B. have options in Sano Genetics. P.J.S. is a shareholder of Sano Genetics. T.H.K. has received consulting fees from Albireo, Boehringer Ingelheim, MSD and Falk Pharma. K.U.L. is cofounder and member of the scientific board of LAMPseq Diagnostics GmbH. T.N. has received speaking fee from Boehringer Ingelheim for talks unrelated to this research. M.E.K.N. is a current employee of Novartis Pharma AG. J.B.R.’s institution has received investigator-initiated grant funding from Eli Lilly, GlaxoSmithKline and Biogen for projects unrelated to this research. He is the CEO of 5 Prime Sciences (www.5primesciences.com), which provides research services for biotech, pharma and venture capital companies for projects unrelated to this research. V.F. is an employee of 5 Prime Sciences. C.D.S. reports grants and personal fees from AstraZeneca, Janssen-Cilag and ViiV Healthcare, personal fees and nonfinancial support from BBraun Melsungen, grants, personal fees and nonfinancial support from Gilead Sciences, personal fees from BioNtech, Eli Lilly, Formycon, Pfizer, Roche, Apeiron, GSK, Molecular partners, SOBI, AbbVie, MSD and Synairgen and grants from Cepheid. L.V.W. reports research funding from GlaxoSmithKline, Genentech and Orion Pharma, and consultancy for Galapagos and GlaxoSmithKline, outside of the submitted work. J.W. is a consultant for Roboscreen GmbH, Biogen GmbH, Immungenetics AG, Noselab GmbH, Roche Diagnostics International, Roche Pharma AG, Janssen-Cilag GmbH, Eisai GmbH, Boehringer Ingelheim and Lilly Deutschland GmbH and has received honoraries from Eisai GmbH, Biogen GmbH, AGNP e. V., Veranex, Med Update GmbH, Guangzhou Gloryren Medical Technology (China), Pfizer Pharma GmbH, Fachverband Rheumatologische Fachassistenz e. V., AWO Psychiatrie Akademie gGmbH, Neuroakademie E. V., Beijing Yibai Science und Technology Ltd., Abbott Laboratories GmbH, Lilly Deutschland GmbH, Simon & Kucher and streamedup! GmbH. The other authors declare no competing interests.

Peer review

Peer review information

Nature Genetics thanks the anonymous reviewers for their contribution to the peer review of this work.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Supplementary Figs. 1–13 and Supplementary Note (Supplementary Methods and Acknowledgements).

Reporting Summary

Supplementary Tables

Supplementary Tables 1–36.

Supplementary Data

Harmonized association statistics for MR exposures and outcomes.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Lammi, V., Nakanishi, T., Jones, S.E. et al. Genome-wide association study of long COVID. Nat Genet 57, 1402–1417 (2025). https://doi.org/10.1038/s41588-025-02100-w

Download citation

Received: 15 June 2024
Accepted: 27 January 2025
Published: 21 May 2025
Version of record: 21 May 2025
Issue date: June 2025
DOI: https://doi.org/10.1038/s41588-025-02100-w

This article is cited by

Pre-pandemic disease trajectories and genetic insights into long COVID susceptibility
- Natalia Blay
- Xavier Farré
- Rafael de Cid
BMC Medicine (2025)
Human genetics implicate thromboembolism in the pathogenesis of long COVID in individuals of European ancestry
- Art Schuermans
- Andreas Verstraete
- Peter Verhamme
Nature Cardiovascular Research (2025)

Subjects

Abstract

Similar content being viewed by others

Main

Results

Genetic variants in FOXP4 locus associated with long COVID

Frequency of long COVID variants varies across ancestries

Risk variants, FOXP4 expression and COVID-19 severity

FOXP4 expression in blood is associated with long COVID

FOXP4 expression in alveolar and immune cells in the lung

FOXP4 variants located at active chromatin in the lung

FOXP4 variant associated with lung cancer

Long COVID and other phenotypes

FOXP4 signal not explained simply by COVID-19 severity

FOXP4 associates with multiple symptoms of long COVID

Discussion

Methods

Contributing studies

Phenotype definitions

GWAS

GWAS meta-analyses

Principal component projection

eQTL, PheWAS and colocalization

Genetic correlation and MR

Bayesian clustering of effects based on linear relationships

Statistics and reproducibility

Reporting summary

Data availability

Code availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Consortia

Long COVID Host Genetics Initiative

FinnGen

Avon Longitudinal Study of Parents and Children (ALSPAC)

Banque québécoise de la COVID-19 (BQC19)

Bonn Study of COVID Genetics (BoSCO)

VA Million Veteran Program

MexGen-COVID Initiative

Norwegian Mother Father and Child Cohort Study (MoBa)

Penn Medicine BioBank (PMBB)

Follow-UP study of patients with critical COVID-19 (SweCovid) and COVID-19 Cohort Study of the University Hospital of the Technical University Munich (Muenchen rechts der Isar) (COMRI)

Tirschenreuth Study (TiKoCo)

TwinsUK

UK Biobank (UKB)

UnderstandingSociety: UK Household Longitudinal Study

COVID-19 Genomics Network (C19-GenoNet)

COVID-19 Host Immune Response Pathogenesis Study (CHIRP)

DBDS Genomic Consortium

Extended Cohort for E-health, Environment and DNA (EXCEED)

Genomes for Life (GCAT) and Cohort COVID in Catalonia (COVICAT study)

GEN-COVID Multicenter Study

Genetics of Long Covid (GOLD)

Genotek

Helix–Helix Exome+ and Healthy Nevada Project COVID-19 Phenotypes

Covid-19 Ioannina Biobank

Genome-wide assessment of the gene variants associated with severe COVID-19 phenotype in Iran (IrCovid)

Japan COVID-19 Task Force

Lifelines

Mount Sinai COVID Biobank (MSCIC)