Genome-wide association study of post COVID-19 syndrome in a population-based cohort in Germany

Ruß, Anne-Kathrin; Schreiber, Stefan; Lieb, Wolfgang; Vehreschild, J. Janne; Heuschmann, Peter U.; Illig, Thomas; Appel, Katharina S.; Vehreschild, Maria J. G. T.; Krefting, Dagmar; Reinke, Lennart; Viebke, Alin; Poick, Susanne; Störk, Stefan; Reese, Jens-Peter; Zoller, Thomas; Krist, Lilian; Ellinghaus, David; Foesel, Bärbel U.; Gieger, Christian; Lorenz-Depiereux, Bettina; Witzenrath, Martin; Anton, Gabriele; Krawczak, Michael; Heyckendorf, Jan; Bahmer, Thomas

doi:10.1038/s41598-025-00945-z

Download PDF

Article
Open access
Published: 06 May 2025

Genome-wide association study of post COVID-19 syndrome in a population-based cohort in Germany

Anne-Kathrin Ruß^1,2,
Stefan Schreiber³,
Wolfgang Lieb²,
J. Janne Vehreschild^4,5,
Peter U. Heuschmann^6,7,8,
Thomas Illig^9,10,
Katharina S. Appel⁴,
Maria J. G. T. Vehreschild¹¹,
Dagmar Krefting^12,13,
Lennart Reinke³,
Alin Viebke³,
Susanne Poick^2,3,
Stefan Störk^14,15,
Jens-Peter Reese^6,7,16,
Thomas Zoller¹⁷,
Lilian Krist¹⁸,
David Ellinghaus¹⁹,
Bärbel U. Foesel²⁰,
Christian Gieger²⁰,
Bettina Lorenz-Depiereux²⁰,
Martin Witzenrath^21,22,
Gabriele Anton²³,
Michael Krawczak¹^na1,
Jan Heyckendorf^3,24,25^na1 &
…
Thomas Bahmer^3,24^na1

Scientific Reports volume 15, Article number: 15791 (2025) Cite this article

4919 Accesses
3 Citations
50 Altmetric
Metrics details

Subjects

Abstract

If health impairments due to coronavirus disease 2019 (COVID-19) persist for 12 weeks or longer, patients are diagnosed with Post-COVID Syndrome (PCS), or Long-COVID. Although the COVID-19 pandemic has largely subsided in 2024, PCS is still a major health burden worldwide, and identifying potential genetic modifiers of PCS remains of great clinical and scientific interest. We therefore performed a case-control type genome-wide association study (GWAS) of three recently developed PCS (severity) scores in 2,247 participants of COVIDOM, a prospective, multi-centre, population-based cohort study of SARS-CoV-2-infected individuals in Germany. Each PCS score originally represented the weighted sum of the binary indicators of all, or a subset, of 12 PCS symptom complexes, assessed six months or later after the PCR test-confirmed SARS-CoV-2 infection of a participant. For various methodical reasons, however, the PCS scores were dichotomized along their respective median values in the present study, prior to the GWAS. Of the 6,383,167 single nucleotide polymorphisms included, various variants were found to be associated with at least one of the PCS scores, although not at the stringent genome-wide statistical significance level of 5 × 10^− 8. With p = 6.6 × 10^− 8, however, the genotype-phenotype association of SNP rs9792535 at position chr9:127,166,653 narrowly missed this threshold. The SNP is located in a region including the NEK6, PSMB7 and ADGRD2 genes which, however, does not immediately suggest an etiological connection to PCS. As regards functional plausibility, variants of a possible effect mapped to the olfactory receptor gene region (lead SNP rs10893121 at position chr11:123,854,744; p = 2.5 × 10^− 6). Impairment of smell and taste is a pathognomonic feature of both, acute COVID-19 and PCS, and our results suggest that this connection may have a genetic basis. Three other genotype-phenotype associations pointed towards a possible etiological role in PCS of cellular virus repression (CHD6 gene region), activation of macrophages (SLC7A2) and the release of virus particles from infected cells (ARHGAP44). All other gene regions highlighted by our GWAS did not relate to pathophysiological processes currently discussed for PCS. Therefore, and because the genotype-phenotype associations observed in our GWAS were generally not very strong, the complexity of the genetic background of PCS appears to be as high as that of most other multifactorial traits in humans.

Self-reported symptom severity, general health, and impairment in post-acute phases of COVID-19: retrospective cohort study of Swedish public employees

Article Open access 17 November 2022

Cross-sectional study of health impairment related to post COVID-19 condition among participants of a large population-based cohort in Germany

Article Open access 16 July 2025

A genome-wide association study for survival from a multi-centre European study identified variants associated with COVID-19 risk of death

Article Open access 06 February 2024

Introduction

The global pandemic of coronavirus disease 2019 (COVID-19), a potentially fatal condition caused by infection with the newly identified coronavirus SARS-CoV-2, started in China in late 2019. While respiratory symptoms such as cough and dyspnoea have since been found to predominate in COVID-19 patients, the disease also affects multiple other organs to a varying degree^1,2. Severe cases might ultimately suffer from acute respiratory distress syndrome with a potentially fatal course. If COVID-19-associated health impairments persist until 12 weeks or later after the initial infection, and if other causes can be ruled out, patients are diagnosed with Post-COVID Syndrome (PCS), or Long-COVID³ (https://www.nice.org.uk/Guidance/NG188, accessed 20 March 2025). Although the COVID-19 pandemic has lost most of its immediate threat in 2025, PCS still poses a significant public health burden worldwide because recovery from COVID-19 is often slow and the risk of chronic manifestation of symptoms is high.

The course of COVID-19 is a prominent example of the human response to viral infections, which results from the complex interaction of a variety of internal and external factors. In addition to viral properties and environmental conditions, the physiological characteristics of the host contribute to this coping process as well. While the innate and adaptive immune systems clearly play a central role in this regard, the general physical condition and the psychological resilience of the affected individual are also likely to determine whether, and how well, the infection is overcome⁴. Notably, as with most complex human traits, differences regarding such host factors can be assumed to be influenced, at least in part, by genetic variation.

Over the past 25 years, most research into the genetic basis of complex human traits has been conducted in the form of case-control studies that exploit the statistical association, at the population level, between genetic variants that are truly causative of the trait of interest and the genotypes of, usually functionless, single nucleotide polymorphisms (SNPs). These so-called ‘genome-wide association studies’ (GWAS) have led to the identification of genotypic associations with over 5,000 human traits, according to the European Bioinformatics Institute GWAS catalogue (http://www.ebi.ac.uk/gwas, accessed 20 March 2025).

While a considerable number of GWAS were performed for acute COVID-19 shortly after the outbreak of the pandemic^5,6,7, only two such studies have been published for PCS to date. The study by Lammi et al.⁸, which relied upon resources from the COVID-19 Host Genetics Initiative, comprised > 6,000 cases and > 1 million population controls from 24 studies in Europe, North America and East Asia. Through meta-analysis, a single genome-wide significant association with PCS was identified, namely of the FOXP4 locus that had been connected before to acute COVID-19 severity as well. Although acute disease severity is a known risk factor for PCS⁴, the authors of the GWAS emphasized that this connection was not sufficient to explain the association between PCS and FOXP4 genetic variation. The second GWAS, by Taylor et al.⁹, was conducted in two sub-cohorts of the Sano Genetics Long-COVID GOLD study, where > 90% participants had self-reported “white” ethnicity. Taking a combinatorial approach that allowed for gene-gene interactions, the study identified 73 genes to be associated with PCS severity in at least one of the two sub-cohorts. Notably, nine of these genes had also been connected to acute COVID-19 severity before.

The power of a GWAS and the validity of its results critically depend upon the definition of the underlying phenotype. A score developed by our group in 2022 specifically for the purpose of phenotyping PCS patients allows their overall COVID-19-related health problems to be severity-graded on the basis of 12 self-assessed symptom complexes⁴. This PCS score was also shown to have two main predictors, namely acute COVID-19 severity and individual resilience, both of which are rather differently associated with the 12 symptom complexes underlying the score⁴. This specificity of association inspired the definition of two additional PCS scores that encompass only subsets of the 12 symptom complexes, and that allow for more accurate assessment of PCS severity in the context of both research and clinical care¹⁰.

Despite the subsiding of the pandemic, identifying genetic modifiers of the long-term consequences of COVID-19 still remains of great practical and scientific interest, not least with a view to the development of efficient clinical treatment and patient management strategies. Therefore, we performed a GWAS of the three abovementioned PCS scores in participants of COVIDOM, a prospective, multi-centre, population-based cohort study of SARS-CoV-2-infected individuals in Germany.

Materials and methods

The COVIDOM study

The COVIDOM study of the long-term health effects of COVID-19 started in Germany in October 2020 and was funded by the German Federal Ministry of Education and Research as part of the National Pandemic Cohort Network (NAPKON, grant number: 01KX2121). Participants of this prospective, multi-centre, population-based cohort study were recruited six months or later after their PCR test-confirmed SARS-CoV-2 infection. Between November 2020 and May 2023, a total of 3,632 probands took part in an online survey and were subsequently examined during their visit at one of the COVIDOM study sites in Kiel (n = 2,557), Berlin (n = 469) and Würzburg (n = 606). For further details about the COVIDOM study, see Horn et al.¹¹ and Bahmer et al.⁴.

Study protocol

Data acquisition in COVIDOM took place in two phases. First, a questionnaire-based online interview was carried out to collect relevant socio-demographic data of the participants as well as self-reported information about the acute phase of their infection. During the study site visit, every participant underwent examinations covering various medical areas, including anthropometry, pulmonology, olfactory and gustatory functions, cardiology, neurology, geriatrics and hepatology. Validated tools such as the PHQ-8 and FACIT-F questionnaires were used for additional data acquisition. These procedures are currently repeated annually as part of a long-term follow-up of COVIDOM participants.

Phenotype definition

The PCS score developed by Bahmer et al.⁴ differentially weighs the presence of 12 clinical symptom complexes to quantify the severity of the COVID-19-related long-term health problems of patients. In the present GWAS, a median split according to the PCS score served to divide COVIDOM participants into two PCS severity groups. The same approach was taken for two recently proposed PCS sub-scores that are more specific than the original score for the two main risk factors for severe PCS¹⁰, namely low individual resilience and severe acute illness. Both risk factors were themselves considered as outcome variables in additional GWAS after dichotomization along the median value of the BRS Brief Resilience Scale¹², or employing a threshold of four for the number of self-reported severe/life threatening acute phase symptoms of COVID-19.

While the dichotomization of quantitative measurements is a popular means in medical research to reduce the complexity of the analysis and interpretation of study data, it is also known to reduce statistical power and to potentially introduce bias¹³. However, these drawbacks primarily concern continuous measurements with ‘well-behaved’ statistical distributions, mostly normal or log-normal, for which the highly implemented, usually model-based analysis methods available to researchers are known to be valid. For measurements with more irregular properties such as strong skewness, multi-modality or non-linear relations to predictors and covariates, in contrast, dichotomization may be justified in order to balance power loss and methodological validity, i.e. correctness of p values¹⁴. The three PCS scores studied here definitely belong to the latter category of outcome measure, not least because they are weighted sums of a few binary variables themselves and were designed specifically to rank the severity of a complex phenotype most efficiently.

DNA extraction, SNP genotyping and quality control

The SNP genotype data used in the present GWAS were generated and quality-controlled separately for Kiel and Würzburg/Berlin COVIDOM participants. DNA extraction from Kiel blood samples (n = 1,691) was performed at the Institute of Clinical Molecular Biology (IKMB), Kiel University, Germany, on a Chemagic 360 machine (PerkinElmer, Waltham, Massachusetts, U.S.) using low volume kit CMG-1491 and buffy coat kit CMG-714 (Chemagen, Baesweiler, Germany) according to the manufacturer protocols. Due to low buffy coat volumes, DNA extraction from the Würzburg/Berlin blood samples (n = 962) was performed at Helmholtz Munich (HMGU), Germany, in a manual process using standard methodology.

Genotyping was carried out at IKMB and HMGU with Illumina (Illumina Inc., San Diego, U.S.) Global Screening Array-24 Multi Disease (GSA) Version 3.0, following the Illumina Infinium HTS Assay Auto 3-day Workflow (Document #15045738v0). Genotype quality was controlled using thresholds of 0.1 for the missingness per SNP per individual and 10^− 5 for the Hardy-Weinberg equilibrium (HWE) test p value. The latter reflects common practice to limit the effects of HWE filtering to the detection of genotyping errors while avoiding the exclusion of rare, geographically localized variants. Linkage disequilibrium-based SNP pruning was performed with a window size of 50 SNPs, a shift of five SNPs per step, and an R² threshold of 0.2. The threshold for the identity-by-descent probability was 0.125.

Genotype imputation

Missing SNPs were imputed with IMPUTE2 (version 2.3.2) using the 1000 Genomes phase 3 reference dataset divided into lots of 5 Mb each. The threshold for the Info Score was 0.7. Genotype imputation was carried out separately for the Kiel and Würzburg/Berlin data.

Genome-wide association analysis (GWAS)

A logistic regression model as implemented in PLINK v2.00a3 SSE4.2 was used for analyzing the SNP profiles (6,383,167 variants) and various dichotomized phenotypes of 2,247 COVIDOM participants. Age, sex, virus variant (surrogate: infection before or after 27 December 2021), study site, time between infection (surrogate: PCR test) and study site visit, acute COVID-19 severity, individual resilience and the 10 first principal components of the SNP genotype profile were considered as covariates, as appropriate, and an additive SNP genotype model was employed throughout.

Inspection of the first two principal components (PCs) revealed a strong genetic similarity between COVIDOM participants and the EUR population from the 1000 Genomes project, thereby confirming the Northern and Western European ancestry of the vast majority of the former (Supplementary Fig. 1). A total of 38 participants (21 Kiel, 17 Würzburg/Berlin) were identified as outliers because of a PC1 or PC2 value that was more than five interquartile ranges below or above the 1st or 3rd quartile of the CEU population, respectively. We also assessed the potential level of genetic stratification of COVIDOM participants with regard to each of the dichotomized PCS scores, using the so-called ‘genomic inflation factor’ λ_G¹⁵ as calculated with R functions median and qchisq (http://genometoolbox.blogspot.com/2014/08/how-to-calculate-genomic-inflation.html, accessed 20 March 2025). A value of λ_G close to unity indicates sufficient control for genetic confounding when comparing two samples of participants in a GWAS.

No formal statistical testing of a genome-wide null hypothesis (i.e. no genetic effect at all) was involved in the interpretation of the analytical results. Instead, the association signals were ranked according to their respective p values, and the threshold for ‘suggestive’ statistical significance and, hence, further consideration of an association was set to 10^− 5 in order to balance the chances of false positive and negative findings. This notwithstanding, the statistical power of the available data was assessed with the UCSF Sample Size Calculator assuming a dominant effect of the minor SNP allele and a significance level of 10^− 5. For a minor allele frequency (MAF) of 0.2, the data provided 90% power to detect an allelic odds ratio (OR) of 1.75 or larger, and 80% power to detect an OR of 1.65 or larger. If the MAF was 0.3, the corresponding OR limits were 1.65 (90%) and 1.55 (80%), respectively.

In addition to the SNP-wise analyses, we assessed the polygenic background of PCS by way of polygenic scores constructed for the three dichotomized PCS scores from the GWAS summary statistics. For this, we used the LDpred2 method implemented in R package bigsnpr¹⁶ with the default HapMap3+-based LD correlation matrix as provided by the package. The performance of the polygenic scores was evaluated by the Area Under Curve (AUC) obtained when treating each polygenic score as a predictor of the corresponding (dichotomized) PCS score in the underlying GWAS dataset. If one of the PCS scores had a notable polygenic background, an overperformance of the corresponding polygenic score would have been expected in this regard.

Combinatorial analysis

For the 118 SNP genotype combinations (“disease signatures”) found by Taylor et al.⁹ to be associated with more severe or fatigue-dominant phenotypes of long COVID syndrome, carriers in COVIDOM were compared to non-carriers for their PCS severity as quantified by one of the three PCS scores, individual resilience or acute disease severity, using a Mann-Whitney test. Since each disease signature was considered five times, we applied a per-signature Bonferroni correction for multiple testing by setting the p value threshold for statistical significance to 0.05/5 = 0.01. This threshold reflects our primary intention to determine the strength of association between the PCS phenotypes considered and each individual disease signature, rather than formally testing the stringent global null hypothesis that none of the 118 disease signatures was associated with PCS.

Results

Data quality control

The 1,691 Kiel DNA samples were genotyped for 730,059 SNPs. Subsequent quality control led to the exclusion of 169 individuals because of unclear sex (n = 30), excess chance of relatedness (n = 85), reduced heterozygosity (n = 32) or likely non-European origin (n = 22), leaving 1,522 individuals for SNP genotype imputation. Several SNPs had to be excluded from further analyses because of excess LD to other variants (n = 450,139), missingness (n = 7,915) or lack of Hardy-Weinberg equilibrium (n = 3,022). After imputation and filtering with an Info Score threshold of 0.7, genotype data of Kiel COVIDOM participants were available for 5,954,344 SNPs.

For the Würzburg/Berlin sub-cohort, SNP genotypes of 836 of 962 individuals were subjected to imputation after quality control. Of the 654,027 initially included SNPs, some 265,307 remained after excluding variants for excess LD (n = 378,116), missingness (n = 10,596) or lack of Hardy-Weinberg equilibrium (n = 310). After imputation and filtering, genotypes of 5,792,334 SNPs from Würzburg/Berlin participants were available for further analysis.

Characteristics of COVIDOM GWAS cohort

A total of 2,247 COVIDOM participants had sufficient genotype and phenotype data available for inclusion into at least one of the subsequent GWAS (Table 1; for the individual frequencies of the PCS score-defining symptom complexes, see Supplementary Table 1). This cohort was characterized by a mean age of 45.8 years (SD: 15.4), a proportion of females of 56.0%, and a mean BMI of 26.8 kg/m² (SD: 5.4). Some 1,112 participants (51.15%) were self-reported current or former smokers, and 138 (6.2%) required hospitalization due to severe acute COVID-19.

Table 1 Characteristics of COVIDOM GWAS cohort (n = 2,247).

Full size table

GWAS

Original PCS score

A PCS score-based median split (at 15.5 points) of the combined Kiel and Würzburg/Berlin dataset yielded 1,048 cases and 1,067 controls, with no evidence of systematic genetic confounding (λ_G = 1.016). With this sample size, only unrealistically strong genotype-phenotype associations would have been detectable at the so-called ‘genome-wide significance level’ of 5 × 10^− 8, which in turn would have led to an unacceptably high false negative rate. Based upon preceding power calculations, we therefore set the p value threshold for further consideration of an association as ‘suggestively’ significant to 10^− 5.

Under this condition, nine loci showed a suggestively significant genotypic association with the dichotomized original PCS score (Fig. 1A). All genes located within 500 kb on either side of the respective lead SNPs are shown in Fig. 1B. The most significant association, with SNP rs10893121 (p = 2.5 × 10^− 6) at position chr11:123,854,744, is located in a region harbouring genes from olfactory receptor families 4, 6 and 10. The minor allele of rs10893121 increases the risk of developing more severe PCS by approximately 50% (OR = 1.49). The strongest association (OR = 2.47) was observed for SNP rs61739314 located at position chr20:39,990,377 in a region that contains the ZHX3 (zinc fingers and homeoboxes 3), LPIN3 (lipin 3), EMILIN3 (elastin microfibril interfacer 3) and CHD6 (chromodomain helicase DNA-binding protein 6) genes. Notably, the region around the FOXP4 gene (highlighted in green in Fig. 1A), the site of the sole genome-wide significant association with PCS reported so far, by Lammi et al.⁸, was not found to be associated (at the ‘suggestive’ significance level of p < 10^− 5) with the dichotomized original PCS score in our data. Moreover, none of the associations highlighted in the present GWAS achieved nominal significance in the study by Lammi et al.⁸.

To assess the polygenic component of the original PCS score, a polygenic score was constructed from the GWAS summary statistics obtained for individual SNPs. However, treatment of the polygenic score as a predictor of the dichotomized original PCS score in the underlying GWAS dataset yielded an AUC value of only 0.523, suggesting that the polygenic contribution to the PCS score was rather small.

Acute COVID-19 severity-specific PCS score (PCS-S)

The median of the PCS-S score equalled 2.5, and the corresponding median split resulted in 1,124 controls and 1,092 cases (λ_G = 1.012). The most significant genotypic association with the dichotomized score was observed for SNP rs9792535 (p = 6.6 × 10^− 8) located at position chr9:127,166,653 (Fig. 2A). Carriership of the minor allele of this SNP increased the risk for the more severe acute disease-specific sub-type of PCS by a factor of 3.5. Genes located in this region include NEK6, PSMB7 and ADGRD2 (Fig. 2B). Other genes around SNPs that achieved p < 10^− 5 were MAST4 on chromosome 5, FBXO42 and SZRD1 on chromosome 1, and pseudogene RNU6-1230P on chromosome 4 (Fig. 2B). The AUC of the PCS-S score-specific polygenic score equalled 0.503.

Individual resilience-specific PCS score (PCS-R)

The median of the individual resilience-specific PCS-R score (comprising fatigue, neurological symptoms and sleep disturbance) equalled 91, and the according median split resulted in 1,095 controls and 1,076 cases (λ_G = 1.012). Associations between the dichotomized phenotype and the lead SNPs at eight loci achieved p < 10^− 5 (Fig. 3A). Genes in the surrounding 500 kb regions (Fig. 3B) include SLC7A2 and PDGFRL (chromosome 8), MYOCD and ARHGAP44 (chromosome 17), CHST11 (chromosome 12), TMEM200A (chromosome 6) and CADM2 (chromosome 3). The AUC of the PCS-R score-specific polygenic score equalled 0.522.

Individual resilience and acute COVID-19 severity

In addition to the PCS-R and PCS-S scores, we also treated their main predictors, individual resilience and acute COVID-19 severity, as outcomes in GWAS to rule out indirect SNP genotype associations with either of the two scores. In both cases, however, no overlap was observed between regions of suggestively significant SNP genotype associations (p < 10^− 5) with either a score or its specific predictor.

A suggestively significant association (p < 10^− 5) with dichotomized acute COVID-19 severity (< 4 or ≥ 4 severe or life-threatening symptoms) was identified at 18 loci (Supplementary Fig. 2), with lead SNP rs78692815 at position chr11:108,617,211 narrowly missing genome-wide significance (p = 6.5 × 10^− 8). Four loci showed a suggestively significant association (p < 10^− 5) with dichotomized individual resilience (Supplementary Fig. 3). The most significant association was observed for SNP rs2780980 at position chr9:86,234,596 (p = 2.6 × 10^− 6).

Combinatorial analysis

In a combinatorial analysis, Taylor et al.⁹ previously identified various SNP genotype combinations (“disease signatures”) to be associated with severe PCS. We compared carriers and non-carriers of these signatures in terms of their PCS scores, individual resilience and acute COVID-19 severity. Of the 118 disease signatures tested, 12 showed a nominally significant association (p < 0.05) with at least one of the three dichotomized PCS scores (Table 2). However, only the association between disease signature 12 and the original PCS score remained significant after Bonferroni correction (p < 0.01). Notably, only six carriers of disease signature 12 were present in our dataset, five of which had an original PCS score above the median. For the remaining disease signatures, none of the associations with any of the three PCS scores was statistically significant (Supplementary Table 2).

When considering individual resilience and acute COVID-19 severity as outcome measures, nine signatures were found to be associated with at least one of the two phenotypes at the nominal significance level of 0.05 (Supplementary Table 3). However, only the association between signature 110 and individual resilience remained significant after Bonferroni correction (p < 0.01). None of the associations between the remaining signatures and the two phenotypes was statistically significant (Supplementary Table 4).

Table 2 Disease signatures (Taylor et al. 2023) showing a nominally significant association (p < 0.05) with at least one dichotomized PCS score.

Full size table

Discussion

Our study revealed that genetic variation at several loci throughout the human genome is potentially related to the severity of the long-term sequelae of COVID-19, which was quantified here by one of three PCS scores previously developed by our group for the work-up of the SARS-CoV-2 pandemic. As can be inferred from the Manhattan plots illustrating the GWAS results (Figs. 1, 2 and 3), however, none of the observed associations reached the stringent level of so-called ‘genome-wide significance’ (5 × 10^− 8). Similarly, the corresponding QQ plots do not suggest strong genetic effects upon the three PCS scores studied (Supplementary Fig. 4). Although adjusting the analyses for additional covariates would reduce the p values of individual genotype-phenotype associations further, particularly when considering BMI (Supplementary Table 5), the main results would however remain unchanged in that the allelic odds ratios estimated for the minor alleles of all lead SNPs virtually stayed the same (Supplementary Table 6).

In our view, this outcome does not detract from the scientific value of the GWAS. On the contrary: Not only would non-reporting of a lack of genome-wide significant results contribute to publication bias, the information that PCS, when regarded as a composite phenotype, does not appear to have a strong genetic basis is very important to guide future research in this field.

While we could not reproduce, even at a less stringent ‘suggestive’ significance level of 10^− 5, the single genome-wide genotype-phenotype association reported so far for PCS, namely with the FOXP4 gene on chromosome 6⁸, we nevertheless identified SNPs in other genomic regions that are at least suggestive of a functional involvement into PCS etiology and that point towards biologically meaningful links to the phenotype in question. This particularly includes olfactory impairment in the case of rs10893121, the SNP most significantly associated (p = 2.5 × 10^− 6) with the original PCS score.

Previous findings that acute COVID-19 severity and individual resilience are the sole main predictors of PCS severity previously encouraged us to develop two additional PCS scores, PCS-S and PCS-R, to address different sub-domains of PCS that are related to either predictor. While the PCS-S score comprises a number of rather different symptom complexes (impairment of smell and taste, fatigue, joint or muscle pain, general signs of infection, and exercise intolerance), the PCS-R score only included two complexes, in addition to fatigue, namely neurological ailments and sleep disturbances. Interestingly, it turned out that the potential genotypic associations with the two predictor-specific scores did not overlap with one another, providing evidence that they may indeed reflect different clinical entities. Since their constituent symptom complexes were also non-overlapping, it was not surprising that the two sub-scores were associated with different SNPs, and even although the observed associations were comparatively weak, they may be worth follow-up research to shed more light on the etiology of PCS.

PCS undoubtedly is a heterogeneous clinical condition, and the major hypotheses about PCS pathophysiology currently include (I) immune dysregulation, (II) microbiota dysbiosis, (III) autoimmunity and immune imprinting, (IV) blood clotting and endothelial abnormalities and (V) dysfunctional neurological signalling³. In the following, we will assess whether the potential genetic associations identified for the three PCS scores in the present GWAS would be consistent with the above hypotheses, considering possible functional links to genes located within or near the associated regions.

PCS score

rs10893121: olfactory receptors OR4D5, OR6T1, OR10S1, OR10G4, OR10G7, OR10G8 and OR10G9

The perception of smell is triggered by odorant molecules in the nasal tissue that initiate a neural response. Olfactory receptors are members of a large family of G protein-coupled receptors that are responsible for the G protein-mediated transduction of olfactory sensory signals. OR4D5, OR6T1, OR10S1, OR10G4, OR10G7, OR10G8 and OR10G9 all belong to this protein family^17,18. While their role in olfactory function is clear-defined, no particular disease has so far been associated with SNPs in the corresponding gene regions. Important in the present context, impairment of smell and taste contributes only 3.5 points to the PCS score (range: 0 to 59) and hence adds only little to the overall severity of PCS. Moreover, while impaired smell is a pathognomonic feature of acute COVID-19 that may persist for a long time, its impact on health-related quality of life is not very strong. The fact that SNP rs10893121 showed the most significant association of all with the original PCS score indicates that variation in olfactory receptor function may nevertheless be a genetic cause of longstanding olfactory impairment after (even mild) SARS-CoV-2 infection.

rs61739314: zinc fingers and homeoboxes 3 (ZHX3), lipin 3 (LPIN3), Elastin microfibril interfacer 3 (EMILIN3), chromodomain helicase DNA-binding protein 6 (CHD6)

The products of the ZHX gene family, including ZHX3, comprise two C2H2-typic zinc fingers and other proteins that may function as transcriptional repressors^19,20. While there is evidence that these gene products represent (unfavourable) prognostic markers of renal, urothelial, endometrial and thyroid cancer, no associations have been reported so far to cardiovascular or inflammatory diseases^21,22,23. Furthermore, no ZHX gene-encoded mRNA is detectable in immune cells, and the corresponding proteins are not found in blood. Therefore, a causal role of these genes in PCS pathophysiology is not very likely.

The protein encoded by the CHD6 gene can function as a transcriptional repressor and is involved in the cellular repression of influenza virus replication²⁴. This role in host-virus interaction, and the capability of the protein to activate gene transcription in response to oxidative stress through an interaction with NFE2L2, may be a plausible explanation for the association between local SNP rs61739314 and PCS²⁵.

The LPIN3 gene is ubiquitously expressed in duodenum, skin and more than 20 other tissues. One of the functions of lipin complexes is to contribute to gene regulation by acting as transcriptional co-activators in the nucleus^26,27. In addition, lipin complexes process precursors of triglycerides and phospholipids in the cytoplasm²⁸. However, protein functions that might be relevant in PCS pathophysiology are not immediately apparent.

Finally, the EMILIN3 gene product is part of the collagen-containing extra-cellular matrix. While the gene is highly expressed in connective tissue in both female and male reproductive organs, a connection to pathophysiological processes of PCS is also not obvious²⁹.

PCS-S score

rs9792535 – NIMA related kinase 6 (NEK6), proteasome 20 S subunit beta 7 (PSMB7) and adhesion G protein-coupled receptor D2 (ADGRD2)

The ‘never in mitosis A’ (NIMA) gene of Aspergillus nidulans encodes a serine/threonine kinase that controls initiation of mitosis. Human NIMA-related kinases (NEKs), like NEK6, are homologues of fungal NIMA and perform similar functions. Inhibition of the proteins can lead to apoptosis^30,31. The NEK6 gene is most abundantly expressed in gallbladder but is neither specific to immune cells nor expressed in blood. While there are some connections between the gene and tumorigenesis, e.g. by suppressing p53-induced cancer cell senescence³², no link to PCS pathophysiology is obvious.

Proteasome 20 S subunit beta 7 (PSMB7) is a multi-catalytic proteinase complex that is distributed throughout eukaryotic cells and cleaves peptides in a non-lysosomal pathway. Gamma interferon may downregulate this proteosomal catalytic subunit. The PSMB7 complex plays many essential roles in the cell by associating with different regulatory particles. Removing misfolded or damaged proteins that could impair cellular function is one of the major capacities of the gene product^33,34. Adhesion G protein-coupled receptor D2 (ADGRD2) is predicted to be involved in adenylate cyclase-activating G protein-coupled receptor signaling pathway and is an integral component of the membrane^35,36. Therefore, an involvement of PSMB7 or ADRG2 also in processes hypothesized to cause PCS appears plausible.

rs58659000, rs10465914 – MAST4, FBXO42 and SZRD1

The MAST4 gene encodes a protein belonging to the microtubule-associated serine/threonine protein kinases, which are mainly expressed in cytoplasm, predominantly in esophagus and urinary bladder³⁷. FBXO42 is a member of the F-box protein family and is characterized by a 40 amino acid F-box motif. Full-length cloning of FBXO42 in a mammary library identified the so-called ‘Just one F-box and Kelch domain-containing protein’ (JFK), a critical negative regulator of p53³⁸. Both FBXO42 and SZRD1 are negative markers of liver cancer. Finally, SZRD1 suppresses cell proliferation by inducing cell cycle arrest and apoptosis³⁹. All three genes are thus not obviously connected to pathophysiological processes relevant for PCS onset or severity.

PCS-R score

SLC7A2, PDGFRL, MYOCD, ARHGAP44, CHST11, TMEM200A and CADM2

The protein encoded by the SLC7A2 (solute carrier family 7 member 2) gene is a cationic amino acid transporter belonging to the APC (amino acid-polyamine-organocation) family of transporters⁴⁰. Located in the cell membrane, it is responsible for the cellular uptake of arginine, lysine and ornithine. Three transcript variants encoding different isoforms have been detected for this gene⁴⁰. SLC7A2 may play a role in macrophage activation through its role, as a member of the cationic amino acid transporter protein family, in L-arginine transport.

The PDGFRL gene encodes a protein with significant amino acid sequence similarity to the ligand binding domain of platelet-derived growth factor receptor beta⁴¹. Mutations in this gene, and the deletion of a chromosomal segment containing this gene, are both associated with sporadic hepatocellular carcinomas, colorectal cancers and non-small cell lung cancers, suggesting that the gene product may function as a tumor suppressor^42,43.

The protein encoded by the MYOCD gene is found in smooth muscle cells and cardiac muscle cells. Through forming a complex with serum response factor, it functions as a transcriptional activator of CArG box-dependent cardiac promotors⁴⁴. The two proteins play a crucial role in cardiogenesis, urinary bladder development and the differentiation of smooth muscle cells (myogenesis)⁴⁴. Potential clinical phenotypes associated with variation in the MYOCD gene include congenital megabladder, but no association to inflammatory, auto-immune or other processes relevant to PCS are known.

ARHGAP44 enables phospholipid binding activity and is involved in actin cytoskeleton dynamics for filopodia protrusion and cell migration⁴⁵. It is predicted to play a role in several processes, including modification of dendritic spine, negative regulation of Rac protein signal transduction and regulation of plasma membrane-bounded cell projection organization. The protein is not detected in immune cells or blood. Previous studies suggested that ARHGAP44 has an indirect influence on viral-induced tetherin signaling⁴⁶. There is also growing evidence that ORF7a and Spike act as tetherin antagonists in SARS-CoV-2 infections^47,48, and another study suggested that SARS-CoV-2 inhibits ORF3a tetherin by trapping it in late endocytic organelles⁴⁹.

The protein encoded by the CHST11 gene belongs to the sulfotransferase 2 family. A chromosomal translocation, t(12;14)(q23;q32), involving this gene and IgH has been reported in a patient with B cell chronic lymphocytic leukemia³⁴. Furthermore the protein is a negative marker of renal and urothelial cancer⁵⁰. TMEM200A is predicted to be an integral component of the membrane, but its detailed function is still unknown.

The protein encoded by the CADM2 gene belongs to the synaptic cell adhesion molecule 1 (SynCAM) family, which is part of the immunoglobulin superfamily⁵¹. It has a cytosolic binding site for members of the protein 4.1 family, known to interact with cytoskeletal proteins. Variants in the CADM2 gene are associated with post-bronchodilator FEV1 and FEV1/FVC ratio, alcohol consumption, general risk-taking tendency and adventurousness, and BMI^52,53,54.

Strengths and limitations

The major strength of our GWAS is that it was based upon a prospective population-based cohort study of PCS with deep phenotyping of participants. Although the PCS scores used in our study drew upon patient self-reports, clinical characteristics of study participants, including laboratory measurements, vital signs, lung function, neurological testing, and echocardiography are available for possible follow-up analyses.

Compared to GWAS of other complex human phenotypes, the sample size of the present study (maximum n = 2,216) was modest, which may have resulted in limited power to detect weak to moderate genetic effects. However, by lowering the significance level accordingly, we aimed to reduce the risk of false negative results to a level that, we believe, ensured that clinically and biologically truly significant associations were unlikely to have been missed. Indeed, inspection of the PCS score-specific QQ plots obtained in the three GWAS reveals that an enrichment of such associations is conceivable at least for the acute severity-sensitive PCS-S score (Supplementary Fig. 4).

Conclusions

We found various SNPs to be potentially associated with PCS severity, first and foremost variants in the olfactory receptor gene region. Impairment of smell and taste is a pathognomonic feature of both, acute COVID-19 and PCS, and our results suggest that this connection may have a genetic basis. Three other genotype-phenotype associations pointed towards an association between PCS and cellular virus repression (CHD6), activation of macrophages (SLC7A2) and the release of virus particles from infected cells (ARHGAP44). All other gene regions highlighted by our GWAS, however, did not relate to pathophysiological processes currently discussed for PCS. Therefore, and because the genotype-phenotype associations observed in our GWAS were generally not very strong, the complexity of the genetic background of PCS and its sub-domains appears to be rather high, and thus comparable to that of many other multifactorial traits in humans.

Data availability

All data of this study are available upon request from the NAPKON Data Use and Access Committee. For information on the NAPKON data governance and for submission of research proposals, access https://proskive.napkon.de (accessed 20 March 2025). Summary statistics of all GWAS conducted in this study are publicly available at locuszoom.org under study label ‘NAPKON POP’ (reference numbers 769746, 372109, 667841, 447780, 114850).

References

Huang, C. et al. Clinical features of patients infected with 2019 novel coronavirus in Wuhan, China. Lancet 395, 497–506 (2020).
Article CAS PubMed PubMed Central Google Scholar
Guan, W. et al. Clinical characteristics of coronavirus disease 2019 in China. N Engl. J. Med. 382, 1708–1720 (2020).
Article CAS PubMed Google Scholar
Davis, H. E., McCorkell, L., Vogel, J. M. & Topol, E. J. Long COVID: major findings, mechanisms and recommendations. Nat. Rev. Microbiol. 21, 133–146 (2023).
Article CAS PubMed PubMed Central Google Scholar
Bahmer, T. et al. Severity, predictors and clinical correlates of Post-COVID syndrome (PCS) in Germany: A prospective, multi-centre, population-based cohort study. eClinicalMedicine 51, 101549 (2022).
Article PubMed PubMed Central Google Scholar
The Severe Covid-19 GWAS Group. Genomewide association study of severe Covid-19 with respiratory failure. N Engl. J. Med. 383, 1522–1534 (2020).
Article Google Scholar
Degenhardt, F. et al. Detailed stratified GWAS analysis for severe COVID-19 in four European populations. Hum. Mol. Genet. 31, 3945–3966 (2022).
Article CAS PubMed PubMed Central Google Scholar
Lin, S. et al. Genome-wide epistasis study highlights genetic interactions influencing severity of COVID-19. Eur. J. Epidemiol. 38, 883–889 (2023).
Article PubMed Google Scholar
Lammi, V. et al. Genome-wide association study of long COVID. Preprint at (2023). https://doi.org/10.1101/2023.06.29.23292056
Taylor, K. et al. Genetic risk factors for severe and fatigue dominant long COVID and commonalities with ME/CFS identified by combinatorial analysis. J. Transl Med. 21, 775 (2023).
Article CAS PubMed PubMed Central Google Scholar
Ballhausen, S. et al. Subdomains of Post-COVID-Syndrome (PCS) - A population-based study. Preprint At. https://doi.org/10.48550/ARXIV.2503.07283 (2025).
Article Google Scholar
Horn, A. et al. Long-term health sequelae and quality of life at least 6 months after infection with SARS-CoV-2: design and rationale of the COVIDOM-study as part of the NAPKON population-based cohort platform (POP). Infection 49, 1277–1287 (2021).
Article CAS PubMed PubMed Central Google Scholar
Smith, B. W. et al. The brief resilience scale: assessing the ability to bounce back. Int. J. Behav. Med. 15, 194–200 (2008).
Article PubMed Google Scholar
Fedorov, V., Mannino, F. & Zhang, R. Consequences of dichotomization. Pharm. Stat. 8, 50–61 (2009).
Article PubMed Google Scholar
Streiner, D. L. Breaking up is hard to do: the heartbreak of dichotomizing continuous data. Can. J. Psychiatry. 47, 262–266 (2002).
Article PubMed Google Scholar
Devlin, B. & Roeder, K. Genomic control for association studies. Biometrics 55, 997–1004 (1999).
Article CAS PubMed Google Scholar
Privé, F., Aschard, H., Ziyatdinov, A. & Blum, M. G. B. Efficient analysis of large-scale genome-wide data with two R packages: bigstatsr and Bigsnpr. Bioinformatics 34, 2781–2787 (2018).
Article PubMed PubMed Central Google Scholar
Buck, L. & Axel, R. A novel multigene family May encode odorant receptors: A molecular basis for odor recognition. Cell 65, 175–187 (1991).
Article CAS PubMed Google Scholar
Mombaerts, P. Seven-transmembrane proteins as odorant and chemosensory receptors. Science 286, 707–711 (1999).
Article CAS PubMed Google Scholar
Barthelemy, I. et al. zhx-1: A novel mouse homeodomain protein containing two zinc-fingers and five homeodomains. Biochem. Biophys. Res. Commun. 224, 870–876 (1996).
Article CAS PubMed Google Scholar
Igata, T. et al. Loss of the transcription repressor ZHX3 induces senescence-associated gene expression and mitochondrial-nucleolar activation. PLoS ONE. 17, e0262488 (2022).
Article CAS PubMed PubMed Central Google Scholar
Kwon, R. J. et al. Expression and prognostic significance of zinc fingers and homeoboxes family members in renal cell carcinoma. PLoS ONE. 12, e0171036 (2017).
Article PubMed PubMed Central Google Scholar
Deng, M. et al. ZHX3 promotes the progression of urothelial carcinoma of the bladder via repressing of RGS2 and is a novel substrate of TRIM21. Cancer Sci. 112, 1758–1771 (2021).
Article CAS PubMed PubMed Central Google Scholar
Zhang, Y. et al. ZHX2 inhibits thyroid cancer metastasis through transcriptional Inhibition of S100A14. Cancer Cell. Int. 22, 76 (2022).
Article CAS PubMed PubMed Central Google Scholar
Alfonso, R. et al. CHD6 chromatin remodeler is a negative modulator of influenza virus replication that relocates to inactive chromatin upon infection: viral recruitment of CHD6 to inactive chromatin. Cell. Microbiol. 13, 1894–1906 (2011).
Article CAS PubMed Google Scholar
Nioi, P., Nguyen, T., Sherratt, P. J. & Pickett, C. B. The carboxy-terminal Neh3 domain of Nrf2 is required for transcriptional activation. Mol. Cell. Biol. 25, 10895–10906 (2005).
Article CAS PubMed PubMed Central Google Scholar
Finck, B. N. et al. Lipin 1 is an inducible amplifier of the hepatic PGC-1α/PPARα regulatory pathway. Cell. Metab. 4, 199–210 (2006).
Article CAS PubMed Google Scholar
Chen, Y., Rui, B. B., Tang, L. Y. & Hu, C. M. Lipin family proteins - key regulators in lipid metabolism. Ann. Nutr. Metab. 66, 10–18 (2015).
Article CAS PubMed Google Scholar
Wang, F. et al. Haploinsufficiency of Lipin3 leads to hypertriglyceridemia and obesity by disrupting the expression and nucleocytoplasmic localization of Lipin1. Front. Med. 18, 180–191 (2024).
Article CAS PubMed Google Scholar
Corallo, D. et al. EMILIN 3, an extracellular matrix molecule with restricted distribution in skin. Exp. Dermatol. 26, 435–438 (2017).
Article CAS PubMed Google Scholar
Yin, M. J., Shao, L., Voehringer, D., Smeal, T. & Jallal, B. The serine/threonine kinase Nek6 is required for cell cycle progression through mitosis. J. Biol. Chem. 278, 52454–52460 (2003).
Article CAS PubMed Google Scholar
Minoguchi, S., Minoguchi, M. & Yoshimura, A. Differential control of the NIMA-related kinases, Nek6 and Nek7, by serum stimulation. Biochem. Biophys. Res. Commun. 301, 899–906 (2003).
Article CAS PubMed Google Scholar
Jee, H. J. et al. Nek6 overexpression antagonizes p53-induced senescence in human cancer cells. Cell. Cycle. 9, 4703–4710 (2010).
Article CAS PubMed Google Scholar
Rivett, J. A., Bose, S., Brooks, P. & Broadfoot, K. I. Regulation of proteasome complexes by γ-interferon and phosphorylation. Biochimie 83, 363–366 (2001).
Article CAS PubMed Google Scholar
Davies, K. J. A. Degradation of oxidized proteins by the 20S proteasome. Biochimie 83, 301–310 (2001).
Article CAS PubMed Google Scholar
Bockaert, J. Molecular tinkering of G protein-coupled receptors: an evolutionary success. EMBO J. 18, 1723–1729 (1999).
Article CAS PubMed PubMed Central Google Scholar
Bjarnadóttir, T. K. et al. The human and mouse repertoire of the adhesion family of G-protein-coupled receptors. Genomics 84, 23–33 (2004).
Article PubMed Google Scholar
Fagerberg, L. et al. Analysis of the human tissue-specific expression by genome-wide integration of transcriptomics and antibody-based proteomics. Mol. Cell. Proteom. 13, 397–406 (2014).
Article CAS Google Scholar
Sun, L. et al. JFK, a Kelch domain-containing F-box protein, links the SCF complex to p53 regulation. Proc. Natl. Acad. Sci. USA. 106, 10195–10200 (2009).
Article ADS CAS PubMed PubMed Central Google Scholar
Zhao, N. et al. SZRD1 is a novel protein that functions as a potential tumor suppressor in cervical cancer. J. Cancer. 8, 2132–2141 (2017).
Article PubMed PubMed Central Google Scholar
Chaturvedi, R. et al. Polyamines impair immunity to Helicobacter pylori by inhibiting L-arginine uptake required for nitric oxide production. Gastroenterol 139, 1686–1698e6 (2010).
Article CAS Google Scholar
Komiya, A. et al. PRLTS gene alterations in human prostate cancer. Jpn J. Cancer Res. 88, 389–393 (1997).
Article CAS PubMed PubMed Central Google Scholar
Xu, M. et al. An integrative approach to characterize disease-specific pathways and their coordination: a case study in cancer. BMC Genom. 9, S12 (2008).
Article Google Scholar
Guo, F. J. Expression and functional characterization of platelet-derived growth factor receptor-like gene. WJG 16, 1465 (2010).
Article CAS PubMed PubMed Central Google Scholar
Wang, D. Z. et al. Activation of cardiac gene expression by myocardin, a transcriptional cofactor for serum response factor. Cell 105, 851–862 (2001).
Article CAS PubMed Google Scholar
Galic, M. et al. Dynamic recruitment of the curvature-sensitive protein ArhGAP44 to nanoscale membrane deformations limits exploratory filopodia initiation in neurons. eLife 3, e03116 (2014).
Galão, R. P., Pickering, S., Curnock, R. & Neil, S. J. D. Retroviral retention activates a Syk-dependent hemitam in human tetherin. Cell. Host Microbe. 16, 291–303 (2014).
Article PubMed PubMed Central Google Scholar
Martin-Sancho, L. et al. Functional landscape of SARS-CoV-2 cellular restriction. Mol. Cell. 81, 2656–2668e8 (2021).
Article CAS PubMed PubMed Central Google Scholar
Wang, S., Huang, K. & Wang, C. Severe acute respiratory syndrome coronavirus Spike protein counteracts BST2-mediated restriction of virus‐like particle release. J. Med. Virol. 91, 1743–1750 (2019).
Article CAS PubMed PubMed Central Google Scholar
Stewart, H. et al. Tetherin antagonism by SARS-CoV ‐2 ORF3a and Spike protein enhances virus release. EMBO Rep. 24, e57224 (2023).
Article CAS PubMed PubMed Central Google Scholar
Zhang, P., Cui, H. & Luo, Q. High expression of CHST11 correlates with poor prognosis and tumor immune infiltration of pancreatic cancer. Clin. Lab. 68, 107754 (2022).
Article Google Scholar
Fogel, A. I. et al. SynCAMs organize synapses through heterophilic adhesion. J. Neurosci. 27, 12516–12530 (2007).
Article CAS PubMed PubMed Central Google Scholar
Lutz, S. M. et al. A genome-wide association study identifies risk loci for spirometric measures among smokers of European and African ancestry. BMC Genet. 16, 138 (2015).
Article PubMed PubMed Central Google Scholar
Karlsson Linnér, R. et al. Genome-wide association analyses of risk tolerance and risky behaviors in over 1 million individuals identify hundreds of loci and shared genetic influences. Nat. Genet. 51, 245–257 (2019).
Article PubMed Google Scholar
Clifton, E. A. D. et al. Genome–wide association study for risk taking propensity indicates shared pathways with body mass index. Commun. Biol. 1, 36 (2018).
Article PubMed PubMed Central Google Scholar

Download references

Acknowledgements

This work was supported by the German Federal Ministry of Education and Research through funding of the COVIDON study and the Epidemiological Core Unit (ECU) of the National Pandemic Cohort Network NAPKON (grant number: 01KX2121). Additional support was provided by the Kiel University Computing Centre, Kiel, Germany, through access to its high-performance computing resources.

Funding

Open Access funding enabled and organized by Projekt DEAL.

Author information

Michael Krawczak, Jan Heyckendorf and Thomas Bahmer contributed equally to this work.

Authors and Affiliations

Institute of Medical Informatics and Statistics, University Medical Center Schleswig-Holstein, Kiel University, Brunswiker Straße 10, 24113, Kiel, Germany
Anne-Kathrin Ruß & Michael Krawczak
Institute of Epidemiology, University Medical Center Schleswig-Holstein, Kiel University, Kiel, Germany
Anne-Kathrin Ruß, Wolfgang Lieb & Susanne Poick
Department of Internal Medicine I, University Medical Center Schleswig-Holstein, Kiel, Germany
Stefan Schreiber, Lennart Reinke, Alin Viebke, Susanne Poick, Jan Heyckendorf & Thomas Bahmer
Institute of Digital Medicine and Clinical Data Science, Faculty of Medicine, Goethe University Frankfurt, Frankfurt, Germany
J. Janne Vehreschild & Katharina S. Appel
Department I of Internal Medicine, Faculty of Medicine, University Hospital CologneUniversity of Cologne, Cologne, Germany
J. Janne Vehreschild
Institute of Clinical Epidemiology and Biometry, University of Würzburg, Würzburg, Germany
Peter U. Heuschmann & Jens-Peter Reese
Institute of Medical Data Science, University Hospital Würzburg, Würzburg, Germany
Peter U. Heuschmann & Jens-Peter Reese
Clinical Trial Center, University Hospital Würzburg, Würzburg, Germany
Peter U. Heuschmann
Hannover Unified Biobank, Hannover Medical School, Hannover, Germany
Thomas Illig
Biomedical Research in Endstage and Obstructive Lung Disease Hannover, German Center for Lung Research, Hannover, Germany
Thomas Illig
Medical Department 2, Center for Internal Medicine, University Hospital Frankfurt, Goethe University Frankfurt, Frankfurt, Germany
Maria J. G. T. Vehreschild
Department of Medical Informatics, University Medical Center Göttingen, Göttingen, Germany
Dagmar Krefting
Campus Institute Data Sciences, Göttingen, Germany
Dagmar Krefting
Department of Clinical Research and Epidemiology, Comprehensive Heart Failure Center, University Hospital Würzburg, Würzburg, Germany
Stefan Störk
Department of Internal Medicine I, University Hospital Würzburg, Würzburg, Germany
Stefan Störk
Faculty of Health Sciences, THM University of Applied Sciences, Gießen, Germany
Jens-Peter Reese
Department of Infectious Diseases, Respiratory and Critical Care Medicine, Charité - Universitätsmedizin Berlin, Freie Universität Berlin, Humboldt-Universität zu Berlin, Berlin, Germany
Thomas Zoller
Institute of Social Medicine, Epidemiology and Health Economics, Charité - Universitätsmedizin Berlin, Freie Universität Berlin, Humboldt-Universität zu Berlin, Berlin, Germany
Lilian Krist
Institute of Clinical Molecular Biology, University Medical Center Schleswig-Holstein, Kiel University, Kiel, Germany
David Ellinghaus
Institute of Epidemiology, Research Unit of Molecular Epidemiology, Helmholtz Munich - German Research Center for Environmental Health, Neuherberg, Germany
Bärbel U. Foesel, Christian Gieger & Bettina Lorenz-Depiereux
Department of Infectious Diseases and Critical Care Medicine, Charité - Universitätsmedizin Berlin, Freie Universität Berlin, Humboldt-Universität zu Berlin, Berlin, Germany
Martin Witzenrath
CAPNETZ Stiftung, Hannover, Germany
Martin Witzenrath
Medical School OWL, Bielefeld University, Bielefeld, Germany
Gabriele Anton
Airway Research Center North (ARCN), German Center for Lung Research (DZL), Großhansdorf, Germany
Jan Heyckendorf & Thomas Bahmer
Leibniz Lung Clinic, Kiel, Germany
Jan Heyckendorf

Authors

Anne-Kathrin Ruß
View author publications
Search author on:PubMed Google Scholar
Stefan Schreiber
View author publications
Search author on:PubMed Google Scholar
Wolfgang Lieb
View author publications
Search author on:PubMed Google Scholar
J. Janne Vehreschild
View author publications
Search author on:PubMed Google Scholar
Peter U. Heuschmann
View author publications
Search author on:PubMed Google Scholar
Thomas Illig
View author publications
Search author on:PubMed Google Scholar
Katharina S. Appel
View author publications
Search author on:PubMed Google Scholar
Maria J. G. T. Vehreschild
View author publications
Search author on:PubMed Google Scholar
Dagmar Krefting
View author publications
Search author on:PubMed Google Scholar
Lennart Reinke
View author publications
Search author on:PubMed Google Scholar
Alin Viebke
View author publications
Search author on:PubMed Google Scholar
Susanne Poick
View author publications
Search author on:PubMed Google Scholar
Stefan Störk
View author publications
Search author on:PubMed Google Scholar
Jens-Peter Reese
View author publications
Search author on:PubMed Google Scholar
Thomas Zoller
View author publications
Search author on:PubMed Google Scholar
Lilian Krist
View author publications
Search author on:PubMed Google Scholar
David Ellinghaus
View author publications
Search author on:PubMed Google Scholar
Bärbel U. Foesel
View author publications
Search author on:PubMed Google Scholar
Christian Gieger
View author publications
Search author on:PubMed Google Scholar
Bettina Lorenz-Depiereux
View author publications
Search author on:PubMed Google Scholar
Martin Witzenrath
View author publications
Search author on:PubMed Google Scholar
Gabriele Anton
View author publications
Search author on:PubMed Google Scholar
Michael Krawczak
View author publications
Search author on:PubMed Google Scholar
Jan Heyckendorf
View author publications
Search author on:PubMed Google Scholar
Thomas Bahmer
View author publications
Search author on:PubMed Google Scholar

Contributions

Study conception and design: M.K., A.-K.R., T.B., J.H. Data generation: T.B., L.R., A.V., D.E., B.U.F., C.G., B.L.-D., G.A. Data analysis and visualization: A.-K.R., M.K. Manuscript preparation: M.K., A.-K.R., T.B., J.H. Study cohort maintenance: T.B., S.Sc., W.L., J.J.V., P.U.H., T.I., K.S.A., M.J.G.T.V., D.K., L.R., A.V., S.P., S.St., J.-P. R., T.Z., L.K., M.W., G.A. Funding acquisition: T.B., W.L., M.W., J. J.V., D.K., J.-P.R., P.U.H., M.K., S.Sc. All authors commented on different versions of the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Michael Krawczak.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary Material 1 (download DOCX )

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Ruß, AK., Schreiber, S., Lieb, W. et al. Genome-wide association study of post COVID-19 syndrome in a population-based cohort in Germany. Sci Rep 15, 15791 (2025). https://doi.org/10.1038/s41598-025-00945-z

Download citation

Received: 25 October 2024
Accepted: 02 May 2025
Published: 06 May 2025
Version of record: 06 May 2025
DOI: https://doi.org/10.1038/s41598-025-00945-z

Keywords

This article is cited by

Subdomains of Post-COVID Syndrome (PCS) – a population-based study
- Sabrina Ballhausen-Lübcker
- Anne-Kathrin Ruß
- Thomas Bahmer
BMC Infectious Diseases (2025)

Subjects

Abstract

Similar content being viewed by others

Self-reported symptom severity, general health, and impairment in post-acute phases of COVID-19: retrospective cohort study of Swedish public employees

Cross-sectional study of health impairment related to post COVID-19 condition among participants of a large population-based cohort in Germany

A genome-wide association study for survival from a multi-centre European study identified variants associated with COVID-19 risk of death

Introduction

Materials and methods

The COVIDOM study

Study protocol

Phenotype definition

DNA extraction, SNP genotyping and quality control

Genotype imputation

Genome-wide association analysis (GWAS)

Combinatorial analysis

Results

Data quality control

Characteristics of COVIDOM GWAS cohort

GWAS

Original PCS score

Acute COVID-19 severity-specific PCS score (PCS-S)

Individual resilience-specific PCS score (PCS-R)

Individual resilience and acute COVID-19 severity

Combinatorial analysis

Discussion

PCS score

rs10893121: olfactory receptors OR4D5, OR6T1, OR10S1, OR10G4, OR10G7, OR10G8 and OR10G9

rs61739314: zinc fingers and homeoboxes 3 (ZHX3), lipin 3 (LPIN3), Elastin microfibril interfacer 3 (EMILIN3), chromodomain helicase DNA-binding protein 6 (CHD6)

PCS-S score

rs9792535 – NIMA related kinase 6 (NEK6), proteasome 20 S subunit beta 7 (PSMB7) and adhesion G protein-coupled receptor D2 (ADGRD2)

rs58659000, rs10465914 ﻿– MAST4, FBXO42 and SZRD1

PCS-R score

SLC7A2, PDGFRL, MYOCD, ARHGAP44, CHST11, TMEM200A and CADM2

Strengths and limitations

Conclusions

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher’s note

Electronic supplementary material

Supplementary Material 1 (download DOCX )

Rights and permissions

About this article

Cite this article

Share this article

Keywords

This article is cited by

Subdomains of Post-COVID Syndrome (PCS) – a population-based study

Search

Quick links

rs58659000, rs10465914 – MAST4, FBXO42 and SZRD1