Abstract
Effective prevention strategies for post-COVID complications are crucial for patients, clinicians, and policy makers to mitigate their cumulative burden. This study evaluated the association of modifiable lifestyle factors (smoking, alcohol intake, BMI, physical activity, sedentary time, sleep duration, and dietary habits) with COVID-19 multisystem sequelae, death, and hospitalization in the UK Biobank cohort (nā=ā68,896). A favorable lifestyle (6-10 healthy factors; 46.4%) was associated with a 36% lower risk of multisystem sequelae (HR, 0.64; 95% CI, 0.58-0.69; ARR at 210 days, 7.08%; 95% CI, 5.98-8.09) compared to an unfavorable lifestyle (0-4 factors; 12.3%). Risk reductions spanned all 10 organ systems, including cardiovascular, coagulation, metabolic, gastrointestinal, kidney, mental health, musculoskeletal, respiratory disorders, and fatigue. This beneficial effect was largely attributable to direct lifestyle impacts independent of corresponding pre-infection comorbidities (71% for any sequelae). A favorable lifestyle was also related to the risk of post-COVID death (HR 0.59, 0.52-0.66) and hospitalization (HR 0.78, 0.73-0.84). These associations persisted across acute and post-acute infection phases, irrespective of hospitalization status, vaccination, or SARS-CoV-2 variant. These findings underscore the clinical and public health importance of adhering to a healthy lifestyle in mitigating long-term COVID-19 adverse impacts and enhancing future pandemic preparedness.
Similar content being viewed by others
Introduction
COVID-19 cases and deaths have decreased globally, yet the long-term health consequences of SARS-CoV-2 infection, termed as post-COVID-19 conditions or long COVID, are still being managed as a global public health crisis1,2. These conditions or symptoms can involve pulmonary and multiple extrapulmonary organ systems, and may occur or extend beyond the acute infection of varying severity, with significant impact on daily functioning and quality of life3. Increased risk and burden of cardiovascular, pulmonary, neuropsychiatric, and metabolic disorders were reported during the 6 to 12 months following SARS-CoV-2 infection4,5, with persistent risk observed for several diseases up to 2 years6,7.
Despite long COVID has been characterized, however, evidence-based strategies for its prevention or treatment are not yet available8,9. Previous studies on its prevention have mainly focused on vaccination and pharmaceutical approaches, including antivirals (e.g., molnupiravir and nirmatrelvir) and other drugs repurposed for long COVID (e.g., metformin). Increasing evidence suggests that vaccination before infection and use of antivirals during acute phase in selected high-risk patients only partially mediate the risk of long COVID at 6 to 12 months following infection (by 15ā51% for vaccination10,11,12, by 26% for nirmatrelvir13, and by 14% for molnupiravir14). Several potential drugs for long COVID are still under investigation without yielding reliable results8,9. Evidence for the non-pharmaceutical management strategies is also lacking. Effective prevention and intervention strategies are needed to inform patients, clinicians and policy makers, and to reduce the cumulative burden of post-COVID conditions.
Modifiable lifestyle factors such as physical activity and healthy diet are potential targets for the prevention of major non-communicable diseases15,16,17, and are associated with lower risk of severe COVID-19 and related mortality18,19, possibly through protection against inflammation17,18, autoimmunity20,21, and clotting abnormality22,23. These mechanisms overlap with the hypothesized pathogenesis of long COVID, and other postviral conditions, such as chronic fatigue syndrome3. When examined individually, factors such as smoking and obesity, have been reported to be related to increased risk of post-COVID symptoms mainly in hospitalized patients12. Nevertheless, association between combinations of multiple lifestyle factors, which are known to interact synergistically24,25, and risk of COVID-19 sequelae across multiple organ systems remains unclear.
This major knowledge gap should be urgently addressed to inform the prevention and care strategies of long COVID. Based on a large-scale, prospective population-based cohort, we evaluated the relationship between composite healthy lifestyle (including 10 modifiable factors) that predated the pandemic and subsequent risk of COVID-19 sequelae in 10 organ systems, death and hospital admission, while considering the phase of infection (acute or post-acute), severity of infection (tested positive in community/outpatient setting vs inpatient setting), vaccination status (fully vaccinated vs unvaccinated or partially vaccinated), and variants (alpha [B.1.1.7], delta [B.1.617.2], vs omicron [B.1.1.529]), that differ in transmissibility, disease course, and disease severity26.
Results
Baseline characteristics
Out of 472,977 eligible UK Biobank participants, 68,896 participants with a positive SARS-CoV-2 test result between March 1, 2020 and March 1, 2022 were included in the current study. The demographic and health characteristics of the eligible participants, and participants with COVID-19 overall and by healthy lifestyle category are provided in TableĀ 1. Of the COVID-19 cohort, the mean (SD) age was 66.6 (8.4) years, 53.4% were male and 82.1% were White. For composite healthy lifestyle prior to the infection, 12.3% followed an unfavorable lifestyle, 41.3% followed an intermediate lifestyle, and 46.4% followed a favorable lifestyle. The median [IQR] number of healthy lifestyle factors participants engaged in was 7 [6ā8]. For prespecified COVID-19 sequalae, 5.5% and 7.8% had sequelae in at least one organ system during the acute and post-acute phase of infection, respectively.
Risk of multisystem sequelae
Overall, the risk of multisystem COVID-19 sequelae decreased monotonically across healthy lifestyle categories during both the acute and post-acute phases of infection. Compared with those with an unfavorable lifestyle, participants with an intermediate (HR, 0.80; 95% CI, 0.74ā0.87; ARR at 210 days, 3.89%; 95% CI, 2.56ā5.12) and favorable (HR, 0.64; 95% CI, 0.58ā0.69; ARR at 210 days, 7.08%; 95% CI, 5.98ā8.09) lifestyle were at significantly lower risk of multisystem sequelae of COVID-19 (Fig.Ā 1), with similar trends observed in both the acute and post-acute phases of COVID-19 (Fig.Ā 2). The number of healthy lifestyle factors (range, 0ā10) was associated with risk of sequelae in a dose-dependent manner (Fig.Ā 1).
Healthy lifestyle (composite or number) and risk of multisystem sequelae (composite or by organ systems), death, and hospitalization during the overall phase (0ā210 days) of SARS-CoV-2 infection. Adjusted HRs and 95% CI are presented for composite/individual multisystem sequelae, death, and hospitalization. Absolute risk reduction (ARR) per 100 persons at 210 days and 95% CIs were calculated. Solid squares represent HRs with the area inversely proportional to the variance of the log HR. Hollow square represents ARR. The horizontal lines indicate 95% CIs, with black line representing statistically significant results and gray line representing non-significant results. Intermediate lifestyle category are in orange, favorable lifestyle category in green.
Composite healthy lifestyle and risk of multisystem sequelae, death, and hospitalization during the acute phase (first 30 days) and post-acute (30ā210 days) phases of SARS-CoV-2 infection. Adjusted HRs and 95% CIs are presented for composite/individual multisystem sequelae, death, and hospitalization. Absolute risk reduction (ARR) per 100 persons at 30 days and 30ā210 days and 95% CIs were calculated. Solid square represents HRs with the area inversely proportional to the variance of the log HR. Hollow square represents ARR. The horizontal lines indicate 95% CIs, with black line representing statistically significant results and gray line representing non-significant results. Intermediate lifestyle category are in orange, favorable lifestyle category in green.
The inverse associations with multisystem sequelae were largely attributable to the direct protective effect of a healthy lifestyle (proportion of direct effect on any sequela: 71%), with proportion of direct effect ranging from 44% to 93% across organ systems (Fig.Ā 3a). Pre-infection medical conditions were associated with substantially increased risk of COVID-19 sequelae, particularly history of cardiovascular diseases, diabetes and mental disorders (Fig.Ā 3b). Number of participants with medical conditions between baseline and infection is provided in Supplementary TableĀ 4.
a Proportion of the direct and indirect effect of a healthy lifestyle on multisystem sequelae (intermediate/favorable vs unfavorable lifestyle). Direct associations were accounted for pre-infection medical conditions (mediator), identified as any relevant event recorded between baseline measurement and infection date. b Association of corresponding pre-infection medical conditions with the risk of sequelae following SARS-CoV-2 infection. Outcomes were ascertained 0ā210 days after SARS-CoV-2 infection. The horizontal bars indicate HR and lines indicate 95% CIs. The sample size was 68,892. 7975 incident events for any sequela, 354 for general fatigue, 923 for coagulation diseases, 1938 for neurologic diseases, 800 for pulmonary diseases, 2023 for Kidney diseases, 2064 for gastrointestinal diseases, 1739 for mental disorders, 1895 for musculoskeletal diseases, 2077 for cardiovascular diseases, and 2152 for diabetes.
For individual components of the healthy lifestyle, each of the 10 studied behavioral and dietary factors was associated with a lower or non-differential risk of sequelae, with smoking, physical activity, obesity, and sleep duration contributing most (Fig.Ā 4).
Blue square represents risk estimates from models fully adjusted for age, sex, education level, ethnicity, IMD, and mutually for all lifestyle factors. The purple square represents risk estimates from models partially adjusted for age, sex, education level, ethnicity, and IMD. The horizontal lines indicate 95% CIs, with black line representing statistically significant results and the gray line representing non-significant results. The sample sizes were 60,561 for any sequela (4792 events), 55,106 for hospitalization (6958 events), and 68,887 for death (1203 events).Ā The HRĀ for each lifestyle factor was calculated by comparing the healthy category withĀ theĀ unhealthy category (e.g., past or never smoker versusĀ current smoker).
Risk of death and hospitalization
Adherence to a healthy lifestyle was associated with lower risk of death and hospitalization following COVID-19 during both the acute and post-acute phases of infection. Compared with those with an unfavorable lifestyle, participants with a favorable lifestyle were at significantly lower risk of death (HR, 0.59; 95% CI, 0.52ā0.66; ARR at 210 days, 1.99%; 95% CI, 1.61ā2.32) and hospitalization (HR, 0.78; 95% CI, 0.73ā0.84; ARR at 210 days, 6.14%; 95% CI, 4.48ā7.68) following COVID-19 (Fig.Ā 1), with similar trend observed in both the acute and post-acute phases (Fig.Ā 2). Each of 10 lifestyle factors was associated with a lower or non-differential risk of death and hospitalization (Fig.Ā 4).
Risk of system-specific sequelae
Compared with those following an unfavorable lifestyle, participants with a favorable lifestyle had a significantly lower risk of sequelae in all 10 organ systems examined, including cardiovascular, coagulation and hematologic, metabolic and endocrine, gastrointestinal, kidney, mental health, musculoskeletal, neurologic, and respiratory disorders, as well as general symptoms of fatigue and malaise, with overall HRs ranging from 0.38 to 0.76 (Fig.Ā 1). The associations with intermediate lifestyle were consistently in the same protective direction across system-specific sequelae (Fig.Ā 1). Similar trend were observed in both the acute and post-acute phases (Fig.Ā 2).
Risk of outcomes by subgroups
The inverse associations between healthy lifestyle and risk of multisystem sequelae, death, and hospitalization held across the different subgroups of clinical interest, including those by age, sex, and ethnicity, vaccine status, test setting, and variants of infection (TableĀ 2). The reduced risk of outcomes was observed in participants who received two doses of vaccine (breakthrough infection) and those who were unvaccinated or partially 1-dose vaccinated (non-breakthrough infection). The reduced risk was evident in participants tested positive in inpatient setting and in those tested positive in community/outpatient settings. The reduced risk was consistently observed across predominant variants of SARS-CoV-2 infection during study period, including wildtype, Alpha, Delta, and Omicron BA.1. Notably, a composite healthy lifestyle was associated with decreased risk of outcomes following infection of Omicron variant, which remains currently dominant variant worldwide. No significant interaction was observed in any of the subgroups across outcomes, except for age, ethnicity, and vaccination. The observed association between a favorable lifestyle and a reduced risk of outcomes was more evident for people aged <ā65, in white subjects, and in those fully vaccinated (for mortality only).
Sensitivity analyses
A similar pattern of associations was observed in multiple sensitivity analyses, including assigning weight to individual sequela and using zero-inflated poisson regression to estimate the association, excluding participants with history of related outcomes in the past two years rather than one year, defining post-acute outcomes 90 days after infection rather than 30 days, and restricting the identification of outcomes to the first three ICD diagnoses (Supplementary TableĀ 5). Stronger associations were identified after accounting for potential misclassification of lifestyle factors, suggesting that the observed associations may have been underestimated (Supplementary TableĀ 5).
Risk of outcomes in the uninfected group
The associations between healthy lifestyle and risk of three predefined main outcomes were largely similar among participants with SARS-CoV-2 infection and those with no evidence of infection during the overall or 30ā210 days of follow-up (Fig.Ā 5 and Supplementary TableĀ 6). However, the inverse associations between healthy lifestyle and long COVID-associated complications and hospitalization were more evident during 0-30 days in the infected group (acute phase) compared to those in the uninfected group; whereas the association with death was stronger in the uninfected group (Supplementary TableĀ 6).
a Participants with SARS-CoV-2 infection. b Participants with no evidence of SARS-CoV-2 infection. Outcomes were ascertained 0ā210 days after SARS-CoV-2 infection. Event rates are presented for the unfavorable lifestyle category (red), the intermediate lifestyle category (orange), and the favorable lifestyle category (green). The shadow of cumulative incidence curves represents 95% CIs.
Discussion
Based on a large, prospective population-based cohort, this study provides a comprehensive assessment of the health effects of multiple lifestyle factors on a systematic range of disease outcomes following COVID-19. Adherence to a healthy lifestyle prior to infection was associated with significantly lower risk of COVID-19 multisystem sequelae, death, and hospital admission, during both the acute and post-acute phases of SARS-CoV-2 infection. The reduced risk was evident across 10 prespecified organ systems, including cardiovascular, coagulation and hematologic, metabolic and endocrine, gastrointestinal, kidney, mental health, musculoskeletal, neurologic, and respiratory disorders, as well as general symptoms of fatigue and malaise. The reduced risk of multisystem sequelae associated with a healthy lifestyle was consistently observed across participants, regardless of their vaccination status (unvaccinated/partially vaccinated or fully vaccinated), disease severity (testing positive in community/outpatient settings or inpatient settings), and major SARS-CoV-2 variants, including Omicron variants, whose subvariants are currently dominant. Moreover, the benefits of healthy lifestyle on sequelae were largely independent of pathways related to pre-existing relevant disease conditions. Overall, the findings suggest that adherence to a healthy lifestyle prior to infection was consistently and directly associated with reduced risk of adverse health outcomes following COVID-19.
We found that a favorable lifestyle, in comparison with an unhealthy one, was associated with a 36% lower risk of multisystem sequelae, a 41% lower risk of death, and a 22% lower risk of hospitalization, which corresponded to an absolute risk reduction of 7.08, 1.99, and 6.14 fewer cases per 100 people at 210 days after infection. This association was even larger than those observed in previous studies of pharmaceutical interventions in non-hospitalized patients, which reported a 14% risk reduction in post-acute sequelae at 180 days for vaccination before infection, and 14% and 26% risk reductions at 180 days for the use of molnupiravir and nirmatrelvir during acute phase of infection, respectively10,13,14. It is important to note that participants with breakthrough infection were still at risk of sequelae compared with those without infection10. In addition, only selected patients at risk of progression to severe COVID-19 are qualified for antivirals during the acute infection13,14, and their benefit-risk profile in wider population with milder infection, or when used during the post-acute stage, remains unclear. These previous findings highlighted the restricted scope of currently available therapies and limited efficacy of vaccination in preventing long COVID10,13,14,27. Our results are consistent with a cross-sectional study of 1981 women suggesting an inverse association between composite healthy lifestyle (mainly driven by BMI and sleep duration) and self-reported symptoms following infection of non-Omicron variants28. However, outcomes purely based on self-report symptoms are less clinically relevant and the inclusion of only women may limit the generalization of findings to other populations and settings.
The mechanisms underlying the benefit of adhering to a healthy lifestyle for the alleviation of sequelae are likely multifaceted. Previous research has established causal links between several individual lifestyle factors, such as smoking, obesity, and physical inactivity, and increased susceptibility and severity in relation to COVID-1929. Smoking and high BMI were also risk factors for long COVID symptoms mainly in hospitalized patients12. Indeed, we observed that these factors and additionally sleep duration, and sedentary behavior were significant contributors to the reduced risk of sequelae. Also, we found that the inverse associations with the risk of long COVID-related complications and hospitalization were more pronounced in the infected group during the first 30 days than in the uninfected group, suggesting the particular benefits of a healthy lifestyle in preventing acute outcomes that could be directly caused by viral infection. However, these pathways are unlikely to fully explain the protective effects on post-infection adverse outcomes conferred by a healthy lifestyle. In our study, all participants had a confirmed infection, and the protective associations persisted even among those who were hospitalized. In addition, it has been suggested that individuals with an unhealthy lifestyle are more likely to have prevalent chronic conditions, such as cardiovascular diseases and diabetesāwhich are strong risk factors for severe COVID-19āand are therefore more vulnerable to post-acute complications. Through mediation analysis, our study supported this hypothesis and, for the first time, further demonstrated that the healthy lifestyleās direct protection accounts for the majority of the overall associations with COVID-19 sequelae. Notably, varying proportions of indirect effects from healthy lifestyle were observed depending on the specific sequela of interest. For example, pre-existing diabetes had the strongest association with post-infection diabetes sequela, therefore lifestyle is more likely to confer its protection indirectly through this pathway. In contrast, thrombotic events are generally more acute and transient, making it less likely for healthy lifestyle to confer indirect protection. Biologically, the overlapping mechanisms between unhealthy lifestyle and viral infection and post-infection conditions may also be involved. Favorable lifestyle factors, such as physical activity and healthy diet, confer health benefits including protection against inflammation17,18, autoimmunity21,30, and clotting abnormality23, which are implicated in the potential pathogenesis of long COVID3.
Although our findings align with previous evidence on the broader benefits of healthy lifestyle on chronic disease prevention and life expectancy15,16,17, this potentially beneficial effect should not be interpreted as changing behaviors around the time of acute infection or during post-acute infection. The current practical guide recommends that patients without long COVID should gradually and safely return to pre-infection physical activity when appropriate, although direct evidence is lacking31. Previous study also characterized long COVID as a multifactorial condition determined by pathogen, host response, external pandemic-associated factors, and supported a multidisciplinary treatment including both pharmacological and rehabilitation approaches, but also social and welfare support to promote healthy lifestyle habits32,33. Future research is warranted to assess the effect of composite lifestyle interventions in prevention of long COVID or alleviating associated symptoms among patients with long COVID. Adherence to a healthy lifestyle, in combination with vaccination and, if necessary, potential medications, may be a viable and practical approach to further reduce the long-term health consequences of SARS-CoV-2 infection. These strategies hold significant public health and scientific importance in mitigating the overall burden of post-COVID complications and enhancing preparedness for future pandemics.
This study has several limitations. First, the UK Biobank participants are likely to be older than the general population of the UK and are mostly of European ancestry, which may limit the generalizability of study findings to the younger population and other ethnic groups. Second, the majority of participants (87%) were classified as intermediate or favorable lifestyle category, suggesting that the study population appears to be healthier than the wider general population. However, the exact distribution of lifestyle categories based on the similar 10 modifiable factors in another population remains unknown. A previous study in Australia (nā=ā231,048)34 reported 68% of participants had ā¤ā1 unhealthy lifestyle factor out of 7 factors assessed (smoking, alcohol consumption, physical activity, sitting time, sleeping duration, and diet), which partly in line with our results. Assuming participants are healthier than the UK general population, the absolute risk estimates such as ARR should be interpreted with caution. Nevertheless, the relative associations of risk factors with disease outcomes in the UK Biobank were tested to be generalizable and comparable to those from other representative cohort of general population35,36. In addition, this high proportion could also suggest potential misreporting bias in self-reported lifestyle data. However, healthy reporting bias may be more common in socioeconomically deprived individuals37 and tends to bias any genuine association towards null. Self-reported lifestyle data, such as sleep duration38 and physical activity39, have been shown to be highly correlated with accelerometer-derived measures in UK Biobank. Third, residual confounding and reverse causality cannot be ruled out in this observational study. Fourth, we assumed the baseline lifestyle unchanged over years until the time of infection, which may be subject to exposure misclassification and underestimated any genuine associations. However, reassuringly, there was high consistency of lifestyle measures between cohort baseline and repetitive interviews after a median of 8 years, and consistent associations were observed after accounting for potential changes in lifestyle factors. Fifth, as sequelae outcomes were based on inpatient records, milder long COVID symptoms were less likely to be detected. Sixth, given the potential non-linear effects of lifestyle factors, such as alcohol consumption, caution is warranted when interpreting associations between binarized lifestyle factors and outcomes. The health effects and recommended targets of several individual lifestyle factors, such as alcohol consumption and red meat intake, are inconsistent in previous epidemiological studies and guidelines40,41, and may potentially vary by disease outcome of interest. Findings on the association of such individual factors with post-infection complications should be considered in the wider context of chronic disease, whether directly related to infection or not. Seventh, itās important to acknowledge that some participants classified as uninfected may have had undiagnosed or untested COVID-19. However, by linking participants to official national databases for COVID-19 testing and hospitalization, the likelihood of misclassifying infected and uninfected participants was minimized. Finally, despite we included a range of sequelae across organ systems, it is difficult to link these outcomes directly to the infection, especially given the lack of consensus standard for diagnosis of long COVID. This limitation is applied to all related studies in the field42,43,44,45,46,47,48,49,50. Nevertheless, the sequelae prespecified were most relevant to long COVID based on prior evidence, with increased risk and burden consistently reported beyond the acute phase in the currently largest electronic dataset42,43,44,45,46,47,48, the same UK Biobank50, and other nationwide cohorts49,50.
Adherence to a healthy lifestyle that predated the pandemic was associated with substantially lower risk of sequelae across organ systems, death, and hospitalization following COVID-19, regardless of phases of infection, vaccination status, test setting, and SARS-CoV-2 variants, and independent of relevant comorbidities. These findings suggest the benefit of population adhering to a healthy lifestyle to reduce the potential long-term adverse health consequences of COVID-19.
Methods
Data sources and study cohorts
UK Biobank is a large-scale population-based prospective cohort study with deep phenotyping and genomic data, as detailed elsewhere51. Briefly, between 2006 and 2010, over 500,000 individuals aged 40ā69 years were recruited from 22 assessment centers across the United Kingdom at baseline, with collection of socio-demographic, lifestyle and health-related factors, a range of physical measures, and blood samples51. Follow-up information is obtained by linking health and medical records, including national primary and secondary care, disease and mortality registries52, with validated reliability, accuracy and completeness53. To identify cases of SARS-CoV-2 infection, polymerase chain reaction (PCR)-based test results were obtained by linking all participants to the Public Health Englandās Second Generation Surveillance System, with dates of specimen collection and healthcare settings of testing54. Outbreak dynamics were validated to be broadly similar between UK Biobank participants and the general population of England54.
In this study, we included participants who were alive by March 1, 2020 and had a positive SARS-CoV-2 PCR test result between March 1, 2020 (date of the first recorded case in the UK Biobank), and March 1, 2022, with the date of first infection considered as index date (T0). For those diagnosed with COVID-19 in hospital, we defined T0 as the date of hospital admission minus a random number of 7 days. The major prevalent variants during the study period included wildtype, Alpha (B.1.1.7), Delta (B.1.617.2), and Omicron (B.1.1.529 BA.1). The calendar periods of dominant variants in the UK were based on pandemic data from the Office for National Statistics26. Participants with missing data on study exposures at baseline were excluded. We addressed missing data on covariates using the following approaches: (1) participants with missing values in age and sex (ā<ā0.1%) were excluded. (2) participants with missing values in ethnicity were classified as āother ethnic groupsā. (3) participants with missing values in education level (0.9%) were classified as ācategory Iā, which includes ānone of the aboveā and āprefer not to answerā. (4) missing values in IMD (13.8%) were imputed with the mean value of the entire UK Biobank cohort. All participants included in this study provided written informed consent at recruitment. This study followed the Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) reporting guidelines and received ethical approval from the UKBB ethics advisory committee. Study design, cohort construction, and timeline are provided in Supplementary Fig.Ā 1. All participants provided written informed consent at the UK Biobank cohort recruitment. This study received ethical approval from UK Biobank Ethics Advisory Committee (EAC) and was performed under the application of 65397.
Lifestyle factors
Ten prespecified potentially modifiable lifestyle factors were assessed, including smoking, alcohol consumption, body mass index (BMI), physical activity, sedentary time, sleep duration, intake of fruit and vegetable, intake of oily fish, intake of red meat, and intake of processed meat. Selection and categorization of lifestyle factors was based on literature review, previous knowledge, and UK national health service guidelines55,56. Multiple lifestyle factors were measured by validated questionnaire for all participants at baseline recruitment. Detailed definitions on measurement and classification of lifestyle factors are provided in Supplementary TableĀ 1. Briefly, healthy lifestyle components including past or never smoker, moderate alcohol intake (āā¤ā4 times week), BMIā<ā30ākg/m2, at least 150āmin of moderate or 75āmin of vigorous physical activity per week, less sedentary time (ā<ā4āh per day), healthy sleep duration (7ā9āh per day), adequate intake of fruit and vegetables (āā„ā400āg/day), adequate oily fish intake (āā„ā1 portion/week), moderate intake of red meat (āā¤ā4 portion week) and processed meat (āā¤ā4 portion week) were defined, in accordance with previous evidence or UK national health service guidelines55,56.
A binary variable was created for each of the 10 factors, with 1 point assigned for those meeting the healthy criteria and 0 otherwise. A composite lifestyle score was then calculated for each participant by summing the total number of healthy lifestyle factors, ranging from 0 to 10. Based on the composite score, participants were classified into three lifestyle categories: unfavorable (0ā5), intermediate (6ā7), and favorable (8ā10). The lifestyle score was also used as a continuous variable of number of healthy lifestyle factors. Similar methods of defining lifestyle score have been used in the same UK Biobank cohort57 as well as external cohorts16,28. Distributions of lifestyle score and categories are provided in Supplementary TableĀ 2.
The median [IQR] duration between baseline assessment of lifestyle factors and the date of infection was 12.5 [11.8ā13.3] years. Part of participants took part in up to two further touchscreen interviews with lifestyle and health-related factors similarly measured. There were generally stable responses to lifestyle factors between baseline assessment and the latest repeat assessment (median time difference from baseline, 8 years) as shown in Supplementary Fig.Ā 2. 34.9% of participants with an unfavorable lifestyle, 48.6% with an intermediate lifestyle, and 73.7% with a favorable lifestyle at baseline remained in the same corresponding lifestyle category at the latest repeat assessment following a median of 8 years. Overall, the proportion of stable lifestyle categories is 60.6%.
Outcomes
The outcomes after COVID-19 were prespecified, including a set of multisystem sequelae, death, and hospital admission following the SARS-CoV-2 infection. The multisystem sequelae were selected and defined based on previous evidence of the long COVID, including 75 systemic diseases or symptoms in 10 organ systems: cardiovascular46, coagulation and hematologic46, metabolic and endocrine44, gastrointestinal48, kidney43, mental health45, musculoskeletal47, neurologic47, and respiratory disorders10,13,14, and general symptoms of fatigue and malaise3,4,42,49. Detailed definitions of multisystem sequelae are listed in the Supplementary TableĀ 3. Outcomes were identified as follows: individual sequela from the hospital inpatient ICD-10 (International Classification of Diseases 10th Revision) diagnosis codes, deaths from the records of national death registry, and hospital admission from hospital inpatient data from the Hospital Episode Statistics. Incident outcomes were assessed in participants with no history of the related outcome within one year before the date of the first infection.
As SARS-CoV-2 infection has been associated with both multisystem manifestations during its acute phase and with sequelae during its post-acute phase7,49, we conducted analyses stratified by phase of infection. We reported risk of each outcome during the acute phase (T0 to T0ā+ā30d), post-acute phase (T0ā+ā30d to T0ā+ā210d), and overall period following infection (T0 to T0ā+ā210d) to reflect the full spectrum of post-COVID conditions. The end of follow-up for the overall cohort was September 30, 2022, with the maximum follow-up period censored to 210 days.
Covariates
We prespecified a list of covariates for adjustment or stratification based on literature review and prior knowledge: socio-demographic characteristics including age, sex, education level (mapped to the international standard for classification of education), index of multiple deprivation (IMD, a summary measure of crime, education, employment, health, housing, income, and living environment)58, and race and ethnicity; and infection related factors including healthcare settings of the testing (community/outpatient vs inpatient setting as proxy of severity of infection), COVID-19 vaccination status, and SARS-CoV-2 variants.
Statistical analysis
Baseline characteristics of the overall cohort of participants with SARS-CoV-2 infection and by composite healthy lifestyle categories were reported as mean and standard deviation or frequency and percentage, when appropriate. Multivariable cox proportional hazard (PH) model was used to assess the association between composite healthy lifestyle and risk of multisystem sequelae (composite or by organ systems), death, and hospital admission, with adjustment for age, sex, ethnicity, education level, and IMD. PH assumption across lifestyle categories was tested by Schoenfeld residuals with no violations observed for outcomes. Hazard ratio (HR) and absolute risk reduction (ARR, difference in incidence rate between lifestyle groups per 100 persons during the corresponding follow-up period) were estimated from the Cox model. We also assessed the association between individual lifestyle factor instead of composite categories (each component as a categorical variable with or without mutual adjustment for others, or the number of factors as continuous variables) and risk of outcomes.
We conducted causal mediation analysis59,60 to quantify the extent to which the habitual healthy lifestyle may affect COVID-19 sequelae through the potential pathway of relevant pre-infection medical conditions (mediator), with the proportion of direct and indirect effects estimated by quasi-Bayesian Monte Carlo methods with 1000 simulations for each. Detailed modeling procedures and a directed acyclic graph are provided in Supplementary Methods.
We examined the association between composite healthy lifestyle and the overall risk of multisystem sequelae in prespecified clinical subgroups by demographic and infection-related factors. The demographic factors included age (āā¤ā65 and >65 years), sex (male and female), and ethnicity (White and other ethnic groups). As the risk profile of COVID sequelae was related to vaccination and severity of infection, and may change with the evolving pandemic, infection-related factors including vaccine status (no or one-dose partial vaccination and two-dose full vaccination), test setting (inpatient and outpatient or community), dominant variants during the study period (wildtype, Alpha, Delta, and Omicron BA.1) were assessed. Multiplicative interactions between the composite healthy lifestyle and the stratification variables were tested, with P-value reported.
We conducted multiple sensitivity analyses to assess the robustness of primary findings. First, to reflect the multisystem and potentially comorbid nature of COVID sequelae, accounting for both the number of sequelae by an individual and the relative health impact of each sequela. Weights based on Global Burden of Disease study data and methodologies for general diseases and long COVID were assigned to each sequela (Supplementary TableĀ 1)61,62. The weighted score was calculated for each participant by summing the weights of all incident sequelae during the follow-up period. Zero inflated Poisson regression was then used to calculate relative risk (RR), with follow-up time set as the offset of the model and adjustment for covariates. Second, to further account for potential reverse causality and more accurately define incident cases, extending the washout period for outcomes from one year to two years. Third, defining events of post-acute sequelae 90 days after infection (follow-up period T0ā+ā90d to T0ā+ā210d), instead of 30 days in the main analyses. The adjustment was made as there is no uniform definition for long COVID, which is currently described as conditions occurring 30ā90 days after infection in existing guidelines27. Fourth, restricting the identification of outcomes to the first three ICD diagnoses, which are the main causes for each hospital admission. Fifth, reconstructing a composite lifestyle index without BMI and assessed its association with outcomes. Finally, we conducted quantitative sensitivity analysis to adjust for changes in lifestyle factors over time since the baseline assessment. We used odds ratios to quantify associations and assumed a sensitivity and specificity of 90% for each lifestyle component (Supplementary Methods).
As a healthy lifestyle is associated with a lower risk of chronic diseases and mortality among the general population predated pandemic, we conduct exploratory analysis to compare the effects of healthy lifestyle on adverse outcomes following COVID-19 with the effects among participants without infection. A random index date was assigned to the participants without infection based on the distribution of T0 among those with infection, and we repeated the main analyses with the maximum follow-up period censored to 210 days.
Statistical significance was determined by a 95% confidence interval (CI) that excluded 1 for ratios and 0 for rate differences. All analyses and data visualizations were conducted using R statistical software (version 4.2.2).
Reporting summary
Further information on research design is available in theĀ Nature Portfolio Reporting Summary linked to this article.
Data availability
The data that support the findings of this study are available from the UK Biobank. Researchers can apply to use the UK Biobank dataset by registering and applying at https://ukbiobank.ac.uk/register-apply/. The aggregated data supporting the findings of this study are available within the paper and its supplementary information files.
Code availability
Analysis code used for this study is available atĀ https://github.com/xjq8065524/Lifestyle_Long_COVD_prevention.git.
References
Parotto, M. et al. Post-acute sequelae of COVID-19: understanding and addressing the burden of multisystem manifestations. Lancet Respir. Med 11, 739ā754 (2023).
Munblit, D. et al. A core outcome set for post-COVID-19 condition in adults for use in clinical practice and research: an international Delphi consensus study. Lancet Respir. Med 10, 715ā724 (2022).
Davis, H. E., McCorkell, L., Vogel, J. M. & Topol, E. J. Long COVID: major findings, mechanisms and recommendations. Nat. Rev. Microbiol. 21, 133ā146 (2023).
Al-Aly, Z., Agarwal, A., Alwan, N. & Luyckx, V. A. Long COVID: long-term health outcomes and implications for policy and research. Nat. Rev. Nephrol. 19, 1ā2 (2023).
Wang, Y., Su, B., Xie, J., Garcia-Rizo, C. & Prieto-Alhambra, D. Long-term risk of psychiatric disorder and psychotropic prescription after SARS-CoV-2 infection among UK general population. Nat. Hum. Behav. 8, 1076ā1087(2024).
Bowe, B., Xie, Y. & Al-Aly, Z. Postacute sequelae of COVID-19 at 2 years. Nature Medicine, 1-11 (2023).
Ballouz, T. et al. Recovery and symptom trajectories up to two years after SARS-CoV-2 infection: population based, longitudinal cohort study. BMJ 381, e074425 (2023).
Ledford, H. Long-COVID treatments: why the world is still waiting. Nature 608, 258ā260 (2022).
Shaffer, L. Lots of long COVID treatment leads, but few are proven. Proc. Natl Acad. Sci. USA 119, e2213524119 (2022).
Al-Aly, Z., Bowe, B. & Xie, Y. Long COVID after breakthrough SARS-CoV-2 infection. Nat. Med. 28, 1461ā1467 (2022).
CatalĆ , M. et al. The effectiveness of COVID-19 vaccines to prevent long COVID symptoms: staggered cohort study of data from the UK, Spain, and Estonia. Lancet Respir. Med. 12, 225ā236 (2024).
Tsampasian, V. et al. Risk factors associated with Postā COVID-19 condition: a systematic review and meta-analysis. JAMA Intern. Med. 183, 566ā580 (2023).
Xie, Y., Choi, T. & Al-Aly, Z. Association of treatment with nirmatrelvir and the risk of postāCOVID-19 condition. JAMA Intern. Med. 183, 554ā564 (2023).
Xie, Y., Choi, T. & Al-Aly, Z. Molnupiravir and risk of post-acute sequelae of covid-19: cohort study. BMJ 381, e074572 (2023).
Patnode, C. D., Redmond, N., Iacocca, M. O. & Henninger, M. Behavioral counseling interventions to promote a healthy diet and physical activity for cardiovascular disease prevention in adults without known cardiovascular disease risk factors: updated evidence report and systematic review for the us preventive services task force. Jama 328, 375ā388 (2022).
Nyberg, S. T. et al. Association of healthy lifestyle with years lived without major chronic diseases. JAMA Intern. Med. 180, 760ā768 (2020).
Firth, J. et al. A metaāreview of ālifestyle psychiatryā: the role of exercise, smoking, diet and sleep in the prevention and treatment of mental disorders. World Psychiatry 19, 360ā380 (2020).
Hamer, M., KivimƤki, M., Gale, C. R. & Batty, G. D. Lifestyle risk factors, inflammatory mechanisms, and COVID-19 hospitalization: A community-based cohort study of 387,109 adults in UK. Brain Behav. Immun. 87, 184ā187 (2020).
Ahmadi, M. N., Huang, B.-H., Inan-Eroglu, E., Hamer, M. & Stamatakis, E. Lifestyle risk factors and infectious disease mortality, including COVID-19, among middle aged and older adults: Evidence from a community-based cohort study in the United Kingdom. Brain Behav. Immun. 96, 18ā27 (2021).
Perricone, C. et al. Smoke and autoimmunity: The fire behind the disease. Autoimmun. Rev. 15, 354ā374 (2016).
Sharif, K. et al. Physical activity and autoimmune diseases: Get moving and manage the disease. Autoimmun. Rev. 17, 53ā72 (2018).
Ghouse, J. et al. Genome-wide meta-analysis identifies 93 risk loci and enables risk prediction equivalent to monogenic forms of venous thromboembolism. Nat. Genet. 55, 399ā409 (2023).
Gregson, J. et al. Cardiovascular risk factors associated with venous thromboembolism. JAMA Cardiol. 4, 163ā173 (2019).
Loef, M. & Walach, H. The combined effects of healthy lifestyle behaviors on all cause mortality: a systematic review and meta-analysis. Prev. Med. 55, 163ā170 (2012).
Xie, J. et al. Genetic risk, adherence to healthy lifestyle and acute cardiovascular and thromboembolic complications following SARS-COV-2 infection. Nat. Commun. 14, 4659 (2023).
Office for National Statistics. Coronavirus (COVID-19) latest insights: Infections, <https://www.ons.gov.uk/peoplepopulationandcommunity/healthandsocialcare/conditionsanddiseases/articles/coronaviruscovid19latestinsights/infections>
Greenhalgh, T., Sivan, M., Delaney, B., Evans, R. & Milne, R. Long covidāan update for primary care. BMJ 378, e072117 (2022).
Wang, S. et al. Adherence to healthy lifestyle prior to infection and risk of postāCOVID-19 condition. JAMA Intern. Med. 183, 232ā241 (2023).
Luo, S., Liang, Y., Wong, T. H. T., Schooling, C. M. & Au Yeung, S. L. Identifying factors contributing to increased susceptibility to COVID-19 risk: a systematic review of Mendelian randomization studies. Int J. Epidemiol. 51, 1088ā1105 (2022).
Tsampasian, V. et al. Cardiovascular disease as part of Long COVID: A systematic review. Eur. J. Prev. Cardiol. https://doi.org/10.1093/eurjpc/zwae070 (2024).
Salman, D. et al. Returning to physical activity after covid-19. BMJ 372, m4721 (2021).
Florencio, L. L. & FernĆ”ndez-de-Las-PeƱas, C. Long COVID: systemic inflammation and obesity as therapeutic targets. Lancet Respir. Med. 10, 726ā727 (2022).
Clinical characteristics with inflammation profiling of long COVID and association with 1-year recovery following hospitalisation in the UK: a prospective observational study. Lancet. Respir. Med. 10, 761-775 (2022).
Ding, D., Rogers, K., van der Ploeg, H., Stamatakis, E. & Bauman, A. E. Traditional and Emerging Lifestyle Risk Behaviors and All-Cause Mortality in Middle-Aged and Older Adults: Evidence from a Large Population-Based Australian Cohort. PLoS Med. 12, e1001917 (2015).
Batty, G. D., Gale, C. R., KivimƤki, M., Deary, I. J. & Bell, S. Comparison of risk factor associations in UK Biobank against representative, general population based studies with conventional response rates: prospective cohort study and individual participant meta-analysis. BMJ 368, m131 (2020).
Allen, N. E. et al. Prospective study design and data analysis in UK Biobank. Sci. Transl. Med. 16, eadf4428 (2024).
Sabia, S. et al. Association between questionnaire- and accelerometer-assessed physical activity: the role of sociodemographic factors. Am. J. Epidemiol. 179, 781ā790 (2014).
Dashti, H. S. et al. Genome-wide association study identifies genetic loci for self-reported habitual sleep duration supported by accelerometer-derived estimates. Nat. Commun. 10, 1100 (2019).
Doherty, A. et al. Large Scale Population Assessment of Physical Activity Using Wrist Worn Accelerometers: The UK Biobank Study. PLoS One 12, e0169649 (2017).
Bryazka, D. et al. Population-level risks of alcohol consumption by amount, geography, age, sex, and year: a systematic analysis for the Global Burden of Disease Study 2020. Lancet 400, 185ā235 (2022).
Lescinsky, H. et al. Health effects associated with consumption of unprocessed red meat: a Burden of Proof study. Nat. Med. 28, 2075ā2082 (2022).
Al-Aly, Z., Xie, Y. & Bowe, B. High-dimensional characterization of post-acute sequelae of COVID-19. Nature 594, 259ā264 (2021).
Bowe, B., Xie, Y., Xu, E. & Al-Aly, Z. Kidney outcomes in long COVID. J. Am. Soc. Nephrol. 32, 2851ā2862 (2021).
Xie, Y. & Al-Aly, Z. Risks and burdens of incident diabetes in long COVID: a cohort study. The lancet. Diab. Endocrinol. 10, 311ā321 (2022).
Xie, Y., Xu, E. & Al-Aly, Z. Risks of mental health outcomes in people with covid-19: cohort study. BMJ 376, e068993 (2022).
Xie, Y., Xu, E., Bowe, B. & Al-Aly, Z. Long-term cardiovascular outcomes of COVID-19. Nat. Med. 28, 583ā590 (2022).
Xu, E., Xie, Y. & Al-Aly, Z. Long-term neurologic outcomes of COVID-19. Nat. Med. 28, 2406ā2415 (2022).
Xu, E., Xie, Y. & Al-Aly, Z. Long-term gastrointestinal outcomes of COVID-19. Nat. Commun. 14, 983 (2023).
Daugherty, S. E. et al. Risk of clinical sequelae after the acute phase of SARS-CoV-2 infection: retrospective cohort study. BMJ 373, n1098 (2021).
Lam, I. C. H. et al. Long-term post-acute sequelae of COVID-19 infection: a retrospective, multi-database cohort study in Hong Kong and the UK. EClinicalMedicine 60, 102000 (2023).
Bycroft, C. et al. The UK Biobank resource with deep phenotyping and genomic data. Nature 562, 203ā209 (2018).
Denaxas, S. et al. UK phenomics platform for developing and validating electronic health record phenotypes: CALIBER. J. Am. Med. Inform. Assoc. 26, 1545ā1559 (2019).
Kuan, V. et al. A chronological map of 308 physical and mental health conditions from 4 million individuals in the English National Health Service. Lancet Digit Health 1, e63āe77 (2019).
Armstrong, J. et al. Dynamic linkage of COVID-19 test results between Public Health Englandās second generation surveillance system and UK Biobank. Microb. Genom. 6, mgen000397 (2020).
NHS. NHS advice about healthy living, including eating a balanced diet, healthy weight, exercise, quitting smoking and drinking less alcohol., <https://www.nhs.uk/live-well/> (2023).
Office for Health Improvement & Disparities. Healthy eating: applying All Our Health, <https://www.gov.uk/government/publications/healthy-eating-applying-all-our-health/healthy-eating-applying-all-our-health> (2023).
Foster, H. M. et al. The effect of socioeconomic deprivation on the association between an extended measurement of unhealthy lifestyle factors and health outcomes: a prospective analysis of the UK Biobank cohort. Lancet Public Health 3, e576āe585 (2018).
Smith, T. et al. The English indices of deprivation 2015. London: Department for Communities and Local Government, 1-94 (2015).
Cashin, A. G., McAuley, J. H., VanderWeele, T. J. & Lee, H. Understanding how health interventions or exposures produce their effects using mediation analysis. BMJ 382, e071757 (2023).
Lee, H. et al. A Guideline for Reporting Mediation Analyses of Randomized Trials and Observational Studies: The AGReMA Statement. JAMA 326, 1045ā1056, (2021).
Vos, T. et al. Global burden of 369 diseases and injuries in 204 countries and territories, 1990ā2019: a systematic analysis for the Global Burden of Disease Study 2019. Lancet 396, 1204ā1222 (2020).
Hanson, S. W. et al. Estimated global proportions of individuals with persistent fatigue, cognitive, and respiratory symptom clusters following symptomatic COVID-19 in 2020 and 2021. Jama 328, 1604ā1615 (2022).
Acknowledgements
This study was based on data from UK Biobank. Dr Wang is funded through the Clarendon Fund Scholarship. Dr Xie is funded through Jardine-Oxford Graduate Scholarship and a titular Clarendon Fund Scholarship. The research was partially supported by the Oxford National Institute for Health and Care Research (NIHR) Biomedical Research Centre. Dr Prieto-Alhambra is funded through an NIHR Senior Research Fellowship (grant SRF-2018-11-ST2-004). We thank Dr Hao Chen for reviewing the revised manuscript. The funding organizations had no role in the design and conduct of the study; collection, management, analysis, and interpretation of the data; preparation, review, or approval of the manuscript; and decision to submit the manuscript for publication. The views expressed in this publication are those of the author(s) and not necessarily those of the NHS, the NIHR or the Department of Health.
Author information
Authors and Affiliations
Contributions
Y.W. and J.X. had full access to all the data in the study and take responsibility for the integrity of the data and the accuracy of the data analyses. Y.W., J.X. and D.P.-A. conceptualized and designed the study, and acquired, analyzed or interpreted data. Y.W. drafted the manuscript. B.S, M.A.-H., N.L.-B, Y.T, C.L., N.J.-W., and R.P., participated in the discussions and interpretation of the results. All authors critically revised the manuscript for important intellectual content. J.X. conducted statistical analysis. D.P.-A. obtained funding, provided administrative, technical or material support, and supervised the project.
Corresponding author
Ethics declarations
Competing interests
D.P.-A. reported grants from Amgen, UCB Biopharma, Les Laboratoires Servier, Novartis, and Chiesi-Taylor, as well as speaker fees and advisory board membership with AstraZeneca and Johnson and Johnson outside the submitted work, in addition to research support from Janssen. R.P. has participated in advisory boards for Gilead, MSD, ViiV Healthcare, Theratechnologies and Lilly. His institution has received research support from Gilead, MSD, and ViiV Healthcare. The remaining authors declare no competing interests.
Peer review
Peer review information
Nature Communications thanks George Ploubidis, Vassilios Vassiliou and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. A peer review file is available.
Additional information
Publisherās note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the articleās Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the articleās Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Wang, Y., Su, B., Alcalde-Herraiz, M. et al. Modifiable lifestyle factors and the risk of post-COVID-19 multisystem sequelae, hospitalization, and death. Nat Commun 15, 6363 (2024). https://doi.org/10.1038/s41467-024-50495-7
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41467-024-50495-7
This article is cited by
-
Sociodemographic factors, biomarkers and comorbidities associated with post-acute COVID-19 sequelae in UK Biobank
Nature Communications (2025)