Introduction

Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), responsible for causing coronavirus disease 2019 (COVID-19), has directly or indirectly affected the global population and caused a total of 2.9 million deaths to date. In the early stages of the pandemic, research and health services have predominantly focused on the immediate direct (through infection) and indirect (changes in behaviours and effect on health systems) effects of the virus1.

While COVID-19 predominantly manifests as respiratory disease, there is increasing evidence of other organ involvement, including the heart2. A prospective cohort study of 201 young low-risk individuals showed that almost 70% of participants had ongoing impairment of multiple organ systems3. In Wuhan, a sample of 1733 patients were followed up after discharge and reported mainly muscle fatigue and lethargy4.

Viral load measurements from nasopharyngeal swabs indicate active viral replication5,6. Previous studies indicated that higher SARS-CoV-2 viral loads at presentation were independently associated with in-hospital mortality, or intubation5,6,7,8,9. Furthermore, a higher viral load is associated with an increased risk of post-COVID-19 syndrome10. A previous systematic review found that respiratory virus real-time polymerase chain reaction cycle threshold (ct) values are associated with disease severity and outcomes11. However, few studies investigated the association between SARS-CoV-2 viral loads at presentation and long-term outcomes or hospitalization in mild populations. Therefore, we studied the association of SARS-CoV-2 viral load with long-term outcomes in patients with COVID-19 in Hong Kong.

Methods

Data source

COVID-19 patients were identified using the Clinical Data Analysis and Reporting System (CDARS). CDARS is an electronic healthcare database developed by the Hospital Authority. The Hospital Authority is a statutory body that manages all public hospitals and public clinics in Hong Kong, covering over 80% of Hong Kong’s population. In Hong Kong, all people testing positive for COVID-19 are admitted to hospital for isolation. Therefore, the included study population in CDARS represent all patients with proven COVID-19, regardless of disease severity. Electronic patient records include demographics, diagnoses date and disease coding, hospital admission and discharge dates, drug prescriptions and dispensing records, laboratory test results, and date of registered death for research and audit purposes. All patients were anonymised in this system, thus sensitive information such as name and ID were not accessible. The coding accuracy of CDARS has been well-validated and published previously12,13,14,15,16,17.

The study was approved by the Institutional Review Board of the University of Hong Kong/Hospital Authority Hong Kong West Cluster (reference number: UW 21-436) and the need for informed consent was waived by the Institutional Review Board of the University of Hong Kong/Hospital Authority Hong Kong West Cluster. This study was conducted under the principles of the Declaration of Helsinki.

Study design and patients selection

This is a population-based retrospective cohort study. COVID patients were identified based on a positive SARS-CoV-2 RT-PCR test or disease classification (International Classification of Diseases, Ninth Revision, Clinical Modification [ICD-9-CM] code: 079.89, 480.8, 519.8) between January 1, 2020, and August 31, 2020. Patients who tested positive on ART, but negative on RT-PCR were excluded. Demographic information, date of the first positive lab result, baseline medication, baseline comorbidities, baseline laboratory result, admission date and principal diagnosis during follow up, and other clinical information were extracted from CDARS.

The index date was defined as the first date of positive PCR test. Follow-up started from the date of discharge until the occurrence of outcome, death, or study end (2021-04-13). Patients aged < 18 years at the index date or who died during hospitalisation were excluded.

Outcome

The main outcome of this study were all-cause mortality and all-cause re-hospitalisation after COVID discharge. Secondary outcomes included cause specific hospitalizations and new-onset medication use. Cause specific hospitalizations included hospitalization for respiratory reasons, gastro-intestinal reasons, mental/psychiatric reasons and cardiovascular reasons. Supplementary Table 1 shows details on the ICD codes used for each outcome. New onset medication use included angiotensin receptor blocker (ARB), beta-blockers (BBs), calcium channel blockers (CCBs), statins, anti-depressants, aspirin and non-steroidal anti-inflammatory drugs (NSAIDs). Supplementary Table 2 provides a detailed overview of the coding used to identify medications.

Statistical analysis

Baseline characteristics of all included patients were presented. Mean and standard deviation (SD) were used to present continuous variables, frequencies and percentages were used to present categorical variables. Normality of continuous variables was evaluated by visualizing histograms and inspecting the distribution. The incidence was presented as the rate per 100 person-year with the 95% confidence interval. We modelled the association of baseline viral load (CT values) as continuous variables with the risk of death, total- and cause-specific hospitalizations using restricted cubic splines and Cox regression analyses. We fitted models with 2–5 knots and chose the optimal number of knots for each spline using the Akaike Information Criterion. (AIC). Because we found a non-linear increase in risk with lower CT values for most outcomes, we dichotomized the CT values based on a cut-off point identified from the restricted cubic splines. To account for factors associated with higher viral loads, we weighed the hazards by balancing weights calculated from the propensity scores using the CBPS package18,19. We dichotomized the CT values according to the median and calculated propensity scores using the dichotomized CT values as the dependent variable. Supplementary Table 3 provides a full list of variables used for weighting, including age, sex, medical history, baseline medication and intensive care unit (ICU) stay. Variables for the multivariable models were included based on clinical relevance and data availability. All analyses were performed using R, version 4.0.5 (2021-03-31) and a two-sided P value of < 0.05 was considered statistically significant.

Results

Baseline characteristics

Figure 1 depicts the consort diagram on patient inclusion. A total of 4,602 patients with COVID-19 were identified in CDARS between 1 Jan 2020 to 31 Aug 2020. Patients aged < 18 years (n = 440) and who died before discharge (n = 108) were excluded. Out of the remaining 4054 patients, 3433 (85%) patients had CT values.

Figure 1
figure 1

CONSORT diagram.

Table 1 shows the baseline characteristics of the 4054 patients with COVID-19 discharged alive. The median age was 47 (interquartile range [IQR] 32–60) years and 49.2% (n = 1995) were men. Most patients had a low prevalence of comorbidities. Diabetes was the most common comorbidity, followed by a history of ischemic heart disease. Calcium channel blockers (CCBs) and statins were the most common medication at baseline. 4.1% of patients were admitted to ICU. The median ICU stay was 9 (IQR 5–15) days.

Table 1 Baseline characteristics.

Outcomes and association with viral load

Table 2 shows the event rates of patients discharged alive. During a median follow up of 251 (IQR 240–279) days, 16 patients died, and 408 patients were rehospitalized. The incidence rate of death was 0.47 (95% confidence interval [CI] 0.27–0.764) per 100 patient years (py) and 11.88 (95% CI 10.76–13.1) per 100 py for hospitalization for any reason. The most common reason for rehospitalization was respiratory, followed by gastro-intestinal, mental and cardiovascular reasons.

Table 2 Event rates.

Figure 2 depicts the restricted cubic splines for the association of CT values at presentation with death, total- and cause-specific hospitalization. We found a non-linear association of higher of lower CT values with a higher relative risk of death, total hospitalization and cardiovascular hospitalization (P < 0.001 for all), but not for mental, respiratory and GI hospitalizations. The risk for death, total and CV and respiratory hospitalizations decreased linearly until 20 repeats, after which the relative risk fell to 1.

Figure 2
figure 2

Splines showing the association between CT values (viral load) and the relative risk of death (A), any hospitalization (B), cardiovascular hospitalization (C), hospitalization for mental reasons (D), respiratory hospitalization (E) or hospitalization for gastro-intestinal reasons (F).

Table 3 shows results of the association between dichotomized CT values (< 20 repeats versus ≥ 20 repeats) and the different outcomes, weighed for probability weights using propensity scores. We observed that patients with high viral loads (< 20 CT repeats) were at a higher risk for death, total hospitalization, and CV hospitalization than those with lower viral loads (≥ 20 CT repeats). Patients with high viral loads more likely started ACE-inhibitors than patients with lower viral loads (≥ 20 repeats).

Table 3 Cox regression analyses for outcomes in patients with higher (< 20 CT values) compared to lower (≥ 20 CT values) viral loads.

Discussion

In our study, we found that post-discharge rehospitalization for patients with COVID-19 in Hong Kong were common. Higher SARS-CoV-2 viral loads at admission were associated with an increased likelihood of post-discharge death or rehospitalization, particularly for myocardial infarction or stroke, independent of initial ICU stay. Early risk stratification for long-term dismal outcomes after COVID-19 infection remains a challenge. Our results suggest that patients with higher viral loads at admission might benefit from close follow-up, early initiation of treatment and a holistic and multidisciplinary long-term care plan.

In Hong Kong, one in ten patients with COVID-19 were rehospitalized during a median follow-up of 251 days. In the United Kingdom, over a third of patients admitted for COVID-19 were readmitted and more than 1 in 10 died after discharge20. Strict lockdown measures and quarantine orders in Hong Kong have kept COVID-19 numbers lower in Hong Kong than most other countries21. Therefore, all COVID-19 patients in Hong Kong who test positive on RT-PCR, are isolated in hospitals or community facilities. The CDARs data represents all patients with COVID-19 in Hong Kong registered with the Hospital Authority, which includes a substantial number of asymptomatic individuals. This explains the lower numbers of post-discharge death and rehospitalization in Hong Kong than the UK. However, in a large study from the Veterans Health Administration among 73,435 uses with COVID-19 who were not hospitalized and survived the first 30 days after diagnosis, the risk of long-term rehospitalization and death was still higher than matched negative-controls, particularly for respiratory signs and symptoms and incident hypertension22.

Various studies indicated that higher SARS-CoV-2 viral loads at presentation was independently associated with in-hospital mortality, or intubation5,6,7,8,9. Respiratory and plasma viral loads of SARS-CoV-2 predicted the risk of in-hospital death, predominantly due to pneumonia5. This was confirmed in a 3,014 patients hospitalized in 3 large New York City hospitals6. All of these previous studies5,6,7,8,9 were performed in countries with a more significant burden of the epidemic and consequently higher admission threshold than Hong Kong and did not include long-term outcome. The current study is unique, due to its large representative sample size, inclusion of long-term outcomes and many asymptomatic patients. To our knowledge, we are the first to report on the association of viral load and long-term outcomes in patients with COVID-19.

We found that higher viral loads (lower CT-values) independently predicted death, total rehospitalization and rehospitalization for stroke or myocardial infarction, independent of ICU stay. In a previous study from the United States22, patients discharged after hospitalization for COVID-19 had an increased risk of CV conditions after discharge22. Similar results were observed in the United Kingdom in patients after discharge for hospitalization with more severe COVID-1920. A recent study in Sweden confirmed the increased risk of incident stroke and myocardial infarction in patients who recovered from COVID-1923. In our study, a higher viral load was associated with an increased risk of hospitalization for myocardial infarction, chest pain, and stroke. We also found that higher viral loads were associated with an increased risk of incident ACEi use. However, these associations were non-linear, such that the risk for death, hospitalization and medication use decreased linearly until 20 repeats, after which the relative risk remained stable at 1. Literature on cut-off values for CT repeats is not conclusive9, but many studies considered a viral load < 25 CT values as high9. Patients with a high viral load commonly also have more severe disease5,6,7,8,9. However, we did not find a significant interaction between viral load and ICU stay for any of the outcomes.

A discussion on the potential mechanisms of the association between higher viral loads and increased myocardial infarction and stroke risk is warranted. COVID-19 infection creates a pro-thrombotic state24. Several previous publications suggested that this prothrombotic phenotype might be mediated by ACE225,26. SARS-CoV2 enters the host cell by binding to ACE2, similar to other Corona viruses25, leading to enhanced shedding of ACE2 and release of soluble protein. Decreased ACE2 activity is potentially thrombogenic through angiotensin-2 activity and plasminogen activator inhibitor-1 expression27. Increased ACE2 plasma levels, potentially reflecting excess shedding, were associated with an increased risk of myocardial infarction and stroke in the PURE study28. Our study suggests that increased viral loads in patients with COVID-19 are associated with an increased risk of myocardial infarction and stroke, potentially mediated through reduced ACE2 activity. The results of our study might have several important clinical implications. First, our results suggest that viral load should not only be used for increased risk stratification but might also be a good indicator for the risk of long-term outcomes. Second, in the post-vaccination era, more patients will have milder disease. Therefore, future studies should investigate whether COVID. While our results indicate the association between COVID-19 viral load and long-term clinical outcomes, future prospective studies, including those accounting for vaccination, should confirm our findings-19 that viral loads are related to long-term risk in those with milder disease post-vaccination.

Limitations

The strength of our study was the inclusion of all patients with COVID-19 in Hong Kong, from a real-world cohort with a wide spectrum of disease severity. A limitation of the current analyses is that viral load measurements were missing in 16% of patients. Ascertainment bias might have affected the reliability of the study due to inaccurate entry of certain diagnosis codes for comorbidities, namely DM, and hypertension. Socioeconomic status was not available for many of the patients. However, because all patients with COVID-19 were mandated to be hospitalized, this will likely not have influenced our results. Finally, some patients might have had laboratory tests performed outside of the public hospitals. Patient-reported outcome measurements (PROMs) and cognitive status measurements were not available for this study, which can provide valuable insights into patient outcomes29. It is unclear how results from Hong Kong validate in other geographic regions. Unfortunately, costing variables were not available for this paper and not included in the analyses.

Conclusion

In Hong Kong, rehospitalization after discharge for COVID-19 infection was common, especially for respiratory and gastro-intestinal reasons. A higher viral load at diagnosis is an independent risk factor for death, all-cause rehospitalization and CV rehospitalization, particularly in patients with CT values < 20. Our results can inform global discussion on the post-acute manifestations of COVID-19 and risk stratification of patients based on their viral load.