Background

The relationship between obesity and infection was frequently highlighted during the COVID-19 pandemic [1], with numerous studies observing strong associations in adults of all ages [2]. With obesity prevalence rising [3], the potential public health importance of this association has been increasing, though the evidence base for many infections was limited [4]. Recently, there is more evidence emerging from observational studies that obesity, estimated from body mass index (BMI), is associated with infection risk [5,6,7,8]. Further, it has also been observed that for some infections, such as pneumonia, the relationship is U-shaped, such that a lower BMI as well as a higher one is also significantly related to higher infection risk [9,10,11]. However, the shape of the association with BMI may not be consistent across different infection types, with skin and urinary tract infections not necessarily being related to being underweight [9].

Obesity is also strongly associated with type 2 diabetes (T2D) [12], which has itself been shown to be related to greater risk of future infections [13]. However, it is not known if T2D mediates the association between obesity and infection, and whether the shape of the association with BMI is similar between people with T2D and people without diabetes. Recent evidence suggests that infection risk among people with T2D is heterogenous, with abdominal obesity being highly influential [14]. Given the high global prevalence of both T2D and obesity [15], with the recent emergence of novel weight loss therapies [16, 17], a better understanding of the interaction between BMI, diabetes and infection would be desirable and informative for patients, clinicians and policy-makers.

This study investigated whether (i) the relationship between BMI and infection risk is similar between people with T2D and people without diabetes, (ii) the magnitude and shape of the associations differs by age, sex and infection type. By utilising a large English cohort of electronic patient records of people with T2D, the study summarises findings on the associations with different infections at more extreme levels of BMI than have been previously reported. It also makes use of a matched cohort without diabetes who have been matched by age, sex and ethnicity to compare the associations across the same range of BMI values.

Methods

Data sources

CPRD (Clinical Practice Research Datalink) is a UK primary care database jointly sponsored by the Medicines and Healthcare products Regulatory Agency and the National Institute for Health and Care Research [18]. It provides a pseudonymised longitudinal medical record for all registered patients (>99% of the UK population are registered with a General Practitioner), with diagnoses and other clinical information recorded using Read Codes. This study used an extract from the CPRD Aurum database that included approximately 16 million currently registered patients from 1447 general practices (England only). Over 90% of contributing practices in Aurum have consented to their data being linked to external sources, and researchers have no access to geographical identifiers such as residential postcode [19]. These data sources include the ONS (Office for National Statistics) mortality data which includes cause of death and the Index of Multiple Deprivation (IMD), a composite small-area (approximately 1500 people) measure used in England for allocation of resources, which provides a good proxy for individual socio-economic deprivation.

Study design and participants

The study was a matched cohort design comparing people with diabetes to a group without diabetes, described previously [13]. Briefly, all patients aged 18 to 90 with a Read code for diabetes who were active in CPRD on 1st January 2015 and had been registered for at least one year were first identified, and classified into Type 1, Type 2 or unknown based on their diagnosis codes and anti-diabetes medication. For this analysis, the main focus was on 525,812 people with T2D (Fig. S1) who had been randomly matched to at least one person without known prediabetes or diabetes, of the same age, sex and ethnicity (White, South Asian, Black, mixed/other or not recorded). A maximum of two people without diabetes was selected for each person with T2D resulting initially in a total of 1,008,898 people (with 94% of match-sets having two comparators, 6% having one).

Body mass index

For each person in the study, the last recorded BMI during 2011–14 was used. If only a weight measurement was present, a BMI was manually estimated if a height measurement was available anywhere in their record (using the closest height measurement to 1/1/2015 when there were multiple measurements). Only BMI measurements between 12 and 85 kg/m2 were accepted as plausible. To compare BMI between people with T2D and without diabetes, and explore the non-linearity at fine detail, 12 BMI categories were created (≤19, >19–≤22, >22–≤24, >24–≤26, >26–≤28, >28–≤30, >30–≤32, >32–≤35, >35–≤39, >39–≤43, >43–≤48, >48) with sufficient numbers in the extreme categories for both the T2D and non-diabetes groups. Thus, ≤19 was chosen as the extreme underweight category rather than the conventional 18.5 because there were too few people with T2D with BMIs in this range. The BMI category >24–≤26, was chosen a priori as the reference category for both within group comparisons. This category has an approximate mean of 25 kg/m2, which was the value with lowest infection risk in a recent large population study from Denmark [9]. While over 98% of people with T2D had a BMI measured during 2011–14 (n = 516,935), this was true for only 75% of people without diabetes (n = 751,909) (Fig. S1). Although this resulted in some people with T2D no longer having a matched person without diabetes, since they are not directly being compared to each other here, these groups were used in the main analysis to maximise the total number. However, a sensitivity analysis considered only match-sets with 1:1 matching (n = 471,474 people with T2D).

Infection outcomes and covariates

Infections were collated using an extensive list of Read codes (primary care) and ICD-10 codes (hospital data), previously described [13]. Briefly, both data sources were searched electronically over a 5-year period (2015–19) for the following: (i) any infection with a prescription in primary care for an antibiotic, antifungal or antiviral within ±14 days of the diagnosis; (ii) any new hospital episode where an infection was the primary diagnosis. In the UK, hospital data is organised into finished consultant episodes and assigned a primary diagnosis [20]. Subsequent episodes can be assigned to the same admission, with a different primary diagnosis e.g., a hospital acquired infection. For each summary group, only one event was counted within a 90-day period, with multiple codes assumed to be the same event. A further analysis considered type of infection from either source, using the 8 most common groups - cellulitis, gastrointestinal tract (GIT), genitourinary (GUI), lower respiratory tract (LRT), mycoses, sepsis, skin (other) and upper respiratory tract (URT). Read code lists for the infection groups are available in the repository https://doi.org/10.24376/rd.sgul.21565557.v1.

Information was also extracted on patient smoking history, and co-morbidities as of 1st January 2015. We selected 12 chronic conditions routinely collected as part of the Quality and Outcomes Framework (QOF), a UK wide system for performance management and payment of GPs in primary care [21]. These were atrial fibrillation, cancer, chronic obstructive pulmonary disease, coronary heart disease, chronic kidney disease, dementia, epilepsy, heart failure, hypertension, peripheral vascular disease, serious mental Illness and stroke.

Statistical methods

Age-sex standardised incidence rates for each infection outcome were estimated, using the distribution of patients with T2D as the standard population. Poisson regression compared infection rates during follow-up for patients with T2D and without diabetes in separate models, with an offset fitted for total days of follow-up time in the study (SAS version 9.4). The 12 BMI categories were fitted, with Incidence Rate Ratios (IRRs) estimating infection risk compared to the >24–≤26 reference category. These were initially fitted adjusting for age (10-year age groups) and sex. Further adjustment added ethnicity (5 categories including not recorded), IMD (quintile), smoking and a count of co-morbidities. For each model an estimated attributable risk fraction (AF) was estimated [22], which assume the associations are causal and that all patients are shifted to the reference category (BMI >24–≤26). All analyses were stratified by sex and age group (18–50, 51–70, 71–90). The following sensitivity analyses were carried out: (i) restricting the analysis to never smokers and those without any co-morbidity, (ii) repeating the main analysis but only using 1:1 match-sets.

Results

Summary of BMI

The mean age of the 516,935 people with T2D in 2015 was 66.3 years (SD = 12.9) with 56% of them males (Table 1); similar to the non-diabetes group (66.8 years, 56% respectively). People with T2D were more likely to be living in more deprived areas and have more co-morbidities than people without diabetes. The mean BMI was substantially higher among people with T2D compared to the non-diabetes group (31.0 vs. 27.2 kg/m2). The distributions of BMI was more spread out among people with T2D at all ages (Fig. S2), with approximately 50% of people with T2D had a recorded BMI of 30 or greater compared to 24% of the non-diabetes group. In both groups there was a clear trend of descending age with increasing BMI (Fig. S3), though the difference in age between the extreme categories of BMI was greater in people with T2D (approximately 16 years vs. 10 years for people without diabetes).

Table 1 Baseline characteristics of people with and without type 2 diabetes with a BMI recorded in last 5 years.

Incidence of overall infection by BMI

Figure 1 summarises age-sex standardised incidence rates for the primary care and hospitalisation related infections in people with T2D and without diabetes separately by different age groups. People with T2D are at a higher risk for infections, both from primary care and those requiring hospitalisation, than people without diabetes at all BMI levels. The lowest incidence rates are observed at approximately 23 kg/m2 for primary care and 25 kg/m2 for hospitalisation related infections. Both outcomes show a consistent positive association with increasing levels of BMI, which are similar in people with T2D and without diabetes. For hospitalisation related infections there is also strong evidence of a U-shaped association with BMI at all ages, which is seen in both groups. However, for primary care infections, the higher incidence of infection at the lowest BMI level is much less apparent. Stratifying the overall analysis by sex showed similar trends with BMI, with women having consistently higher primary care infection rates (Fig. S4).

Fig. 1: Incidence rates for infections during 2015–2019 across BMI categories by age group in people with and without type 2 diabetes (T2D) separately.
figure 1

a Primary care infections (T2D, n=516,935). b Primary care infections (non-diabetes, n=751,909). c Hospitalisation-related infections (T2D, n=516,935). d Hospitalisation-related infections (non-diabetes, n=751,909). Incidence rates were age-standardised to T2D population. Error bars show 95% confidence intervals.

Relative risks of infection with elevated or low BMI

Table 2 compares adjusted incidence rates for the two summary infection groups, in people with T2D and without diabetes separately, using a BMI of >24–≤26 as a reference category, overall and by age group. The elevated relative risks at extreme high and low levels of BMI are much more apparent for hospitalisation related infections than for those seen in primary care. People in the highest category (BMI > 48) had a more than doubling of this risk compared to those with a BMI of >24– ≤ 26 (T2D IRR = 2.35, 95%CI 2.26–2.44, non-diabetes IRR = 2.52, 95%CI 2.30–2.75), and this pattern was seen at all ages. People in the most underweight category (BMI ≤ 19) had elevated risks of hospitalisation related infections (T2D IRR = 1.92 95%CI 1.83–2.02, non-diabetes IRR = 2.11 95%CI 2.05–2.17) compared to a BMI of >24–≤26. For primary care infections, the small increase in risk among underweight was only observed among ages 71–90 (Fig. S5). When the analysis was stratified by sex (Table S1), men tended to have higher IRRs at the extreme BMI categories in both people with T2D and without diabetes. Attributable risk estimates (Table S2) suggested that the combined contribution of under- and over- weight on primary care infections was greater in people with T2D (11.4%) compared to people without diabetes (5.9%), but for hospitalisation related infections it was more similar due to a greater proportion of people without diabetes being underweight (11.5% vs. 10.0% respectively).

Table 2 Adjusted incidence rate ratios for infections during 2015–19 by BMI in people with T2D and without diabetes.

Sensitivity analyses adjusting only for age and sex or restricting to 1:1 match-sets produced similar IRRs to the main analysis (Table S3). When the overall analysis was restricted to never smokers, those without co-morbidity, or both, the IRRs observed with high BMI levels were unchanged, and the association between underweight BMI levels and hospitalisation related infections remained similar (Table S4).

Risks with specific infection groups

Figure 2 summarises age-sex standardised incidence rates by infection type in people with T2D and without diabetes. The pattern of infection incidence with BMI varied greatly by infection type but were comparable between people with and without T2D (Fig. 2). Cellulitis had a distinctive J-shape, while lower respiratory tract was the most U-shaped with similar raised incidence at both the lowest and highest BMI levels. In the adjusted regression models (Table 3), people in the highest category (BMI > 48) had >6 times risk of cellulitis compared to those with a BMI of >24–≤26 (T2D IRR = 6.19, 95%CI 5.94–6.45, non-diabetes IRR = 6.43, 95%CI 5.98–6.92) and >3 times risk of sepsis (T2D IRR = 3.23, 95%CI 2.99–3.50, non-diabetes IRR = 3.30, 95% CI 2.71–4.03), with smaller increases in risk observed for all other types. For people in the underweight category (BMI ≤ 19), the risks compared to those with a BMI of >24–≤26 were approximately 1.6 times for both LRT infections and sepsis (IRR = 1.60, 95% CI 1.53–1.67 and IRR = 1.63, 95% CI 1.44–1.84 for people with T2D respectively), whereas there was no evidence of an increase in risk at low BMI for skin or URT infections.

Fig. 2: Incidence rates by infection type during 2015–2019 across BMI categories in people with and without type 2 diabetes (T2D) separately.
figure 2

a T2D (n=516,935). b Non-diabetes (n=751,909). Incidence rates were age-standardised to T2D population. Error bars show 95% confidence intervals.

Table 3 Adjusted incidence rate ratios for infections during 2015–19 by BMI in people with T2D and without diabetes, by type.

Discussion

This study used large electronic health databases to describe the associations between body mass index and infection risk, both within a cohort of people with T2D, and a comparison group without diabetes. Despite people with T2D having higher BMI levels, and being at greater risk of infections overall, it found that the relationship between BMI and infection was similar in people with and without diabetes. People classed as having obesity based on their BMI, regardless of whether they had T2D or not, were at a consistently higher risk of acquiring an infection over a 5-year period compared to persons of normal weight, particularly cellulitis. Additionally, very low BMI levels were also associated with a higher risk of infections requiring hospitalisation, particularly lower respiratory tract infections and sepsis, though not with infections treated in primary care.

Strengths and limitations

One of the main strengths of the present analysis is the large sample size that comprised over half a million people with T2D matched to at least one person of the same age, sex and ethnicity without diabetes. By studying a period prior to 2020, it assessed infection risks before the disruption of the COVID-19 pandemic, after which the relationship between BMI and COVID-19 infection risks became much more widely appreciated [1, 2]. The focus on adults with T2D had two advantages. Firstly, since they are generally seen regularly in UK primary care, it ensured a patient cohort with a high completeness of body mass index measurements (over 98%). Secondly, it allowed for an analysis of BMI values across a wide range, including appreciable numbers of participants with extremely high values (almost 25,000 people with T2D with a BMI > 43) which very few population studies have reported on.

The inclusion of the group without diabetes allowed a comparison of results across the same BMI categories, however they were less likely to have their weight measured recently, with only 75% of the originally matched cohort with an eligible BMI recorded during 2011–14. Although this resulted in unbalanced match-sets in the main analysis, the impact of this was minimal as the sensitivity analysis using 1:1 match-sets produced identical results. It might still be assumed that patients without diabetes and no BMI measurement are a healthier subset and might have more normal weights. However, the estimated prevalence of obesity (BMI ≥ 30) in the non-diabetes group with a recorded BMI (24%) was marginally lower than what was reported in the 2015 Health Survey for England (27%) [23]. A recent study investigating BMI and mortality using CPRD estimated a similar prevalence of obesity among adults with a measurement compared to national estimates [24].

Although the time since the last measurement will vary between patients, one might expect that sudden or significant weight changes are being captured, and using the last recorded BMI over a 4-year period may be a reasonable estimate of their BMI during the following 5-year follow-up period. Although the analysis could have used measurements from 2015 onwards to update a patient’s BMI this may increase the possibility of reverse causality where the infection had a resulted in a change in weight. Finally, it is important to acknowledge the potential limitation of BMI as a measure of clinical obesity [25], but despite this, BMI can still provide utility as a predictive measure of risk in large-scale epidemiological studies where it is routinely recorded.

Comparisons with previous studies

The recent evidence base linking obesity to infection risk throughout the life course has been observed in several large epidemiological studies, mainly using BMI measurements in adult populations in developed countries [26]. The evidence regarding underweight is more sparse, although a 2013 systematic review concluded that being underweight was potentially a more serious risk factor for community acquitted pneumonia than obesity [27]. Instead, many studies have focused only on a presumed linear association between a range of infections and BMI. For example, a large Swedish cohort study of 39,163 adults found higher risk of many infections in participants with a BMI > 30 compared to normal weight (BMI = 18.5–25), but the study excluded underweight (BMI < 18.5) adults [8]. More recent studies from Denmark [9, 10] and Taiwan [11] have reported on the heterogenous shapes of the associations with different infections across a wider range of BMI values including underweight. Harpsøe et al [10], found that within the Danish National Birth Cohort (DNBC) of 75,001 middle age-women the relationship between any hospitalisation for infection over 12 years and their baseline BMI was U-shaped, with a particularly high risk among underweight women (BMI <18.5), largely driven by respiratory tract infections. The study of 101,447 adults in the Copenhagen General Population Study (CGPS) also found U-shaped associations between BMI and hospitalisation for any infection, in particular pneumonia [9]. Finally, a study of 120,864 adults in Taiwan showed U-shaped associations between BMI and hospitalisation for all infections, particularly LRT infections and septicaemia, and estimated that underweight people were at a greater risk than people who had obesity [11]. Our analysis similarly found (i) U-shape associations overall, particularly for hospitalisation related infections, which were not explained by smoking or co-morbidity, and (ii) these associations were strongest for both LRT infections and sepsis, which had the greatest relative risks among underweight diabetes and non-diabetes patients (BMI < 19) compared to their reference group. A recent study using the same CPRD dataset as utilised here but into 2020, found that among 584,854 people with T2D there was a higher risk of hospitalisation for COVID-19 among those with a BMI < 18.5 compared to 25–29.9, comparable to the more reported risks associated with a BMI of 40 or more [28].

A positive association between skin infections and obesity has been consistently observed [8,9,10,11, 29], with no evidence of an increase in risk among lower BMI levels [9,10,11, 29]. When the study from Copenhagen analysed their data using genetic instrumental variables, the only associations that remained significant from their initial analysis were with skin infections [9]. Obesity has been recognised as a predisposing factor for lower limb cellulitis, with a number of proposed mechanisms such as venous insufficiency, impaired lymphatic flow, increased skin fragility, and poorer hygiene [30]. A recent meta-analysis of case-control studies found that patients with cellulitis had 2.67 times odds of being classed as having obesity when compared with controls [31]. Our analysis was found that among all the infection types, the strongest positive association with increasing levels of BMI was with cellulitis. Similarly, in the Danish birth cohort study the risk of hospitalisation for erysipelas or cellulitis was five-fold higher among women with obesity (BMI ≥ 30) compared to normal levels of BMI [10].

We have previously reported on the increased risk of infection among people with diabetes and, due to the high prevalence of diabetes, estimated that almost 10% of all infection related hospitalisations among the adult population in England may be attributable to T2D [13]. Given the very high current prevalence of obesity, there is interest in understanding the relationship between these conditions. People living with either T2D or obesity share many of the same characteristics, and it has been suggested that the poor outcomes experienced by both groups may be due to an immune system dysfunction being triggered by chronic low-grade inflammation present in both T2D and obesity [15]. We found no evidence that diabetes mediated the relationship between BMI and infection risk. Although people with T2D were at higher risk of infections at every BMI category, their relative risk compared to a person with T2D and a BMI of 24–26 kg/m2 was very similar to that estimated among the non-diabetes group. Few population-based analyses have focused on people with T2D or compared the relationship with common infections and BMI to a non-diabetes population. A Taiwanese study of obesity, diabetes and tuberculosis found that the relationship was complex, such that participants who had obesity and diabetes had similar or lower risk of tuberculosis compared to participants without diabetes of normal weight [32].

Implications

The current obesity epidemic has huge cost implications for healthcare and populations globally [33]. In the UK, it has been estimated that obesity accounts for approximately 3% of Gross Domestic Product [34]. Data from Australia has shown that among middle-aged adults, there is a strong dose response relationship between BMI and hospital costs and length of stay [35]. Similarly, rising BMI is inversely related to health-related quality of life measures [33]. While the health consequences of obesity predominately focus on non-communicable disease, our study suggests infections may represent another important consequence to consider. The association with a wide burden of disease, has led to the suggestion that obesity could be an “amenable target for complex multimorbidity prevention” [29]. A recent retrospective cohort study in the UK found that higher BMI categories were associated with increased healthcare expenditure, particularly when obesity-related complications were present [36]. The introduction of GLP-1 agonists as novel weight loss therapies, such as semaglutide, may have significant impact on population trends in obesity, and could represent a turnaround in the effectiveness of weight loss public health strategies [37]. With trial evidence additionally suggesting lower rates of infection while on semaglutide [38], their increased usage could play a part in reducing infection risk in the population, as well as the non-infectious consequences of obesity. However, the roll out of these weight-loss drugs is unlikely to be fast and may increase inequities that already exist regarding obesity in the population [39]. Although a recent study using CPRD data found that while all-cause mortality rates had significantly declined in people with obesity (BMI ≥ 30) from 2004-2019, no significant reduction was seen for infection specific deaths [24]. Therefore, more targeted infection prevention strategies in higher risk patients based on their BMI such as appropriate vaccination, and increased patient and clinician awareness of the need for early assessment of infections, could result in prompter diagnosis and effective treatment, with the potential to reduce infection-related hospitalisations.

Conclusions

In conclusion, our study has emphasised the impact of obesity on infection risk, and that this relationship appears to act independently of Type 2 diabetes. In addition, some infections such as pneumonia and sepsis are more common in underweight people and are not simply explained by other health conditions.