Introduction

Obesity is a major public health issue and an established risk factor for many other chronic health conditions such as type 2 diabetes, coronary heart disease, neurological disorders, and multiple cancers. In the UK, 26% of the adult population were living with obesity in 2021, with projections suggesting this number will rise to almost 37% by 2034 [1, 2]. The estimated annual cost of obesity to the UK’s National Health Service (NHS) is £6.5 billion [3].

Obesity is associated with many interrelated lifestyle behaviours including a sedentary and physically inactive lifestyle, and excessive caloric intake [1, 4,5,6,7,8]. The World Health Organisation recommends at least 150 min of moderate- or 75 min of vigorous-intensity physical activity weekly to reduce obesity risk [9,10,11]. Likewise, evidence suggests that consuming at least five portions of fruits and vegetables daily is associated with lower BMI and obesity risk [12, 13]. Other modifiable behaviours, such as sleep duration, alcohol intake, and smoking, also contribute to obesity risk [13,14,15,16,17,18]. Both short and long sleep durations outside the NHS recommended 7–9 h/day are associated with greater BMI and obesity risk [16, 17, 19, 20]. Furthermore, exceeding the NHS alcohol intake guidelines (no more than 14 units per week) and being an ex-smoker, are linked to higher BMI [15, 16, 21,22,23,24,25].

While the associations between individual lifestyle behaviours and obesity risk are well studied, these behaviours do not occur in isolation. There may be important interactions (i.e. multiplicative effects) between behaviours which are not captured when studied separately [26,27,28]. For instance, the weight gain associated with smoking cessation may be mitigated by regular physical activity [27]. In another study, Kaufman et al., highlighted the synergistic effect of smoking and sedentary behaviour on obesity risk [29]. Nurwanti et al. have also demonstrated that the positive association between sedentary behaviour and obesity risk is amplified by greater carbohydrate intake [30]. Additionally, Kim et al. further illustrated that obesity risk was highest among individuals who reported insufficient sleep and frequent binge alcohol consumption [31]. Furthermore, there is evidence suggesting that when behaviours are examined in isolation, the impact on obesity risk is not fully explained compared to when studying multiple or synergistic effects of behaviours [32, 33]. The combined and interactive associations of multiple (>2) lifestyle behaviours with BMI and obesity risk remain inadequately understood [27, 28, 34]. This is because of challenges in modelling health outcomes based on multiple lifestyle behaviours, while accounting for the high number of possible interactions. While various approaches exist to address this, one such method is Latent Class Analysis (LCA), which categorizes individuals into distinct subgroups based on behaviour patterns [33, 35]. However, LCA and similar approaches may still fail to fully capture the complex, individual-level interactions between behaviours that influence obesity risk, as they primarily focus on grouping individuals rather than modelling the dynamic interrelationships between multiple behaviours.

Multilevel analysis of individual heterogeneity and discriminatory accuracy (MAIHDA) is a method predominantly used to investigate intersectional inequalities [36, 37]. It involves creating intersectional groups or strata (e.g. one stratum might be Black ethnicity, female sex, aged 40–60 years, high education, living in a deprived area) and fitting a multilevel model, with individuals at level one nested within strata at level two. This approach addresses many limitations of conventional methods (e.g. a standard regression model with all possible interactions), with advantages including model parsimony, precision weighted estimates, and accurate estimates for strata with small sample sizes [34, 38]. MAIHDA efficiently allows the researcher to 1) estimate the predicted outcome for each user-defined group/stratum of interest, 2) compare between strata vs. between individual within strata variation, and 3) separate additive vs. multiplicative effects. Extensive details about the MAIHDA method have been described elsewhere [36, 38, 39].

While MAIHDA has gained serious traction in social epidemiology to study intersectional inequalities, it has not yet been used widely in other fields. We intended to demonstrate for the first time how MAIHDA can be used in behavioural obesity epidemiology. This study aimed to investigate the combined, interactive associations of five lifestyle behaviours (diet, physical activity, sleeping, smoking, and alcohol intake) with BMI and obesity risk.

Research design and methods

Study and sample

We conducted a cross-sectional study using baseline data from the UK Biobank – a large, prospective cohort study of over 500,000 adults aged 40–69 years between 2006 and 2010. Written informed consent was obtained from all participants. More details on the UK Biobank study are available online [40,41,42]. This research was part of project number 80843.

After excluding 237,361 participants due to missing data (233,866 for lifestyle behaviours, 2506 for confounders, and 989 for BMI), 264,995 participants (139,540 males, 125,455 females) were selected for the final analysis (Supplementary Fig. 1).

Lifestyle variables

At the UK Biobank baseline assessment visit, lifestyle variables were assessed through a touchscreen questionnaire. Four of the five lifestyle variables were re-coded according to meeting vs. not meeting the following UK national public health guidelines: sleep duration (7–9 h per day), fruit and vegetable intake (at least 5 portions per day), alcohol intake (no more than 14 units per week), and physical activity (at least 150 min of moderate- or 75 min of vigorous-intensity activity weekly). The fifth lifestyle variable, smoking status, was coded as current, previous, and never smoker. More details on the lifestyle variables and recoding are in Supplementary Table 1.

Lifestyle strata

Forty-eight unique groups or strata (i.e. 2 × 2 × 2 × 2 × 3) were created based on combinations of these lifestyle behaviours. Each stratum was assigned a 5-digit number, with each digit representing whether the participant met (1) or did not meet (0) the relevant lifestyle behaviour recommendation. Smoking was coded as current (1), previous (0), and never (2). For example, a 5-digit number of 01112 would represent participants who did not meet the sleep duration guideline (first digit = 0), met the fruit and vegetable intake guideline (second digit = 1), met the alcohol intake guideline (third digit = 1), met the physical activity guideline (fourth digit = 1), and were never smokers (fifth digit = 2). Further details on how the strata were constructed are provided in Supplementary Fig. 2.

By stratifying participants into these lifestyle profile groups, the MAIHDA approach enables us to quantify whether there is any additional strata-level effect beyond the sum of individual behaviours. This also allows us to understand whether certain lifestyle combinations have additional influence on BMI, and obesity risk compared to considering the behaviours individually.

Outcomes

Standing height and weight of participants were measured by trained staff using a non-stretchable sprung tape measure and standardized scales [39, 40]. The outcomes of interest were BMI (kg/m2) and weight status (underweight <18.5 kg/m2, normal weight 18.5–24.9 kg/m2, overweight 25.0–29.9 kg/m2, and obesity >30 kg/m2) [39]. The underweight category was excluded due to low participant numbers (N = 1149).

Confounders

Age, ethnicity, and socioeconomic position were considered as potential confounders. Age and ethnicity were self-reported at the time of recruitment. Ethnicity was self-reported and categorized as white and non-white (mixed, Asian or Asian British, Black or Black British, and other). Indicators of socioeconomic position included the Townsend Deprivation Index (TDI), where participants were asked to report their home postcode, and self-reported employment status (currently employed vs. retired or unemployed).

Statistical analysis

All analyses were stratified by sex. Descriptive statistics were utilised to describe the participants and variables of interest, and linear and binary logistic MAIHDA models were fit to BMI and weight status, respectively. To evaluate potential multicollinearity among the predictors, we examined pairwise correlations and calculated the variance inflation factors (VIFs) for all lifestyle behaviours and confounders, separately for males and females.

Although we initially attempted to use a multinomial logistic model to examine both overweight and obesity status in one model, this approach resulted in convergence issues with the models, prompting us to use two separate binary logistic models instead.

Three models were fit for each outcome:

Model 1: Null model

The first model included only a random intercept with no predictor variables.

$${\rm{Linear}}:{{bmi}}_{{ij}}={\beta }_{0{ij}}$$
$${\beta }_{0{ij}}={\beta }_{0}+{u}_{0j}+{e}_{0{ij}}$$

where \({y}_{{ij}}\) denotes the BMI for individual \({i}\) in stratum \(j\), \({\beta }_{0}\) represents the precision-weighted grand mean BMI, \({u}_{j}\) signifies the stratum random effect, and \({e}_{{ij}}\) indicates the deviation of the observed BMI for individual \(i\) in stratum \(j\) from the stratum mean BMI estimate.

$${Logistic}:{logit}\left({\pi }_{j}\right)=\log \left(\frac{{\pi }_{j}}{1-{\pi }_{j}}\right)={\beta }_{0}+{u}_{j}$$

where \({logit}\left({\pi }_{j}\right)\) denotes the log odds of obesity (vs. normal weight), \(\beta\)0 represents the precision-weighted grand mean, and \({u}_{j}\) signifies the stratum random effect.

Model 2: Main effects model

In the second models, the main effects (lifestyle variables used to construct the strata) were added to the fixed part of model 1.

Model 3: Main effects model with confounders

Model 3 is an expansion of model 2 by the addition of confounders (age and TDI were centred at their means for meaningful interpretation of the intercept).

For each model, the variance partition coefficient (VPC) was calculated as the proportion of the total variance attributable to between strata variance and expressed in percentage:

$${VPC}=\frac{{Variance\; between\; strata}}{{Variance\; between\; strata}+{Variance\; within\; strata}}\,\times 100$$

The VPC in logistic models is computed differently than in general linear models as the individual-level variance is not estimated. The \({\sigma }_{e}^{2}\) is therefore set at the variance of the standard logistic distribution or \(\frac{{\pi }^{2}}{3}=3.29,\) where π signifies the constant 3.142. We expected a reduction in the VPC in models 2 and 3 compared to model 1 as some between-stratum variance would be explained by the inclusion of predictor variables.

We also calculated the proportional change in variance (PCV) as a measure of the amount of between stratum variance due to additive effects, expressed as a percentage.

$${PCV}=\,\frac{{\sigma }_{\mu 2}^{2}-\,{\sigma }_{\mu 1}^{2}}{{\sigma }_{\mu 1}^{2}}\,\times 100$$

where \({\sigma }_{\mu 1}^{2}\) is the between stratum variance in model 1 and \({\sigma }_{\mu 2}^{2}\) is the between stratum variance in models 2 or 3.

We also calculated the predicted BMI and obesity risk for each individual strata and ranked them in ascending order. We also identified strata where predicted BMI or obesity odds were influenced by interactive effects, both positive and negative. This was achieved by isolating each stratum’s residuals and comparing the 95% confidence intervals for estimates with and without the random effect. No overlap of the intervals in a stratum was indicative of the presence of a multiplicative effect for that stratum.

All procedures were performed in Stata 17 (StataCorp LP, College Station, TX, USA). The command runmlwin was used for the multilevel models [43].

Results

Participant characteristics

As shown in Table 1, median BMI values were in the overweight range, and 23% of males and 17% of females had obesity. While most men and women met the recommendations for physical activity and sleep duration, few met those for fruit and vegetable and alcohol intake.

Table 1 Descriptive statistics.

General linear MAIHDA models for BMI

Pairwise correlations between the lifestyle behaviours and confounders were generally low ( < 0.2), and the VIFs for all predictors were below 1.6 for both males and females, indicating that multicollinearity was unlikely to bias the estimates.

Results for the general linear MAIHDA models for BMI are given in Table 2. In model 1, the level 2 (between-stratum) variance was 0.7 for males and females, while the level 1 (between-individual, within-strata) variance was 15% for males and 20% for females. The VPC was 4% for both sexes, indicating that only 4% of the variances in BMI were attributable to differences between strata as opposed to differences between individuals, within strata.

Table 2 MAIHDA models for BMI.

After adding the main effects for predictor variables in model 2, the VPCs reduced to 0.3% for males and 0.2% for females. The small VPCs indicated that most of the between-strata variance was explained by additive effects, with minimal contribution from multiplicative effects. This was further supported by the model 2 PCV estimates, which showed that 93% of the between-strata variance for males and 96% for females were explained by additive main effects, highlighting the minimal role of interactive effects. On adjusting for the confounders in model 3, VPCs and PCVs remained similar.

Predictions from the linear models

Table 3 shows the five highest and lowest strata ranked by their predicted mean BMI from model 3. The full list of strata ranked by their predicted mean BMI is given in Supplementary Tables 2 and 3. For both sexes, most of the lifestyle recommendations were met by the strata that had the lowest predicted mean BMI. For instance, among males, the strata of never smokers who met all four lifestyle behaviour recommendations had the lowest predicted mean BMI of 26.7 kg/m2. Conversely, the strata that did not meet most of the recommendations were among the highest predicted mean BMI. For example, the strata with males who were previous smokers and did not meet any of the lifestyle behaviour recommendations except for daily fruit and vegetable intake, had the highest predicted mean BMI of 30.0 kg/m2. Additionally, adherence to the physical activity and sleep guidelines was observed in all five strata with the lowest predicted mean BMI for both males and females.

Table 3 The five strata with the lowest and highest BMI.

While there was little to no evidence of multiplicative effects across most strata, interactive effects were observed for one specific combination of lifestyle behaviours only – male current smokers meeting only the physical activity recommendations. Non-overlap of the 95% CIs for predicted BMI when including (26.9–27.5 kg/m2) vs. excluding (27.5–28.0 kg/m2) the random intercept in the stratum indicated the presence of a multiplicative effect.

Binary logistic MAIHDA models

The results of the binary logistic models for obesity vs. normal weight is given in Table 4. The VPCs from model 1 indicated that 6% of the total variance for females and 7% for males were explained by differences between strata as opposed to between individuals, within strata. These reduced to near zero values (0.004% for males and 0.010% for females) in model 2 on addition of the fixed effects. The model 2 PCVs of 98% for males and 93% for females indicated that most of the between-strata variances were explained by additive main effects compared to multiplicative effects. VPCs and PCVs remained largely unchanged in model 3 on further adjustment for the confounders, indicating only a very minor influence of the confounding variables.

Table 4 MAIHDA models for obesity status.

Predictions from the logistic models

Table 5 shows the top five and bottom five strata ranked by their predicted obesity probabilities from Model 3. For the strata with the lowest predicted probability of obesity, most lifestyle recommendations were followed, and this trend was consistent for both males and females. For example, among males, the strata of never smokers who adhered to all four lifestyle recommendations had the lowest predicted obesity probability (33%). Conversely, the strata that did not follow most of the lifestyle recommendations had the highest predicted probability of obesity. For instance, among males, the strata of previous smokers who did not meet any of the lifestyle recommendations had the highest predicted obesity probability estimates (79%). The full list of strata ranked by their predicted obesity probabilities are given in Supplementary Tables 4 and 5. Furthermore, as observed with BMI, all five strata with the lowest predicted probability of obesity for both sexes consistently followed the physical activity and sleep guidelines. We also observed that the same strata of male current smokers meeting only the physical activity recommendations displayed a non-overlap of 95% CIs (0.40–0.49, 0.49–0.56), indicating the presence of interactive effects on predicted obesity probability.

Table 5 The five strata with the lowest and highest predicted probability of obesity.

Given that the manuscript focuses on obesity as the primary clinical outcome, we have presented only the main results for the linear BMI models and the logistic models comparing obesity with normal weight. Logistic models comparing overweight with normal weight are reported separately in Supplementary Tables 6–8.

Discussion

Lifestyle behaviours such as physical activity, diet, smoking, alcohol intake, and sleep influence a range of health outcomes including obesity [4, 5, 12,13,14,15]. While prior studies have explored associations between individual behaviours and obesity risk, this study advances knowledge by investigating their combined relationship with BMI and obesity risk [13,14,15, 27, 28]. This is important as these modifiable lifestyle behaviours do not occur in isolation and may interact. Using MAIHDA, our key findings were that 1) most variation in BMI and obesity risk was due to differences between individuals (within the same strata) rather than between strata, 2) the influence of lifestyle behaviours on BMI and obesity risk was largely additive, with minimal multiplicative effects, 3) practically, this manifested as the strata meeting more number of lifestyle recommendations had lower predicted BMI and obesity risk, and 4) meeting the physical activity and sleep guidelines appeared most consistently in the strata with the lowest predicted BMI and obesity risk.

To our knowledge, this is the first study to apply MAIHDA to behavioural obesity epidemiology to examine the combined and interactive associations of multiple lifestyle behaviours with obesity risk. As highlighted by many studies, obesity often has multifactorial lifestyle risk factors [44,45,46,47]. Therefore, MAIHDA is particularly relevant in the context of obesity epidemiology, as it allows the investigation of multiple behaviours and their interactions. The present study found that most of the total variation in BMI and probability of obesity was explained by between-individual, within-strata differences as opposed to between-strata differences, as shown by VPC estimates of 4% for males and 7% for females. For reference, the VPCs of MAIHDA null models are typically around 5%, rarely exceeding 10% [34, 45, 48]. This demonstrates the consistency of the MAIHDA approach across different contexts. Similar patterns have been observed in previous MAIHDA research reporting comparable variance parameters (VPC ~ 5%, PCV ~ 95%) [48,49,50,51,52,53]. The primarily within-strata effects suggest that while combined healthy lifestyle behaviours play an important role in lowering BMI and obesity risk, most variance in mean BMI and obesity risk is explained by other factors not examined here (e.g. neighbourhood, access to parks, job nature, and wider dietary intake).

Our findings showed that the effects of meeting vs. not meeting lifestyle behaviour recommendations on BMI and obesity risk were predominantly additive rather than multiplicative. This means that the influence of meeting or not meeting each lifestyle factor on obesity risk was largely independent and added to one another, rather than being dependent on each other and their combined effect being the product of their individual effects. This was demonstrated by the model 2 and 3 PCVs and the substantial reduction in VPCs after adding the main effects. This finding aligns with previous research in which an unhealthy diet and physical inactivity combined additively increased several cardio-metabolic outcomes such as Type 2 Diabetes, dyslipidemia, and hypertension [54, 55]. Similarly, Layman et al., highlighted the additive effect of physical activity and diet on improving body composition and lowering of body fat [56]. These findings suggest additional benefits from following more individual lifestyle recommendations, rather than synergistic effects between their combinations. In other words, meeting each additional lifestyle behaviour recommendation is associated with lower BMI and obesity risk. For example, even if three out of the five guidelines are met, meeting one or both of the remaining guidelines may further decrease risk.

Predictions from the final models indicated that strata meeting most recommendations had lower predicted BMI and obesity risk, while those not meeting most had higher predicted BMI and obesity risk. For instance, males who never smoked and met all recommendations, had the lowest predicted mean BMI (26.7 kg/m2) and probability of obesity (33%). In comparison, the predicted BMI and obesity probability were 4.5 kg/m2 and 46%, respectively, higher in males who were previous smokers and met none of the lifestyle behaviour recommendations. These differences in the highest and lowest ranked strata underscore the importance of meeting all the lifestyle recommendations in reducing BMI and obesity risk. This aligns with previous research emphasizing benefits of multiple healthy lifestyle behaviours in reducing obesity [18, 47, 53].

It is important to align the core message with the primary study aim, which focuses on the combined effect of multiple lifestyle behaviours rather than the individual impact of any single behaviour. While certain behaviours, such as physical activity, sleep, and smoking appear to contribute more to BMI and obesity outcomes in our study, these must be interpreted in the context of the combination with other behaviours and not in isolation.

The study findings have implications for future research aimed at mitigating obesity risk. By using MAIHDA to explore obesity risk differences between lifestyle behaviour combinations, we have provided evidence of additive benefits of meeting recommendations. While this means that meeting each of these individual lifestyle behaviour recommendations is important, and some may be relatively more influential, at the population level, emphasis should be on meeting as many as possible to maximise the weight-related benefits of these recommendations. Likewise, at the individual level, adopting each additional behaviour confers additive benefits. However, our study also highlighted the greater influence of factors other than the lifestyle behaviour combinations, which could include aspects such as the built environment, neighbourhood, and access to public walkways. We therefore recommend that future research include such structural factors to better understand their combined impact on obesity risk and related conditions using primary data and other large-scale datasets. Furthermore, future work could also model lifestyle behaviours as continuous exposures or as part of an aggregate lifestyle score, enabling the assessment of potential dose-response and/or non-linear relationships alongside guideline-based thresholds.

Strengths and limitations

The study has several strengths. First, the use of MAIHDA enabled a more detailed exploration of the stratum-specific intersectional effects of the lifestyle behaviours, providing an advantage over traditional methods, which are often limited in the number of interactions they can investigate [36, 38, 43, 51, 52]. While other approaches such as LCA categorize individuals into distinct subgroups based on patterns of behaviour, they may fail to capture the complex relationships between behaviours and outcomes at the individual level. In contrast, MAIHDA allows for a more dynamic and flexible modelling approach, providing more precise estimates, especially when dealing with smaller groups. Another strength is the large sample size, which was crucial in enabling us to demonstrate the application of MAIHDA to explore variation in BMI and obesity risk according to different lifestyle behaviour combinations. Lastly, the use of nationally recommended guidelines for these lifestyle behaviours enhances the generalizability of our findings to the broader population.

However, there are several limitations to consider. First, while we focused on the key lifestyle behaviours influencing obesity risk, other behavioural factors that also could impact obesity were not included in the analysis. Second, certain covariates, such as educational attainment and income, were not adjusted for due to high levels of non-response (i.e. missing data), and their potential confounding effects on the relationship between lifestyle behaviours and obesity risk should be considered in future research. Third, since the analysis relied on self-reported data for the lifestyle behaviours, recall bias might have influenced the results. Fourth, although each of these lifestyle factors have been shown to causally impact body weight, there is also evidence that greater adiposity itself may lead to reductions in physical activity and increase in sedentary behaviour, indicating possible reverse causality [57, 58]. Given the study’s cross-sectional design, the analysis cannot establish temporality or address reverse causality. Finally, the UK Biobank sample is not fully representative of the broader UK population, which could introduce bias and limit the applicability of our findings to the general population.

Conclusions

Our findings highlight substantial differences in mean BMI and obesity risk between groups of adults who meet different combinations of lifestyle behaviour guidelines. However, most of this variation in the outcomes was attributed to differences between individuals within strata, as opposed to differences between strata. Overall, we found predominantly additive effects and minimal evidence of interactive effects of the lifestyle behaviours. This has important public health implications highlighting that there are additive benefits of meeting a greater number of public health lifestyle behaviour recommendations in the context of reducing BMI and obesity risk.