Introduction

Several key human body functions and biological processes are regulated over a 24 h cycle by an internal biological clock1. Regulation of this rhythm is progressively disrupted with ageing, as manifested by shifts in biological processes, and a range of behaviours such as the sleep-wake cycle2. There is also emerging evidence of a link between dysregulation in an organism’s innate timing device—the circadian clock3, and its bio-behavioural manifestation—circadian rhythm2, and a range of chronic diseases. Along with studies on the molecular organisation of the circadian clock3, there is increasing research interest in its bio-behavioural manifestation; recent advances in accelerometry allow scalable, objective measures of this behavioural circadian rhythm in large-scale studies4,5.

Multiple metrics can be derived from 24 h accelerometer data to reflect four key behavioural dimensions of circadian rhythm—‘rest-activity rhythm’ (RAR) reflecting circadian rhythmicity in a free-living setting, ‘daytime activity’ composed of physical activity (PA) and sedentary behaviour (SB) over the waking period, ‘sleep’ to measure the quality and quantity of sleep during the sleep period, and ‘chronotype’ to measure wakefulness and sleep timing. Whether dysregulation is uniform across these circadian rhythm dimensions in individuals remains unclear. Most studies on accelerometer-based measures of circadian rhythm focus on RAR and chronotype, either ignoring other dimensions6, or using only one marker of PA and/or sleep7,8,9,10,11,12. Few studies have examined all four dimensions (RAR, daytime activity, sleep, and chronotype)13,14,15 and have either considered these dimensions in separate models13, or in a mutually adjusted model14, or derived a large number of composite scores15, that do not allow individual patterns of dysregulations in the four dimensions to be elucidated.

The quantity of data generated by accelerometers, with multiple metrics for each dimension, and their correlated nature present analytic challenges. The aim of this study was to consider all four dimensions (RAR, daytime activity, sleep, and chronotype) and their metrics to identify real-life human clusters of circadian rhythm in older adults. To address concerns of the correlation between various metrics, we used a two-step procedure consisting of data reduction and then cluster analysis. To ensure construct validity of the method, analyses were undertaken in the Whitehall II (WII) and in the UK Biobank (UKB) cohort studies. A secondary objective was to determine the sociodemographic, behavioural, and health-related factors correlates of circadian rhythm clusters identified in our study.

Results

A total of 3,991 (mean age = 69.4 (standard deviation (SD) = 5.7, range = 60–83) years; 26% women) and 54,995 (mean age = 67.5 (SD = 4.2, range = 60–79) years; 54% women) participants from WII and UKB, respectively, were included in the clustering analyses (flowcharts in Figs. S1 and S2). The correlations (Fig. 1) between the 36 accelerometer-based metrics (described in Table S1) were mostly moderate (0.40 to 0.59) to high (≥0.60) for metrics within the same dimension of circadian rhythm—RAR, daytime activity, sleep, and chronotype. RAR metrics were also highly correlated with most daytime activity metrics (e.g. absolute range of correlations of cosinor mesor with 13 out of 15 daytime activity metrics: 0.53–0.86 in WII, 0.41–0.83 in UKB), moderately correlated with some sleep metrics (absolute range of correlations of relative amplitude with 6 out of 10 sleep metrics: 0.27–0.57 in WII, 0.17–0.76 in UKB), and had a weak correlation with chronotype metrics (absolute range of correlations for the 5 metrics: 0.11–0.29 in WII, 0.11–0.31 in UKB). The correlations between metrics of daytime activity, sleep, and chronotype were weaker (<0.40).

Fig. 1: Correlation matrix of 36 metrics reflecting circadian rhythm dimensions (rest-activity rhythm, daytime activity, sleep, and chronotype). a Whitehall II. b UK Biobank.
figure 1

IG intensity gradient, L5 least active 5-h period, LIPA light intensity physical activity, M10 most active 10-h period, MVPA moderate to vigorous physical activity, SB sedentary behaviour, TPar,d transition probability from activity to rest during the day, TPra,d transition probability from rest to activity during the day, TPsw,n transition probability from sleep to wake during the night, TPws,n transition probability from wake to sleep during the night, WASO wake after sleep onset. A day is defined as the period between waking time to start the day and sleep onset, and a night as the period between sleep onset and waking for the day. Source data are provided as a Source Data file.

To justify use of 36 rather than commonly used 10 circadian rhythm metrics (including interdaily stability, intradaily variability, relative amplitude, cosinor mesor, cosinor amplitude, cosinor acrotime, M10 and L5 timing and mean acceleration, excluding most daytime activity and sleep metrics) we compared the predictive performance for mortality of the 10 metrics and all 36 metrics. This was done by first using principal components analysis (PCA) in each set of metrics (10 and 36 metrics). A total of three principal components were retained in analysis on 10 metrics (Tables S2 and S3) that explained 80.4% and 77.6% of the variability in WII and UKB datasets, respectively. For 36 metrics, eight principal components were retained that explained 86.5% and 85.4% of the variability in WII and UKB datasets, respectively (Table S4). Over median 11.0 and 8.0 years of follow-up, a total of 633 and 3256 participants died in WII and UKB, respectively. The predictive performance of the model with 36 metrics (C-index = 0.675, 95% confidence interval (CI) = 0.653–0.696 in WII; and C-index = 0.652, 95% CI = 0.643–0.662 in UKB) was higher than the model with 10 metrics (C-index = 0.651, 95% CI = 0.629–0.672 in WII; and C-index=0.622, 95% CI = 0.612–0.632 in UKB), p for difference <0.001 for both cohorts. Subsequent analyses were therefore based on 36 metrics.

Identification of circadian rhythm clusters

The first step in the analysis (procedure shown in Fig. S3), principal component analyses (PCA) for data reduction of the 36 metrics, yielded the eight principal components (described above, Tables S4S6). K-means cluster analysis on these principal components found nine clusters to be the optimal representation of circadian rhythm metrics in participants in both cohort studies. This decision was based on examination of statistical criteria (Supplementary Figs. S4 and S5) and interpretability of clusters.

The standardised and raw mean and SD of the 36 metrics for participants in each cluster are shown in Fig. 2 and Table S7 (WII) and Fig. 3 and Table S8 (UKB). A plot of the acceleration signal for a medoid participant (most central individual) in each cluster is shown in Supplementary Figs. S6 (WII) and  S7 (UKB). Ranking the mean values by magnitude in each cluster (Tables S7 and S8) allowed their interpretation in relation to RAR, daytime activity, sleep, and chronotype.

Fig. 2: Standardised mean scores on 36 metrics as a function of nine circadian rhythm clusters in the Whitehall II accelerometer sub-study.
figure 2

IG intensity gradient, L5 least active 5-h period, LIPA light intensity physical activity, M10 most active 10-h period, MVPA moderate to vigorous physical activity, PA physical activity, RAR rest-activity rhythm, SB sedentary behaviour, TPar,d transition probability from activity to rest during the day, TPra,d transition probability from rest to activity during the day, TPsw,n transition probability from sleep to wake during the night, TPws,n transition probability from wake to sleep during the night, WASO wake after sleep onset. Results shown in the figure are mean standardised scores (z-scores) for each metric in each cluster. Source data are provided as a Source Data file.

Fig. 3: Standardised mean scores on 36 metrics as a function of nine circadian rhythm clusters in the UK Biobank accelerometer sub-study.
figure 3

IG intensity gradient, L5 least active 5-h period, LIPA light intensity physical activity, M10 most active 10-h period, MVPA moderate to vigorous physical activity, PA physical activity, RAR rest-activity rhythm, SB sedentary behaviour, TPar,d transition probability from activity to rest during the day, TPra,d transition probability from rest to activity during the day, TPsw,n transition probability from sleep to wake during the night, TPws,n transition probability from wake to sleep during the night, WASO wake after sleep onset. Results shown in the figure are mean standardised scores (z-scores) for each metric in each cluster. Source data are provided as a Source Data file.

Clusters were the same in both cohort studies, unless otherwise specified. Participants in cluster 1 (RAR++/PA++) had the most robust (++) RAR—characterised by highest amplitude, less fragmented and more stable rhythm —and active (++) pattern both in terms of light-intensity PA (LIPA) and moderate-to-vigorous PA (MVPA) while those in cluster 9 (RAR−−/PA−−) had the poorest (−−) RAR and least (−−) active pattern. In UKB, cluster 9 (RAR−−/PA−−/Chronotype−) was also characterised by delayed chronotype. Participants in clusters 2 and 3 had robust (+) RAR but opposite sleep profiles with cluster 2 characterised by shorter sleep duration (in UKB), more inneficient and fragmented sleep (Sleep−) and cluster 3 by longer sleep duration, more efficient, and less fragmented sleep (Sleep+). In addition, in WII, cluster 2 (RAR+/LIPA+/Sleep−) was also characterised by more LIPA and in UKB by both more LIPA and MVPA (RAR+/PA+/Sleep−). In both WII and UKB, cluster 3 (RAR+/LIPA+/Sleep+) participants had daytime activity characterised by light-intensity PA (LIPA+). Participants in cluster 4 (MVPA++) had a RAR pattern close to the mean, but with higher relative amplitude that was characterised by higher values in metrics related to high-intensity PA, including MVPA features and intensity gradient (IG) slope.

Participants in clusters 5, 6, 7, and 8 had poor (−) RAR pattern. Cluster 5 (RAR−/Chronotype−−) had the most delayed (−−) chronotype with later sleep period and later activity during the day. Participants in clusters 6 (RAR−/PA−/Sleep+) and 7 (RAR−/PA−/Sleep−−) had both low daytime activity but differed in sleep. Participants in cluster 8 (RAR−/PA+/Restless sleep) showed restless (agitated) sleep where mean acceleration during sleep was the highest (>1 SD and 2 SD from the sample mean in WII and UKB, respectively). In WII, participants in cluster 8 (RAR−/PA+/Restless sleep) also presented shorter and less efficient sleep. In this cluster, relative amplitude was low and cosinor mesor high, resulting from both more daytime PA and more nighttime movements (measured by mean acceleration) compared to participants in other clusters.

Clusters 3 (RAR+/LIPA+/Sleep+) and 6 (RAR−/PA−/Sleep+) in both cohort studies were the largest (15.7/17.1% (WII/UKB) and 17.7/16.1%, respectively) and clusters 8 (RAR−/PA+/Restless sleep) and 9 (RAR−−/PA−−/(Chronotype−, in UKB)) the smallest (5.9/3.4% and 6.5/6.9%, respectively). The size of clusters was similar in WII and UKB, with the largest difference observed for cluster 4 (MVPA++) (9.5 and 13.0%). Data on subjective chronotype preference in UKB (Table S9) showed 8.2% of participants to report themselves as being ‘Definitely an evening person’ and this was particularly the case for participants in clusters identified with delayed chronotype using accelerometer data—clusters 5 (RAR−/Chronotype−−) and 9 (RAR−−/PA−−/Chronotype−) (21.9% and 16.1%, respectively, compared to 6.0% in cluster 3 (RAR+/LIPA+/Sleep+), p < 0.001).

Factors associated with circadian rhythm clusters

Table 1 (WII) and Table 2 (UKB) show participants’ characteristics in the nine circadian rhythm clusters among participants with complete data (N = 3968 in WII and N = 51,507 in UKB). Participants in UKB who worked shifts (N/Total N with available data=4441/34,467) were more likely to be in cluster 1 (RAR++/PA++), 2 (RAR+/PA+/Sleep−), 7 (RAR−/PA−/Sleep−−) and 9 (RAR−−/PA−−/Chronotype−), all p ≤ 0.01 for comparison with cluster 3 (RAR+/LIPA+/Sleep+), Table S10. Associations of sociodemographic, behavioural, and health-related factors (independent variables) with circadian rhythm clusters (dependent variable) were examined using multinomial regressions. These covariates were measured concurrently to accelerometer measures except, in UKB, for marital status, employment status, Townsend deprivation index, behavioural factors, body mass index (BMI), hyperlipidaemia, and central nervous system (CNS) medication, that were measured in 2006–2010 (in average (SD) 5.7 (1.1) years before accelerometer measure).

Table 1 Characteristics of participants as a function of nine circadian rhythm clusters in the Whitehall II accelerometer sub-study
Table 2 Characteristics of participants as a function of nine circadian rhythm clusters in the UK Biobank accelerometer sub-study

In these analyses, cluster 3 (RAR+/LIPA+/Sleep+) was used as the reference due to its size (N = 619, 15.6%, in WII and N = 8811, 17.1%, in UKB), and a healthy profile with all mean values on all metrics being between –1 and +1 SD. These results (Table 3 for WII and Table 4 for UKB) show participants in clusters characterised by disrupted RAR and low PA, irrespective of sleep, to be older; these were clusters 6 (RAR−/PA−/Sleep + ), 7 (RAR−/PA−/Sleep−−), and 9 (RAR−−/PA−−(/Chronotype− in UKB)). The reference cluster was more likely to be composed of women; all p < 0.015. Non-white participants were more likely to be part of clusters characterised by poor sleep (clusters 2, 7, and 8), late chronotype (cluster 5), and in WII also poorer RAR/activity profile (cluster 9); all p < 0.01. Participants who were not married/cohabiting were more likely to be in the reference cluster 3 (RAR+/LIPA+/Sleep+) in both cohort studies; all p < 0.044; except for cluster 2 (RAR+/LIPA+/Sleep−) and 6 (RAR−/PA−/Sleep+) in WII where no difference were seen with cluster 3. Participants in employment were more likely to be in cluster 8 (RAR−/PA+/Restless sleep) and less likely to be in clusters 9 (RAR−−/PA−−(/Chronotype− in UKB)) and 5 (RAR−/Chronotype−−) in WII, and in cluster 4 (MVPA++) and 6 (RAR−/PA−/Sleep+) in UKB. WII participants living in more deprived areas were more likely to be in clusters characterised by disturbed sleep and late chronotype (clusters 2, 5, 7, and 8) and in UKB, they were more likely to be in the cluster with high-intensity PA (cluster 4) and all clusters characterised by poorer RAR (clusters 5 to 9).

Table 3 Association (Odds Ratio (95% CI)) of socio-demographic, seasonal, behavioural, and health-related factors with circadian rhythm clusters in the Whitehall II accelerometer sub-studya
Table 4 Association (Odds Ratio (95% CI)) of socio-demographic, seasonal, behavioural, and health-related factors with circadian rhythm clusters in the UK Biobank accelerometer sub-studya

Among behavioural factors, we found current smokers to be more likely to be in clusters 2 (RAR+/LIPA+/Sleep−), 5 (RAR−/Chronotype−−), 7 (RAR−/PA−/Sleep−−), and 9 (RAR−−/PA−−(/Chronotype− in UKB)); this was also the case for cluster 6 (RAR−/PA−/Sleep+) in WII and 8 (RAR−/PA+/Restless sleep) in UKB. Individuals who consumed less fruit and vegetables were more likely to be in clusters 7 (RAR−/PA−/Sleep−−) and 9 (RAR−−/PA−−(/Chronotype− in UKB)), and in UKB also in clusters 5 (RAR−/Chronotype−−) and 6 (RAR−/PA−/Sleep+). Exposure to light was higher in cluster 1 (RAR++/PA++) that had the most robust RAR and highest PA and lower in clusters characterised by poorer RAR and less PA (clusters 5, 6, 7, and 9); all p < 0.001.

Participants with higher BMI were less likely to be in clusters with higher PA (1 and 4), although the association was not statistically significant in WII (p = 0.29), and more likely to be in clusters with poorer RAR (clusters 5, 6, 7, 8, and 9), all p < 0.001 except for cluster 8 (RAR−/PA+/Restless sleep) in WII. Participants with diabetes were less likely to be in clusters characterised by MVPA (cluster 1 in UKB and cluster 4 in both studies), and more likely to be in the less active clusters in UKB (6, 7, and 9), while associations were in the same direction in WII although not statistically significant (p = 0.08, p = 0.31, p = 0.23, respectively). In UKB, participants with diabetes were also more likely to be in clusters 5 (RAR−/Chronotype−−) and 8 (RAR−/PA+/Restless sleep). No strong associations were found for hypertension and hyperlipidaemia, besides higher odds (1.26 (95% CI = 1.15–1.38)) of hypertension among participants in cluster 9 (RAR−−/PA−−) compared to cluster 3 (RAR+/LIPA+/Sleep+) in UKB. Participants with at least one chronic disease were more likely to be in cluster 5 (RAR−/Chronotype−−) and cluster 9 (RAR−−/PA−−(/Chronotype− in UKB)) in both studies, and in UKB also in clusters 6 (RAR−/PA−/Sleep+) and 7 (RAR−/PA−/Sleep−−). Use of CNS medication was less likely among participants in cluster 1 (RAR++/PA++) and more likely in clusters with poorer RAR (5 to 9).

Discussion

The main premise of this study is that rest-activity rhythm (RAR) and chronotype dimensions alone may not comprehensively capture the bio-behavioural expression of circadian rhythm patterns in real-world settings. We identified circadian rhythm clusters in two population-based cohort studies using a method that allows clusters to be identified using a combination of metrics that reflect dimensions of the behavioural expression of circadian rhythm (36 metrics to reflect RAR, daytime activity, sleep, and chronotype). This approach is in contrast to studies that either use metrics related to a few dimensions or use dimensions of circadian rhythm independently of each other. The stability of clusters in our analysis was confirmed in two independent large-scale population-based cohort studies. The characteristics of individuals in these clusters differed as a function of sociodemographic, behavioural, and health-related factors, highlighting the validity of these clusters. Age, sex, ethnicity, cohabitation status, deprivation index, smoking, fruit and vegetable consumption, light exposure, BMI, diabetes, prevalence of chronic diseases, and CNS drugs were the factors that differed the most across the nine circadian rhythm clusters.

Circadian rhythm is altered at older ages2 and in individuals with chronic diseases2,3. There is increasing interest in using accelerometer-based measures of RAR and/or chronotype to examine associations of circadian rhythm with adverse health outcomes8,12,14. These studies have generally not considered daytime activity or sleep to measure circadian rhythm, and cannot be seen as a comprehensive reflection of the sleep-wake cycle. Some studies6,9,16 have included M10 (mean acceleration during most active 10 h) and L5 (mean acceleration during 5 least active hours) as broad indicators of PA and sleep, respectively, but this approach does not fully capture differences in activity intensity (for example LIPA vs MVPA) or sleep disruptions such as fragmentation (for example number of sleep bouts). Previous studies have shown both sleep17,18 and PA19, two key dimensions of the sleep-wake cycle, to be important for health. Results from our preliminary analysis show use of all 36 metrics to have better predictive performance for mortality than the prediction model with 10 commonly used metrics (RAR, chronotype, M10 and L5). Furthermore, eight of the nine circadian rhythm profiles we identified were characterised by RAR but also presented differences in the other dimensions. These findings suggest that focusing on RAR and/or chronotype alone may introduce misclassification bias in studies of sleep-wake cycles and health.

A thorough search of the relevant literature did not identify previous accelerometer-based studies on identification of distinct circadian rhythm profiles using metrics of RAR, daytime activity, sleep, and chronotype. Some studies have identified profiles based either on hourly activity levels20,21,22, RAR metrics7, daytime activity23,24, or sleep25,26 metrics. Other studies combined daytime activity and sleep metrics and found between three27 and five28 clusters. Another study combined RAR, sleep, and chronotype metrics and identified three profiles that differed primarily on sleep characteristics29. One notable study used data reduction on 28 metrics of the four circadian rhythm dimensions to derive 13 component scores15 that essentially reflect the correlational structure of the circadian rhythm metrics. These results are comparable to the first step in our analysis—the PCA results where the first three components reflected RAR and daytime activity, sleep, and chronotype, while the others principal components more complex combinations of the metrics.

We added a second step by undertaking cluster analyses on the components from the PCA. The advantage of cluster analysis is that it allows individuals to be grouped in clusters that have high within-group similarity and between-group differences, while also accounting for the interrelated structure of the data. Our findings show how measures of RAR, daytime activity, sleep, and chronotype are distributed in the population. For example, clusters 5 (RAR−/Late chronotype), 6 (RAR−/PA−/Sleep+), and 7 (RAR−/PA−/Sleep−−) all have a similar pattern of poorer RAR but differed in the intensity and duration of daytime activity, chronotype, and sleep characteristics. Whether the longitudinal associations of these clusters with health outcomes differ needs to be examined in future studies.

Our two-step approach (PCA for data reduction followed by cluster analyses) captures the behavioural manifestation of the endogenous biological clock by capturing multiple aspects of the sleep-wake cycle using data from two population-based cohort studies. Accelerometers provide scalable, cost-effective measures of the sleep-wake cycle—one of the most visible manifestations of the circadian clock. It remains important to consider the influence of extrinsic factors, such as seasonal variations or activity routines, that may also influence individual behaviours. For example, we found employment status to affect the clusters identified in our study, as also demonstrated in another study where work environment and schedule impacted sleep30. Exposure to zeitgebers such as light plays an important role in the synchronisation of the internal clock, and misalignment of zeitgebers with endogenous biological clock can lead to chronic perturbations of circadian rhythm31. Further studies are needed to determine the association between behavioural manifestation of the biological clock and its molecular and genetic markers.

In our study the size of the nine circadian rhythm clusters was slightly different in WII and UKB, possibly due to difference in the distribution of age, sex, and education as well as differences in accelerometer protocol. However, in both studies participants’ sociodemographic, behavioural, and health-related factors differenced across the clusters. Previous studies have reported participants’ characteristics to vary across metrics such as interdaily stability and intradaily variability16,32,33, relative amplitude, and M10 (most active 10 h period) and L5 (least active 5 h period)16. Older age has been shown to be associated with a more stable32,33 but fragmented RAR5,16,32—as denoted by higher interdaily stability and intradaily variability—and less daytime activity16. Women and people married/cohabiting have been shown to have more stable and less fragmented rhythm32,33. Smoking has also been found to be associated with a less stable rhythm16,32,33 while higher BMI was associated with less stable but also more fragmented rhythm16,32.

Our results add to the existing evidence by showing heterogeneity in associations with sociodemographic, behavioural and health-related factors for a given level of RAR depending on the other dimensions of circadian rhythm—daytime activity, sleep, and chronotype. We found women and persons married/cohabiting to have a more stable and less fragmented rhythm, undertake light intensity physical activity, and have good sleep levels. Living in a more deprived area was more common in clusters characterised by sleep disturbances and delayed chronotype in WII, whereas in UKB it was more common in clusters with poorer RAR. Participants with greater outdoor light exposure had more robust RAR and were likely to be active, independently of the season, as previously shown in UKB using self-reported time spent outdoors34. As the number of blind participants was small ( < 0.01% in WII and UKB), this group is unlikely to influence findings on light exposure. Participants in clusters characterised by poor sleep and/or disturbed RAR were more likely to be on CNS medications. Poor health status indicated by prevalent diabetes or chronic diseases tended to be more frequent in participants with poorer RAR and less physical activity.

Strengths of our study include objective measurement of circadian rhythm, the large number of metrics covering four behavioural circadian rhythm dimensions, statistical analysis in two cohort studies, use of open-source software to derive metrics, and analyses on a large set of covariates to study their association with circadian rhythm clusters. This study also has limitations. First, both cohort studies included mainly white participants in the UK and whether circadian rhythm differs in other countries remains unknown. Second, a sleep diary was not used in UKB, potentially leading to some misclassification of sleep and waking time due to inaccuracies in estimation of sedentary behaviour, sleep, and chronotype metrics, particularly if participants were sedentary before sleep35. Three, in UKB some covariates were measured at inclusion rather than concurrently with the accelerometer measure but a previous study in UKB did not find major differences in associations for covariates drawn from baseline or at the same time as the accelerometer36. In addition, the pattern of results in our study was similar in WII and UKB, suggesting little impact of when covariates were measured. Fourth, in the absence of a consensus on measuring naps, we did not use data on naps in the identification of circadian rhythm clusters. We recognise that naps are likely to contribute to intradaily variability, transition probabilities from activity to rest, and sedentary behaviour.

Using data from two large, population-based cohort studies, our study demonstrates the clustering in behavioural circadian rhythm metrics, resulting in participants to be grouped in nine distinct clusters. Despite the RAR metrics being similar across several clusters, our approach shows considerable differences in the distribution of daytime activity, sleep, and chronotype across the clusters. Not considering these differences in studies that focus on RAR does not allow accurate measurement of circadian rhythm. Our analyses show a number of sociodemographic, behavioural and health-related factors to differ across these clusters. Given the dysregulation of circadian rhythm at older ages and in certain chronic diseases, the findings of the present study highlight the need for better characterisation of circadian rhythm.

Methods

Study populations

The WII study is an ongoing prospective cohort study established in 1985–1988 on 10,308 British civil servants, consisting of a clinical examination every 4 to 5 years. An accelerometer measure was added to the 2012–2013 wave of data collection for 4880 men and women (aged 60–83 years) seen at the London clinic and those living in the South-Eastern regions of England who underwent clinical examination at home23. Written informed consent for participation was obtained at each contact. Research ethics approval was obtained from the University College London ethics committee at each wave (latest reference number 85/0938).

The UKB is a prospective population-based cohort study on over 500,000 men and women aged between 40 and 69 years, registered with the UK National Health Service. Baseline measurements took place between 2006 and 2010. The accelerometer sub-study was undertaken on 236,488 individuals between 2013–201537. Participants aged ≥60 at accelerometer sub-study were retained in the present analysis as our focus was circadian rhythm clusters at older ages. Approval was received from the National Information Governance Board for Health and Social Care and the National Health Service North West Centre for Research Ethics Committee (reference number 11/NW/0382). All participants gave written consent for participation. Access to UKB data in our study was under application number 96856.

Accelerometer wear protocols

In WII, individuals wore a GENEActiv (Activinsights Ltd, Kimbolton, UK) triaxial accelerometer on their non-dominant wrist and completed a sleep diary over 24 h, for nine consecutive days. Accelerometer data were sampled at 85.7 Hz with the acceleration expressed relative to gravity (1 g = 9.81 m/s2), and processed using GGIR R package version 3.1–738. Raw acceleration was calculated with the metric Euclidean Norm Minus One (ENMO), negative values rounded to zero. Then these values were corrected for calibration error and non-wear time39. Data from days 2 to 9 (penultimate day) were retained in the analysis, comprising a total of seven consecutive full day windows (daytime waking period and following sleep period). Sleep onset and waking up times were detected using an algorithm, guided by a sleep diary39. The daytime waking periods were defined as the time between waking up to start the day and sleep onset at night, and sleep periods between sleep onset and waking up to start the day. Participants were included in the analyses if they had data from at least five valid full day windows (defined as wear times ≥2/3 of both waking and following sleep periods)40.

In UKB, participants wore an Axivity AX3 triaxial accelerometer on their dominant wrist over 24 h for seven consecutive days, starting at 10 h on day 137. Data were sampled at 100 Hz. As a sleep diary was not used, sleep periods were detected using an algorithm that has been described previously39. To maximise measures covering full day windows (sleep period and following waking period), data from the sleep onset of the first night until sleep onset on the last night were retained in the analyses, comprising a total of six consecutive full day windows. Participants were included in our analyses if they had data on at least five valid full day windows40.

Measures of behavioural circadian rhythm

A total of 36 accelerometer-assessed metrics (Supplementary Table S1) were derived to measure four dimensions of circadian rhythm—RAR, daytime activity, sleep, and chronotype.

RAR (6 metrics) was based on the day windows and included parametric measures based on the cosinor function fitted to the log of the acceleration signal41, that yielded three metrics: mesor (average activity according to the function), amplitude (the peak of the function minus the mesor), and R2 (goodness of fit)41. The three non-parametric metrics were: interdaily stability, intradaily variability5, and the relative amplitude based on the mean acceleration of M10 and the least active 5 h period (L5)42.

Daytime activity (15 metrics) was based on activity levels during waking periods and classified as SB, LIPA, and MVPA43. For each activity level, we calculated the daytime total duration, number and mean duration of bouts. We also considered: mean acceleration during waking period and during \({{{{\rm{M}}}}}_{10}\), transition probabilities (TPs) to switch from activity levels (LIPA or MVPA) to rest (SB), and from rest to activity, during the waking period (\({{{{\rm{TP}}}}}_{{{{\rm{ar}}}},{{{\rm{d}}}}}\) and \({{{{\rm{TP}}}}}_{{{{\rm{ra}}}},{{{\rm{d}}}}}\))5, and two parameters derived from the acceleration distribution (intensity gradient (IG) intercept, and slope)44.

Sleep (10 metrics) included: total sleep duration, sleep efficiency during the sleep period, mean acceleration during sleep and during L5, number and mean duration of sleep bouts, TPs from sleep to wake, or wake to sleep during the night (\({{{{\rm{TP}}}}}_{{{{\rm{sw}}}},{{{\rm{n}}}}}\) and \({{{{\rm{TP}}}}}_{{{{\rm{ws}}}},{{{\rm{n}}}}}\)), duration of wake after sleep onset (WASO), and mean duration of wake bouts.

Chronotype (5 metrics) was characterised by: time of sleep onset and waking, starting time of M10 and L5, and the cosinor acrotime (timing of the peak of the function).

All metrics were derived using GGIR R package version 3.1–7.

Participants’ characteristics

In WII, covariates were measured at the 2012–2013 wave either using questionnaire, clinical examination, or data from electronic health records, including the Hospital Episode Statistics (HES) and Mental Health Services Data Set. In UKB, covariates were either extracted from the baseline examination in 2006–2010 or extracted from electronic health records (HES).

Socio-demographic factors included age at accelerometer data collection, sex (defined based on the British civil servants record in WII, and from the UK National Health Service, updated in some cases by self-report of the participant, in UK Biobank), ethnicity (white or non-white), education (lower secondary school or less, secondary school, or higher than secondary school), cohabitation status (married/cohabiting or not married/cohabiting), whether in employment (yes or no), and area deprivation index (Index of multiple deprivation in WII45 and Townsend deprivation index in UKB46).

Behavioural factors included smoking status (never, ex-, or current-smoker), alcohol consumption (0, 1–14, or >14 units/week), fruit and vegetable consumption (less than or ≥ twice daily), and exposure to daylight, estimated as the proportion of waking time exposed to >1000 lux based on the light sensor from the GENEActiv accelerometer in WII47 and self-report of the time spent outdoors in UKB34. The light measures were standardised to account for season of accelerometer measurement. In additional analyses, we used self-reported chronotype preference in UKB, categorised as: Definitely a morning person, More a morning than evening person, More an evening than a morning person, and Definitely an evening person. Data on shift work were only available in UKB and were used in additional analyses, categorised as: No and Yes.

Health-related factors included BMI (in kg/m², calculated from measures of height and weight at the clinical examination), diabetes (measured using fasting glucose at the clinical examination ≥7.0 mmol/L, self-reported doctor-diagnosed diabetes, use of anti-diabetic medications, or record in HES), hypertension (defined as systolic/diastolic blood pressure ≥140/90 mmHg, use of antihypertensive drugs, or record in HES), hyperlipidaemia (defined as low-density lipoproteins >4.1 mmol/L or use of lipid-lowering drugs), chronic disease (yes, no out of coronary heart disease, stroke, Parkinson’s disease, chronic obstructive pulmonary disease, heart failure, depression, other mental disorders, cancer, liver disease, and arthritis, and extracted from multiple sources including HES records, mental health records, cancer registry, and data collected at the Whitehall clinical examinations in WII, and from HES, clinical measures, and medication data in UKB), use of CNS medication (among anti-depressant, antipsychotic, hypnotic, anxiolytic, or Parkinson medications), reported by participant in the section on medication in the questionnaire.

Mortality data were used in preliminary analysis. Mortality cases were ascertained using linkage to the National Health Service (NHS) mortality register in both cohort studies. Participants were followed from date of accelerometer wear to November 2023 in WII and November 2022 in UKB.

Statistical analysis

To justify the use of 36 rather than the 10 commonly used metrics (interdaily stability, intradaily variability, relative amplitude, cosinor mesor, cosinor amplitude, cosinor acrotime, M10 and L5 timing and mean acceleration6,8,48,49,50) to characterise circadian rhythm we compared the predictive performance of these two sets of metrics for risk of mortality. We first used principal component analysis (PCA) to extract components to reflect the metrics too strongly correlated to be used in the prediction analysis. The number of components for each set of metrics (10 and 36) was selected using the eigenvalue criterion (≥1) and the cumulated percentage of variance explained (at least 75%). Cox regression analysis was used to examine the association of these components with mortality, and their predictive performances were compared using the C-index.

As the predictive performance of components using 36 metrics was better, subsequent analyses were conducted using the set of 36 metrics in parallel in both cohort studies to identify how circadian rhythm metrics clustered in individuals, and then examine the association of the clusters with covariates. The correlation between the 36 metrics (reflecting RAR, daytime activity, sleep, and chronotype) was examined using Pearson’s correlation coefficients. Then, to examine how the 36 metrics were distributed across individuals we undertook cluster analyses; the workflow for steps in these analyses is shown in Supplementary Fig. S3. The procedure to identify clusters in both WII and UKB was made in three steps. First, standardisation of the 36 metrics using scale function to make them comparable. Second, we applied the data reduction method PCA on this standardised metrics using prcomp function. Using eigenvalue ≥ 1 and cumulative variance explained ≥75% of the variance of the 36 metrics, we selected N component. Finally, we used k-means clustering with the kmeans function on the retained components, testing for K = 4 to 12, with 5000 numbers of random starts, and 10,000 maximum number of iterations. To determine the number of cluster K retained, we examined statistical criteria—within cluster sum of squares, Silhouette coefficient, and Davies-Bouldin index—to assess the statistical differences between clustering solutions. As the differences were small, the interpretability of the clusters was further examined. For every K (number of clusters) solution, we looked at the visualisation of the solutions to interpret each cluster. The aim was to determine whether the addition of a new cluster provided new relevant information in terms of differences in the circadian rhythm profiles. We also considered cluster sizes, ensuring that each contained at least N ≥ 200 participants to be suitable for analysis.

In WII, the assumptions for post-hoc Tukey tests were not met and to allow interpretation of the clusters as a function of each metric we used the non-parametric simultaneous rank test procedure. In UKB, the large sample size did not allow non-parametric simultaneous rank test to be used, leading us to use the post-hoc Tukey test with ANOVA51.

After identification of circadian rhythm clusters, we used multinomial logistic regression to examine associations of participants’ characteristics and season of wear (independent variables) with circadian rhythm clusters (dependent variable). The reference category in the multinomial regression was chosen based on size (number of participants), the cluster where none of the metric means was above or below one SD from the population mean, and better health status.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.