Background

Chronic kidney disease (CKD) involves kidney damage or a reduction in function, typically defined by an eGFR less than 60 mL/min per 1·73 m² or markers of kidney damage like albuminuria1. CKD poses a significant and escalating global burden, with an estimated 10% of adults worldwide affected by this condition. CKD is independently associated with an elevated risk of all-cause mortality, cardiovascular mortality, and progression to end-stage renal disease2,3,4. It leads to 1.2 million deaths and 28.0 million years of life lost annually5,6.

CKD is a complex disorder with a wide range of causes, including systemic illnesses and conditions like hypertension, diabetes, autoimmune diseases, genetic predisposition, and congenital abnormalities7,8,9,10. Other factors, such as inflammation, exposure to toxins in the environment, and certain medications, can also contribute to the development of CKD11,12,13,14,15. The heterogeneity and complexity of CKD pose challenges to its control and management. The dilemma between health benefits from the intervention and economic consideration in terms of cost-effectiveness calls for re-classification of CKD to enable precise and effective intervention in those at the greatest risk of mortality. In the present study, we hypothesize that distinct subpopulations within individuals with CKD can be identified through the use of multidimensional phenotypic data by performing the consensus clustering analysis16,17. It is further hypothesized that these subpopulations will have varying risks of future death. We aimed to explore the heterogeneity among CKD patients, and further investigated the differences in mortality risks between clusters.

Results

Study population

Totally, 6,526 individuals with CKD were included in the final analysis. The mean age of study population was 60.3 years, 50.5% of the subjects were women and 58.0% were non-white. The mean eGFR was 95.1 ± 24.2 ml/min per 1.73 m2 and the median UACR was 72.5 mg/g (interquartile range, 42.0–181.0 mg/g). The total person-months of follow-up were 591,634 person-months.

Determination of cluster number

Figure 1 shows the matrix heatmaps of the pairwise consensus for each number of cluster analysis (Fig. 1. a-f). The cumulative distribution functions (CDFs) (Fig. 2a) and the proportion increase of the area under the CDFs (Fig. 2b) indicated that to category patients into 4 clusters could best display the profiles of participants with CKD. For 4 clusters, the mean consensus scores were larger than 0.96 for all clusters, with a larger number suggesting better stability of cluster analysis (Fig. 2c).

Fig. 1
figure 1

Consensus matrix heatmaps using behavioral risk factors, metabolic risk factors, diseases history, and social determinants.

The consensus matrix heat maps of K = 2 to K = 7 using 45 variables. The brightest blue color in the diagonals represents perfect consensus where two individuals always group together, the white color represents perfect consensus where two individuals always group separately, and the blue color scales in between represent ambiguous consensus where two individuals are grouped together in some runs but separately in others. (a) K = 2; (b) K = 3; (c) K = 4; (d) K = 5; (e) K = 6; (f) K = 7.

Fig. 2
figure 2

Consensus cumulative distribution function and cluster consensus score to determine at what number of clusters.

(a) The lines by colors indicating the cumulative distribution functions (CDF) of the consensus matrix for each number of clusters. The CDF reaches an approximate maximum, consensus and cluster confidence is at a maximum at this K. (b) The changes in area under the CDF curves comparing K and K − 1. For K = 2, there is no K − 1, so the total area under the curve rather than the relative increase is plotted. The relative increases in consensus are used to determine K at which there is appreciable increase. (c) The mean consensus score for different numbers of clusters (K ranges from 2 to 7). Cluster is indicated by color following the same color scheme as the cluster matrices and tracking plots. The bars are grouped by K which is marked on the horizontal axis. High values indicate a cluster has high stability and low values indicate a cluster has low stability. For K = 4, the mean consensus score was 0.97 for cluster 1, 0.99 for cluster 2, 0.96 for cluster 3, 0.97 for cluster 4.

Distributions of characteristics by clusters

Demographic, anthropometric, behavioral and clinical data for the four clusters are shown in Table 1. The distribution of the 45 baseline variables exhibited statistically significant differences between the four clusters, with the exception of the following variables: not being a citizen of the US, levels of fasting plasma glucose, aspartate transaminase, γ-glutamyl transferase, triglyceride, and uric acid; history of diagnosed diabetes, hypercholesterolemia, congestive heart failure, and treated diabetes (Table 1). The four clusters showed distinctive patterns displayed by standardized means of cluster variables (Fig. 3). Supplementary Table 2 showed the pairwise comparisons of the clustering variables between clusters, and most differences achieved Bonferroni adjusted statistical significance (p < 0.0083). Supplementary Table 3 shows the standard mean differences in age and metabolic risk factors between each two clusters. The distributions of the metabolic-related factors that with large differences (standard mean differences > 0.2) are shown in Fig. 4, and the values of the features were centered to a mean value of 0 and a standard deviation of 1. Cluster 1, including 508 (7.8%) participants, was marked by relatively higher levels of BMI, WC, eGFR, ALT, and DBP, but lower levels of 2 h PG and SBP, and younger age. Cluster 2 comprised 1,439 (22.1%) participants and cluster 3 constituted 2,550 (39.1%) participants. The characteristics of cluster 2 and 3 fell somewhere in between cluster 1 and 4. Cluster 4, included 2,029 (31.1%) participants, was marked by the worst metabolic traits (higher levels of SBP, 2 h PG) and kidney function markers (lowest levels of eGFR).

Table 1 Baseline characteristics of study participants by cluster.
Fig. 3
figure 3

Distribution of the cluster feature variables.

All the values of metabolic risk factors and sodium intake were centered to a mean value of 0 and a standard deviation of 1, the other variables were presented by proportion. (a) Cluster 1; (b) Cluster 2; (c) Cluster 3; (d) Cluster 4.

Fig. 4
figure 4

Distribution of the metabolic risk factors feature variables that with larger difference.

All the values of cluster feature were centered to a mean value of 0 and a standard deviation of 1. All the negative values were converted to positive value by added a fixed value to yield polygon areas related to adverse variable effects. The centre of the figure indicates 0. (a) Cluster 1; (b) Cluster 2; (c) Cluster 3; (d) Cluster 4.

2-h PG, 2-hour post-load glucose; WC, waist circumference; BMI, body mass index; ALT, alanine transaminase; SBP, systolic blood pressure, DBP, diastolic blood pressure.

Associations of CKD clusters with mortality

Totally, 2,327 participants with CKD at baseline died during the follow-up, including 850 CVD death, 368 cancer death, and 1109 death due to other causes. The Kaplan-Meier survival curves showed that there were significant different risks of all-cause mortality, CVD mortality, and other cause mortality by cluster (log-rank test all P < 0.001) (Fig. 5). However, there was no significant difference in cancer mortality between clusters (P = 0.310) (Fig. 5). The risks of all-cause mortality were significantly higher in cluster 2 (HR, 1.25; 95%CI, 1.02–1.54) and cluster 3 (HR, 1.25; 95%CI, 1.03–1.52), and in cluster 4 (HR, 1.48; 95%CI, 1.22–1.81) compared with cluster 1 (Table 2). Only cluster 4 presented significantly higher risk of CVD mortality (HR, 1.45; 95%CI, 1.05–2.02) and mortality due to other causes (HR, 1.60; 95%CI, 1.20–2.14) compared with cluster 1 (Table 2).

Fig. 5
figure 5

Kaplan-Meier survival analysis illustrated the survival probability in different clusters.

The log rank P values for comparisons were < 0.001 in (a, b, and d) and 0.310 in (c). (a) All-cause mortality; (b) cardiovascular disease mortality; (c) Cancer mortality 3; (d) Mortality due to other causes.

Table 2 Comparison of mortalities between clusters by cox proportional hazards model.

Sensitivity analysis

We further performed the sensitivity analysis among the 1,207 individuals with impaired baseline kidney function (eGFR < 45 ml/min per 1.73 m2 or UACR ≥ 300 mg/g), we also identified four clusters (Supplemental Fig. 2) with similar cluster consensus scores and proportion of clustered values (Supplemental Fig. 3) as the main analyses. The baseline characteristics of the four predefined clusters are presented in Supplementary Table 4. Similar distributions of characteristics were observed across the predefined clusters. Cluster 1, 2, 3, and 4 consisted of 129 (10.7%), 309 (25.7%), 427 (35.3%), and 342 (28.3%) participants, respectively. Cluster 1 was characterized by relatively higher levels of BMI, WC, eGFR, ALT, DBP, and sodium intake, as well as a younger age. Conversely, cluster 4 was marked by the highest levels of SBP and the lowest levels of eGFR, BMI, and WC. Moreover, participants in cluster 4 exhibited a higher prevalence of stroke, heart attack, and cancer or malignancy, as well as elder age, lower educational attainment, and the lowest sodium intake (Supplementary Table 4). The mortality risks were compared between each cluster. Even though the HRs for all-cause mortality presented increased risk trend by cluster, they did not reach statistical significance. Only cluster 4 had significantly higher risk of mortality due to other causes compared with cluster 1 (HR, 1.65; 95%CI, 1.03–1.81) (Supplementary Table 5).

Another sensitivity analysis was conducted by conducting cluster analysis after excluding highly correlated variables (Supplementary Figs. 4 and 5). Six variables were excluded and 39 variables remained, the results of cluster analysis showed almost the same with the main analysis (Supplementary Table 6).

Discussion

Using unsupervised consensus clustering, we identified four distinct clusters based on 45 baseline variables. These clusters exhibited significant differences in the risk of mortality. Cluster 4, characterized by older age, higher levels of SBP, 2 h PG, lower levels of eGFR, and higher prevalence of CVD, had the highest risk of future mortality. Stratification by eGFR and UACR confirmed patient heterogeneity within the overall population. An understanding of which CKD phenotypes relate to mortality will guide both mechanistic research and promote precision medicine for CKD management.

In this study, the participants exhibited a high prevalence of obesity, as indicated by a BMI of ≥ 30 kg/m² and a WC of ≥ 103 cm. Cluster 1 has the highest levels of BMI and WC, whereas Cluster 4 shows the opposite pattern in our study. Existing evidence suggests a positive association between general adiposity (represented by BMI) and central adiposity (represented by WC) with the risk of all-cause mortality in the general population18,19. It has been observed that both higher and lower BMI values are associated with an increased risk of mortality, with obesity (BMI ≥ 30 kg/m²) being responsible for the majority of the mortality burden18. However, interestingly, in our study, cluster 4, which had the lowest levels of BMI and WC, presented the higest with mortality risk. Previous studies have indicated that the association between BMI and mortality risk weakens substantially with increasing age20,21. Similar findings have been observed for waist circumference in participants aged 60 years and older19. This suggests that increased weight may actually confer a survival advantage in older individuals20. In our study, the participants were predominantly elderly, with a mean age of ≥ 59 years. This age composition may have attenuated the association between obesity and the risk of all-cause mortality. Overall, our findings highlight the complex relationship between obesity, age, and mortality risk, emphasizing the need for further investigation to better understand the impact of obesity on mortality in different age groups.

The patients in cluster 1 had much higher sodium intake than those in other clusters. The patients in cluster 1 are younger and more likely to be men. Generally, men and younger people have higher energy requirements than women and older people. A correlation between sodium and energy intake is evident. In addition, low sodium intake may suggest a lower nutritional status, which is consistent with our findings that patients in cluster 1 have the lowest mortality risk. Another study also reported a correlation between higher sodium intake and lower all-cause mortality risk in American individuals22.

As we are aware, hypertension is the most common comorbidity observed in patients with CKD23. In our present study, cluster 4 was characterized by a higher prevalence of hypertension and poorer kidney function. Uncontrolled hypertension in CKD patients can lead to detrimental clinical outcomes, such as myocardial infarction, acute coronary syndrome, ischemic stroke, heart failure, and even death24. Previous research has shown that SBP has a stronger association with adverse kidney outcomes than DBP in CKD patients25. In cluster 4, we observed higher levels of SBP and relatively lower levels of DBP, suggesting that SBP might be a more potent predictor of CKD progression and have a greater impact on future mortality in CKD patients compared to DBP. In conclusion, our findings emphasize the importance of effectively managing blood pressure in patients with CKD.

Our study focused on patients belonging to cluster 4, characterized by poorer kidney function (indicated by lower levels of eGFR) and a higher prevalence of CVD, including coronary heart disease and heart attacks. This particular group demonstrated an elevated risk of future mortality. It is widely acknowledged that eGFR and albuminuria are strongly and independently associated with various adverse outcomes, such as progression to kidney failure, cardiovascular events, and death1. Our study confirmed these previous findings by establishing a significant association between decreased eGFR and the risk of all-cause mortality. Prior research has consistently demonstrated that CVD is a leading cause of death among patients with CKD1. In line with these findings, our study revealed that higher-risk groups for cardiovascular events experienced higher overall mortality rates and an increased risk of CVD-related mortality. These results underscore the importance of reducing cardiovascular risk factors to slow down CKD progression and strive to prolong the lifespan of patients with CKD. By prioritizing strategies aimed at managing CVD risk factors in this population, we can potentially improve outcomes, reduce mortality rates, and enhance the overall quality of life for individuals with CKD.

Our study highlights the significant heterogeneity observed within the broad category of CKD, which encompasses various pathologies and etiologies. In order to explore this diversity, we employed consensus clustering using discrete data elements. The use of consensus clustering allowed us to identify distinct patient groups with specific clinical markers. Of particular interest is the stratification of patients based on their kidney function, distinguishing between those with relatively preserved or impaired kidney function. This stratification provides valuable insights for tailoring clinical interventions and treatments in a more precise manner. However, it is important to note that further clinical studies are required to validate the utility of this approach. These studies should investigate whether the identified patient groupings indeed lead to more targeted interventions and improved clinical outcomes, thus fulfilling the promise of precision medicine.

The study has several limitations that need to be considered. Firstly, it’s important to acknowledge that cluster analysis is an exploratory data analysis method. It relies on the input of data, such as the 45 baseline variables used in our analysis, to identify clusters of individuals with similar characteristics. Therefore, if different patient characteristics were used as input variables, it is likely that different CKD subgroups would be identified. It’s essential to interpret the results of cluster analysis within this context and understand that the identified subgroups are specific to the variables used in the analysis. Second, our clustering analysis relied mainly on well-known clinical traits associated with CKD, some other factors may also contribute to identifying sub-types of CKD. Future studies should explore the inclusion of novel biomarker data, such as inflammatory markers, genomics, metabolomics, and proteomics, to enhance our understanding of CKD heterogeneity. Finally, our study population was limited to the US population. Therefore, our findings may not be generalizable to other populations. It is essential to validate our results in a broader range of populations in the future.

By using the probably associated factors of CKD, our study demonstrates that substantial heterogeneity with sophisticated phenotyping and underlying disease pathologies exists within the broad category of “CKD”. Our findings also added new information to the 2012 KDIGO CKD categorization guideline. The underlying heterogeneity in CKD staging by eGFR and urine ACR is not adequately captured. Patients with CKD can be subtyped into four separate subgroups based on 45 baseline features by applying data-driven clustering to multidimensional patient data, including demographics, biomarkers from blood and urine, health status and behaviors, and medication use. These subgroups are associated with future mortality risks. Identification of clinically subgroups among CKD patients provides an important step toward patient classification and management. Further research is needed to validate and refine these clusters, explore their reproducibility, and investigate underlying mechanisms. These findings contribute to a deeper understanding of CKD and have implications for improved patient outcomes.

Methods

Study population

The US National Health and Nutrition Examination Survey (NHANES) is an ongoing cross-sectional, national, stratified, multistage probability surveys of the civilian, noninstitutionalized US population26. About 10 thousand individuals in each survey for every 2 years are investigated to complete a household interview and underwent a physical examination. In present study, we used data from NHANES III 1999–2000 to 2017–2018 which included 10 cycles of survey. All nonpregnant participants with 20 years or older and CKD were included in the analysis. A detailed description of the NHANES database is publicly available (http://www.cdc.gov/nchs/nhanes.htm).

We combined ten consecutive survey cycles which included 101,316 participants. Participants were excluded for aged younger than 20 (n = 46,235), being pregnant at examination or uncertain of the pregnancy status (n = 2,639), and having received dialysis treatment in the past 12 months (n = 162). Participants were also excluded due to missing data on mortality or outliers of cluster variables. Finally, a total of 6,526 eligible subjects with CKD were included in the analysis (Supplementary Fig. 1). The NHANES protocol was approved by the National Center for Health Statistics (NCHS) Institutional Review Board and written informed consent was obtained. Our study followed the Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) reporting guideline27.

Mortality data

NHANES III participant records were linked to mortality data from the National Death Index based on death certificate data (https://www.cdc.gov/nchs/data-linkage/mortality-public.htm). International Classification of Diseases, Tenth Revision codes were used to identify the cause of deaths. Cardiovascular death includes death due to diseases of heart (I00-I09, I11, I13, I20-I51) and cerebrovascular diseases (I60-I69). Cancer death was classified using codes C00-C97. Person-months were calculated in months from the date of interview to date of death or most recent vital status record.

Variables

Sociodemographic characteristics, behavioral risk factors and history of diseases were administered in the survey by trained interviewers using questionnaires. The physical examinations and laboratory tests in NHANES took place in a mobile examination center using standardized protocols and calibrated equipment, and details on the data collection are described on the website (https://wwwn.cdc.gov/nchs/nhanes/analyticguidelines.aspx). We selected 45 clinically available and novel factors from over all variables that were measured at NHANES study baseline. Variables were selected on the basis of literature review for those that are most clinically relevant to CKD28,29,30,31,32. We excluded variables with over 10% missing data or small variability (e.g., binary variable with < 5%). The 45 variables included variables of sociodemographic characteristics (n = 9), behavioral risk factors (n = 5), biomarkers of metabolic status (n = 19), and history of diseases (n = 12).

The sociodemographic characteristics included ethnicity, education level, family income, marriage status, citizenship status, housing, employment, health insurance, and regular health care access. The ethnicity was categorized into non-Hispanic white and non-white. Low education attainment was defined as attaining less than a high school education. The income-to-poverty ratio (annual family income divided by the poverty threshold adjusted for family size and inflation) was used as a measure of income. The low income-to-poverty ratio was defined as less than 100%. The marriage status was dichotomized as currently married and not married. The citizenship status includes two options, citizen by birth or naturalization and not a citizen of the US. For investigating housing status, the participants were asked “Is this home owned, being bought, rented, or occupied by some other arrangement by you or someone else in your family?” Employed status was dichotomized as unemployed and employed, student, or retired. The type of health insurance was also dichotomized as with and without health insurance. The participants were asked “Is there a place that you usually go when you are sick, or do you need advice about your health?” for investigating the health care access.

The behavioral risk factors included currently smoking status, currently drinking status, physical activity level, sleep duration and sodium intake. Current smoking was defined as having smoked at least 100 cigarettes in life and smoking at present. Current alcohol drinking was defined as taking at least 12 times drinks of any type of alcoholic beverage in the last 12 months. Physical activity was estimated using the form of the Global Physical Activity Questionnaire by asking questions about the intensity, duration, and frequency of physical activity. There were different types of physical activity assessment tools used in NHANES 1999–2000 to 2005–2006 and NHANES 2007–2008 to 2017–2018. In NHANES 1999–2000 to 2005–2006, the duration of the physical activity was not ascertained, each physical activity was assigned an intensity value (metabolic equivalent tasks) that represents the ratio of the energy expenditure of the activity to the basal metabolic rate. In NHANES 2007–2008 to 2017–2018, total metabolic equivalent minutes per week were calculated as the measurement of physical activity level for the subjects. A higher level of physical activity was defined as having a higher metabolic equivalent/week than the median levels of the metabolic equivalent/week by investigation cycles. The usual sleep duration at night was investigated and long sleep duration was considered as sleep longer than 8 h. The sodium intake was collected through dietary interview.

The physical examinations and laboratory tests of metabolic biomarkers were collected using standardized protocols and assays, including body mass index (BMI), waist circumference (WC), systolic blood pressure (SBP), diastolic blood pressure (DBP), HbA1c, fasting plasma glucose (FPG), 2 h postprandial glucose (2 h PG), alanine transaminase (ALT), aspartate transaminase (AST), γ-glutamyl transferase (GGT), triglyceride (TG), high-density lipoprotein cholesterol (HDL-cholesterol), total cholesterol, low-density lipoprotein cholesterol (LDL-cholesterol), C-reactive protein (CRP), serum albumin, uric acid (UA), Urinary albumin creatinine ratio (UACR), and eGFR.

The information on currently taking prescribed medicine for treating hypertension, diabetes, and hypercholesterolemia was investigated in the survey. The history of diseases, including congestive heart failure, coronary heart disease, heart attack, stroke, and cancer or malignancy was also collected.

Definition of CKD

The eGFR was calculated using the 2009 chronic kidney disease epidemiology collaboration (CKD-EPI) equation with considering the sex, serum creatinine level and race33. Albuminuria was calculated using urinary albumin divided by the urinary creatinine based on morning spot urine. CKD was defined as eGFR level < 60 ml/min/1.73 m2 or UACR ≥ 30 mg/g. Equations expressed for specified sex and serum creatinine level were showed in the Supplementary Table 1.

Statistical analysis

We employed multiple imputation with arbitrary missing patterns to correct for response bias under the assumption of missing at random, and to maximally utilize existing risk factor data. The continuous variables with skewed distributions were log-transformed to normal distributions in the imputation process. The linear regression method was used to impute missing values for continuous variables, and the logistic regression method for variables having binary or ordinal responses. For each variable with missing data, we used the other variables to impute.

We performed consensus clustering analysis on the participants with chronic kidney disease and the continuous values were centered to a mean value of 0 and a standard deviation of 1. The clustering algorithm is to maintain high cluster consensus while maximizing the number of clusters34. With the prespecified setting a number of clusters K = 2, 3, …, 7, the consensus clustering algorithm generated a random subset that contained 80% of the data records without replacement and repeated 100 times for each number of clusters. For each random subset, the K-means (Euclidean distance-based) algorithm was conducted while each individual was assigned to one of the clusters. The frequencies of any pair of two individuals were calculated after 100 iterations, which were grouped together under each scenario of K and constructed a matrix of participantsˈ pairwise consensus value34. In the consensus matrix, consensus values ranged from 0 (never clustered together) to 1 (always clustered together) were marked by white to bright blue. For each number of cluster analysis, the cluster memberships are marked by colored rectangles. The consensus matrix is ordered by the consensus clustering which is displayed as a dendrogram atop the heatmap.

The optimal number of clusters was ascertained by observing the consensus matrix heat map, the within-cluster consensus scores, and the cumulative distribution function (CDF) (range 0–1) plot34. The CDF plot showed the area under the CDFs for each K, and for a specific number of clusters, the CDF reached an approximate maximum, thus consensus and cluster confidence was at a maximum at this K. The relative change in area under the CDF curve comparing K and K − 1 was also used to determine the optimal number of clusters. The cluster consensus score, ranged between 0 and 1, was defined as the average consensus value for all pairs of individuals belonging to the same cluster. A value approached to 1 indicated better cluster stability34.

For continuous variables with normal distribution, we calculated mean and standard deviation; for continuous variables with skewed distributions, we calculated median and interquartile range; and for categorical variables, we presented count and percentage. To present the cluster profiles of the 45 variables, we graphically displayed the standardized means of continuous variables (metabolic risk factors and sodium intake) and proportions for categorical variables (the other variables) by cluster. The frequencies of endpoints related to death were calculated as the number of events divided by person­months of observation censored at the date of event occurrence, death, or follow-up visit, whichever came first. Adjusted Cox proportional hazard models were used and hazard ratios (HRs) with 95% confidence intervals (CIs) were calculated to estimate the risks for all-cause mortality, CVD mortality, cancer mortality and mortality due to other causes by cluster.

All the statistical analysis was conducted using the R version 4.2.3 (http://www.r-project.org). Consensus clustering analysis was done using the ConsensusClusterPlus function (minimum K = 2, maximum K = 7, replication = 100, proportion of random subset = 0.8, Euclidean distance-based K-means algorithm) in the ‘ConsensusClusterPlus’ package in R version 4.2.3 (http://www.r-project.org).