Introduction

Early alterations in body water distribution may occur in the initial stages of disease progression, and extracellular water (ECW) has been proposed as a potential preclinical disease marker. Advances in bioelectrical impedance analysis (BIA) have facilitated the quantification of body fluids, including total body water (TBW) and fat-free mass, making such assessments more feasible in both research and clinical settings1,2. As a result, the ECW/TBW has gained recognition as a clinically relevant indicator of fluid distribution3,4.

Numerous studies have explored the relationship between body water distribution and health outcomes across diverse patient populations. For instance, research has demonstrated that in patients with cancer—including gastrointestinal cancer, cancer cachexia, and cancer-related sarcopenia—a higher ECW/TBW ratio is associated with reduced survival rates2,5,6. Studies conducted among patients with chronic obstructive pulmonary disease7, cardiovascular diseases8, acute heart failure9, renal diseases10, and those undergoing peritoneal dialysis11 and hemodialysis12,13 have also identified a significant association between fluid imbalance and poor prognosis. A study on the appendicular extracellular-to-intracellular water ratio (AE/I) and all-cause mortality in middle-aged and elderly individuals in Japan found that AE/I was positively associated with all-cause mortality and could serve as a predictive marker for early death in this population14. Consistent with these findings, two studies on COVID-19 patients reported that an elevated ECW/TBW ratio was associated with increased mortality rates, establishing ECW/TBW as a significant predictor of mortality15,16. Collectively, these studies suggest that a higher ECW/TBW ratio is linked to greater disease severity and a significantly increased risk of all-cause mortality. However, the precise quantitative relationship between ECW/TBW and all-cause mortality risk in the general population remains unclear.

To address this knowledge gap, the National Health and Nutrition Examination Survey (NHANES), a large and nationally representative study of the U.S. population, provides a unique research opportunity. By integrating comprehensive demographic, clinical, biochemical, and BIA-derived data, NHANES facilitates robust statistical analyses and an in-depth examination of ECW/TBW in relation to key health indicators while accounting for important confounders17,18.

In this study, we demonstrate that a higher ECW/TBW ratio is significantly associated with an increased risk of all-cause mortality in the general population, with an inflection point of 42.4%. A deeper understanding of this relationship may provide valuable quantitative insights for early detection, individualized ECW/TBW interventions, and improved health management strategies.

Materials and methods

Data source

We utilized a cohort from the United States, the NHANES, by the U.S. National Center for Health Statistics, has been conducting 2-year cycles since 1999 to assess the health and nutritional status of the U.S. population19,20. NHANES uses a complex, stratified, multistage probability sampling method in each survey to select a nationally representative sample. The survey encompasses two components: household interviews and medical examinations conducted at a Mobile Examination Center (MEC)21. The data from NHANES collected between 1999 and 2004, conducted by the Centers for Disease Control and Prevention (CDC). Data collection included in-home interviews, standardized physical examinations, and laboratory tests performed at the MEC.

Standard protocol approval, registration, and patient consent

All methods were performed in accordance with the relevant guidelines and regulations. The NHANES study protocols adhere to the Declaration of Helsinki and have been ethically approved by the Ethics Review Committee of the National Center for Health Statistics (NCHS), with all participants providing written informed consent22. Our secondary analysis followed the STROBE guidelines for cohort studies, and did not require additional institutional review board approval23. Detailed information regarding NHANES’ methodology and ethics is available on the CDC and NCHS website (https://www.cdc.gov/nchs/nhanes/index.htm).

Study design and population

For this study, we included participants aged 18 years or older with complete and valid body composition measurements. Exclusion criteria included individuals under 18 years of age, pregnant women, participants with lacking essential body composition information, and those missing data on all-cause mortality. Figure 1 illustrates the detailed participant enrollment process and the final analytic sample.

Fig. 1
figure 1

The study’s flow diagram.

Ascertainment of all-cause mortality

Mortality data and causes of death were obtained by linking the cohort database to the CDC’s National Death Index as of December 31, 2019. All-cause mortality was defined as death from any cause24. The follow-up period for each participant was calculated from the date of MEC testing to the date of death or the end of the follow-up period (December 31, 2019).

Assessment of ECW/TBW

ECW and TBW data were collected during MEC testing using the HYDRA ECW/ICW Bio-Impedance Spectrum Analyzer (Model 4200) manufactured by Xitron Technologies, Inc. (San Diego, California). These assessments were conducted by trained health technicians following standardized protocols detailed in the Body Composition Procedures Manual available on the NHANES website25. BIA is a non-invasive method for assessing body composition with significant clinical potential. It accurately determines body composition by emitting electrical signals and measuring resistance across a broad spectrum of frequencies26,27,28,29,30. Among various BIA-derived parameters, ECW/TBW serves as a key indicator of volume status, expressed as ECW-to-TBW ratio3,4.

Definitions of other covariates

Covariates were selected based on established clinical relevance and previous research findings11,14,31,32. They included the following:

  • Sociodemographic factors: Age, sex, marital status (married, living with a partner, or living alone), race/ethnicity (non-Hispanic White, non-Hispanic Black, Mexican American, or other races), educational level (< 9 years, 9–12 years, or > 12 years), and family income. Family income was categorized as low (poverty income ratio [PIR] ≤ 1.3), medium (1.3 < PIR ≤ 3.5), or high (PIR > 3.5)21.

  • Lifestyle factors: Smoking Status: Never (fewer than 100 cigarettes smoked in a lifetime), former (quit smoking after smoking ≥ 100 cigarettes), and current21. Drinking Status: Defined by consuming ≥ 12 alcoholic drinks in any given year33. Physical Activity: Sedentary, moderate (≥ 10 min of light to moderate exercise in the past 30 days), or vigorous (≥ 10 min of heavy exercise in the past 30 days)21.

  • Clinical conditions: Hypertension, diabetes, cancer, and coronary heart disease, each determined by self-reported physician diagnosis.

  • Anthropometric measures: Body Mass Index (BMI): Computed as weight (kg) / height (m)2.

  • Laboratory measurements: Albumin and Creatinine: Analyzed using a Hitachi Model 917 multichannel analyzer (Roche Diagnostics, Indianapolis, IN)34. C-Reactive Protein (CRP): Measured by latex-enhanced nephelometry on a Dade Behring Nephelometer II Analyzer (BNII)35.

Statistical analysis

In accordance with the complex sampling design of NHANES, our analysis incorporated sample weights, clustering, and stratification. Following NHANES guidelines for survey sample weights, we applied MEC weights. The sampling weights were calculated as follows: weights for 1999–2002 were assigned as two-thirds of the 4-year MEC weight, while weights for 2003–2004 were one-third of the 2-year MEC weight.

Categorical variables were presented as unweighted counts (weighted percentages), whereas continuous variables were expressed as mean (standard deviation, SD) or median (interquartile range, IQR). Chi-square tests were used to compare categorical variables, while regression analyses were applied for continuous variables. Potential multicollinearity among covariates was assessed using the variance inflation factor (VIF), with VIF ≥ 5 indicating significant multicollinearity. Additionally, multiple imputation was applied for covariates with missing values.

The KM method was employed to estimate survival rates, and the log-rank test was used to assess differences between groups. Multivariable Cox proportional hazard models were utilized to estimate HRs and 95% CIs for the association between ECW/TBW and all-cause mortality. For continuous variables, the HR represents the relative risk increase per unit change in the variable compared to a reference value. For categorical variables, the HR was computed relative to the baseline category (Quartiles ECW/TBW < 44.6%). Three models were constructed:

  • Model 1: Adjusted for sociodemographic characteristics (age, sex, race/ethnicity, marital status, education level, and family income) and NHANES cycle.

  • Model 2: Further adjusted for lifestyle factors (smoking status, drinking status, and physical activity).

  • Model 3: Fully adjusted for all covariates, on the basis of model 2, further including hypertension, diabetes, cancers, coronary heart disease, BMI, creatinine, CRP, and albumin.

Furthermore, we employed restricted cubic spline models to construct HR curves, exploring potential non-linear relationships between ECW/TBW and all-cause mortality and analyzing the association threshold between ECW/TBW and mortality risk. The bootstrap resampling method was used to determine inflection points.

Additionally, fully adjusted, complex sampling-weighted subgroup analyses were conducted using Cox proportional hazards models to examine associations stratified by key variables, including age (< 35 or ≥ 35 years), gender (male vs. female), marital status (married or living with a partner vs. living alone), education level (< 9 vs. 9-12 vs. > 12 years), family income (low vs. medium vs. high), smoking status (never smoker vs. former smoker vs. current smoker), drinking status (drinker vs. non-drinker), physical activity (sedentary vs. moderate vs. vigorous), comorbid hypertension (yes vs. no), diabetes (yes vs. no), cancer (yes vs. no), BMI (< 25.0 vs. ≥ 25.0 kg/m2), creatinine levels (< 75.0 vs. ≥ 75.0 umol/L), CRP (< 0.15 vs. ≥ 0.15 mg/dL), and albumin (< 40 vs. ≥ 40 g/L). Likelihood ratio testing was conducted to assess interactions among subgroups, and results were visualized using a forest plot.

To ensure the robustness of our findings, we conducted a series of sensitivity analyses. First, to address potential reverse causality bias, we repeated the analyses after excluding deaths that occurred within the first 2 years of follow-up. Second, we performed the same analyses after excluding participants who had cancer at baseline. Third, an E-value was computed to estimate the potential effect of unmeasured confounders.

All analyses were carried out using R software (version 4.2.1; http://www.Rproject.org), the R survey package (version 4.1-1), and Free Statistics software (version 2.1; Beijing Free Clinical Medical Technology Co., Ltd.). A two-tailed p-value < 0.05 was considered statistically significant. Data were analyzed from November and December 2024.

Results

Study population

Of the 31,126 participants of the NHANES 1999–2004, 24,788 were excluded for the following reasons:

  • Under 18 years of age (n = 14,065);

  • Pregnancy (n = 927);

  • Missing ECW/TBW data (n = 9791);

  • Missing all-cause mortality data (n = 5).

Thus, 6338 participants were included in the analysis (Fig. 1).

Baseline characteristics

During the 20-year follow-up, 307 (4.64%) all-cause mortality occurred. Among these, malignant neoplasms (cancer mortality) 84 (1.32%), diseases of heart (cvd mortality) 55 (0.82%), and non-specific mortality including diabetes mellitus 14 (0.25%), cerebrovascular diseases 13 (0.20%), chronic lower respiratory diseases 7 (0.11%), nephrotic syndrome 4 (0.04%), influenza and pneumonia 3 (0.05%), Alzheimer’s disease 0 (0%), accidents (unintentional injuries) 41 (0.68%), and all other causes (residual) 86 (1.4%) (Table S1).

Table 1 presents the baseline characteristics of the 6338 study participants according to their ECW/TBW quartiles. Extrapolated to the national level, these results are representative of about 97,627,499 individuals in the U.S. The mean (SD) age of the study participants was 31.4 (10.1) years, and 3,279 (51.1%) individuals were male. The median ECW/TBW was 42.4 (40.4, 44.6)%. Individuals who with a high ECW/TBW often tended to be older (32.5 [10.6] y versus 29.6 [9.5] y, respectively); more often female (1366 [85.8%] versus 203 [11%], respectively); and more likely to be married or living with a partner (801 [62%] versus 706 [55.8%], respectively). They were also more frequently non-Hispanic white (776 [74.1%] versus 518 [58.9%], respectively). They tended to have a higher incidence of diabetes (60 [3.2%] versus 19 [1.2%], respectively) and cancers (53 [4.8%] versus 12 [1.1%], respectively). Higher ECW/TBW was associated with higher C-reactive protein (0.2 [0.0, 0.4] mg/dL versus 0.1 [0.0, 0.3] mg/dL, respectively), and lower BMI (25.9 [6.5] kg/m2 versus 27.8 [5.3] kg/m2, respectively), creatinine (61.7 [24.5] umol/L versus 81.8 [27.4] umol/L, respectively), and albumin levels (43.0 [3.4] g/L versus 45.3 [3.2] g/L, respectively). In addition, individuals who with a low ECW/TBW often tended to engage in vigorous physical activity (837 [53.3%] versus 566 [37.4%], respectively).

Table 1 Population characteristics by ECW/TBW.

Kaplan–Meier curves for all-cause mortality

KM survival curves for all-cause mortality showed that participants in the higher ECW/TBW group (Q4, Quartiles ECW/TBW ≥ 44.6%) had lower survival ( log -rank P < 0.038) (Fig. 2). KM survival curves for male/female all-cause mortality and KM survival curves for cancer mortality / non-specific mortality showed the same trend, as shown in Supplementary Figure S2a and S2b.

Fig. 2
figure 2

Kaplan–Meier survival rate for all-cause mortality. The survival rates for the two groups with a lower ECW/TBW (Quartiles ECW/TBW < 44.6%) and a higher ECW/TBW (Q4, Quartiles ECW/TBW ≥ 44.6%). X-axis: survival time (months). Y-axis: survival probability.

Relationship between ECW/TBW and mortality

The univariate Cox analysis showed that a higher ECW/TBW was significantly associated with elevated risks of all-cause death (HR of 1.05, 95%CI 1.00–1.10, P < 0.049). A higher ECW/TBW was independently associated with elevated risks of all-cause death even after adjustment for age, sex, marital status, race/ethnicity, education level, family income, smoking status, alcohol drinking status, physical activity, hypertension, diabetes, cancers, coronary heart disease, BMI and creatinine (adjusted HR of 1.05, 95% CI 1.00–1.10, P = 0.041). Compared to participants with lower ECW/TBW (< 44.6%, Quartiles), those with upper quartile of ECW/TBW (≥ 44.6%, Q4) had an adjusted HR of 1.60 (95% CI 1.15–2.23, P = 0.005) for all- cause mortality. The risk of all-cause death increased by 60% for those with ECW/TBW more than 44.6% (Table 2). Hazard ratios of cancer / cvd / non-specific mortality by ECW/TBW were shown in Supplementary Table S2.

Table 2 Hazard ratios of all-cause mortality by ECW/TBW.

Non-linear relationship between ECW/TBW and all-cause mortality

To investigate the potential non-linear relationship between ECW/TBW and all-cause mortality, we performed a complex sampling-weighted Restricted Cubic Spline (RCS) analysis using an ECW/TBW value of 42.4% as the reference point. The statistical analysis demonstrated a significant overall association (P < 0.001) and a notable non-linear trend (P = 0.009), indicating a strong non-linear relationship between ECW/TBW and all-cause mortality risk. The resulting curve exhibited a J-shape: the risk remained stable before the reference point, while the risk of all-cause mortality rose with increasing ECW/TBW after the reference point (Fig. 3). Association between ECW/TBW and cancer / cvd / non-specific mortality by RCS were shown in Supplementary Figure S3.

Fig. 3
figure 3

Association between ECW/TBW and all-cause mortality by restricted cubic spline (RCS). The association between ECW/TBW and all-cause mortality exhibited a J-Shaped curve (P = 0.009). The solid curved line represents the estimates for the association of ECW/TBW with mortality, and shading, the 95% CI. ECW, extracellular water; ECW/TBW, extracellular water/total body water ratio; TBW, total body water.

In a subsequent threshold (two-piecewise) analysis, the HR for all-cause mortality was 1.10 (95% CI 1.05–1.15, P < 0.001) for ECW/TBW values ≥ 42.4%, indicating that the risk of all-cause mortality increased by 10% for every one-unit increase in ECW/TBW. However, no significant association was observed between the extracellular water-to-total body water ratio and all-cause mortality in participants with ECW/TBW values < 42.4% (Table 3). Threshold effect analysis of the relationship of ECW/TBW with cancer / cvd / non-specific mortality was shown in Supplementary Table S3.

Table 3 Threshold effect analysis of the relationship of ECW/TBW with all-cause mortality.

Stratified analyses based on variables

Stratified analyses were conducted to evaluate potential effect modifiers in the relationship between ECW/TBW and all-cause mortality (Fig. 4, Supplementary Figure S1). Subgroup analyses were conducted on samples categorized by age, sex, BMI, smoking status, alcohol drinking status, hypertension, and diabetes. No significant interactions were detected across any subgroups (P > 0.05), indicating that the positive association between ECW/TBW and all-cause mortality was largely consistent across these factors. Additionally, when accounting for multiple testing, a p-value of less than 0.05 for the interaction with age may not represent statistical significance.

Fig. 4
figure 4

The relationship between ECW/TBW and all-cause mortality according to basic features. Except for the stratification component itself, each stratification factor was adjusted for all other variables, including age, sex, body mass index (BMI), smoking status, alcohol drinking status, hypertension, diabetes, respectively. HR, hazard ratio; CI, confidence interval.

Sensitivity analysis

Sensitivity analyses were performed to test the stability of association results. The association between ECW/TBW and all-cause mortality risk is shown in Tables 2 and 4. The significant association remained, on the one hand, after excluding deaths that occurred within 2 years of follow-up, on the other hand, after excluding participants who had cancer (P < 0.05) (Table 4). The E-value for this association was 2.58 (95% CI 1.57), indicating that a considerable unmeasured confounder would be required to negate these observed effect estimates. Thus, the observed associations appear robust to potential biases (Supplementary Figure S4).

Table 4 Sensitivity analyses, hazard ratios of all-cause mortality by ECW/TBW.

Discussion

The findings of this cohort study demonstrate that a higher ECW/TBW ratio is associated with an increased risk of all-cause mortality. Overall, an increase of one-unit in ECW/TBW was associated with a 5% increase in the risk of all-cause mortality. Subsequent analyses revealed a J-shaped relationship between ECW/TBW and all-cause mortality, with an inflection point at approximately 42.4%. Below this inflection point, the association between ECW/TBW and all-cause mortality was not significant; however, beyond the inflection point, the risk of mortality increased sharply with rising ECW/TBW values. Specifically, each one-unit increase in ECW/TBW beyond the inflection point was associated with a 10% higher risk of all-cause mortality. Moreover, individuals with an ECW/TBW greater than 44.6% had a 60% higher risk of all-cause mortality compared to those with an ECW/TBW less than or equal to 44.6%.

The association between ECW/TBW and mortality rates is both epidemiologically and clinically plausible. From an epidemiological perspective, elevated ECW/TBW has been strongly linked to an increased risk of cancer, hemodialysis, kidney diseases, and cardiovascular diseases7,8,12,36,37, which are major contributors to all-cause mortality. Interestingly, our study revealed that the association between ECW/TBW and non-specific mortality (ECW/TBW ≥ 42.4%, HR of 1.14, 95% CI 1.03–1.25, P = 0.010) was more pronounced than the association with deaths caused by malignant neoplasms or heart diseases. This discrepancy might be attributed to the limited sample size for these specific conditions.

From a clinical perspective, higher ECW/TBW levels are indicative of individual hydration status4, nutritional status13,38, and inflammation levels14,39. Nutritional deficiencies can compromise the integrity of cell membranes, leading to a reduction in intracellular water and a relative increase in extracellular water40. Additionally, decreased protein synthesis and increased protein breakdown further deplete intracellular water while drawing more water into the extracellular space13. Inflammation also plays a role by increasing vascular permeability, causing fluid to leak into the interstitial spaces and augmenting extracellular water41. Edema and fluid overload further elevate the ECW/TBW levels. Collectively, these mechanisms contribute to an increase in the ECW/TBW. These factors are known to play pivotal roles in the onset and progression of various diseases38,42, which might be the leading causes of all-cause mortality.

Water is the cornerstone of life on Earth and a major component of the human body, playing a critical role in maintaining cellular homeostasis43,44. A healthy human body typically consists of 50% to 70% water, which is essential for transporting nutrients to cells and expelling waste, highlighting its indispensable role in bodily functions28. Generally, approximately 33% of TBW resides outside the cells, while the remaining 67% is intracellular. Any increase in extracellular osmolarity draws fluid out of the cells, resulting in an expansion of ECW45. The measurement of intracellular and extracellular water can reflect the functional status of tissues and provide insights into the physiological state of human cells.

This study highlights that ECW/TBW serves as an important biomarker for assessing hydration status, nutritional status, and levels of inflammation-factors that play pivotal roles in the epidemiology of numerous diseases. BIA is widely recognized for its speed, non-invasiveness, reproducibility, ease of use, and safety in body composition assessment6,46. These advantages make BIA a practical and accessible tool for widespread use in clinical and research settings47. As an important clinical index to evaluate body composition, ECW/TBW has a normal range of 35.6–40.3%32. Monitoring ECW/TBW might facilitate a deeper understanding of disease progression, predicts patient outcomes, and offers opportunities for more personalized treatment and management strategies. Multiple strategies may help reduce ECW/TBW. (1) Sodium restriction: A high-salt diet elevates extracellular osmotic pressure, driving water from the intracellular to the extracellular compartment and increasing ECW/TBW. Reducing salt intake can lower this osmotic gradient, diminishing extracellular fluid volume48. (2) Weight management: Lowering body mass index (BMI) and waist circumference is typically accompanied by beneficial shifts in body composition, potentially reducing ECW/TBW49. (3) Exercise and physical activity: In individuals with abdominal obesity, increasing physical activity may improve metabolic parameters and redistribute fluid, ultimately decreasing extracellular water48. (4) Nutritional and inflammatory status: Nutritional support (e.g., protein supplementation, optimized caloric intake) combined with anti-inflammatory therapy (e.g., IL-6 inhibitors) has been shown in clinical trials to reduce ECW/TBW and improve survival rates50,51. Addressing these factors under medical supervision could help optimize fluid balance.

While this study has several strengths, including a comprehensive analysis of a nationally representative sample, long-term follow-up, and careful adjustment for various confounding factors, some limitations should be acknowledged.

First, although we employed multivariable Cox proportional hazards models and performed extensive subgroup and sensitivity analyses, residual confounding cannot be entirely ruled out. NHANES data capture a wide range of variables, but not all potential confounders (e.g., over-the-counter medication use or acute illness) may be accounted for. Second, ECW/TBW data were only collected in NHANES from 1999 to 2004, restricting our ability to use data from other NHANES cycles for further validation. Third, although BIA measurements were performed using standardized protocols, specific conditions such as fasting state, bladder fullness, time of day, and recent physical activity were not fully controlled or documented. Fourth, a one-time measurement of ECW/TBW was used in this study. Future research should consider serial measurements to strengthen the robustness and reliability of the analysis. Fifth, this study focused solely on all-cause mortality, and disease-specific mortality was not fully investigated due to the small sample sizes for specific conditions. Finally, further research is warranted to validate the application of ECW/TBW as a population-level screening tool. As this study was conducted in a U.S. population, investigating the association between ECW/TBW and all-cause mortality in other populations would enhance the generalizability of these findings.

Conclusion

In conclusion, this national cohort study identified a J-shaped relationship between the ECW/TBW and all-cause mortality risk among U.S. adults, with a critical inflection point at approximately 42.4%. These findings provide strong evidence that ECW/TBW may serve as a non-invasive and easily obtainable screening tool for estimating mortality risk and identifying high-risk individuals. This novel concept has the potential to be integrated into public health strategies for nutritional guidance and health management in the general population.