Introduction

Chronic kidney disease (CKD), affecting more than 10% of the people worldwide, becomes a growing public health issue1,2,3. With the progression of CKD, the risk of developing end-stage renal disease (ESRD) increases gradually and so do the cardiovascular complications and deaths4,5. Traditional risk factors of CKD progression include hypertension, diabetes mellitus (DM), and higher body mass index (BMI)6; however, these factors do not provide a very accurate prediction of the speed of glomerular filtration rate (GFR) decline. Some patients’ GFR show rapid decline, while others remain stable7,8.

Several investigations indicated that higher baseline GFR, such as renal hyperfiltration, may relate to subsequent rapid GFR decline. Higher baseline GFR have been found in early stages of DM if hyperglycemia was not well controlled9,10,11,12,13. The pathophysiology of hyperfiltration in DM comprised several possible mechanisms, such as ultrastructural changes caused by imbalanced release of cytokines and growth factors, and disturbance of vascular tone due to an imbalance of vasoactive humoral factors in response to hyperglycemia12,14,15. Moreover, most observational studies and meta-analyses have shown a significant relation between higher baseline GFR and subsequent rapid GFR decline in patients with DM16,17,18,19.

The phenomenon of higher baseline GFR was also found in people with intolerant fasting glucose, obesity, pregnancy, or high protein diet20,21,22. However, there is limited information about the relation between higher baseline GFR and subsequent GFR decline in patients without DM. A recent study showed that higher baseline GFR was significantly related to more rapid decline in GFR over time in patients without DM by using a linear mixed regression model with a random intercept and slope, i.e. the variations in the baseline GFR and in the changes in GFR respectively, to resolve the statistical issue of mathematical coupling17. Mathematical coupling, defined as one variable containing the whole or part of another, could yield a spurious correlations between two variables irrespective of any true association, thereby leading to questionable conclusions3,17,23,24,25. Although linear mixed model can tackle the issue of mathematical coupling, correct specification of the random effects is essential for yielding a meaningful interpretation of the correlation between the random intercept and slope26. However, previous studies did not always describe their model specifications in detail, and it is, therefore, unclear whether the issue of mathematical coupling has indeed been resolved.

To investigate the relationship between the baseline GFR and subsequent GFR change, we used an early stages of CKD cohort comprising patients with DM and without DM. We used linear mixed model to overcome the mathematical coupling in the analysis of the relation between the baseline GFR and its changes. We also compared results from correct and incorrect specifications of mixed models to assess the impact of mathematical coupling. We hypothesized that higher baseline GFR is a risk factor of a rapid GFR decline in patients with DM but not in those without DM.

Methods and materials

Study population

This study is a retrospective analysis of data from a single medical center, National Taiwan University Hospital. Patients, who agreed to participate the early CKD program between 2011 and 2018, were included, if their ages were between 18 and 80 years and were diagnosed with chronic kidney disease stage 1 to 3A, which was defined as a 45 ml/min/1.73 m2 < GFR < 60 ml/min/1.73 m2 or GFR ≥ 60 ml/min/1.73 m2 with a urinary protein-to-creatinine ratio (UPCR) ≥ 150 mg/g or significant findings of renal pathology for at least 3 months27,28. Patients, who had acute kidney injury within three months, Child Pugh class B to C liver cirrhosis, or terminal malignancies, were excluded. We then divided the cohort into DM (including type 1 and type 2 DM) and non-DM groups to proceed with statistical analyses. This study has been approved by the Research Ethics Committee of the National Taiwan University Hospital (202006020RINB). As this study was retrospective and observational, participants written informed consent was waived by the Research Ethics Committee of the National Taiwan University Hospital.

Early chronic kidney disease program

The early CKD program was initiated by the National Health Insurance Bureau, the Ministry of Health and Welfare in Taiwan, to care patients with CKD stages 1 to 3a in high-risk population since 201129. The high-risk population of CKD progression includes people of being older than 65 years old, family history of CKD, diabetes mellitus, and hypertension.

These patients were invited to join this program on their visits to outpatient clinics. A comprehensive educational program, consisting of basic knowledge of chronic kidney disease, risk factors, lifestyle modification and medical treatment, was provided. The participants returned to the clinic every 3–6 months, according to the clinical presentations of patients and judgments of primary care physicians. They attended classes of the educational programs and undertook routine laboratory tests at least every 6 months. Once patients’ GFR was lower than 45 ml/min/1.73 m2 or UPCR1000 mg/gm, they were transferred to the Pre-ESRD program for further management29.

Data collection

Basic personal information of age, sex and underlying comorbidity were recorded. Body height, weight, blood pressure and biochemical data, including serum creatinine, low-density lipoprotein cholesterol, fasting glucose and HbA1c (only for DM patients) and UPCR were recorded at baseline and every 3 to 6 months. The estimated GFR (eGFR) was calculated by the Taiwanese MDRD equation [1.309 × (186 × (serum creatinine)−1.154 × Age −0.203 × 0.742 (if female)) 0.912] , which was developed by using linear regression of the difference on the average of log-transformed inulin clearance and the MDRD, CKD-Epidemiology Collaboration (CKD-EPI) equations, and has been validated and shown to be more accurate and precise than MDRD-4 variables and CKD-EPI equations for Taiwanese adults28,30.

Statistical analyses

We used the mean ± standard deviation (SD) to summarize continuous variables, and relative frequency for categorical variables. The differences between the two groups were analyzed by using the t-test or the chi-squared test. Variables with a non-normal distribution were analyzed by the Mann–Whitney U test. A two-level linear mixed regression model with the random intercept and slope was used to analyze eGFR measurements and solving the statistical problems of mathematical coupling. The dependent variable was the absolute value of eGFR. The time variable was the duration of observation starting from the date of the first visit up to that of the last visit in nephrology outpatient department. To test the appropriate null hypothesis in multivariable linear mixed regression model, we undertook within-subject mean centering for the duration of observation to correct for the effects of mathematical coupling between random intercept and slope23,24,25,26. The basic linear mixed model is written as:

$$eGF{R}_{ij}={b}_{0j}+{b}_{1j}Tim{e}_{ij}+{e}_{ij}$$
$${b}_{0j}={\beta }_{0}+{u}_{0j}$$
$${b}_{1j}={\beta }_{1}+{u}_{1j}$$
$$\left(\genfrac{}{}{0pt}{}{{u}_{0j}}{{u}_{1j}}\right)\sim N\left(\begin{array}{cc}0,&{\varvec{\Sigma}}\end{array}\right),{\varvec{\Sigma}}=\left(\begin{array}{cc}{\sigma }_{u0}^{2}& {\sigma }_{u01}\\ {\sigma }_{u01}& {\sigma }_{u1}^{2}\end{array}\right)$$
$${e}_{ij}\sim N\left(\begin{array}{cc}0,& {\sigma }_{e}^{2}\end{array}\right)$$

where \(eGF{R}_{ij}\) is the observed eGFR for the jth patient on ith occasion; \({b}_{0j}\) is the intercept for the jth patient \({\beta }_{0}\) is the average intercept for the whole patient population, and \({u}_{0j}\) is the random intercept, i.e. the variations in the intercepts of the whole patient population, which is assumed to follow a normal distribution with the mean of zero and variance of \({\sigma }_{u0}^{2}\); Time is the centered time variable, i.e. the centered duration of observation in years for each patient; \({b}_{1j}\) is the slope for the jth patient \({\beta }_{1}\) is the average slope for the whole patient population, and \({u}_{1j}\) is the random slope, i.e. the variations in the slopes of the whole patient population, which is assumed to follow a normal distribution with the mean of zero and variance of \({\sigma }_{u1}^{2}\); and \({e}_{ij}\) is the residual error term for \(eGF{R}_{ij}\). \({u}_{0j}\) and \({u}_{1j}\) follow a bivariate normal distribution, and the parameter \({\sigma }_{u01}\) is the covariance between random intercept and slope and can be used to calculate the correlation r between the baseline eGFR and the subsequent changes in eGFR by using the formulae: \(r=\frac{{\sigma }_{u01}}{\sqrt{{\sigma }_{u0}^{2}*{\sigma }_{u1}^{2}}}\). The effect of an independent variable on GFR change rate in mL/min per year was evaluated by using 2-way interaction between the independent variables and the centered time variable. The relation between baseline eGFR (ml/min/1.73 m2) and succeeding eGFR changes (ml/min/1.73 m2 per year) was evaluated by the correlation between the random intercept and slopes. We also conducted the same analyses without centering the time variables to assess the impact of mathematical coupling on the estimation of the correlation between the baseline eGFR and the change. We performed covariates adjustment, including sex and age in model 1, then plus BMI and systolic blood pressure (SBP) in model 2, and added current smoking status, HbA1c (only in diabetes mellitus group) and urine protein-creatinine ratio in model 3. A P-value < 0.05 was considered statistically significance. The statistical software Stata version 14 (Stata Corp College Station, Texas, USA) (https://www.stata.com/) was used for data analysis. All methods carried out were in accordance with relevant guidelines and national legal regulations.

Results

Baseline characteristics of study population

The baseline characteristics of diabetes mellitus and non-diabetes mellitus groups is shown in Table 1. A total number of 1002 patients with early-stage CKD (717 in non-DM group and 285 in DM group, respectively) and 7621 nephrology clinic visits were identified between January 1, 2011, and December 31, 2018. More than 70 percent of patients in both DM and non-DM groups were male (79.6% and 72.9%, respectively). Patients in DM group were 4.4 years older than those in non-DM group (68.0 vs 63.6 years old, p < 0.001) and had a higher BMI (mean BMI 26.0 vs 24.8 kg/m2, p < 0.001, respectively). Mean baseline eGFR was 58.5 ± 18.1 (SD) ml/min/1.73 m2 in the non-DM group and 57.4 ± 16.3 (SD) ml/min/1.73 m2 in DM group by the Taiwanese MDRD equation. The eGFR declined gradually in both DM and non-DM groups (Fig. 1), but the lengths of follow-up varied greatly among participants.

Table 1 Baseline characteristics of study population.
Figure 1
Figure 1
Full size image

The graph of estimated glomerular filtration rate (eGFR) change without within-subject mean centering for the time variable.

Fixed effects of risk factors on the baseline eGFR and changes of eGFR

Our multivariable mixed model (model 3) showed that males and older age had a significantly negative association with baseline eGFR; the average baseline eGFR of men was 4.61 ml/min/1.73 m2 lower than that of women, and the average baseline eGFR decreased by 0.50 ml/min/1.73 m2 when a patient’s age increased by 1 year. UPCR had a significantly negative relationship to the changes of eGFR in the DM group; the average change in eGFR decreased by 0.10 ml/min/1.73 m2 as UPCR increased by 10 mg/g (Table 2). In the non-DM group, male and older age also had a negative association with baseline eGFR; the average baseline eGFR of men was 10.85 ml/min/1.73 m2 lower than that of women, and the average baseline eGFR decreased by 0.44 ml/min/1.73 m2 when a patient’s age increased by 1 year, and higher systolic blood pressure showed greater changes in eGFR; the average change in eGFR increased by 0.02 ml/min/1.73 m2 when systolic blood pressure increased by 1 mmHg (Table 3).

Table 2 Fixed effects of risk factors on baseline eGFR and changes of eGFR in patients with DM.
Table 3 Fixed effects of risk factors with baseline eGFR and changes of eGFR in patients without DM.

Association between baseline eGFR and subsequent eGFR change

In the DM group, a significantly consistent negative correlation between GFR at baseline (random intercept) and GFR change rate (random slope) was found after multivariable adjustment (r =  − 0.46 [95% CI, − 0.62 to − 0.27] in model 1, r =  − 0.51 [95% CI, − 0.68 to − 0.30] in model 2, and r =  − 0.65 [95% CI, − 0.81 to − 0.39] in model 3, respectively). After within-subject mean centering for the time variable, the correlations decreased substantially (r =  − 0.25 [95% CI, − 0.44 to − 0.04] in model 1, r =  − 0.30 [95% CI, − 0.51 to − 0.05] in model 2, and r =  − 0.40 [95% CI, − 0.67 to − 0.05] in model 3, respectively) (Table 4) (Fig. 2).

Table 4 Correlation between baseline eGFR and eGFR decreased rates (random effects) in patients with DM.
Figure 2
Figure 2
Full size image

The relation between baseline glomerular filtration rate (GFR) and GFR change rate. The figure shows the best linear unbiased predictors (BLUPs) of random slopes and intercepts, indicating the relation between baseline GFR (the random intercept) and GFR change rate (random slope) in a linear mixed model after using within-subject mean centering for the time variable. DM, diabetes mellitus; CI, confidence interval. Stata version 14 (Stata Corp College Station, Texas, USA) (https://www.stata.com/).

In the non-DM group, multivariable mixed linear regression model also showed a significantly negative correlation between GFR at baseline and GFR change rate (r =  − 0.22 [95% CI, − 0.36 to − 0.06] in model 1, r =  − 0.36 [95% CI, − 0.52 to − 0.17] in model 2, and r =  − 0.44 [95% CI, − 0.69 to − 0.09] in model 3, respectively). However, after within-subject mean centering for the time variable, the negative correlation between GFR at baseline and GFR change became much smaller and no longer statistically significant (r = 0.05 [95% CI, − 0.11 to 0.21] in model 1, r =  − 0.06 [95% CI, − 0.26 to 0.15] in model 2, and r =  − 0.09 [95% CI, − 0.41 to 0.25] in model 3, respectively) (Table 5) (Fig. 2).

Table 5 Correlation between baseline eGFR and eGFR decreased rates (random effects) in patients without DM.

Discussion

To our best knowledge, this study is the first to explore the relation between baseline eGFR and subsequent GFR decline in DM and non-DM patients with early-stage CKD in a long-term population-based cohort. After multivariable adjustment, we found that male and increased age had a significantly negative effect on the baseline eGFR, and only UPCR showed a significant relation to the decrease in eGFR in patients with CKD and DM. In patients with CKD and non-DM, males and older age also showed a significantly negative association with the baseline eGFR but males (in model 1 and 2) and higher systolic pressure were associated with a significantly smaller decrease in eGFR. Naïve analyses without centering of the time variable found a significantly negative correlation between GFR at baseline and GFR change in DM and non-DM patients with CKD. After within-subject mean centering, a significant but moderate correlation were observed only in DM patients with CKD. This indicates the impact of mathematical coupling on the estimation of the correlation between GFR at baseline and GFR change cannot be overlooked, and this is consistent with findings in the previous methodological studies23,25,31,32,33.

It is still controversial whether hyperfiltration with higher baseline GFR is related to subsequent GFR decline. Some studies revealed that hyperfiltration with higher eGFR was associated with more rapid eGFR decline in patients with type 1 or type 2 DM16,17,18,19,34,35,36. Moreover, Melsom et al. showed that this significant correlation existed not only in patients with DM but also in those without DM17. In contrast, no substantial correlation between baseline GFR and subsequent GFR decline was also found in other investigations37,38. This inconsistency may be attributed to considerable GFR variations over time, such as age-related GFR decline, GFR measurements, and inappropriate statistical methods for assessing the relation between GFR at baseline and its changes39,40. In our study, a negative relation between the baseline eGFR and subsequent decline was found only in patients with DM but not in patients without DM.

The accurate measurement of GFR should examine the clearance of materials which are only through renal filtration, such as iohexol, iothalamate, inulin, etc.41. However, the estimated GFR was more readily available and cost-effective in the clinical practice than GFR via direct measurement, which was difficult to replicate due to different physiological conditions and great variations over time. Two of the most common equations for estimating GFR were MDRD-4 variables and CKD-EPI worldwide. However, the MDRD-4 equation was created by using data from Caucasians and African Americans with CKD and is likely to underestimate GFR, when eGFR greater than 60 mL/min/ 1.73 m242,43. The KDIGO 2012 guidelines recommend using the CKD-EPI equation in adults, unless an alternative equation has been shown to be more accurate in the specific population44. In our study, we used a corrected MDRD equation (Taiwanese MDRD), because it has been shown to have better precision and a lower bias than the equations of the MDRD-4 and CKD-EPI in Taiwanese population30.

Analysis of the relation between the baseline and changes suffered from mathematical coupling, which gives rise to misleading results and invalid testing of the null hypothesis23,25. Some studies applied a linear mixed regression model with random intercept and slope to resolve this statistical issue and to attain correct results17. However, the correct null hypothesis for testing the correlation between the baseline value and subsequent change is not zero, because these two variables have an underlying mathematical relation by sharing a common component of the baseline value23,25. It has been shown that centering the time variable could overcome this underlying mathematical relation and then attain an accurate result of null hypothesis test23,24,25,45. In this study, we undertook within-subject mean centering for the time variable to investigate the relationship between the baseline eGFR and its changes. We observed that the negative correlations are always attenuated after centering when compared with those given by the conventional approach without centering, in DM and non-DM patients with CKD. Moreover, in non-DM patients, this negative correlations were no longer statistically significant, after correct methodologies were used. This implied that higher baseline eGFR or renal hyperfiltration may be only a subclinical indicator but not the major cause of renal damage in patients with DM and early-stage CKD. Furthermore, when patients’ GFR was lower than 45 ml/min/1.73 m2, they would be transferred to the Pre-ESRD program, and their eGFR values were no longer being included in our analysis. This may lead to the truncation of low eGFR values, resulting in the reduction in the variances of follow-up eGFR and the over-estimation of the negative relation between the baseline eGFR and its changes. Our study suggests that previous evidence on the relation between the baseline eGFR and the decline in eGFR should be interpreted with great cautions and may require reevaluation.

Limitations

Our study has some limitations. First, we included a relatively small number of participants, especially in patients with DM. Despite this problem, our study collected sufficient data of repeated measurements during the long-term follow-up to achieve robust inference. Second, our study included a higher proportion of men and elderly in DM and non-DM groups. According to previous national study, near two thirds of adults with CKD was more than 60 years old (63.3%), and men had a higher prevalence of early-stage CKD than did women (11.7% versus 9.9%)46. These data showed a similar distribution to our study. Therefore, it needs to be cautious to apply findings in our study to other populations with different sex proportions or age distributions. Third, the mean eGFR in our study was lower than the traditional definition of hyperfiltration with high GFR. The focus of our study is not the consequence of renal hyperfiltration but proper analyses of the relation between higher baseline GFR and subsequent GFR change. Moreover, the number of nephrons varied among individuals and usually decreased with age or renal injury39. Glomerular hyperfiltration or single-nephron hyperfiltration in people with fewer numbers of nephrons may show a normal or mildly low level of whole-kidney GFR, which is equal to single-nephron GFR multiplied by nephron numbers47. Therefore, results from our study still provides evidence on the relation between higher eGFR and subsequent eGFR changes. Finally, patients with GFR lower than 45 ml/min/1.73 m2 or UPCR1000 mg/gm were transferred to the Pre-ESRD program for further management. Therefore, informative censoring may be a concern, since the information from those with rapidly deteriorating kidney function were selectively missing. This may be regarded as a problem with the truncated data, since GFR lower than a threshold was unavailable. This is likely to lead to a decrease in the variance of GFR with the increase in follow-up of the cohort, yielding a spurious, negative correlation between the baseline GFR and changes in GFR25. Therefore, the negative correlation between GFR at baseline and GFR changes in patients with CKD and DM may be weaker than what has been observed.

Conclusion

In conclusions, a significantly negative correlation between GFR at baseline and GFR changes was found in patients with CKD and DM, but no such correlation was found in non-DM patients with CKD when correct statistical analyses were undertaken. Higher baseline eGFR or renal hyperfiltration may be only a subclinical indicator but not the major cause of renal damage in patients with DM and early stage CKD. Our findings suggest that higher baseline GFR was associated with a greater GFR decline in DM patients but not in non-DM patients. Investigations about baseline value to subsequent changes should describe model specifications in detail to assure resolving mathematical coupling and then prevent from a spurious conclusion.