Introduction

Diabetic kidney disease (DKD) is a leading cause of end-stage renal disease worldwide. Current diagnostic tools such as urinary albumin-to-creatinine ratio (UACR) and estimated glomerular filtration rate (eGFR) lack sufficient sensitivity and specificity for early detection. More sensitive, noninvasive markers are urgently needed to improve early diagnosis and management of DKD.

Type 2 diabetes mellitus (T2DM) is associated with multiple metabolic disturbances, including hyperglycemia, hypertension, and dyslipidemia, all of which contribute to DKD progression. Previous prediction models have included conventional risk factors but demonstrated limited specificity and clinical utility. Advanced glycation end products (AGEs) and serum cystatin C (CysC) have emerged as promising markers of renal function and damage. Lens AGE fluorescence provides a rapid, noninvasive measure of systemic AGE accumulation, while CysC is a more accurate surrogate for glomerular filtration than serum creatinine.

We aimed to establish and validate a nomogram combining routine clinical indicators with lens AGE fluorescence and CysC for early prediction of DKD risk in T2DM patients.

Study design and methods

Study participants

Participant grouping, inclusion and exclusion criteria, and sample collection

  1. (1)

    A schematic overview of the patient selection process and overall study design is depicted in Fig. 1. A total of 212 T2DM patients admitted to the Department of Endocrinology at Shanghai Fifth People’s Hospital, Fudan University, from November 2019 to February 2024 were selected to construct receiver operating characteristic (ROC) curves and determine cutoff points for noninvasive lens AGEs and CysC to predict worsening kidney function. According to the “Chinese Guidelines for the Prevention and Treatment of Diabetic Kidney Disease (2021 Edition)”1, patients were divided into a DKD group (31 males, 27 females) and a non-DKD group (99 males, 55 females). Patients were categorized into three groups according to the CKD prognostic risk stratification recommended by the KDIGO 2024 guidelines2. The low-risk category was defined as the mild group (n = 154), the moderately increased risk category as the moderate group (n = 36), and the high- plus and very-high-risk categories were combined and defined as the severe group (n = 22) (Fig. 2).

Fig. 1
figure 1

Study flow chart. Overview of patient selection and study design.

Fig. 2
figure 2

Prognostic risk stratification for CKD. Risk categories defined according to KDIGO 2024 guidelines.

  1. (2)

    Reproducibility Validation and Construction of the Nomogram Prediction Model: Patient data collection continued until February 2024, and the sample size was expanded to 2980. Patients who did not meet the study criteria or who demonstrated poor compliance were excluded. After completing relevant examinations and further excluding ineligible patients or those with incomplete clinical data, 868 T2DM patients (536 males, 332 females) were included. The grouping was consistent with the first phase: 644 patients were in the non-DKD group (403 males, 241 females), and 224 were in the DKD group (133 males, 91 females).

    A total of 127 candidate variables were initially collected and entered into univariate analyses for screening potential DKD-associated factors: (1) demographic characteristics, including age, sex, smoking and drinking history, and body mass index (BMI); (2) clinical information, such as the duration of T2DM and diabetic microvascular complications, including DKD, diabetic peripheral neuropathy (DPN), and diabetic retinopathy (DR); and (3) medical history, including hypertension, coronary heart disease (CHD), stroke, gout, hyperlipidemia, and fatty liver. Additionally, 48 laboratory indicators, such as blood biochemistry, complete blood counts, liver function tests, kidney function tests, and lipid profiles, were included. (4) Noninvasive lens AGE measurements were dichotomized at 0.306. Consistent with our previously published findings3, we applied the validated lens AGE cutoff of 0.306, which was originally derived through ROC analysis using Youden’s index to define the optimal threshold for identifying DKD.

    AGE fluorescence detection was performed by certified and trained medical personnel via a noninvasive lens AGEscan fluorescence detector (provided by Sinocare Inc.), which was used to obtain AGE accumulation levels.

  2. (3)

    Inclusion and Exclusion Criteria: According to the “Chinese Guidelines for the Prevention and Treatment of Diabetic Kidney Disease (2021 Edition),1” patients were divided into the DKD group (UACR ≥ 30 mg/g and/or eGFR < 60 mL/[min·1.7m2]) and the non-DKD group (UACR < 30 mg/g and eGFR ≥ 60 mL/[min·1.7m2]). Exclusion Criteria: Age < 20 or > 70 years; type 1 diabetes, gestational diabetes, or specific types of diabetes; acute or chronic infections, benign or malignant tumors, hematological diseases, immune disorders, urinary tract stones, non-DKD CKD, or DKD coexisting with non-DKD; fluorescein angiography within the past 6 months, photodynamic drug treatment within the past year, clinical diagnosis of lens opacity (e.g., cataracts), or ocular surface diseases (e.g., dry eye); white blood cell count (WBC) > 9.5 × 10^9/L or < 3.5 × 10^9/L; and poor patient compliance.

  3. (4)

    The study was conducted in accordance with the ethical guidelines of the Declaration of Helsinki.

    The procedures were approved by the Ethics Board at Shanghai Fifth People’s Hospital, Fudan University (approval no. (2021) (142)). Study participants with a clinical diagnosis of T2DM or DKD provided informed consent. The clinical characteristics of the study participants, including sex, age, and BMI etc. were recorded.

Data collection

Trained surveyors conducted face-to-face interviews via standardized paper-based questionnaires to retrospectively collect relevant data. All questionnaires were distributed, completed, and collected onsite. The data were entered independently by two individuals and cross-verified for accuracy.

Statistical analysis

Sample size considerations: this study used a retrospective design, all eligible patients were included and no a priori sample-size calculation was conducted. Adequacy for multivariable regression was ensured using the events-per-variable (EPV) principle4. A total of 224 DKD events provided > 30 EPV for the seven predictors in the final model, substantially exceeding the minimum requirement of 10 EPV.

The data were analyzed via SPSS 23.0 and R 4.0 software and are presented as the means ± standard deviations (SDs) or medians (interquartile ranges); comparisons between groups were performed via independent sample t tests or nonparametric tests. Categorical data are expressed as frequencies and percentages, and differences between groups were analyzed via the χ2 test. Univariate and multivariate logistic regression analyses were conducted to identify independent risk factors for DKD. The nomogram was constructed and validated via the rms package in R. The predictive performance of the model was evaluated via the area under the ROC curve (AUC). Calibration curves and the Hosmer–Lemeshow test were used to assess model calibration. Decision curve analysis (DCA) was applied to evaluate the clinical utility of the model.

Results

Construction of ROC curves to identify cutoff values of noninvasive lens AGEs and CysC for predicting worsening kidney function

Group analysis based on CKD prognostic risk stratification recommended by the KDIGO 2024 guidelines

Compared with the mild and moderate KDIGO groups, the severe KDIGO group presented significantly greater noninvasive lens AGE values and CysC levels and significantly lower 25-hydroxyvitamin D (25(OH)D) levels (P < 0.05). Compared with the mild KDIGO group, the severe KDIGO group presented significantly greater WBCs, neutrophil counts, platelet counts, and neutrophil-to-lymphocyte ratios (NLRs) (P < 0.05). Compared with the mild KDIGO group, the moderate and severe KDIGO groups had significantly greater platelet-to-lymphocyte ratios (PLRs) (P < 0.05) (Fig. 3). In the KDIGO 2024 and CDS guideline systems, inflammatory markers such as the NLR and PLR were positively correlated with the severity of kidney damage. We hypothesize that elevated AGE levels may directly or indirectly promote systemic chronic inflammation. This raises the question of whether these inflammation-related markers could also be used to assess DKD progression or aid in its diagnosis.

Fig. 3
figure 3

Comparison of clinical indicators across KDIGO groups. Noninvasive lens AGEs, cystatin C (CysC), 25-hydroxyvitamin D [25(OH)D], white blood cells (WBC), neutrophils, platelets, neutrophil-to-lymphocyte ratio (NLR), and platelet-to-lymphocyte ratio (PLR).

Group analysis based on the CDS guidelines for DKD diagnosis

In the DKD group, noninvasive lens AGE values, CysC levels, WBCs, Neutrophil counts, the NLR, and the PLR were significantly greater than those in the non-DKD group (P < 0.05) (Fig. 4).

Fig. 4
figure 4

Comparison of AGE, CysC, and 25(OH)D levels. Distribution between DKD and non-DKD groups.

Pearson correlation analysis of non-invasive lens AGEs and CysC values across domestic and international guidelines

In both the CDS and KDIGO stratification systems, noninvasive lens AGE values were positively correlated with kidney function indicators (r = 0.272 and 0.251, respectively; P < 0.05). CysC was also positively correlated with kidney function indicators in both systems (r values of 0.208 for CDS and 0.420 for KDIGO; P < 0.05).

Logistic regression analysis of CysC and noninvasive lens AGE values between groups

Logistic regression analysis based on KDIGO stratification revealed that patients in the moderate group were 2.97 times more likely (P < 0.001, 95% CI: 1.72–4.22) and 3.63 times more likely (P < 0.001, 95% CI: 2.46–4.80) to develop DKD than those in the mild group were. Patients in the severe group had a 4.21-fold (P < 0.001, 95% CI: 2.88–5.54) and a 12.49-fold (P < 0.001, 95% CI: 3.71–11.23) greater likelihood of developing DKD than did those in the mild group. Logistic regression analysis based on CDS dichotomization revealed that noninvasive lens AGE values and CysC levels in the DKD group were 2.75 times (P < 0.001, 95% CI: 1.50–4.00) and 3.13 times (P < 0.001, 95% CI: 1.91–4.34) higher, respectively, than those in the non-DKD group.

Construction of ROC curves based on noninvasive lens AGEs and CysC

Using the KDIGO 2024 guidelines for risk stratification, the ROC curve indicated that the cutoff value of CysC for predicting worsening kidney function was 1.185 mg/L (AUC = 97.0, sensitivity = 100%, specificity = 86.8%, P < 0.001) (Fig. 5A). When the CDS guidelines were used to categorize patients as diagnosed with DKD or not diagnosed with DKD, the ROC curve indicated that the cutoff value of CysC for predicting worsening kidney function was 1.005 mg/L (AUC = 61.8, sensitivity = 51.8%, specificity = 76.9%, P = 0.009). The cutoff value of noninvasive lens AGEs for predicting worsening kidney function was 2.893 (AUC = 64.6, sensitivity = 66.1%, specificity = 67.3%, P = 0.001) (Fig. 5B).

Fig. 5
figure 5

ROC curves for DKD prediction. (A) KDIGO-based stratification; (B) CDS-based stratification using CysC and AGEs.

Construction of a nomogram for DKD

In the first part of the study, the sample size was further expanded. After strict screening of the study population according to the inclusion and exclusion criteria, 1002 patients were initially included. After patients with incomplete clinical data were excluded, a total of 868 T2DM patients were included in the study, including 332 females and 536 males.

Group analysis based on different kidney damage evaluation indicators

Patients were divided into a DKD group and a non-DKD group according to the diagnostic criteria of the CDS guidelines. Among them, 224 patients (25.81%) were classified as having DKD, whereas 644 patients (74.19%) were classified as having non-DKD. The baseline characteristics of the two groups, including demographic characteristics, physical examination results, routine blood test parameters, and biochemical indicators, are shown in Table 1. Patients were divided into three groups according to the CKD prognostic risk stratification recommended by the KDIGO 2024 guidelines, as described in the previous section. The baseline characteristics are shown in Table 2.

Table 1 Comparison of baseline characteristics between the two groups.
Table 2 Comparison of baseline characteristics among the three groups.

Univariate and multivariate regression analyses

A total of 127 variables were included in the univariate analysis. Significant differences (P < 0.05) were observed between the DKD and non-DKD groups in terms of AGE risk stratification, age, 25(OH)D, glycated hemoglobin (HbA1c, %), systolic blood pressure (SBP), fasting blood glucose (FBG) levels, postprandial BG (PBG) levels, the PLR, the RBC, the WBC, SCr, CysC, and triglycerides (TGs). Variables with P < 0.05 in the univariate analysis were subsequently entered into a backward elimination stepwise logistic regression model for multivariable predictor selection. All eligible variables were included in the initial full model, and predictors were sequentially removed based on Akaike Information Criterion (AIC). The elimination process continued until no further improvement in model fit was achieved.

Seven variables remained in the final model and were identified as independent risk factors for DKD: SBP (OR = 1.05, 95% CI 1.03–1.09), HbA1c (OR = 1.24, 95% CI 1.07–1.43), TG (OR = 1.22, 95% CI 1.04–1.45), CysC (OR = 9.14, 95% CI 1.73–51.35), 25(OH)D (OR = 0.98, 95% CI 0.97–1.00), RBC (OR = 0.46, 95% CI 0.26–0.84), and AGEs (≥ 0.306 vs < 0.306, OR = 1.90, 95% CI 1.09–3.32) (Table 3).

Table 3 Univariate and multivariate regression analyses of DKD risk factors.

Construction of the nomogram

On the basis of the 7 independent risk factors identified through regression analysis—SBP, HbA1c (%), TGs (mmol/L), CysC (mg/L), 25(OH)D (nmol/L), the RBC (10^12/L), and AGEs (dichotomized at 0.306)—we constructed a nomogram for predicting DKD risk in T2DM patients (Fig. 6). Each risk factor is assigned a score proportional to its contribution to DKD risk. A vertical line is drawn from the patient’s value on the corresponding scale to the top points scale to determine the score for each risk factor. All the scores are added to calculate the total score. Using the bottom section of the total points scale, the probability of DKD occurrence can be predicted.

Fig. 6
figure 6

Nomogram for DKD risk prediction. Constructed from seven independent predictors.

Internal validation of the nomogram and model performance

The accuracy of the nomogram for predicting DKD risk was assessed via a calibration curve from the training set (Fig. 7A). The X-axis represents the predicted DKD risk, whereas the Y-axis represents the actual observed DKD risk. In this study, the ideal model with perfect predictive ability is represented by the diagonal dashed line. The “Apparent” dashed line represents internal validation, whereas the solid line (bias-corrected) represents the externally calibrated nomogram. The results of this study indicate that the constructed model aligns well with the actual observed outcomes. The ROC curve, with an AUC value of 0.809 (95% CI: 0.773–0.845; P < 0.05), is displayed in Fig. 7B. The sensitivity and specificity were 0.690 and 0.800, respectively.

Fig. 7
figure 7

Model performance in the training set. (A) Calibration curve; (B) ROC curve.

Internal validation of the nomogram and model performance

To validate the DKD model established in this study, 145 T2DM patients from the Department of Endocrinology, Shanghai Fifth People’s Hospital, Fudan University, were included for internal validation. The calibration curve for internal validation, which was in good agreement with the ideal calibration curve, is shown in Fig. 8A, whereas the ROC curve, with an AUC value of 0.806 (95% CI: 0.714–0.899; P < 0.05), is presented in Fig. 8B. The sensitivity and specificity were 0.632 and 0.944, respectively.

Fig. 8
figure 8

Model performance in the validation set. Receiver operating characteristic (ROC) curve.

Clinical utility of the nomogram

The Y-axis represents the net benefit. The diagonal gray line assumes that all patients have DKD, whereas the horizontal black line assumes that no patients have DKD. The red line represents the risk nomogram. The nomogram achieved maximum clinical benefit in both the training set (Fig. 9A) and validation set (Fig. 9B) when the threshold probability ranged from 0.101.00, indicating that the model has broad applicability and high clinical utility.

Fig. 9
figure 9

Decision curve analysis. Net clinical benefit of the DKD prediction model.

Finally, a total of 868 T2DM patients were included, with 224 (25.8%) diagnosed with DKD and 644 (74.2%) without DKD. Patients with DKD exhibited higher systolic blood pressure (SBP), glycated hemoglobin (HbA1c), triglycerides (TG), cystatin C (CysC), and AGE values, while 25-hydroxyvitamin D [25(OH)D] and red blood cell (RBC) counts were lower (all P < 0.05).

Seven independent predictors were identified: SBP, HbA1c, TG, CysC, 25(OH)D, RBC, and AGEs. A nomogram incorporating these variables achieved strong discrimination (AUC 0.809 in the training set and 0.806 in the validation set) and good calibration. Decision curve analysis confirmed clinical benefit across a broad range of thresholds.

Discussion

Comparison with previous models

DKD is the leading cause of end-stage renal disease (ESRD) worldwide and develops through the long-term interaction of multiple metabolic and hemodynamic factors. Conventional diagnostic markers such as serum creatinine (SCr), eGFR, and UACR are limited by delayed changes, which usually occur after irreversible renal damage has already developed5,6,7,8. In recent years, several nomograms have been proposed to predict DKD risk in T2DM patients9,10,11,12,13,14. These models typically included variables such as disease duration, BMI, blood pressure, HbA1c, triglycerides (TG), and blood urea nitrogen (BUN). Reported predictive performance was acceptable (C-index ~ 0.76–0.81), but most models relied solely on routine clinical parameters, and few underwent adequate internal or external validation. In contrast, our model is the first to incorporate noninvasive lens AGE values alongside CysC and conventional risk factors, showing robust discrimination (AUC 0.809 in the training set; 0.806 in the validation set) and particularly high specificity (0.944 in the validation set).

Role of cystatin C (CysC)

Cystatin C (CysC) is a critical marker of glomerular damage in diabetic kidney disease (DKD)15. As a low–molecular-weight protein produced by all nucleated cells and released into the bloodstream at a constant rate, CysC is freely filtered by the glomeruli and almost completely reabsorbed and degraded in the proximal tubules with minimal tubular secretion. These physiological characteristics make CysC a highly sensitive indicator of changes in the glomerular filtration rate (GFR)16,17,18,19. Unlike serum creatinine (SCr), CysC levels are not significantly affected by age, sex, muscle mass, diet, or inflammatory status, allowing it to more accurately reflect true renal function. Recently, several studies showed that serum Cysc is a better marker of declining GFR in DM patients than serum creatinine (Scr)20.Most of the evidence came from cross-sectional studies demonstrating that serum Cysc levels in DKD patients were significantly higher when compared to those in DM cases without DKD21,22,23.

Previous studies have shown that CysC demonstrates excellent sensitivity and specificity in the early diagnosis and monitoring of diabetic nephropathy, hypertensive nephropathy, and acute kidney injury24. Consistent with these findings, our study confirmed that CysC maintained high sensitivity and specificity within both the Chinese Diabetic Society (CDS) and KDIGO 2024evaluation frameworks. The incorporation of CysC into our model substantially enhanced its diagnostic accuracy and clinical interpretability. Collectively, these results provide a strong theoretical and empirical basis for the combined use of CysC as emerging diagnostic indicator for DKD risk prediction.

Role of advanced glycation end products (AGEs)

Autofluorescence (AF) of the lens, skin tissue, and serum reflects cumulative glycemic exposure in patients with diabetes to a certain extent25,26. Advanced glycation end products (AGEs) are irreversible adducts formed through nonenzymatic reactions between reducing sugars and free amino groups on proteins, nucleic acids, and lipids. Initiated by aldehyde or ketone groups of glucose and other reducing sugars, these reactions yield stable AGE structures that progressively accumulate in long-lived tissues27. Accelerated AGE generation and deposition promote oxidative stress and inflammatory activation, contributing to chronic tissue injury and metabolic dysfunction28,29.

In diabetes, AGE accumulation induces crosslinking of extracellular matrix (ECM) proteins, disrupting matrix architecture and promoting renal fibrosis. AGEs can also generate reactive oxygen species (ROS) through interactions with molecular oxygen and transition metals, amplifying oxidative stress and activating downstream inflammatory signaling. Through binding to the receptor for AGEs (RAGE), they trigger NF-κB–mediated transcription of proinflammatory and profibrotic cytokines, thereby exacerbating renal damage.

Skin autofluorescence (SAF) provides a widely used noninvasive measure of tissue AGE burden and correlates strongly with biochemical AGE quantification by high-performance liquid chromatography, supporting its analytical validity. However, SAF is influenced by melanin-dependent light absorption, sun exposure, skin pigmentation, epidermal thickness, topical products (such as moisturizers or sunscreens), and dermal vascularity, which may reduce measurement accuracy and limit its clinical applicability.

Lens crystallins are extremely long-lived proteins with minimal turnover, allowing AGEs to accumulate continuously over time and providing a stable reflection of lifetime glycemic and oxidative stress exposure. Lens fluorescence measurement is rapid (approximately 6 s), fully noninvasive, and does not require fasting or other patient preparation. It is applicable to most individuals except those who have undergone cataract extraction or lens replacement surgery.

As independent risk factors for the development and progression of diabetic kidney disease (DKD), AGEs not only predict glycemic deterioration but also the occurrence of diabetic microvascular complications. Renal fibrosis is a central driver of DKD progression toward end-stage renal disease (ESRD). Epithelial–mesenchymal transition (EMT)—characterized by loss of epithelial polarity and adhesion along with acquisition of mesenchymal features—is a key event in this fibrotic process. Experimental evidence suggests that AGEs induce metabolic reprogramming in renal tubular epithelial cells (TECs), shifting mitochondrial oxidative phosphorylation (OXPHOS) toward glycolysis and increasing renal lactate production, which contributes to metabolic maladaptation and fibrosis29.Our previous research further confirmed that lens AGEs levels were positively correlated with the occurrence of DKD, and the incidence of DKD in the high lens AGEs group was 2.739 times that in the low lens AGEs group.The receiver operating characteristic (ROC) curves showed that patients with T2DM with a lens AGEs value ≥ 0.306 were likely to have DKD3.

Other independent risk factors

In addition to CysC and AGEs, six conventional variables—SBP, HbA1c, TG, RBC, and 25(OH)D—were identified as independent predictors. Elevated SBP and HbA1c are well-established risk factors for DKD progression30,31,32, underscoring the importance of strict blood pressure and glycemic control. Hypertriglyceridemia has been implicated in renal injury via lipid-mediated inflammation, oxidative stress, and ectopic lipid deposition33,34,35. Reduced RBC counts, reflecting anemia due to impaired erythropoietin production, are frequently observed in DKD and have been linked to accelerated microvascular complications36,37,38. Finally, low 25(OH)D levels have been associated with impaired immune regulation, inflammation, and oxidative stress, all of which promote DKD progression39,40,41,42,43,44. Together, these findings emphasize that DKD results from a multifactorial interplay of metabolic, inflammatory, and hemodynamic factors.

Clinical implications

Our study provides a novel, easy-to-use, and clinically practical nomogram that integrates both traditional and emerging risk factors. The incorporation of noninvasive lens AGE values is of particular significance, offering a rapid, cost-effective, and patient-friendly method for DKD risk assessment. By achieving high specificity, this model reduces the likelihood of overdiagnosis and unnecessary interventions, thereby optimizing clinical resource allocation. In addition, the nomogram can assist clinicians in identifying high-risk individuals for early intervention, personalized treatment, and regular follow-up, ultimately improving patient outcomes and reducing healthcare burden.

Limitations

Several limitations should be acknowledged. First, This was a single-center study performed in a tertiary care hospital, which may limit the generalizability of our findings to milder or community-based diabetic populations and external multicenter validation with larger and more diverse populations is required to confirm the generalizability of our findings. Second, lifestyle and behavioral factors—including smoking status, diet, physical activity, and medication adherence—were not systematically collected, although these variables are known to influence DKD risk and may have introduced residual confounding.Third, the cross-sectional design precluded evaluation of causal relationships .Finally, while internal validation showed promising performance, further external validation and prospective studies are warranted.