Development and temporal validation of five year retinal arteriosclerosis risk prediction in high risk adults from an East China cohort

Zhu, Jing; Xu, Jiaqi; Gao, Xin; Wang, Yuying; Gu, Yiqing; Wang, Yamin; Tang, Zaixiang; Shen, Junjie

doi:10.1038/s41598-026-42686-7

Download PDF

Article
Open access
Published: 27 March 2026

Development and temporal validation of five year retinal arteriosclerosis risk prediction in high risk adults from an East China cohort

Jing Zhu¹^na1,
Jiaqi Xu^2,3^na1,
Xin Gao⁴,
Yuying Wang^2,3,
Yiqing Gu^2,3,
Yamin Wang¹,
Zaixiang Tang^2,3 &
…
Junjie Shen^2,3

Scientific Reports volume 16, Article number: 10835 (2026) Cite this article

371 Accesses
Metrics details

Subjects

Abstract

Structural variations in retinal vessels predict clinical conditions related to vascular health. This study aims to describe the incidence and distribution of retinal arteriosclerosis and to develop risk prediction models for high-risk populations. Data were obtained from a retrospective health examination cohort at Hua Dong Sanatorium, China. The primary outcome was retinal arteriosclerosis. Candidate risk predictors were selected based on biological plausibility and potential predictive ability. Risk prediction models were developed using the Cox proportional hazards model. Predictive performance was evaluated through discrimination and calibration accuracy.Internal validation was performed using bootstrap resampling (100 iterations) to estimate optimism and to report optimism-corrected discrimination (C-index) and calibration (slope). In the derivation cohort, 10,323 individuals were diagnosed with retinal arteriosclerosis over 288,525 person-years (incidence density: 35.8 per 1,000 person-years). Males over 40 and females over 45 were identified as relatively high-risk groups. Risk predictors included age, body mass index, waist circumference, hip circumference, systolic blood pressure, diastolic blood pressure, smoking, hypertension, diabetes, high-density lipoprotein, and serum creatinine. The C-index of the risk prediction models was approximately 0.8 in the validation set. These models are accessible to potential users via an online application. The risk prediction models may facilitate early intervention by modifying risk factors, thereby reducing the incidence of retinal arteriosclerosis in high-risk populations.

The performance of an integrated model including retinal information in predicting childhood hypertension

Article 04 May 2021

Prevalence and risk factors of hypertensive retinopathy in South Korea based on national health survey data

Article Open access 15 July 2025

Development and validation of a hypertension risk prediction model and construction of a risk score in a Canadian population

Article Open access 27 July 2022

Introduction

The continued growth of the global elderly population increases the burden of retinal diseases worldwide¹. Retinal arteriosclerosis (RA) measured noninvasively in the fundus is a progressive narrowing of small retinal arteries, impairing circulation². It advances from mild arteriolar narrowing to severe sclerosis with hemorrhages. Risk factors include aging, hypertension, diabetes, hyperlipidemia, and smoking³. To a certain extent, RA reflects the conditions of vascular systems in the whole body, which is one of the earliest signs of adverse structural and functional changes in vascular walls^4,5. Thus retinal vessels provide a window for detecting changes in the micro-vascular system associated with the progress of the cardiovascular disease^6,7.

Clinical risk prediction model has been widely used in medical practice to tell an individual’s risk of disease in the future based on multiple individual characteristics⁸, which helps for improving the quality of life of the target population and optimize the utilization of medical resources⁹. It is well known that aging, hypertension, high fat intake, and diabetes can accelerate the process of vascular and retinal disease^10,11,12, however, no large cohort studies have been conducted with retinal or fundus arteriosclerosis as an outcome up to now, and we still lack a comprehensive epidemiological understanding of the disease. Furthermore, there is also lack of risk prediction model for predicting the future risk of this disease in high risk populations.

This study aims to describe the recent incidence and distribution of RA, and further develop and validate risk prediction models for high risk populations, by using a retrospective large population-based cohort from East China. We hope that early intervention via changing modifiable risk factors from the risk prediction system to reduce the expectation of RA among high risk populations.

Materials and methods

Study population

This study analyzed data from Hua Dong Sanatorium, which has been reported previously^13,14. Specifically, the major subjects of this retrospective longitudinal dynamic health examination data were non-manual workers or retirees living in East China, most of which took a health examination every one to two years. Due to missing data in 2014, we divided this cohort into two parts: one cohort is from January 2005 to December 2013, and the other is from January 2015 to December 2021. We took the latter as the descriptive subject and as the derivation cohort, and took the former as the validation cohort.

The outcome we were interested in was the occurrence of RA in any grade. In this study, we defined RA-free as “survival”, and the “survival probability” referred to the probability of remaining RA-free. The diagnosis of RA was performed by a trained ophthalmologist. Fundus photography was completed following a standardized Protocol. Subjects were classified concerning their retinal photography based on the Keith-Wagener-Barker classification. Detailed instructions for other data collection see in^13,14.

The selection process of the eligible individuals in the derivation cohort is as follows. First, 112,612 individuals with at least two health examination records were retained. Second, individuals without ophthalmic examination and those diagnosed with RA at the first health examination were excluded. Third, we excluded individuals observed for less than 3 months and those with an age less than 18 at the first health examination. Lastly, 90,240 individuals with 324,966 health examination records were left. Figure S1 presents the above process.

This study was approved by the Ethical Committee and Institutional Review Board of Hua Dong Sanatorium, Wuxi. Informed consent was waived because the data were analyzed retrospectively and anonymously, and was supported by the Ethical Committee. All methods were performed in accordance with the Declaration of Helsinki.

Construction and estimation of the risk prediction model

We conducted and reported risk prediction models according to the guidance of TRIPOD (Table S1)⁹. The outcome definition of RA followed the Keith–Wagner–Barker classification (Table S2). Due to the inherited difference between male and female¹⁵, we built risk prediction models of RA disease risk for males and females separately. We calculated the incidence density of RA in the derivation cohort by quartile age groups (Table S3-S4). Based on incidence density, males over 40 and females over 45 were considered to be at high risk in this study.

Predictor variables

We first described the baseline characteristics of males over 40 and females over 45 in the derivation cohort (see Table S5-S6). Among these characteristics, candidate predictor variables were selected mainly by the availability, explainability in biology, and potential predictive ability judged by the Cox regression model.

The results of univariable and multivariable Cox regression analyses see Table S7-S8. Continuous laboratory data would be log2 transformed if they were skewed or had a wide range. When conducting univariable Cox analysis, the linear relationships between continuous variables and risk of incidence were examined (Figure S2). An obvious nonlinear relationship would be fitted using restricted cubic splines¹⁶. Restricted cubic splines were implemented using the rcs() function in the rms package in R, with 3 knots placed at the 10th, 50th, and 90th percentiles of the predictor distribution in the derivation cohort, as determined by the datadist object. The same spline specification (number and locations of knots) was retained when non-linear terms were included in the multivariable Cox models and when generating predictions in the temporal validation cohort. We selected variables with p-value < 0.05 for the next multi-variable analysis. When conducting multivariable Cox analysis, the col-linearity diagnosis was first performed between variables. Variables with variance inflation factor (VIF) greater than 5, i.e., cholesterol (Table S9-S10) or thought to be closely related in biology, i.e., glycated hemoglobin (vs. fasting plasma glucose) were excluded.

For the risk prediction model, we considered variables that could increase the concordance index of the model while trying to reduce the complexity of the model. Finally, age, body mass index (BMI), waist circumference (WC), hip circumference (HC), diastolic pressure (DP), smoking, hypertension, diabetes, and serum creatinine (Scr) were selected as key predictors for male; age, BMI, WC, HC, systolic pressure (SP), DP, hypertension, diabetes, high density lipoprotein (HDL) and Scr as key predictors for female.

Derivation and validation models

Given that BMI had a missing value of 40% in the original data, we evaluated the mechanism of the loss of BMI. Following the suggestion of Desai¹⁷, describing differences between those with and without data on key variables concerning population characteristics serves as an assessment of the missing completely at random (MCAR) assumption, we compared the incidence of RA between samples with BMI and without BMI (see Figure S3), found that there was little difference between the two in either male or female population. Thus, the missing of BMI could be considered as the MCAR (Actually, the missing of the records of height and weight was completely random, and so did WC and HC). It can be considered that the result using the complete-case (CC) of BMI was unbiased. Finally, data with complete BMI was used. In these data, the other variables were all missing by less than 1%. Thus, we chose to delete these samples with missing values directly. The last derivation set version was: males over 40, n = 19,889; females over 45, n = 8,792.

The prediction model using the semi-parameter Cox model consists of two parts: the linear part in the exponential position and the unshaped baseline hazard¹⁸. In the final version, risk prediction models for RA were fitted within the above risk predictors using Cox proportional hazards models. Linear predictor (LP) or called predicted risk score (PRS) was calculated by the sum of each predictor’s value multiplied by its coefficient. The submission to the proportional hazard assumption of each risk factor was assessed by checking Schoenfeld residual plots (Figure S4). Linear or nonlinear relationships between continuous predictors and risk of incidence were also examined. The relationship between predictors and risk of incidence in males was reasonably approximates a linear relationship (Figure S5A) while age, HDL, and Scr presented a nonlinear relationship to the risk of incidence in females (Figure S5B).

In this study, the health examination cohort from January 2005 to December 2013 was used as the temporal external validation cohort. We chose the validation set consistent with the conditions of the derivation set, and individuals that appeared in the derivation set were eliminated in the validation set. Baseline characteristics of the derivation and validation set are summarized in Tables S11-S12. PRS in the validation set was obtained by the coefficients from the derivation set. We fitted a Cox model to the validation set with no covariates other than PRS with the regression coefficient constrained to one and obtained the estimated baseline hazard of the validation set¹⁸.

Evaluation of model performance

The overall performance measures of the risk predictive model are mainly assessed by discrimination and calibration⁸. In addition, the Brier score is deemed as a comprehensive index of evaluation model performance, with a range from 0 (for a perfect model) to 0.25 (for a non-informative model with a 50% incidence of the outcome)⁸.

Discrimination is the ability to differentiate between high-risk individuals and low-risk individuals¹⁹. The most intuitive method is to divide the population into different risk groups by PRS and observe the risk difference between the groups by Kaplan-Meier (K-M) curves¹⁹. Discrimination is also often measured by Harrell’s overall concordance (C) statistic, with a range from 0.5 to 1.0. We calculated the original C-index and corrected C-index by a bootstrap procedure resampled 100 times. In addition, the area under a receiver operating characteristic curve (AUC) is calculated in different time horizons²⁰. Another approach to validation is to estimate the weight of PRS in the validation set¹⁸, known as the “calibration slope”. The discrimination in the validation set is about the same as the derivation set when the slope of PRS is approximately one¹⁸.

Calibration describes how accurately the estimates or predictions of risk from a model reflect the risk incidence in the observed data¹⁹. Usually, calibration plots are used to assess calibration. In this study, subjects were evenly divided into ten groups by PRS, then the mean predicted incidence probabilities of these groups (X-axis) were compared to the mean observed incidence probabilities (Y-axis) at 1, 3, and 5 years. Besides, we performed a more intuitive and sensitive assessment of the calibration. We divided individuals into three risk groups based on the tertiles of PRS calculated from the derivation set. We next plotted the observed K-M curves and the mean predicted probability K-M curves within each risk group to compare the bias.

We performed the above performance evaluation on the risk prediction models in the derivation set (internal) and validation set (external) respectively. If the estimated cumulative baseline hazard of the derivation set is very different from that of the validation set, the calibration of the risk predictive model would be very poor in the validation set. In such a situation, we would use the cumulative baseline hazard of the validation set to re-calibrate the models^21,22.

To estimate the robustness of predictive accuracy of the risk predictive models constructed using the baseline information of the health examination cohort, we fitted a time-dependent covariates Cox regression model using subsequently updated health examination data of derivation set and calculated the Brier score and C-index over a series of points in time.

Online risk predictive app

Our risk predictive models of predicting RA-free probability are available for the potential users at the online app for male: https://lnzi.shinyapps.io/rarisk_male/ and for female: https://lnzi.shinyapps.io/rarisk_female/.

Statistical analysis

We used the event per variable (EPV) criterion to assess the adequacy of data instead of formally calculating the sample size⁹. Follow-up time was determined as the time from the first health examination to RA diagnosis, or the last health examination time, whichever came first. The incidence of RA was presented by K-M curves. Median follow-up time was computed via the reverse K-M method⁹. P-value < 0.05 was considered to be statistical significance. Data cleaning and analyses were performed using R version 4.1.3.

Results

The baseline information of 90,240 individuals aged 18.1–96.8 years in the health examination cohort from January 2015 to December 2021 is summarized in Table 1. The median age of the cohort population was 43 (inter-quartile range, IQR: 35–51). The median follow-up time was 3.14 years (95% credible interval, CI: 3.12–3.16). During a sum of 288524.7 person-years of follow-up, a total of 10,323 individuals were diagnosed with RA (incidence density: 35.8 per 1000 person-years).

Table 1 Baseline characteristics of participants in the health examination cohort (2015–2021) by retinal arteriosclerosis (RA) status at the end of follow-up.

Full size table

Derivation and validation models

The baseline characteristics of the derivation set and validation set are summarized in Tables S10-S11. The median follow-up time of males in the derivation set was 3.61 years (95% CI: 3.47–3.70) and 6.62 years (95% CI: 6.60–6.65) in the validation set. The incidence density of RA in males was 71.2 per 1000 person-years in the derivation set and 46.4 per 1000 person-years in the validation set. The median follow-up time of females in the derivation set was 3.15 years (95% CI: 3.09–3.31) and 6.56 years (95% CI: 6.51–6.60) in the validation set. The incidence density of females was 47.8 per 1000 person-years in the derivation set and 31.2 per 1000 person-years in the validation set.

The coefficients of the male risk predictive model, as well as PRS equation structured from the derivation set, see Table 2. The coefficients of the female risk predictive model and PRS equation are structured from the derivation set (see Table 3).

Table 2 Predictors and coefficients of the risk predictive model of retinal arteriosclerosis for males over 40.

Full size table

Table 3 Predictors and coefficients of the risk predictive model of retinal arteriosclerosis for females over 45.

Full size table

The distribution of PRS in the derivation set and validation set were similar (see Figure S6). We modeled the relationship between PRS and hazard ratio (HR) in the derivation cohort, with a threshold specified to classify individuals into high- and low-risk categories. (Fig. 1). The baseline survival (disease-free) probabilities of derivation set and validation set in males and females see Table S14-S15, which can be used to calculate the disease-free probability for an individual in recent years. Note that there was an obvious difference in the cumulative hazard and baseline survival probability between the derivation set and validation set in both males and females (see also Figure S7).

Evaluation of model performance

In internal validation, the original overall C-statistic of risk predictive models of males and females was 0.7740 and 0.7969, respectively, and the corrected C-statistic was 0.7736 and 0.7955. In external validation, the original C-statistic of the models of male and female was 0.7859 and 0.8156, respectively, and the corrected C-statistic was 0.7860 and 0.8160. Brier score and C-index at different time points from 1-year to 5-year of derivation set and validation set in males and females are shown in Fig. 2. Consistent with Fig. 2, time-dependent ROC curves at 1, 3, and 5 years showed good discrimination in both cohorts (Figure S11A-B).The calibration slope was 0.972 in males and 1.013 in females which were both close to 1. The observed K-M survival curves of the three risk groups grouped by PRS of the derivation set (see Table S15) were widely separated in the derivation set and validation set (Fig. 3, solid lines). These results corroborated the high discriminative power of the models. Of note, the C-statistics in the temporal validation cohort were slightly higher than those in the derivation cohort (0.7859 vs. 0.7740 in men and 0.8156 vs. 0.7969 in women). This small difference most likely reflects differences in case-mix and follow-up patterns between cohorts, rather than a truly superior performance of the model in the validation data.

In terms of calibration, the observed K-M survival curves of the three risk groups were close to the mean predicted survival curves in the internal validation (Figs. 3A and D and 4A and D). However, calibration in the temporal validation cohort showed notable overestimation of risk (Fig. 3B and E). Calibration plots also indicated poor agreement between predicted and observed risk probability at 1, 3, and 5 years in external validation (Fig. 4B and E). The predicted risk was overestimated in the validation set. The reason is the baseline disease-free probability of the validation set was much higher than that of the derivation set (Table S13-S14 and Figure S7). After re-calibrated the cumulative baseline hazard using the validation set, calibration accuracy had been well improved in both male and female models (see Figs. 3C and F and 4C and F).

Brier score and C-index at different time points from 1-year to 5-year in males and females using subsequently updated health examination data of the derivation set as sensitivity analysis see Figure S8, which had not much difference to the baseline models. This may have something to do with the lack of movement in the indicators during the health examination period (see Figure S9 for example).

Discussion

A major issue in the older population is the high prevalence of sight-threatening conditions. The eye is thought of as a kind of “vascular” organ¹⁰. Ophthalmologists may play a key role in the early identification of older individuals with systemic vascular disease in the future²³. Recent evidence also supports associations between retinal arteriosclerotic signs and subclinical atherosclerosis (e.g., coronary heart disease and carotid atherosclerosis or new-onset carotid plaque), reinforcing the value of fundus findings as systemic vascular markers^24,25,26. However, before identifying changed micro-vascular features like RA, it is much preferable to make personalized risk stratification and early management feasible on a population basis in advance, which are important from the public healthcare perspective.

Although there have been many studies on the prevalence of arteriosclerosis or retinal atherosclerosis^13,27,28,29, fewer studies have reported on the incidence¹². Compared to the previous studies, our study reported the distribution of short-term incidence density of RA in different populations in terms of sex and age. In the study of¹², researchers reported the incidence of retinal atherosclerosis of 8.2% (26,168/319,501) between January 2005 and April 2020 with a median follow-up of 2.9 (IQR: 1.6–4.6) years, which was lower than the whole population in our study of 11.4% (10,323/90,240) during a median follow-up of 3.14 (95%CI: 3.12–3.16) years. In the study of¹², researchers summarized the distribution of characteristics of retinal atherosclerosis between the absent and the present, the results were similar to ours.

Aging, obesity, dyslipidemia, hypertension, and diabetes are well-known traditional risk factors for vascular and retinal diseases^10,30. Besides, there is evidence of the increased risk of retinal disease from smoking by adversely affecting blood flow and decreasing HDL levels^31,32. In this study, Scr was found to be a protective factor within its reference ranges (male: 53–106 µmol/L, female: 44–97 µmol/L, most people in the derivation set fell within the reference range, see Figure S10). Generally, Scr that is well outside the reference range predicts poor renal function. But for older adults with a normal level, higher Scr concentrations may indicate more robust musculature and improved nutritional status³³. We would like to emphasize that these risk prediction models may not be suitable for individuals with poor renal function.

Our risk prediction models for RA were constructed with more quantitative predictors, and potential nonlinear patterns were identified. These helped to improve the accuracy of prediction models. But the calibration was poor in external temporal validation because of the wide difference in baseline survival probability between the derivation and validation set. Although the models discriminated well, calibration differed by calendar period. We developed the models in 2015–2021 to reflect contemporary screening and data capture after the 2014 relocation, and used 2005–2013 as an out-of-time validation cohort to examine transportability. The lower baseline hazard in 2005–2013 led to systematic overestimation of absolute risk when applying the 2015–2021 baseline, illustrating cohort/period effects that primarily impact calibration. Updating the baseline cumulative hazard in the validation cohort markedly improved calibration; therefore, our online tool reports absolute risk using the contemporary (2015–2021) baseline, and recalibration of baseline risk is recommended when applying the model to other periods or settings³⁴. Therefore, considering that the derivation set reflected the more recent condition of RA, we used the baseline risk of the derivation set to calculate an individual’s disease-free survival probability in an online risk prediction app.

This study has some limitations. First, this population came from the health examination cohort in East China, which may have selection bias and may not be representative of other populations worldwide. In addition, the health examination data generated the lagging detection of the outcome, which would lead to the underestimation of the incidence density. Moreover, the true onset of RA is interval-censored between examinations; we therefore assigned the event time to the examination date when RA was first detected. This pragmatic choice may influence estimation of the baseline hazard and thus absolute risk and calibration, and may contribute to miscalibration in temporal validation if examination intervals or follow-up patterns differ across periods³⁵. Second, genetic factors were not considered in this study. Genetic factors are quite important in the development of vascular structure variation³⁶. This study also failed to consider personal diet, living habits, and medication history/use (e.g., antihypertensives, lipid-lowering, antidiabetic agents). That is why we included many quantitative variables in risk prediction models that may reflect a combination of these factors. Even when hypertension is included as a predictor, measured systolic and diastolic BP are prioritized because they distinguish treated/controlled from untreated/poorly controlled individuals and provide a continuous risk gradient; hypertension history is retained only when it adds incremental information without problematic collinearity.

In conclusion, this study described the recent short-term incidence density and distribution of RA among Chinese population according to the retrospective population-based health examination cohort. We hope the early intervention by changing the modifiable risk factors from our sketchy risk prediction system to reduce the expectation of RA for high risk populations. In the meantime, we appeal to more studies to improve our models by using other predictors such as genetic factors and medication history/use (e.g., antihypertensives, lipid-lowering, antidiabetic agents) and to accommodate more different populations.

Data availability

The datasets used and analyzed during the current study are available from the corresponding author upon reasonable request.

Abbreviations

RA:: retinal arteriosclerosis
VIF:: variables with a variance inflation factor
BMI:: body mass index
WC:: waist circumference
HC:: hip circumference
DP:: diastolic pressure
Scr:: serum creatinine
SP:: systolic pressure
HDL:: high density lipoprotein
MCAR:: missing completely at random
CC:: complete-case
LP:: linear predictor
PRS:: predicted risk score
K-M:: Kaplan-Meier
C:: concordance
AUC:: area under the curve
CI:: credible interval
EPV:: event per variable
IQR:: inter-quartile range
HR:: hazard ratio

References

Ben-Arzi, A., Ehrlich, R. & Neumann, R. Retinal Diseases: The Next Frontier in Pharmacodelivery. Pharmaceutics 14(5). (2022).
Micera, A., Bruno, L., Cacciamani, A., Rongioletti, M. & Squitti, R. Alzheimer’s disease and retinal degeneration: A glimpse at essential trace metals in ocular fluids and tissues. Curr. Alzheimer Res. 16(12), 1073–1083 (2019).
Article CAS PubMed Google Scholar
Liew, G., Wang, J. J., Mitchell, P. & Wong, T. Y. Retinal vascular imaging: A new tool in microvascular disease research. Circ. Cardiovasc. Imaging 1(2), 156–161 (2008).
Article PubMed Google Scholar
Cavalcante, J. L., Lima, J. A., Redheuil, A. & Al-Mallah, M. H. Aortic stiffness: current understanding and future directions. J. Am. Coll. Cardiol. 57 (14), 1511–1522 (2011).
Article PubMed Google Scholar
Klein, R. et al. Are retinal arteriolar abnormalities related to atherosclerosis? The atherosclerosis risk in communities study. Arterioscl Throm Vas. 20 (6), 1644–1650 (2000).
Article CAS Google Scholar
Liew, G. & Wang, J. J. [Retinal vascular signs: a window to the heart?]. Rev. Esp. Cardiol. 64 (6), 515–521 (2011).
Article PubMed Google Scholar
Chang, J. et al. Association of Cardiovascular Mortality and Deep Learning-Funduscopic Atherosclerosis Score derived from Retinal Fundus Images. Am. J. Ophthalmol. 217, 121–130 (2020).
Article PubMed Google Scholar
Steyerberg, E. W. et al. Assessing the performance of prediction models: a framework for traditional and novel measures. Epidemiology 21 (1), 128–138 (2010).
Article PubMed PubMed Central Google Scholar
Moons, K. G. et al. Transparent Reporting of a multivariable prediction model for Individual Prognosis or Diagnosis (TRIPOD): explanation and elaboration. Ann. Intern. Med. 162 (1), W1–73 (2015).
Article PubMed Google Scholar
Flammer, J. et al. The eye and the heart. Eur. Heart J. 34 (17), 1270–1278 (2013).
Article PubMed PubMed Central Google Scholar
Roizenblatt, M., Naranjit, N., Maia, M. & Gehlbach, P. L. The Question of a Role for Statins in Age-Related Macular Degeneration. Int J. Mol. Sci 19(11). (2018).
Matsuoka, S. et al. Association of retinal atherosclerosis assessed using Keith-Wagener-Barker system with incident heart failure and other atherosclerotic cardiovascular disease: Analysis of 319,501 individuals from the general population. Atherosclerosis 348, 68–74 (2022).
Article CAS PubMed Google Scholar
Liu, Q. et al. Gender-Specific Association between Serum Uric Acid and Incident Fundus Arteriosclerosis in Chinese Population: A Retrospective Cross-Sectional Study. Sci. Rep. 10 (1), 8595 (2020).
Article CAS PubMed PubMed Central ADS Google Scholar
Liu, Q. Q. et al. Gender-Specific Association Between Serum Uric Acid and Incident High Intraocular Pressure in Chinese Population: A Cross-Sectional Study. Invest Ophth Vis. Sci 61(11). (2020).
Ji, H. et al. Sex Differences in Myocardial and Vascular Aging. Circ. Res. 130 (4), 566–577 (2022).
Article CAS PubMed PubMed Central Google Scholar
Ferrando-Vivas, P. et al. Team IC-: Prognostic Factors for 30-Day Mortality in Critically Ill Patients With Coronavirus Disease 2019: An Observational Cohort Study. Crit. Care Med. 49 (1), 102–111 (2021).
Article CAS PubMed PubMed Central Google Scholar
Desai, M., Kubo, J., Esserman, D. & Terry, M. B. The handling of missing data in molecular epidemiology studies. Cancer Epidemiol. Biomarkers Prev. 20 (8), 1571–1579 (2011).
Article PubMed PubMed Central Google Scholar
Royston, P. & Altman, D. G. External validation of a Cox prognostic model: principles and methods. BMC Med. Res. Methodol. 13, 33 (2013).
Article PubMed PubMed Central Google Scholar
Graf, E., Schmoor, C., Sauerbrei, W. & Schumacher, M. Assessment and comparison of prognostic classification schemes for survival data. Stat. Med. 18 (17–18), 2529–2545 (1999).
Article CAS PubMed Google Scholar
Heagerty, P. J. & Zheng, Y. Survival model predictive accuracy and ROC curves. Biometrics 61 (1), 92–105 (2005).
Article MathSciNet PubMed ADS Google Scholar
Janssen, K. J., Moons, K. G., Kalkman, C. J., Grobbee, D. E. & Vergouwe, Y. Updating methods improved the performance of a clinical prediction model in new patients. J. Clin. Epidemiol. 61 (1), 76–86 (2008).
Article CAS PubMed Google Scholar
Moons, K. G. M. et al. Risk prediction models: II. External validation, model updating, and impact assessment. Heart 98 (9), 691–698 (2012).
Article PubMed Google Scholar
Hafner, J., Zierfuss, B., H Schernthaner, G. & Schmidt-Erfurth, U. From the eye into the foot? Atherosclerosis 294, 41–43 (2020).
Article CAS PubMed Google Scholar
Zhang, W. et al. RETINAL MICROVASCULAR CHANGES AND RISK OF CORONARY HEART DISEASE: A Systematic Review and Meta-Analysis. Retina 44 (2), 333–344 (2024).
Article CAS PubMed Google Scholar
Jiang, Y. et al. Association of Retinal Arterial Narrowing With New-Onset Carotid Plaque: A Chinese Community-Based Nested Case-Control Cohort Study. J. Clin. Hypertens. (Greenwich). 27 (3), e14983 (2025).
Article CAS PubMed PubMed Central Google Scholar
Liu, C. et al. Sex-specific association between carotid atherosclerosis and fundus arteriosclerosis in a Chinese population: a retrospective cross-sectional study. Eur. J. Med. Res. 28 (1), 518 (2023).
Article CAS PubMed PubMed Central Google Scholar
Deal, J. A. et al. Retinal signs and risk of incident dementia in the Atherosclerosis Risk in Communities study. Alzheimers Dement. 15 (3), 477–486 (2019).
Article PubMed Google Scholar
Matsuoka, S. et al. Association Between Blood Pressure Classification Using the 2017 ACC/AHA Blood Pressure Guideline and Retinal Atherosclerosis. Am. J. Hypertens. 34 (10), 1049–1056 (2021).
Article PubMed Google Scholar
Meng, Y. et al. Association Between Folic Acid Supplementation and Retinal Atherosclerosis in Chinese Adults With Hypertension Complicated by Diabetes Mellitus. Front. Pharmacol. 9, 1159 (2018).
Article CAS PubMed PubMed Central Google Scholar
Soto, I., Krebs, M. P., Reagan, A. M. & Howell, G. R. Vascular Inflammation Risk Factors in Retinal Disease. Annu. Rev. Vis. Sci. 5, 99–122 (2019).
Article PubMed Google Scholar
Fleckenstein, M. et al. Age-related macular degeneration. Nat. Rev. Dis. Primers. 7 (1), 31 (2021).
Article PubMed PubMed Central Google Scholar
Yuen, V. L. et al. Effects of firsthand tobacco smoking on retinal vessel caliber: a systematic review and meta-analysis. Graefes Arch. Clin. Exp. Ophthalmol. 262 (5), 1397–1407 (2024).
Article PubMed Google Scholar
Hsu, J., Johansen, K. L., Hsu, C. Y., Kaysen, G. A. & Chertow, G. M. Higher serum creatinine concentrations in black patients with chronic kidney disease: beyond nutritional status and body composition. Clin. J. Am. Soc. Nephrol. 3 (4), 992–997 (2008).
Article CAS PubMed PubMed Central Google Scholar
Booth, S. et al. Using temporal recalibration to improve the calibration of risk prediction models in competing risk settings when there are trends in survival over time. Stat. Med. 42 (27), 5007–5024 (2023).
Article MathSciNet PubMed PubMed Central Google Scholar
Descallar, J., Ma, J., Zhu, H., Heritier, S. & Wolfe, R. Cause-specific hazard Cox models with partly interval censoring - Penalized likelihood estimation using Gaussian quadrature. Stat. Methods Med. Res. 33 (9), 1531–1545 (2024).
Article MathSciNet PubMed PubMed Central Google Scholar
Dhande, I. S. & Doris, P. A. Genomics and Inflammation in Cardiovascular Disease. Compr. Physiol. 11 (4), 2433–2454 (2021).
Article PubMed Google Scholar

Download references

Acknowledgements

We thank Dr. Chunxing Liu and Hua Dong Sanatorium for providing and granting access to the de-identified health examination cohort data used in this study.

Funding

This study was supported by the National Natural Science Foundation of China(82373688, 81773541), the Project of Jiangsu Province Engineering Research Centerof Molecular Target Therapy and Companion Diagnostics in Oncology, (No.SGK1202407), the Priority Academic Program Development of Jiangsu Higher Education Institutions at Soochow University, the State Key Laboratory of Radiation Medicine and Protection (GZK1201919), Jiangsu Key Laboratory of Preventive and Translational Medicine for Geriatric Diseases (KJS2222), and Suzhou Key Laboratory of Neuro-oncology and Nano-bionics (SZZD003),Jiangsu Province Engineering Research Center of Molecular Target Therapy and Companion Diagnostics in Oncology(SGK1202407).The funding institutions did not play any role in the design or conduct of this research; collection, management, analysis, and interpretation of the data, preparation, review, or approval of the manuscript; and decision to submit the manuscript for publication.

Author information

Jing Zhu and Jiaqi Xu contributed equally to this work.

Authors and Affiliations

Department of General Education and Teaching, Changzhou Vocational Institute of Engineering, Changzhou, 213164, China
Jing Zhu & Yamin Wang
Department of Biostatistics, School of Public Health, Jiangsu Key Laboratory of Preventive and Translational Medicine for Geriatric Diseases, Suzhou, 215300, Jiangsu, P.R. China
Jiaqi Xu, Yuying Wang, Yiqing Gu, Zaixiang Tang & Junjie Shen
MOE Key Laboratory of Geriatric Diseases and Immunology, Suzhou Medical College of Soochow University, Suzhou, 215300, Jiangsu, P.R. China
Jiaqi Xu, Yuying Wang, Yiqing Gu, Zaixiang Tang & Junjie Shen
Centre for Psychological Health Education, Changzhou Vocational Institute of Engineering, Changzhou, 213164, China
Xin Gao

Authors

Jing Zhu
View author publications
Search author on:PubMed Google Scholar
Jiaqi Xu
View author publications
Search author on:PubMed Google Scholar
Xin Gao
View author publications
Search author on:PubMed Google Scholar
Yuying Wang
View author publications
Search author on:PubMed Google Scholar
Yiqing Gu
View author publications
Search author on:PubMed Google Scholar
Yamin Wang
View author publications
Search author on:PubMed Google Scholar
Zaixiang Tang
View author publications
Search author on:PubMed Google Scholar
Junjie Shen
View author publications
Search author on:PubMed Google Scholar

Contributions

Study conception and design: ZJ, XJQ, and TZXData collection and cleaning: ZJ, XJQ, WYY, GX, GYQ, WYMReal data analysis and interpretation: ZJ, GYQ, GX, WYM, and SJJDrafting of the manuscript: XJQ, SJJ, and TZXAll authors reviewed the manuscript.

Corresponding authors

Correspondence to Zaixiang Tang or Junjie Shen.

Ethics declarations

Competing interests

The authors declare no competing interests.

Ethics approval and consent to participate

The study was approved by the Ethics Committee of Hua Dong Sanatorium (No. 9, 2021), and all methods were carried out under relevant guidelines and regulations. The study was a retrospective cohort study. The informed consent was waived and the need for waiving the informed consent was also supported by the Ethical Committee of Hua Dong Sanatorium. The personal information of the study subjects was confidential.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary Material 1 (download PDF )

Supplementary Material 2 (download PDF )

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Zhu, J., Xu, J., Gao, X. et al. Development and temporal validation of five year retinal arteriosclerosis risk prediction in high risk adults from an East China cohort. Sci Rep 16, 10835 (2026). https://doi.org/10.1038/s41598-026-42686-7

Download citation

Received: 02 January 2025
Accepted: 26 February 2026
Published: 27 March 2026
Version of record: 31 March 2026
DOI: https://doi.org/10.1038/s41598-026-42686-7

Subjects

Abstract

Similar content being viewed by others

The performance of an integrated model including retinal information in predicting childhood hypertension

Prevalence and risk factors of hypertensive retinopathy in South Korea based on national health survey data

Development and validation of a hypertension risk prediction model and construction of a risk score in a Canadian population

Introduction

Materials and methods

Study population

Construction and estimation of the risk prediction model

Predictor variables

Derivation and validation models

Evaluation of model performance

Online risk predictive app

Statistical analysis

Results

Derivation and validation models

Evaluation of model performance

Discussion

Data availability

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Ethics approval and consent to participate

Additional information

Publisher’s note

Supplementary Information

Supplementary Material 1 (download PDF )

Supplementary Material 2 (download PDF )

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Quick links