AI-guided personalized predictions on myopia progression and interventions

Liu, Sian; Lu, Yuxing; Li, Xiaoman; Chen, Xiaoniao; Sun, Zhuo; Li, Gen; Wang, Kai; Wu, Wei; Xu, Hui; Li, Hongyi; Hu, Changxi; Zou, Zixing; Zhang, Miao; Zhang, Xuan; Lu, Wenyang; Yin, Yun; Qu, Jia; Zhang, Kang; Chen, Jie

doi:10.1038/s41746-025-02308-4

Download PDF

Article
Open access
Published: 12 January 2026

AI-guided personalized predictions on myopia progression and interventions

Sian Liu¹^na1,
Yuxing Lu^1,2^na1,
Xiaoman Li¹^na1,
Xiaoniao Chen^1,3^na1,
Zhuo Sun¹^na1,
Gen Li¹^na1,
Kai Wang^1,2,
Wei Wu²,
Hui Xu¹,
Hongyi Li¹,
Changxi Hu¹,
Zixing Zou^1,4,5,
Miao Zhang⁶,
Xuan Zhang¹,
Wenyang Lu^1,4,
Yun Yin⁷,
Jia Qu¹,
Kang Zhang^1,4,5 &
…
Jie Chen¹

npj Digital Medicine volume 9, Article number: 129 (2026) Cite this article

4839 Accesses
6 Altmetric
Metrics details

Subjects

Abstract

Myopia is a major global health concern. To enable precision myopia management, we developed a Transformer-based artificial intelligence (AI) model, the Myopia Progression Predictive Model (MPPM), comprising two modules: the Natural Progression Module (NPM) for predicting untreated myopia progression and the Intervention Progression Module (IPM) for forecasting progression under specific interventions. NPM was trained on 1,109,827 refractive records from 304,353 children and adolescents, achieving high predictive accuracy for future spherical equivalent (SE) and axial length (AL) over a 10-year period. In the internal test set, SE prediction reached R² = 0.94, MAE = 0.35D; for AL, R² = 0.91, MAE = 0.16 mm. Comparable performance was observed in external validation. IPM was trained on four intervention cohorts (0.01% atropine, orthokeratology, peripheral defocus spectacles, and repeated low-level red light [RLRL] therapy) using a Transformer-based causal machine learning framework, enabling individualized estimation of treatment effects. It accurately predicted myopia changes under each intervention (SE: R² > 0.88, MAE < 0.45D; AL: R² > 0.80, MAE < 0.31 mm). Among the interventions, RLRL slightly reversed myopia progression, whereas the others slowed myopia progression. MPPM demonstrates strong promise as an AI-driven platform for personalized prediction and optimization of pediatric myopia management.

Predicting onset of myopic refractive error in children using machine learning on routine pediatric eye examinations only

Article Open access 17 August 2025

Models of myopia: the effect of accommodation, lenses and atropine

Article 12 December 2023

Synergistic effect of defocus incorporated multiple segment glasses and repeated low level red light therapy against myopia progression

Article Open access 01 February 2025

Introduction

Myopia has emerged as a critical global public health issue, with particularly high prevalence rates in Asia^1,2. In China, an estimated 80% of high school graduates are affected by myopia, of whom 10–20% suffer from high myopia (defined as ≤−6.0 diopters)³. High myopia carries significant risks of vision-threatening complications including retinal detachment and macular degeneration, which can lead to irreversible vision loss and diminished quality of life^2,4. These concerns underscore the importance of early prediction and timely intervention for myopia progression.

Current clinical strategies for myopia control include 0.01% low-concentration atropine eye drops (Atropine), orthokeratology lenses (Ortho-K), peripheral defocus spectacles (PDS), and repeated low-level red light (RLRL) therapy. However, these interventions face several limitations including high costs, prolonged treatment durations, and potential adverse effects. Atropine may cause photophobia, transient near-vision impairment, or allergic reactions^2,5, while Ortho-K increase risks of corneal epithelial injury and infection^6,7. PDS involve substantial fitting costs that may limit accessibility⁸, and RLRL therapy raises concerns about potential retinal phototoxicity with long-term use^9,10. Artificial intelligence (AI) models offer a promising solution by enabling precise prediction of myopia progression and individualized treatment efficacy assessments, thereby facilitating optimized early interventions for high-risk pediatric patients. This approach could significantly improve healthcare resource allocation, reduce unnecessary costs and risks, and enhance clinical outcomes.

While previous studies have developed AI models to predict future spherical equivalent (SE) from historical refraction data^{11,12,13,14,15}, and randomized controlled trials (RCTs) have demonstrated the benefits of various myopia control interventions^{16,17,18,19,20,21,22}, two critical research gaps remain. First, there is a need for accurate annual progression predictions spanning the typical 10-year myopia progression period from ages 8 to 18, particularly accurate predictions for long-term axial length (AL) growth changes. Second, the field lacks individualized, quantitative predictions for treatment benefits.

Transformer architectures, with their ability to capture long-range temporal dependencies in sequential data²³, provide a powerful framework for modeling long-term myopia progression. To address the two limitations mentioned above, we developed a Transformer-based Myopia Progression Predictive Model (MPPM) with two modules: the Natural Progression Module (NPM), which predicts the untreated course of myopia progression, and the Intervention Progression Module (IPM), which forecasts myopia progression under specific interventions. The NPM was trained on a large-scale longitudinal cohort of children with myopia, with follow-up durations exceeding 10 years. Participants in this cohort had not received any myopia control interventions other than single-vision spectacles, allowing the model to capture the natural course of refractive development. The IPM was trained using data from four real-world myopia intervention cohorts, in which participants received Atropine, Ortho-K, PDS, or RLRL, respectively. The IPM was based on the NPM, but because numerous confounding factors may bias the estimation of treatment benefits^24,25, we incorporated a gradient reversal layer and adversarial training mechanism into the IPM, thereby establishing a Transformer-based causal machine learning framework^26,27,28. This design enables the IPM to generate accurate individualized treatment effect (ITE) predictions for different myopia control interventions²⁹.

Results

Overview of the study design

An overview of the study design is illustrated in Fig. 1. In this study, we included longitudinal data of children and adolescents with myopia from the Eye Hospital, Wenzhou Medical University (WMU). In the WMU dataset, participants were categorized into five cohorts based on whether they received myopia control interventions and, if so, the type of intervention: a non-intervention cohort, an Atropine cohort, a PDS cohort, an Ortho-K cohort, and an RLRL cohort. We also included longitudinal data of children and adolescents with myopia from Dazhou Central Hospital (DCH) and data from an Investigator Initiated Trial (IIT) of RLRL therapy (ChiCTR2200066365). The specific myopia correction and/or control interventions received by participants in different cohorts are detailed in the Datasets and subjects section.

For all participants, demographic information (sex and age at each visit) and ocular measurements of both eyes, including SE and AL, were collected. As subjective refraction was performed at every visit but AL was measured less frequently, the WMU dataset and DCH dataset contained more SE than AL records. Missing AL values were reconstructed using machine-learning-based imputation methods.

We developed the MPPM, which consists of two modules: the NPM, designed to predict myopia progression in the absence of interventions, and the IPM, designed to forecast progression under different myopia control strategies (Figs. 1 and 2). The NPM, based on a Transformer architecture, used sex, age, and prior SE and AL measurements to predict future SE and AL values over a 10-year horizon. It was trained and internally validated (8:2 split) using the WMU non-intervention cohort and externally validated on the DCH dataset and the RLRL IIT dataset’s control group. The IPM was derived from the NPM by introducing a gradient reversal layer and adversarial training mechanism to mitigate confounding and to estimate the causal effects of interventions on clinical outcomes. This allowed the model to predict individual myopia trajectories under different intervention strategies. The IPM was trained and internally validated (8:2 split) using the WMU intervention cohorts and externally validated with the RLRL IIT dataset’s intervention group. It is noteworthy that imputed AL values were used only for model training, whereas model validation were performed exclusively on observed AL values.

**Fig. 2: Architecture of the Myopia Progression Prediction Model (MPPM).**

Participant characteristics and follow-up details

Table 1 summarizes participant characteristics and follow-up details for both WMU dataset and DCH dataset. The WMU dataset included 304,353 individuals contributing 1,109,827 subjective refraction records, with 81,142 individuals providing 276,298 AL measurements. The DCH dataset contained 60,533 participants with 141,498 SE records, including 12,846 participants who contributed 30,134 AL measurements.

Table 1 Participant characteristics and follow-up details for the WMU and DCH datasets

Full size table

Notably, in the WMU dataset, 4848 participants had follow-up durations exceeding 10 years. No statistically significant differences were observed in the annual progression rates of SE and AL among the groups stratified by follow-up duration (≤3 years, 3–5 years, 5–10 years, and >10 years), as determined by one-way analyses of variance (for SE, p = 0.213; for AL, p = 0.339; see Supplementary Tables 1 and 2). These results suggest no substantial heterogeneity between participants with long- and short-term follow-up.

Table 2 summarizes the baseline characteristics and follow-up profiles of the non-intervention cohort and the four myopia intervention cohorts in the WMU dataset.

Table 2 Participant characteristics and follow-up details for the 5 cohorts in WMU datasets

Full size table

AL data imputation

Given axial elongation’s fundamental role in pediatric myopia progression³⁰, incorporating historical AL measurements into prediction models carries significant clinical importance. However, although subjective refraction data were available for all clinical visits, AL measurements were missing for a substantial proportion (~75%) of visits (Table 1). To maximize data utility, we implemented an XGBoost regression model with age, sex, and SE of both eyes as input features for AL imputation. The model was trained and validated using 304,353 concurrent subjective refraction and AL measurements from the WMU cohorts. Model evaluation showed strong concordance between predicted and measured AL values, with Pearson correlation coefficients (PCC) of 0.848 ± 0.003 (mean ± standard deviation) for right eyes and 0.849 ± 0.002 for left eyes, coefficients of determination (R²) of either eye were 0.719 ± 0.005 and 0.722 ± 0.004, respectively, and mean absolute errors (MAE) were 0.495 ± 0.002 mm and 0.494 ± 0.002 mm, respectively (Fig. 3). These results demonstrated high accuracy and reliability of the machine-learning-based AL imputation.

**Fig. 3: Machine learning–based imputation of axial length.**

The model was subsequently applied to impute missing AL values from measured subjective refraction data across the WMU dataset, thus establishing a comprehensive longitudinally matched dataset with complete SE-AL pairs (WMU paired dataset) for the MPPM contruction. It should be noted that both the observed and imputed AL data were used for model training, but the model validation were performed only on observed AL values to ensure methodological rigor.

Myopia Progression Predictive Model (MPPM) architecture

The architecture of MPPM is illustrated in Fig. 2. MPPM includes two modules: Natural Progression Module (NPM) and Intervention Progression Module (IPM).

The NPM was trained on the WMU non-intervention cohort. It was based on Transformer architecture and comprised five key components: input layer, feature embedding module, temporal sequence encoder, multi-task prediction head, and output layer. The model processes longitudinal visit records containing categorical (e.g., sex, intervention), continuous (age, SE and AL for both eyes), and temporal (inter-visit intervals) features. Notably, the feature embedding module separately processes categorical and continuous features into dense vector representations, incorporating positional encoding to maintain temporal ordering and multi-head attention to capture feature dependencies. These embedded vectors feed into the temporal sequence encoder, which uses masked multi-head attention with additional positional encoding to model temporal dependencies. The temporal sequence encoder output passes to the multi-task prediction head, which bifurcates into SE and AL prediction branches sharing common underlying features to enhance generalizability. The SE head predicts current time-step SE values for both eyes, while the AL head performs analogous AL predictions, with final outputs generated through the output layer (Fig. 2).

Numerous confounding factors may influence the evaluation of the relationship between myopia control interventions and clinical benefits. Many myopia control studies adopt broad inclusion criteria (e.g., enrolling all children with myopia) without stratifying participants by disease characteristics such as baseline myopia severity. This increases outcome heterogeneity and hampers precise quantification of treatment benefits. For instance, when assessing orthokeratology or low-dose atropine, failure to account for baseline refractive error may dilute the estimated treatment effect^24,25. Therefore, building upon the NPM, we incorporated a gradient reversal layer and adversarial training to construct a Transformer-based causal inference framework, referred to as the Intervention Progression Module (IPM). The IPM mitigates the influence of confounding factors and enables accurate estimation of the causal effects (individualized treatment effect, ITE) of myopia control interventions on changes in SE and AL progression. The formula for IPM to predict future SE and AL values is: The future value under natural growth predicted by NPM – The growth reduction due to control interventions predicted by IPM (i.e., ITE) = The future value under control interventions predicted by IPM.

The performance of the Natural Progression Module (NPM)

We evaluated the NPM through assessment of SE and AL prediction performance using R² and MAE. The evaluation specifically measured the proportion of predictions achieving clinically acceptable thresholds: absolute error <0.75 diopters for SE (P[AE < 0.75D]) and <0.25 mm for AL (P[AE < 0.25 mm])^13,14,31. As a result, the NPM demonstrated strong predictive accuracy for future SE and AL values over a 10-year period across both internal and external test dataset. In detail, in the internal test set, its prediction of SE showed an R² of 0.94, with an MAE of 0.35D, and P[AE < 0.75D] was 0.91. For AL, its R² was 0.91, with an MAE of 0.16 mm, and P[AE < 0.25 mm] was 0.84. In the external test set, NPM’s prediction of SE yielded an R² of 0.94, an MAE of 0.40D, and P[AE < 0.75D] = 0.86. The prediction for AL had an R² of 0.94, an MAE of 0.19 mm, and P[AE < 0.25 mm] = 0.72. (Table 3 and Fig. 4).

**Fig. 4: Performance of the Natural Progression Module (NPM) on the internal test set and external test set.**

Table 3 Overall performance of Natural Progression Module (NPM)

Full size table

To further evaluate the model’s predictive performance, Bland–Altman analyses and calibration curve assessments were conducted on the internal test set. In the Bland–Altman analyses, for SE prediction, the mean difference was 0.01 D with limits of agreement within ±0.80 D; for AL prediction, the mean difference was 0.03 mm with limits of agreement within ±0.50 mm (Supplementary Table 3, Supplementary Fig. 1). For the calibration curve analyses, continuous predictions were first divided into equally sized bins based on the predicted values (i.e., deciles). For each bin, the mean predicted value was plotted against the corresponding mean observed value in the test dataset. A locally weighted scatterplot smoothing (LOESS) curve was fitted to visualize the relationship, and the identity line (y = x) served as the reference representing perfect calibration. The resulting calibration curves demonstrated near-perfect agreement between predicted and observed values for both SE and AL in both eyes (Supplementary Fig. 2). Collectively, these findings indicate that the predicted SE and AL values closely correspond to the actual clinical measurements.

Besides, we conducted subgroup analyses on the internal test set. Based on baseline myopia severity, participants were categorized into three subgroups: mild myopia (SE ≥ −3.0D), moderate myopia (−6.0D ≤ SE < −3.0D), and high myopia (SE < −6.0D). According to baseline age, participants were divided into two subgroups: children (3 ≤ age ≤ 10) and adolescents (10 < age ≤ 18). Subgroup analyses were also performed by sex (male and female). The results showed that, across all subgroups, the prediction of SE achieved R² ≥ 0.86 (slightly lower in the mild myopia subgroup, R² = 0.78), MAE < 0.4D, and P[AE < 0.75D] > 0.90 (slightly lower in the high myopia subgroup, P = 0.87); across all subgroups, the prediction of SE achieved R² ≥ 0.86 (slightly lower in the mild myopia subgroup, R² = 0.78), MAE ≤ 0.2 mm, and P[AE < 0.25 mm] ≥ 0.73. (Supplementary Tables 4–6, Supplementary Fig. 3–8). These findings confirm the predictive accuracy of the NPM in forecasting future myopia progression across different demographic and clinical subgroups.

Furthemore, since the model’s time-series design leveraged all available prior visit records for each individual to predict future myopia progression, we evaluated the model performance of predicting (n + 1)^th visit values using n prior visits and observed improved predicting accuracy with increasing numbers of prior visits (Fig. 5), which indicated the importance of prior visits as input data in predicting accuracy. Moreover, we defined the prediction horizon as the duration (in years) between the n^th and (n + 1)^th visits. Analysis revealed an inverse relationship between prediction horizon length and accuracy (Fig. 6). Collectively, these results demonstrated that model accuracy would be increased with more prior visits but decreased moderately with longer prediction horizons (Fig. 7).

**Fig. 5: Impact of the number of prior follow-up visits on model prediction accuracy.**

**Fig. 6: Impact of prediction horizon (in years) on model prediction accuracy.**

**Fig. 7: Combined effect of the number of prior follow-up visits and prediction horizon (in years) on model prediction accuracy.**

To enhance model interpretability, we conducted feature-ablation tests in the internal test set to assess the importance of each variable. We systematically removed key input features, including SE history, AL history, Age and Sex from input features respectively and evaluated the resulting impact on model performance. The results are shown in Table 4. We observed that removing SE or AL history led to the largest decrease in performance, while removing age and sex resulted in a slight performance decrease. These ablation results demonstrate that baseline SE and AL are the most critical predictors, while age and sex contribute moderately to prediction accuracy.

Table 4 Feature ablation analysis of NPM performance

Full size table

The performance of Intervention Progression Module (IPM)

The IPM predicted future SE and AL under four myopia control strategies, and its performance was assessed using R² and MAE between predicted and observed outcomes. The results demonstrated that the IPM provided accurate predictions of myopia progression across all interventions. For Atropine, SE prediction achieved an R² of 0.97 with an MAE of 0.23D, and AL prediction achieved an R² of 0.96 with an MAE of 0.14 mm. For Ortho-K, SE prediction reached an R² of 0.88 with an MAE of 0.44D, and AL prediction reached an R² of 0.82 with an MAE of 0.31 mm. For PDS, SE prediction achieved an R² of 0.96 with an MAE of 0.27D, and AL prediction achieved an R² of 0.94 with an MAE of 0.16 mm. For RLRL, SE prediction achieved an R² of 0.90 with an MAE of 0.36D, and AL prediction achieved an R² of 0.90 with an MAE of 0.18 mm (Table 5, Fig. 8, Supplementary Fig. 9).

**Fig. 8: Evaluation of individualized treatment effect (measured by SE) from myopia control interventions using MPPM.**

Table 5 Performance evaluation of the IPM and myopia intervention benefit predicted by the AI model

Full size table

Individualized treatment effect (ITE) estimated by MPPM

By comparing the SE and AL changes predicted by the IPM under intervention conditions with those predicted by the NPM under non-intervention conditions, we found that on average Atropine reduced SE progression by ~55% and AL progression by ~75%; Ortho-K reduced SE and AL progression by ~45% and ~45%, respectively; PDS reduced SE and AL progression by ~50% and ~70%, respectively; RLRL therapy not only halted the progression of SE and AL, but also led to an approximate 10% reversal in SE (Table 5, Fig. 8, Supplementary Fig. 9). RLRL was the most effective intervention, while the others showed varying degrees of progression slowing. It should be noted that these estimates were based on different follow-up durations: within 1 year for RLRL and up to 30 months for the other three interventions.

Validation of the MPPM using an investigator initiated trial

Finally, we further validated the MPPM using data from an investigator initiated trial (IIT) of RLRL therapy. In this IIT trial, participants in the intervention group received RLRL treatment and wore single-vision spectacles, while those in the control group only wore single-vision spectacles. Follow-up examinations were conducted at 1, 3, 6, and 12 months after baseline. Baseline characteristics of participants are summarized in Table 6. We used each participant’s baseline features to predict SE and AL progression under natural conditions using NPM and compared these predictions with actual follow-up data from the control group. The model achieved an R² of 0.89 and an MAE of 0.35D for SE, and an R² of 0.85 and an MAE of 0.21 mm for AL. We then used IPM to predict changes in SE and AL under RLRL intervention and compared these with observed values in the treatment group. The model yielded an R² of 0.86 and an MAE of 0.37D for SE, and an R² of 0.83 and an MAE of 0.23 mm for AL (Table 7). These results further demonstrate the strong predictive performance of the MPPM model.

Table 6 Characteristics of the participants in the IIT of RLRL therapy

Full size table

Table 7 Validation of MPPM using data from the IIT of RLRL therapy

Full size table

Discussion

In this study, we developed and validated a Transformer-based time series AI model, MPPM, to predict long-term myopia progression in children and adolescents. The model has two modules: NPM, for predicting the natural progression of myopia; and IPM, for forecasting progression under specific interventions. The NPM demonstrated high accuracy in forecasting SE and AL over a 10-year period. Prediction performance improved with a greater number of prior follow-up visits and declined moderately as the prediction horizon increased. Building upon NPM, by incorporating a gradient reversal layer and adversarial training, we developed the IPM capable of predicting the causal effect between myopia control interventions and individualized patient benefits.

This study addressed two major gaps in the field of myopia prediction: (1) forecasting future AL in children and adolescents; (2) providing individualized, quantitative estimates of treatment benefits from myopia control interventions.

The use of AI for myopia prediction has recently become a research hotspot. Xu et al. developed an AI model that utilizes cycloplegic refraction and AL measurements from non-myopic children to predict the risk of future myopia occurrence³². Similarly, Qi et al. introduced an AI model that employs fundus photography and electronic medical records to assess the risk of myopia onset in children³³. Foo et al. used fundus photography to predict the risk of high myopia occurrence in children within five years³⁴. However, these studies did not predict the degree of myopia (SE and AL) in children on an annual basis, this is the key distinction of the present study. While some studies have attempted to predict future SE in children^{11,12,13,14,15}, none have focused on AL. We attribute this gap largely to the lack of large-scale, longitudinal AL datasets with extended follow-up. To overcome this limitation, we introduced a machine-learning-based imputation strategy to infer missing AL values from long-term patient follow-up records. This enabled the construction of a comprehensive dataset suitable for model training. Using this augmented dataset, we developed a Transformer-based model for predicting both SE and AL. The model was subsequently validated on real (non-imputed) data and demonstrated satisfactory predictive performance.

More importantly, we developed a tool that enables precise, individualized prediction of the clinical benefits of different myopia control interventions in children with myopia. Although RCTs and cohort studies have established the efficacy of various myopia control interventions^{16,17,18,19,35,36,37,38}, such evidence is typically population-based and reflects only average treatment effects. These traditional approaches do not provide individualized benefit estimates. To address this issue, we developed the IPM, a Transformer-based causal inference framework designed to overcome the influence of confounding factors and accurately estimate the causal effects of myopia control interventions on clinical outcomes. This approach enables individualized and quantitative prediction of intervention benefits, offering a more precise tool to support clinical decision-making in myopia control.

Using the MPPM model, we estimated the clinical benefits of four myopia control interventions (Atropine, PDS, Ortho-K, and RLRL) in children and adolescents, quantified as the reduction in SE and AL progression. Our results showed that over a follow-up period of up to 30 months, Atropine, PDS, and Ortho-K each slowed myopia progression to varying degrees, consistent with findings from previous RCTs and cohort studies^{16,18,19,35,36,37,38}. Notably, we observed that, on average, RLRL therapy not only halted myopia progression but also led to a modest reversal in SE, which is also in line with results reported in prior RCTs^{17,20,21,39,40}. However, it is important to note that although RLRL appears to be the most effective among the evaluated interventions, clinicians should remain cautious, as safety concerns regarding potential retinal damage have not yet been fully resolved^9,10.

In Supplementary Fig. 9a–d, for children receiving Atropine and PDS therapy, the predicted natural growth of AL after time 0 (green line) was substantially faster than the actual AL growth observed before time 0 (orange line). This occurred because, based on their age (Atropine: 10.6 ± 2.32 years; PDS: 10.94 ± 2.39 years) and pre-treatment AL growth trajectory, the model inferred that these children would be entering a phase of accelerated axial elongation. In contrast, such a pattern was not observed in children treated with Ortho-K (Supplementary Fig. 9e–f), where the predicted natural AL growth after time 0 did not markedly exceed the pre-treatment growth rate. This is likely because these children were older at baseline (14.23 ± 3.54 years), at an age when the rate of axial elongation typically slows^41,42. For children in these three treatment cohorts (Atropine, PDS, and Ortho-K), the actual AL growth after time 0 (blue line) was lower than the predicted natural growth (green line), indicating that the interventions effectively reduced axial elongation.

When comparing Supplementary Figs. 9a–d with 8a–d, we observed that for children treated with Atropine and PDS, although the model predicted a faster AL growth rate after time 0, the corresponding rate of SE progression did not accelerate. This finding is consistent with known ocular compensatory mechanisms in children and adolescents: During periods of axial elongation, thinning of the crystalline lens and deepening of the anterior chamber can partially compensate for refractive changes^43,44,45. As a result, SE progression tends to lag behind AL growth, with the rate of SE change being slower than the rate of axial elongation. This observation further supports the physiological plausibility of the model’s predictions.

This model is currently at the research validation stage and has not yet been approved as a medical device. Future translation into clinical practice should adhere to established regulatory pathways, which necessitate prospective clinical validation and compliance with Software-as-a-Medical-Device (SaMD) requirements. In this study, we observed that once patient data were entered, the model generated prediction results within 1–2 seconds. Given this high inference speed, we anticipate that MPPM could be deployed using a server–client architecture to provide real-time clinical decision support in outpatient settings. In a typical workflow, ocular biometric parameters (SE and AL) collected during routine examinations would be input into the system. The model’s predictions would then be presented to clinicians to inform their management decisions. This proposed integration aligns with existing myopia management pathways without necessitating additional clinical procedures.

This study has several limitations:

(1)
The study population was limited to children and adolescents with SE ranging from +1.0D to −10.0D and AL between 23 mm and 28 mm, covering individuals from pre-myopia to moderate and high myopia. Individuals with ultra-high myopia were excluded, as such cases are often associated with posterior staphyloma and marked irregularities in ocular morphology^46,47,48. These features substantially increase the complexity and reduce the reliability of refractive error predictions using the current model.
(2)
All participants were Chinese children and adolescents, with no representation from other nationalities. Given that the rate of myopia progression may vary substantially across populations with different ethnic, genetic, and environmental exposures^49,50, the generalizability of our model to populations of other ethnic or geographic backgrounds remains to be further validated. Future work will focus on external validation in multi-ethnic cohorts, including datasets from regions with different prevalence profiles and environmental risk factors (e.g., European, Southeast Asian, and African populations). We are currently collaborating with our partners to collect refractive data from non-Chinese children residing in China, with the aim of establishing a multi-ethnic dataset to further validate the generalizability of the model. We also plan to collaborate with international pediatric ophthalmology centers to evaluate model performance across diverse demographic and lifestyle contexts. This will allow us to assess the robustness, transferability, and potential need for recalibration of the model in non-Chinese populations.

Another limitation is the restricted range of input modalities. The model currently uses only basic demographic and clinical variables, including age, sex, and previous refraction and AL measurements. It does not incorporate additional potentially relevant factors such as genetics (e.g., family history or genotyping)^51,52, environmental influences (e.g., near work habits, screen time, or academic pressure)^53,54, or imaging data (e.g., fundus photographs)^33,34. Previous studies have demonstrated that both genetic and environmental factors play important roles in the onset and progression of myopia, and that retinal images may offer added predictive value.

However, we intentionally limited the model inputs for two key reasons:

(1)
Reducing the required input data enhances model accessibility and usability. Patients can receive personalized predictions based solely on prior clinical records, without undergoing additional tests such as fundus imaging. This simplicity facilitates large-scale deployment and maximizes the model’s reach and impact.
(2)
The longitudinal patterns captured in historical refractive data may already encode some effects of underlying genetic and environmental influences, which may explain the model’s strong performance despite its minimal input requirements.

Given the model’s robust predictive accuracy, we believe that increasing input complexity may not be necessary at this stage.

In summary, we introduced an AI-driven platform for personalized prediction and optimization of pediatric myopia management. Myopia poses a growing global public health challenge, underscoring the critical need for early prediction of its progression and personalized intervention in pediatric patients to mitigate disease burden. However, research gaps remain unaddressed in this domain: there are no tools for accurately predicting the degree of myopia over long term (including not only SE but also AL), nor are there tools for individualized prediction of the benefits of myopia control interventions. This study addresses these gaps by developing a Transformer-based time series AI model that enables accurate prediction of both SE and AL in children and adolescents over a 10-year period. Furthermore, it innovatively incorporates causal inference techniques to provide individualized predictions of the benefits of myopia control interventions. Our AI platform represents a transformative tool for guiding precision myopia management in pediatric populations, enabling clinicians to optimize intervention strategies based on individual risk profiles and predicted therapeutic responses.

Methods

Datasets and subjects

This study was conducted in accordance with the tenets of the Declaration of Helsinki, and the protocols were approved by the Clinical Research Ethics Committee of the Eye Hospital, Wenzhou Medical University (No. 2023-200-K-162). We retrospectively collected refractive examination data from children and adolescents at WMU and DCH, and additionally included data from an IIT of RLRL therapy.

The inclusion criteria were as follows:

(1)
age between 3 and 20 years (age at first visit ≤18 years, and age at the last recorded visit ≤20 years);
(2)
SE ranging from +1.0D to −10.0D, and AL between 23 mm and 28 mm;
(3)
at least two clinic visits with subjective refraction performed under cycloplegia and best-corrected visual acuity (BCVA) of ≥0.8 (Snellen);
(4)
absence of ocular diseases other than refractive errors, as confirmed by comprehensive ophthalmic examinations, including slit-lamp biomicroscopy, post-cycloplegic fundus examination, and strabismus assessment;
(5)
In the WMU non-intervention cohort and the DCH cohort, participants wore only single-vision spectacles and received no other myopia control interventions. In the WMU intervention cohorts, participants received single-vision spectacles and the assigned intervention, with no additional treatments. In the IIT study, participants had not undergone any interventions other than single-vision spectacles prior to enrollment. After enrollment, participants in the RLRL group received only single-vision spectacles and RLRL therapy, whereas those in the control group continued with single-vision spectacles alone.

The dataset included longitudinal visit records for each participant. Each record contained the following information: a de-identified unique patient ID, sex, date of birth, visit date and age at visit, and subjective refraction data (spherical power, cylindrical power, and cylinder axis). AL measurements were available for some visits, while others lacked this information. SE was calculated using the formula: SE = spherical power + 0.5 × cylindrical power.

Machine-learning-based imputation of AL data

To address missing AL data in some visit records, we employed a machine learning–based imputation strategy. Specifically, we developed an XGBoost regression model to predict AL, using age, sex, and the SE of both eyes as input features. The sex variable was numerically encoded (female = 0, male = 1) prior to model training.

Model training and validation were conducted on a subset of participants with complete AL records. Independent XGBoost models were trained for the right and left eyes. To robustly evaluate model performance, we employed five-fold cross-validation: 80% of the data were used for training and 20% for testing in each fold. Model performance was assessed using standard regression metrics, including mean squared error (MSE), MAE, R², and PCC between predicted and actual AL values. We reported the mean and standard deviation of these metrics across the five folds to reflect overall model performance.

The objective function was squared error. Hyperparameters were as follows: learning_rate = 0.05; max_depth = 6; n_estimators = 500; subsample = 0.8; colsample_bytree = 0.8; min_child_weight = 1; reg_alpha = 0.0; reg_lambda = 1.0; tree_method = “hist”; early_stopping_rounds = 50 using a fold-held validation split; random seed = 1.

After cross-validation, the final AL prediction models for the right and left eyes were retrained on the full subset of complete AL records using the optimal hyperparameters identified during validation. These trained models were then applied to the full dataset to impute missing AL values. Only missing AL fields were imputed; existing, non-missing AL records were left unchanged. The resulting AL-complete dataset was used to train the longitudinal myopia progression prediction model.

Model configuration

We used EHRFormer, a time-series transformer-based model. Our EHRFormer consists of an EHREmbedding encoder and a temporal GPT-2 encoder. The EHREmbedding uses a BERT backbone configured with 2 transformer layers and 12 attention heads (hidden size 768), GELU activations, and dropout 0.1. The temporal encoder is GPT‑2 initialized from the “gpt2” configuration (12 layers, 12 attention heads, hidden size 768). Visit order is encoded by explicit position_ids equal to the chronological index of each visit. A multi-task regression head maps the hidden state to SE and AL outputs; an auxiliary gradient-reversal head predicts medication type for representation invariance. For optimizer, we used AdamW with learning rate 5 × 10⁻⁵ and weight decay 1×10⁻³. Learning-rate schedule CosineAnnealingWarmupRestarts with first_cycle_steps = 50 epochs, warmup = 10% of epochs, min_lr = 1 × 10⁻⁸. We used a batch size of 100 and trained for a maximum of 50 epochs. No early stopping was applied; instead, the checkpoint with the lowest validation loss was selected. Mixed-precision (bf16) was used.

We use patient‑level 5‑fold mapping: train folds {1,2,3,4}, validation fold {0}, test fold {0}. No patient appears in more than one split. We verify empty intersections of patient IDs across train/validation/test before training. We set global random seed as 1 for Python, NumPy, and PyTorch. These settings ensure run‑to‑run stability on the same hardware and software stack.

Data processing and model training

To increase the sample size, longitudinal visit records for each participant were segmented into multiple training samples. The model was designed to use all available prior visit records of an individual to predict their future refractive status (including both SE and AL). For example, if an individual had four historical visit records (a, b, c, d), the records were transformed into the following training samples, each predicting the SE and AL at the next time point:

(1)
Record [a] used to predict SE and AL at record [b] (interval [a–b]);
(2)
Records [a, b] used to predict SE and AL at record [c] (interval [b–c]);
(3)
Records [a, b, c] used to predict SE and AL at record [d] (interval [c–d]).

Model performance evaluation

For a sample with n + 1 visit records, the model used the first n records to predict SE and AL at the (n + 1)-th visit. The difference between predicted and actual values at the (n + 1)-th visit was used to quantify prediction error. Model performance was evaluated using R² and MAE. For SE prediction, we additionally calculated the proportion of absolute errors within 0.75D (P[AE < 0.75D]), and for AL prediction, the proportion of absolute errors within 0.25 mm (P[AE < 0.25 mm]), representing the percentage of predictions falling within clinically acceptable error thresholds^13,14,31.

We further examined how model performance varied with the number of prior visits and the time interval between the last historical visit and the predicted visit (i.e., the prediction horizon, measured in years). To do this, we stratified the test data based on the number of prior visits (1, 2, 3, …) and prediction horizons (0–1 year, 1–2 years, …), and recalculated R², MAE, and P[AE < 0.75D] within each stratum. Additionally, we performed a two-dimensional analysis to visualize the combined effect of prior visit count and prediction horizon on model accuracy.

Causal machine learning

Causal objective

We target counterfactual natural progression and ITE. Let \({H}_{t}\) denote the longitudinal patient history up to time \(t\), \({A}_{t}\in A\) the intervention at \(t\), and \({Y}_{t+\Delta }(a)\) the potential outcome at horizon \(\Delta\) under intervention \(a\) been taken at \(t\). The observed outcome is \({Y}_{t+\Delta }\) (i.e. SE and AL).

Define \({\mu }_{a}\left({H}_{t}\right)={\mathbb{E}}\left[{Y}_{t+\Delta }({\rm{a}}),|,{H}_{t}\right]\) and let \({a}_{0}\) denote “no intervention” (natural progression)

Our estimands are the natural-progression outcome \({\mu }_{a0}\left({H}_{t}\right)\) and the ITE \(\tau \left({H}_{t}\right)={\mu }_{a}\left({H}_{t}\right)-{\mu }_{a0}\left({H}_{t}\right)\) for a prespecified active \(a\). Identification follows from standard sequential assumptions: consistancy (if \({A}_{t}=a\) then \({Y}_{t+\Delta }={Y}_{t+\Delta }(a)\), with no hidden treatment versions or interference), sequential exchangeability (\({Y}_{t+\Delta }(a)\perp {A}_{t}|{H}_{t}\) for all a \(\in A\)), and positivity (\(0 < P({A}_{t}={a|}{H}_{t}) < 1\) on the support).

Transformer + adversarial deconfounding

We encode \({H}_{t}\) with a Transformer to obtain a temporal representation \({z}_{t}=\Phi \left({H}_{t}\right)\) by averaging hidden states. To remove confounding from treatment assignment, we train an adversarial classifier \(g\) to predict \({A}_{t}\) from \({z}_{t}\), coupled with a gradient reversal layer (GRL). During forward propagation GRL is the identity; during backpropagation it multiplies the gradient by \(-\lambda\), driving \(\Phi\) to suppress treatment-predictive information. The adversary is optimized with cross-entropy loss \({L}_{{adv}}\).

Outcome and causal losses

The outcome head \(f\) predicts \({Y}_{t+\triangle }\). To align with causal identification, we weight the outcome loss with inverse propensity weights: \({L}_{{task}}{\mathbb{=}}{\mathbb{E}}\left[w\left({A}_{t},{H}_{t}\right)\cdot l\left(f\left({z}_{t},{A}_{t}\right),{Y}_{t+\Delta }\right)\right],w=\frac{1}{\pi \left({A}_{t},|,{H}_{t}\right)}\left({IPTW}\right)\), where \(\pi \left({A}_{t},|,{H}_{t}\right)\) is a learned propensity model. The full objective is: \(\mathop{\min }\limits_{\Phi ,f}\mathop{\max }\limits_{g}{L}_{{total}}={L}_{{task}}+\beta {L}_{{adv}}\), which induces balanced, treatment-invariant representations \({z}_{t}\) (i.e., reduced \(I({z}_{t};{A}_{t})\)), a sufficient condition for unbiased counterfactual prediction under the assumptions.

Counterfactual prediction

At inference, we obtain counterfactuals by clamping the intervention while holding \({z}_{t}\) fixed: \({\hat{Y}}_{t+\Delta }^{a}=f({z}_{t},a),\,{\hat{\mu }}_{0}({H}_{t})={\hat{Y}}_{t+\Delta }^{a},\,\hat{\tau }({H}_{t})={\hat{Y}}_{t+\Delta }^{1}-{\hat{Y}}_{t+\Delta }^{0}\).

Data availability

Python code for conducting the core analyses is available on GitHub and will be public after publication (https://anonymous.4open.science/r/Eyeformer-A07E). Restrictions apply to the availability of datasets, which were used with the permission of the participants for the current study.

References

Liang, J. et al. Global prevalence, trend and projection of myopia in children and adolescents from 1990 to 2050: a comprehensive systematic review and meta-analysis. Br. J. Ophthalmol. 109, 362–371 (2025).
Article PubMed Google Scholar
Jonas, J. B. et al. IMI prevention of myopia and its progression. Invest. Ophthalmol. Vis. Sci. 62, 6 (2021).
Article PubMed PubMed Central Google Scholar
Morgan, I. G. & Jan, C. L. China turns to school reform to control the myopia epidemic: a narrative review. Asia Pac. J. Ophthalmol. 11, 27–35 (2022).
Article Google Scholar
Haarman, A. E. G. et al. The complications of myopia: a review and meta-analysis. Invest. Ophthalmol. Vis. Sci. 61, 49 (2020).
Article PubMed PubMed Central Google Scholar
Zhang, X. J. et al. Five-year clinical trial of the low-concentration atropine for myopia progression (LAMP) study: phase 4 report. Ophthalmology 131, 1011–1020 (2024).
Article PubMed Google Scholar
Liu, C. & Ni, Y. Corneal wound associated with orthokeratology lenses. JAMA Ophthalmol. 140, e223044 (2022).
Article PubMed Google Scholar
Sartor, L., Hunter, D. S., Vo, M. L. & Samarawickrama, C. Benefits and risks of orthokeratology treatment: a systematic review and meta-analysis. Int. Ophthalmol. 44, 239 (2024).
Article PubMed PubMed Central Google Scholar
Burnett, A. et al. Parents’ willingness to pay for children’s spectacles in Cambodia. BMJ Open Ophthalmol. 6, e000654 (2021).
Article PubMed PubMed Central Google Scholar
Liu, H., Yang, Y., Guo, J., Peng, J. & Zhao, P. Retinal damage after repeated low-level red-light laser exposure. JAMA Ophthalmol. 141, 693–695 (2023).
Article PubMed Google Scholar
Liao, X. et al. Cone density changes after repeated low-level red light treatment in children with myopia. JAMA Ophthalmol. https://doi.org/10.1001/jamaophthalmol.2025.0835 (2025).
Huang, J., Ma, W., Li, R., Zhao, N. & Zhou, T. Myopia prediction for children and adolescents via time-aware deep learning. Sci. Rep. 13, 5430 (2023).
Article CAS PubMed PubMed Central Google Scholar
Li, J. et al. Accurate prediction of myopic progression and high myopia by machine learning. Precis. Clin. Med. 7, pbae005 (2024).
Article PubMed PubMed Central Google Scholar
Lin, H. et al. Prediction of myopia development among Chinese school-aged children using refraction data from electronic medical records: a retrospective, multicentre machine learning study. PLoS Med. 15, e1002674 (2018).
Article PubMed PubMed Central Google Scholar
Varošanec, A. M., Marković, L. & Sonicki, Z. A novel time-aware deep learning model predicting myopia in children and adolescents. Ophthalmol. Sci. 4, 100563 (2024).
Article PubMed PubMed Central Google Scholar
Zhao, J. et al. Development and validation of predictive models for myopia onset and progression using extensive 15-year refractive data in children and adolescents. J. Transl. Med. 22, 289 (2024).
Article PubMed PubMed Central Google Scholar
Cho, P. & Cheung, S. W. Retardation of myopia in Orthokeratology (ROMIO) study: a 2-year randomized clinical trial. Invest. Ophthalmol. Vis. Sci. 53, 7077–7085 (2012).
Article PubMed Google Scholar
Jiang, Y. et al. Effect of repeated low-level red-light therapy for myopia control in children: a multicenter randomized controlled trial. Ophthalmology 129, 509–519 (2022).
Article PubMed Google Scholar
Su, B. et al. Novel Lenslet-ARray-integrated spectacle lenses for myopia control: a 1-year randomized, double-masked, controlled trial. Ophthalmology 131, 1389–1397 (2024).
Article PubMed Google Scholar
Yam, J. C. et al. Low-concentration atropine for myopia progression (LAMP) study: a randomized, double-blinded, placebo-controlled trial of 0.05%, 0.025%, and 0.01% atropine eye drops in myopia control. Ophthalmology 126, 113–124 (2019).
Article PubMed Google Scholar
Chen, H. et al. Low-intensity red-light therapy in slowing myopic progression and the rebound effect after its cessation in Chinese children: a randomized controlled trial. Graefe’s. Arch. Clin. Exp. Ophthalmol. 261, 575–584 (2023).
Article Google Scholar
Dong, J., Zhu, Z., Xu, H. & He, M. Myopia control effect of repeated low-level red-light therapy in chinese children: a randomized, double-blind, controlled clinical trial. Ophthalmology 130, 198–204 (2023).
Article PubMed Google Scholar
Fu, A. et al. Effect of low-dose atropine on myopia progression, pupil diameter and accommodative amplitude: low-dose atropine and myopia progression. Br. J. Ophthalmol. 104, 1535–1541 (2020).
PubMed Google Scholar
Wu, H., Xu, J., Wang, J. & Long, M. In Proceedings of the 35th International Conference on Neural Information Processing Systems Article 1717 (Curran Associates Inc., 2021).
Lawrenson, J. G. et al. Interventions for myopia control in children: a living systematic review and network meta-analysis. Cochrane Database Syst. Rev. 2, Cd014758 (2023).
PubMed PubMed Central Google Scholar
Walline, J. J. et al. Interventions to slow progression of myopia in children. Cochrane Database Syst. Rev. 1, Cd004916 (2020).
PubMed PubMed Central Google Scholar
Alonso, M. N. I. Transformers for Causality. https://papers.ssrn.com/sol3/papers.cfm?abstract_id=5045350 (2024).
Melnychuk, V., Frauen, D. & Feuerriegel, S. In International conference on machine learning. 15293–15329 (PMLR).
Nichani, E., Damian, A. & Lee, J. D. How transformers learn causal structure with gradient descent. (2024).
Feuerriegel, S. et al. Causal machine learning for predicting treatment outcomes. Nat. Med. 30, 958–968 (2024).
Article CAS PubMed Google Scholar
Chen, S. et al. Axial growth driven by physical development and myopia among children: a two year cohort study. J. Clin. Med. 11, https://doi.org/10.3390/jcm11133642 (2022).
Smith, G. Refraction and visual acuity measurements: what are their measurement uncertainties? Clin. Exp. Optom. 89, 66–72 (2006).
Article PubMed Google Scholar
Xu, S. et al. Establishment of myopia occurrence prediction model in children without myopia using cycloplegic refraction and prior axial length change. Ophthalmology 132, 1260–1272 (2025).
Article PubMed Google Scholar
Qi, Z. et al. A deep learning system for myopia onset prediction and intervention effectiveness evaluation in children. NPJ Digit. Med. 7, 206 (2024).
Article PubMed PubMed Central Google Scholar
Foo, L. L. et al. Deep learning system to predict the 5-year risk of high myopia using fundus imaging in children. NPJ Digit. Med. 6, 10 (2023).
Article PubMed PubMed Central Google Scholar
Pérez-Flores, I., Macías-Murelaga, B. & Barrio-Barrio, J. Age-related results over 2 years of the multicenter Spanish study of atropine 0.01% in childhood myopia progression. Sci. Rep. 13, 16310 (2023).
Article PubMed PubMed Central Google Scholar
Lee, S. S. et al. Low-concentration atropine eyedrops for myopia control in a multi-racial cohort of Australian children: a randomised clinical trial. Clin. Exp. Ophthalmol. 50, 1001–1012 (2022).
Article PubMed PubMed Central Google Scholar
Najji, R. et al. The real-world effectiveness of defocus incorporated multiple segments and highly aspherical lenslets on myopia control: a longitudinal study from the French myopia cohort. BMJ Open Ophthalmol. 10, https://doi.org/10.1136/bmjophth-2025-002142 (2025).
Walline, J. J., Jones, L. A. & Sinnott, L. T. Corneal reshaping and myopia progression. Br. J. Ophthalmol. 93, 1181–1185 (2009).
Article CAS PubMed Google Scholar
Tian, L. et al. Investigation of the efficacy and safety of 650 nm low-level red light for myopia control in children: a randomized controlled trial. Ophthalmol. Ther. 11, 2259–2270 (2022).
Article PubMed PubMed Central Google Scholar
Liu, G. et al. Axial shortening effects of repeated low-level red-light therapy in children with high myopia: a multicenter randomized controlled trial. Am. J. Ophthalmol. 270, 203–215 (2025).
Article PubMed Google Scholar
Wong, Y. L. et al. Variations in physiological and myopic eye growth among children from different populations. Am. J. Ophthalmol. 280, 20–27 (2025).
Article PubMed Google Scholar
Zhang, J. et al. Changes in lens thickness and power before and after myopia onset. Invest. Ophthalmol. Vis. Sci. 66, 36 (2025).
Article PubMed PubMed Central Google Scholar
Han, X. et al. Longitudinal changes in lens thickness and lens power among persistent non-myopic and myopic children. Invest. Ophthalmol. Vis. Sci. 63, 10 (2022).
Article PubMed PubMed Central Google Scholar
Li, S. M. et al. Corneal power, anterior segment length and lens power in 14-year-old chinese children: the Anyang childhood eye study. Sci. Rep. 6, 20243 (2016).
Article CAS PubMed PubMed Central Google Scholar
Rozema, J., Dankert, S., Iribarren, R., Lanca, C. & Saw, S. M. Axial growth and lens power loss at myopia onset in Singaporean children. Invest. Ophthalmol. Vis. Sci. 60, 3091–3099 (2019).
Article PubMed Google Scholar
Mimura, R. et al. Ultra-widefield retinal imaging for analyzing the association between types of pathological myopia and posterior staphyloma. J. Clin. Med. 8, 1505 (2019).
Goldschmidt, E. & Fledelius, H. C. Clinical features in high myopia. A Danish cohort study of high myopia cases followed from age 14 to age 60. Acta Ophthalmol. 89, 97–98 (2011).
Article PubMed Google Scholar
Nakao, N. et al. Quantitative evaluations of posterior staphylomas in highly myopic eyes by ultra-widefield optical coherence tomography. Invest. Ophthalmol. Vis. Sci. 63, 20 (2022).
Article PubMed PubMed Central Google Scholar
Luong, T. Q. et al. Racial and ethnic differences in myopia progression in a large, diverse cohort of pediatric patients. Invest. Ophthalmol. Vis. Sci. 61, 20 (2020).
Article PubMed PubMed Central Google Scholar
Naduvilath, T. et al. Regional/ethnic differences in ocular axial elongation and refractive error progression in myopic and non-myopic children. Ophthalmic Physiol. Opt. 45, 135–151 (2025).
Article PubMed Google Scholar
Cai, X. B., Shen, S. R., Chen, D. F., Zhang, Q. & Jin, Z. B. An overview of myopia genetics. Exp. Eye Res. 188, 107778 (2019).
Article CAS PubMed Google Scholar
Li, J. & Zhang, Q. Insight into the molecular genetics of myopia. Mol. Vis. 23, 1048–1080 (2017).
CAS PubMed PubMed Central Google Scholar
Wong, C. W. et al. Digital screen time during the COVID-19 pandemic: risk for a further myopia boom? Am. J. Ophthalmol. 223, 333–337 (2021).
Article CAS PubMed Google Scholar
Zhang, C., Li, L., Jan, C., Li, X. & Qu, J. Association of school education with eyesight among children and adolescents. JAMA Netw. Open 5, e229545 (2022).
Article PubMed PubMed Central Google Scholar

Download references

Acknowledgements

This research was supported by Beijing Natural Science Foundation (F251001), National Natural Science Foundation of China (32371035, W2431057), National Key R&D Program of China (2022YFA1105502), Beijing Nova Program (20230484246), Young Elite Scientists Sponsorship Program by CAST (2023QNRC001), Wenzhou Medical University Eye Health and Disease Advanced Institute, the Macau Science and Technology Development Fund, Macao (0007/2020/AFJ, 0070/2020/A2, and 0003/2021/AKP), Guangzhou National Laboratory (YW-SLJC0201). Acknowledgements to EHR and Image Reading and Evaluation Group on Systemic and Eye Diseases: Leaders and Senior Physicians: Kang Zhang, Jia Qu, Jie Chen; Members: Xiaoniao Chen, Hang Wong, Sian Liu, Hui Xu, Cheng Tang, Changxi Hu, Xu Xu, Xuan Zhang, Wenyang Lu, Tianyi Xu, Binrong Wu, Wenjia Cai, Jie Xu, Shuang Liu, Sai Pan, Chuyue Zhang, Yue Niu.

Author information

These authors contributed equally: Sian Liu, Yuxing Lu, Xiaoman Li, Xiaoniao Chen, Zhuo Sun, Gen Li.

Authors and Affiliations

State Key Laboratory of Eye Health, Institute for Advanced Study on Eye Health and Diseases, Institute for Clinical Data Science, Wenzhou Medical University, Wenzhou, China
Sian Liu, Yuxing Lu, Xiaoman Li, Xiaoniao Chen, Zhuo Sun, Gen Li, Kai Wang, Hui Xu, Hongyi Li, Changxi Hu, Zixing Zou, Xuan Zhang, Wenyang Lu, Jia Qu, Kang Zhang & Jie Chen
Department of Big Data and Biomedical AI, College of Future Technology, Peking University, Beijing, China
Yuxing Lu, Kai Wang & Wei Wu
Senior Department of Ophthalmology, Chinese PLA General Hospital, Beijing, China
Xiaoniao Chen
Institute for AI in Medicine, Faculty of Medicine, Macau University of Science and Technology, Macau, China
Zixing Zou, Wenyang Lu & Kang Zhang
Guangzhou National Laboratory, Guangzhou, China
Zixing Zou & Kang Zhang
Department of Optometry, Shenzhen Eye Hospital, Southern Medical University, Shenzhen, China
Miao Zhang
Faculty of Business and Wellness, City University of Macau, Macau, China
Yun Yin

Authors

Sian Liu
View author publications
Search author on:PubMed Google Scholar
Yuxing Lu
View author publications
Search author on:PubMed Google Scholar
Xiaoman Li
View author publications
Search author on:PubMed Google Scholar
Xiaoniao Chen
View author publications
Search author on:PubMed Google Scholar
Zhuo Sun
View author publications
Search author on:PubMed Google Scholar
Gen Li
View author publications
Search author on:PubMed Google Scholar
Kai Wang
View author publications
Search author on:PubMed Google Scholar
Wei Wu
View author publications
Search author on:PubMed Google Scholar
Hui Xu
View author publications
Search author on:PubMed Google Scholar
Hongyi Li
View author publications
Search author on:PubMed Google Scholar
Changxi Hu
View author publications
Search author on:PubMed Google Scholar
Zixing Zou
View author publications
Search author on:PubMed Google Scholar
Miao Zhang
View author publications
Search author on:PubMed Google Scholar
Xuan Zhang
View author publications
Search author on:PubMed Google Scholar
Wenyang Lu
View author publications
Search author on:PubMed Google Scholar
Yun Yin
View author publications
Search author on:PubMed Google Scholar
Jia Qu
View author publications
Search author on:PubMed Google Scholar
Kang Zhang
View author publications
Search author on:PubMed Google Scholar
Jie Chen
View author publications
Search author on:PubMed Google Scholar

Contributions

K.Z., Y.Y., J.C., J.Q., S.L., Y.L., X.L., Z.S., G.L., K.W., W.W., H.X., H.L., X.C., C.H., Z.Z., X.Z., W.L., and M.Z. collected and analyzed the data. K.Z., Y.Y., J.Q., X.C., and J.C. conceived, designed, and supervised the project. K.Z., G.L., S.L., Y.L., and J.C. wrote the manuscript. All authors discussed the results and reviewed the manuscript.

Corresponding authors

Correspondence to Xiaoniao Chen, Yun Yin, Jia Qu, Kang Zhang or Jie Chen.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information (download PDF )

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Liu, S., Lu, Y., Li, X. et al. AI-guided personalized predictions on myopia progression and interventions. npj Digit. Med. 9, 129 (2026). https://doi.org/10.1038/s41746-025-02308-4

Download citation

Received: 02 September 2025
Accepted: 19 December 2025
Published: 12 January 2026
Version of record: 06 February 2026
DOI: https://doi.org/10.1038/s41746-025-02308-4

Subjects

Abstract

Similar content being viewed by others

Predicting onset of myopic refractive error in children using machine learning on routine pediatric eye examinations only

Models of myopia: the effect of accommodation, lenses and atropine

Synergistic effect of defocus incorporated multiple segment glasses and repeated low level red light therapy against myopia progression

Introduction

Results

Overview of the study design

Participant characteristics and follow-up details

AL data imputation

Myopia Progression Predictive Model (MPPM) architecture

The performance of the Natural Progression Module (NPM)

The performance of Intervention Progression Module (IPM)

Individualized treatment effect (ITE) estimated by MPPM

Validation of the MPPM using an investigator initiated trial

Discussion

Methods

Datasets and subjects

Machine-learning-based imputation of AL data

Model configuration

Data processing and model training

Model performance evaluation

Causal machine learning

Causal objective

Define \({\mu }_{a}\left({H}_{t}\right)={\mathbb{E}}\left[{Y}_{t+\Delta }({\rm{a}}),|,{H}_{t}\right]\) and let \({a}_{0}\) denote “no intervention” (natural progression)

Transformer + adversarial deconfounding

Outcome and causal losses

Counterfactual prediction

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Supplementary information

Supplementary Information (download PDF )

Rights and permissions

About this article

Cite this article

Share this article

Search

Quick links