Looking outside the box with a pathology aware AI approach for analyzing OCT retinal images in Stargardt disease

Khateri, Parisa; Koottungal, Tiana; Wong, Damon; Strauss, Rupert W.; Janeschitz-Kriegl, Lucas; Pfau, Maximilian; Schmetterer, Leopold; Scholl, Hendrik P. N.

doi:10.1038/s41598-025-85213-w

Download PDF

Article
Open access
Published: 08 February 2025

Looking outside the box with a pathology aware AI approach for analyzing OCT retinal images in Stargardt disease

Parisa Khateri^1,4,
Tiana Koottungal¹,
Damon Wong^1,2,
Rupert W. Strauss^1,3,6,
Lucas Janeschitz-Kriegl^1,4,
Maximilian Pfau^4,7,8,
Leopold Schmetterer^{1,2,5,11,12,13,14,15,16} &
…
Hendrik P. N. Scholl^5,9,10

Scientific Reports volume 15, Article number: 4739 (2025) Cite this article

3543 Accesses
3 Citations
Metrics details

Subjects

Abstract

Stargardt disease type 1 (STGD1) is a genetic disorder that leads to progressive vision loss, with no approved treatments currently available. The development of effective therapies faces the challenge of identifying appropriate outcome measures that accurately reflect treatment benefits. Optical Coherence Tomography (OCT) provides high-resolution retinal images, serving as a valuable tool for deriving potential outcome measures, such as retinal thickness. However, automated segmentation of OCT images, particularly in regions disrupted by degeneration, remains complex. In this study, we propose a deep learning-based approach that incorporates a pathology-aware loss function to segment retinal sublayers in OCT images from patients with STGD1. This method targets relatively unaffected regions for sublayer segmentation, ensuring accurate boundary delineation in areas with minimal disruption. In severely affected regions, identified by a box detection model, the total retina is segmented as a single layer to avoid errors. Our model significantly outperforms standard models, achieving an average Dice coefficient of $99\%$ for total retina and $93\%$ for retinal sublayers. The most substantial improvement was in the segmentation of the photoreceptor inner segment, with Dice coefficient increasing by $25\%$. This approach provides a balance between granularity and reliability, making it suitable for clinical application in tracking disease progression and evaluating therapeutic efficacy.

Adaptive optics scanning laser ophthalmoscopy in a heterogenous cohort with Stargardt disease

Article Open access 09 October 2024

Optical coherence tomography choroidal enhancement using generative deep learning

Article Open access 04 May 2024

Multi-omic spatial effects on high-resolution AI-derived retinal thickness

Article Open access 04 February 2025

Introduction

Stargardt disease type 1 (STGD1) is a genetic retinal disorder, with a typical onset in childhood or adolescence, which progressively leads to vision loss and results in legal blindness in almost all cases^1,2. Despite ongoing clinical trials, there are currently no approved therapies³. The rarity of the disease and its slow progression pose significant challenges in measuring structural progression, which is critical for understanding the disease’s natural history and developing potential treatments. The ProgStar study, a multicenter longitudinal program conducted throughout the world, has collected extensive data on retinal structure and function, offering a unique opportunity to overcome these challenges^4,5,6.

One of the promising endpoints for assessing the structural progression of Stargardt disease is Optical Coherence Tomography (OCT), a key examination method in the ProgStar study⁷. OCT is a non-invasive and patient-friendly procedure with a short acquisition time. It provides high-resolution cross-sectional images of the posterior pole of the retina on microscale, enabling the objective measurement of loss of the photoreceptor layer and retinal pigment epithelium (RPE) atrophy. Notably, OCT can reveal structural changes, such as hyper-reflectivity at the base of the foveal outer nuclear layer, even before onset of symptoms^8,9.

OCT imaging allows for the quantification of several parameters based on the segmentation of retinal layers, including total retinal thickness and the thickness of sublayers such as inner retina, the outer nuclear layer, photoreceptor inner/outer segments, and the RPE. A recent ProgStar report demonstrated a significant decline of mean thickness and intact area (as defined by thickness value $>0 \upmu m$) of the outer retinal layers, captured through OCT scans of patients with STGD1 over a 24 months period⁹. This supports the potential of OCT imaging and retinal thickness layer as key endpoints for clinical trials aimed at managing the disease progression. However, the experience with manual and semi-automated segmentation of OCT images of patients with STGD1 has demonstrated that this task is complex^10,11. Continuous segmentation of the photoreceptor inner and outer segment boundaries is particularly challenging due to layer disruptions and the accumulation of hyper-reflective debris due to the disease pathology. Even correcting software errors in semi-automated segmentation is time-consuming, requiring significant resources in terms of time, funding and personnel¹².

Automatic segmentation of retinal sublayers in OCT images has been widely studied in recent years, employing both classical methods and deep learning-based approaches. The methods usually focus on boundary delineation or pixel-wise classification to identify retinal sublayers. They offer insights into various retinal disorders. However, few studies specifically address the complexities of segmenting OCT images in the context of Stargardt disease, where structural disruptions challenge the segmentation of retinal sublayers. Liu et al. employed an improved Canny operator for automatic segmentation of eleven retinal boundaries in OCT images of healthy subjects and age-related macular degeneration (AMD) patients¹³. Their method showed robustness in handling noise and artifacts, but its reliance on classical edge detection techniques may limit adaptability to the severe disruptions seen in Stargardt disease. Li et al., with DeepRetina, introduced a modified Xception65 architecture¹⁴ combined with an atrous spatial pyramid pooling¹⁵ to delineate 10 retinal boundaries in OCT images of healthy subjects. Their approach was notable for its adaptability to high-resolution OCT images and preserving fine-details. UNet architectures¹⁶ have been widely used for retinal segmentation tasks due to its ability to preserve spatial hierarchies and extract contextual information. For instance, Yojana and Thillai Rani developed a UNet-based model with a ResNet34 encoder-decoder for total retina segmentation in healthy eyes¹⁷. However, this approach lacks the granularity to address sublayer segmentation. He et al.¹⁸ introduced a residual UNet consisting of residual blocks with two output branches, combining pixel classification and boundary regression into a single feed-forward operation. Their model was promising in segmenting retinal sublayers in diabetic macular edema (DME) and multiple sclerosis, however, it requires adaptation for the unique disruptions in Stargardt disease. Sousa et al.¹⁹ integrated UNet and DexiNed²⁰ in a two-step pipeline to segment retinal sublayers in OCT images of healthy subjects and AMD patients. Their method effectively captured fine boundary details, but its performance in severely disrupted regions, typical of Stargardt pathology, remains unexplored. Wang et al.²¹ proposed a 3D graph-assisted UNet, integrating a graph-pyramid into a U-shape network for segmentation of OCT volumes in DME and wet AMD patients. Despite its success in segmenting diseased retinas, its dependency on spatial coherence may not generalize well to Stargardt disease, where layer continuity is often lost. More recently, there has been a growing interest in deploying attention-based models and vision transformers for OCT segmentation²². Cao et al.²³ proposed a UNet-based model integrating self-attention mechanisms. A transformer block was added at the encoder’s output to capture long-range dependencies, and spatial attention mechanisms were added in skip connections and upsampling to enhance essential features. Their model was validated on data with DME, myopia, peripapillary atrophy and cataract for nine retinal layers. Similarly Zhang et al. proposed TranSegNet²⁴, which combined a lightweight vision transformer with an improved UNet backbone, using multi-head convolutional for global feature extraction. The network was evaluated for the segmentation of retinal OCT images of healthy and DME patients. These models while effective in various retinal pathologies, remain largely unexplored in their utility for the disrupted retinal layers in Stargardt disease.

To our knowledge, very few studies have been conducted on automated OCT segmentation for Stargardt disease to date. Kugelman et al. investigated the use of automated deep learning techniques to segment retinal boundaries in Stargardt patients’ OCT images²⁵. However, they focused on segmenting the total retina as a single layer, rather than individual retinal sublayers. Mishra et al. introduced a UNet-based¹⁶ deep learning model to segment the retina into 11 sublayers in OCT images of Stargardt patients²⁶. Another study independently re-segmented a subset of the same dataset which were initially graded for Mishra’s study, and noted reproducibility issues, particularly in grading the outer retinal layers in disrupted areas critical for assessing disease progression¹². This was attributed to inclusion criteria based on atrophy, which rendered inner-outer segment layers absent in the central subfield. The variability in these measurements raises concerns about the reliability of such complex segmentation methods for consistent disease monitoring. Furthermore, the extensive manual segmentation performed by Mishra et al.²⁶ is time-consuming and costly, making it impractical for clinical trials. These approaches face significant limitations: the former lacks granularity, while the latter introduces reproducibility issues, especially in areas where retinal layers are absent due to degeneration.

Addressing these challenges requires establishing reliable ground truth annotations and developing robust, automated segmentation techniques capable of managing the complexities of retinal degeneration while enabling accurate measurements of progressive structural changes of the retina. In this study, we propose a novel deep learning-based approach that incorporates a pathology-aware loss function, targeting relatively unaffected regions where sublayers can still be detected. Retinal sublayer segmentation is performed only in these regions—where disruption is minimal enough to allow for accurate boundary delineation. In regions with severe degeneration, identified by a box detection model, the total retina is segmented as a single layer. This approach prevents the model from attempting to segment layers that are no longer present by distinguishing severely affected regions from relatively unaffected ones, applying tailored segmentation models to each area, thereby reducing errors and enhancing the reliability of structural measurements.

Methods

Dataset

The data used for this study were obtained from the prospective arm of the ProgStar study⁴ and from a set of healthy eyes, as described below.

ProgStar data

The prospective arm of the ProgStar study provides longitudinal data from 259 patients with molecularly confirmed STGD1, gathered at nine international sites. Details of data acquisition and demographic information of patients have been reported previously⁴.

Almost all patients underwent five study visits every six months, with both eyes examined at each visit. Multiple investigations were performed during these visits, including Spectral Domain OCT (SD-OCT), fundus autofluorescence, and microperimetry. This study specially focused on analyzing SD-OCT images, obtained with the Heidelberg Spectralis (Heidelberg Engineering, Heidelberg, Germany), which consisted of volumetric macular scans with 49 B-scans, each with a resolution of ${1024 \times 496}$ pixels, covering a field of view (FOV) of ${20}^{\circ }$ by ${20}^{\circ }$ (approximately 6 mm $\times$ 6 mm with 2 mm depth).

Quality of SD-OCT images was assessed based on the following criteria: FOV size, image resolution, cut-off artifacts, noise levels, tilting effects, non-uniformity, hyper-reflectivity, reflection artifacts, and eccentric fixation.

Ideally, we would have 2,590 volumes with 126,910 B-scans, but several patients did not have both their eyes examined, did not follow up for some visits, and some B-scans were excluded due to their poor quality. As a result, 105,537 B-scans from the ProgStar dataset were included in the analysis.

In some cases, difficulties with fixation led to the inclusion of the optic nerve in the FOV. These instances were corrected by manually masking the optic nerve. Additionally, due to the registration of data during successive visits, some B-scans were slightly rotated, resulting in oblique cuts through the retina layers; these sections were also masked out.

Healthy eyes data

The healthy eyes dataset comprised 100 SD-OCT volumes acquired by the Heidelberg Spectralis system from 100 volunteers at the Department of Ophthalmology, University of Bonn, Germany. The Institutional Review Board (IRB) of the University of Bonn approved the study (ethics approval ID: 191/16). The study adhered to the Declaration of Helsinki and local regulations. Written informed consent was obtained from all participants.

Each SD-OCT volume featured a macular scan with 121 B-scans, each with a resolution of ${768 \times 496}$ pixels, covering an FOV of ${30}^{\circ }$ by ${25}^{\circ }$ (approximately 9 mm $\times$ 7.5 mm with 2 mm depth). To ensure compatibility with the ProgStar dataset, the healthy B-scans were resized using nearest neighbor interpolation to match the matrix size of ProgStar dataset. Additionally, due to the larger FOV in the healthy dataset, some B-scans included the optic nerve, which was masked out whenever it appeared.

Data preparation

Data sampling and stratification

As detailed above, over $10^5$ B-scans from the ProgStar dataset were available for analysis. Manually annotating such a large number of 2D images is highly time consuming. Therefore, we implemented an iterative framework similar to the active learning method²⁷ to minimize the amount of manual annotation required for training the models (see section Iterative Learning). An initial sample for training the deep learning models was stratified and selected as follows (see figure 1):

1.
Site-base stratification: The ProgStar data were collected from multiple sites, each contributing a different number of patients. Patients from each site were randomly assigned to training, validation and test sets in an 8 : 1 : 1 ratio, ensuring that each site approximately follows this proportion, and no individual patient’s data appears in more than one set. This stratification resulted in 203 patients in the training set and 28 patients in each of the validation and test sets.
2.
B-scan grouping: B-scans from each SD-OCT volume were divided into central and peripheral groups, with the middle 11 B-scans classified as central and the remaining 38 as peripheral. For each patient, all central and peripheral B-scans, across all visits and both eyes, were grouped and shuffled.
3.
B-scan sampling: A fixed number of B-scans were sampled from each patient to create the dataset. The sampling ratio was set at $30\%$ central B-scans and $70\%$ peripheral B-scans, ensuring denser sampling from the central region, where the most damage due to the disease typically occurs. The fixed number of B-scans per patient was set to 16 for the validation and test sets, which remain constant during training, and three for the training set, which increases as the training progresses.

This approach resulted in an initial train set of 609 B-scans, and 448 B-scans in both the validation and test sets. A similar procedure was used to sample an equivalent number of images from the healthy dataset. The sampled data were annotated as described in the next section. During the training process, the size of the training set increases as part of the iterative learning loops. We refer to the initially sampled and annotated training set as the ”initial training set”, and the remaining unannotated images from the patients in the training data as the ”unannotated train set”.

Ground truth generation

Box annotation: We used box annotations to differentiate between severely damaged regions and those that were moderately affected or unaffected by the disease. A region was considered severely affected by Stargardt disease where at least one retinal layer was undetectable. This area was enclosed within an orthogonal box drawn on the B-scan. If multiple small regions with undetectable layers were present across the retina, they were grouped together within a single larger box. For consistency, the enclosed area will be referred to as the ”severely affected region,” while the area outside the box will be termed the ”relatively unaffected region. It is important to note that the region outside the box may not be healthy-in fact, its ”health” is the very question that thickness measurement through segmentation aims to answer.

Retina segmentation: Outside the box, where retina sublayers were detectable, retina was manually segmented into five sublayers using the “OCTExplorer” software developed by the University of Iowa^28,29,30. The sublayers are defined as follows (see Figure 2):

Inner Retina (IR): from the Inner Limiting Membrane (ILM) to the inner boundary of the Outer Plexiform Layer (OPL).
Outer Nuclear Layer (ONL): from the OPL inner boundary to the External Limiting Membrane (ELM).
Photoreceptor Inner Segment (PR-IS): from the ELM to the posterior side of the Ellipsoid Zone (EZ).
Photoreceptor outer segment (PR-OS): from the posterior EZ to the inner boundary of Retinal Pigment Epithelium (RPE).
Retinal Pigment Epithelium (RPE): from the RPE inner boundary to the choroid inner boundary.

For the region inside the box, due to the complexity and degeneration of the retinal layers, only the total retinal thickness was segmented as a single layer.

Both annotation steps were performed by a trained individual and verified by a second trained reviewer. Any disagreements were resolved through discussion with an ophthalmologist.

Image analysis

Our approach comprises three main steps (see Figure 3):

1.
Detection of the severely affected region—box detection: Identifying the regions severely affected by Stargardt disease in 2D B-scans using a box detection model, marking these areas with a rectangular box. The box delineates severe damage that does not allow further segmentation other than total retina.
2.
Segmentation of the total retina: Using a deep learning model to segment the total retina across the entire B-scan, focusing on the total retina, which remains visible in most Stargardt cases.
3.
Segmentation of retinal sublayers: Applying a second deep learning model to segment retinal sublayers by integrating the information obtained from the first step. The loss function accounts for the severely affected region by excluding it from the calculation, thus focusing only on the relatively unaffected regions outside the box for sublayer segmentation.

Each model is integrated into an iterative learning algorithm, reducing the need for comprehensive data annotation, by acquiring manual annotation for only a sample of the data. The following sections describe the detailed implementation of the image analysis.

Iterative learning

To reduce the time-consuming and costly effort of labeling, we employed an iterative learning approach inspired by the active learning method^27,31. Active learning involves iteratively selecting the most informative data from an unlabeled pool to improve model performance. The process begins with a labeled sample of the dataset. In the end of each loop, the trained model identifies the most informative subset of the unlabeled data, often based on uncertainty^32,33. This new data is labeled and added to the training set, and the model is retrained. The process repeats until the model’s performance on the validation set stabilizes and no longer improves.

This approach was adopted by Block et al., who used a Monte Carlo Dropout method to identify the most uncertain images in an object detection task by calculating uncertainty based on the overlap between predicted bounding boxes³⁴. However, this method assumes that every image contains at least one object. This assumption is unsuitable for our dataset, where some images may have no detectable objects (i.e. no Stargardt-affected region) but are still informative and valuable for training. To address this limitation, we modified this approach by employing a random sampling method to select data from the unlabeled pool, ensuring the inclusion of images with varying levels of degeneration. In the end of each learning loop, 200 images were randomly sampled from the unlabeled pool, labeled, and then added to the training set. The validation and test sets remained unchanged throughout the iterations.

Detection of the severely affected region—box detection

A Faster R-CNN-based model was trained to detect severely affected regions in 2D B-scans³⁵. Specifically, we utilized the Faster R-CNN (R50-FPN) model from the Detectron2 platform developed by Facebook AI Research³⁶, which incorporates a ResNet50³⁷ backbone and a Feature Pyramid Network (FPN)³⁸. The ResNet50 backbone was initialized with weights pre-trained on the ImageNet dataset³⁹. The model was configured with a learning rate of 0.0005 and 2000 iterations for the first loop of the iterative learning. As the training set expanded, the number of iterations was increased throughout the iterative learning loops, reaching 3000 after 10 loops. We stopped the iterative training after 10 loops as the model’s performance reached a plateau (See Supplementary Materials).

The training, validation, and test sets used for box detection included only ProgStar data; no healthy images were considered in this phase. The model was built and executed within a Singularity container⁴⁰ based on Ubuntu 20.04, customized to include Python 3.8, PyTorch 1.11.0, and CUDA 11.3.

Segmentation of the total retina

For the segmentation of the total retina across the entire B-scan, we employed the DeepLabv3 model¹⁵ with a ResNet50 backbone pre-trained on the ImageNet dataset³⁹. The images were segmented into two classes: the total retina and the background. Each loop of the iterative training consisted of 15 epochs with a batch size of 15. The learning rate was set to 0.0005 at the beginning of each training loop. The adaptive moment estimation (ADAM) optimizer was used⁴¹. To optimize the Dice score on the validation set, a learning rate scheduler with a decay rate of 0.5 and a patience of 15 epochs was employed. The loss function was a combination of cross entropy loss $L_{ce}$ and the Dice loss $L_{dc}$ calculated as follows:

$$\begin{aligned} loss_{ce}= & -\frac{1}{N} \sum _{i=1}^{N} y_i \log (\hat{y}_i) \end{aligned}$$

(1)

$$\begin{aligned} loss_{dc}= & 1 - \frac{2 \sum _{i=1}^{N} y_i \hat{y}_i}{\sum _{i=1}^{N} y_i + \sum _{i=1}^{N} \hat{y}_i} \end{aligned}$$

(2)

$$\begin{aligned} loss= & loss_{ce} + loss_{dc} \end{aligned}$$

(3)

Here, $y_i$ is the ground truth label for the i-th pixel, $\hat{y}_i$ is the predicted probability for the i-th pixel, and N is the number of pixels in the entire batch of images.

Dice scores were evaluated on the validation set over five loops of training. At the end of each loop, 200 images were randomly sampled, labeled and added to the training set for the next loop. The training, validation and test sets included an equal share of ProgStar and healthy data.

The model was executed within a PyTorch NGC container (release 22.11) provided by NVIDIA, which included PyTorch version 1.13.0, CUDA 11.8, running on Ubuntu 20.04 with Python 3.8.

Segmentation of the retinal sublayers

For the regions outside the detected boxes, where retinal sublayers are visible, we used a procedure similar to that described for total retina segmentation, with some modifications described as follows. Here, we segmented five retinal sublayers—IR, ONL, PR-IS, PR-OS, and RPE—resulting in six classes, including the background. Given the increased complexity of this task, the number of epochs was increased to 30. Along with the 2D B-scans, the box information for the severely affected region, as detected in the box detection step, was provided as input to the model. The loss calculation was adjusted to exclude the severely affected region by using the box information as follows:

$$\begin{aligned} loss_{pa,ce}= & -\frac{1}{N} \sum _{i=1}^{N} m_i y_i \log {m_i \hat{y}_i} \end{aligned}$$

(4)

$$\begin{aligned} loss_{pa,dc}= & 1 - \frac{2 \sum _{i=1}^{N} m_i y_i\hat{y}_i}{\sum _{i=1}^{N} m_i y_i + \sum _{i=1}^{N} m_i \hat{y}_i} \end{aligned}$$

(5)

$$\begin{aligned} loss_{pa}= & loss_{pa,ce} + loss_{pa,dc} \end{aligned}$$

(6)

Here, $m_i$ is a mask with binary values, 0 for pixels inside the detected box and 1 for pixels outside the box. $loss_{pa,ce}$ is the pathology-aware cross entropy loss, $loss_{pa,dc}$ is the pathology-aware Dice loss, and $loss_{pa}$ is the pathology-aware total loss. To simplify, the mask height was extended to span the entire height of the image, ensuring that only pixels to the left and right of the box were considered in the loss. Masking the severely affected region in this study is analogous to the approach used by Wang et al.²¹, where unannotated regions were excluded from loss calculation.

We employed three models for sublayer segmentation: UNet¹⁶, DeepLabV3, and ReLayNet⁴². The UNet architecture was based on this Git repository⁴³, with 16 filters. The ReLayNet architecture was borrowed from this Git repository⁴⁴, using the same configuration. The DeepLabV3 arcitechture, as well as the learning rate and optimizer for all three models were the same as those used for total retina segmentation. The models were trained over five iterative loops. At the end of each loop, 200 randomly sampled and labeled images were added to the training set for the subsequent loop.

Evaluation metrics

Precision for box detection: To evaluate the performance of the box detection model, the average precision (AP) was calculated as the area under the precision-recall (PR) curve. This curve was generated by plotting precision against recall, using a confidence threshold of 0.5 to determine whether a predicted bounding box was considered a valid detection. The precision and recall were defined as follows:

$$\begin{aligned} Precision= & \frac{True Positives}{True Positives + False Positives} \end{aligned}$$

(7)

$$\begin{aligned} Recall= & \frac{True Positives}{True Positives + False Negatives} \end{aligned}$$

(8)

The PR curve was obtained by varying the Intersection over Union (IoU) thresholds, which measure the overlap between the predicted bounding box and the ground truth bounding box. Two IoU thresholds of 50% (IoU = 0.5) and 75% (IoU = 0.75) were used to calculate the AP50 and AP75, respectively. The overall performance of the model was summarized using the mean Average Precision (mAP), which was the mean of the AP values across multiple IoU thresholds.

Dice score: To evaluate the performance of segmentation models, the Dice score, DC, was used. The Dice score measured the overlap between the predicted segmentation and the ground truth segmentation and was defined as twice the area of overlap between the predicted segmentation and the ground truth, divided by the total number of pixels in both areas:

$$\begin{aligned} DC = \frac{2 \sum _{i=1}^{N} y_i \hat{y}_i}{\sum _{i=1}^{N} y_i + \sum _{i=1}^{N} \hat{y}_i} \end{aligned}$$

(9)

The Dice Score ranges from 0 to 1, where a score of 1 indicates perfect overlap between the predicted and actual segmentation, while a score of 0 indicates no overlap.

Results

Detection of the severely affected region—box detection

Figure 4 shows several B-scans from different cross-sections with varying levels of damage, along with the ground truth and pred icted bounding boxes identifying the severely affected regions. In all test set images, the model successfully predicted the presence or absence of a bounding box. The precision of the predicted boxes was evaluated on the test set, yielding values of 65.31, 90.55 and 76.53 for mAP, AP50 and AP75, respectively. These results indicate that the model performed well in detecting the severely affected regions, particularly effective in identifying regions with a reasonable overlap (IoU threshold of $50\%$). Additionally, the AP75 value suggests that the model maintained strong performance even with more restrict overlap (IoU threshold of $75\%$). The mAP score of 65.31 reflects robust overall performance across multiple IoU thresholds, highlighting the model’s generalization ability.

Segmentation of the total retina

The performance of the segmentation model for the total retina was assessed based on Dice score. The average Dice score across the entire test set was $99.1\%$. Specifically, the model achieved a Dice score of $99.3\%$ for B-scans of healthy eyes and $98.9\%$ for B-scans of patients with STGD1. These results indicate high accuracy in the segmentation of total retina, with slightly higher performance observed for healthy eyes compared to the affected by Stargardt disease.

It is important to note that the pathology-aware model, assigned for sublayer segmentation, was employed specifically for regions outside the box. Its performance for total retina segmentation was evaluated by aggregating sublayers for the regions outside the box on Progstar data. This approach yielded a lower Dice coefficient ($91\%$) compared to a dedicated single-layer total retina segmentation ($99\%$). Therefore, we opted for a distinct model for total retina segmentation, which directly segmented the retina as a single layer.

Segmentation of the retinal sublayers

For the segmentation of the retinal sublayers, we compared the performance of our proposed method, which uses a pathology-aware loss function (Equation 6) with a standard model that employs a conventional loss function (Equation 3), using three network architectures for both approaches: UNet, DeepLabV3 and ReLayNet. The standard model was trained exclusively on the healthy dataset, as it was not feasible to use the ProgStar dataset due to the unavailable annotation for severely affected regions.

The models were evaluated on two test sets: one containing healthy data, and one containing ProgStar data. The performance of the segmentation models was assessed using the average Dice score across all retinal sublayers, as well as the average Dice score for each individual retinal sublayer. A summary of the results is presented in Table 1, comparing the performance of the standard approach and the pathology-aware approach across three models and two test sets.

Table 1 Dice scores evaluating the performance of the standard and pathology-aware methods across different test sets (Healthy and Progstar). The results are derived using three models (UNet, DeepLabv3, and ReLayNet) and are reported for individual retinal layers-Inner Retina (IR), Outer Nuclear Layer (ONL), Photoreceptor Inner Segment (PR-IS), Photoreceptor Outer Segment (PR-OS), and Retinal Pigment Epithelium (RPE)-as well as an overall score. Each score includes mean and standard deviation values.

Full size table

The plots depicted in Figure 5 visualize some of the Dice scores presented in Table 1, comparing the performance of the standard and pathology-aware methods across healthy and ProgStar datasets for the ReLayNet architecture. As the plots highlight this, both methods showed similar performance for the healthy dataset with an overall Dice coefficient of 0.95. For the ProgStar dataset, the pathology-aware model outperformed the standard model significantly (p < 0.001, paired T-test), achieving an overall Dice coefficient of 0.93 compared to 0.79 for the standard approach in the ReLayNet architecture.

The detailed breakdown of Dice coefficients for each retinal sublayer—IR, ONL, PR-IS, PR-OS, and RPE— shows that the pathology-aware method consistently outperformed the standard method for ProgStar data across all three network architectures, with improvements ranging from $13\%$ to $33\%$ for different sublayers. The largest improvement was observed in the PR-IS sublayer. Among the three architectures, ReLayNet showed the best performance, followed by UNet.

Figure 6 presents example B-scans with segmented sublayers, comparing the ground truth with the predictions from both the standard method and the pathology-aware method.

Discussion

In this study, we presented a novel deep learning approach for the automated segmentation of retinal sublayers in SD-OCT images from patients with STGD1. The key contribution of our method lies in the introduction of a pathology-aware loss function, which enables the model to focus on relatively unaffected regions of the retina while adapting to the variability in retinal degeneration across different patients.

The use of box detection in our study in severely affected regions avoids the challenge of attempting to segment absent layers. In these areas, segmenting the retina as a single layer provides a more reliable and feasible approach, ensuring that the segmentation task remains manageable and clinically relevant. Attempting to segment individual layers in such regions would not only introduce inaccuracies but also complicate the interpretation of the results. By focusing on relatively intact regions, the method captures detailed information where possible, while maintaining robustness in areas with advanced degeneration.

Retinal degeneration can vary widely from person to person, with some regions of the retina being more affected than others. The method can recognize and handle these differences, focusing on the relatively unaffected regions of the retina in each individual case, even though the extent and nature of the degeneration may differ across patients and visits. Our results demonstrate that the proposed method achieved high performance in segmenting retinal sublayers in less affected regions and the total retina in severely affected regions. This dual approach enhances the model’s ability to generalize across a diverse dataset with varying degrees of retinal degeneration.

The iterative learning strategy employed in our study was particularly effective in reducing the annotation burden, which is a critical factor for the scalability of such techniques in larger clinical studies. By iteratively sampling new images, the models were able to improve their performance with a relatively small labeled dataset; $\sim 1700$ images equivalent to $\sim 1.5\%$ of the total ProgStar dataset for box detection, and $\sim 1000$ images equivalent to $\sim 1\%$ of the total ProgStar dataset for the segmentation tasks. This approach is highly relevant for clinical applications where obtaining large, annotated datasets is often challenging. Monitoring the model performance on the validation dataset over consecutive loops ensured that we could take the most out of data without losing much information by not including the whole dataset.

Compared to previous approaches, our method offers several key improvements. Earlier studies either focused on segmenting the total retina as a single layer across the entire image^11,25, or complicated the segmentation task by attempting to segment an excessive number of sublayers across the entire B-scan, including regions with severe disruption²⁶. The former lacked the granularity needed to better understand the disease, while the latter made the segmentation task overly complex with the risk of losing reproducibility in regions with significant degeneration. Our method achieves an optimal balance, extracting as much detailed information as possible by introducing a pathology-aware loss function that differentiates between severely affected and relatively unaffected regions. This gives us the possibility to investigate the layers that are still visible, although have already undergone some degeneration. Additionally, the method remains straightforward enough to be integrated into clinical practice.

The ability to accurately segment retinal sublayers in OCT images has major implications for the clinical management of Stargardt disease. Visual acuity, while clinically important, is not a good outcome measure, because the disease is characterized by a progressive enlargement of visual field loss while only the foveal center drives visual acuity. In addition, visual acuity is a subjective and inherently variable measure, which limits its reliability. Structural measures such as OCT imaging, equipped with precise quantification of retinal layer thicknesses and the extent of atrophy can provide valuable biomarkers for tracking disease progression and assessing the efficacy of potential therapies. The automated approach presented here could facilitate more consistent measurements, and enable large-scale clinical trials.

Moreover, our method’s capacity to handle data with varying degrees of retinal damage makes it particularly suitable for longitudinal studies where the extent of degeneration changes over time. By focusing on relatively unaffected regions for detailed analysis, while still capturing overall retinal thickness in affected areas, our approach can offer a more comprehensive assessment of disease progression.

It is important to note that OCT images do not provide an immediate reflection of the patient’s vision, and the ongoing debate concerns whether OCT imaging can serve as a reliable surrogate endpoint. Therefore, our study could be further expanded to explore the structure/function relationship in Stargardt disease. By integrating functional measures such as microperimetry with OCT-derived structural data, future studies could investigate how retinal degeneration correlates with functional vision loss. Furthermore, integrating other imaging modalities, such as fundus autofluorescence, could provide complementary information to enhance box detection and segmentation accuracy.

In addition to thickness analysis, the size of the detected boxes could be utilized in future studies to monitor the growth of the damaged area over time, offering a new dimension to the longitudinal analysis.

Further refinement of the iterative learning strategy, moving from a random selection to selecting more informative data at successive loops, possibly by incorporating uncertainty estimation methods, could improve the efficiency and effectiveness of the annotation process.

Our current approach effectively detects single boxes including the affected region. However, in some cases, there are still detectable boundaries inside the box which are not considered due to the single box strategy, as we combined multiple small regions into one large box. These boundaries might hold critical information, and refining our approach to implement a multi-box detection mechanism could enhance the analysis by adding more details from such regions which are covered by the larger boxes.

Another limitation of the current study is the use of data from a single type of OCT device, specifically the Heidelberg Engineering system. To ensure the reliability of our approach across a broader range of clinical settings, further evaluation using datasets from other OCT scanners and expanding the test set with external datasets would enhance the applicability of the results across diverse clinical environments. Additionally, while the pathology-aware approach requires relatively less complex annotations, due to the exclusion of severely affected regions, the reproducibility of these annotations could be assessed through blind grading by multiple graders.

Our method is model-independent and can utilize any deep learning model for box detection and segmentation. Although the results show a significant accuracy in the prediction, other deep learning models, such as transformers⁴⁵ or dual head convolutional networks⁴⁶ which combine boundary regression with layer segmentation could be investigated for better performance in segmentation.

Although our method is tailored for STGD1, its versatility allows it to be adapted for other diseases, such as geographic atrophy in age-related macular degeneration, or imaging modalities where segmentation of organs with damaged tissue structures is required.

Conclusion

Our study introduces a novel, pathology-aware deep learning approach for the automated segmentation of retinal layers in SD-OCT images of patients with STGD1. By focusing on relatively unaffected regions and adapting to the variability in retinal degeneration, our method provides a balanced and efficient means of extracting valuable structural information while maintaining simplicity for clinical implementation. The integration of an iterative learning strategy significantly reduces the annotation burden, making the approach scalable for larger studies. This work not only advances the field of automated OCT segmentation but also lays the foundation for future research aimed at enhancing the understanding and clinical management of Stargardt disease and related retinal disorders.

Data Availability

The datasets generated and/or analysed during the current study are not publicly available due to the inclusion of clinical data but are available from the corresponding author on reasonable request.

References

Rotenstreich, Y., Fishman, G. A. & Anderson, R. J. Visual acuity loss and clinical observations in a large series of patients with stargardt disease. Ophthalmology. 110, 1151–1158 (2003).
Article PubMed MATH Google Scholar
Glatz, M. et al. Blindness and visual impairment in central europe. Plos one. 17, e0261897 (2022).
Article CAS PubMed PubMed Central MATH Google Scholar
Scholl, H. P. et al. Emerging therapies for inherited retinal degeneration. Science translational medicine. 8, 368rv6-368rv6 (2016).
Article PubMed MATH Google Scholar
Strauss, R. W. et al. The natural history of the progression of atrophy secondary to stargardt disease (progstar) studies: design and baseline characteristics: Progstar report no. 1. Progstar report Ophthalmology. 123, 817–828 (2016).
Kong, X. et al. Visual acuity change over 12 months in the prospective progression of atrophy secondary to stargardt disease (progstar) study: Progstar report number 6. Ophthalmology. 124, 1640–1651 (2017).
Article PubMed MATH Google Scholar
Strauss, R. W. et al. Progression of stargardt disease as determined by fundus autofluorescence over a 24-month period (progstar report no. 17). American Journal of Ophthalmology. 250, 157–170 (2023).
Article PubMed MATH Google Scholar
Popescu, D. P. et al. Optical coherence tomography: fundamental principles, instrumental designs and biomedical applications. Biophysical reviews. 3, 155–169 (2011).
Article CAS PubMed PubMed Central MATH Google Scholar
Khan, K. N. et al. Early patterns of macular degeneration in abca4-associated retinopathy. Ophthalmology. 125, 735–746 (2018).
Article PubMed MATH Google Scholar
Strauss, R. W. et al. The progression of stargardt disease as determined by spectral-domain optical coherence tomography over a 24-month period (progstar report no. 18). Ophthalmic Research. 67, 435–447 (2024).
PubMed MATH Google Scholar
Ervin, A.-M. et al. A workshop on measuring the progression of atrophy secondary to stargardt disease in the progstar studies: findings and lessons learned. Translational Vision Science & Technology. 8, 16–16 (2019).
Article MATH Google Scholar
Strauss, R. W. et al. Assessment of estimated retinal atrophy progression in stargardt macular dystrophy using spectral-domain optical coherence tomography. British Journal of Ophthalmology. 100, 956–962 (2016).
Article PubMed MATH Google Scholar
Kong, X. et al. Reproducibility of measurements of retinal structural parameters using optical coherence tomography in stargardt disease. Translational Vision Science & Technology. 8, 46–46 (2019).
Article MATH Google Scholar
Liu, J. et al. Automated retinal boundary segmentation of optical coherence tomography images using an improved canny operator. Scientific Reports. 12, 1412 (2022).
Article ADS PubMed PubMed Central MATH Google Scholar
Chollet, F. Xception: Deep learning with depthwise separable convolutions. In Proceedings of the IEEE conference on computer vision and pattern recognition, 1251–1258 (2017).
Chen, L.-C., Papandreou, G., Schroff, F. & Adam, H. Rethinking atrous convolution for semantic image segmentation. arXiv preprint arXiv:1706.05587 (2017).
Ronneberger, O., Fischer, P. & Brox, T. U-net: Convolutional networks for biomedical image segmentation. In Medical image computing and computer-assisted intervention–MICCAI 2015: 18th international conference, Munich, Germany, October 5-9, 2015, proceedings, part III 18, 234–241 (Springer, 2015).
Yojana, K. & Rani, L. T. Oct layer segmentation using u-net semantic segmentation and resnet34 encoder-decoder. Measurement: Sensors. 29, 100817 (2023).
Google Scholar
He, Y. et al. Structured layer surface segmentation for retina oct using fully convolutional regression networks. Medical image analysis. 68, 101856 (2021).
Article PubMed MATH Google Scholar
Sousa, J. A. et al. Automatic segmentation of retinal layers in oct images with intermediate age-related macular degeneration using u-net and dexined. Plos one. 16, e0251591 (2021).
Article CAS PubMed PubMed Central Google Scholar
Poma, X. S., Riba, E. & Sappa, A. Dense extreme inception network: Towards a robust cnn model for edge detection. In Proceedings of the IEEE/CVF winter conference on applications of computer vision, 1923–1932 (2020).
Wang, Y. et al. Retinal oct layer segmentation via joint motion correction and graph-assisted 3d neural network. IEEE Access (2023).
Chen, J. et al. Transunet: Transformers make strong encoders for medical image segmentation. arXiv preprint arXiv:2102.04306 (2021).
Cao, G., Wu, Y., Peng, Z., Zhou, Z. & Dai, C. Self-attention cnn for retinal layer segmentation in oct. Biomedical Optics Express. 15, 1605–1617 (2024).
Article PubMed PubMed Central Google Scholar
Zhang, Y., Li, Z., Nan, N. & Wang, X. Transegnet: hybrid cnn-vision transformers encoder for retina segmentation of optical coherence tomography. Life. 13, 976 (2023).
Article ADS PubMed PubMed Central MATH Google Scholar
Kugelman, J. et al. Retinal boundary segmentation in stargardt disease optical coherence tomography images using automated deep learning. Translational vision science & technology. 9, 12–12 (2020).
Article MATH Google Scholar
Mishra, Z., Wang, Z., Sadda, S. R. & Hu, Z. Automatic segmentation in multiple oct layers for stargardt disease characterization via deep learning. Translational Vision Science & Technology. 10, 24–24 (2021).
Article Google Scholar
Ren, P. et al. A survey of deep active learning. ACM computing surveys (CSUR) 54, 1–40 (2021).
MATH Google Scholar
Abràmoff, M. D., Garvin, M. K. & Sonka, M. Retinal imaging and image analysis. IEEE reviews in biomedical engineering 3, 169–208 (2010).
Article PubMed PubMed Central MATH Google Scholar
Li, K., Wu, X., Chen, D. Z. & Sonka, M. Optimal surface segmentation in volumetric images-a graph-theoretic approach. IEEE transactions on pattern analysis and machine intelligence 28, 119–134 (2005).
MATH Google Scholar
Garvin, M. K. et al. Automated 3-d intraretinal layer segmentation of macular spectral-domain optical coherence tomography images. IEEE transactions on medical imaging. 28, 1436–1447 (2009).
Article PubMed PubMed Central MATH Google Scholar
Cohn, D. A., Ghahramani, Z. & Jordan, M. I. Active learning with statistical models. Journal of artificial intelligence research. 4, 129–145 (1996).
Article MATH Google Scholar
Gal, Y., Islam, R. & Ghahramani, Z. Deep bayesian active learning with image data. In International conference on machine learning., 1183–1192 (PMLR, 2017).
Liu, S. & Li, X. Understanding uncertainty sampling. arXiv preprint arXiv:2307.02719 (2023).
Blok, P. M. et al. Active learning with maskal reduces annotation effort for training mask r-cnn on a broccoli dataset with visually similar classes. Computers and Electronics in Agriculture. 197, 106917. https://doi.org/10.1016/j.compag.2022.106917 (2022).
Article MATH Google Scholar
Ren, S., He, K., Girshick, R. & Sun, J. Faster r-cnn: Towards real-time object detection with region proposal networks. IEEE transactions on pattern analysis and machine intelligence 39, 1137–1149 (2016).
Article PubMed MATH Google Scholar
Wu, Y., Kirillov, A., Massa, F., Lo, W.-Y. & Girshick, R. Detectron2. https://github.com/facebookresearch/detectron2 (2019).
He, K., Zhang, X., Ren, S. & Sun, J. Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition, 770–778 (2016).
Lin, T.-Y. et al. Feature pyramid networks for object detection. In Proceedings of the IEEE conference on computer vision and pattern recognition, 2117–2125 (2017).
Deng, J. et al. Imagenet: A large-scale hierarchical image database. In 2009 IEEE conference on computer vision and pattern recognition, 248–255 (Ieee, 2009).
Kurtzer, G. M., Sochat, V. & Bauer, M. W. Singularity: Scientific containers for mobility of compute. PloS one. 12, e0177459 (2017).
Article PubMed PubMed Central Google Scholar
Diederik, P. K. Adam: A method for stochastic optimization. (No Title) (2014).
Roy, A. G. et al. Relaynet: retinal layer and fluid segmentation of macular optical coherence tomography using fully convolutional networks. Biomedical optics express. 8, 3627–3642 (2017).
Article PubMed PubMed Central MATH Google Scholar
Pytorch-unet. https://github.com/milesial/Pytorch-UNet/ (2023). Commit 2f62e6b.
Relaynet-pytorch. https://github.com/ai-med/relaynet_pytorch/ (2023). Commit 40ae1aa.
Khan, R. F., Lee, B.-D. & Lee, M. S. Transformers in medical image segmentation: a narrative review. Quantitative Imaging in Medicine and Surgery. 13, 8747 (2023).
Article PubMed PubMed Central MATH Google Scholar
He, Y. et al. Fully convolutional boundary regression for retina oct segmentation. In Medical Image Computing and Computer Assisted Intervention–MICCAI 2019: 22nd International Conference, Shenzhen, China, October 13–17, 2019, Proceedings, Part I 22, 120–128 (Springer, 2019).

Download references

Acknowledgements

Dr. Hendrik Scholl is supported by the Swiss National Science Foundation (Project funding: “Developing novel outcomes for clinical trials in Stargardt disease using structure/function relationship and deep learning” #310030_201165), and National Center of Competence in Research Molecular Systems Engineering: “NCCR MSE: Molecular Systems Engineering (phase II)” #51NF40-182895, the Wellcome Trust (PINNACLE study), and the Foundation Fighting Blindness Clinical Research Institute (ProgStar study). Prof. Leopold Schmetterer is supported by “National Medical Research Council (OFLCG/004c/2018-00; MOH-000249-00; MOH-000500-00; MOH-000647-00; MOH-000707-00; MOH-001001-00; MOH-001015-00; MOH-001072-06; MOH-001286-00; MOH-001574-02; MOH-001576-00)”, “National Research Foundation Singapore (NRF2019-THE002-0006; NRF-CRP24-2020-0001)”, “National Health Innovation Centre Singapore (NHIC-I2D-2402329)”, “Agency for Science, Technology and Research (A20H4b0141)”, “SingHealth & Duke-NUS (AM/TP085/2024 [SRDUKAMR2485); AM/AIR017/2024 [SRDUKAMR24A7])”, “Duke-NUS (05/FY2022/EX/66-A128; Duke-NUS-KBrFA/2024/0088)”, “Singapore Eye Research Institute & Nanyang Technological University (SERI-NTU Advanced Ocular Engineering (STANCE) Program)”, and “Singapore Eye Research Institute (I010/2024 [2082/44/2024])”.

Author information

Authors and Affiliations

Institute of Molecular and Clinical Ophthalmology Basel, Basel, Switzerland
Parisa Khateri, Tiana Koottungal, Damon Wong, Rupert W. Strauss, Lucas Janeschitz-Kriegl & Leopold Schmetterer
Singapore Eye Research Institute, Singapore National Eye Centre, Singapore, Singapore
Damon Wong & Leopold Schmetterer
Department of Ophthalmology, Medical University Graz, Graz, Austria
Rupert W. Strauss
Department of Ophthalmology, University of Basel, Basel, Switzerland
Parisa Khateri, Lucas Janeschitz-Kriegl & Maximilian Pfau
Department of Clinical Pharmacology, Medical University of Vienna, Vienna, Austria
Leopold Schmetterer & Hendrik P. N. Scholl
Moorfields Eye Hospital, NHS Foundation Trust and UCL Institute of Ophthalmology, University College London, London, UK
Rupert W. Strauss
Department of Ophthalmology, University of Bonn, Bonn, Germany
Maximilian Pfau
F. Hoffmann-La Roche AG, Basel, Switzerland
Maximilian Pfau
Pallas Kliniken AG, Pallas Klinik Zürich, Zürich, Switzerland
Hendrik P. N. Scholl
European Vision Institute, Basel, Switzerland
Hendrik P. N. Scholl
SERI-NTU Advanced Ocular Engineering (STANCE) Program, Singapore, Singapore
Leopold Schmetterer
Ophthalmology and Visual Sciences Academic Clinical Program (Eye ACP), Duke-NUS MedicalSchool, Singapore, Singapore
Leopold Schmetterer
School of Chemistry, Chemical Engineering and Biotechnology, Nanyang Technological University, Singapore, Singapore
Leopold Schmetterer
Center for Medical Physics and Biomedical Engineering, Medical University Vienna, Vienna, Austria
Leopold Schmetterer
Fondation Ophtalmologique Adolphe De Rothschild, Paris, France
Leopold Schmetterer
Aier Hospital Group, Changsha, People’s Republic of China
Leopold Schmetterer

Authors

Parisa Khateri
View author publications
Search author on:PubMed Google Scholar
Tiana Koottungal
View author publications
Search author on:PubMed Google Scholar
Damon Wong
View author publications
Search author on:PubMed Google Scholar
Rupert W. Strauss
View author publications
Search author on:PubMed Google Scholar
Lucas Janeschitz-Kriegl
View author publications
Search author on:PubMed Google Scholar
Maximilian Pfau
View author publications
Search author on:PubMed Google Scholar
Leopold Schmetterer
View author publications
Search author on:PubMed Google Scholar
Hendrik P. N. Scholl
View author publications
Search author on:PubMed Google Scholar

Contributions

P.K. conducted model developments and experiments, reviewed gradings, analyzed the results and wrote the manuscript. T.K. conceived the data curation and gradings. D.W. advised on model development. R.W.S. assisted in image collection. R.W.S. and L.J.K. reviewed gradings. M.P. provided the healthy dataset. L.S. and H.S. designed the study, provided scientific mentorship and research funding. All authors reviewed the manuscript.

Corresponding authors

Correspondence to Parisa Khateri, Leopold Schmetterer or Hendrik P. N. Scholl.

Ethics declarations

Competing interests

The authors declare the following competing interests: Dr. Scholl is chief medical officer of Belite Bio and director of Bioptima AG. He is member of the Scientific Advisory Board of: Boehringer Ingelheim Pharma GmbH & Co, Janssen Research & Development, LLC (J&J Innovative Medicine), Kerna Ventures, Okuvision GmbH, and Tenpoint Therapeutics. Dr. Scholl is member of the Investment Advisory Board of Droia NV. Dr. Scholl is a member of the Data Monitoring and Safety Board/Committee of ViGeneron (NCT06291935) and adviser of the Steering Committee of Novo Nordisk (FOCUS trial; NCT03811561). The other authors declare no competing interests.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Khateri, P., Koottungal, T., Wong, D. et al. Looking outside the box with a pathology aware AI approach for analyzing OCT retinal images in Stargardt disease. Sci Rep 15, 4739 (2025). https://doi.org/10.1038/s41598-025-85213-w

Download citation

Received: 05 October 2024
Accepted: 01 January 2025
Published: 08 February 2025
DOI: https://doi.org/10.1038/s41598-025-85213-w

Keywords

This article is cited by

Advances in machine learning for ABCA4-related retinopathy: segmentation and phenotyping
- Yousif J. Shwetar
- Brian P. Brooks
- Melissa A. Haendel
International Ophthalmology (2025)
Classification of Retinal Abnormalities using Adaptive Sand Cat Swarm Optimization Algorithm with Equivalent Depth-Wise Separable Convolutional Neural Network based on OCT Images
- R. Pugal Priya
- L. Raja Saviour
Journal of The Institution of Engineers (India): Series B (2025)

Subjects

Abstract

Similar content being viewed by others

Adaptive optics scanning laser ophthalmoscopy in a heterogenous cohort with Stargardt disease

Optical coherence tomography choroidal enhancement using generative deep learning

Multi-omic spatial effects on high-resolution AI-derived retinal thickness

Introduction

Methods

Dataset

ProgStar data

Healthy eyes data

Data preparation

Data sampling and stratification

Ground truth generation

Image analysis

Iterative learning

Detection of the severely affected region—box detection

Segmentation of the total retina

Segmentation of the retinal sublayers

Evaluation metrics

Results

Detection of the severely affected region—box detection

Segmentation of the total retina

Segmentation of the retinal sublayers

Discussion

Conclusion

Data Availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Publisher’s note

Supplementary Information

Supplementary Information.

Rights and permissions

About this article

Cite this article

Share this article

Keywords

This article is cited by

Advances in machine learning for ABCA4-related retinopathy: segmentation and phenotyping

Classification of Retinal Abnormalities using Adaptive Sand Cat Swarm Optimization Algorithm with Equivalent Depth-Wise Separable Convolutional Neural Network based on OCT Images

Search

Quick links