Diagnostic performance of artificial intelligence for detecting peritoneal and small bowel dissemination in epithelial ovarian cancer using preoperative contrast-enhanced CT imaging

Kim, Ria; Seki, Toshiyuki; Noda, Katsuhiko; Yoshida, Kaname; Yokosu, Kota; Hamada, Rintaro; Habuchi, Erika; Takahashi, Masahiro; Takano, Hirokuni; Okamoto, Aikou

doi:10.1038/s41598-026-41728-4

Download PDF

Article
Open access
Published: 23 March 2026

Diagnostic performance of artificial intelligence for detecting peritoneal and small bowel dissemination in epithelial ovarian cancer using preoperative contrast-enhanced CT imaging

Ria Kim¹,
Toshiyuki Seki¹,
Katsuhiko Noda²,
Kaname Yoshida²,
Kota Yokosu¹,
Rintaro Hamada¹,
Erika Habuchi¹,
Masahiro Takahashi³,
Hirokuni Takano¹ &
…
Aikou Okamoto¹

Scientific Reports volume 16, Article number: 8739 (2026) Cite this article

1373 Accesses
27 Altmetric
Metrics details

Subjects

Abstract

In epithelial ovarian cancer (EOC), accurate assessment of the lesions is crucial for determining the appropriate treatment strategy. We evaluated artificial intelligence (AI) for detecting peritoneal and small bowel dissemination in EOC using preoperative contrast-enhanced computed tomography (CT). We retrospectively analyzed 227 patients (254 CT scans) with histologically confirmed ovarian, fallopian tube, or peritoneal carcinoma who underwent surgery between October 2019 and December 2024 at two affiliated institutions. The mean patient age was 56.6 years. The FIGO stage was early in 118 cases and advanced in 136 cases. Histology included serous (n = 123) and non-serous (n = 131). The mean CA125 level and peritoneal cancer index (PCI) score were 2,243.9 U/mL and 5.8, respectively. Two AI models were developed: one for peritoneal dissemination (P-Model) and one for small bowel dissemination (SB-Model). Diagnostic performance was evaluated by sensitivity, specificity, and accuracy. For the P-Model, clinical factors were compared between high- and low-accuracy groups. The P-Model achieved 68.75% sensitivity, 85.84% specificity, and 77.30% accuracy; the SB-Model, 86.40%, 77.47%, and 81.93%, respectively. Low AI accuracy was linked to ascites volume, CA125 level, and PCI score. AI-based analysis of preoperative CT demonstrated favorable diagnostic performance for detecting dissemination in patients with EOC.

Introduction

Epithelial ovarian cancer (EOC) is the seventh most common cancer in females worldwide and the second most common gynecological malignancy¹. Approximately 60%–70% of patients present at an advanced stage with massive peritoneal and visceral dissemination in the abdominal cavity (International Federation of Gynecology and Obstetrics [FIGO] stage III or IV)^2,3. The optimal standard of care for patients with advanced ovarian cancer is either primary debulking surgery (PDS) or preoperative neoadjuvant platinum-based chemotherapy followed by cytoreductive surgery, which is called interval debulking surgery (IDS)⁴. Neoadjuvant chemotherapy (NAC) and IDS are considered in patients for whom complete resection at PDS is not feasible owing to tumor distribution or in those who cannot tolerate the extent of surgery required for complete cytoreduction. Strong evidence indicates that survival outcomes are significantly improved in patients who achieve no residual disease after debulking, regardless of whether PDS or IDS is performed⁵. However, advanced EOC often spreads extensively to all the peritoneal surfaces, including parietal and visceral surfaces, resulting in a high tumor burden, rendering complete resection impossible. Therefore, accurate preoperative prediction of tumor location and burden is critical for selecting patients in whom complete cytoreduction is achievable and for optimal surgical planning. Computed tomography (CT) is widely used to assess the location and extent of peritoneal dissemination. However, its diagnostic accuracy remains suboptimal, often necessitating direct visual confirmation of tumor location via diagnostic laparotomy or laparoscopy⁶.

In particular, dissemination on the surface of the small bowel and its mesentery is a common finding that often prevents complete tumor resection⁷. However, preoperative imaging for detecting such lesions remains challenging, with diagnostic accuracy rates as low as 20%–30%^5,8.

Thus, we conducted this study to optimize the diagnostic accuracy of detecting peritoneal and small bowel dissemination by integrating contrast-enhanced CT imaging studies with machine learning methods.

Results

P-Model for peritoneal dissemination diagnosis

Supplementary Table S1 summarizes the image-unit-based evaluation of the P-Model for diagnosing peritoneal dissemination, comparing single-model predictions and ensemble predictions on two types of images: entire CT images and 75% cropped images. The results showed that ensemble prediction using entire CT images achieved the highest diagnostic performance, with an accuracy of 69.49% (sensitivity = 62.26%, specificity = 76.73%).

Supplementary Fig S1 exhibits variability of accuracies of image-unit-based evaluation using P-Models. The horizontal line indicates the average accuracy of all models. The vertical line is a line plotted from the worst accuracy to the best accuracy. This chart demonstrates that the range of variability in accuracies of ensemble predictions is much smaller than that of single-model predictions.

Figure 1 and Supplementary Table S2 present the results of the case-unit-based evaluation of the P-Model for diagnosing peritoneal dissemination, comparing single-model and ensemble predictions across two types of images: the entire CT images and 75% cropped images. The rows of this table indicate the number of consecutive slice images used to calculate the positive prediction rate for each case.

The best performance was achieved with ensemble prediction on 21 consecutive slices of 75% cropped images, yielding an accuracy of 77.30% (95% confidence interval (CI) = 71.76 ~ 82.02%), sensitivity of 68.75% (95%CI = 59.86 ~ 76.43%), and specificity of 85.84% (95%CI = 79.02 ~ 90.7%).

Supplementary Fig S2 exhibits variability of accuracies of case-unit-based evaluation using P-Models. The horizontal line indicates the average accuracy of all models. The vertical line is a line plotted from the worst accuracy to the best accuracy. This chart demonstrates that the range of variability in accuracies of ensemble predictions is much smaller than that of single-model predictions.

Detailed sensitivities and specificities for all evaluations are provided in Supplementary Table S3.

SB-Model for small bowel dissemination diagnosis

Supplementary Table S4 summarizes the image-unit-based evaluation of the SB-Model for diagnosing small bowel dissemination, comparing single-model and ensemble predictions on entire CT images and 75% cropped images. Ensemble prediction using 75% cropped images achieved the best accuracy of 72.79% (sensitivity = 65.24%, specificity = 80.34%).

Supplementary Fig S3 exhibits variability of accuracies of image-unit-based evaluation using SB-Models. The horizontal line indicates the average accuracy of all models. The vertical line is a line plotted from the worst accuracy to the best accuracy. This chart demonstrates that the range of variability in accuracies of ensemble predictions is much smaller than that of single-model predictions.

Figure 2 and Supplementary Table S5 present the case-unit-based evaluation results. The rows of the table indicate the number of consecutive slice images used to calculate the positive prediction rate for each case. The best performance was achieved with ensemble prediction on nine consecutive slices of 75% cropped images, yielding an accuracy of 81.93% (95%CI = 76.73 ~ 86.17%), sensitivity of 86.40% (95%CI = 74.69 ~ 93.18%), and specificity of 77.47% (95%CI = 73.07 ~ 81.33%).

Supplementary Fig S4 exhibits variability of accuracies of case-unit-based evaluation using SB-Models. The horizontal line indicates the average accuracy of all models. The vertical line is a line plotted from the worst accuracy to the best accuracy. This chart demonstrates that the range of variability in accuracy of ensemble predictions is much smaller than that of single-model predictions.

Detailed sensitivities and specificities of all evaluations are shown in Supplementary Table S6.

Evaluation of cases with low diagnostic accuracy by the AI model

Table 1 shows whether cases with an AI accuracy of ≤ 25% for the P-Model were correctly interpreted preoperatively by radiologists. In these cases, the sensitivity and specificity of radiologists’ interpretations were 63.0% and 57.9%, respectively.

Table 1 Radiological interpretation of cases with ≤ 25% AI accuracy in detecting the presence or absence of peritoneal dissemination.

Full size table

Table 2 compares the clinical characteristics of patients with AI diagnostic accuracy ≤ 25% versus > 25% for the P-Model, stratified by the presence or absence of peritoneal dissemination based on intraoperative macroscopic findings.

Table 2 Correlation between AI accuracy and clinical parameters in the P-Model.

Full size table

Among patients without peritoneal dissemination, those with AI accuracy ≤ 25% had significantly larger volume of ascites on preoperative CT compared to those with accuracy > 25%. Ascites extending into the upper abdomen were more frequently observed in the low-accuracy group (p < 0.001). Additionally, preoperative CA125 levels were significantly higher in this group (p = 0.005). In contrast, among patients with peritoneal dissemination, those with AI accuracy ≤ 25% exhibited significantly lower ascites volumes than those with higher accuracy. Most cases in the low-accuracy group had ascites limited to the pelvic cavity on preoperative CT (p = 0.017), and intraoperative ascites volume was ≤ 300 mL in the majority of cases (p = 0.002). Furthermore, preoperative CA125 levels and PCI scores were significantly lower in the low-accuracy group compared to the high-accuracy group (p < 0.001 and p = 0.005). Regardless of the presence or absence of peritoneal dissemination, no differences in accuracy were observed by BMI or histological subtype. As shown in Supplementary Table S7, similar results were obtained when the accuracy cutoff was set at 50%. Among patients without peritoneal dissemination, the low-accuracy group had a significantly larger volume of ascites on preoperative CT than the high-accuracy group (P = 0.007). In contrast, among patients with peritoneal dissemination, the low-accuracy group had significantly smaller volumes of ascites on both preoperative CT and intraoperative assessment compared with the high-accuracy group (P = 0.002 and P < 0.001, respectively). Furthermore, preoperative CA125 levels and PCI scores were significantly lower in the low-accuracy group than in the high-accuracy group (P = 0.002 and P = 0.004, respectively).

Discussion

We found that AI-based contrast-enhanced CT imaging can predict both peritoneal and small bowel dissemination with optimal accuracy. Notably, this is the first study to apply an AI-driven machine learning approach to the preoperative diagnosis of small bowel dissemination in EOC, demonstrating an accuracy exceeding 80%. This performance surpasses the diagnostic accuracy of conventional CT image interpretation by radiologists reported in previous studies.

Debulking surgery for advanced EOC usually requires resection of multiple organs involved in tumor dissemination, including extensive peritoneal surfaces, diaphragm, small and large bowel, spleen, pancreas, and other organs, depending on the tumor location. Reliable prediction would allow clinicians to make timely treatment decisions, such as choosing between primary debulking surgery and neoadjuvant chemotherapy. Moreover, identifying organs likely to require resection before surgery would facilitate multidisciplinary planning, enabling early consultation with appropriate surgical specialists (e.g., hepatobiliary, pancreatic, colorectal, thoracic, or vascular surgeons) and preparation of the necessary equipment for complex procedures.

According to the ESMO–ESGO consensus conference⁹, patients should not be considered suitable candidates for primary debulking surgery if the disease demonstrates extensive dissemination patterns that preclude complete cytoreduction. Specifically, contraindications include diffuse deep infiltration of the root of the small bowel mesentery or widespread carcinomatosis of the small bowel that would necessitate extensive resection, potentially resulting in short bowel syndrome (defined as a remaining bowel length of < 1.5 m). In addition, several anatomic sites, such as the stomach, duodenum, pancreatic head, and pancreatic body, have traditionally been considered difficult areas for complete tumor resection¹⁰. It has been reported that, among patients with residual disease after debulking surgery for advanced ovarian cancer, the most frequent sites of residual tumor are the small bowel mesentery and serosa, with approximately 80% of patients affected¹¹. However, accurate preoperative prediction of peritoneal dissemination, especially small bowel involvement, remains challenging with current imaging modalities^{5,6,8,12,13,14,15}.

Therefore, clinicians still rely on diagnostic laparoscopy or laparotomy to clarify tumor location and determine the feasibility of complete resection through primary debulking surgery. However, these procedures are invasive, require general anesthesia, and may delay the initiation of treatment, especially for patients who would benefit from neoadjuvant chemotherapy. Such delays can negatively affect both the physical and emotional well-being of patients and may even worsen treatment outcomes¹⁶. If imaging studies could accurately predict disseminated tumor burden, it would streamline the treatment flow and improve clinical decision-making.

Some argue that residual tumor measuring < 1 cm is clinically acceptable, as it is preferable to residual disease > 1 cm^17,18. From this perspective, preoperative prediction of small bowel dissemination may be considered less meaningful, because individual nodules of small bowel involvement are typically < 1 cm in size. However, accurate prediction of small bowel dissemination remains critically important. To date, no phase III trials have established optimal treatment strategies for patients in whom residual tumor < 1 cm is anticipated. Notably, a recent phase III trial demonstrated that, in the primary treatment of advanced EOC, progression-free survival was markedly improved, with a median of 27.9 months, when complete cytoreduction was achieved by PDS¹⁹.

In this context, treatment selection between PDS and NAC followed by IDS in clinical practice is influenced by multiple practical and patient-related factors, including performance status, histologic subtype, institutional resources, and surgical scheduling, particularly when minimal residual disease (< 1 cm) is anticipated. Accordingly, accurate preoperative prediction of the sites and extent of peritoneal and small bowel dissemination is of critical importance.

Regarding imaging modalities, several reports have demonstrated relatively favorable outcomes with MRI, ultrasound (US), and PET-CT. Michielsen et al.¹² reported that whole-abdominal MRI achieved an average accuracy of 91% for detecting peritoneal dissemination and 81.0% for detecting small bowel dissemination. Similarly, Fischerova et al.⁵ reported accuracies of 88% and 70% for detecting peritoneal dissemination and small bowel dissemination, respectively. These outcomes appear superior to those of our study (see Table 3). However, whole-body MRI currently requires prolonged acquisition times of approximately 40–50 minutes²⁰. In the report by Michielsen et al., patients were also required to drink 1 L of pineapple juice before whole-body MRI to aid in the diagnosis of ovarian cancer dissemination. Such requirements are challenging for critically ill patients with massive carcinomatosis or bulky tumors, who may struggle to maintain the scanning position for extended periods or tolerate fluid intake. Moreover, the availability of whole-body MRI scanners remains limited, and the associated costs are high. Given these limitations, the routine clinical use of whole-body MRI in this setting remains impractical.

Table 3 Comparison between our results and previous reports of imaging diagnosis for dissemination in patients with epithelial ovarian cancer.

Full size table

With regard to US, achieving high accuracy in detecting disseminated regions requires the expertise of a well-trained technician or physician. Moreover, a thorough US examination can be time-consuming, similar to whole-body MRI, which can be challenging and burdensome for patients.

By comparison, CT scans offer advantages in terms of accessibility and patient comfort over both whole-body MRI and US. However, CT also has some drawbacks. As shown in Table 3, its diagnostic ability, especially for small bowel dissemination, remains suboptimal, even when interpreted by well-trained gynecologic radiologists^{6,8,12,13,14,15,21,22,23,24}.

In our study, AI-based machine learning analysis of contrast-enhanced CT scans demonstrated superior diagnostic performance compared to previous reports, particularly in terms of sensitivity for detecting small bowel dissemination. The SB-Model demonstrated superior performance, with a notably high sensitivity of 86.40%. Given the critical importance of preoperative detection of small bowel dissemination—where oversight may lead to inappropriate surgical planning, this result is of substantial clinical relevance. The high sensitivity of AI in this context represents a valuable advancement in preoperative evaluation.

However, there were cases in which the AI method could not achieve high diagnostic accuracy. In cases where the P-Model exhibited ≤ 25% accuracy, preoperative CT interpretations by radiologists demonstrated a sensitivity of 63.0%, a specificity of 57.9%, and an overall accuracy of 60.5%. These findings suggest that the poor performance of AI was not solely attributable to tumor size, location, or other factors typically challenging for human readers, but may also reflect limitations inherent to the model architecture, training dataset, or data augmentation strategy.

To further explore this hypothesis, we compared AI accuracy with clinical findings. Among patients without peritoneal dissemination, those with low AI accuracy more frequently showed ascites extending into the upper abdomen on preoperative CT and had significantly higher CA125 levels. In contrast, among patients with peritoneal dissemination, low-accuracy cases more often exhibited ascites limited to the pelvic cavity, with intraoperative ascites volumes ≤ 300 mL in most cases and significantly lower PCI scores, suggesting a smaller tumor burden.

These findings imply that the AI model may be disproportionately influenced by the features emphasized in its training data. Specifically, it may have learned to associate the presence of widespread ascites with peritoneal dissemination, potentially leading to misclassification in atypical or early-stage cases. Importantly, this limitation could likely be mitigated by expanding the training dataset to include a more diverse range of clinical presentations, thereby improving the model’s robustness and generalizability. However, we believe that AI-assisted diagnosis of CT scans can support radiologic interpretation and aid physicians in clinical decision-making.

This study has several limitations. First, our AI system in this study was designed to predict only the presence or absence of dissemination and did not assess the exact tumor location, tumor burden, or the presence of residual disease of debulking surgery. An AI system capable of providing this level of detail would be more useful for surgeons in assessing the feasibility of complete resection. To further improve our system, future efforts will focus on expanding the dataset, incorporating more detailed imaging features, and developing a more generalizable model to enhance diagnostic accuracy and clinical applicability. Second, this study did not include external validation. The performance of AI models may decrease when evaluated on external datasets^25,26. Therefore, external validation should be conducted before clinical implementation and will be addressed in future work. Finally, the results may be influenced by bias arising from individual cases, which cannot be completely eliminated. However, as indicated by the 95% confidence intervals of our results, increasing the number of cases used for training and evaluation—although a potential approach to minimizing such bias—may also lead to a substantial decline in accuracy.

In conclusion, we demonstrated that AI can detect peritoneal dissemination—particularly small bowel dissemination—with high accuracy on preoperative contrast-enhanced CT in patients with ovarian cancer. Further refinement and validation of the model are warranted to facilitate its integration into clinical practice.

Methods

Patients and clinical information

This was a retrospective, multi-institutional observational study approved by the Institutional Review Board of The Jikei University School of Medicine (approval no. 35–149[11777]). The requirement for patient consent was waived in accordance with the Ethical Guidelines for Research Involving Human Subjects in Japan²⁷. We included patients diagnosed with ovarian, fallopian tube, or peritoneal cancer who underwent surgery between October 2019 and December 2024 at two affiliated hospitals: The Jikei University School of Medicine and the Jikei University Kashiwa Hospital. A total of 227 patients were included, yielding 254 contrast-enhanced CT images for analysis. Clinical data were retrieved from medical records, including age, body mass index (BMI) at surgery, date of surgery, and preoperative CA125 levels. Intraoperative findings, such as the amount of ascites and the location of peritoneal dissemination, were assessed using the Peritoneal Cancer Index (PCI) score^28,29. Pathological data, including histological subtype and FIGO stage, were also collected.

Patient characteristics

A total of 227 patients with epithelial ovarian cancer were included in this study. For patients who underwent IDS, CT scans were obtained twice, at the time of staging surgery (laparotomy or laparoscopy) and at IDS, and both scans were used for AI training. Therefore, the number of CT cases available for AI training was 254.

Table 4 presents the characteristics of all cases. The mean age at surgery was 56.6 (range, 19.0–87.0) years, and the mean BMI was 22.0 (range, 14.6–35.2) kg/m². FIGO stage at diagnosis was stage I in 86 cases (33.9%), stage II in 32 cases (12.6%), stage III in 105 cases (41.7%), and stage IV in 30 cases (11.8%). Histological subtypes included high-grade serous carcinoma in 123 patients (48.4%), endometrioid carcinoma in 47 (18.5%), clear cell carcinoma in 63 (24.8%), mucinous carcinoma in 9 (3.5%), adenocarcinoma in 4 (1.6%) and other histologies in 8 (3.1%). The mean preoperative serum CA125 level was 2,243.9 (range, 2–142,522) U/mL, and the mean PCI score was 5.8 (range, 0–36). Ascites volume was ≤ 300 mL in 181 patients (71.3%) and > 300 mL in 73 patients (28.7%).

Table 4 Patient characteristics used for AI training (n = 254).

Full size table

CT imaging

We collected CT imaging data acquired before debulking or diagnostic surgery. All scans were contrast-enhanced with iodine and obtained in the portal venous phase, with a slice thickness of 5 mm. Imaging data were extracted as DICOM files from the two participating institutions and subsequently converted into JPEG format. Axial plane images covering the area from the liver and right diaphragm to the pubic bone were selected for machine learning analysis.

Number of CT images

Supplementary Table S8 shows the number of cases and the total number of CT images. Cases were classified into three groups: those with both small bowel and peritoneal dissemination (sb + & p +), those with peritoneal dissemination only (sb − & p +), and those with neither small bowel nor peritoneal dissemination (sb − & p −). The presence or absence of peritoneal and small bowel dissemination was determined based on intraoperative macroscopic findings.

Types of dissemination diagnostic models

In this study, we developed two deep neural network (DNN) models: one for diagnosing peritoneal dissemination (P-Model) and one for diagnosing small bowel dissemination (SB-Model). As shown in Fig. 3, the P-Model was trained on all CT images to predict whether an image demonstrated dissemination (peritoneal and/or small bowel; p +) or no dissemination (p −). The SB-Model was trained on all CT images to predict the presence (sb +) or absence (sb −) of small bowel dissemination. Images containing peritoneal dissemination without small bowel involvement were classified as sb − .

Shape of DNN model

In this study, we adopted the MobileNetV2 architecture³⁰, a relatively compact DNN consisting of 88 layers, with a fixed input image size of 224 × 244 pixels and 3,538,984 learning parameters. We selected MobileNetV2 because its compact architecture helps reduce overfitting under limited data conditions. In addition, this architecture has already been widely applied in medical image analysis, demonstrating versatility and robustness across clinical imaging tasks³¹.

Training and cross-validation of the DNN model

We adopted an eight-fold cross-validation approach by dividing the cases into eight groups, using seven groups for training (training dataset) and the remaining group for evaluation (evaluation dataset), maintaining a 7:1 ratio. As a result, eight DNN models were generated, each corresponding to one cross-validation set. To increase dataset diversity, images in each training dataset were augmented 960-fold. During training, One out of 480 augmented images were randomly selected for each epoch. A total of 20 epochs were performed for each cross-validation set. The augmentation was performed using general augmentation techniques including rotation, flipping, scaling, etc. and it was applied only to the training data but not for the evaluation.

Because DNN models exhibit variability in performance when trained on large volumes of augmented data derived from a small number of patients, 24 sets of cross-validation sets were created to verify fluctuations in model accuracy. Each set consisted of 8 models, resulting in a total of 192 models (8 datasets × 24) for both the P-Model and SB-Model.

Supplementary Table S9 shows the number of cases and images of each group used for cross-validation during P-Model training.

Supplementary Table S10 shows the number of cases and images in the training and evaluation sets for each cross-validation of the P-Model.

Supplementary Table S11 shows the number of cases and images in each group used for cross-validation during SB-Model training.

Supplementary Table S12 shows the number of cases and images in the training and evaluation sets for each cross-validation of the SB-Model.

Evaluation of DNN models

The performance of the DNN models was evaluated using two methods: image-unit-based evaluation and case-unit-based evaluation, applied to two types of images: the entire CT image and images cropped to the central 75% of the original field (equivalent to 133% zoom). Both the P-Model and SB-Model were assessed using these methods.

For image-unit-based evaluation, predictions were performed in two ways: single-model predictions and ensemble predictions using 23 models, all derived from 24 cross-validation sets.

For case-unit-based evaluation, the highest positive prediction rate was calculated across consecutive slice images for each case. The number of consecutive slices examined ranged from 5 to 30. Predictions were again performed using both single and ensemble approaches on the two types of images (entire and 75% cropped).

We adopted the Wilson method to calculate the 95% confidence interval of accuracy, sensitivity and specificity³².

Evaluation of CT images by radiologists

Furthermore, we evaluated whether radiologists could make a diagnosis in cases where the P-model showed less than 25% accuracy. The diagnostic performance of radiologists was assessed by retrospectively reviewing preoperative CT interpretation reports that had been generated by board-certified radiologists at our institutions. These reports were independently reviewed by gynecologic oncologists. Radiologists were consulted regarding the feasibility of diagnosing small bowel dissemination. They indicated that such a diagnosis is nearly impossible because individual nodules on the surface of the small bowel are very small. Therefore, radiologist-based evaluation of small bowel dissemination was not conducted. In addition, we examined the features of the images associated with low AI performance in the P-Model, defining low AI performance as below 25% and 50%. We analyzed the associations between diagnostic accuracy and preoperative CA125 levels, BMI, intraoperative PCI score, and intraoperative ascites volume.

Statistical analysis

Continuous variables were expressed as means with ranges and analyzed using either the Mann–Whitney U test or the Student’s t-test, as appropriate. Categorical variables were reported as absolute numbers and analyzed using Fisher’s exact test. A p-value of less than 0.05 was considered statistically significant.

Ethics

This study was approved by the Institutional Review Board of The Jikei University School of Medicine (approval no. 35–149[11777]). In patient application forms, the need for informed consent was waived by these above institutional review boards. It was clearly stated that patients were allowed to opt out of the study at any time. Information on how they could opt out was provided on our website, and arrangements were made for patients to opt out. All methods were performed in accordance with the relevant guidelines and regulations.

Data availability

All data generated and analyzed during this study are included in this published article and its supplementary files. Additional datasets are available from the corresponding author upon reasonable request. Requests for data should be addressed to Toshiyuki Seki, M.D., Ph.D. (e-mail: t_seki@jikei.ac.jp).

References

Ferlay, J. et al. (ed IARC) (Lyon, France, 2013).
Heintz, A. P. et al. Carcinoma of the ovary. Int. J. Gynaecol. Obstet. 83(Suppl 1), 135–166. https://doi.org/10.1016/s0020-7292(03)90118-4 (2003).
Article PubMed Google Scholar
Pecorelli, S., Favalli, G., Zigliani, L. & Odicino, F. Cancer in women. Int. J. Gynaecol. Obstet. 82, 369–379. https://doi.org/10.1016/s0020-7292(03)00225-x (2003).
Article CAS PubMed Google Scholar
Chang, S. J., Bristow, R. E., Chi, D. S. & Cliby, W. A. Role of aggressive surgical cytoreduction in advanced ovarian cancer. J. Gynecol. Oncol. 26, 336–342. https://doi.org/10.3802/jgo.2015.26.4.336 (2015).
Article CAS PubMed PubMed Central Google Scholar
Fischerova, D. et al. Preoperative staging of ovarian cancer: Comparison between ultrasound, CT and whole-body diffusion-weighted MRI (ISAAC study). Ultrasound Obstet. Gynecol. 59, 248–262. https://doi.org/10.1002/uog.23654 (2022).
Article CAS PubMed Google Scholar
Takeda, T. et al. Evaluation of preoperative prediction of intestinal invasion in patients with ovarian cancer. Int. J. Gynaecol. Obstet. 153, 398–404. https://doi.org/10.1002/ijgo.13492 (2021).
Article PubMed Google Scholar
González-Martín, A. et al. Newly diagnosed and relapsed epithelial ovarian cancer: ESMO Clinical Practice Guideline for diagnosis, treatment and follow-up. Ann. Oncol. 34, 833–848. https://doi.org/10.1016/j.annonc.2023.07.011 (2023).
Article CAS PubMed Google Scholar
Hu, T. W. Y., Nie, D., Gou, J. H. & Li, Z. Y. Predictive significance of preoperative CT findings for suboptimal cytoreduction in advanced ovarian cancer: A meta-analysis. Cancer Manag. Res. 10, 2019–2030. https://doi.org/10.2147/cmar.s166658 (2018).
Article PubMed PubMed Central Google Scholar
Colombo, N. et al. ESMO-ESGO consensus conference recommendations on ovarian cancer: Pathology and molecular biology, early and advanced stages, borderline tumours and recurrent disease. Int. J. Gynecol. Cancer 29, 728–760. https://doi.org/10.1136/ijgc-2019-000308 (2019).
Article CAS PubMed Google Scholar
Querleu, D. et al. European Society of Gynaecological Oncology (ESGO) Guidelines for Ovarian Cancer Surgery. Int. J. Gynecol. Cancer 27, 1534–1542. https://doi.org/10.1097/igc.0000000000001041 (2017).
Article PubMed Google Scholar
Heitz, F. et al. Pattern of and reason for postoperative residual disease in patients with advanced ovarian cancer following upfront radical debulking surgery. Gynecol. Oncol. 141, 264–270. https://doi.org/10.1016/j.ygyno.2016.03.015 (2016).
Article PubMed Google Scholar
Michielsen, K. et al. Whole-body MRI with diffusion-weighted sequence for staging of patients with suspected ovarian cancer: A clinical feasibility study in comparison to CT and FDG-PET/CT. Eur. Radiol. 24, 889–901. https://doi.org/10.1007/s00330-013-3083-8 (2014).
Article PubMed Google Scholar
Di Donna, M. C. et al. Concordance of radiological, laparoscopic and laparotomic scoring to predict complete cytoreduction in women with advanced ovarian cancer. Cancers 15, 500. https://doi.org/10.3390/cancers15020500 (2023).
Article PubMed PubMed Central Google Scholar
Bagul, K., Vijaykumar, D. K., Rajanbabu, A., Antony, M. A. & Ranganathan, V. Advanced primary epithelial ovarian and peritoneal carcinoma—Does diagnostic accuracy of preoperative CT scan for detection of peritoneal metastatic sites reflect into prediction of suboptimal debulking? A prospective study. Indian J. Surg. Oncol. 8, 98–104. https://doi.org/10.1007/s13193-016-0601-6 (2017).
Article PubMed PubMed Central Google Scholar
Bristow, R. E. et al. A model for predicting surgical outcome in patients with advanced ovarian carcinoma using computed tomography. Cancer 89, 1532–1540. https://doi.org/10.1002/1097-0142(20001001)89:7%3c1532::aid-cncr17%3e3.0.co;2-a (2000).
Article CAS PubMed Google Scholar
Tewari, K. S., Java, J. J., Eskander, R. N., Monk, B. J. & Burger, R. A. Early initiation of chemotherapy following complete resection of advanced ovarian cancer associated with improved survival: NRG Oncology/Gynecologic Oncology Group study. Ann. Oncol. 27, 114–121. https://doi.org/10.1093/annonc/mdv500 (2016).
Article CAS PubMed Google Scholar
Fagotti, A. et al. Randomized trial of primary debulking surgery versus neoadjuvant chemotherapy for advanced epithelial ovarian cancer (SCORPION-NCT01461850). Int. J. Gynecol. Cancer 30, 1657–1664. https://doi.org/10.1136/ijgc-2020-001640 (2020).
Article PubMed Google Scholar
Onda, T. et al. Comparison of survival between primary debulking surgery and neoadjuvant chemotherapy for stage III/IV ovarian, tubal and peritoneal cancers in phase III randomised trial. Eur. J. Cancer 130, 114–125. https://doi.org/10.1016/j.ejca.2020.02.020 (2020).
Article CAS PubMed Google Scholar
Reuss, A. et al. TRUST: Trial of Radical Upfront Surgical Therapy in advanced ovarian cancer (ENGOT ov33/AGO-OVAR OP7). Int. J. Gynecol. Cancer 29, 1327–1331. https://doi.org/10.1136/ijgc-2019-000682 (2019).
Article PubMed Google Scholar
Tunariu, N. et al. What’s new for clinical whole-body MRI (WB-MRI) in the 21st century. Br. J. Radiol. 93, 20200562. https://doi.org/10.1259/bjr.20200562 (2020).
Article PubMed PubMed Central Google Scholar
Tempany, C. M. et al. Staging of advanced ovarian cancer: Comparison of imaging modalities--report from the Radiological Diagnostic Oncology Group. Radiology 215, 761–767. https://doi.org/10.1148/radiology.215.3.r00jn25761 (2000).
Article CAS PubMed Google Scholar
Schmidt, S., Meuli, R. A., Achtari, C. & Prior, J. O. Peritoneal carcinomatosis in primary ovarian cancer staging. Clin. Nucl. Med. 40, 371–377. https://doi.org/10.1097/rlu.0000000000000768 (2015).
Article PubMed Google Scholar
Alcázar, J. L. et al. Pre-operative assessment of intra-abdominal disease spread in epithelial ovarian cancer: A comparative study between ultrasound and computed tomography. Int. J. Gynecol. Cancer 29, 227–233. https://doi.org/10.1136/ijgc-2018-000066 (2019).
Article PubMed Google Scholar
Rizzo, S. et al. Pre-operative evaluation of epithelial ovarian cancer patients: Role of whole body diffusion weighted imaging MR and CT scans in the selection of patients suitable for primary debulking surgery. A single-centre study. Eur. J. Radiol. 123, 108786. https://doi.org/10.1016/j.ejrad.2019.108786 (2020).
Article PubMed Google Scholar
Yu, A. C., Mohajer, B. & Eng, J. External Validation of Deep Learning Algorithms for Radiologic Diagnosis: A Systematic Review. Radiol.: Artif. Intell. https://doi.org/10.1148/ryai.210064 (2022).
Article PubMed PubMed Central Google Scholar
Kim, D. W., Jang, H. Y., Kim, K. W., Shin, Y. & Park, S. H. Design characteristics of studies reporting the performance of artificial intelligence algorithms for diagnostic analysis of medical images: Results from recently published papers. Korean J. Radiol. 20, 405. https://doi.org/10.3348/kjr.2019.0025 (2019).
Article PubMed PubMed Central Google Scholar
Eba, J. & Nakamura, K. Overview of the ethical guidelines for medical and biological research involving human subjects in Japan. Jpn. J. Clin. Oncol. 52, 539–544. https://doi.org/10.1093/jjco/hyac034 (2022).
Article PubMed PubMed Central Google Scholar
Lampe, B., Kroll, N., Piso, P., Forner, D. M. & Mallmann, P. Prognostic significance of Sugarbaker’s peritoneal cancer index for the operability of ovarian carcinoma. Int. J. Gynecol. Cancer 25, 135–144. https://doi.org/10.1097/igc.0000000000000327 (2015).
Article PubMed Google Scholar
Climent, M. T., Serra, A., Gilabert-Estellés, J., Gilabert-Aguilar, J. & Llueca, A. Comparison of peritoneal carcinomatosis scoring methods in predicting resectability and prognosis in gynecologic malignancies. J. Clin. Med. 10, 2553. https://doi.org/10.3390/jcm10122553 (2021).
Article PubMed PubMed Central Google Scholar
Sandler, M., Howard, A., Zhu, M., Zhmoginov, A. & Chen, L.-C. in 2018 IEEE/CVF conference on computer vision and pattern recognition. 4510–4520 (IEEE).
Takahashi, Y. et al. Automated system for diagnosing endometrial cancer by adopting deep-learning technology in hysteroscopy. PLoS ONE 16, e0248526. https://doi.org/10.1371/journal.pone.0248526 (2021).
Article CAS PubMed PubMed Central Google Scholar
Wilson., E. B. Probable Inference, the Law of Succession, and Statistical Inference. J. Am. Stat. Assoc. 22, 209 (1927).
Article Google Scholar

Download references

Acknowledgements

We thank all the team members at The Jikei University School of Medicine for their contributions to patient care and data acquisition, and SIOS Technology, Inc., for their cooperation. We are also grateful to the Otolaryngology-led AI Research Team at Jikei University, led by Dr. Masahiro Takahashi and consisting primarily of otolaryngologists, for their expertise in AI-based medical research and for facilitating collaboration with the Department of Obstetrics and Gynecology, which enabled this interdisciplinary study. We further thank Dr. Kazuhiko Morikawa, Dr. Youhei Ohki, and Dr. Akira Baba of the Department of Radiology, Jikei University, for their valuable assistance with the study design. Finally, we thank Oxford Science Editing (https://www.oxfordscience.org/) for English language editing. The graphical abstract was created using BioRender (Grant No. WZ28OBEZPR, YU28S6DBAR).

Funding

This work was supported by the Department of Obstetrics and Gynecology at the Jikei University School of Medicine, and by JSPS KAKENHI (Grant Number JP80569559). JSPS KAKENHI had no role in the study, the Department of Obstetrics and Gynecology at the Jikei University School of Medicine contributed by providing clinical data.

Author information

Authors and Affiliations

Department of Obstetrics and Gynecology, The Jikei University School of Medicine, 3-25-8 Nishi-Shinbashi, Minato-Ku, Tokyo, 105-8461, Japan
Ria Kim, Toshiyuki Seki, Kota Yokosu, Rintaro Hamada, Erika Habuchi, Hirokuni Takano & Aikou Okamoto
SIOS Technology, Inc., Tokyo, Japan
Katsuhiko Noda & Kaname Yoshida
Department of Otorhinolaryngology, The Jikei University School of Medicine, Tokyo, 105-8461, Japan
Masahiro Takahashi

Authors

Ria Kim
View author publications
Search author on:PubMed Google Scholar
Toshiyuki Seki
View author publications
Search author on:PubMed Google Scholar
Katsuhiko Noda
View author publications
Search author on:PubMed Google Scholar
Kaname Yoshida
View author publications
Search author on:PubMed Google Scholar
Kota Yokosu
View author publications
Search author on:PubMed Google Scholar
Rintaro Hamada
View author publications
Search author on:PubMed Google Scholar
Erika Habuchi
View author publications
Search author on:PubMed Google Scholar
Masahiro Takahashi
View author publications
Search author on:PubMed Google Scholar
Hirokuni Takano
View author publications
Search author on:PubMed Google Scholar
Aikou Okamoto
View author publications
Search author on:PubMed Google Scholar

Contributions

Conception and design: Toshiyuki Seki, Ria Kim. Acquisition of clinicopathological data: Toshiyuki Seki, Ria Kim, Kota Yokosu, Rintaro Hamada, Erika Habuchi. Analysis and interpretation of data: Katsuhiko Noda, Kaname Yoshida, Ria Kim, Toshiyuki Seki. Writing, review: Ria Kim, Toshiyuki Seki, Katsuhiko Noda, Kaname Yoshida Study supervision: Toshiyuki Seki, Katsuhiko Noda, Kaname Yoshida, Masahiro Takahashi, Hirokuni Takano, Aikou Okamoto.

Corresponding author

Correspondence to Toshiyuki Seki.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information. (download DOCX )

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Kim, R., Seki, T., Noda, K. et al. Diagnostic performance of artificial intelligence for detecting peritoneal and small bowel dissemination in epithelial ovarian cancer using preoperative contrast-enhanced CT imaging. Sci Rep 16, 8739 (2026). https://doi.org/10.1038/s41598-026-41728-4

Download citation

Received: 24 October 2025
Accepted: 23 February 2026
Published: 23 March 2026
Version of record: 23 March 2026
DOI: https://doi.org/10.1038/s41598-026-41728-4

Subjects

Abstract

Introduction

Results

P-Model for peritoneal dissemination diagnosis

SB-Model for small bowel dissemination diagnosis

Evaluation of cases with low diagnostic accuracy by the AI model

Discussion

Methods

Patients and clinical information

Patient characteristics

CT imaging

Number of CT images

Types of dissemination diagnostic models

Shape of DNN model

Training and cross-validation of the DNN model

Evaluation of DNN models

Evaluation of CT images by radiologists

Statistical analysis

Ethics

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher’s note

Supplementary Information

Supplementary Information. (download DOCX )

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Quick links