Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

Advertisement

npj Digital Medicine
  • View all journals
  • Search
  • My Account Login
  • Content Explore content
  • About the journal
  • Publish with us
  • Sign up for alerts
  • RSS feed
  1. nature
  2. npj digital medicine
  3. articles
  4. article
Large language models in systematic review and meta-analysis of surgical treatments for vaginal vault prolapse
Download PDF
Download PDF
  • Article
  • Open access
  • Published: 19 February 2026

Large language models in systematic review and meta-analysis of surgical treatments for vaginal vault prolapse

  • Yunjeong Park1,2,
  • Hyun-Soo Zhang3 &
  • Sang Wook Bai  ORCID: orcid.org/0000-0001-7724-75521,2 

npj Digital Medicine , Article number:  (2026) Cite this article

We are providing an unedited version of this manuscript to give early access to its findings. Before final publication, the manuscript will undergo further editing. Please note there may be errors present which affect the content, and all legal disclaimers apply.

Subjects

  • Diseases
  • Health care
  • Medical research

Abstract

Systematic reviews provide the highest level of evidence but remain resource-intensive. We evaluated the performance of a large language model (LLM; ChatGPT, OpenAI) in a PRISMA-guided review of randomized controlled trials on vaginal vault prolapse surgery. Prompts were carefully designed to minimize errors, and outputs were verified. Each task was completed within minutes. For title/abstract screening, recall was 69.8% and precision 85.7% (κ = 0.77); full-text agreement 94.1–100% (κ = 0.82–1); data extraction accuracy 87.5–99.7%. From 18 RCTs (1668 women), sacrocolpopexy (SC) showed higher anatomic success than sacrospinous fixation (SSF) (OR 1.42, 95% CI 0.71–2.84). Transvaginal mesh improved 3-year objective success compared with SSF (OR 1.84, 95% CI 1.13–2.99) but had higher reoperation rates (5–16% vs 2–4%) than SC. We did not find conclusive evidence that any single technique is superior; most comparisons were underpowered, with wide confidence intervals and substantial heterogeneity. All LLM-derived statistical results were identical to those from conventional R analyses, confirming robustness. Validated LLM workflows can enable more efficient and scalable evidence synthesis.

Similar content being viewed by others

Effectiveness of large language models in preoperative and discharge education: a systematic review based on an evaluation framework

Article Open access 07 January 2026

Pre-treatment sarcopenic assessments as a prognostic factor for gynaecology cancer outcomes: systematic review and meta-analysis

Article 22 February 2022

Comparison of first versus second line sacrocolpopexies in terms of morbidity and mid-term efficacy

Article Open access 29 September 2022

Data availability

All data analyzed in this study were derived from published randomized controlled trials included in the systematic review and meta-analysis. No new raw patient-level data were generated. The datasets supporting screening decisions, extracted variables, and ChatGPT-assisted workflow outputs are available from the corresponding author upon reasonable request.

References

  1. Wang, B. et al. Global burden and trends of pelvic organ prolapse associated with aging women: An observational trend study from 1990 to 2019. Front. Pub. Health 10 - 2022, https://doi.org/10.3389/fpubh.2022.975829 (2022).

  2. Nüssler, E., Granåsen, G., Bixo, M. & Löfgren, M. Long-term outcome after routine surgery for pelvic organ prolapse—A national register-based cohort study. Int. Urogynecology J. 33, 1863–1873 (2022).

    Google Scholar 

  3. Løwenstein, E., Møller, L. A., Laigaard, J. & Gimbel, H. Reoperation for pelvic organ prolapse: a Danish cohort study with 15-20 years’ follow-up. Int Urogynecol J. 29, 119–124 (2018).

    Google Scholar 

  4. DeLancey, J. O. What’s new in the functional anatomy of pelvic organ prolapse?. Curr. Opin. Obstet. Gynecol. 28, 420–429 (2016).

    Google Scholar 

  5. Trutnovsky, G., Robledo, K. P., Shek, K. L. & Dietz, H. P. Definition of apical descent in women with and without previous hysterectomy: A retrospective analysis. PLoS One 14, e0213617 (2019).

    Google Scholar 

  6. Brunes, M. et al. Vaginal vault prolapse and recurrent surgery: A nationwide observational cohort study. Acta Obstet. Gynecol. Scand. 101, 542–549 (2022).

    Google Scholar 

  7. Woodruff, A. J., Roth, C. C. & Winters, J. C. Abdominal sacral colpopexy: Surgical pearls and outcomes. Curr. Urol. Rep. 8, 399–404 (2007).

    Google Scholar 

  8. Chaliha, C. & Khullar, V. Management of vaginal prolapse. Women’s. Health (Lond.) 2, 279–287 (2006).

    Google Scholar 

  9. Alshami, A., Elsayed, M., Ali, E., Eltoukhy, A. E. E. & Zayed, T. Harnessing the Power of ChatGPT for Automating Systematic Review Process: Methodology, Case Study, Limitations, and Future Directions. Systems 11, 351 (2023).

    Google Scholar 

  10. Sarkis-Onofre, R., Catalá-López, F., Aromataris, E. & Lockwood, C. How to properly use the PRISMA Statement. Syst. Rev. 10, 117 (2021).

    Google Scholar 

  11. Borah, R., Brown, A. W., Capers, P. L. & Kaiser, K. A. Analysis of the time and workers needed to conduct systematic reviews of medical interventions using data from the PROSPERO registry. BMJ Open 7, e012545 (2017).

    Google Scholar 

  12. Luo, X. et al. Potential Roles of Large Language Models in the Production of Systematic Reviews and Meta-Analyses. J. Med. Internet Res. 26, e56780 (2024).

    Google Scholar 

  13. Maher, C. F. et al. Abdominal sacral colpopexy or vaginal sacrospinous colpopexy for vaginal vault prolapse: a prospective randomized study. Am. J. Obstet. Gynecol. 190, 20–26 (2004).

    Google Scholar 

  14. Culligan, P. J. et al. A randomized controlled trial comparing fascia lata and synthetic mesh for sacral colpopexy. Obstet. Gynecol. 106, 29–37 (2005).

    Google Scholar 

  15. Maher, C. F. et al. Laparoscopic sacral colpopexy versus total vaginal mesh for vaginal vault prolapse: a randomized trial. Am. J. Obstet. Gynecol. 204, 360.e361–367 (2011).

    Google Scholar 

  16. Paraiso, M. F. R., Jelovsek, J. E., Frick, A., Chen, C. C. G. & Barber, M. D. Laparoscopic compared with robotic sacrocolpopexy for vaginal prolapse: a randomized controlled trial. Obstet. Gynecol. 118, 1005–1013 (2011).

    Google Scholar 

  17. Tate, S. B., Blackwell, L., Lorenz, D. J., Steptoe, M. M. & Culligan, P. J. Randomized trial of fascia lata and polypropylene mesh for abdominal sacrocolpopexy: 5-year follow-up. Int Urogynecol J. 22, 137–143 (2011).

    Google Scholar 

  18. Halaska, M. et al. A multicenter, randomized, prospective, controlled study comparing sacrospinous fixation and transvaginal mesh in the treatment of posthysterectomy vaginal vault prolapse. Am. J. Obstet. Gynecol. 207, 301.e301–301.e307 (2012).

    Google Scholar 

  19. Freeman, R. M. et al. A randomised controlled trial of abdominal versus laparoscopic sacrocolpopexy for the treatment of post-hysterectomy vaginal vault prolapse: LAS study. Int. Urogynecology J. Pelvic Floor Dysfunct. 24, 377–384 (2013).

    Google Scholar 

  20. Svabik, K., Martan, A., Masata, J., El-Haddad, R. & Hubka, P. Comparison of vaginal mesh repair with sacrospinous vaginal colpopexy in the management of vaginal vault prolapse after hysterectomy in patients with levator ani avulsion: a randomized controlled trial. Ultrasound Obstet. Gynecol. : Off. J. Int. Soc. Ultrasound Obstet. Gynecol. 43, 365–371 (2014).

    Google Scholar 

  21. Coolen, A. W. M. et al. Laparoscopic sacrocolpopexy compared with open abdominal sacrocolpopexy for vault prolapse repair: a randomised controlled trial. Int Urogynecol J. 28, 1469–1479 (2017).

    Google Scholar 

  22. Ow, L. L. et al. RCT of vaginal extraperitoneal uterosacral ligament suspension (VEULS) with anterior mesh versus sacrocolpopexy: 4-year outcome. Int Urogynecol J. 29, 1607–1614 (2018).

    Google Scholar 

  23. Ferrando, C. A. & Paraiso, M. F. R. A Prospective Randomized Trial Comparing Restorelle Y Mesh and Flat Mesh for Laparoscopic and Robotic-Assisted Laparoscopic Sacrocolpopexy. Female Pelvic Med. Reconstructive Surg. 25, 83–87 (2019).

    Google Scholar 

  24. Galad, J., Papcun, P., Dudic, R. & Urdzik, P. Single‑incision mesh vs sacrospinous ligament fixation in posthysterectomy women at a three-year follow-up: a randomized trial. Bratisl. lekarske listy 121, 640–647 (2020).

    Google Scholar 

  25. Hemming, C. et al. Surgical interventions for uterine prolapse and for vault prolapse: The two VUE RCTs. Health Technol. Assess. 24, 1–219 (2020).

    Google Scholar 

  26. Ferrando, C. A. & Paraiso, M. F. R. A prospective randomized trial comparing Restorelle® Y mesh and flat mesh for laparoscopic and robotic-assisted laparoscopic sacrocolpopexy: 24-month outcomes. Int Urogynecol J. 32, 1565–1570 (2021).

    Google Scholar 

  27. van Oudheusden, A. M. J. et al. Laparoscopic sacrocolpopexy versus abdominal sacrocolpopexy for vaginal vault prolapse: long-term follow-up of a randomized controlled trial. Int Urogynecol J. 34, 93–104 (2023).

    Google Scholar 

  28. van Oudheusden, A. M. J. et al. Laparoscopic sacrocolpopexy versus vaginal sacrospinous fixation for vaginal vault prolapse: a randomised controlled trial and prospective cohort (SALTO-2 trial). Bjog 130, 1542–1551 (2023).

    Google Scholar 

  29. Menefee, S. A. et al. Apical Suspension Repair for Vaginal Vault Prolapse A Randomized Clinical Trial. JAMA Surg. 159, 845–855 (2024).

    Google Scholar 

  30. Andy, U. U. et al. Body Image and sexual function improve following prolapse repair. American J. Obstetrics Gynecology https://doi.org/10.1016/j.ajog.2025.01.042 (2025).

  31. Choong, M. K., Galgani, F., Dunn, A. G. & Tsafnat, G. Automatic evidence retrieval for systematic reviews. J. Med Internet Res 16, e223 (2014).

    Google Scholar 

  32. Kim, J. K., Chua, M. E., Li, T. G., Rickard, M. & Lorenzo, A. J. Novel AI applications in systematic review: GPT-4 assisted data extraction, analysis, review of bias. BMJ Evidence-Based Med., bmjebm-2024-113066, https://doi.org/10.1136/bmjebm-2024-113066 (2025).

  33. Reason, T. et al. Artificial Intelligence to Automate Network Meta-Analyses: Four Case Studies to Evaluate the Potential Application of Large Language Models. PharmacoEconomics - Open 8, 205–220 (2024).

    Google Scholar 

  34. Chelli, M. et al. Hallucination Rates and Reference Accuracy of ChatGPT and Bard for Systematic Reviews: Comparative Analysis. J. Med. Internet Res. 26, e53164 (2024).

    Google Scholar 

  35. Myers, S. et al. Lessons learned on information retrieval in electronic health records: a comparison of embedding models and pooling strategies. J. Am. Med. Inf. Assoc. 32, 357–364 (2025).

    Google Scholar 

  36. Asgari, E. et al. A framework to assess clinical safety and hallucination rates of LLMs for medical text summarisation. NPJ Digit Med. 8, 274 (2025).

    Google Scholar 

  37. Minozzi, S., Cinquini, M., Gianola, S., Gonzalez-Lorenzo, M. & Banzi, R. The revised Cochrane risk of bias tool for randomized trials (RoB 2) showed low interrater reliability and challenges in its application. J. Clin. Epidemiol. 126, 37–44 (2020).

    Google Scholar 

  38. Lai, H. et al. Assessing the Risk of Bias in Randomized Clinical Trials With Large Language Models. JAMA Netw. Open 7, e2412687–e2412687 (2024).

    Google Scholar 

  39. Haltaufderheide, J. & Ranisch, R. The ethics of ChatGPT in medicine and healthcare: a systematic review on Large Language Models (LLMs). npj Digital Med. 7, 183 (2024).

    Google Scholar 

  40. Hu, D., Guo, Y., Zhou, Y., Flores, L. & Zheng, K. A systematic review of early evidence on generative AI for drafting responses to patient messages. npj Health Syst. 2, 27 (2025).

    Google Scholar 

  41. Mitchell, E., Are, E. B., Colijn, C. & Earn, D. J. D. Using artificial intelligence tools to automate data extraction for living evidence syntheses. PLOS ONE 20, e0320151 (2025).

    Google Scholar 

  42. Cosson, M. et al. A study of pelvic ligament strength. Eur. J. Obstet. Gynecol. Reprod. Biol. 109, 80–87 (2003).

    Google Scholar 

  43. Torosis, M. & Ackerman, A. L. Patient-reported outcomes in pelvic organ prolapse repair: the missing information needed to inform our understanding of what matters most to patients. Gynecology Pelvic Med. 7, 1–3 (2024).

    Google Scholar 

  44. Dyer, O. Transvaginal mesh: FDA orders remaining products off US market. Bmj 365, l1839 (2019).

    Google Scholar 

  45. Coolen, A.-L. W. M. et al. The treatment of post-hysterectomy vaginal vault prolapse: a systematic review and meta-analysis. Int. Urogynecology J. 28, 1767–1783 (2017).

    Google Scholar 

  46. Dindo, D., Demartines, N. & Clavien, P. A. Classification of surgical complications: a new proposal with evaluation in a cohort of 6336 patients and results of a survey. Ann. Surg. 240, 205–213 (2004).

    Google Scholar 

  47. Zeng, G. On the confusion matrix in credit scoring and its analytical properties. Commun. Stat. - Theory Methods 49, 2080–2093 (2020).

    Google Scholar 

  48. Li, M., Gao, Q. & Yu, T. Kappa statistic considerations in evaluating inter-rater reliability between two raters: which, when and context matters. BMC Cancer 23, 799 (2023).

    Google Scholar 

  49. Issaiy, M. et al. Methodological insights into ChatGPT’s screening performance in systematic reviews. BMC Med. Res. Methodol. 24, 78 (2024).

    Google Scholar 

  50. Wohlin, C. In Proceedings of the 18th International Conference on Evaluation and Assessment in Software Engineering Article 38 (Association for Computing Machinery, London, England, United Kingdom, 2014).

  51. Lai, H. et al. Language models for data extraction and risk of bias assessment in complementary medicine. npj Digital Med. 8, 74 (2025).

    Google Scholar 

  52. Sterne, J. A. C. et al. RoB 2: a revised tool for assessing risk of bias in randomised trials. Bmj 366, l4898 (2019).

    Google Scholar 

Download references

Acknowledgements

The authors acknowledge the Yonsei University Medical Library for support with the literature search. ChatGPT (OpenAI, San Francisco, CA, USA) was used to assist with English language refinement, under the authors’ supervision.

Author information

Authors and Affiliations

  1. Department of Obstetrics and Gynecology, Severance Hospital, Yonsei University College of Medicine, Seoul, Republic of Korea

    Yunjeong Park & Sang Wook Bai

  2. Institute of Women’s Life Medical Science, Yonsei University College of Medicine, Seoul, Republic of Korea

    Yunjeong Park & Sang Wook Bai

  3. Biostatistics Collaboration Unit, Department of Biomedical Informatics, College of Medicine, Yonsei University, Seoul, Republic of Korea

    Hyun-Soo Zhang

Authors
  1. Yunjeong Park
    View author publications

    Search author on:PubMed Google Scholar

  2. Hyun-Soo Zhang
    View author publications

    Search author on:PubMed Google Scholar

  3. Sang Wook Bai
    View author publications

    Search author on:PubMed Google Scholar

Contributions

Y.P. - Conceptualization, literature search, study selection, data extraction, quality assessment, preparation of figures and tables, drafting of the manuscript H.Z. - Statistical analysis, data synthesis, preparation of figures and tables, methodological consultation S.W.B. - Literature search, data extraction, validation, interpretation of findings, Project administration.

Corresponding author

Correspondence to Sang Wook Bai.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Supplementary data 1

Supplementary data 2

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Park, Y., Zhang, HS. & Bai, S.W. Large language models in systematic review and meta-analysis of surgical treatments for vaginal vault prolapse. npj Digit. Med. (2026). https://doi.org/10.1038/s41746-026-02431-w

Download citation

  • Received: 21 August 2025

  • Accepted: 01 February 2026

  • Published: 19 February 2026

  • DOI: https://doi.org/10.1038/s41746-026-02431-w

Share this article

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

Download PDF

Associated content

Collection

Impact of Agentic AI on Care Delivery

Advertisement

Explore content

  • Research articles
  • Reviews & Analysis
  • News & Comment
  • Collections
  • Follow us on X
  • Sign up for alerts
  • RSS feed

About the journal

  • Aims and scope
  • Content types
  • Journal Information
  • About the Editors
  • Contact
  • Editorial policies
  • Calls for Papers
  • Journal Metrics
  • About the Partner
  • Open Access
  • Early Career Researcher Editorial Fellowship
  • Editorial Team Vacancies
  • News and Views Student Editor
  • Communication Fellowship

Publish with us

  • For Authors and Referees
  • Language editing services
  • Open access funding
  • Submit manuscript

Search

Advanced search

Quick links

  • Explore articles by subject
  • Find a job
  • Guide to authors
  • Editorial policies

npj Digital Medicine (npj Digit. Med.)

ISSN 2398-6352 (online)

nature.com sitemap

About Nature Portfolio

  • About us
  • Press releases
  • Press office
  • Contact us

Discover content

  • Journals A-Z
  • Articles by subject
  • protocols.io
  • Nature Index

Publishing policies

  • Nature portfolio policies
  • Open access

Author & Researcher services

  • Reprints & permissions
  • Research data
  • Language editing
  • Scientific editing
  • Nature Masterclasses
  • Research Solutions

Libraries & institutions

  • Librarian service & tools
  • Librarian portal
  • Open research
  • Recommend to library

Advertising & partnerships

  • Advertising
  • Partnerships & Services
  • Media kits
  • Branded content

Professional development

  • Nature Awards
  • Nature Careers
  • Nature Conferences

Regional websites

  • Nature Africa
  • Nature China
  • Nature India
  • Nature Japan
  • Nature Middle East
  • Privacy Policy
  • Use of cookies
  • Legal notice
  • Accessibility statement
  • Terms & Conditions
  • Your US state privacy rights
Springer Nature

© 2026 Springer Nature Limited

Nature Briefing

Sign up for the Nature Briefing newsletter — what matters in science, free to your inbox daily.

Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing