Abstract
Accurate diagnosis and prognostic assessment of gastric cancer are critical for improving patient outcomes. The application of advanced machine learning methods, particularly the XGBoost algorithm, offers a promising approach for enhancing prognostic evaluations. In this study, data from 2,270 patients with gastric cancer were analysed to develop a predictive model for prognosis using the XGBoost algorithm and 20 key clinical features. Comprehensive data collection, preprocessing, and feature selection were conducted to ensure robust model construction and validation. The model demonstrated strong predictive performance in the test cohort, achieving an area under the curve (AUC) of 0.855, and it effectively differentiated patients at high-risk from those at low-risk. Feature importance analysis revealed that pTNM stage and CA125 level were the most influential prognostic factors. This study successfully implemented a machine learning-based model integrating the XGBoost algorithm and critical clinical indicators to predict the five-year survival rate of patients with gastric cancer. The findings highlight the potential of such approaches in supporting personalised treatment strategies and advancing cancer prognosis assessment methodologies.
Similar content being viewed by others
Author information
Authors and Affiliations
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Electronic Supplementary Material
Below is the link to the electronic supplementary material.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Zhang, Y., Zhou, X., Li, P. et al. XGBoost-based model for predicting five-year survival in gastric cancer using clinical indicators. Sci Rep (2026). https://doi.org/10.1038/s41598-026-50043-x
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-026-50043-x


