Introduction

Gastric cancer (GC) remains a major global health burden, ranking sixth in incidence and third in mortality1. In 2020, 1.089 million new cases and 769,000 deaths were reported globally2 with China contributing 43.94% of global incidence (478,508 cases)3. GC prognosis is stage-dependent, with 5-year survival rates between 20% and 40% globally. For locally advanced gastric cancer (LAGC), this drops below 30%, necessitating surgery-centered comprehensive treatment4. Neoadjuvant chemotherapy (NAC) is now the global standard for LAGC, supported by trials like Germany’s FLOT4-AIO5 and Asia’s RESOLVE6. The Orient-16 trial7 established PD-1 inhibitor-based immunotherapy combined with chemotherapy as a first-line option for unresectable locally advanced, recurrent, or metastatic gastric/gastroesophageal junction (GEJ) adenocarcinoma.

However, standardized methods for evaluating neoadjuvant therapy efficacy remain limited. The international community primarily uses WHO criteria8 and RECIST 1.19. WHO criteria assess response via the product of longest tumor diameter and its perpendicular counterpart, while RECIST 1.1 defines target lesions and evaluates changes in total diameters from baseline10. Because the stomach is non-solid, primary gastric lesions are classified as non-measurable under RECIST, with lymph nodes ≥ 1.5 cm in short axis considered measurable11. This limitation hinders preoperative assessments, delaying precision medicine and optimal surgical timing.

To address this, researchers proposed modified criteria (e.g., irRECIST12 and imRECIST13 and functional imaging standards (e.g., EORTC14 and PERCIST 1.015). Techniques like contrast-enhanced ultrasound(US), dual-source Computed Tomography(CT) iodine uptake measurement, MRI-derived ADC values, and AI-integrated imaging with multi-omics strategies show high sensitivity and specificity. However, no universally accepted method exists due to applicability challenges, despite their accuracy.

In parallel with a prospective study on NAC combined with immunotherapy for LAGC, our team evaluated high-resolution CT and color Doppler ultrasound against postoperative TRG using the CAP system16. The goal was to assess models derived from routine methods in evaluating therapeutic outcomes after neoadjuvant treatment, providing a clinically relevant tool for efficacy evaluation.Materials and Methods.

Methods

Data collection

This study compiled multi-center data from the investigation of “Tislelizumab combined with Apatinib and chemotherapy as neoadjuvant treatment for Borrmann IV type, large Borrmann III type, and Bulky N LAGC. “Data were collected from five Fujian-based hospitals: Fujian Cancer Hospital, Sanming First Hospital, Sanming Second Hospital, Putian University Affiliated Hospital, and Xiapu County Hospital.

Inclusion criteria

  • ECOG performance status of 0–1;

  • Age 18–75 years;

  • Histologically confirmed AJCC 8th edition LAGC, with tumors > 5 cm in basal diameter categorized as Borrmann III, Borrmann IV, or Bulky N (enlarged lymph nodes > 3 cm around specific gastric regions or more than two adjacent lymph nodes > 1.5 cm);

  • Preoperative endoscopy confirming no peritoneal dissemination and negative cytology;

  • No prior surgery, radiotherapy, chemotherapy, or immunotherapy.

Exclusion criteria

  • History of other malignant tumors;

  • Acute oncological emergencies (bleeding, obstruction, perforation);

  • Known HER2 positivity;

  • Allergy to study drugs;

  • Conditions interfering with trial outcomes;

  • Mucinous adenocarcinoma and lymphoepithelioma-like carcinoma, excluded due to reduced TRG grading accuracy in pathology assessments.

  • The study was approved by the Ethics Committees of all participating institutions.

Treatment regimen(each 21-day cycle):

  • Tislelizumab (TSZ): 200 mg IV on Day 1.

  • Apatinib (APA): 250 mg/day orally (Days 1–21).

  • Oxaliplatin (OXA): 130 mg/m² IV on Day 1.

  • S-1: Oral dosing based on BSA (Days 1–14):

  • < 1.25 m²: 40 mg bid.

  • 1.25–1.50 m²: 50 mg bid.

  • ≥ 1.50 m²: 60 mg bid.

Four cycles were administered, with APA omitted in the final cycle, followed by gastrectomy with D2 lymphadenectomy and R0 resection. Imaging assessment was performed per modified RECIST 1.1.

Imaging protocols

Imaging modalities included CT scans with a slice thickness of ≤ 5 mm, using oral negative contrast for stomach expansion, followed by low-dose preparation and iodinated contrast enhancement. Color Doppler ultrasonography involved the patient consuming 400-500 ml of a gastrointestinal ultrasound contrast agent in a semi-solid state prepared with hot water 5 min before the exam. Scanning in various positions assessed lesion location, size, local gastric wall structure, depth of infiltration, gastric motility, and the presence of lymphadenopathy or metastasis to adjacent organs.

  • CT: ≤5 mm slice thickness, oral negative contrast for stomach expansion, low-dose prep, and iodinated contrast.

  • Color Doppler US: Patients ingested 400–500 mL of semi-solid ultrasound contrast (prepared with hot water 5 min pre-exam).

  • Assessment Parameters: Lesion location, size, gastric wall structure, infiltration depth, motility, lymphadenopathy, and organ metastasis.

  • Image Interpretation Measures.

  • Standardized SOPs: Unified reading guidelines and SOPs ensured consistent evaluation.

  • Blinded Review: Readers had no access to patient identifiers or clinical data.

  • CAD Systems: Reduced subjectivity via computer-aided diagnosis (CAD).

  • Training & Certification: Readers completed skill-specific training programs.

  • Inter-Rater Reliability: Kappa value testing monitored agreement rates.

  • Dual Review: CT: 2 radiologists reviewed independently; discrepancies resolved by consensus; US: 2 sonographers reviewed independently; discrepancies resolved by consensus.

  • Re-Reading: Critical/controversial cases underwent multi-reader/multi-examiner re-evaluation for CT/US.Pathological Assessment.

Pathological response assessment (TRG System)

TRG (American Society of Clinical Oncology): This study’s pathological review provided detailed definitions for the percentage of residual cancer cells in each grade, as follows:

  1. 1.

    TRG 0 = No viable cancer cells (complete response, 0% residual cells).

  2. 2.

    TRG 1 = Isolated or small clusters of residual cells (moderate response, < 10% residual cells).

  3. 3.

    TRG 2 = Residual tumor with fibrosis (minimal response, 10–90% residual cells).

  4. 4.

    TRG 3 = Minimal or no tumor regression, abundant residual cancer cells• (poor response, > 90% residual cells).

Efficacy evaluation criteria design

The first category of evaluation criteria adopted the classical Response Evaluation Criteria in Solid Tumors (RECIST, version 1.1)17 Models were established using ultrasound-measured gastric tumor diameters. To address diverse clinical scenarios and imaging limitations, five distinct evaluation methods were designed and validated against pathological tumor regression grades (TRG) via Receiver Operating Characteristic(ROC)curves, calibration curves, and Decision Curve Analysis (DCA) curves. Standard RECIST v1.118,19 defines measurable lymph nodes as those with a short-axis diameter > 1.5 cm, which remains the mainstream standard. However, diagnostic studies have proposed lower thresholds (0.5–0.7 cm), requiring integration of morphology, long-to-short axis ratio, enhancement pattern, and margin characteristics. Radiomics techniques, which extract thousands of features (e.g., textural, wavelet) for machine learning20,21,22,23,24.

Through prospective internal validation at our center using postoperative pathology confirmation, we demonstrated that a lymph node short-axis threshold of ≥ 0.7 cm achieved 62% sensitivity and 91% specificity. Therefore, this study adopted ≥ 0.7 cm as the threshold. Final lymph node positivity assessments required comprehensive evaluation combining morphology, enhancement patterns, and other features. We recommend that medical centers carefully consider local technology and validation data before adjusting the > 1.5 cm gold standard.

Abbreviations include Sum of short axes of lymph nodes measured by CT = CT Lymph Node Diameter Sum; Maximum short axis of lymph nodes measured by CT = CT Lymph Node Diameter; Maximum thickness of gastric lesion measured by CT = CT Gastric Lesion Thickness; Maximum basal width of gastric lesion measured by ultrasound = US Gastric Lesion Base Width; Maximum thickness of gastric lesion measured by ultrasound = US Gastric Lesion Thickness. Pre-tx/Post-tx (before/after treatment).

RECIST_Expansion group: This cohort evaluated the predictive accuracy of RECIST 1.1 in estimating TRG. Following standard RECIST 1.1 methodology, gastric primary lesions were classified as non-measurable targets, with treatment response assessed solely via CT-measured changes in the sum of lymph node short-axis diameters. However, clinically, lymph nodes > 1.5 cm are rarely observed. According to recent Literature and our prospective validation, we recommend a reduced threshold of 0.7 cm for the short-axis diameter of lymph nodes.

Evaluation formula: Shrinkage rate =

$$\:\frac{\:(\text{P}\text{r}\text{e}-\text{t}\text{x})\text{C}\text{T}\:\text{L}\text{y}\text{m}\text{p}\text{h}\:\text{N}\text{o}\text{d}\text{e}\:\text{D}\text{i}\text{a}\text{m}\text{e}\text{t}\text{e}\text{r}\:\text{S}\text{u}\text{m}-\:(\text{P}\text{o}\text{s}\text{t}-\text{t}\text{x})\text{C}\text{T}\:\text{L}\text{y}\text{m}\text{p}\text{h}\:\text{N}\text{o}\text{d}\text{e}\:\text{D}\text{i}\text{a}\text{m}\text{e}\text{t}\text{e}\text{r}\:\text{S}\text{u}\text{m}\:}{(\text{P}\text{r}\text{e}-\text{t}\text{x})\:\text{C}\text{T}\:\text{L}\text{y}\text{m}\text{p}\text{h}\:\text{N}\text{o}\text{d}\text{e}\:\text{D}\text{i}\text{a}\text{m}\text{e}\text{t}\text{e}\text{r}\:\text{S}\text{u}\text{m}\:}$$

UC_RECIST Group(Ultrasound-Modified RECIST Group): While RECIST 1.1 defines measurable lesions (> 10 mm longest diameter), gastric lesions are categorized as non-measurable due to their hollow viscus anatomy. This cohort explored reclassifying gastric lesions as measurable targets using ultrasound-measured basal width as the longest diameter. Treatment response was evaluated by combining this parameter with CT-assessed lymph node regression.

Evaluation formula: Shrinkage rate=

$$\:\frac{\text{Pre}-\text{tx}( \text{CT Lymph Node Diameter Sum} +\text{US Gastric Lesion Base Width} ) - \text{Post} - \text{tx} ( \text{CT Lymph Node Diameter Sum} + \text{US Gastric Lesion Base Width} ) } {\text{Pre} - \text{tx} ( \text{CT Lymph Node Diameter Sum} + \text{US Gastric Lesion Base Width} ) }$$

CT_RECIST Group(CT-Only RECIST Group): Not all centers can use ultrasound for gastric tumor assessment; CT is more widely accessible. This cohort established CT-exclusive criteria to validate their accuracy. Due to CT’s limitations in measuring gastric lesion basal width, treatment response was assessed via CT-measured gastric lesion thickness changes and lymph node regression.

Evaluation formula: Shrinkage rate=

$$\:\frac{\text{P}\text{r}\text{e}-\text{t}\text{x}( \text{C}\text{T}\:\text{L}\text{y}\text{m}\text{p}\text{h}\:\text{N}\text{o}\text{d}\text{e}\:\text{D}\text{i}\text{a}\text{m}\text{e}\text{t}\text{e}\text{r}+\text{C}\text{T}\:\text{G}\text{a}\text{s}\text{t}\text{r}\text{i}\text{c}\:\text{L}\text{e}\text{s}\text{i}\text{o}\text{n}\:\text{T}\text{h}\text{i}\text{c}\text{k}\text{n}\text{e}\text{s}\text{s} - \text{P}\text{o}\text{s}\text{t} - \text{t}\text{x}(\text{C}\text{T}\:\text{L}\text{y}\text{m}\text{p}\text{h}\:\text{N}\text{o}\text{d}\text{e}\:\text{D}\text{i}\text{a}\text{m}\text{e}\text{t}\text{e}\text{r}+\text{C}\text{T}\:\text{G}\text{a}\text{s}\text{t}\text{r}\text{i}\text{c}\:\text{L}\text{e}\text{s}\text{i}\text{o}\text{n}\:\text{T}\text{h}\text{i}\text{c}\text{k}\text{n}\text{e}\text{s}\text{s})}{\text{P}\text{r}\text{e} - \text{t}\text{x}( \text{C}\text{T}\:\text{L}\text{y}\text{m}\text{p}\text{h}\:\text{N}\text{o}\text{d}\text{e}\:\text{D}\text{i}\text{a}\text{m}\text{e}\text{t}\text{e}\text{r}+\text{C}\text{T}\:\text{G}\text{a}\text{s}\text{t}\text{r}\text{i}\text{c}\:\text{L}\text{e}\text{s}\text{i}\text{o}\text{n}\:\text{T}\text{h}\text{i}\text{c}\text{k}\text{n}\text{e}\text{s}\text{s})}$$

U_RECIST Group(Ultrasound-Only RECIST Group): As a radiation-free, cost-effective modality, ultrasonography enables more frequent monitoring than CT during treatment cycles. This facilitates earlier detection of disease progression, enhancing precision oncology strategies. This cohort evaluated ultrasound alone as a safer, clinically accurate alternative for treatment response assessment.

Evaluation formula: Shrinkage rate=

$$\:\frac{ ( \text{P}\text{r}\text{e} -\text{t}\text{x}\:\text{U}\text{S}\:\text{G}\text{a}\text{s}\text{t}\text{r}\text{i}\text{c}\:\text{L}\text{e}\text{s}\text{i}\text{o}\text{n}\:\text{B}\text{a}\text{s}\text{e}\:\text{W}\text{i}\text{d}\text{t}\text{h}/2)^{2} \times\:(\text{P}\text{r}\text{e}-\text{t}\text{x}\:\text{U}\text{S}\:\text{G}\text{a}\text{s}\text{t}\text{r}\text{i}\text{c}\:\text{L}\text{e}\text{s}\text{i}\text{o}\text{n}\:\text{T}\text{h}\text{i}\text{c}\text{k}\text{n}\text{e}\text{s}\text{s}) - (\text{P}\text{o}\text{s}\text{t} -\text{t}\text{x}\:\text{U}\text{S}\:\text{G}\text{a}\text{s}\text{t}\text{r}\text{i}\text{c}\:\text{L}\text{e}\text{s}\text{i}\text{o}\text{n}\:\text{B}\text{a}\text{s}\text{e}\:\text{W}\text{i}\text{d}\text{t}\text{h}/2)^{2}\:\times\:(\text{P}\text{o}\text{s}\text{t} -\text{t}\text{x}\:\text{U}\text{S}\:\text{G}\text{a}\text{s}\text{t}\text{r}\text{i}\text{c}\:\text{L}\text{e}\text{s}\text{i}\text{o}\text{n}\:\text{T}\text{h}\text{i}\text{c}\text{k}\text{n}\text{e}\text{s}\text{s})\:}{(\text{P}\text{r}\text{e} -\text{t}\text{x}\:\text{U}\text{S}\:\text{G}\text{a}\text{s}\text{t}\text{r}\text{i}\text{c}\:\text{L}\text{e}\text{s}\text{i}\text{o}\text{n}\:\text{B}\text{a}\text{s}\text{e}\:\text{W}\text{i}\text{d}\text{t}\text{h}/2)^{2}\times\:(\text{P}\text{r}\text{e} -\text{t}\text{x}\text{U}\text{S}\:\text{G}\text{a}\text{s}\text{t}\text{r}\text{i}\text{c}\:\text{L}\text{e}\text{s}\text{i}\text{o}\text{n}\:\text{T}\text{h}\text{i}\text{c}\text{k}\text{n}\text{e}\text{s}\text{s})}$$

V_RECIST Group(Volumetric RECIST Group): To combine strengths of CT and ultrasound, this cohort used an approximate volume formula, integrating: CT-measured maximum lymph node short-axis diameter, CT-measured gastric lesion thickness, and Ultrasound-measured gastric lesion basal width.Pre- and post-treatment changes in lymph node and gastric tumor volumes were modeled to optimize monitoring accuracy.

Evaluation formula: Shrinkage rate=

$$\:\frac{\text{t}\text{h}\text{e}\:\text{S}\text{h}\text{r}\text{i}\text{n}\text{k}\text{a}\text{g}\text{e}\:\text{r}\text{a}\text{t}\text{e}\:\text{o}\text{f}\:\:\text{U} - \text{R}\text{E}\text{C}\text{I}\text{S}\text{T}\:\text{G}\text{r}\text{o}\text{u}\text{p}}{2} + \frac{\text{P}\text{r}\text{e}-\text{t}\text{x}\:\:\text{C}\text{T}\:\text{L}\text{y}\text{m}\text{p}\text{h}\:\text{N}\text{o}\text{d}\text{e}\:\text{D}\text{i}\text{a}\text{m}\text{e}\text{t}\text{e}\text{r}^{2}\: - \text{P}\text{o}\text{s}\text{t} -\text{t}\text{x}\:\text{C}\text{T}\:\text{L}\text{y}\text{m}\text{p}\text{h}\:\text{N}\text{o}\text{d}\text{e}\:\text{D}\text{i}\text{a}\text{m}\text{e}\text{t}\text{e}\text{r}^{2}\:}{2\times\:\text{P}\text{r}\text{e} -\text{t}\text{x}\:\:\text{C}\text{T}\:\text{L}\text{y}\text{m}\text{p}\text{h}\:\text{N}\text{o}\text{d}\text{e}\:\text{D}\text{i}\text{a}\text{m}\text{e}\text{t}\text{e}\text{r}^{2}\:}$$

Category 2 efficacy evaluation

Patients were stratified into three groups based on TRG classification: MPR Group (TRG 0–1), Partial Response Group (TRG 2), and No Response Group (TRG 3). A regression model and nomogram were developed using ordinal logistic regression for univariate and multivariate analyses to identify key predictors,. Model performance was assessed via ROC curves, calibration curves, DCA curves, and ten-fold cross-validation.

Statistical analyses

Statistical analyses were carried out with SPSS 20.025, and categorical variables were summarized as counts (%).Agreement between TRG and imaging assessments was evaluated via ROC curves, determining optimal cut-off values, sensitivity, specificity, and P-values (P < 0.05 indicated significance). Calibration curves and DCA were generated using R 4.4.026, alongside ordinal logistic regression (univariate and multivariate) and nomogram construction. Model performance was validated through ROC curves, calibration curves, DCA curves, and ten-fold cross-validation.

Results

Clinical data statistics

This study utilized data from 75 patients enrolled in phase II and some phase III studies on “Tislelizumab combined with Apatinib and chemotherapy as neoadjuvant treatment for Borrmann IV type, large Borrmann III type, and Bulky N LAGC”. Among them, 56 (74.6%) were male, and 40 (53.3%) presented with poorly differentiated gastric tumors. By GC type, 47 (63.7%) had Large Borrmann III Type, and 26 (34.6%) achieved TRG grade 2. Detailed patient characteristics are summarized in Table 1.

Table 1 Participants’ clinical characteristics.

Establishment of model category 1 - Efficacy evaluation model based on tumor diameter according to RECIST (Version 1.1)

1) Model Development and Evaluation

Efficacy groups were defined by TRG grades: Grades 0–2 (effective), Grade 3 (ineffective). ROC analysis showed the V_RECIST group had the highest area under the receiver operating characteristic curve [AUC] (0.951) at the optimal cutoff, indicating superior predictive accuracy. The UC_RECIST, CT_RECIST, and U_RECIST groups achieved AUCs of 0.909, 0.933, and 0.906, respectively, all non-inferior to RECIST_Expansion (AUC = 0.915) (Table 2; Fig. 1a). All models demonstrated good sensitivity and specificity.Calibration curves confirmed strong agreement between predicted and observed outcomes (Mean Absolute Error (MAE), Mean Squared Error (MSE), and 0.9 Quantile of absolute error(Q90 (AE))(Fig. 1b; Table 3). DCA curves validated clinical utility, with V_RECIST providing net benefits above ALL/None thresholds. Other groups also showed satisfactory performance (Fig. 1c).

Table 2 ROC curve analysis of neoadjuvant therapy response modelsr: comparing TRG 0–2 vs. TRG 3 and TRG 0–1 vs. TRG 2–3.
Table 3 Calibration curve data of neoadjuvant therapy response modelsr: comparing TRG 0–2 vs. TRG 3 and TRG 0–1 vs. TRG 2–3.
Fig. 1
Fig. 1
Full size image

(a) ROC Curve Comparing the (TRG 0–2) and the (TRG 3). (b) Calibration Curves Comparing the (TRG 0–2) and the (TRG 3). (c) DCA Curves Comparing the (TRG 0–2) and the (TRG 3). (d) ROC Curve Comparing the (TRG 0–1) and the (TRG 2–3). (e) Calibration Curves Comparing the (TRG 0–1) and the (TRG 2–3). (f) DCA Curves Comparing the (TRG 0–1) and the (TRG 2–3).

2) Evaluation by MPR

The patients were reclassified into TRG 0–1 (MPR) and TRG 2–3 (Non-MPR) groups. ROC analysis showed V_RECIST achieved the highest AUC (0.868) at the optimal cutoff, outperforming UC_RECIST (0.804), CT_RECIST (0.842), U_RECIST (0.811), and RECIST_Expansion (0.778) (Table 2; Fig. 1d). All models demonstrated acceptable sensitivity and specificity.Calibration metrics (MAE, MSE, 0.9 Quantile of Absolute Error) confirmed strong alignment between predicted and actual outcomes (Table 3). DCA curves validated clinical utility, with V_RECIST consistently yielding net benefits above “ALL” and “None” thresholds. Other models also showed satisfactory performance for diverse scenarios (Fig. 1e, f).

Establishment of the second category prediction model: ordered logistic regression and nomogram model

To develop the second category prediction model, patients were stratified into three TRG groups: MPR (TRG 0–1), Partial Response (TRG 2), and No Response (TRG 3). Variables included imaging data and pre-treatment biomarkers: CPS, differentiation, neutrophil-to-lymphocyte ratio (NLR), platelet-to-lymphocyte ratio (PLR), prognostic nutritional index (PNI), and systemic immune-inflammation index (SII), all previously linked to treatment efficacy. Ordered logistic regression analyses identified ultrasound-based basal width reduction rate (p = 0.001), CT-based lymph node short-axis reduction rate (p = 0.000), and CT-based tumor thickness reduction rate (p = 0.024) as key predictors (Table 4). These variables formed the regression model (Supplementary Table 1). Nomograms for the three indicators were constructed (Fig. 2) and validated via ROC curves: AUC = 0.952 for TRG_0–1 + TRG_2 vs. TRG_3, AUC = 0.870 for TRG_0–1 vs. TRG_2 + TRG_3, AUC = 0.900 for TRG_3 identification (Fig. 3a, b). Calibration curves confirmed strong alignment between predicted and actual outcomes. DCA curves demonstrated clinical utility (Fig. 3c, d). Given the small sample size, 10-fold cross-validation (70% training, 30% testing) yielded multinomial logistic regression accuracy = 0.73, Dxy = 0.725, and R² = 0.757 (Supplementary Table 2). Model applications are summarized in Table 5.

Table 4 Univariate and multivariate analysis of ordered logistic regression.
Fig. 2
Fig. 2
Full size image

Nomogram for predicting TRG.

Fig. 3
Fig. 3
Full size image

(a, b) ROC curves for the nomogram predicting TRG. (c) Calibration curve for the nomogram. (d) DCA curve for the nomogram.

Table 5 Application scenarios of different Models.

Discussion

In Western nations, perioperative chemotherapy is now the standard for LAGC, supported by trials like MAGIC27 FNCLCC/FFCD 970328, and the FLOT4-AIO study. The NCCN guidelines prioritize this approach, a recommendation shared by Asian scholars through studies like China’s RESOLVE and Korea’s PRODIGY29. PD-1 inhibitors, initially effective in melanoma, have expanded to GC. CheckMate-64930 demonstrated the first-line efficacy of PD-1 inhibitors + chemotherapy in LAGC, leading to NMPA approval of Opdivo for advanced/metastatic gastric, gastroesophageal junction (GEJ), or esophageal adenocarcinoma. In China, sintilimab (Orient-16, 2022) was approved for unresectable locally advanced/recurrent/metastatic GC or GEJ adenocarcinoma when combined with oxaliplatin + capecitabine. Neoadjuvant immunotherapy + chemotherapy has shown promise in GC, with studies like Jia et al.31 and Lin et al.18 reporting improved response rates and outcomes in locally advanced GEJ adenocarcinoma.

However, evaluating neoadjuvant treatment efficacy in LAGC remains highly Limited. The current RECIST 1.1 criteria classify the primary gastric lesion as a hollow organ, excluding it from measurable targets. Furthermore, metastatic lymph nodes must have a short-axis diameter ≥ 1.5 cm to be considered measurable, which significantly restricts the accuracy and applicability of response assessment. In our study, 69.3% of patients had lymph nodes < 1.5 cm before treatment, leaving many without measurable lesions and thus no basis for standardized evaluation. Additionally, this leads to the exclusion of advanced patients from trials, denying them access to potentially life-saving investigational therapies.

Emerging approaches have been explored for efficacy evaluation in GC, For instance: Yan et al.32 found contrast-enhanced ultrasound predicted chemotherapeutic responses with 81.8% sensitivity and 66.7% specificity. Wang et al.33 demonstrated CT-based texture models outperformed subjective imaging features (97.3% vs. 56.0% accuracy). Yuan et al.34 reported DCE-MRI and IVIM-DWI parameters achieved an AUC of 0.92, with 87.0% sensitivity and 85.0% specificity. Wang35 combined radiomics/deep learning with PD-L1 expression, improving AUC by 4–7%.Zurlo IV et al.36 highlighted histological subtype impacts, while Liu C. et al.37 developed a recurrence-based prognostic model. Alternative criteria Like irRECIST, imRECIST, EORTC, and PERCIST 1.0 have been proposed to better assess immunotherapy outcomes and prognosis. However, none have replaced RECIST 1.1 as the standard.

Gastric cancer’s primary tumor is often flat-cylindrical, We observed that treatment-sensitive features detectable by ultrasound at millimeter precision (e.g.basal width and lesion thickness changes). Based on this, we evaluated whether ultrasound-integrated assessments could enhance or replace RECIST 1.1. If validated, this approach could expand RECIST 1.1’s applicability, benefiting more patients. We designed two models using high-resolution CT and color Doppler ultrasound: Modified RECIST Framework: RECIST_Expansion: Lymph node assessment only. UC_RECIST: CT-ultrasound integration (gastric lesions as measurable targets). CT_RECIST: CT-only (for facilities without ultrasound). U_RECIST: Ultrasound-only (enables frequent monitoring). V_RECIST: Multimodal volume modeling (geometric approximation). Visualized Nomogram: Built via ordinal regression, offering graphical, individualized predictions of nonlinear outcomes.

The RECIST_Expansion group demonstrated that lowering the lymph node short-axis threshold to 0.7 cm (vs. RECIST 1.1’s ≥ 1.5 cm) enabled accurate differentiation between effective/ineffective chemotherapy and MPR/non-MPR groups (AUC: 0.915 and 0.778). Calibration and decision curve analyses confirmed this adjustment maintains high accuracy. However, due to imaging variability, centers should adapt thresholds based on technical capabilities and validation data. The UC_RECIST group explored gastric lesions as measurable targets by combining ultrasound-derived basal width (longest diameter) with CT-based lymph node short-axis changes. It achieved AUCs of 0.909 and 0.804, validating gastric lesions’ feasibility as targets. For CT_RECIST, designed for ultrasound-limited centers, CT-only metrics (e.g., maximum lesion thickness) yielded AUCs of 0.933 and 0.842, showing strong diagnostic potential despite CT’s cost and radiation exposure limitations.The V_RECIST group integrated ultrasound and CT data to develop an approximate volumetric calculation formula aiming to improve accuracy. It chieving the highest AUC (0.951 and 0.868) among tested models.The U_RECIST group used ultrasound-only metrics for gastric lesion assessment. Compared to CT’s high cost and radiation exposure, ultrasound offers a low-cost, radiation-free alternative suitable for frequent monitoring in precision medicine, enabling early detection and personalized treatment. It achieved AUCs of 0.906 and 0.811, slightly lower than V_RECIST but still highly accurate. U_RECIST focused exclusively on primary lesion changes (not lymph nodes). Given that almost all advanced gastric lesions can be measured by ultrasound, this approach has broad applicability, enabling a large number of lymph node-negative GC patients to receive efficacy evaluations and benefit accordingly.

The second model category includes ordinal logistic regression and nomograms, offering visual representation of nonlinear statistical predictions. Patients were classified into three TRG-based groups: MPR (TRG 0–1), partial response (TRG 2), and no response (TRG 3). Univariate analysis incorporated systemic inflammation and nutritional indicators: SII38 PNI39 pre-treatment PLR, and NLR40,41,42. Despite prior links to tumor progression, these metrics showed no correlation with pathological tumor reduction—a discrepancy likely due to tumor shrinkage’s non-exclusive role in prognosis43. While SII/PNI/PLR/NLR may influence prognosis, their impact on neoadjuvant therapy response remains unclear and requires large-scale validation. Combined Positive Score (CPS) quantifies PD-L1 expression via the ratio of PD-L1-positive cells (tumor, lymphocytes, macrophages) to viable tumor cells. Higher CPS is linked to better immunotherapy outcomes44 yet no strong CPS-TRG correlation was observed, possibly due to chemotherapy/targeted therapy combinations diluting immunotherapy’s individual effect. Three key predictors emerged: Ultrasound-measured basal width reduction (p = 0.001) CT lymph node short-axis reduction (p = 0.000) CT tumor thickness reduction (r = 0.024) A nomogram integrating these factors demonstrated superior predictive accuracy and visual clarity compared to RECIST’s linear percentage method. Nomograms, although involving more sophisticated statistical modeling and calculation procedures, demonstrate improved predictive accuracy and visualization. We anticipate that the accumulation of additional clinical data will enable further refinement of the model’s performance, thereby facilitating its broader adoption in clinical settings.

Conclusion

Our study comprises six models across two categories. Among these, five RECIST 1.1 modified models exhibit strong performance in evaluating neoadjuvant therapy responses in LAGC patients. The V_RECIST model demonstrates the highest predictive accuracy, while the ultrasound-only U_RECIST model offers cost-effectiveness, radiation-free imaging, and clinically acceptable accuracy, positioning it as a viable option for clinical adoption. The nomogram further improves predictive capacity through advanced statistical modeling and visual clarity. However, real-world validation and model refinement are essential for clinical implementation.