Abstract
To determine a direct method for diagnosing axillary lymph node (ALN) tumor burden preoperatively in cT1-2N0 breast cancer patients, we developed and validated a deep learning (DL) model based on ultrasound (US) images of sentinel lymph nodes (SLNs) detected by contrast-enhanced lymphatic ultrasound (CEUS). Women with cT1-T2N0 breast cancer who received CEUS were enrolled prospectively from Peking Union Medical College Hospital between April 2020 and July 2021 and from Sichuan Cancer Hospital between April 2022 and July 2022. Heavy ALN tumor burden was defined as > 2 metastatic lymph nodes according to the Z0011 criteria. We developed a DL model, the modality-adaptive network with clinicopathological information (MAN + C), using grayscale or color Doppler US images and radioclinicopathological information to predict heavy tumor burden. A total of 595 SLNs from 374 patients met the inclusion criteria. The areas under the receiver operating characteristic curve (AUCs) were calculated to evaluate the predictive performance of the model, yielding values of 0.91[95%CI(confidence interval):0.899–0.943], 0.98[95%CI:0.950-1], 0.89[95%CI:0.850–0.935], and 0.84[95%CI:0.811–0.869] in the training, validation, independent and external test datasets, respectively. Compared to current artificial intelligence (AI) models, this model extends usability by 30%, encompassing patients with multifocal lesions or those who have undergone primary breast cancer lesion therapy. Using this model, 88.9% of patients could safely decide whether they could waive unnecessary SLN biopsy. MAN + C provided a direct and efficient method for accurate preoperative assessment of ALN tumor burden in cT1-2N0 breast cancer patients.
Similar content being viewed by others
Introduction
Breast cancer remains the most frequently diagnosed malignancy among women worldwide1. Axillary lymph node (ALN) status is a critical determinant for tumor staging, prognosis, and treatment decision-making in patients with early breast cancer2. The sentinel lymph node (SLN), defined as the first node(s) within the ALN drainage basin, is the key target for axillary staging. SLN biopsy has replaced ALN dissection as the standard surgical procedure for early-stage patients without clinically evident axillary metastasis3. Recently, several large de-escalation trials — including SENOMAC4, AMAROS5, and the more recent INSEMA6 trials, have further shifted surgical axillary management toward minimal invasiveness. INSEMA demonstrated that omission of SLNB may be safe in selected cT1–2, clinically node-negative patients. However, these trials primarily focus on patients with low axillary tumor burden and address whether axillary surgery can be safely reduced or avoided.
In contrast, accurate preoperative stratification of ALN tumor burden remains essential, because a subset of clinically node-negative (cN0) patients still harbor high tumor burden and may not be appropriate candidates for omission of SLNB. Timely identification of this subgroup would improve the safety of selectively omitting SLNB in low-risk patients, streamline surgical decision-making, and enhance the overall patient experience.
Axillary ultrasound (US) is the primary imaging method used to preoperatively assess ALN status7. However, there is great variation in the diagnostic accuracy of axillary US for detecting ALN metastasis (sensitivity from 48.8% to 87.1%; specificity from 55.6% to 97.3%)8,9,10, especially for clinically negative LNs. The main reason may be the great overlap of morphology between LNs with and without metastasis. Small metastases in lymph nodes resulting in minor changes in US images can result in false negatives. Nonspecific lymph node enlargement and cortical thickening are irrelevant to metastasis and can result in false positives10,11,12. Moreover, conventional axillary US has limited value for identifying the SLN(s) and predicting the lymph node tumor burden13.
Deep learning (DL) has been widely used in the medical field and can effectively improve the ability of imaging diagnosis14. Previous studies have proven that the DL of US of primary breast cancer lesions can predict ALN metastasis status, for which AUCs varies from 72% to 90%15,16,17,18,19,20,21,22,23. Nevertheless, such approaches are indirect, relying on features of the primary tumor rather than on nodal characteristics themselves. They cannot be applied to patients with multifocal lesions or those who have received prior local treatment15,17,18. Importantly, multifocal or treated cases are not rare, with reported proportions ranging from 6% to 60%24. Therefore, developing DL models that rely solely on ALN or SLN imaging features is essential for robust and generalizable preoperative axillary evaluation. To date, only a few DL models based on SLN images have been developed specifically for tumor burden stratification in cT1-2N0 breast cancer patients. CEUS can accurately identify SLN(s) by tracing lymphatic vessels, offering the opportunity for direct assessment of the SLN(s) in this study11,25.
Our study aimed to develop and validate a DL model based on SLN US images to evaluate the preoperative tumor burden of axillary lymph nodes in patients with cT1-2N0 breast cancer, with the goal of identifying those at high risk of extensive nodal involvement who may directly benefit from ALND.
Patients and methods
Development cohort and clinical test cohorts
This study was a prospective multicenter study and was approved by the Ethics Committee of Peking Union Medical College Hospital (approval number: ZA-2291). This study was registered at www.chictr.org.cn with the clinical trial registration number ChiCTR2000031231 on March 25, 2020. All the participants signed informed consent forms. We enrolled consecutive patients from the breast surgery department of Peking Union Medical College Hospital between April 2020 and July 2021 and Sichuan Cancer Hospital between April 2022 and July 2022. The inclusion criteria were as follows: (a) histopathologically confirmed clinically staged T1-T2 invasive breast cancer; (b) clinically negative ALNs; (c) successful detection of the SLN with CEUS; and (d) SLN biopsy for axillary lymph node staging. The exclusion criteria were as follows: (a) received axillary therapy (biopsy, neoadjuvant chemotherapy); (b) lacked full-screen conventional (grayscale or color Doppler mode) US images of SLNs identified by CEUS; and (c) incomplete clinicopathological information.
Acquisition of grayscale SLN images and pathological outcomes of ALNs
All sonographic examinations were performed preoperatively by experienced radiologists specializing in breast US (Q.Z., M.L., more than 20 years; J.L., 9 years; Z.N., 7 years) using high-frequency linear array transducers on the iU22 and EPIQ 7 (Philips-Advanced Technology Laboratories, Bothell, WA, USA, L12-5MHZ) machines. The SLN(s) were identified as the first visualized LN(s) traced by enhanced lymphatic channels on CEUS imaging. The dual display mode showed both CEUS and grayscale images simultaneously. Then, full-screen, grayscale images of as many sections of SLNs as possible were acquired. One or two days later, the surgeons identified the SLN(s) by methylene blue dye and indocyanine green. Both micro- and macromatastases were defined as metastases. The details of the percutaneous CEUS procedure and surgical management of ALNs are provided in Supplementary material 1.
Pathological assessment
Estrogen receptor (ER), progesterone receptor (PgR), and human epidermal growth factor receptor 2 (HER2) status were evaluated by immunohistochemistry (IHC) on surgical or biopsy specimens. ER and PgR positivity were defined as ≥ 1% of tumor cell nuclei showing immunoreactivity, according to current ASCO/CAP guidelines. HER2 status was determined following the ASCO/CAP recommendations: IHC 3 + was considered positive, whereas IHC 2 + cases underwent in situ hybridization (ISH), and HER2 amplification by ISH was classified as HER2-positive. Molecular subtypes were assigned based on ER, PR, HER2 status, and Ki67 index according to the St. Gallen consensus. Histological grade was determined using the Nottingham Grading System, incorporating tubule formation, nuclear pleomorphism, and mitotic count. Ki67 was assessed by IHC, and a 20% cutoff was used to distinguish low from high proliferative activity, consistent with current clinical practice.
Input preparation (images and clinical characteristics)
The input images (manual cropping) were conducted by experienced radiologists who meticulously tailored each image, ensuring that the focal sentinel lymph node (SLN) occupied the predominant portion of the final image. It was also carried out to prevent shape deformation or imbalance between the height and width of the image before feeding images into deep neural networks. These coordinates were subsequently employed for image cropping. Additionally, all images were resized to a standardized size of 512 × 512, accompanied by supplementary image augmentations.
In addition to the US scanning images, we also collected patient characteristics using a patient characteristics data selection procedure. The final set included age, longitudinal and short SLN length, the longitudinal-to-short diameter SLN ratio, cortical thickness, the estrogen value, the progesterone receptor value, the human epidermal growth factor receptor 2 value, and the Ki-67 index of primary breast cancer. For multifocal cases, we consistently used the largest lesion to determine tumor size, molecular subtype, and other primary tumor characteristics. For patients who had received preoperative treatment, we incorporated the pre-treatment biopsy results and, whenever available, tumor size information documented in outside-hospital reports. All the above data serve as their respective inputs into our proposed model. Further preprocessing details are provided in Supplementary material 2.
Modality-adaptive network development
The training set was used for optimizing the parameters of our proposed modality-adaptive network with clinicopathological information (MAN + C). The MAN + C pipeline is shown in Fig. 1. IBN-ResNet26 was adopted as the backbone of our DL model. The optimization was performed on top of the IBN-ResNet model that was pretrained on the ImageNet dataset. The bulk of the model was maintained except that the output linear layer was replaced by a squeeze excitation27 module followed by a custom classification block tailored for predicting the ALN tumor burden. Moreover, we analyzed 9 different clinical characteristics, as mentioned in the last section. The 15-element input, comprising 6 single-valued data points and 3 × 3 categorical encodings representing null, 1 and 0, is processed through a dedicated patient data network (PDN) with three distinct fully connected layers. After obtaining US image features and clinical characteristic features, we adopted a feature fusion module for merging them together, in which both outputs of IBN-ResNet and PDN were batch normalized and concatenated. Finally, a feed-forward-layer classifier was added to produce a scalar for each data sample, which was the final prediction of the model. Further details are provided in Supplementary material 3.
The pipeline of the proposed modality-adaptive network for ALN tumor burden prediction.
The location of the SLN(s) can be traced by CEUS. The dual display mode showed both CEUS and grayscale images simultaneously. Then, full-screen, conventional US images of the SLNs were acquired. Patients’ clinical data were processed into PDN information. Any grayscale or color modalities of US images use an IBN-ResNet50-a for feature extraction and a custom classifier for final prediction. The output of the PDN (dimension of 8) linearly concatenates with the output of the IBN-ResNet50-a model (dimension of 128) and passes through a final linear layer for binary prediction.
In this work, we proposed a multipathway DL model (Fig. 1). By conducting comprehensive experiments across training, validation, independent test and external test datasets, we compared 4 different model architectures: IBN-ResNet-a26, EfficientNet-b428, VGG1629, and ResNet5030.
Statistical analysis
Continuous variables are expressed as medians with the first and third quartiles, and categorical variables are expressed as numbers and percentages (%). Kolmogorov‒Smirnov tests were used for normally distributed data. Continuous variables were compared using one-way analysis of variance or the Kruskal–Wallis test, as appropriate. Categorical variables were compared using the chi-square test or Fisher’s exact test. A two-sided P value < 0.05 was considered statistically significant. In our proposed models, the AUC and F1 score were used as the main parameters for the prediction metrics. The F1 score was included because the dataset exhibited class imbalance between patients with low and heavy ALN tumor burden. As the harmonic mean of precision and recall, the F1 score provides a balanced metric that better reflects the model’s ability to correctly identify patients with heavy tumor burden compared with accuracy alone. Low tumor burden was defined as isolated tumor cells or micrometastasis (≤ 2 mm) and ≤ 2 positive lymph nodes. Heavy tumor burden was defined as the presence of at least one macrometastasis (> 2 mm) and/or ≥ 3 positive axillary lymph nodes. This threshold was used as the binary classification endpoint for all model evaluations. The strategy used to find the optimal threshold was to optimize the geometric mean (G-mean):
which is a balance between sensitivity and specificity. Choosing the threshold that yielded the largest G-mean value will certify the threshold as the optimal threshold. All the statistical tests were two-sided, and P < 0.05 was considered to indicate statistical significance. All the statistical analyses were performed with SPSS (version 26.0 IBM), Python 3.8 and the Numpy library, and the statistical graphs were generated with Matplotlib.
Results
We prospectively included 222 patients from the inpatient department of breast surgery at Peking Union Medical College Hospital between April 2020 and March 2021 as the development dataset; these patients were randomly divided into training and validation sets at a ratio of 8:2. After one month, we included 53 patients between May 2021 and July 2021 as an independent test dataset. We further enrolled 99 patients from Sichuan Cancer Hospital between April 2022 and July 2022 as an external test dataset. The detailed baseline data of the patients are listed in Table 1. Figure 2 shows the patient recruitment workflow.
Overview of the patient enrollment workflows.
Base model selection
The statistical results of base models are shown in Supplementary Table (1) The training, validation, and independent test ROC curve comparisons are illustrated in Fig. 3 (abc). The IBN-ResNet-a outperformed the other models on the test set. All models were then evaluated for performance with patient characteristics concatenated after the last custom classification block. The statistical results are shown in Supplementary Table (2) The training, validation, and independent test ROC comparisons are illustrated in Fig. 3(def). In this category, IBN-ResNet ranked first as well. The integration of patient characteristic data also improved overall performance by approximately 20%.
ROC curves for different base models with or without clinicopathological characteristics in the training, validation and independent test cohorts. The figures display ROC curves comparing various base models for predicting ALN tumor burden in the training (a), validation (b), and independent test (c) sets. The figures present the performance evaluation of each model with clinicopathological characteristics in predicting ALN tumor burden in the training (d), validation €, and test (f) sets.
Performance of the models
Aided by the patient characteristics data, MAN + C achieved AUCs of 0.91(95% CI: 0.899–0.943), 0.98(95% CI: 0.950-1), 0.89 (95% CI: 0.850–0.935) and 0.84 (95% CI: 0.811–0.869) on the training, validation, independent and external test datasets, respectively, as shown by the ROC curves in Fig. 4. The model showed consistent improvement across the two test groups, which indicated its robustness to the factors of time and hospital type.
Threshold-dependent performance was further evaluated to determine clinically meaningful operating points for SLNB avoidance (Supplementary Table 3). At the conservative threshold of 0.1, sensitivity and NPV reached 100% but at the cost of an excessively high false-positive rate (specificity 43.5%). A threshold of 0.2 provided a more balanced clinical profile, maintaining high sensitivity (92.9%) and very high NPV (98.6%) while substantially reducing false positives (specificity 83.5%). Therefore, 0.2 was selected as the primary decision threshold for SLNB-avoidance analysis.
ROC curves for the ability of MAN+C to predict ALN tumor burden. The blue, orange, green and red curves represent the performance of MAN+C on the training, validation, independent and external test sets, respectively.
To explore the effect of clinicopathological characteristics, we removed radioclinicopathological information step by step according to clinical accessibility. Using the full set of patient characteristic data yielded the most promising results, especially the addition of C2 and C3 (C2 represents SLN min/max length and ratio, C3 represents cortical thickness), with AUCs ranging from 0.58 to 0.84 (Table 2; Fig. 5). The characteristics of C2 and C3 pertain to the US images of SLNs. In summary, as each group of patient characteristics was added, a steady improvement in performance was observed. When we used only clinicopathological information to predict ALN tumor burden, the area under the curve (AUC) reached 0.82(95% CI: 0.762–0.884), which indicates that the invisible information in US images could improve the AUC by 8.5% (Supplementary Fig. 1).
ROC curves for MAN+C with stepwise removal of patient characteristic groups. The notation C after each type of model indicates the addition of patient characteristic data. C1 represents age, C2 represents SLN min/max length and ratio, C3 represents cortical thickness, and C4 represents ER, PR, Her-2 and Ki-67.
Benefit and interpretability of MAN
As shown in Fig. 6(a), the MAN + C could be used to directly determine the ALN tumor burden independent of the morphology of the primary breast lesion, as more than 30% of patients with multifocal lesions or who received any primary lesion treatment could benefit from our model. In addition, even in the external dataset, which showed the worst performance, 88.9% of patients with cT1-2N0 breast cancer could receive accurate ALN tumor burden assessments, enabling the subsequent development of individualized treatment management by using this model.
We applied gradient-weighted class activation mapping (Grad-CAM)31 to generate a heatmap of the input image based on the prediction result of MAN + C. The heatmap shows the importance of the region in which the model paid the most attention. Figure 6(b) shows a visual representation from the output of Grad-CAM. The MAN generally matched the ROI. Depending on the geometry of the LN, the model highlighted the entry and exit drainage of the LN, which showed the model’s advanced understanding of the LN features.
The overall benefit of MAN+C and its visualization map. (a) The pie chart above shows the proportions of different lesions in cT1-2N0 patients who underwent axillary lymph node CEUS in the four datasets. The additional patients who were unable to benefit from earlier models relying solely on primary breast lesion data could now resort to our model, constituting 31% of the total patient population. The pie chart illustrates the proportion of patients diagnosed using MAN+C in the external validation dataset, with 88.9% of them accurately identified for lymph node tumor burden. (b) Grad-CAM visualization of four patient examples. The red region represents the larger attention given by the deep neural network. Note that since both manual cropping images and original images are square, not rectangular, the image is not scaled down equally.
Automatic detection reduces the impact of operator-dependence
Additionally, operator-dependence has always been a difficulty of DL model in US. While in our study, we incorporated saliency detection cropping into our prediction pipeline to fully automate the end-to-end prediction process, from which the computer could automatically identify where the lymph nodes were located. Experiments showed that our proposed model was still able to yield promising results (AUC of 0.82 on the independent test set) (Supplementary Fig. 2) with the new saliency detection US image cropping and suffered a minor performance drop compared to manual cropping. This opened exciting possibilities for a fully automated end-to-end prediction pipeline without any human intervention, which avoided the error and non-repeatability of manual operations and further improved the degree of automations.
Discussion
In this study, we applied CEUS for tracing SLN(s) to establish a one-to-one correlation between US images of SLN(s) and pathological results. Then, we meticulously designed the MAN + C model using these images with radioclinicopathological information to predict ALN tumor burden. The model demonstrated competitive performance compared to state-of-the-art models based on primary breast cancer lesions. Integrating radiologists’ prior knowledge with high-throughput imaging information from US images significantly enhances diagnostic capabilities. The wealth of quantitative imaging data extracted from these images not only complements the interpretative expertise of radiologists but also has the potential to substantially improve diagnostic accuracy when integrated with existing radiological readings and clinical pathological information.
Axillary ultrasonography is the standard initial tool for evaluating ALN status, but its sensitivity for detecting nodal tumor burden is limited. Although DL- and radiomics-based approaches have emerged, most high-quality models still rely on primary tumor features15,17,18, resulting in exclusion of patients with multifocal disease or prior treatment17,18,32. Consequently, there remains a clinical need for a direct assessment method independent of primary lesion status. Since sentinel lymph node (SLN) pathology is strongly associated with ALN involvement, nodal tumor burden, and survival33, our MAN approach addresses this unmet need because it is not limited by primary tumor characteristics, including size, multifocality, or pretreatment.
Recent DL studies have attempted to classify ALN metastasis directly from US images, yet they suffer from methodological constraints. Ozaki J. et al.34 compared only normal versus heavily metastatic nodes; David C. et al.35 excluded US-negative metastases, inflating positive rates; and Shawn S. et al.36 achieved an AUC of only 0.72. A key obstacle is the lack of one-to-one correspondence between preoperative ALN images and pathological references, which diverges from guideline-recommended SLN-based N staging. By employing CEUS-identified SLNs as the pathological standard, our study aligns with current practice and addresses the challenge of imaging-negative nodes, achieving comparatively high performance in real-world data.
Our study fully explored the capability of the DL methodology. The design involved the integration of both US images and selected patient characteristics into a unified network comprising two independent networks, the IBN Resnet 50 and the PDN. The new design took advantage of domain knowledge, mainly the incorporation of direct measurements of lymph node parameters from these images, which are crucial for enhancing the predictive power of our model. The relevance of all clinicopathological information included in our model to ALN metastasis has been validated in previous studies37,38,39,40,41. Because these clinical variables were jointly optimized within the PDN rather than modeled separately, their combined predictive contributions were inherently captured. Additionally, studies and experiments were conducted on patient data selection to retain only useful patient characteristic data while maintaining a high prediction performance comparable to that of models from prior studies. The robust performance on independent tests reflects the model’s resilience over time and across different hospital settings.
Compared with the previous application of multimodal imaging data such as elasticity and CEUS of primary breast cancer lesions18, MAN requires only grayscale or color Doppler images acquired by conventional axillary US. The implication of this is that we do not have stringent requirements for the quality of other modality images, which simplifies the image acquisition process and makes it more feasible to implement. This approach reduces the complexity and enhances the practicality of the procedure, allowing for broader application in various clinical settings without the need for specialized or advanced imaging equipment. A potential source of bias in our study is the manual cropping of ultrasound images during preprocessing. Although the regions of interest were selected by experienced radiologists using a standardized protocol, this step remains operator-dependent and may introduce subtle variability. Manual cropping is also time-consuming and limits scalability. Future incorporation of automated lesion localization or segmentation methods could improve reproducibility and reduce human-related variation.
However, this study has some limitations. Firstly, since CEUS of ALNs is not yet popular enough, large and multicenter external validation is warranted to verify the generalizability of this model for accurate diagnosis of ALN metastasis. Secondly, all the US images used to develop our model were of SLNs, and further work is needed to explore whether MAN combined with fully automated detection technology still performs well for whole axillary scanning. Thirdly, this study was designed as an observational diagnostic modeling study, and the model outputs did not influence clinical management. Therefore, no treatment-related or recurrence-related differences exist between model-defined groups, and time-to-event analyses such as Kaplan–Meier curves cannot be meaningfully performed. Future prospective studies, including potential randomized trials using the model to guide axillary management, are needed to determine whether risk stratification translates into differences in recurrence outcomes. Lastly, in this exploratory diagnostic modeling study, the DL model provides preoperative risk stratification rather than guiding axillary treatment decisions. Its potential clinical impact—such as identifying patients who might safely avoid ALND —requires prospective validation in future trials and is beyond the scope of the present study. Therefore, we were unable to compare model performance with radiologists because estimating the exact number of metastatic lymph nodes preoperatively in cN0 participants is not feasible for human readers, and no clinical benchmark exists for this task.
MAN + C based on SLN US with clinicopathological information provides a direct and efficient method for accurate preoperative assessment of ALN tumor burden in cT1-2N0 breast cancer patients. These findings will provide new possibilities for determining appropriate axillary treatment options for breast cancer patients. Prospective multicenter validation is expected to provide high-level evidence for clinical application in follow-up studies.
Data availability
The datasets generated and/or analyzed during the current study are not publicly available owing to concerns related to patient privacy, but are available from the corresponding author upon reasonable request. The code is available at (https://github.com/SophieGao6/BC\_SLN\_2dprediction\_tumorburden).
Abbreviations
- ALN:
-
Axillary lymph node
- SLN:
-
Sentinel lymph node
- CEUS:
-
Contrast-enhanced lymphatic ultrasound
- MAN:
-
Modality-adaptive network
- MAN + C:
-
Modality-adaptive network with clinicopathological information
- AI:
-
Artificial intelligence
- AUC:
-
Area under the receiver operating characteristic curve
- ROC:
-
Receiver operating characteristic curve
- US:
-
Ultrasound
- CI:
-
Confidential interval
References
Humphrey, K. L., Saksena, M. A., Freer, P. E., Smith, B. L. & Rafferty, E. A. To do or not to do: axillary nodal evaluation after ACOSOG Z0011 trial. Radiographics: Rev. Publication Radiological Soc. North. Am. Inc. 34 (7), 1807–1816. https://doi.org/10.1148/rg.347130141 (2014).
Liang, Y., Zhang, H., Song, X. & Yang, Q. Metastatic heterogeneity of breast cancer: molecular mechanism and potential therapeutic targets. Semin Cancer Biol. 60, 14–27. https://doi.org/10.1016/j.semcancer.2019.08.012 (2020).
Qiu, S. Q. et al. Evolution in sentinel lymph node biopsy in breast cancer. Crit. Rev. Oncol. Hematol. 123, 83–94. https://doi.org/10.1016/j.critrevonc.2017.09.010 (2018).
de Boniface, J. et al. Omitting axillary dissection in breast cancer with sentinel-node metastases. N Engl. J. Med. 390 (13), 1163–1175. https://doi.org/10.1056/NEJMoa2313487 (2024).
Bartels, S. A. L. et al. Radiotherapy or surgery of the axilla after a positive Sentinel node in breast cancer: 10-year results of the randomized controlled EORTC 10981–22023 AMAROS trial. J. Clin. Oncol.: Official J. Am. Soc. Clin. Oncol. 41 (12), 2159–2165. https://doi.org/10.1200/jco.22.01565 (2023).
Reimer, T. et al. Axillary surgery in breast cancer - primary results of the INSEMA trial. N Engl. J. Med. 392 (11), 1051–1064. https://doi.org/10.1056/NEJMoa2412063 (2025).
Marino, M. A., Avendano, D., Zapata, P., Riedl, C. C. & Pinker, K. Lymph node imaging in patients with primary breast cancer: concurrent diagnostic tools. Oncologist 25 (2), e231–e. https://doi.org/10.1634/theOncologist.2019-0427 (2020).
Valente, S. A. et al. Accuracy of predicting axillary lymph node positivity by physical examination, mammography, ultrasonography, and magnetic resonance imaging. Ann. Surg. Oncol. 19 (6), 1825–1830. https://doi.org/10.1245/s10434-011-2200-7 (2012).
Chen, X. et al. Feasibility of using negative ultrasonography results of axillary lymph nodes to predict Sentinel lymph node metastasis in breast cancer patients. Cancer Med. 7 (7), 3066–3072. https://doi.org/10.1002/cam4.1606 (2018).
Zhu, Y., Zhou, W., Jia, X. H., Huang, O. & Zhan, W. W. Preoperative axillary ultrasound in the selection of patients with a heavy axillary tumor burden in early-stage breast cancer: what leads to false-positive results? J. Ultrasound Med. Official J. Am. Inst. Ultrasound Med. 37 (6), 1357–1365. https://doi.org/10.1002/jum.14545 (2018).
Sever, A. R., Mills, P., Jones, S. E., Mali, W. & Jones, P. A. Sentinel node identification using microbubbles and contrast-enhanced ultrasonography. Clin. Radiol. 67 (7), 687–694. https://doi.org/10.1016/j.crad.2011.11.009 (2012).
Bedi, D. G. et al. Cortical morphologic features of axillary lymph nodes as a predictor of metastasis in breast cancer: in vitro sonographic study. AJR Am. J. Roentgenol. 191 (3), 646–652. https://doi.org/10.2214/ajr.07.2460 (2008).
Ahmed, M. et al. Meta-analysis of tumour burden in pre-operative axillary ultrasound positive and negative breast cancer patients. Breast Cancer Res. Treat. 166 (2), 329–336. https://doi.org/10.1007/s10549-017-4405-3 (2017).
Avanzo, M. et al. Machine and deep learning methods for radiomics. Med. Phys. 47 (5), e185–e202 (2020).
Gao, Y. et al. Nomogram based on radiomics analysis of primary breast cancer ultrasound images: prediction of axillary lymph node tumor burden in patients. Eur. Radiol. 31 (2), 928–937. https://doi.org/10.1007/s00330-020-07181-1 (2021).
Zhou, L. Q. et al. Lymph node metastasis prediction from primary breast cancer US images using deep learning. Radiology 294 (1), 19–28. https://doi.org/10.1148/radiol.2019190372 (2020).
Zheng, X. et al. Deep learning radiomics can predict axillary lymph node status in early-stage breast cancer. Nat. Commun. 11 (1), 1236. https://doi.org/10.1038/s41467-020-15027-z (2020).
Jiang, M. et al. Radiomics model based on shear-wave elastography in the assessment of axillary lymph node status in early-stage breast cancer. Eur. Radiol. 32 (4), 2313–2325. https://doi.org/10.1007/s00330-021-08330-w (2022).
Wu, J., Ge, L., Guo, Y., Xu, D. & Wang, Z. Utilizing multiclassifier radiomics analysis of ultrasound to predict high axillary lymph node tumour burden in node-positive breast cancer patients: a multicentre study. Ann. Med. 56 (1), 2395061. https://doi.org/10.1080/07853890.2024.2395061 (2024).
Liu, H. et al. Deep learning radiomics based prediction of axillary lymph node metastasis in breast cancer. NPJ Breast Cancer. 10 (1), 22. https://doi.org/10.1038/s41523-024-00628-4 (2024).
Gong, C. et al. Computer-assisted diagnosis for axillary lymph node metastasis of early breast cancer based on transformer with dual-modal adaptive mid-term fusion using ultrasound elastography. Comput. Med. Imaging Graph. 119, 102472. https://doi.org/10.1016/j.compmedimag.2024.102472 (2025).
Sun, C. et al. A nomogram based on conventional and contrast-enhanced ultrasound radiomics for the noninvasively prediction of axillary lymph node metastasis in breast cancer patients. Front. Oncol. 14, 1400872. https://doi.org/10.3389/fonc.2024.1400872 (2024).
Yan, M. et al. Comparative analysis of nomogram and machine learning models for predicting axillary lymph node metastasis in early-stage breast cancer: a study on clinically and ultrasound-negative axillary cases across two centers. Ultrasound Med. Biol. 51 (3), 463–474. https://doi.org/10.1016/j.ultrasmedbio.2024.11.003 (2025).
Ahn, S. et al. Genomic profiling of multiple breast cancer reveals inter-lesional heterogeneity. Br. J. Cancer. 122 (5), 697–704. https://doi.org/10.1038/s41416-019-0713-1 (2020).
Cox, K. et al. Validation of a technique using microbubbles and contrast enhanced ultrasound (CEUS) to biopsy Sentinel lymph nodes (SLN) in pre-operative breast cancer patients with a normal grey-scale axillary ultrasound. Eur. J. Surg. Oncology: J. Eur. Soc. Surg. Oncol. Br. Association Surg. Oncol. 39 (7), 760–765. https://doi.org/10.1016/j.ejso.2013.03.026 (2013).
Pan, X., Luo, P., Shi, J. & Tang, X. Two at once: enhancing learning and generalization capacities via ibn-net. In Proceedings of the European Conference on Computer Vision (ECCV) 464–479 (2018).
Hu, J., Shen, L. & Sun, G. Squeeze-and-excitation networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition 7132–7141 (2018).
Tan, M., Le, Q. & Efficientnet Rethinking model scaling for convolutional neural networks. In International Conference on Machine Learning: PMLR 6105–6114 (2019).
Simonyan, K. & Zisserman, A. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 (2014).
He, K., Zhang, X., Ren, S. & Sun, J. Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition 770–778 (2016).
Selvaraju, R. R. et al. Grad-cam: visual explanations from deep networks via gradient-based localization. In Proceedings of the IEEE International Conference on Computer Vision 618–626 (2017).
Zhou, W. J., Zhang, Y. D., Kong, W. T., Zhang, C. X. & Zhang, B. Preoperative prediction of axillary lymph node metastasis in patients with breast cancer based on radiomics of gray-scale ultrasonography. Gland Surg. 10 (6), 1989–2001. https://doi.org/10.21037/gs-21-315 (2021).
Lai, B. S., Tsang, J. Y., Li, J. J., Poon, I. K. & Tse, G. M. Anatomical site and size of Sentinel lymph node metastasis predicted additional axillary tumour burden and breast cancer survival. Histopathology 82 (6), 899–911. https://doi.org/10.1111/his.14875 (2023).
Ozaki, J. et al. Deep learning method with a convolutional neural network for image classification of normal and metastatic axillary lymph nodes on breast ultrasonography. Japanese J. Radiol. 40 (8), 814–822. https://doi.org/10.1007/s11604-022-01261-6 (2022).
Coronado-Gutiérrez, D. et al. Quantitative ultrasound image analysis of axillary lymph nodes to diagnose metastatic involvement in breast cancer. Ultrasound. Med. Biol. 45 (11), 2932–2941. https://doi.org/10.1016/j.ultrasmedbio.2019.07.413 (2019).
Sun, S. et al. Deep learning prediction of axillary lymph node status using ultrasound images. Comput. Biol. Med. 143, 105250. https://doi.org/10.1016/j.compbiomed.2022.105250 (2022).
Cui, Q. et al. Contrast-enhanced ultrasound-guided Sentinel lymph node biopsy in early-stage breast cancer: a prospective cohort study. World J. Surg. Oncol. 21 (1), 143. https://doi.org/10.1186/s12957-023-03024-7 (2023).
Li, L., Zhao, J., Zhang, Y., Pan, Z. & Zhang, J. Nomogram based on multiparametric analysis of early-stage breast cancer: prediction of high burden metastatic axillary lymph nodes. Thorac. Cancer. 14 (35), 3465–3474. https://doi.org/10.1111/1759-7714.15139 (2023).
Chen, Y. et al. Automated breast ultrasound (ABUS)-based radiomics nomogram: an individualized tool for predicting axillary lymph node tumor burden in patients with early breast cancer. BMC Cancer. 23 (1), 340. https://doi.org/10.1186/s12885-023-10743-3 (2023).
Song, S. E. et al. Value of breast MRI and nomogram after negative axillary ultrasound for predicting axillary lymph node metastasis in patients with clinically T1-2 N0 breast cancer. J. Korean Med. Sci. 38 (34), e251 (2023).
Yao, J. et al. Machine learning-based breast tumor ultrasound radiomics for pre-operative prediction of axillary Sentinel lymph node metastasis burden in early-stage invasive breast cancer. Ultrasound. Med. Biol. 50 (2), 229–236. https://doi.org/10.1016/j.ultrasmedbio.2023.10.004 (2024).
Acknowledgements
We thank all participants, their caregivers, and their families for participation in this study. Also, we thank Hongda Chen for his valuable assistance on the statistical advice. This study has received funding by CAMS Clinical and translational medicine research specialty (2024-I2M-C&T-B-035), Peking Union Medical College Hospital Talent Cultivation Program (Category C) No.UBJ10852, National High Level Hospital Clinical Research Funding (Grant No. 2025-PUMCH-A-015). We sincerely thank all the patients and their families who participated in this study. Their trust, cooperation, and willingness to contribute made this research possible.
Funding
This study has received funding by CAMS Clinical and translational medicine research specialty (2024-I2M-C&T-B-035), Peking Union Medical College Hospital Talent Cultivation Program (Category C) No.UBJ10852 and National High Level Hospital Clinical Research Funding (Grant No. 2025-PUMCH-A-015).
Author information
Authors and Affiliations
Contributions
YG, DG and QZ was responsible for conceptualization, methodology design, data curation, and drafting of the manuscript. RL, YL and ZN conducted formal analysis and contributed to data curation. JL, ZN and MX provided resources, acquired funding and contributed to manuscript review and editing. YZ, FM and MZ contributed to critical revision of the manuscript. YJ supervised the study, provided resources, and contributed to manuscript review and editing. HL and ML contributed to conceptualization, supervision, and provision of resources. QZ contributed to conceptualization, supervision, provision of resources and manuscript review and editing. All authors read and approved the final manuscript.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Ethical approval
This study was a prospective multicenter study and was approved by the Ethics Committee of Peking Union Medical College Hospital (approval number: ZA-2291). This study was conducted in accordance with the 1964 Helsinki Declaration and its later amendments. This study was registered at www.chictr.org.cn with the clinical trial registration number ChiCTR2000031231 on March 25, 2020. Written informed consent was obtained from all individual participants included in the study.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Below is the link to the electronic supplementary material.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
About this article
Cite this article
Gao, Y., Gu, D., Li, J. et al. Preoperative assessment of axillary lymph node tumor burden in cT1-2N0 breast cancer patients with a modality-adaptive network based on sentinel lymph node ultrasound images. Sci Rep 16, 4228 (2026). https://doi.org/10.1038/s41598-025-34371-y
Received:
Accepted:
Published:
Version of record:
DOI: https://doi.org/10.1038/s41598-025-34371-y








