Introduction

Breast cancer is a common malignant tumor that poses a significant threat to human health1. As the first station of breast lymphatic drainage, the axillary lymph node (ALN) plays a crucial role in breast cancer. Precise assessment and management of ALNs are of utmost importance for regional disease control, accurate staging of breast cancer, individualized treatment decision-making, and reliable prognosis evaluation1,2. Currently, sentinel lymph node biopsy (SLNB) and axillary lymph node dissection (ALND) are clinically recognized as the gold standard procedures for assessing the pathological status of ALNs (pNx) in early-stage breast cancer2,3. However, both of these methods are invasive, and complications associated with axillary surgery, such as upper limb lymphedema, shoulder and back pain, arm numbness, and reduced arm strength, are inevitable, significantly impacting patients’ postoperative quality of life4,5. In recent years, the results of several rigorously designed prospective clinical trials, including the SOUND (NCT02167490), BOOG-2013-08 (NCT02271828), INSEMA (NCT02466737), and SOAPET (NCT04072653) trials, have been published. These trials investigated the feasibility of de-escalation surgical approaches for managing ALNs. Additionally, the KEYNOTE-522 study explored the efficacy and safety of novel neoadjuvant treatment regimens for patients with early-stage triple-negative breast cancer, with all reporting positive efficacy data6,7,8,9,10. In this context, current clinical practice demands a more rigorous and scientific approach for the preoperative assessment of ALN status in breast cancer patients, to enable the development of more personalized and precise treatment plans.

Currently, the most commonly used non-invasive methods for clinically assessing the status of of ALN include physical examination and ultrasonography. Compared to traditional physical examination, ultrasonography, with its high-resolution imaging technology, offers a more detailed visualization of the morphological characteristics of ALNs, such as size, shape, boundary clarity, and internal echo properties, thereby enhancing diagnostic accuracy. However, ultrasonography is limited to providing information on lymph node morphology and is subject to operator dependence and subjectivity, which can compromise its accuracy11,12. Although the application of ultrasound-guided fine-needle aspiration cytology (FNAC) greatly makes up for the deficiencies of ultrasonographic diagnosis, the limited number of specimens obtained by fine-needle aspiration (FNA) may not fully reflect the extent of the disease, and the diagnosis is heavily reliant on the experience of the operator and pathologist, leading to a high degree of uncertainty13,14. To overcome the limitations of ultrasound diagnosis, multiple medical centers have developed a series of prediction models, such as the Memorial Sloan Kettering Cancer Center (MSKCC) model and SCH model based on breast tumor pathology features, as well as radiomics models based on PET-CT or breast MRI11,15,16,17. Although these predictive models largely mitigate the issues of operator dependence and subjectivity, they are characterized by incomplete coverage of predictive variables, high costs, procedural complexity, and limited popularizability, which restrict their clinical application12,18.

In practical clinical work, clinicians often need to comprehensively assess a variety of risk factors associated with lymph node metastasis when evaluating the status of ALNs in breast cancer patients. These include physical examination results, imaging features of the breast and ALNs (tumor size, tumor laterality, tumor location, diffuse distribution, multifocality, cluster calcification, etc.), pathological features of breast tumors (lymphovascular invasion (LVI), histologic grade, estrogen receptor (ER), progesterone receptor (PR), human epidermal growth factor receptor 2 (Her-2) status, Ki-67 expression, etc.), and nonspecific systemic inflammatory markers (neutrophil to lymphocyte ratio (NLR) and platelet lymphocyte ratio (PLR), among others)15,19,20,21,22. Unfortunately, at present, we still lack a pragmatic and convenient model to comprehensively represent the overall significant features of ALNs. In view of this, this study intends to develop and verify such a web-based prediction model, thereby achieving hierarchical management of ALNs in breast cancer, fulfilling the purpose of precision medical treatment, and improving the patient’s quality of life.

Materials and methods

This retrospective study was carried out in line with the moral standards stipulated in the Helsinki Declaration and the International Ethics Guide for Human Biomedical Research issued by the International Committee of Medical Scientific organizations. It was approved by the Medical Ethics Committee of Weifang Hospital of Traditional Chinese Medicine. The data collected in this study were all tested by Weifang Hospital of Traditional Chinese Medicine.

Study population

Clinical data (sex, age, menstrual state, findings on breast physical examination, ultrasound grades of ALNs, tumor imaging size, tumor laterality, tumor location, diffuse distribution, multifocality, cluster calcification, histologic grade, ER, PR, Her-2 state, Ki-67, LVI, neutrophil count, lymphocyte count and platelet count) from 1022 patients with breast cancer treated in our hospital from January 2019 to June 2023 were collected. After excluding patients who did not meet the criteria for admission, 710 patients were ultimately enrolled.

The inclusion criteria were as follows: 1) Invasive breast cancer confirmed pathologically; 2) ALND or SLNB surgery having been performed; 3) Availability of histopathology and immunohistochemistry results for the surgical specimen; 4) Assessment of ALN status and breast masses by ultrasound and mammography within one week prior to biopsy; and 5) Completion of routine blood tests at our hospital with results available before biopsy.

The exclusion criteria were as follows: 1) Incomplete medical records; 2) patients who had undergone neoadjuvant therapy; 3) patients with bilateral or recurrent breast cancer; 4) patients undergoing palliative surgery; 5) patients with other types of malignant tumors; 6) patients with acute or chronic infections and hematological diseases; 7) patients with a history of axillary surgery; and 8) patients who were taking antibiotics, immunosuppressants, or hormone drugs before routine blood tests.

Breast physical exam

All patients underwent a breast physical examination before the relevant auxiliary examination, and the positive breast signs included pathological nipple discharge, asymmetric thickening or nodularity, skin changes (peau d’orange, dimple sign, erythema, nipple excoriation, scaling, eczema, rupture), and secondary inverted nipples3.

Calculation of inflammatory indices

Peripheral venous blood was taken from the upper limb on an empty stomach before biopsy and sent to the laboratory department of our hospital for determination by an automatic hematology analyzer. The neutrophil (N), lymphocyte (L) and platelet (P) counts in the blood results were recorded. NLR = N/L; PLR = P/L23,24.

Ultrasonic evaluation

Two radiologists (with 15 and 10 years of experience in breast ultrasonography, respectively) performed ultrasound evaluation of the ALNs status and breast masses before biopsy. The image features of the ALNs status and masses were observed, measured and recorded. Tumor size was expressed as the longest diameter (cm) of the mass. Tumor laterality was recorded as left breast or right breast. Tumor location was recorded as upper outer quadrant, UOQ; lower outer quadrant, LOQ; upper inner quadrant, UIQ; lower inner quadrant, LIQ; central quadrant, CQ or multiple quadrant, MQ (the mass crosses two or more quadrants). The diffuse distribution of the mass was recorded as Yes or No. The multifocality of the mass was recorded as Yes or No. Multifocality was defined as foci of carcinoma separate from the primary tumor or a mass with diffuse distribution; no distinction was made between multifocality or multicentricity15.

The ultrasound grade of ALNs was divided into a negative group (including Types 1, 2, 3, and 4) and a positive group (including Types 5 and 6) based on cortical morphologic features. The cortical morphologic features of each type of lymph node were defined as follows: Type 1, hyperechoic with no visible cortex; Type 2, thin (< 3 mm) hypoechoic cortex; Type 3, hypoechoic cortex thicker than 3 mm; Type 4, generalized lobulated hypoechoic cortex; Type 5, focal hypoechoic cortical lobulation; and Type 6, totally hypoechoic node with no hilum25. When multiple types of lymph nodes were present in the same patient, the one with the highest grade was selected for inclusion.

Mammography evaluation

The patients underwent mammography before breast biopsy, and each breast was routinely photographed in 2 positions, namely, the craniocaudal position and mediolateral oblique position. A double-blind analysis was conducted by two radiologists (with 5 and 10 years of experience in mammography, respectively) to analyze the mammography features. They reached consensus through consultation when different opinions were expressed. In this study, the image features of abnormal calcification in the breast were defined as 1) amorphous calcification of segment, line and cluster distribution; 2) rough heterogeneous calcification of single cluster distribution; and 3) fine pleomorphic calcification26.

Surgery procedure for ALNs

The surgical methods for assessing ALNs include SLNB and ALND. For patients with clinically node-negative (cN0) status, ALNs were pathologically assessed using either SLNB alone or a combination of SLNB and ALND. For patients with clinically node-positive (cN1 +) status, ALNs were assessed using ALND. The sentinel lymph nodes (SLNs) were identified using blue dye and 99 m-Technetium sulfur colloid. In the protocol, an SLN was defined as any blue-stained node, any node with a blue-stained lymphatic channel leading directly to it, any node with a radioactive count at least 10% of the most radioactive node, or any pathologically palpable node. For patients with no more than two positive lymph nodes identified by SLNB, the decision to perform axillary dissection or not depended on the type of operation (breast-conserving therapy or mastectomy) and individual pathological characteristics 27.

Criteria for evaluating ALNs status

The final status (metastatic or nonmetastatic) of ALNs was postoperatively examined using serial sections with hematoxylin & eosin (H&E) staining. Macrometastases (> 2 mm cancer foci) or micrometastases (0.2–2 mm cancer foci) of ALNs were considered metastatic. It should be noted that the isolated tumor cells (ITCs, < 0.2 mm) were considered negative in the final statistical analysis11,17.

Histopathological analysis

Breast specimens were analysed by 2 pathologists with 20 and 15 years of experience in breast disease, respectively. The pathological features included LVI status, histologic grade, ER status, PR status, Her-2 status and Ki-67 expression. The cutoff value of high expression of ER was based on the positive nuclear staining of over 10% of the tumor cells26. The cutoff value of low expression of ER was based on the proportion of positive nuclear staining of tumor cells ≥ 1% and ≤ 10%26. The cutoff value of high expression of PR was based on the positive nuclear staining of over 20% of the tumor cells26. The cutoff value of low expression of PR was based on the proportion of positive nuclear staining of tumor cells ≥ 1% and ≤ 20%26. The Her-2 status was considered positive if the IHC score was 3 + 17. For Her-2 2 + expression, Her-2 status was determined with gene amplification by fluorescence in situ hybridization (FISH)11,17. Her-2 copy number > 6.0 or Her-2/CEP17 (chromosome enumeration probe-17) ratio > 2.0 was defined as Her-2 positive17,28. A Ki-67 proliferative index of > 20% was considered high29.

Construction of prediction model

Of the 710 cases, there was only one male patient with breast cancer. This 1 case not only had no statistical significance in the total sample size of more than 700 cases, but also led to a decrease in the generalizability of the statistical results, so they were removed, and finally only 709 cases were included in the statistical data. Firstly, a random variable function was established for 709 cases of data included in the statistics. According to the quartile method, data less than 25% of the random number was defined as the validation set, and the remaining 75% of the data was defined as the development set. Then the stepwise regression method was used to screen out the variables with significant prediction efficiency from the development set, and the logical regression equation was obtained. The β value of each variable in the equation was the score of each variable, while the score of a variable that was not included in the equation was zero. The total score of each sample was calculated by summation. Next, the ROC curve was drawn using the verification set data to evaluate the prediction model. Finally, the prediction score of each sample was calculated by using the logical regression equation and arranged in the order from low to high. According to the order, the scores of all samples were equally divided into 5 segments, and each segment accounted for about 20%. The average metastatic probability of the sample in this segment was taken as the metastatic probability of the samples with this grade30.

Data and statistical analysis

All statistical analyses were carried out using SAS9.4 international standard statistical programming software and the value of p < 0.05 was considered to be statistically significant. The enumeration data were described by frequency and percentage, the chi-square test was used for inter-group comparison; the median was used to describe the measurement data that did not meet the normal distribution, the t-test was used to determine whether there was any difference between the prediction results and the reference standard, and the nonparametric rank sum test was used for inter-group comparison. Risk factors were analyzed by binary logical regression analysis30.

Nine variables were entered into the MSKCC website (www.mskcc.org/nomograms): tumor type, LVI, tumor size, tumor location, age, multifocality, and ER and PR status. Subsequently, the probability of ALN metastasis was calculated for each patient. A ROC curve was drawn, and the area under the curve (AUC) was calculated. An AUC value between 0.5 and 0.7 indicates that the model has a certain accuracy, between 0.7 and 0.9 indicates that it has a good accuracy, and when it is above 0.9 that it has a very high accuracy31.

Results

General characteristics of the study population

Our study comprised 709 patients, 531 in the development set and 178 in the validation set. The descriptive characteristics of the development and validation sets are provided in Table 1. There was no significant difference between the development and validation sets. The rate of ALN metastasis in breast cancer patients was 51.22% (272/531) and 52.25% (93/178) within the development and validation sets, respectively. Based on the results of the univariate analysis, breast physical exam, tumor size on imaging, tumor location, abnormal calcification, ultrasound grades of ALNs, PR, Her-2 status, Ki-67 proliferative index, histological grade, LVI, NLR and PLR (p < 0.05) were significantly associated with the incidence of ALN metastasis. Multifocality was of borderline significance for predicting ALN metastasis (p = 0.0579).

Table 1 Descriptive characteristics of the development and validation sets.

Validation of the MSKCC model

A comparison between the verification populations from the MSKCC and our patient cohort is shown in Table 2. The descriptive characteristics of these two populations differ with respect to surgery. The patients in our validation population included SLNB and ALND patients. Therefore, we verified the predictive power of MSKCC for ALN but not SLN metastasis. We could not use the MSKCC model to predict the likelihood that patients diagnosed with breast cancer with skin changes or diagnosed with ductal carcinoma in situ with microinvasion (< 0.1 cm) have positive ALNs. Therefore, we could not compare the subtypes between the two populations. Additionally, we excluded those patients when validating the usefulness of the MSKCC model. We input each variable data online to obtain the probability of MSKCC for each patient’s ALN metastasis and drew the ROC curve according to the real value and predicted value to assess the accuracy of the MSKCC model for predicting ALN metastasis in our cohort. As shown in Fig. 1a, the AUC value for the MSKCC model in our verification population was 0.817, and the 95% confidence interval (95% CI) was 0.784 to 0.851, which was significant (p < 0.001).

Table 2 Comparison of descriptive characteristics of the two validation groups of the MSKCC model for predicting ALN metastasis, New York and Wei Fang.
Fig. 1
figure 1

The receiver operating characteristic (ROC) curve of the research. (a) Performance of the MSKCC model applied to the verification group (n = 613). The area under the ROC curve was 0.817, and the 95% confidence interval (95% CI) was 0.784–0.851, p < 0.001. (b) Performance of the new prediction model based on the development set (n = 531). The area under the ROC curve was 0.895, and the 95% confidence interval (95% CI) was 0.867–0.923, p < 0.001. (c) Performance of the novel scoring system based on the risk score (n = 709). The area under the ROC curve was 0.887, and the 95% confidence interval (95% CI) was 0.862–0.912, p < 0.001. (d) Performance of the validation set (n = 178). The area under the ROC curve was 0.865, and the 95% confidence interval (95% CI) was 0.810–0.920, p < 0.001.

Establishment of the new prediction model

We developed a new prediction model to overcome the limitations of the preoperative application of the MSKCC model. The multiple logistic regression analysis revealed that physical exam (p = 0.0034), ultrasound grades of ALNs (p < 0.0001), Her-2 status (p = 0.0176) and LVI (p < 0.0001) were related to ALN metastasis in breast cancer and showed significant differences between groups. Associations between clinical data and ALN metastasis are listed in Table 3. The following regression model for predicting metastatic and nonmetastatic ALNs was obtained: Risk of score = -2.2509 + 0.984 × physical exam (Negative: 0; Positive: 1) + 3.1703 × ultrasound grades of ALNs (Negative: 0; Positive: 1) + 0.6622 × Her-2 status (Negative: 0; Positive: 1) + 1.5287 × LVI status (Negative: 0; Positive: 1). The AUC value of the prediction model was 0.895, and the 95% CI was 0.867 to 0.923, which was statistically significant (p < 0.001) (Fig. 1b).

Table 3 Clinical data characteristics of multifactor analysis and estimation of the metastasis risk of the development dataset.

Establishment of the risk scoring system

A 5-grade risk-scoring system was developed. Table 4 indicates the risk of ALN metastasis according to the clinical data characteristics of breast cancer. In this scoring system, the probability of metastasis was 11.25% in ALNs without significantly suspicious metastatic clinical data (grades 1). In other words, the false-negative rate of the scoring system is 11.25%. Additionally, the Spearman rank correlation analysis of the grading and metastasis risk of the scoring system showed that the rank correlation coefficient was 1, p < 0.0001. This showed that, on the whole, the metastasis probability increased with increasing risk grade, and there was a positive correlation between them. The reference standard was taken as the state variable, the total score was taken as the test variable, the ROC curve was drawn to verify the regression equation based on the ROC curve, and finally, the critical value of -0.06 was obtained (the risk of metastasis was considered to be high if it was greater than -0.06 points). The AUC value of the risk scoring system was 0.887, and the 95% CI was 0.862 to 0.912 (p < 0.001), which was statistically significant (Fig. 1c). This finding shows that the transition from the prediction model to the scoring system was stable.

Table 4 Metastatic risk of ALNs according to the novel scoring system, development set, and validation set.

Prospective use of the risk scoring system

We randomly selected one-fourth of the total data as test data to internally validate the prediction model. As shown in Fig. 1d, the results showed that the scoring system performed well in the internal verification test (AUC: 0.865; [95% CI: 0.810 to 0.920]; p < 0.001). This finding shows that the model is suitable for use in routine clinical practice. Considering that the false-negative rate of SLNB averaged 8.4%, ranging from 0 to 29% across all trials. We divided the management recommendations on breast cancer patients’ ALNs into three levels according to the false-negative rate and the critical value of the risk scoring system: consider exempting SLNB (Grade 1, metastasis risk approximately11.25%), consider SLNB (Grade 2, metastasis risk approximately 34.4%), and consider ALND (Grades 3,4 and 5, metastasis risk approximately 74.17- 93.46%).

Establishment of the network-based prediction model

An online automated scoring system was developed (https://pay-test.coalapay.cn/Coala/thyroid/BCD_EN.html; Fig. 2) using the clinical data characteristics significantly associated with ALN metastasis for the simplified calculation and summation of the scoring system for metastasis risk stratification. Through this online automated scoring system, applicants can easily obtain the risk of grade, probability of metastasis and related management recommendations for ALNs.

Fig. 2
figure 2

Online risk calculation system. An online resource is available at https://pay-test.coalapay.cn/Coala/thyroid/BCD_EN.html for the easy calculation and summation of the metastasis risk stratification of ALNs using the clinical data characteristics of breast cancer.

Discussion

The results of the present study can be summarized as follows. First, a simple and easily accessible, web-based, diagnostic scoring system was developed using clinical features, imaging, and pathological indices of breast tumors to stratify the metastatic risk and management of ALNs. Second, this scoring system showed excellent predictive accuracy with an AUC of approximately 0.9 in the development set. In routine clinical practice, this web-based scoring system can automatically calculate and estimate the risk of ALN metastasis and provide corresponding management recommendations.

In clinical practice, accurate preoperative evaluation and management of ALNs is the key to providing accurate individualized diagnosis and treatment for breast cancer patients32,33,34. To achieve this goal, scholars from numerous medical centers have studied ALN metastasis in breast cancer and developed relevant prediction models based on either tumor pathological features or radiomic characteristics of the tumor11,15,16,17,33. However, these models have the characteristics of incomplete coverage of predictive variables, high costs, procedural complexity, and inability to popularize, which limits their clinical application12,18.

To overcome these drawbacks, we referred to the establishment method of the MSKCC model and developed a network based prediction model that combines physical exam, ultrasound grades of ALNs, Her-2 status and LVI (Tables 1 and 3). Generally, a model that performs with an AUC value of 0.7–0.8 is considered good, whereas values of 0.81–0.90 are considered excellent31. As shown in Fig. 1b, by analyzing the ROC curve of the prediction model, we obtained an AUC value of 0.895. This indicated that the prediction model we constructed in this study showed excellent predictability. Furthermore, when we selected the clinical data of another 178 patients to validate the prediction model internally, the results showed excellent prediction accuracy, and the AUC value of the internal validation set was 0.865 (Fig. 1d). The AUC values in these two populations were both greater than 0.754 of the original MSKCC model, indicating that our constructed model was more accurate15. Meanwhile, in comparison to prediction models developed by other research centers, such as the MSKCC model, the SCH model, etc., our proposed model was simpler to use since only four variables were included versus the nine variables in the MSKCC model and the five variables in the SCH model15,16,35. Although the prediction model in this study only includes four variables, these four variables come from three different prediction directions, making the prediction content more comprehensive. In addition, when we applied the MSKCC model to the retrospective data from our center to predict the probability of ALN metastasis, the AUC value was 0.817 (Fig. 1a and Table 2). Although this value is larger than the original MSKCC result of 0.754, it is still lower than the result of our prediction model of 0.895. We suspect that the reason for this result may be potentially related to skipping metastasis of ALNs. The analysis results not only provide indirect validation of the MSKCC model’s accuracy using external data, but also affirm the superiority of our prediction model.

According to an investigation report from the American Society of Clinical Oncology (ASCO), the overall false-negative rate for SLNB was 8.4%, with a range of 0–29%36. Therefore, taking into account the potential risks and benefits of surgery, if surgeons are willing to accept a false-negative rate ranging from 0 to 29%, patients in the low-risk metastasis group may be considered for exemption from ALN surgery. Furthermore, to enhance the clinical practicability of the predictive model, we developed a risk scoring system with five grades. Each suspicious clinical characteristic related to ALNs metastasis was assigned a different risk score, and the risk of metastasis was determined by calculating the total score. As shown in Fig. 1c, the ROC curve analysis of the risk scoring system yielded an AUC value of 0.887, indicating that the transition from the prediction model to the scoring system was stable and reliable. After calculating the metastasis probabilities for different grades in the scoring system, we found that the ALN metastasis probability for grade 1 was 11.25%, which fell within the false-negative rate range of SLNB (Table 4). Therefore, in clinical practice, we suggest that exemption from ALN surgery can be considered for patients in the grade 1 category. Previously, several studies indicated that SLNB could be avoided when the MSKCC predictive value was ≤ 10%; however, the application of this criterion was limited due to the low proportion of patients who benefit from it within the overall SLNB population (7.7%)16,31. In this study, patients with low metastatic risk (grade 1 segments) accounted for 33.85% of all patients in our proposed model. In other words, more people could benefit from our proposed model. In addition, the results of Spearman’s correlation coefficient for ranked data between the risk grade and the metastasis probability showed that, on the whole, the metastasis probability increased with increasing risk grade, and there was a positive correlation between them. By analyzing the ROC curve of the scoring system, we obtained a critical value of -0.06 points. Statistically, if the risk score of a case was greater than -0.06 points (the corresponding metastasis probability was approximately 34%), it was considered ALN metastasis37. Thus, combined with the metastasis probability of each risk grade shown in Table 4, we further categorized grades 2 as belonging to a medium-risk group (where SLNB was recommended), and grades 3 and above as falling into a high-risk group (where ALND was recommended). This approach provides more intuitive help for clinical workers making clinical decisions.

To facilitate clinical application, we have developed an easily accessible online risk calculator (https://pay-test.coalapay.cn/Coala/thyroid/BCD_EN.html). This tool automates the scoring system’s calculation, thereby enhancing its usability and intuitiveness in clinical settings. All users are free to utilize this risk calculator online and can further validate our scoring system with external datasets. This accurate and practical model predicts the likelihood of ALN metastasis, guiding clinical decision-making effectively.

Our study has some notable strengths and limitations. The strengths of this study can be summarized as follows. Comprehensiveness: The parameters of the prediction model encompass clinical examination, imaging, and histopathological findings, providing a comprehensive reflection of ALN status in breast cancer patients. Convenience: The model is freely accessible on the website, and clinical practitioners can easily input relevant prediction parameters to obtain automatic calculation of the prediction results. Intuitiveness: The risk stratification, probability of metastasis, and management recommendations for ALN metastasis are presented in a concise report format, making them clear and easily understandable. However, our research also has some limitations. First, this study is a single-center experimental study. The development set data and the validation set data were randomly selected from the total samples using the method of random function variables, ensuring no selection bias between them. To ensure the accuracy of the prediction model, the sample size of the development set was larger, specifically three times that of the validation set. Additionally, both the development set and validation set data originated from the same research center, meaning the validation set only served as internal validation. Therefore, we cannot ascertain whether these patients are representative of the general population. Fortunately, we included a larger sample size to mitigate any potential selection bias, and the risk scoring system we developed is freely available on our website, allowing researchers from other centers to use their data for external validation of our scoring system.Second, the potential for interobserver variability in the interpretation of pathological findings between the two pathologists was not assessed. However, given that the pathologists involved in this study had over 10 years of clinical experience in diagnosing breast cancer, the results may be more reproducible than if they had been evaluated by pathologists with less experience.

In conclusion, this study presents an accurate, comprehensive, and user-friendly multivariate prediction model along with an online risk calculator. This model will enable a true representation of the risk of ALN metastasis in breast cancer patients before surgery, thereby facilitating the selection of appropriate surgical protocols tailored to individual patients. It is worth noting that the universal applicability of this prediction model requires confirmation in future multicenter experimental studies with large cohorts that represent the general population. Until the prediction model is further evaluated, this predictive tool should be used with caution when deciding not to perform an ALN surgery in patients with a low risk of ALN metastasis.