Introduction

Pancreatic ductal adenocarcinoma (PDAC) is recognized as a profoundly malignant neoplasm, and is the sixth leading cause of oncologic mortality in China1,2. Statistical analysis of data from 2015 to 2019 revealed an annual increase in the incidence of pancreatic cancer of 0.6% to 1%. Projections for 2024 suggest that 611,720 individuals in the United States will succumb to cancer, translating to approximately 1680 deaths per day, with a significant proportion of these deaths attributed to pancreatic cancer3. Globally, there has been a surge in the incidence and mortality rates of pancreatic cancer, accompanied by a disheartening 5-year overall survival rate ranging from 9 to 13%4. The insidious nature of pancreatic cancer typically results in the late emergence of nonspecific symptoms such as abdominal pain, jaundice, vomiting, and hiccups, which are often apparent only advanced stages. Regrettably, only approximately 25% of these cases are amenable to surgical intervention5. Consequently, proactive monitoring of individuals at risk for pancreatic cancer is imperative.

In its precancerous stages, PDAC primarily arises from pancreatic intraepithelial neoplasia (PanIN) and mucinous cystic neoplasms (MCNs), with the latter including intraductal papillary mucinous neoplasms (IPMNs) and mucinous cystic tumours. IPMNs are of clinical importance because of their high prevalence and potential for malignant transformation, which significantly impacts patient outcomes. According to the latest World Health Organization (WHO) classification of gastrointestinal tumours, IPMNs are stratified into three categories on the basis of the degree of dysplasia: low-grade, high-grade (also termed in situ carcinoma), and IPMN-related invasive carcinoma6. Clinical evidence suggests that low-grade IPMNs with dysplasia are benign and associated with a lower risk of malignant progression, and a 5-year disease-specific survival rate of 97%, whileas high-grade and invasive IPMNs are associated with 5-year specific survival rates of 84% and 39%, respectively7. Notably, longitudinal study findings indicate that a mere 2.8% of patients with IPMNs develop malignant tumours over time8. Additionally, surgical treatment carries the inherent risk of postoperative complications, subsequently increasing mortality rates9. Therefore, a nuanced approach that optimizes diagnostic accuracy and surgical decision-making is repuired for effective management of IPMNs to miniminze unnecessary surgeries while preventing maligancy progression. Thus, increasing diagnostic precision is critical for advancing therapeutic strategies and improving outcomes for patients with pancreatic IPMNs.10

Traditional imaging techniques have exhibited inherent limitations in effectively delineating the risks associated with IPMNs. In an effort to overcome these limitations, the concept of “radiomics” was coined by the American scholar Gillies11.In radiomics, critical features are extracted from radiographic images to create high-dimensional datasets, thereby increasing the accuracy of identifying the risk factors associated with IPMNs. The radiomic workflow involves four essential steps: data acquisition, preprocessing, feature extraction, and feature selection12.Within this framework, the processes of data handling and analysis are instrumental in refining model development, providing a foundation for predictive accuracy.

As radiomic technology has progressed, radiomics has burgeoned into a vital tool in multiple applications, including tumour diagnosis, automated structured reporting, and the evaluation of treatment efficacy. The widespread use of radiomics in clinical applications has proven particularly advantageous in the diagnosis of conditions such as pulmonary nodules, breast cancer, prostate cancer, and rectal cancer. The incorporation of radiomics has significantly augmented diagnostic precision, ultimately facilitating better patient outcomes.

The aims of this study were to evaluate the preoperative predictive accuracy of a model constructed using radiomic features extracted from pancreatic contrast-enhanced CT images in distinguishing between benign and malignant IPMNs; to explore the optimal imaging phase for contrast-enhanced CT that maximizes predictive performance; and to investigate whether the integration of both qualitative and quantitative clinical features can refine the accuracy of pathological stratification diagnoses. Ultimately, the goal is to deliver more precise treatment recommendations, thereby increasing both the immediate and long-term survival prospects of patients diagnosed with IPMNs.

Method

Study population and inclusion criteria

A retrospective analysis of patients with IPMNs who underwent surgical treatment at the First Hospital of Jilin University from January 2013 to January 2023 was conducted. Patients were categorized into benign and malignant groups on the basis of postoperative pathological results. The data collected included demographic, clinical, and radiological parameters. The exclusion criteria were not receiving neoadjuvant therapy prior to surgery and not undergoing contrast-enhanced CT scanning within our facility. A total of 121 patients were deemed suitable for inclusion in the study.

We confirm that all methods employed in this study strictly adhered to relevant guidelines and regulations. The research protocols had been previously reviewed and approved by the Ethics Committee of the First Hospital of Jilin University, ensuring compliance with established ethical standards. Furthermore, informed consent was obtained from all individual participants and/or their legal guardians prior to their involvement in the study.

Image acquisition

In this study, preoperative pancreatic contrast-enhanced CT examinations were conducted with three different models of spiral CT machines: PHILIPS Ict 64/256, SIEMENS Cardiac 64, and GE Revolution 64 (the choice of machine was likely based on the availability and performance characteristics of these machines to ensure the acquisition of the most accurate images). The specific details regarding image acquisition are as follows: Scanning range extended from the xiphoid process level to the umbilical level, covering the entire abdomen, particularly the region containing the pancreas. This comprehensive coverage ensured a complete dataset for diagnostic purposes. Scan parameter settings: Tube voltage (influencing image contrast), 120 kVp; tube current, adjusted automatically to accommodate variations in patient size; pitch (determining the distance the CT machine moves per complete rotation), 5; matrix (defining image resolution), 521 × 521; and reconstructed slice thickness, between 1 and 1.5 mm, with thinner slices providing higher image resolution. All images were processed to create 1-mm-thick, thin-slice CT images to increase image clarity. The images were archived in DICOM format, the standard for medical images in diagnosis and research. These detailed parameters and processing steps are crucial to ensuring the accuracy and repeatability of the study, especially in analysing patients with IPMNs who are undergoing hepato-pancreato-biliary surgical treatment. Accurate image data are fundamental for diagnosis and research in this context.

Collection of radiological features in CT images

To ensure the accuracy and reliability of the radiological data, each CT image underwent meticulous independent review by two radiologists, each with over a decade of experience in clinical image analysis. Discrepancies between their initial assessments prompted a structured discussion, complemented by an additional review from a senior physician, to achieve a consensus on the evaluations. This rigorous procedure was imperative for the precise identification, classification, and localization of IPMN lesions, as well as for evaluating critical indicators such as dilation of the main pancreatic duct (MPD). The incorporation of such detailed and expert-driven analysis guarantees the robustness and reliability of the radiological features collected for subsequent analyses.

Radiomic analysis of CT images

All patient images in DICOM format were imported into RIAS software and meticulously processed by two radiologists with two and ten years of clinical reading experience. These radiologists carefully delineated the complete and surrounding boundaries of the pancreas and the lesion, layer by layer, to generate three-dimensional volumes of interest (VOIs). This manual process ensured a precise depiction of the pancreas and lesions while avoiding major blood vessels and dilated bile ducts adjacent to the lesion. In cases of evaluative discrepancies, the radiologists engaged in discussions to reach a consensus. After completing the delineation, the image regions of interest (ROIs) were saved in an independent “nii” file format and categorized on the basis of their benign or malignant characteristics for further analysis.

RIAS software uses standardization technology to extract radiological features within the region of interest.These feature typically encompass four categories: (1) shape-based characteristics; (2) first-order intensity attributes; (3) second-order texture properties; and (4) high-order texture features. Shape-based characteristics primarily include tumour volume, maximum diameter, surface area, tumour density, and sphericity. The first-order intensity feature is derived from the histogram, with evaluation of the ROI on the basis of the mean, median, maximum, minimum, kurtosis, entropy, and skewness values of the voxel intensity. Texture features are computed via various grey matrices, such as the grey co-occurrence matrix (GLCM), and grey dependence matrix (GLDM). Higher-order features are typically generated via wavelet or Laplacian Gaussian filtering. After extraction, a standardization process was applied to ensure consistent scaling across all the features. The LASSO algorithm was employed for dimensionality reduction, selecting the most significant parameters pertinent to pancreatic lesions.

The comprehensive dataset was divided into a cross-validation set and a test set at a 7:3 ratio. Selected radiomic features were utilized to construct predictive models via a random forest classifier (RFC), support vector machine (SVM), and logistic regression (LR) on the RIAS platform, ensuring efficient data classification and prediction. The SVM model, which demonstrated superior performance, was selected for validation. A fivefold cross-validation method involving receiver operating characteristic (ROC) curve plotting and evaluation of metrics such as the area under the curve (AUC), accuracy, sensitivity, specificity, and other relevant indicators was applied to the cross-validation set toassess model performance.

Separate radiomic models for the arterial and venous phases were developed and compared against various evaluation indicators to determine the optimal predictive model. The model exhibiting the best prediction performance was subsequently integrated with selected clinical features to increase diagnostic accuracy and efficacy.

Statistical analysis

In this study, statistical analysis was performed on data collected from 121 patients using IBM SPSS 26.0 software. Continuous variables, including patient attributes such as height, weight, and lesion size, were evaluated. Differences in categorical variables, such as sex, lesion type, pancreatic duct dilation, and the solid-cystic nature of the lesion, were also evaluated. For normally distributed metric data (e.g., patient age, lesion diameter, and volume), mean values and standard deviations are reported. Nonnormally distributed data, are reported as median values along with upper and lower quartiles. Independent sample t tests were used to compare two sets of data for differences, whereas Mann-Whitney U tests were used for nonnormally distributed data. ROC curve analysis and the Delong test were employed to compare the diagnostic performance of radiomic models from different phases and models combining selected clinical features. The chi-square test and corrected chi-square test were used to validate the results, with a p-value less than 0.05 indicating significant differences between groups.

Results

Analysis of clinical characteristics

A total of 121 patients diagnosed with IPMNs were included in the study, ranging in age from 36 to 77 years, with a mean age of 60.2 years. The cohort consisted of 61 males (50.41%) and 60 females (49.59%). Among them, 61 patients were determined to have benign IPMNs, including 30 with mild dysplasia, 21 with mild to moderate dysplasia, and 10 with moderate dysplasia. The malignant IPMN group comprised 60 patients, including 10 with moderate to severe dysplasia, 9 with severe dysplasia, and 41 with IPMNs associated with invasive carcinoma. Both qualitative and quantitative data from the 121 patients were statistically analysed, and the results are detailed in Table 1.

Table 1 Clinical data and statistical analysis results of 121 patients.
Figure 1
figure 1

The segmentation of pancreatic lesion. (a) The arterial-phase enhance CT of pancreas (b) Blue part is the lesion region. Purple part is the remaining pancreatic region. The RIAS software extracts radiomic features by labeling the purple and blue areas.

Selection and extraction of radiomic features

Using RIAS software, we extracted 1427 dimensional features from the ROIs in the cross-validation set (Fig. 1). Following intragroup validation, 954 arterial-phase features and 843 venous-phase features were rerained. Through dimensionality reduction, 8 and 9 dimensional arterial-phase features and venous-phase features were retained respectively. Figure 2 and Table 2 illustrate the visualized radiomic features after dimensionality reduction in arterial-phases; Fig. 3 and Table 3 illustrate the visualized radiomic features after dimensionality reduction in venous-phases.

Table 2 The remaining arterial-phase radiomic features after LASSO algorithm.
Figure 2
figure 2

Parameter selection and image omics feature screenin of LASSO model. (a) Parameter optimization of the arterial-phase LASSO regression model. (b) Selected feature parameters after fivefold cross-validation of the arterial-phase LASSO model. (c) Parameter optimization of the venous-phase LASSO regression model. (d) Selected feature parameters after fivefold cross-validation of the venous-phase LASSO model.

Table 3 The remaining venous-phase radiomic features after LASSO algorithm.
Figure 3
figure 3

Feature visualization after dimensionality reduction (a,b) is the feature visualization after dimensionality reduction in arterial-phase, (c,d) is the venous-phase.

Establishment of an arterial phase radiomics feature model and evaluation of predictive performance

We divided the entire sample dataset into training and testing sets at a ratio of 7:3. Employing dimensionality reduction techniques, we extracted meaningful feature parameters from radiomic features and constructed an SVM model for training and testing purposes. The model underwent fivefold cross-validation for validation. The final arterial-phase SVM model performed on the test set demonstrated the following metrics: sensitivity 0.55; specificity 0.37; precision 0.45; recall 0.47; F1 score 0.37; and accuracy 0.41; Fig. 4 shows the results of the ROC curves analyses for the arterial-phase training set, validation set, cross-validation set, and test set.

Figure 4
figure 4

ROC curve of arterial-phase imaging model. (a) Train set, (b) validation set, (c) cross-validation set, (d) test set.

Establishment of a venous phase radiomics feature model and evaluation of predictive performance

We divided the entire sample dataset into training and testing sets at a 7:3 ratio. Employing dimensionality reduction techniques, we extracted valuable features from radiomic characteristics and built an SVM model for both training and testing. Fivefold cross-validation was implemented for validation purposes. The venous-phase SVM model performed on the test set exhibited notable performance metrics: sensitivity, 0.8; specificity, 0.74; precision, 0.72; recall, 0.74; F1 score, 0.74; and accuracy, 0.74. The results of the ROC curve analyses for the arterial-phase training set, validation set, cross-validation set, and test set are shown in Fig. 5.

Figure 5
figure 5

ROC curve of venous-phase imaging model. (a) Train set, (b) validation set, (c) cross-validation set, (d) test set.

Establishment and evaluation of a combined model with partial qualitative and quantitative clinical data and radiomic features

We integrated clinical features with p < 0.05 along with radiomic features from the venous-phase ROIs to construct a novel SVM model. Following fivefold cross-validation, the model demonstrated superior predictive performance on the test set, with a sensitivity of 0.904, a specificity of 0.74, a precision of 0.76, a recall of 0.88, an F1 score of 0.74, and an accuracy of 0.8 (Fig. 6). Compared with the radiomics feature model alone, the model in which partial qualitative and quantitative clinical data were included resulted in a significant improvement in diagnostic performance (DeLong test, P value > 0.05).

Figure 6
figure 6

Clinical features combined with radiomics model. (a) Train set, (b) validation set, (c) cross-validation set, (d) test set.

Discussion

We established a cohort of 121 patients with IPMNs and utilized radiomic features to distinguish between benign and malignant IPMNs. To address this challenging issue within current clinical and radiological practice, we aimed to develop a model with clinical practice value, that could lead to improved recommendations for patients and their families and more effective management of their care. Once a patient is diagnosed with an IPMN, risk stratification can be conducted, there by aiding in improving clinical decision-making. In patients for whom the risk of developing malignant tumours is low, surgery can be avoided, opting instead for close patient monitoring. For high-risk patients, tumour resection significantly improves patient outcomes.

In this retrospective study, the patient cohort was divided into benign and malignant groups, and a multivariate model was established, using logistic regression to assess the ability of the model to predict benign outcomes. This study diverges from traditional methods that rely solely on either venous- or arterial-phase CT scans by employing multiphase enhanced CT coupled with radiomic analysis. Interestingly, we observed that each comparison period radiomic signature contained distinctive dimensional features. Specifically, the univariate features from multiple venous phases yielded superior prediction results. Moreover, the multivariate analysis revealed that the arterial-phase model included more dimensional features that may not be discernible through visual inspection and experience alone. Our findings indicate that, compared with a pure venous-phase imaging assessment, more influential features can be exreacted with multiphase radiomic assessment of IPMNs.

Among the various types of IPMNs, main-duct IPMNs (MD-IPMNs) represent the most common form of malignant transformation, with up to 65% exhibiting malignancy13. Diabetes is associated with an increased risk of primary involvement and high-grade dysplasia or invasive cancer14.Mixed-type IPMNs (MT-IPMNs) share a similar pathophysiology to MD-IPMNs and are therefore considered to have a higher potential for malignancy compared to branch-duct IPMNs (BD-IPMNs)15. A summary of over 1500 postoperative IPMN patients revealed that jaundice and solid tumour components were significant risk factors for malignancy16. CA19-9 levels > 37 U/ml, tumour diameter > 3 cm, and elevated alkaline phosphatase are concerning indicators17. Studies have indicated that combining CA19-9 with CA125 provides a more accurate prediction of IPMN malignancy18.Furthermore, proteomic analysis of IPMN liquid tissue lysate demonstrated overexpression of tissue transglutaminase in patients with malignant disease19. Combining radiomics features with the nine selected independent risk factors significantly increased the accuracy of the model diagnosis. The AUC of our model for predicting benign and malignant IPMNs reached 0.904, surpassing the diagnostic accuracy of the International Consensus Fukuoka Guidelines.

IPMNs represent a predominant type of pancreatic cystic neoplasm (PCN), with an incidence rate reaching 70%20.The increasing incidence reflects advancements in imaging technologies and improved understanding and detection of pancreatic lesions. Compared with research on PDAC grading, research on IPMN grading is considerably less extensive; furthermore,IPMN grading methods often lack validation and have insufficient diagnostic performance for practical application. Serum CA19-9 levels are associated with malignant IPMNs but demonstrate limited diagnostic capability21, with reported AUC values ranging from 0.62 to 0.78. Serum CEA and CA19-9 concentrations combined with anti-p53 antibody positivity can distinguish high-grade dysplasia and malignant IPMNs from low-grade IPMNs, albeit with a sensitivity of only 38.4% and specificity of 81.6%22 .Recent studies suggest that combining DNA methylation biomarkers can further increase diagnostic accuracy, whereas blood protein and microRNA biomarkers can facilitate the precise diagnosis and risk stratification of IPMN patients23,24.

In a previous imaging study, Hanania et al.25 identified grayscale co-occurrence matrix biomarkers on arterial phase 2D images, achieving 82% diagnostic accuracy in predicting malignant or high-grade diseases. Chakraborty et al.26 utilized 2D radiomic texture features from venous-phase images, and reported a diagnostic accuracy of 74% in identifying high-risk IPMNs. Flammia et al.27 identified guideline-relevant risk features and worrisome features via diffusion-weighted imaging (DWI)/apparent diffusion coefficient (ADC) maps of contrast-enhanced MR images in 50 IPMN patients. Additionally, Cui and others, through in-depth analysis, constructed a joint nomogram model that combines radiomic features of MR images with clinical indicators (CA19-9 levels and MPD size) to more accurately identify branching types of tumours in IPMN patients28.

Compared with these previous studies, our research boasts as a relatively large sample size, contributing to a more stable model. While prior efforts focused that primarily on 2D analysis at the level of the largest tumour cross-section, our approach incorporates a 3D total lesion analysis. This methodology allows the extraction of more comprehensive dimensional features, significantly augmenting the depth and breadth of the analysis, and thus provides a more nuanced understanding of the disease. However, it is essential to note that the data in our study were sourced from a single centre, and studies on data from multiple centres are needed to further validate the results.