Background

Pancreatic cancer is the 4th leading cause of cancer-related deaths worldwide and is projected to become the second most lethal tumor by 2030 [1]. One of the cancers with the poorest prognosis, pancreatic cancer has a 5-year survival rate of 12% [2]. At the time of diagnosis, <20% of patients present with localized [3] and potentially resectable and curative disease, 30–35% have locally advanced tumors, and the remaining 50–55% of patients present with metastatic disease [4]. Five-year survival rates of patients with stage 0 (Union for International Cancer Control staging), IA, and IB pancreatic cancer were 85.8%, 68.7%, and 59.7%, respectively [5]. If tumor cells invade the surrounding tissues and spread to distant organs, the 5-year survival rate decreases to 15% and 3%, respectively [2]. Therefore, early detection of pancreatic cancer without local invasion or distant metastasis is critical for improving the survival of patients with pancreatic cancer. However, early diagnosis of pancreatic cancer is difficult owing to nonspecific symptoms, with stage IA and IB cases accounting for only 1.8% and 2.3% of all cases, respectively [6]. Patients with pancreatic cancer who present with symptoms generally have advanced-stage disease, and 75% of patients with stage 0 and stage I pancreatic cancer are asymptomatic [7]. Therefore, it is important to detect pancreatic cancer in asymptomatic populations to improve the prognosis. However, to date, no blood biomarkers have been used to identify patients with pancreatic cancer at an early stage.

The US Preventive Services Task Force does not recommend pancreatic cancer screening in asymptomatic adults [8]. The low prevalence of pancreatic cancer makes the screening of asymptomatic adult populations unfeasible using existing diagnostic methods. Moreover, the existing diagnostic methods have unacceptably high rates of false positive findings [9]. However, high-risk populations with certain inherited genetic syndromes, a history of familial pancreatic cancer, or intraductal papillary neoplasms of the pancreas (IPMN) are recommended for pancreatic cancer screening [10, 11]. However, in the framework of an imaging-based pancreatic cancer surveillance research program in high-risk individuals, almost half of the individuals developed neoplastic lesions without prior signs on imaging, and by the time of detection or surgical treatment, most had already progressed beyond an early stage (T1 N0 M0) [12]. According to the Japanese nationwide calculation of cancer screening detection rates using abdominal ultrasound (AUS), only 152 pancreatic cancer cases were detected among 3,005,393 examinees (0.005%) [13]. Thus, more effective, and less invasive diagnostic tools for early-stage pancreatic cancer are urgently required.

MicroRNAs (miRNAs) are small non-coding RNA composed of 18–24 base pairs of single-chain molecules [14, 15]. MiRNAs modulate gene expression by decreasing target mRNA stability or repressing translational efficiency. They can stably exist in severe conditions, including in serum or pancreatic juice because some miRNAs are at least partly packaged into extracellular vesicles or included in an RNA-induced simple complex (RISC) with the Agonature2 protein to protect against the elimination of RNase [16,17,18]. Therefore, circulating miRNAs are potential novel targets for liquid biopsies. Some retrospective studies have shown that the expression of specific miRNAs in plasma or serum can distinguish patients with pancreatic cancer from healthy participants or patients with benign pancreatic diseases [19,20,21,22,23,24,25,26,27,28,29,30]. However, in most of these studies, the data were not validated using independent case-control cohorts. In addition, previous studies included only a small number of patients with early-stage (stage 0-I) pancreatic cancer. Furthermore, most of these studies analyzed only a few varieties of miRNAs, and not whole miRNA profiles. To the best of our knowledge, there have been few analyses of whole miRNA profiles for all samples in pancreatic cancer patients; however, these studies discriminated pancreatic cancer from other cancers but not from healthy controls [28], or the validation cohort included less than five samples from early-stage (stage 0-I) pancreatic cancer patients [31]. Importantly, its performance has not been validated in patients with asymptomatic early-stage (stage 0-I) pancreatic cancer.

In this study, to assess the ability of miRNA profiles and CA19-9 levels to distinguish subjects with pancreatic cancer at each stage from subjects without pancreatic cancer, we analyzed 425 serum samples, including 213 pancreatic cancer samples collected from 14 centers and 212 non-cancer healthy control samples collected from three centers. We comprehensively analyzed all miRNA profiles by next-generation sequencing (NGS) using an automated machine learning (AutoML) method to create discrimination models using highly expressed 100 miRNAs and CA19-9. The models were validated in an independent cohort study. Our data showed that the 100 highly expressed miRNAs and a combination of those with CA19-9 could be biomarkers for the specific and early detection of pancreatic cancer, and even for asymptomatic early-stage pancreatic cancer patients.

Methods

Patients and sample preparation

Serum samples were obtained from pancreatic cancer patients (n = 212) who were admitted or referred to Kyoto University Hospital (KUHP) (n = 56), Kindai University Hospital (KDUH) (n = 67), Kyoto Prefectural University of Medicine (KPUM) (n = 25), Hyogo Prefectural Amagasaki General Medical Center (AGMC) (n = 5), Japanese Red Cross Osaka Hospital (JRCOS) (n = 11), Japanese Red Cross Otsu Hospital (JRCOT) (n = 12), Japanese Red Cross Takatsuki Hospital (JRCT) (n = 3), Japanese Red Cross Wakayama Medical Center (JRCW) (n = 13), Kyoto Katsura Hospital (KKTR) (n = 4), Kobe City Nishi-Kobe Medical Center (KNMC) (n = 2), Kitano Hospital (KTNH) (n = 5), Shiga General Hospital (SGH) (n = 3), Shinko Hospital (SKHP) (n = 3), and Tenri Hospital (TNRH) (n = 11), all of which are secondary or tertiary care hospitals, between 2020 and 2023 and stored at āˆ’80 °C. The inclusion criteria were age over 20 years, histologically confirmed pancreatic cancer in a surgically resected specimen, or a computed tomography (CT) scan showing a solid mass in the pancreas or dilatation of pancreatic duct in patients not undergoing surgery with histology or cytology from the pancreas that confirmed the diagnosis of adenocarcinoma. The clinical stage of pancreatic cancer was determined using a CT scan according to the UICC 7th criteria. Patients with pancreatic cancer were randomly divided into a training cohort and a validation cohort under the following restrictions: 30 cases in stage 0-I, stage II, III, and IV were assigned to the validation cohort, and the remaining 92 pancreatic cancer cases were assigned to the training cohort.

A total of 213 serum samples from healthy individuals without cancer were collected from three independent cohorts. The three cohorts included volunteers aged over 40 years who were recruited from the OCROM clinic (OCROMC) (n = 71), Osaka Pharmacology Clinical Research Hospital (OPHACH) (n = 71), and ToCROM clinic (TOCROMC) (n = 71), all of which are primary care hospitals, in 2021 and stored at āˆ’80 °C. The inclusion criterion for healthy control participants was no history of malignant tumors according to self-reported medical history at the time of blood sampling and one year later. Healthy control participants were randomly divided into training and validation cohorts under the following restrictions: 120 cases were assigned to the validation cohort, and the remaining 93 healthy control participants were assigned to the training cohort.

Blood sample collection and miRNA extraction from serum

Blood samples were collected into serum-separating tubes. After blood collection, the serum was separated by centrifugation and aliquoted into cryotubes. These sera were frozen at āˆ’80 °C until miRNA extraction. The time interval between blood sampling and serum freezing at āˆ’80 °C was observed strictly within the same day, and serum samples that showed hemolysis were excluded. RNA samples containing miRNA were extracted from the serum using the MaxwellĀ® RSC miRNA Plasma and Serum kit (Promega, AS1680). QIAseq miRNA Library QC Spike-ins (Qiagen, 331535) was spiked into each serum sample to monitor RNA extraction. Concentrations of miRNAs were quantified using Qubitā„¢ microRNA Assay Kits (Invitrogen, Q32881). These miRNA samples were stored at āˆ’80 °C until NGS library preparation.

NGS library preparation and NGS measurement

miRNA libraries were prepared using the QIAseq miRNA Library Kit (Qiagen, 331509) and the QIAseq miRNA NGS 96 Index IL (96) (Qiagen, 331565) using Agilent Bravo NGS (Agilent Technologies, RRID:SCR_019473). The library size distribution was determined using a TapeStation HS D1000 system (Agilent Technologies, 5067-5584 and 5067-5585). Library samples were pooled and the NextSeq PhiX Control Kit (Illumina, FC-110-3002) was spiked into the sample mixture according to the manufacturer’s recommendations. The pooled samples were sequenced in four lanes of the NextSeq 500/550 High Output Kit v2.5 (75 cycles) (Illumina, 20024906) using the NextSeq 550 platform (Illumina, RRID:SCR_016381). The sequenced reads were annotated using the QIAseq miRNA primary analysis pipeline provided by the GeneGlobe Data Analysis Center (https://geneglobe.qiagen.com/jp/analyze/, RRID:SCR_021211). Sequencing outputs were mapped to miRBase v21 using the QIAseq miRNA Primary Analysis Pipeline.

miRNA selection and normalization

We performed miRNA filtration using the read count data of 2588 miRNAs and 425 samples. Among the 2588 miRNAs, we excluded miRNAs if they did not meet the following criteria: the read count was one or more in all the samples. The remaining 230 miRNAs were normalized by ComBat (RRID:SCR_010974) in the R (version 4.2.3, RRID:SCR_001905) ā€œsvaā€ package and log2-transformed. These miRNAs were used for PCA and hierarchical unsupervised clustering analysis. The miRNAs are listed in Supplementary TableĀ 1.

For the development of classification models, miRNAs were excluded if they did not meet the following criteria: a read count of 50 or more in all the samples. The ComBat-normalized and log2-transformed values of the remaining 100 miRNAs were used to construct discrimination models. These miRNAs are listed in Supplementary TableĀ 2.

Construction of pancreatic cancer discrimination models

To construct pancreatic cancer discrimination models, we utilized the AutoML platform DataRobot (version 8.0.12, Boston, Massachusetts) [32, 33]. DataRobot can automatically create more than 60 discrimination models and their combinations, called ensemble models. Using the expression data of 100 miRNAs, a 5-fold cross-validation (CV) AUC was calculated against 64 algorithms. This calculation was repeated 100 times and the division patterns of the training cohort were randomly changed for cross-validation. Eight algorithms with high average 5-CV AUC values were selected for the final miRNA model. For the miRNA model, the selected 8 algorithms were as follows: elastic net [34], light gradient boosting [35] (RRID:SCR_021697), nystroem kernel support vector machine [36], extremely randomized trees [37], Keras wide residual neural network [38], regularized logistic regression [39], stochastic gradient descent [40], and extreme gradient boosting [41] (RRID:SCR_021361). The average value of the eight prediction score outputs from each algorithm was used as the final miRNA index. For the miRNA+CA19-9 model, 5-CV AUC calculations and model selection were performed in the same way as described above, using 100 miRNA expression data points in combination with serum CA19-9 concentration data. For the miRNA+CA19-9 model, the selected 8 algorithms were as follows: RuleFit [42], gradient-boosted trees [43], gradient-boosted greedy trees [44], random forest [45], Keras wide residual neural network, elastic net, stochastic gradient descent, and extreme gradient boosting. The average value of the eight prediction score outputs from each algorithm was used as the final miRNA+CA19-9-index.

Construction of pancreatic cancer discrimination models using data from Ion GeneStudio S5 system

For miRNA quantification using the Ion GeneStudio S5 Prime system (Thermo Fisher Scientific, RRID: SCR_017984), miRNA libraries were prepared using the QIAseq miRNA Library Kit and the QIAseq miRNA 48 Index TF (96) (Qiagen, 331585) using Agilent Bravo NGS. The pooled libraries were loaded into the Ion 540 Chip (Thermo Fisher Scientific, A27766) with the Ion Chef system (Thermo Fisher Scientific, 4484177) and sequenced with the Ion GeneStudio S5 Prime system. The sequenced reads were processed as for NextSeq 550. miRNA reads were normalized by reference-batch ComBat in the R (version 4.2.3) ā€œsvaā€ package. Pancreatic cancer discrimination models using data from the Ion GeneStudio S5 system were constructed using the same 100 miRNAs used in the models built on NextSeq 550 data. The same eight algorithms as the NextSeq 550-based models were used and the average of these prediction scores was used as the final index.

Statistical analysis

PCA, t-SNE, UMAP, hierarchical unsupervised clustering analysis with heatmaps, box plots, scatter plots, confusion matrices, ROC curves, and AUC calculations were conducted using the statistical analysis software R (version 4.2.3). PCA, t-SNE, and UMAP analyses were conducted by prcomp (RRID:SCR_014676), Rtsne (perplexity = 30, as the default value, RRID:SCR_016342), and umap in the R ā€œstats,ā€ ā€œRtsne,ā€ and ā€œumapā€ packages, respectively. In each PCA, t-SNE, and UMAP mapping, 95% confidence ellipses were drawn by stat_ellipse in the R ā€œggplot2ā€ package. Heatmap and dendrograms were drawn by Heatmap in the R ā€œComplexHeatmapā€ package. Student’s t-test for continuous variables and Pearson’s χ2 test for categorical variables were used to analyze patient characteristics of the training cohort and validation cohort. To evaluate model performance of sensitivity and specificity, 95% CIs were calculated using the bootstrap method by the R ā€œpROCā€ package. To evaluate the influence of patient backgrounds of age, sex, drinking habits, history of smoking, blood collection centers, diabetes mellitus, and focal pancreatic parenchymal atrophy (FPPA) on constructed models, p-values of the Kolmogorov–Smirnov test were calculated by ks.test in the R ā€œstatsā€ package, and corrected by the Bonferroni method. To evaluate the correlation between miRNA index and CA19-9 level, and between miRNA index and tumor size, Spearman’s correlation coefficient was calculated by cor.test in the R ā€œstatsā€ packages. To evaluate AUC values of ROC curves with early-stage or asymptomatic pancreatic cancers of constructed models, p-values were calculated by roc.test in the R ā€œpROCā€ package, and corrected by the Bonferroni method. To evaluate sensitivities of early-stage or asymptomatic pancreatic cancers of constructed models, p-values of the McNemar test were calculated by mcnemar.test in the R ā€œstatsā€ package, and corrected by the Bonferroni method.

Results

Study design

A total of 425 serum samples, including those of 213 pancreatic cancer patients from 14 hospitals and of 212 healthy controls without cancer from three clinics, were analyzed by miRNA sequencing to generate global miRNA expression profiles. According to the UICC 7th edition criteria, we subdivided pancreatic cancers based on the clinical stage: stage 0, stage I, stage II, stage III, and stage IV. Serum samples were collected prior to treatment. To validate the diagnostic performance in patients with pancreatic cancer in each stage, we randomly selected 30 pancreatic cancer cases in stage 0-I, stage II, III, and IV, and 120 healthy control cases as a validation cohort. The remaining 92 patients with pancreatic cancer and 93 non-cancer healthy controls were assigned to the training cohort (Fig.Ā 1).

Fig. 1: The workflow of pancreatic cancer patients and healthy control participants for developing and evaluating prediction models.
figure 1

a All the 425 serum samples, including 213 pancreatic cancer patients from 14 hospitals and 212 healthy controls without cancer from three clinics, were randomly divided into the training and validation cohorts.

Participants’ characteristics are presented in TableĀ 1. Owing to restrictions in the sampling of healthy participants, there was a significant difference in age, drinking habits, and diabetes mellitus status in the training cohort and in sex, age, history of smoking, and diabetes mellitus in the validation cohort (P < 0.05) between patients with pancreatic cancer and healthy participants. Owing to the random selection of 30 pancreatic cancer cases at each stage, there was a significant difference in stage (P < 0.05) (Supplementary TableĀ 3) between the validation and training cohorts in pancreatic cancer patients.

Table 1 The characteristics of pancreatic cancer patients and healthy controls in the training and validation cohorts and the p-value of pancreatic cancers vs. healthy controls in each cohort.

The comprehensive analysis of the miRNA expression profiles identified differences in patients with pancreatic cancer and healthy participants

We examined the expression levels of 2588 miRNAs comprehensively for miRNA sequencing with NextSeq 550 (Illumina). First, we focused on the 230 miRNAs commonly detected in all 425 samples. For these miRNAs, we performed hierarchical unsupervised clustering analysis with a heatmap (Fig.Ā 2a) and principal component analysis (PCA) mapping (Fig.Ā 2b) to visualize the expression patterns of miRNAs in all samples in the training and validation cohorts. Another clustering result, without the separation of healthy controls and patients with pancreatic cancer, also showed that each sample type formed clusters (Supplementary Fig.Ā 1A). These classification analyses indicated that the miRNA expression profiles in non-cancer controls were relatively distinct from those in patients with pancreatic cancer. Furthermore, the PCA revealed no significant differences in patients with pancreatic cancer among hospitals, in healthy controls among clinics, and in all the samples among the NGS measurements (Fig.Ā 2c–e). Similar analyses were conducted using the t-distributed stochastic neighbor embedding (t-SNE), and the uniform manifold approximation and projection (UMAP) methods. These results also supported differences in miRNA expression profiles between healthy controls and patients with pancreatic cancer, but no significant differences among the blood collection centers and the NGS measurements (Supplementary Fig.Ā 2A–H).

Fig. 2: The serum miRNA expression profiles in healthy controls differed from those in pancreatic cancer patients.
figure 2

a Heatmap representing hierarchical unsupervised clustering analysis of the miRNA sequencing of healthy controls (blue) and pancreatic cancer patients (orange). b Principal component analysis (PCA) of the miRNA sequencing of healthy controls (blue) and pancreatic cancer patients (orange). c–e The PCA grouped by hospitals, clinics, and the NGS measurements. HC healthy control, PC pancreatic cancer.

We created the best discrimination models using AutoML in the training cohort and validated the performance in the independent validation cohort

Next, using the training cohort, diagnostic models were constructed to discriminate patients with pancreatic cancer from healthy controls. To this end, we focused on the highly and robustly expressed miRNAs. One hundred highly expressed miRNAs were used to create a diagnostic model in the training cohort using the AutoML platform DataRobot. We used eight algorithms to calculate the prediction scores and took the average of the prediction scores produced by the eight algorithms to create the best discrimination model (Fig.Ā 3a), designated as the miRNA model, which exhibited the best discrimination performance in the training cohort. We also created a diagnostic model using the 100 highly and robustly expressed miRNAs in combination with serum CA19-9 in the same way as creating the miRNA model, designated as the miRNA+CA19-9 model, which exhibited the best discrimination performance in the training cohort.

Fig. 3: Creation of the best discrimination models using AutoML in the training cohort and validation in the independent validation cohort.
figure 3

a Schematic procedure of the index calculation in miRNA model and miRNA+CA19-9 model. b ROC curve for the performance of serum CA19-9 alone (red), miRNA model (green), and miRNA+CA19-9 model (blue) in the validation cohort. c–e Box plots of CA19-9, indices of miRNA model, and miRNA+CA19-9 model of healthy participants and pancreatic cancer patients in each stage (0, I, II, III, and IV) in the validation cohort. f, g Confusion matrices of miRNA model and miRNA+CA19-9 model at 98% specificity in each stage of the validation cohort. TP True Positive, TN True Negative, FP False Positive, FN False Negative, SEN Sensitivity, SPC Specificity, FPR False Positive Rate, FNR False Negative Rate. The color gradient indicated the rate of each metric. h, i Box plots of CEA and DUPAN-2 of healthy participants and pancreatic cancer patients in each stage (0, I, II, III, and IV) in the validation cohort.

Next, we evaluated the diagnostic performance of serum CA19-9, the miRNA model, and the miRNA+CA19-9 model in an independent validation cohort. Receiver operating characteristic (ROC) curves for serum CA19-9, the miRNA model, and the miRNA+CA19-9 model in the validation cohort revealed that both the miRNA model and the miRNA+CA19-9 model showed better discrimination performance than conventional serum CA19-9 in the validation cohort (Fig.Ā 3b). The miRNA and miRNA+CA19-9 models exhibited high indices for patients with pancreatic cancer, even in the early stages. Although serum CA19-9 levels gradually increased as the patient’s disease stages progressed, positive CA19-9 levels in the validation cohort were observed only in 29% of patients in stage I, while none of the patients in stage 0 had a positive level (Fig.Ā 3c and TableĀ 2). In contrast, the miRNA and miRNA+CA19-9 models successfully discriminated healthy controls from patients with pancreatic cancer in stages II-IV as well as in stage 0-I (Fig.Ā 3d–g and TableĀ 2).

Table 2 The performance of serum CA19-9, miRNA model, and miRNA+CA19-9 model to discriminate pancreatic cancer patients from healthy participants.

In the independent validation cohort, the AUC was 0.88 (95% confidence interval (CI), 0.84–0.93) for serum CA19-9, 0.94 (95% CI, 0.91–0.97) for the miRNA model, and 0.99 (95% CI, 0.98–1.00) for the miRNA+CA19-9 model when patients with pancreatic cancer were tested against healthy participants (TableĀ 2).

We also evaluated the diagnostic performance of serum CEA and DUPAN-2, other pancreatic cancer markers. Serum CEA and DUPAN-2 levels, similar to CA19-9, gradually increased as the patients’ disease status progressed however, only 20% of patients in stages 0 and I were positive for each tumor marker in the validation cohort (Fig.Ā 3h, i). We investigated the influence of patient background, including age, sex, drinking habits, smoking history, blood collection center, diabetes mellitus status, FPPA detection, tumor size, and CA19-9 level. No association was found between discrimination scores and age, sex, drinking habits, history of smoking, blood collection center, or diabetes mellitus status, FPPA detection, tumor size, or CA19-9 level. (Supplementary Figs.Ā 3A–J and 4A–G).

To validate the reproducibility of diagnostic performance between miRNA measurement platforms, we measured miRNA profiles with Ion GeneStudio S5 and predicted pancreatic cancer or healthy control in the validation cohort on models constructed by miRNA profiles from NextSeq 550. Validation results using data from the Ion GeneStudio S5 system were comparable to those of NextSeq 550, confirming the robustness of the models (Supplementary Fig.Ā 6, Supplementary TableĀ 6).

The miRNA+CA19-9 model showed the highest performance in discriminating patients with pancreatic cancer in stage 0-I or stage 0-II from healthy controls

We also evaluated the performance of each prediction model in discriminating patients with pancreatic cancer in stage 0-I or stage 0-II from healthy controls in the validation cohort. Among the three models, the miRNA+CA19-9 model showed the highest AUC in the three models. The diagnostic performance of the miRNA model was higher than that of CA19-9 in patients with stage 0-I (Fig.Ā 4a) and stage 0-II (Fig.Ā 4b) pancreatic cancers compared to healthy participants. The AUC was 0.81 (95% CI, 0.71–0.92) and 0.84 (95% CI, 0.76–0.91) for serum CA19-9, 0.92 (95% CI, 0.86–0.98) and 0.92 (95% CI, 0.87–0.96) for the miRNA model, and 0.98 (95% CI, 0.96–1.00) and 0.98 (95% CI, 0.97–1.00) for the miRNA+CA19-9 model when patients with pancreatic cancer in stage 0-I or stage 0-II were tested against healthy participants, respectively (Supplementary TableĀ 4).

Fig. 4: The 100 highly expressed miRNAs and CA19-9 in blood serum successfully discriminated healthy controls from patients with pancreatic cancer, not only in the advanced stage but also in the early stage (stage 0-I) with high sensitivity and specificity.
figure 4

a, b ROC curves for the performance of serum CA19-9 alone (red), miRNA model (green), and miRNA+CA19-9 model (blue) in discriminating patients with pancreatic cancer in stage 0-I (a) and stage 0-II (b) from healthy controls. c ROC curves for the performance of serum CA19-9 alone (red), miRNA model (green), and miRNA+CA19-9 model (blue) in discriminating asymptomatic patients with pancreatic cancer in stage 0-I in the validation cohort. d–l Asymptomatic patients with pancreatic cancer in stage I: case 1 (d–g) and case 2 (h–l). d MRI revealed main pancreatic duct dilatation (arrow). Scale bar, 3.0 cm (e) MRI revealed a low-signal area in the main pancreatic duct on T2-weighted images (arrow). Scale bar, 3.0 cm (f) Contrast-enhanced CT revealed a slightly enhanced tumor in the main pancreatic duct (arrow). Scale bar, 3.0 cm (g) EUS revealed a mass, 5 mm in diameter, in the main pancreatic duct (arrow). Scale bar, 0.50 cm (h) MRI revealed main pancreatic duct dilatation (arrow). Scale bar, 3.0 cm (i) Contrast-enhanced CT revealed main pancreatic dilatation (arrow). A tumor could not be detected by CT. Scale bar, 3.0 cm (j) A tumor could not be detected in PET-CT (arrow). Scale bar, 3.0 cm (k) A tumor, 6 mm in diameter, was detected by EUS (arrow). Scale bar, 1.0 cm (l) Hematoxylin, and eosin staining of the surgical specimen revealed pancreatic cancer in stage IA. The inset shows invasive pancreatic cancer. Scale bar, 200 µm

The AUC of the miRNA+CA19-9 model was significantly higher than that of CA19-9 in patients with pancreatic cancer in stage 0-I (Fig.Ā 4a) and stage 0-II (Fig.Ā 4b) in the validation cohort (P < 0.05). The sensitivities of the miRNA and miRNA+CA19-9 models at 98% specificity, the same as that of CA19-9, were significantly higher than that of CA19-9 in patients with pancreatic cancer in stage 0-I tested against healthy participants in the validation cohort (P < 0.05).

These results indicated that the 100 highly expressed miRNAs and CA19-9 in blood serum successfully discriminated healthy controls from patients with pancreatic cancer, not only in the advanced stage but also in the early stage (stage 0-I) with high sensitivity and specificity.

To further determine whether these discrimination models are useful for asymptomatic patients with pancreatic cancer in the early stage (stage 0-I), we investigated the performance of each model in 21 cases of asymptomatic patients with pancreatic cancer in stage 0-I. The ROC curves for serum CA19-9, the miRNA model, and the miRNA+CA19-9 model using the validation cohort revealed that both the miRNA and miRNA+CA19-9 models showed better discrimination performance than conventional serum CA19-9 in asymptomatic patients with pancreatic cancer in stage 0-I (Fig.Ā 4c). The AUC was 0.81 (95% CI, 0.70–0.93) for serum CA19-9, 0.89 (95% CI, 0.81–0.97) for the miRNA model, and 0.97 (95% CI, 0.95–.00) for the miRNA+CA19-9 model when asymptomatic patients with pancreatic cancer of stage 0-I were tested against healthy participants in the validation cohort. The AUC and sensitivity of the miRNA+CA19-9 model at 98% specificity were significantly higher than those of CA19-9 (P < 0.05) (Supplementary TableĀ 5).

These results indicate that the 100 highly expressed miRNAs and their combination with CA19-9 could be used as biomarkers for screening asymptomatic patients with pancreatic cancer in stage 0-I.

Finally, we present representative cases in which the miRNA and miRNA+CA19-9 models could discriminate asymptomatic patients with early-stage pancreatic cancer from healthy controls.

Case 1: An asymptomatic male in his 70 s visited Kyoto University Hospital for a routine check-up for dilatation of the main pancreatic duct and a pancreatic cyst. Magnetic resonance imaging (MRI) revealed worsening of the dilatation of the main pancreatic duct, cyst enlargement, and a low-signal area in the main pancreatic duct on T2-weighted images (Fig.Ā 4d, e). The patient underwent contrast-enhanced CT, contrast-enhanced endoscopic ultrasound (EUS), serum tumor marker assessments, and serum miRNA sequencing. A tumor, 5 mm in diameter, in the main pancreatic duct was detected on contrast-enhanced CT and EUS (Fig.Ā 4f, g). CA19-9 of 12.1, CEA of 2.3, the miRNA index of 0.92, and the miRNA+CA19-9 index of 0.66 indicated negative, negative, positive, and positive, respectively. Pancreatic juice cytology revealed adenocarcinoma, and the patient was diagnosed with pancreatic cancer at clinical stage I. The patient underwent surgery and was pathologically diagnosed with stage IB pancreatic cancer.

Case 2: An asymptomatic male in his 70 s visited Kyoto University Hospital because of main pancreatic duct dilatation detected on MRI screening (Fig.Ā 4h). The patient underwent AUS, contrast-enhanced CT, positron emission tomography-CT (PET-CT), EUS (Fig.Ā 4i–k), serum tumor marker measurements, and serum miRNA sequencing. Dilatation of the main pancreatic duct was detected in CT, MRI, and EUS. A tumor could not be detected by AUS, CT, MRI, or PET-CT but could only be detected by EUS. CA19-9 of 20.5, CEA of 4.2, the miRNA index of 0.45, and the miRNA+CA19-9 index of 0.68 indicated negative, negative, negative, and positive, respectively. Histological examination of EUS-FNA samples from the tumor revealed adenocarcinoma, and the patient was diagnosed with clinical stage I pancreatic cancer. The patient underwent surgery and was pathologically diagnosed with stage IA pancreatic cancer (Fig.Ā 4l).

Discussion

Early diagnosis of pancreatic cancer is difficult because of nonspecific symptoms, with stage I cases accounting for only 4.1% [7] of all pancreatic cancer cases. Sensitive and specific biomarkers for identifying patients with early-stage pancreatic cancer are urgently required. In this study, we comprehensively analyzed serum miRNA profiles using NGS of all serum samples from patients with pancreatic cancer and healthy controls in a large sample set (N = 425) collected from 17 centers, including 46 patients with pancreatic cancer in stage 0-I. The discrimination models generated using the training cohort were validated using an independent validation cohort (N = 240). High diagnostic accuracy (AUC:0.98) was validated even in an independent early-stage (stage 0-I) pancreatic cancer cohort (N = 30), which included 21 patients with asymptomatic pancreatic cancer. Using an ensemble model that combines machine learning algorithms, our model has good discriminative ability and high robustness.

Liu et al. [27] and Nakamura et al. [30] recently reported that a combination of miRNAs in plasma with serum CA19-9 can discriminate patients with pancreatic cancer from healthy controls with high sensitivity and specificity (AUC of 0.98 and 0.99, respectively). However, these prediction models have not been validated using an independent validation cohort [27, 30]. In this study, we validated our pancreatic cancer prediction models, the miRNA model and the miRNA+CA19-9 model, with an independent validation cohort to differentiate patients with pancreatic cancer from healthy controls.

One of the greatest strengths of this study was its ability to discriminate patients with pancreatic cancer in the early stage (stage 0-I) from healthy controls with high sensitivity and specificity. By comprehensively analyzing all the miRNA profiles and using AutoML, our miRNA+CA19-9 model showed the highest performance in terms of AUC for discriminating patients with pancreatic cancer from healthy controls compared with all previously reported miRNA studies in which the performance was validated in independent cohorts containing over 10 patients with stage 0-I pancreatic cancer [30]. Furthermore, the high performance of our discrimination models was validated in 21 asymptomatic patients with pancreatic cancer in stage 0-I. These results suggest that the 100 highly expressed miRNAs and their combination with CA19-9 could be biomarkers for detecting asymptomatic patients with pancreatic cancer in stage 0-I.

Most of the 100 miRNAs used in our discrimination models were dismissed in previous studies. This is possibly because previous studies focused only on miRNAs that were differentially expressed between patients with pancreatic cancer and controls, and constructed discrimination models using them. We extracted highly and robustly expressed miRNAs and used them for model construction, regardless of differences in expression levels.

Another strength of this study is that we used large-scale miRNA sequencing data from 425 serum samples using an ensemble model that combined machine learning algorithms to identify the best combination of miRNAs for pancreatic cancer screening. Compared to conventional cancer discriminations with miRNA, which sets a threshold on the expression levels of a small number of miRNAs added together [29], combining a large number of miRNAs through machine learning improves the discrimination ability of pancreatic cancer. A possible disadvantage of machine learning is that the time cost of creating models is high; however, AutoML can build models faster.

This study had a few limitations. First, the outcome was confounded by age because healthy volunteer blood donors were younger than the patients with pancreatic cancer. However, separate analyses of cases and controls revealed no correlation between age, drinking habits, smoking history, miRNA index, and the miRNA+CA19-9 index. These findings suggest that the models may be practical, independent of age, drinking habits, and smoking history, even under more realistic clinical conditions. Second, we only assessed patients with pancreatic cancer and healthy participants retrospectively. This limitation raises the question of whether these discrimination models are useful for other populations, especially high-risk populations with certain inherited genetic syndromes, a history of familial pancreatic cancer, or IPMN, which are precancerous lesions of pancreatic cancer. Therefore, future prospective screening studies involving high-risk populations and asymptomatic adults are required to successfully translate these findings to clinical settings. Third, it is unknown whether this miRNA model is sensitive to patients with other cancers, such as biliary cancer and colon cancer. In the future, it would be ideal to create diagnostic miRNA models of other cancers and to discriminate cancer patients from healthy controls using diagnostic discrimination models after comprehensive miRNA sequencing. Fourth, it is unclear whether the serum 100 miRNAs are produced from pancreatic cancer cells and how the 100 miRNAs function in pancreatic cancer. A previous report showed that miR-155, miR-181a, miR-181b, miR-21, miR-221, and miR-222 were upregulated in pancreatic cancer tumor, as confirmed by microarray and qPCR [46]. We examined if the reported miRNAs upregulated in the tumor were also upregulated in the serum. Some miRNAs (hsa-miR-181b-5p, hsa-miR-21-5p, and hsa-miR-222-3p) in the serum were slightly upregulated, while others were not in patients with pancreatic cancer compared with healthy controls. One possible reason is that the serum miRNA did not necessarily originate from the tumor tissue (Supplementary Fig.Ā 5A–F). Fifth, the miRNA profile is affected by quantification methods such as library preparation protocol and measurement platform [47,48,49]. We investigated whether the high discrimination performance can be delivered using miRNA profiles measured by different platforms. Notably, our data suggested that the miRNA and miRNA+CA19-9 models constructed in this study robustly discriminated even when miRNA profiles were measured on Ion GeneStudio S5, a platform other than the NextSeq 550 used to create the models (Supplementary Fig.Ā 6, Supplementary TableĀ 6). There are three possible reasons for the robustness. First, both NextSeq 550 and Ion GeneStudio S5 accurately measured miRNA. Second, library preparation was highly automated. Third, we selected high-performance models from numerous blueprints of models and ensembled them with autoML. Furthermore, to validate the generality of the machine learning diagnostic model of pancreatic cancer with comprehensive miRNA expression profiles, we also created a model using miRNA profiles measured with the Ion GeneStudio S5 system. We found the model has a high performance comparable to that of the Illumina sequencing platform (Supplementary Fig.Ā 7, Supplementary TableĀ 7).

In conclusion, by comprehensively analyzing all miRNA profiles and using AutoML, the 100 highly expressed miRNAs and a combination of those with CA19-9 could discriminate patients with pancreatic cancer, even in the early stages (stage 0-I), from healthy controls. High diagnostic accuracy (AUC:0.98) was validated in an independent early-stage (stage 0-I) pancreatic cancer cohort, which included 21 asymptomatic patients. Our data demonstrate that the 100 highly expressed miRNAs and their combination with CA19-9 could be used as biomarkers for the specific and early detection of pancreatic cancer. We plan to prospectively study the utility of these discriminatory models for high-risk populations of pancreatic cancer.