Effective reduction of unnecessary biopsies through a deep-learning-assisted aggressive prostate cancer detector

Rodrigues, Nuno M.; de Almeida, José Guilherme; Verde, Ana Sofia Castro; Gaivão, Ana Mascarenhas; Bireiro, Carlos; Santiago, Inês; Ip, Joana; Belião, Sara; Matos, Celso; Vanneschi, Leonardo; Tsiknakis, Manolis; Marias, Kostas; Regge, Daniele; Silva, Sara; Papanikolaou, Nickolas

doi:10.1038/s41598-025-99795-y

Download PDF

Article
Open access
Published: 30 April 2025

Effective reduction of unnecessary biopsies through a deep-learning-assisted aggressive prostate cancer detector

Nuno M. Rodrigues^1,2^na1,
José Guilherme de Almeida¹^na1,
Ana Sofia Castro Verde¹,
Ana Mascarenhas Gaivão³,
Carlos Bireiro³,
Inês Santiago³,
Joana Ip³,
Sara Belião³,
Celso Matos¹,
Leonardo Vanneschi⁴,
Manolis Tsiknakis^5,6,
Kostas Marias^6,7,
Daniele Regge^8,9,
Sara Silva²^na2,
The ProCAncer-I Consortium &
…
Nickolas Papanikolaou^1,10^na2

Scientific Reports volume 15, Article number: 15211 (2025) Cite this article

4074 Accesses
3 Citations
5 Altmetric
Metrics details

Subjects

Abstract

Despite being one of the most prevalent cancers, prostate cancer (PCa) shows a significantly high survival rate, provided there is timely detection and treatment. Currently, several screening and diagnostic tests are required to be carried out in order to detect PCa. These tests are often invasive, requiring either a biopsy (Gleason score and ISUP) or blood tests (PSA). Computational methods have been shown to help this process, using multiparametric MRI (mpMRI) data to detect PCa, effectively providing value during the diagnosis and monitoring stages. While delineating lesions requires a high degree of experience and expertise from the radiologists, being subject to a high degree of inter-observer variability, often leading to inconsistent readings, these computational models can leverage the information from mpMRI to locate the lesions with a high degree of certainty. By considering as positive samples only those that have an ISUP\(\ge\)2 we can train aggressive index lesion detection models. The main advantage of this approach is that, by focusing only on aggressive disease, the output of such a model can also be seen as an indication for biopsy, effectively reducing unnecessary biopsy screenings. In this work, we utilize both the highly heterogeneous ProstateNet dataset, and the PI-CAI dataset, to develop accurate aggressive disease detection models.

A clinical available decision support scheme for optimizing prostate biopsy based on mpMRI

Article 24 January 2022

Active surveillance inclusion criteria under scrutiny in magnetic resonance imaging-guided prostate biopsy: a multicenter cohort study

Article Open access 17 December 2021

Diagnostic accuracy and clinical implications of robotic assisted MRI-US fusion guided target saturation biopsy of the prostate

Article Open access 12 October 2021

Introduction

Prostate cancer (PCa) is the most prevalent cancer in men and the second most prevalent across genders¹. However, PCa is also characterized by a low mortality rate provided there is early detection, a key factor in ensuring positive treatment outcomes. While biopsies constitute an essential step in diagnosing and stratifying prostate cancer, false positives or incorrect risk assessments can lead to over-treatment. Together with treatment side effects, this may result in a loss of quality of life for the patients, making it imperative to carefully consider treatment choices². The development of computer-aided diagnosis (CAD) models capable of providing “virtual biopsies” assisted by biparametric MRI (bpMRI) has the potential to reduce unnecessary biopsies and improve the risk assessment process. Indeed, the typical process for the recommendation of a biopsy consists of the analysis by an expert radiologist who will recommend a biopsy based on a positive (>2) or negative (<3) Prostate Imaging-Reporting and Data System (PI-RADS) score³, a process with a high rate of false positives⁴.

While the performance of automated systems is seldom as good as that of expert radiologists⁵, the latter commonly suffer from inter- and intra-expert variability^6,7, which can be a limiting factor in deciding between performing or not performing a biopsy or even in choosing an appropriate treatment. Computational models have the benefit of producing consistent results provided the input data is identical, with the caveat that performance degradation is common when transferring models between scanner manufacturers⁸ or, in the case of prostate bpMRI, scanner manufacturers and the use of endorectal coil. However, some works have explored the benefits of using large multi-centric heterogeneous datasets to improve the robustness and performance of the models, effectively reducing the effects of domain-shift^9,10,11.

Recent CAD models have shown potential in several clinical applications for PCa, from disease aggressiveness classification^12,13,14 to lesion segmentation and detection^{9,15,16,17,18,19,20,21,22,23,24,25}. However, these works seldom focus on unnecessary biopsy reduction, a clinical endpoint which has direct implications for patient care. Additionally, they tend to make use of single-centric datasets and rarely include a prospective validation of the developed models. Here, we make use of the publicly available PI-CAI^25,26, as well as ProstateNet (https://prostatenet.eu), a large-scale multi-centric dataset of multiparametric prostate MRI to train aggressive lesion segmentation models. We show that using heterogeneous datasets leads to improved segmentation and lesion detection performance, and validate it using a hold-out test set. Through a simulated clinical feasibility analysis, we show how the combination of medical recommendations with our fully automatic models can lead to an effective reduction in the number of unnecessary biopsies with no significant reduction in Recall, effectively reducing the number of false positives. Finally, we validate all aspects of this approach using prospective data.

Methods

Data

In this study, two different datasets were used: PI-CAI²⁶ and ProstateNet (also refered to as PNet). Each dataset is composed of a retrospective cohort, with ProstateNet also having a prospective cohort. The following are the descriptions of the datasets:

PI-CAI is a collection of Biparametric MRI volumes that include T2W, DWI and ADC sequences. These samples were acquired by three Dutch clinical centers (Radboud University Medical Center (RUMC), Ziekenhuis Groep Twente (ZGT), University Medical Center Groningen (UMCG)), and one Norwegian center (Norwegian University of Science and Technology (NTNU)), plus the additional inclusion of 329 cases from the ProstateX dataset²⁷. These clinical centers used only Siemens Healthineers or Philips Medical Systems-based 1.5Tor 3T MRI scanners with surface coils to acquire the images, following the Biparametric prostate MRI protocol²⁸. As stated in the official document of the dataset²⁶, ISUP values of 0 represent confirmed negatives or cases without the required 3-year follow-up. In total, 1009 biparametric sequences were used.
ProstateNet (PNet) is a collection of Biparametric MRI volumes that include T2W, DWI and ADC sequences. These samples were acquired by 12 clinical partners of the Procancer-I project. These partners used Siemens (Aera, Skyra, Sola, Avanto, VIDA, Tim, Prisma, Veri, Symphony, Osirix), Philips (Ingenia, Achieva, Multiva) and GE scanners (Optima, Signa, DISCOVERY). Given that each centre has specific acquisition protocols, no single one was used across all mpMRI studies done. All labels were acquired manually, and for each sample, the label consists of the index lesion (mandatory) and additional lesions that the patient has (optional). ISUP values of 0 represent cases confirmed negative after 1 year of follow-up or non-confirmed cases. In total, 1484 biparametric sequences were used.

To maximize data variability, both datasets were combined into a global one, dubbed PNetCAI. Table 1 shows the composition of the different retrospective datasets regarding scanner manufacturers and ISUP grades, while Table 2 does the same for the prospective cohort. The prospective cases were downloaded from the ProstateNet platform on February 26th 2024. From these numbers, \(15\%\) of the samples were used as a hold-out test set, and the remaining were used for training, following a 5-fold cross-validation (CV) strategy.

Table 1 Stratification of samples of the retrospective data cohort. On the left, number of samples by scanner manufacturer and by ISUP score for the retrospective cohorts. On the right, number and proportion of samples on the training and test sets.

Full size table

A connected component analysis was conducted on the training labels of both datasets ( Fig. 1), revealing that 16 samples from the PI-CAI datasets that were labelled as aggressive (ISUP \(\ge\) 2) were empty. This was cross-checked with the files present in their repository. A comparison between the size of the lesions on both datasets and their effect on the Dice score is presented in the “Results” section (3).

Table 2 Stratification of samples by scanner manufacturer and ISUP score for the prospective cohort of ProstateNet.

Full size table

Biparametric data processing

In order to use all mpMRI sequences as a single volume, both DWI and ADC sequences were resampled to the same space and size of the T2W sequences. Both T2W and DWI images were normalized using Z-scoring normalization, while ADC images were normalized by clipping the intensity values to the 0.5 and 99.5 percentiles, followed by subtracting the mean and dividing by the standard deviation.

Deep learning model specification

All 3D deep-learning (DL) detection models that were trained were full resolution nnUNet models (nnUNet)²⁹ that use deep supervision³⁰. The networks are implemented in Pytorch³¹ and were trained for 1000 epochs (250 mini-batches per epoch). To train the nnUNet models, we used the provided 3D full resolution architecture. This framework uses stochastic gradient descent with Nesterov momentum \((\mu =0.99)\), a maximum initial learning rate of 0.001, and polynomial³² learning rate policy which reduces the learning rate by a factor of \((1 - epoch/epoch_{max})^{0.9}\) in each epoch. Initial tests showed that the default learning rate of the nnUNet (0.01) was too high, resulting in underfitting on some of the folds, the reason why we decided to use a lower, more common, value. The loss function was a simple average of Dice and cross-entropy losses and the batch size was 2 sequences per iteration. The nnUNet applies automatic preprocessing based on the dataset fingerprint, and therefore the models for each dataset worked on data with slightly different spatial structures:

ProstateNet: spacing = \(0.5\times 0.5\times 3.0\)mm; crop size = \(256\times 256\times 30\) voxels
PI-CAI: spacing = \(0.4\times 0.4\times 3.0\)mm; crop size = \(384\times 384\times 21\) voxels
PNetCAI: spacing = \(0.5\times 0.5\times 3.0\)mm; crop size = \(384\times 384\times 23\) voxels

Based on recent work^11,33, no transformer-based models (ViT) were evaluated, as they were shown to perform significantly worse than nnUNet models. This is further justified by the original ViT paper, which states the need for very large datasets (over 1 million images) to train a ViT model from scratch³⁴.

Network calibration

Previous work³⁵ and prior experiments conducted by us for whole gland segmentation have shown that calibrating segmentation models significantly improves their performance. Given this, we decided to use the findings from Murugresan et al.³⁵ and change the nnUNet loss function to include both label-smoothing³⁶ and margin loss. We applied an \(\alpha\) smoothing factor of 0.2 and a margin of 10 to the loss function.

Technical specifications

To train the models for this project, we used a machine with the following specifications: 2\(\times\) NVIDIA RTX A6000 GPUs, AMD Ryzen Threadripper 3990X 64-Core Processor, and 64GB DDR4 RAM with 2200MHz clock speed. Each fold of each model took approximately 13h to finish.

Model evaluation

During the 5-fold CV, each model was evaluated based on its Dice Score (DS) and Recall when comparing the predicted output mask to that of the ground truth. When evaluating the performance on both the retrospective hold-out test and the prospective cohort, the same metrics were not computed on the vanilla output of the model, but on the candidate lesions obtained by following the subsequent methodology:

1.
Taking the probability maps that the model outputs, a threshold of 10% was defined, clipping all voxels with a probability lower than 10%, generating a soft blob;
2.
Taking those soft blobs, we employed the heuristics proposed by Bosma³⁷ and assigned all lesion candidates to their respective ground truth through a linear sum assignment algorithm;
3.
All candidates that had a confidence above 10% (the confidence is the maximum probability within the candidate) were kept and turned into hard blobs (binary segmentation masks). All other candidates (i.e. candidates with a confidence below 10%) were excluded and not analyzed any further. This threshold was selected as it reflects what has been used previously in the literature for prostate lesion candidate selection³⁷;
4.
Lastly, all hard blobs that had an intersection with the prostate gland of less than 10% (meaning they should be almost entirely outside the prostate, while still accounting for extracapsular extension) were classified as negative. The segmentations for the prostate gland were obtained using the whole gland segmentation model dubbed ProstateAll from Rodrigues et al.¹¹;
5.
In order to perform a more rigorous assessment, only hard blobs with at least 10% intersection with the original lesion masks were considered positive, regardless of having located any other lesion present in the same sample. This assessment, despite lowering some of the scores as opposed to simply locating any lesion, provides a more realistic clinical application scenario.

Each model was tested in all available retrospective hold-out sets and on the prospective cohort. The training/testing setup is summarized in Figure 2.

Additionally, we also calculated the Hausdorff Distance (HD), Average Symmetric Surface distance (ASSD), and Relative Absolute Volume Difference (RAVD) during quality assessment of the model, as these metrics provide a quantitative measure of the spatial accuracy by considering the shape and volume of the segmented regions³⁸ (both distance metrics were calculated using MedPy³⁹). The evaluations and details of each metric are available in the Supplementary Methods (A.1).

Results

Model performance is affected by train-test similarity

As previously mentioned in “Model evaluation”, we follow a two-step process in order to select the most appropriate lesion candidates: lesion candidates are selected similarly to what has been described in³⁷, followed by a lesion filtering process that keeps only lesions with a 10% overlap with the whole prostate gland. Table 3 presents the cross-validation results of all developed models. Given that the models were trained as regular index lesion segmentation models, the resulting low Dice scores are a likely consequence of the heterogeneous nature of lesion annotation for the datasets used during training. We also note that bpMRI models outperform T2W models; this is expected, as both DWI and ADC sequences provide information in the form of hyper- and hypo-intense areas, which is much more relevant for lesion localization when compared to T2W sequences. The Recall also shows that bpMRI models, in particular the PI-CAI and PNetCAI models, can detect almost all lesions, achieving a maximum Recall score of 0.9 (90%), while their respective T2W counterparts can only locate approximately 65% of the lesions.

Table 3 CV results. For each dataset, the average Dice, Hausdorf, RAVD, ASSD and Recall performance, along with their respective standard deviations, are presented. The highest recall value per sequence combination is highlighted in bold for easier comparison. p-values for the T-test significance comparing the Dice score between bpMRI PNetCAI results and each other model are also shown, with significant differences (p-value \(< 0.01\) ) marked as green or red if the bpMRI PNetCAI results are better or worse, respectively.

Full size table

The similarity between training and testing data (i.e., training and testing models on training and hold-out datasets constructed from the same dataset) can also be an important factor affecting performance. While T2W models trained on PNet data perform well only on data from PNet (\(\textrm{Dice} = 0.34\) and \(\textrm{Dice} = 0.13\) for T2W PNet models tested on PNet and PI-CAI, respectively), PI-CAI are more consistent (\(\textrm{Dice} = 0.34\) and \(\textrm{Dice} = 0.30\) for T2W PI-CAI models tested on PNet and PI-CAI, respectively; Tables 4, 5), an effect which is also consistent for Recall. However, using bpMRI leads to considerably worse performance in terms of both Dice and Recall for PI-CAI models tested on PNet data (Tables 4, 5); indeed, for bpMRI models, which outperform T2W models, performance is only consistently good for PNetCAI models. In other words, models perform consistently better only when there is some similarity between training and testing data.

This can be further observed in Table 6, where the bpMRI PNetCAI excels over the bpMRI PNet model on its hold-out test set, while differing only in 2 lesions from the bpMRI PI-CAI model on its test set. Furthermore, after a manual analysis of these missed cases, we discovered that both where from out-of-distribution samples with very large fields of view.

Table 4 Hold-out test set results. For each pairwise evaluation, the average Dice, Recall and Precision performances are presented. The best Recall result for each dataset per sequence combination is highlighted in bold for easier comparison.

Full size table

Table 5 T-test p-values for the pairwise comparison of the Dice scores presented in Table 4. Significant differences (p-value \(< 0.01\) ) marked as green.

Full size table

Table 6 Hold-out test set results stratified by the ISUP grade of the lesions. For each pairwise evaluation, the number of predicted lesions is compared to the total number of lesions. The best-performing model (most successful detections) for each dataset per sequence combination is highlighted in bold.

Full size table

Trade-off between avoiding biopsies and dangerous underestimates

To understand whether the best performing model—trained on bpMRI PNetCAI data—could be used as a CAD system for the effective reduction of biopsies (i.e. correctly predicting when an individual has no aggressive lesions), we first determined how many lesions were present in each case and calculated the number of detected lesions for all models. We then performed a simple experiment assigning lesions to one of six categories:

Correct + avoided biopsy: if no lesions were present and the model correctly estimated this (i.e. recommended avoiding an unnecessary biopsy);
Correct: if one or more lesions were present and the model correctly estimated the number of lesions
Overestimate: if one or more lesions were present and the model overestimated the number of lesions
Overestimate + unnecessary biopsy: if no lesions were present and the model overestimated the number of lesions (i.e. recommended an unnecessary biopsy)
Underestimate: if two or more lesions were present and the model estimated a number of lesions between one and excluding the correct number of lesions
Dangerous underestimate: if two or more lesions were present and the model detected no lesions (i.e. recommended avoiding a necessary biopsy)

This categorization system leads to a consistent trade-off between overestimating the number of lesions while recommending an unnecessary biopsy and avoiding unnecessary biopsies (Fig. 3); in other words, these systems could have the potential of reducing the number of biopsies but this set up has to be carefully considered as it could also result in avoiding biopsies for patients who would require them. A concerning aspect of this analysis is that only in one instance—PNetCAI models tested on PNet data—does it fulfill the task of reducing the number of biopsies without missing any relevant predictions (Table 7).

Additionally, there is a consistently large number of recommended unnecessary biopsies—indeed, for bpMRI PNetCAI models tested on PNet data, 54.05% of cases (n = 120) would have an unnecessary biopsy recommended, while only 17.76% of cases (n = 27) would avoid an unnecessary biopsy. This can have a negative impact on the well-being of individuals who have to undergo these unnecessary biopsies.

Table 7 Absolute and relative frequency of bpMRI AI system recommendations, stratified by training and testing dataset. Counts are displayed between brackets after percentages.

Full size table

Prospective validation of a simulated clinical decision system

As noted above, an automated system based solely on our models would either lead to dangerous underestimates (i.e. no lesion detected when a lesion was present) or an excess of unnecessary biopsies. To curtail these negative aspects, we devised a clinical decision protocol requiring the interaction of two different decisions, one made by a radiologist (i.e. determine that an individual should have a follow-up biopsy) and the other made by our CAD system: (i) if a radiologist does not recommend a follow-up biopsy, none is performed; (ii) if a radiologist recommends a follow-up biopsy and our model recommends no follow-up biopsy, this is not performed; and (iii) if a radiologist and our model recommend a follow-up biopsy, a biopsy is performed. In effect, this is the ideal case scenario for a model which is highly sensitive but whose specificity is relatively low (i.e. the model produces an excess of false positives).

To avoid the self-fulfilling prophecy of developing models and testing them on the same data, we used a ProstateNet prospective cohort of 73 cases (21 aggressive PCa) to determine whether such a strategy could be beneficial. In terms of prospective segmentation and detection performance, these models perform similarly to those trained and tested with retrospective data (Table 8). Lastly, and most importantly, our results show that using a combined CAD system as described above would indeed lead to a reduction of unnecessary biopsies (21.9% of cases [n=16]; Fig. 4) without increasing the dangerous underestimates.

Finally, we assess whether these models are capable of performing reasonably well across different confidence thresholds and whether they can be reliably used at the lesion level. As highlighted in Fig. A.3, these models perform better when confidence thresholds are lower (AUROC is consistently higher when such is the case). Additionally, there is limited applicability for these models as lesion segmentation tools due to their relatively high number of false positives.

Table 8 Prospective cohort results. For each model, per sequence, the average Dice, Recall and Precision performances are presented. The best Recall scores are highlighted in bold for easier comparison.

Full size table

Determinants of performance

To better understand performance (Dice scores), we analysed distinct factors—lesion size and whether annotations were derived by an AI or by a radiologist. ProstateNet and PI-CAI have different distributions of lesion size (Fig. 5B), with ProstateNet presenting lesions larger than those in PI-CAI. Indeed, at a significance threshold of 0.05, there is a significant Dice difference between below and above median lesions for both datasets (Fig. 5A). While more evident in the ProstateNet dataset, both sets of data exhibit a size bias where larger lesions are easier to segment. Given that some lesions in PI-CAI are generated by an AI model²⁶, we compared the Dice scores between lesions annotated by AI and by radiologists, showing that the former lead to higher Dice scores than the latter (\(p=7.6e-5\); Fig. 5C). In Fig. 5D, we highlight a more comprehensive vision of these results.

Finally, to acquire a qualitative understanding of prediction quality, we analyzed a subset of true positive and false negative detections at the lesion level for our best-performing model—trained on bpMRI PNetCAI data. Figure 6 offers a concise overview of our analysis, while Figs. A.1 and A.2 present a comprehensive depiction. As highlighted in Fig. A.1, true positives typically encompass all or nearly all of the lesions as annotated by expert radiologists. This is what is expected of such CAD systems, providing information regarding the general area where it thinks the lesion is located to guide the radiologist. When considering negative examples (Fig. A.2), there is a trend—while the lesion annotated by expert radiologists may be missed, the models identify another likely lesion somewhere else in the prostate. In summary, the conclusions derived from our qualitative analysis are as follows:

In each case, our model detected additional existing lesions and/or cysts. Although these were marked as missed cases due to insufficient overlap with the ground truth mask, they nonetheless correctly identified other lesions as aggressive, demonstrating significant clinical value for a CAD system.
In some instances (Fig. A.2), with the fourth example being the only visible one in this set of slices, our model correctly identified the area of interest despite low confidence and probability scores. This demonstrates the utility of our model in guiding radiologists to significant areas regardless of the displayed probability.

Discussion

In this work, we posit a hybrid computer-aided diagnosis (CAD) system combining radiologists and an automatic lesion detection model, which can reduce the number of unnecessary biopsies in the diagnosis of aggressive prostate cancer (ISUP>1) in the general population of patients undergoing biparametric MRI for prostate cancer diagnosis. Through a simulated clinical feasibility scenario, a reduction of approximately 20% of unnecessary biopsies was achieved, with a prospective validation showing that this does not lead to a reduction in the number of detected prostate cancer cases. Ultimately, we highlight how deep-learning methods can assist in the reduction of unnecessary biopsies without leading to decreased sensitivity. This has the potential to reduce patient discomfort and complications following biopsies.

Largely, most CAD systems of the sort seek to solve a similar, albeit separate problem — that of detecting undiagnosed prostate cancer cases with the objective of increasing sensitivity by reducing the amount of false negatives; our approach considers a different problem — that of reducing the number of unnecessary biopsies (i.e. reducing false positives). Indeed, this is also a considerable problem, as a 2019 meta-review showed that the pooled sensitivity for PI-RADS 2.1 was approximately 91% (95% CI=83%-95%)⁴. Works seeking to automate or partially automate prostate cancer diagnosis contemplate strategies focusing either on the detection of lesions with a sufficiently high PI-RADS score (i.e. 3 or 4)⁴⁰ or in the detection of lesions with a confirmed aggressive histological grade (ISUP>1)^25,41,42. The former has the obvious advantage of requiring no biopsy for training, but hinders the clinical applicability evidenced by the latter. Some of these strategies also incorporate a human-in-the-loop setup, which is more similar to the study design we introduce here⁴³. The relevant performance metric which we can compare between our work and previous works is the Recall—we observed a Recall of 82% for models trained/tested on PNetCAI, slightly lower to what has been previously reported (87.2%⁴³, 89.4%²⁵, 93%⁴²). However, we note that these studies are trained/tested on a relatively small number of clinical centers (4 or fewer)^25,42,43 (which greatly reduces the variability of the data), do not provide confirmation of prospective validation, and do not study the impact of using diverse training datasets on performance. Given the previously reported drop in performance when transferring models between different datasets^10,11,44 and the fact that models (clinical and otherwise) tend to suffer from temporal degradation^45,46,47, such assessments are of paramount importance. Finally, and to the best of our knowledge, our work offers a unique analysis of performance differences when considering lesion size and annotation types, thus better contextualizing results.

This work has some caveats—the simulated clinical scenario does not allow us to estimate the effect of real-world agents (i.e. medical doctors) interacting with such a CAD system. This may lead to optimistic results as automation bias (when users excessively trust the output of automatic CAD systems⁴⁸) can lead to unforeseen outcomes as radiologists may trust excessively in wrong predictions made by CAD systems⁴⁹. It should also be highlighted that, while the best performing model detects all important cases in ProstateNet both retrospectively and prospectively (Figs. 3, 4), not all index lesions are detected, which can cause confusion when results are interpreted in a clinical setting; this is in part largely associated with how these datasets are annotated — indeed, radiologists are tasked with segmenting at least the index lesion, leading to a fair degree of heterogeneity in the annotations. Additionally, performance is relatively poor when we consider the specificity of these models; while this can be improved through the assistance of a radiologist, it should be noted that additional sources of false positive reduction should be taken, such as an auxiliary classification of lesion candidates⁵⁰ or zone-specific PSA density⁵¹. Furthermore, our approach does not focus on lesion location — particularly, we perform predictions at the patient, rather than at the lesion level — so further studies on this are necessary. Finally, it should be noted that there is no guarantee that nnUNet is the best performing model (“No Free Lunch” theorem) — earlier works have suggested that other models may be better performing than nnUNet for prostate lesion segmentation⁵⁰, so a more comprehensive assessment with other models could be important.

Data availability

The datasets generated and/or analysed during the current study are available in the PI-CAI repository, https://zenodo.org/records/6624726. The datasets generated and/or analysed during the current study are not publicly available due to data privacy laws but are available from the corresponding author on reasonable request.

References

Siegel, R. L., Miller, K. D., Fuchs, H. E. & Jemal, A. Cancer statistics, 2022. CA Cancer J. Clin. 72, 7–33 (2022).
Article PubMed Google Scholar
Resnick, M. J. et al. Long-term functional outcomes after treatment for localized prostate cancer. N. Engl. J. Med. 368, 436–445. https://doi.org/10.1056/NEJMoa1209978 (2013).
Article CAS PubMed PubMed Central Google Scholar
Scott, R., Misser, S. K., Cioni, D. & Neri, E. PI-RADS v2.1: What has changed and how to report. SA J. Radiol. 25, 2062 (2021).
Drost, F.-J. H. et al. Prostate MRI, with or without MRI-targeted biopsy, and systematic biopsy for detecting prostate cancer. Cochrane Database Syst. Rev. 4, CD012663 (2019).
Cao, R. et al. Performance of deep learning and genitourinary radiologists in detection of prostate cancer using 3-T multiparametric magnetic resonance imaging. J. Magn. Reson. Imaging 54, 474–483 (2021).
Article PubMed PubMed Central Google Scholar
Steenbergen, P. et al. Prostate tumor delineation using multiparametric magnetic resonance imaging: Inter-observer variability and pathology validation. Radiother. Oncol. 115, 186–190 (2015).
Article PubMed Google Scholar
Chen, M. Y., Woodruff, M. A., Dasgupta, P. & Rukin, N. J. Variability in accuracy of prostate cancer segmentation among radiologists, urologists, and scientists. Cancer Med. 9, 7172–7182. https://doi.org/10.1002/cam4.3386 (2020).
Article PubMed PubMed Central Google Scholar
Kushol, R., Parnianpour, P., Wilman, A. H., Kalra, S. & Yang, Y.-H. Effects of MRI scanner manufacturers in classification tasks with deep learning models. Sci. Rep. 13, 16791 (2023).
Article ADS CAS PubMed PubMed Central Google Scholar
Netzer, N. et al. Fully automatic deep learning in bi-institutional prostate magnetic resonance imaging: Effects of cohort size and heterogeneity. Invest. Radiol. 56, 799–808 (2021).
Article CAS PubMed Google Scholar
Meglič, J., Sunoqrot, M. R. S., Bathen, T. F. & Elschot, M. Label-set impact on deep learning-based prostate segmentation on MRI. Insights Imaging 14. https://doi.org/10.1186/s13244-023-01502-w (2023).
Rodrigues, N. M. et al. Analysis of domain shift in whole prostate gland, zonal and lesions segmentation and detection, using multicentric retrospective data. Comput. Biol. Med. 171, 108216. https://doi.org/10.1016/j.compbiomed.2024.108216 (2024).
Article PubMed Google Scholar
Rodrigues, A. et al. Value of handcrafted and deep radiomic features towards training robust machine learning classifiers for prediction of prostate cancer disease aggressiveness. Sci. Rep. 13. https://doi.org/10.1038/s41598-023-33339-0 (2023).
Pachetti, E. & Colantonio, S. 3d-vision-transformer stacking ensemble for assessing prostate cancer aggressiveness from t2w images. Bioengineering 10. https://doi.org/10.3390/bioengineering10091015 (2023).
Bernatz, S. et al. Comparison of machine learning algorithms to predict clinically significant prostate cancer of the peripheral zone with multiparametric mri using clinical assessment categories and radiomic features. Eur. Radiol. 30, 6757–6769 (2020).
Article PubMed PubMed Central Google Scholar
Pellicer-Valero, O. J. et al. Deep learning for fully automatic detection, segmentation, and Gleason grade estimation of prostate cancer in multiparametric magnetic resonance images. arXiv:2103.12650 (2022).
Dai, Z. et al. Segmentation of the prostatic gland and the intraprostatic lesions on multiparametic magnetic resonance imaging using mask region-based convolutional neural networks. Adv. Radiat. Oncol. 5, 473–481 (2020).
Article PubMed PubMed Central Google Scholar
Cao, R. et al. Prostate cancer detection and segmentation in multi-parametric MRI via cnn and conditional random field. In 2019 IEEE 16th International Symposium on Biomedical Imaging (ISBI 2019). 1900–1904. https://doi.org/10.1109/ISBI.2019.8759584 (2019).
Hambarde, P. et al. Prostate lesion segmentation in MR images using radiomics based deeply supervised u-net. Biocybern. Biomed. Eng. 40, 1421–1435. https://doi.org/10.1016/j.bbe.2020.07.011 (2020).
Article Google Scholar
Cao, R. et al. Joint prostate cancer detection and Gleason score prediction in MP-MRI via focalnet. IEEE Trans. Med. Imaging 38, 2496–2506. https://doi.org/10.1109/TMI.2019.2901928 (2019).
Article PubMed Google Scholar
Hosseinzadeh, M. et al. Deep learning-assisted prostate cancer detection on bi-parametric MRI: Minimum training data size requirements and effect of prior knowledge. Eur. Radiol. 32, 2224–2234 (2022).
Article CAS PubMed Google Scholar
Seetharaman, A. et al. Automated detection of aggressive and indolent prostate cancer on magnetic resonance imaging. Med. Phys. 48, 2960–2972 (2021).
Article PubMed Google Scholar
Khan, Z., Yahya, N., Alsaih, K., Ali, S. S. A. & Meriaudeau, F. Evaluation of deep neural networks for semantic segmentation of prostate in T2W MRI. Sensors 20, 3183. https://doi.org/10.3390/s20113183 (2020).
Article ADS PubMed PubMed Central Google Scholar
Khan, Z., Yahya, N., Alsaih, K., Al-Hiyali, M. I. & Meriaudeau, F. Recent automatic segmentation algorithms of MRI prostate regions: A review. IEEE Access 9, 97878–97905. https://doi.org/10.1109/access.2021.3090825 (2021).
Article Google Scholar
Bashkanov, O. et al. Automatic detection of prostate cancer grades and chronic prostatitis in biparametric MRI. Comput. Methods Programs Biomed. 239, 107624 (2023).
Article PubMed Google Scholar
Saha, A. et al. Artificial intelligence and radiologists in prostate cancer detection on MRI (PI-CAI): An international, paired, non-inferiority, confirmatory study. Lancet Oncol. (2024).
Saha, A. et al. The PI-CAI Challenge: Public Training and Development Dataset. https://doi.org/10.5281/zenodo.6517398 (2022).
Armato, S. G. et al. PROSTATEx challenges for computerized classification of prostate lesions from multiparametric magnetic resonance images. J. Med. Imaging (Bellingham) 5, 044501 (2018).
Engels, R. R., Israël, B., Padhani, A. R. & Barentsz, J. O. Multiparametric magnetic resonance imaging for the detection of clinically significant prostate cancer: What urologists need to know. Part 1: Acquisition. Eur. Urol. 77, 457–468. https://doi.org/10.1016/j.eururo.2019.09.021 (2020).
Isensee, F., Jaeger, P. F., Kohl, S. A. A., Petersen, J. & Maier-Hein, K. H. nnU-net: A self-configuring method for deep learning-based biomedical image segmentation. Nat. Methods 18, 203–211. https://doi.org/10.1038/s41592-020-01008-z (2020).
Article CAS PubMed Google Scholar
Zhu, Q., Du, B., Turkbey, B. I., Choyke, P. L. & Yan, P. Deeply-supervised cnn for prostate segmentation. In 2017 International Joint Conference on Neural Networks (IJCNN). 178–184 (2017).
Paszke, A. et al. Pytorch: An imperative style, high-performance deep learning library. In Advances in Neural Information Processing Systems. Vol. 32. 8024–8035 (Curran Associates, Inc., 2019).
Chen, L.-C., Papandreou, G., Kokkinos, I., Murphy, K. & Yuille, A. L. Deeplab: Semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected CRFS. arXiv:1606.00915 (2017).
Rodrigues, N. M., Silva, S., Vanneschi, L. & Papanikolaou, N. A comparative study of automated deep learning segmentation models for prostate MRI. Cancers 15. https://doi.org/10.3390/cancers15051467 (2023).
Dosovitskiy, A. et al. An image is worth 16 x 16 words: Transformers for image recognition at scale. https://doi.org/10.48550/ARXIV.2010.11929 (2020).
Murugesan, B., Liu, B., Galdran, A., Ayed, I. B. & Dolz, J. Calibrating segmentation networks with margin-based label smoothing. Med. Image Anal. 87, 102826. https://doi.org/10.1016/j.media.2023.102826 (2023).
Article PubMed Google Scholar
Müller, R., Kornblith, S. & Hinton, G. When Does Label Smoothing Help? (Curran Associates Inc., 2019).
Google Scholar
Bosma, J. S. et al. Semisupervised learning with report-guided pseudo labels for deep learning-based prostate cancer detection using biparametric MRI. Radiol. Artif. Intell. 5, e230031 (2023).
Article PubMed PubMed Central Google Scholar
Yeghiazaryan, V. & Voiculescu, I. Family of boundary overlap metrics for the evaluation of medical image segmentation. J. Med. Imaging (Bellingham) 5, 015006 (2018).
Maier, O. et al. loli/medpy: Medpy 0.4.0. https://doi.org/10.5281/zenodo.2565940 (2019).
Hosseinzadeh, M. et al. Deep learning-assisted prostate cancer detection on bi-parametric MRI: Minimum training data size requirements and effect of prior knowledge. Eur. Radiol. 32, 2224–2234 (2022).
Article CAS PubMed Google Scholar
Zhao, L. et al. Predicting clinically significant prostate cancer with a deep learning approach: A multicentre retrospective study. Eur. J. Nucl. Med. Mol. Imaging 50, 727–741 (2023).
Article CAS PubMed Google Scholar
Hamm, C. A. et al. Interactive explainable deep learning model informs prostate cancer diagnosis at MRI. Radiology 307, e222276 (2023).
Article PubMed Google Scholar
Yu, R. et al. PI-RADSAI: Introducing a new human-in-the-loop AI model for prostate cancer diagnosis based on MRI. Br. J. Cancer 128, 1019–1029 (2023).
Article CAS PubMed PubMed Central Google Scholar
Yu, A. C., Mohajer, B. & Eng, J. External validation of deep learning algorithms for radiologic diagnosis: A systematic review. Radiol. Artif. Intell. 4, e210064 (2022).
Article PubMed PubMed Central Google Scholar
Bedoya, A. D. et al. Machine learning for early detection of sepsis: An internal and temporal validation study. JAMIA Open 3, 252–260 (2020).
Article PubMed PubMed Central Google Scholar
Foote, H. P. et al. Development and temporal validation of a machine learning model to predict clinical deterioration. Hosp. Pediatr. 14, 11–20 (2024).
Article PubMed PubMed Central Google Scholar
Vela, D. et al. Temporal quality degradation in AI models. Sci. Rep. 12, 11654 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Kostick-Quenet, K. M. & Gerke, S. AI in the hands of imperfect users. NPJ Digit. Med. 5, 197 (2022).
Article PubMed PubMed Central Google Scholar
Dratsch, T. et al. Automation bias in mammography: The impact of artificial intelligence BI-RADS suggestions on reader performance. Radiology 307, e222176 (2023).
Article PubMed Google Scholar
Saha, A., Hosseinzadeh, M. & Huisman, H. End-to-end prostate cancer detection in bpMRI via 3D CNNs: Effects of attention mechanisms, clinical priori and decoupled false positive reduction. Med. Image Anal. 73, 102155 (2021).
Article PubMed Google Scholar
Hamm, C. A. et al. Reduction of false positives using zone-specific prostate-specific antigen density for prostate MRI-based biopsy decision strategies. Eur. Radiol. (2024).
Hamm, C. A. et al. Reduction of false positives using zone-specific prostate-specific antigen density for prostate MRI-based biopsy decision strategies. Eur. Radiol. https://doi.org/10.1007/s00330-024-10700-z (2024).
Article PubMed PubMed Central Google Scholar

Download references

Acknowledgements

This work was supported by FCT through the LASIGE Research Unit, ref. UID/000408/2025, and Nuno Rodrigues PhD Grant10.54499/2021.05322.BD (https://doi.org/10.54499/2021.05322.BD). Ana Sofia and José Guilherme de Almeida were supportedby the European Union H2020: ProCAncer-I project (EU grant 952159). This work was supported by national funds throughFCT (Fundação para a Ciência e a Tecnologia), under the project UIDB/04152/2020 (https://doi.org/10.54499/UIDB/04152/2020) -Centro de Investigação em Gestão de Informação (MagIC)/NOVA IMS.

Author information

Nuno M. Rodrigues and José Guilherme de Almeida contributed equally to this work.
These authors jointly supervised this work: Sara Silva and Nickolas Papanikolaou.
A list of authors and their affiliations appears at the end of the paper.

Authors and Affiliations

Computational Clinical Imaging Group, Champalimaud Foundation, Lisbon, Portugal
Nuno M. Rodrigues, José Guilherme de Almeida, Ana Sofia Castro Verde, Celso Matos, Nikolaos Papanikolaou, José Guilherme de Almeida, Ana Castro Verde, Ana Carolina Rodrigues, Nuno Rodrigues, Miguel Chambel & Nickolas Papanikolaou
LASIGE, Faculty of Sciences, LASIGE, Faculdade de Ciências, Universidade de Lisboa, 1749--016 Lisboa, Portugal, Lisbon, Portugal
Nuno M. Rodrigues & Sara Silva
Radiology Department, Champalimaud Clinical Center, Champalimaud Foundation, Lisbon, Portugal
Ana Mascarenhas Gaivão, Carlos Bireiro, Inês Santiago, Joana Ip & Sara Belião
NOVA Information Management School (NOVA IMS), Universidade Nova de Lisboa, Campus de Campolide, 1070-312, Lisboa, Portugal
Leonardo Vanneschi
Institute of Computer Science, Foundation for Research and Technology Hellas (FORTH), 700 13, Heraklion, Greece
Manolis Tsiknakis
Department of Electrical and Computer Engineering, Hellenic Mediterranean University, 710 04, Heraklion, Greece
Manolis Tsiknakis & Kostas Marias
Computational BioMedicine Laboratory (CBML), Institute of Computer Science, Foundation for Research and Technology-Hellas (FORTH), Heraklion, Greece
Kostas Marias, Manolis Tsiknakis, Kostas Marias, Stelios Sfakianakis, Varvara Kalokyri, Eleftherios Trivizakis, Grigorios Kalliatakis, Avtantil Dimitriadis, Dimitris Fotiadis, Nikolaos Tachos, Eugenia Mylona, Dimitris Zaridis & Charalampos Kalantzopoulos
Department of Radiology, Candiolo Cancer Institute, FPO-IRCCS, Strada Provinciale 142 Km 3.95, 10060, Candiolo, Italy
Daniele Regge, Daniele Regge, Valentina Giannini, Simone Mazzetti, Giovanni Cappello, Giovanni Maimone & Valentina Napolitano
Department of Surgical Sciences, University of Turin, 10124, Turin, Italy
Daniele Regge
Department of Radiology, Royal Marsden Hospital, Sutton, UK
Nickolas Papanikolaou
Radboud, Nijmegen, Netherlands
Henkjan Huisman, Maarten de Rooij, Anindo Saha, Jasper J. Twilt & Jurgen Futterer
HULAFE-Biomedical Imaging Research Group, Medical Imaging Department, Instituto de Investigación Sanitaria La Fe, Hospital Universitari i Politecnic La Fe, Valencia, Spain
Luis Martí-Bonmatí, Leonor Cerdá-Alberich, Gloria Ribas, Silvia Navarro & Manuel Marfil
Academic Radiology, Department of Translational Research, University of Pisa, Pisa, Italy
Emanuele Neri, Giacomo Aringhieri, Lorenzo Tumminello & Vincenzo Mendola
Department of Radiology, Hacettepe, Ankara, Turkey
Deniz Akata, Mustafa Özmen, Ali Devrim Karaosmanoglu, Firat Atak & Musturay Karcaaltincaba
Department of Radiology (IDI), Institute of Biomedical Research of Girona Dr. Josep Trueta (IDIBGI), Girona, Spain
Joan C. Vilanova
National Cancer Institute, Vilnius, Lithuania
Jurgita Usinskiene, Ruta Briediene, Audrius Untanas & Kristina Slidevska
General Anti-Cancer and Oncological Hospital of Athens, Athens, Greece
Katsaros Vasilis & Georgiou Georgios
Radiology & AI Research Hub, Division of Radiotherapy and Imaging, The Institute of Cancer Research, The Royal Marsden NHS Foundation Trust, London, UK
Dow-Mu Koh, Robby Emsley, Sharon Vit, Ana Ribeiro, Simon Doran & Tiaan Jacobs
Quirónsalud Hospital/CIBERSAM, Valencia, Spain
Gracián García-Martí
Institute of Information Science and Technologies of the National Reserch Council of Italy, Pisa, Italy
Sara Colantonio, Maria Antonietta Pascali, Eva Pachetti, Giulio del Corso, Danila Germanese, Andrea Berti & Gianluca Carloni
Mass General Hospital, Boston, USA
Jayashree Kalpathy-Cramer & Christopher Bridge
B3D, Birmingham, UK
Joao Correia & Walter Hernandez
Advantis, Athens, Greece
Zoi Giavri, Christos Pollalis & Dimitrios Agraniotis
Quibim, S.L., Valencia, Spain
Ana Jiménez Pastor & Jose Munuera Mora
Univie, Vienna, Austria
Clara Saillant, Theresa Henne & Rodessa Marquez

Authors

Nuno M. Rodrigues
View author publications
Search author on:PubMed Google Scholar
José Guilherme de Almeida
View author publications
Search author on:PubMed Google Scholar
Ana Sofia Castro Verde
View author publications
Search author on:PubMed Google Scholar
Ana Mascarenhas Gaivão
View author publications
Search author on:PubMed Google Scholar
Carlos Bireiro
View author publications
Search author on:PubMed Google Scholar
Inês Santiago
View author publications
Search author on:PubMed Google Scholar
Joana Ip
View author publications
Search author on:PubMed Google Scholar
Sara Belião
View author publications
Search author on:PubMed Google Scholar
Celso Matos
View author publications
Search author on:PubMed Google Scholar
Leonardo Vanneschi
View author publications
Search author on:PubMed Google Scholar
Manolis Tsiknakis
View author publications
Search author on:PubMed Google Scholar
Kostas Marias
View author publications
Search author on:PubMed Google Scholar
Daniele Regge
View author publications
Search author on:PubMed Google Scholar
Sara Silva
View author publications
Search author on:PubMed Google Scholar
Nickolas Papanikolaou
View author publications
Search author on:PubMed Google Scholar

Consortia

The ProCAncer-I Consortium

Manolis Tsiknakis
, Kostas Marias
, Stelios Sfakianakis
, Varvara Kalokyri
, Eleftherios Trivizakis
, Grigorios Kalliatakis
, Avtantil Dimitriadis
, Dimitris Fotiadis
, Nikolaos Tachos
, Eugenia Mylona
, Dimitris Zaridis
, Charalampos Kalantzopoulos
, Nikolaos Papanikolaou
, José Guilherme de Almeida
, Ana Castro Verde
, Ana Carolina Rodrigues
, Nuno Rodrigues
, Miguel Chambel
, Henkjan Huisman
, Maarten de Rooij
, Anindo Saha
, Jasper J. Twilt
, Jurgen Futterer
, Luis Martí-Bonmatí
, Leonor Cerdá-Alberich
, Gloria Ribas
, Silvia Navarro
, Manuel Marfil
, Emanuele Neri
, Giacomo Aringhieri
, Lorenzo Tumminello
, Vincenzo Mendola
, Deniz Akata
, Mustafa Özmen
, Ali Devrim Karaosmanoglu
, Firat Atak
, Musturay Karcaaltincaba
, Joan C. Vilanova
, Jurgita Usinskiene
, Ruta Briediene
, Audrius Untanas
, Kristina Slidevska
, Katsaros Vasilis
, Georgiou Georgios
, Dow-Mu Koh
, Robby Emsley
, Sharon Vit
, Ana Ribeiro
, Simon Doran
, Tiaan Jacobs
, Gracián García-Martí
, Daniele Regge
, Valentina Giannini
, Simone Mazzetti
, Giovanni Cappello
, Giovanni Maimone
, Valentina Napolitano
, Sara Colantonio
, Maria Antonietta Pascali
, Eva Pachetti
, Giulio del Corso
, Danila Germanese
, Andrea Berti
, Gianluca Carloni
, Jayashree Kalpathy-Cramer
, Christopher Bridge
, Joao Correia
, Walter Hernandez
, Zoi Giavri
, Christos Pollalis
, Dimitrios Agraniotis
, Ana Jiménez Pastor
, Jose Munuera Mora
, Clara Saillant
, Theresa Henne
& Rodessa Marquez

Contributions

Conceptualization, N.M.R., J.G.A.; methodology, N.M.R., J.G.A.; software, N.M.R., J.G.A.; validation, N.M.R.; formal analysis, N.M.R., J.G.A.; investigation, N.M.R., J.G.A; resources, N.M.R, J.G.A; data curation, N.M.R, J.G.A., A.C.V., A.G., C.B., I.S., J.I., S.B., P.C.; writing-original draft preparation, N.M.R., J.G.A.; writing-review and editing, S.S., L.V., N.P., P.C.; visualization, N.M.R., J.G.A.; supervision, S.S., L.V., N.P; project administration, N.P, C.M, M.T., K.M., D.R.; funding acquisition, N.P., M.T., K.M., D.R.; all authors have read and agreed to the published version of the manuscript.

Corresponding author

Correspondence to Nuno M. Rodrigues.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information. (download PDF )

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Rodrigues, N.M., de Almeida, J.G., Verde, A.S.C. et al. Effective reduction of unnecessary biopsies through a deep-learning-assisted aggressive prostate cancer detector. Sci Rep 15, 15211 (2025). https://doi.org/10.1038/s41598-025-99795-y

Download citation

Received: 18 July 2024
Accepted: 23 April 2025
Published: 30 April 2025
Version of record: 30 April 2025
DOI: https://doi.org/10.1038/s41598-025-99795-y

This article is cited by

Decade-long landscape of transrectal ultrasound (TRUS) in prostate cancer research: trends, collaborations, and emerging frontiers
- Jingwen Yan
- Minghui Tan
- Qing Wu
Journal of Ultrasound (2025)

Subjects

Abstract

Similar content being viewed by others

A clinical available decision support scheme for optimizing prostate biopsy based on mpMRI

Active surveillance inclusion criteria under scrutiny in magnetic resonance imaging-guided prostate biopsy: a multicenter cohort study

Diagnostic accuracy and clinical implications of robotic assisted MRI-US fusion guided target saturation biopsy of the prostate

Introduction

Methods

Data

Biparametric data processing

Deep learning model specification

Network calibration

Technical specifications

Model evaluation

Results

Model performance is affected by train-test similarity

Trade-off between avoiding biopsies and dangerous underestimates

Prospective validation of a simulated clinical decision system

Determinants of performance

Discussion

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Consortia

The ProCAncer-I Consortium

Contributions

Corresponding author

Additional information

Publisher’s note

Supplementary Information

Supplementary Information. (download PDF )

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Decade-long landscape of transrectal ultrasound (TRUS) in prostate cancer research: trends, collaborations, and emerging frontiers

Search

Quick links