Evaluation of selective bone scan staging in prostate cancer – external validation of current strategies and decision-curve analysis

Hiwase, Mrunal D.; Jay, Alex; Bulamu, Norma; Teh, Johnathan; Paterson, Felix; Kichenadasse, Ganessan; Vincent, Andrew D.; O’Callaghan, Michael

doi:10.1038/s41391-022-00515-8

Download PDF

Article
Clinical Research
Open access
Published: 14 March 2022

Evaluation of selective bone scan staging in prostate cancer – external validation of current strategies and decision-curve analysis

Mrunal D. Hiwase ORCID: orcid.org/0000-0003-2814-3217^1,2,
Alex Jay³,
Norma Bulamu ORCID: orcid.org/0000-0003-4822-4858⁴,
Johnathan Teh^1,5,
Felix Paterson⁶,
Ganessan Kichenadasse ORCID: orcid.org/0000-0001-9923-5149⁷,
Andrew D. Vincent⁸^na1,
Michael O’Callaghan³^na1 &
South Australian Prostate Cancer Clinical Outcomes Collaborative (SA-PCCOC)

Prostate Cancer and Prostatic Diseases volume 25, pages 336–343 (2022)Cite this article

2237 Accesses
1 Citations
Metrics details

Subjects

Abstract

Background

Recommendations for staging newly diagnosed prostate cancer patients vary between guidelines and literature.

Methods

Our objective was to validate and compare prediction models selecting newly diagnosed prostate cancer patients for bone scan staging. To achieve this, we validated eleven models in a population-based cohort of 10,721 patients diagnosed with prostate cancer between 2005 and 2019. The primary outcome was net-benefit. This was assessed at different balances of conservatism and tolerance, represented by preference ratio and number-willing-to-test (NWT). Secondary outcomes included calibration slope, calibration-in-the-large (intercept), and discrimination measured by Area-under-the-receiver-operator-characteristics curve (AUC).

Results

For preference ratios less than 1:39 (NWT greater than 40), scanning everyone provided greater net-benefit than selective staging. For preference ratios 1:39 to 3:97 (NWT 33–40), the European Association of Urology (EAU) 2020 guideline recommendation was the best approach. For preference ratios 3:97–7:93 (NWT 14–33), scanning EAU high-risk patients only was preferable. For preference ratios 7:93–1:9 (NWT 10–13), scanning only Gnanapragasam Group 5 patients was best. All models had similar fair discrimination (AUCs 0.68–0.80), but most had poor calibration.

Conclusions

We identified three selective staging strategies that outperformed all other approaches but did so over different ranges of conservatism and tolerance. Scanning only EAU high-risk patients provided the greatest net-benefit over the greatest range of preference ratios and scenarios, but other options may be preferable depending upon the local healthcare system’s degree of conservatism and tolerance.

Predicting high-grade prostate cancer at initial biopsy: clinical performance of the ExoDx (EPI) Prostate Intelliscore test in three independent prospective studies

Article Open access 30 September 2021

Explainable ML models for a deeper insight on treatment decision for localized prostate cancer

Article Open access 17 July 2023

Risk calculators for the detection of prostate cancer: a systematic review

Article 03 June 2024

Introduction

Prostate cancer mortality is highly dependent upon stage of disease, and assessment of metastatic diseases at prostate cancer diagnosis is critical for adequate treatment planning and selection between potentially morbid treatment options. Bone scan staging remains the most widely available tool for quantifying metastatic burden and most supported for basing treatment decisions upon [1, 2]. Yet, recommendations on which patients to scan vary between guidelines and literature. Guideline recommendations are reported as weak [1] or based only upon expert opinion and grade 2A–C evidence [2, 3]. Recommendations from the primary literature were developed in small selective cohorts [4,5,6] often based upon insensitive performance markers (like negative predictive value), infrequently externally validated and if validated, done so in small selective cohorts [7,8,9,10,11,12,13,14,15,16,17,18,19]. Head-to-head comparisons of strategies are also limited [7, 13,14,15,16].

Decision curve analysis offers a novel approach to evaluate these strategies and compare them at various levels of conservatism (preference to avoid missing a positive scan) and tolerance (preference to limit number of people scanned). This approach compares strategies on net-benefit, which considers the positive scans detected by a particular strategy and the number of people scanned with it, weighting these two results by the conservatism and tolerance of the preferred strategy type.

We use decision curve analysis to review and validate strategies for bone scan staging in patients with newly diagnosed prostate cancer, comparing them against major clinical guidelines. The aim is to identify optimal strategies for bone scan staging in newly diagnosed prostate cancer patients.

Subjects and methods

Identifying models used for selective bone scan staging

Models were chosen from published literature and guidelines and validated in the South Australian Prostate Cancer Clinical Outcomes Collaborative (SA-PCCOC) database. A model was defined as any allocation of bone scan positivity risk to a group of newly diagnosed prostate cancer patients based on a predictor(s). MEDLINE and EMBASE databases were searched for models using keywords: Prostate Cancer, Metastases, Prediction, Staging, Screening and Imaging (with related terms) and an English-only limit. Titles and abstracts were screened for relevance. Abstract-only records and reviews were manually excluded. Articles containing models predicting bone scan positivity, using common clinical predictors, were further assessed. Those using tests not routinely available (circulating tumour cells, cell-free DNA and similar) were excluded. Common predictors included serum Prostate Specific Antigen, Tumour stage and Gleason score (GS) at diagnosis. We used the Prediction model Risk Of Bias ASsessment Tool (PROBAST) tool [20] for quality assessment.

Validation cohort

The cohort comprised of all patients diagnosed between 1 January 2005 and 26 May 2019 in the SA-PCCOC registry. This registry captures more than 90% of prostate cancer patients diagnosed in South Australia, collecting data on disease characteristics at diagnosis, initial treatment type, cause of death, time to biochemical recurrence and more. Patients are retained unless they opt-out of data collection. Survival data is obtained from the births, deaths and marriages registry and is available for all patients. Only patients diagnosed before 2005 or without a diagnosis date were excluded.

Model outcome

Bone scans performed within 20 weeks of histological diagnosis were considered staging scans [21]. Indeterminate scans were reclassified as positive or negative using subsequent imaging and clinical information. Where further classification was unachievable, results were imputed.

Model predictors

Most models used serum prostate-specific antigen (PSA), tumour (T) stage and/or GS as predictors. For validation, PSA prior to treatment and closest to diagnosis were used for “PSA at diagnosis”. If all PSA levels on record were post-treatment, PSA was set as unknown and imputed. T-stage was assessed by physical exam at diagnosis. GS was based on diagnostic biopsies.

Ethics

The SA-PCCOC research committee approved use of de-identified data, having permission to authorize this from the Southern Australian Clinical Human Research Ethics Committee. This study was performed in accordance with the Declaration of Helsinki 2013.

Statistical methods

Calibration

Calibration slope and calibration-in-the-large (calibration intercept) were assessed to gauge accuracy of model predictions of the risk of bone scan positivity. These were calculated by fitting logistic regressions of observed risk of bone scan positivity against predicted risk [22]. Calibration-in-the-large was similarly calculated with slope fixed at one [22]. These analyses were performed in each imputed dataset and pooled using Rubin’s rules [23]. Ideal calibration slope is one and calibration-in-the-large is zero [22]. Where predicted risk was not specified for a model risk group, the rate of bone scan positivity in the model’s development study was taken as predicted risk (Supplementary Table 6). Calibration was not calculated for guideline models, which did not report numeric predicted risks.

Discrimination

Area-under-the-receiver-operator-characteristics curve (AUC) was used to summarize model ability to discriminate between patients with a positive and negative bone scan. AUC was interpreted in accordance with Hosmer et al. [24].

Decision curve analysis

Decision curve analysis was used to compare the net-benefit of models at different scanning thresholds (staging strategies) over varying degrees of conservatism (preference to avoid missing disease) and tolerance (preference to scan fewer people) [25]. Traditionally, varying degrees of conservatism and tolerance (“preference”) are reflected in the x-axis of decision curves as a probability threshold (p_t)—the point at which the user believes intervention is appropriate. To avoid confusion between model thresholds and p_t, we used the alternative measure of preference ratio [25] and number-willing-to-test (NWT). A preference ratio of 1:99, in this context, represents a belief that scanning one hundred people to capture one positive bone scan is reasonable [25], and a p_t of 0.01 and NWT of 100. A preference ratio of 1:9 was the upper limit of preference assessed, as it represents a willingness to scan at least ten patients to capture one positive bone scan—a number we felt was universally acceptable.

Continuous and categorical models were presented differently. As categorical models provide qualitative rather than quantitative predictions, they had fewer potential decision thresholds. They were presented as fixed strategies, akin to the presentation of a “test” in Vickers et al. [25], with each potential threshold from a categorical model displayed as a straight-line across the range of preference ratios assessed (equation in Supplementary 2). Continuous models were presented as both decision-analysis curves (demonstrating potential outcomes of using any threshold in that model) and straight lines for the fixed strategies their source articles recommended. Strategies with higher net-benefit were considered higher performing, the magnitude of this difference being irrelevant [25].

Missing data

Missing data were multiply imputed using chained equations (mice package [26]). Based upon analyses in Supplementary 1, reasons for missingness were felt well explained and correlated to prostate cancer-specific overall survival, initial treatment, treatment in a public or private setting, biopsy type and disease factors, allowing the missing-at-random assumption. We imputed one hundred datasets, each with one hundred iterations, and pooled results using Rubin’s rules [23]. Kaplan–Meier curves were used to compare survival in patients with imputed positive bone scans to those with observed positive scans, and likewise for imputed negative bone scans (Supplementary 1).

All statistical analyses were performed using R version 3.4.2 [27].

Results

Validation cohort

The cohort is comprised of 10,721 consecutive men newly diagnosed with prostate cancer (Fig. 1), 4,079 of whom had a staging bone scan and 354 (8.7%) of which were positive (Table 1). As expected, patients with positive scans had poorer survival and higher GSs, PSA at diagnosis, clinical T-stage and percent positive cores on biopsy than those with negative scans. There were 150 indeterminate bone scans (3.6%, 150/4079), the majority of which were (n = 135) were subsequently reclassified as negative based on follow-up imaging and data. The remaining fifteen were imputed. 6642 patients had no staging bone scan result in our database. These patients had lower GS and T-stage than patients with staging bone scans on record, were more often treated in the private setting (Supplementary Table 1) and had better survival (Supplementary Fig. 2). This points towards two main mechanisms of missing data, selective use of bone scan staging (in patients thought to be at “higher risk” as per previous clinical guidelines) or restricted access to data in privately treated patients. As the difference in survival between patients with and without bone scan minimizes with stratification by risk group (Supplementary Fig. 3 and Supplementary Table 2), there is strong support for this mechanism of missingness and thus our choice of imputation model. Supplementary 1 confirms reliability of imputations. Survival was almost identical in patients imputed with a positive bone scan, compared to those with a known positive scan, and likewise for patients imputed with negative scans (Supplementary Fig. 1). Post-imputation cohort characteristics (Supplementary Table 3) show that distribution of disease stage and incidence of metastatic disease was similar in our cohort to the SEER database [28].

Table 1 Characteristics of the validation cohort prior to multiple imputation.

Full size table

Model identification

Thirteen distinct models were identified from the guidelines and literature search (Supplementary Fig. 4): EAU 2020 risk strata [29], AUA 2018 risk strata [3], NCCN 2019 risk strata [30], Ho [31], Wang [32], Chybowski [33], Briganti [4], O’Sullivan [34], Lai [5], ISUP [7], Gnanapragasam [7], Wang 2 [35] and Lorente [36]. Two could not be validated (Wang 2 [35] and Lorente [36]) as they used serum alkaline phosphatase (not recorded in the database). Three provided continuous estimates of risk based on logistic regression (Ho [31], Wang [32] and Chybowski [33]), while others categorized patients as low, intermediate, high-risk or similar based upon common clinical thresholds [3, 5, 7, 29, 30, 34] or classification-and-regression-training [4]. Thresholds recommended from these models were used to select for bone scanning (Table 2).

Table 2 Model details and characteristics of sources.

Full size table

A high risk of bias was identified in all literature-derived models due to small sample sizes, limited internal and external validations and some biased recruitment processes (Supplementary Tables 4 and 5 and Supplementary Fig. 5). The rationale behind threshold selection for staging strategies was sometimes missing [30] or poor. Three main approaches were used to select thresholds: percent bone scan positivity (inadequate in small studies where observed risk may not generalize) [7], negative predictive value (insensitive for rare events) and the highest point on the ROC curve (balancing sensitivity and specificity equally though sensitivity must be higher in this context).

Model validation

No model had the ideal calibration-in-the-large of zero (Table 3). Most models had a positive calibration-in-the-large, indicating they under-estimated risk on average. Lai deviated least in calibration-in-the-large (−0.28 [95% confidence interval, CI: −0.37, −0.19]) and Ho deviated most (−1.88 [95% CI: −1.96, −1.80]), overestimating risk on average.

Table 3 Calibration and discrimination of models.

Full size table

Calibration slope was also rarely one, the ideal (Table 3). The Wang model was closest with slope 0.94 [95% CI: 0.88, 1.00], but most others deviated significantly. Those with slope less than one (Chybowski, ISUP and Lai) over-predicted risk in high-risk groups and under-predicted it in low-risk groups (Supplementary Fig. 6), classic of overfitting. The Ho and Gnanapragasam models had slopes far greater than one. Their calibration plots suggest this was likely due to under-prediction of risk in high-risk groups for Gnanapragasam and over-prediction in low-risk groups for Ho (Supplementary Fig. 6).

Discrimination ranged from 0.68 to 0.80 for all models, considered “fair” by Hosmer et al. [24]. The highest AUCs were seen with Ho, Wang and Gnanapragasam (Table 3).

Strategy validation

Figure 2 summarizes net-benefit comparisons. Part A presents the decision-analysis curves for guideline recommendations and the two novel selective staging strategies that superseded all other approaches: scanning EAU high risk patients only and Gnanapragasam Group 5 patients only. Part B highlights the strategy performing best at each assessed preference ratio. The EAU guideline recommendation was best for preference ratios 1:39–3:97 (NWT 40–33), scanning EAU high-risk patients for preference ratios 3:97 to 7:93 (NWT 32–14) and scanning Gnanapragasam Group 5 patients for preference ratios 7:93–1:9 (NWT 13–10). The scan-all strategy had higher net-benefit than all other strategies at preference ratios 1:99–1:39 (representing a number-willing-to-test to capture a positive scan, NWT, 100–40). Supplementary Fig. 7 has decision-analysis curves for all strategies.

**Fig. 2: Model performance by decision curve analysis.**

Supplementary Table 7 presents net-benefit for each model’s recommended staging strategy (the strategy advised by the model’s source) at different preference ratios, above the net-benefit from the best performing strategy in that model for that preference ratio. There was often a discrepancy, indicating the benefit of using net-benefit to identify optimal staging strategies. The table also shows that fixed strategies from continuous models often had higher net-benefit than the continuous model itself at the same preference ratio. This may be due to mis-calibration.

Discussion

Bone scans are the most widely available tool for prostate cancer staging and remain the most evidence-based in guiding treatment selection [2]. Bone scan results can significantly alter the optimal treatment plan for newly diagnosed prostate cancer patients. A finding of oligometastases may lead a patient from radical curative treatment to combined radiotherapy and systemic therapy, or from systemic to combined radiotherapy and systemic therapy. However, recommendations for bone scan staging vary and are based upon consensus opinion or models developed in small cohorts often with selective recruitment and limited rigorous external validation. Ours is the first study to validate such a broad range of bone scan staging strategies head-to-head in a large independent cohort using net-benefit.

We found that (i) none of the commonly used models or strategies were universally superior across preference ratios, and (ii) the optimal staging strategy varied with preference ratio. Selective staging strategies that performed best were the EAU 2020 guideline recommendations (scanning patients with intermediate-risk GS 4 + 3 disease or high-risk disease), scanning EAU high-risk patients only and scanning patients in Group 5 of the novel Gnanapragasam model. The choice between them depends upon the preference ratio of conservatism and tolerance appropriate to the local health system and a given patient’s case. As bone scan results can radically alter treatment, some clinicians and patients may prefer more conservative approaches like the EAU guideline recommendation. In other scenarios, with different patients or health systems, or in health crises, such changes in treatment or such generous scanning may not be feasible, necessitating more “tolerant” strategies-like scanning EAU high-risk patients or Gnanapragasam Group 5 patients only.

Interestingly, at high levels of conservatism, scanning everyone had greater net-benefit than currently available selective staging strategies. This may be a result of true misses with selective staging strategies. In our pre-imputation cohort, approximately 3% (35/1086) of patients with GS 6 disease on biopsy had positive staging bone scans. These patients are often excluded from selective staging strategies as GS 6 disease is often thought not to metastasize. However, upgrading of Gleason 6 prostate cancer is common on radical prostatectomy [37,38,39], and these patients may have a risk of metastatic disease higher than appreciated by current selective staging strategies. Additional predictors of final grade, like PIRADS score, may improve the accuracy of selective staging strategies at conservative preference ratios [40]. A scan-all approach may also have appeared superior to selective staging approaches because of false positives. Present literature suggests a 79% specificity of bone scan staging [41], but patients with low-risk disease were often excluded from these studies. Our own data suggest a higher rate of false-positive scans in patients with low-risk disease, as indeterminate scans in patients with low-risk disease were classified as negative more often than in patients with high-risk disease. False positives have the potential of inappropriately altering treatment plans and leading to sub-optimal care, and thus such inclusive strategies should be used with care. Improved imaging technologies should bring fewer false positives, and conditioning future models on true positive scan results rather than all positives could also circumvent this issue in future.

Our analysis confirmed inaccuracies in bone scan positivity risk prediction by current models. Ho and Gnanapragasam were overfitted (calibration slope more than one), and Chybowski, Lai and ISUP were underfitted (calibration slope less than one). Both are consequences of small sample sizes, having fewer than ten events (positive bone scans) per predictor-variable (EPV) at model development or few events at model validation and repurposing (ISUP and Gnanapragasam) [42]. Calibration issues are likely responsible for differences in net-benefit from continuous models and the “fixed strategies” recommended from them. Recalibration may prove these models more useful. This analysis confirms the widespread problems of model development noted by Moon et al. [42], but also shows that despite mis-calibration, the Gnanapragasam model provided a highly effective selective staging strategy, underscoring the importance of practical measures of model performance like net-benefit.

Another key strength of our study is it is one of few studies in this field to meet the sample size requirements for reliable external validation [42, 43]. Our cohort was also derived from an opt-out population-based registry, with minimal exclusion criteria, limiting selection bias. Although missing data is a key limitation, this is a common issue in this field [42], and our study is the first to report on it in such detail and the first in the field to use multiple imputation to handle it. Additionally, we have strong evidence to support the reliability of our imputations, with post-imputation distributions of prostate cancer disease characteristics fitting those expected in a prostate cancer population. Finally, while PSMA-PET use is extending to primary prostate cancer staging [44, 45], radionuclide bone scans have the most evidence in guiding treatment strategies and have FDA approval [2]. Thus, this work is of critical relevance and use now, and in future, may help evaluate PSMA-PET staging.

This study found that no single model performed best for selective bone scan staging, and rather different strategies from different models were better than others over different degrees of conservatism and tolerance. Of the selective staging strategies assessed, three performed best: scanning patients as per the 2020 EAU guideline, scanning EAU high-risk patients and scanning Gnanapragasam Group 5 patients. Scanning only EAU high-risk patients provided the greatest net-benefit over the greatest range of preference ratios (NWT 14–32), but other approaches may be preferred in different settings with different degrees of conservatism and tolerance. This study provides a robust analysis that can improve bone scan use and decision making now in primary prostate cancer staging, and acts as a flagship for the assessment of future technologies like PSMA-PET/CT.

Code availability

Analyses and imputations were performed using open-source code within the CRAN repository [26, 46,47,48,49]. Additional code required for data cleaning and incorporating multiply imputed data into the analysis was tailored to the dataset.

References

Mottet N, Cornford P, Bergh RCNvd, Briers E. Expert patient advocate (European Prostate Cancer Coalition/Europa UOMO). In: Santis MD, et al., editors. EAU Guidelines. EAU Annual Congress. Milan. Arnhem, The Netherlands: EAU Guidelines Office; 2021.
Schaeffer E, Srinivas S, Antonarakis ES, Armstrong A, Cheng H, D’Amico A, et al. NCCN Guidelines Version 2.2022 prostate cancer. Pennsylvania, USA: NCCN; 2021.
Sanda MG, Cadeddu JA, Kirkby E, et al. Clinically Localized Prostate Cancer: AUA/ASTRO/SUO Guideline. Part I: Risk Stratification, Shared Decision Making, and Care Options. J Urol. 2018;199:683–90.
Briganti A, Passoni N, Ferrari M, Capitanio U, Suardi N, Gallina A, et al. When to perform bone scan in patients with newly diagnosed prostate cancer: external validation of the currently available guidelines and proposal of a novel risk stratification tool. Eu Urol. 2010;57:551–8.
Article Google Scholar
Lai MH, Luk WH, Chan JC. Predicting bone scan findings using sPSA in patients newly diagnosed of prostate cancer: feasibility in Asian population. Urol Oncol. 2011;29:275–9.
Article Google Scholar
Gnanapragasam V, Lophatananon A, Muir K, Gavin A, Wright K, Greenberg D. An improved clinical risk stratification system to better predict cancer specific mortality at diagnosis in primary non-metastatic prostate cancer. Eur Urol Suppl. 2016;15:e613.
Article Google Scholar
Thurtle D, Hsu RC, Chetan M, Lophatananon A, Hubbard R, Gnanapragasam VJ, et al. Incorporating multiparametric MRI staging and the new histological Grade Group system improves risk-stratified detection of bone metastasis in prostate cancer. Br J Cancer. 2016;115:1285–8.
Article Google Scholar
McArthur C, McLaughlin G, Meddings RN. Changing the referral criteria for bone scan in newly diagnosed prostate cancer patients. Br J Radiol. 2012;85:390–4.
Article CAS Google Scholar
Tanaka N, Fujimoto K, Shinkai T, Nakai Y, Kuwada M, Anai S, et al. Bone scan can be spared in asymptomatic prostate cancer patients with PSA of <=20 ng/ml and Gleason score of <=6 at the initial stage of diagnosis. Jpn J Clin Oncol. 2011;41:1209–13.
Article Google Scholar
Wolff JM, Bares R, Jung PK, Buell U, Jakse G. Prostate-specific antigen as a marker of bone metastasis in patients with prostate cancer. Urol Int. 1996;56:169–73.
Article CAS Google Scholar
Ayyathurai R, Mahapatra R, Rajasundaram R, Srinivasan V, Archard NP, Toussi H. A study on staging bone scans in newly diagnosed prostate cancer. Urol Int. 2006;76:209–12.
Article CAS Google Scholar
Lee SH, Chung MS, Park KK, Yom CD, Lee DH, Chung BH. Is it suitable to eliminate bone scan for prostate cancer patients with PSA </= 20 ng/mL? World J Urol. 2012;30:265–9.
Article CAS Google Scholar
Chien TM, Lu YM, Geng JH, Huang TY, Ke HL, Huang CN, et al. Predictors of positive bone metastasis in newly diagnosed prostate cancer patients. Asian Pac J Cancer Prev. 2016;17:1187–91.
Article Google Scholar
De Nunzio C, Leonardo C, Franco G, Esperto F, Brassetti A, Simonelli G, et al. When to perform bone scan in patients with newly diagnosed prostate cancer: external validation of a novel risk stratification tool. World J Urol. 2013;31:365–9.
Article Google Scholar
Lu YM, Chien TM, Ke HL, Huang SP, Huang CN. The most suitable guidelines for performing bone scans in prostate cancer staging – one southern Taiwan medical center’s results. Urol Sci. 2016;27:208–11.
Article Google Scholar
Merdan S, Womble PR, Miller DC, Barnett C, Ye Z, Linsell SM, et al. Toward better use of bone scans among men with early-stage prostate cancer. Urology. 2014;84:793–8.
Article Google Scholar
Pal RP, Thiruudaian T, Khan MA. When is a bone scan study appropriate in asymptomatic men diagnosed with prostate cancer? Asian J Androl. 2008;10:890–5.
Article Google Scholar
Chen SS, Chen KK, Lin AT, Chang YH, Wu HH, Hsu TH, et al. The significance of serum alkaline phosphatase bone isoenzyme in prostatic carcinoma with bony metastasis. Br J Urol. 1997;79:217–20.
Article CAS Google Scholar
Rudoni M, Antonini G, Favro M, Baroli A, Brambilla M, Cardani G, et al. The clinical value of prostate-specific antigen and bone scintigraphy in the staging of patients with newly diagnosed, pathologically proven prostate cancer. Eur J Nucl Med. 1995;22:207–11.
Article CAS Google Scholar
Moons K, Wolff K, Riley R, Whiting P, Westwood M, Collins G, et al. PROBAST: a tool to assess the risk of bias and applicability of prediction model studies. Ann Intern Med. 2019;170:51–8.
Article Google Scholar
Ruhl J, Adamo M, Dickie L. SEER program coding and staging manual 2016: Section V. Bethesda, MD: National Cancer Institute; 2016. p. 20850–9765.
Vergouwe Y, Nieboer D, Oostenbrink R, Debray TPA, Murray GD, Kattan MW, et al. A closed testing procedure to select an appropriate method for updating prediction models. Stat Med. 2017;36:4529–39.
Article Google Scholar
Harrell FE. Missing Data. In: Harrell JFE, editor. Regression modeling strategies: with applications to linear models, logistic and ordinal regression, and survival analysis. Cham: Springer International Publishing; 2015. p. 45–61.
Hosmer DW, Lemeshow S, Sturdivant RX. Assessing the fit of the model. Applied logistic regression. Hoboken, NJ, USA: John Wiley & Sons, Inc.; 2013. p. 177.
Vickers AJ, van Calster B, Steyerberg EW. A simple, step-by-step guide to interpreting decision curve analysis. Diagn Progn Res. 2019;3:18.
Article Google Scholar
van Buuren S, Groothuis-Oudshoorn K. {mice}: multivariate imputation by chained equations in R. J Stat Softw. 2011;45:1–67.
Article Google Scholar
R Core Team. R: a language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical Computing; 2017.
Surveillance Research Program. SEER*Explorer: an interactive website for SEER cancer statistics. 2021; https://seer.cancer.gov/explorer/.
Mottet N, van den Bergh RCN, Briers E, Cornford P, De Santis M, Fanti S, et al. EAU - ESTRO - ESUR - SIOG Guidelines on Prostate Cancer 2020. European Association of Urology Guidelines 2020 Edition. presented at the EAU Annual Congress Amsterdam 2020. Arnhem, The Netherlands: European Association of Urology Guidelines Office; 2020.
Mohler JL, Antonarakis ES, Armstrong AJ, et al. Prostate Cancer, Version 2.2019, NCCN Clinical Practice Guidelines in Oncology. J Natl Compr Cancer Netw 2019;17:479.
Ho CC, Seong PK, Zainuddin ZM, Abdul Manaf MR, Parameswaran M, Razack AH. Retrospective study of predictors of bone metastasis in prostate cancer cases. Asian Pac J Cancer Prev. 2013;14:3289–92.
Article Google Scholar
Wang Y, Guo J, Xu L, Zhao N, Xu Z, Wang H, et al. Should bone scan be performed in Chinese prostate cancer patients at the time of diagnosis? Urol Int. 2013;91:160–4.
Article Google Scholar
Chybowski FM, Keller JJ, Bergstralh EJ, Oesterling JE. Predicting radionuclide bone scan findings in patients with newly diagnosed, untreated prostate cancer: prostate specific antigen is superior to all other clinical parameters. J Urol. 1991;145:313–8.
Article CAS Google Scholar
O’Sullivan JM, Norman AR, Cook GJ, Fisher C, Dearnaley DP. Broadening the criteria for avoiding staging bone scans in prostate cancer: a retrospective study of patients at the Royal Marsden Hospital. BJU Int. 2003;92:685–9.
Article Google Scholar
Wang Y, Wan F, Xu L, Zhao N, Xu Z, Wang H, et al. Is it safe to omit baseline bone scan for newly diagnosed prostate cancer patients? Urol Int. 2015;94:342–6.
Article Google Scholar
Lorente JA, Valenzuela H, Morote J, Gelabert A. Serum bone alkaline phosphatase levels enhance the clinical utility of prostate specific antigen in the staging of newly diagnosed prostate cancer patients. Eur J Nucl Med. 1999;26:625–32.
Article CAS Google Scholar
Bullock N, Simpkin A, Fowler S, Varma M, Kynaston H, Narahari K. Pathological upgrading in prostate cancer treated with surgery in the United Kingdom: trends and risk factors from the British Association of Urological Surgeons Radical Prostatectomy Registry. BMC Urol. 2019;19:94-.
Article Google Scholar
Beckmann K, O’Callaghan M, Vincent A, Cohen P, Borg M, Roder D, et al. Extent and predictors of grade upgrading and downgrading in an Australian cohort according to the new prostate cancer grade groupings. Asian J Urol. 2019;6:321–9.
Article Google Scholar
Evans SM, Patabendi Bandarage V, Kronborg C, Earnest A, Millar J, Clouston D. Gleason group concordance between biopsy and radical prostatectomy specimens: a cohort study from Prostate Cancer Outcome Registry – Victoria. Prostate Int. 2016;4:145–51.
Article Google Scholar
Alqahtani S, Wei C, Zhang Y, Szewczyk-Bieda M, Wilson J, Huang Z, et al. Prediction of prostate cancer Gleason score upgrading from biopsy to radical prostatectomy using pre-biopsy multiparametric MRI PIRADS scoring system. Sci Rep. 2020;10:7722.
Article CAS Google Scholar
Shen G, Deng H, Hu S, Jia Z. Comparison of choline-PET/CT, MRI, SPECT, and bone scintigraphy in the diagnosis of bone metastases in patients with prostate cancer: a meta-analysis. Skeletal Radiol. 2014;43:1503–13.
Article Google Scholar
Moons KG, Altman DG, Reitsma JB, Ioannidis JP, Macaskill P, Steyerberg EW, et al. Transparent Reporting of a multivariable prediction model for Individual Prognosis or Diagnosis (TRIPOD): explanation and elaboration. Ann Intern Med. 2015;162:W1–73.
Article Google Scholar
Tangri N, Kent DM. Toward a modern era in clinical prediction: the TRIPOD statement for reporting prediction models. Am J Kidney Dis. 2015;65:530–3.
Article Google Scholar
de Feria Cardet RE, Hofman MS, Segard T, Yim J, Williams S, Francis RJ, et al. Is prostate-specific membrane antigen positron emission tomography/computed tomography imaging cost-effective in prostate cancer: an analysis informed by the proPSMA Trial. Eur Urol. 2021;79:413–8.
Article Google Scholar
Hofman MSP, Lawrentschuk NM, Francis RJM, Tang CM, Vela IM, Thomas PM, et al. Prostate-specific membrane antigen PET-CT in patients with high-risk prostate cancer before curative-intent surgery or radiotherapy (proPSMA): a prospective, randomised, multicentre study. Lancet. 2020;395:1208–16.
Article CAS Google Scholar
Yoshida K, Bohn J. tableone: Create ‘Table 1’ to Describe Baseline Characteristics. R package version 0.9.3; 2018. https://CRAN.R-project.org/package=tableone2018.
Kassambara A, Kosinski M. survminer: Drawing Survival Curves using ‘ggplot2’. R package version 0.4.2 ed. 2018. https://CRAN.R-project.org/package=survminer.
Wickham H. ggplot2: Elegant graphics for data analysis. Springer-Verlag New York; 2016. http://ggplot2.org.
Sing T, Sander O, Beerenwinkel N, Lengauer T. ROCR: visualizing classifier performance in R. Bioinformatics. 2005;21.

Download references

Acknowledgements

We would like to acknowledge the O’Rourke family and the Freemasons Foundation Centre of Men’s Health for their generous support of this research, as well as the staff of the SA-PCCOC and A/Prof Martin Borg for their kind input into this project.

Funding

Open Access funding enabled and organized by CAUL and its Member Institutions.

Author information

These authors contributed equally: Andrew D. Vincent, Michael O’Callaghan.

Authors and Affiliations

University of Adelaide, Adelaide Medical School, Adelaide, SA, Australia
Mrunal D. Hiwase & Johnathan Teh
Department of Surgery, Central Adelaide Health Network, Adelaide, SA, Australia
Mrunal D. Hiwase
Flinders Medical Centre, Urology Unit, Adelaide, SA, Australia
Alex Jay & Michael O’Callaghan
Health Economist, Flinders Health and Medical Research Institute, Flinders University, Adelaide, SA, Australia
Norma Bulamu
Northern Adelaide Health Network, Adelaide, SA, Australia
Johnathan Teh
Nuclear Medicine Physician and Radiologist, Dr Jones and Partners Radiology and Flinders Medical Centre, Adelaide, SA, Australia
Felix Paterson
Flinders Centre for Innovation in Cancer, Flinders Medical Centre/Flinders University, Bedford Park, SA, 5042, Australia
Ganessan Kichenadasse
Freemasons Centre for Male Health and Wellbeing, University of Adelaide, Adelaide, SA, Australia
Andrew D. Vincent
Department of Urology, Flinders Medical Centre, Bedford Park, SA, Australia
Tina Kopsaftis
South Australian Prostate Cancer Clinical Outcomes Collaborative (SA-PCCOC), Adelaide, SA, Australia
Tina Kopsaftis & Scott Walsh
Envido (Digital Health Insights), Adelaide, 5000, SA, Australia
Scott Walsh

Authors

Mrunal D. Hiwase
View author publications
Search author on:PubMed Google Scholar
Alex Jay
View author publications
Search author on:PubMed Google Scholar
Norma Bulamu
View author publications
Search author on:PubMed Google Scholar
Johnathan Teh
View author publications
Search author on:PubMed Google Scholar
Felix Paterson
View author publications
Search author on:PubMed Google Scholar
Ganessan Kichenadasse
View author publications
Search author on:PubMed Google Scholar
Andrew D. Vincent
View author publications
Search author on:PubMed Google Scholar
Michael O’Callaghan
View author publications
Search author on:PubMed Google Scholar

Consortia

South Australian Prostate Cancer Clinical Outcomes Collaborative (SA-PCCOC)

Tina Kopsaftis
& Scott Walsh

Contributions

MH: Conceptualization, methodology, software, formal analysis, data curation, investigation, writing –original draft preparation, visualization; AJ: Writing – review & editing; NB: conceptualization, writing – review & editing; JT: Investigation, writing – review & editing; FP: Writing – review & editing; GK: Writing – review & editing; AV: Conceptualization, methodology, software, formal analysis, writing – review & editing, visualization, supervision; MO: Conceptualization, methodology, writing – review & editing, supervision, project administration; SA-PCCOC: Data curation, resources.

Corresponding authors

Correspondence to Mrunal D. Hiwase or Michael O’Callaghan.

Ethics declarations

Competing interests

FP is employed by Dr. Jones and Partners Radiology, but his involvement in work was limited to drafting of the manuscript and performed outside of his work with Dr. Jones and Partners Radiology. MDH was provided with the Paddy O’Rourke Scholarship for Prostate Cancer research by the Freemasons Foundation Centre of Men’s Health. No other conflicts of interest to declare.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Hiwase, M.D., Jay, A., Bulamu, N. et al. Evaluation of selective bone scan staging in prostate cancer – external validation of current strategies and decision-curve analysis. Prostate Cancer Prostatic Dis 25, 336–343 (2022). https://doi.org/10.1038/s41391-022-00515-8

Download citation

Received: 19 December 2021
Revised: 24 January 2022
Accepted: 09 February 2022
Published: 14 March 2022
Issue date: June 2022
DOI: https://doi.org/10.1038/s41391-022-00515-8

Subjects

Abstract

Background

Methods

Results

Conclusions

Similar content being viewed by others

Predicting high-grade prostate cancer at initial biopsy: clinical performance of the ExoDx (EPI) Prostate Intelliscore test in three independent prospective studies

Explainable ML models for a deeper insight on treatment decision for localized prostate cancer

Risk calculators for the detection of prostate cancer: a systematic review

Introduction

Subjects and methods

Identifying models used for selective bone scan staging

Validation cohort

Model outcome

Model predictors

Ethics

Statistical methods

Calibration

Discrimination

Decision curve analysis

Missing data

Results

Validation cohort

Model identification

Model validation

Strategy validation

Discussion

Code availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Consortia

South Australian Prostate Cancer Clinical Outcomes Collaborative (SA-PCCOC)

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Supplementary information

Supplementary

Rights and permissions

About this article

Cite this article

Share this article

Search

Quick links