Main

The Gleason grading system is an architectural based grading system, and has become one of the most accepted and important predictors of the prognosis and behavior of prostate adenocarcinoma.1, 2, 3, 4, 5, 6, 7, 8 In general, the Gleason score on 18-gauge needle biopsy specimens has been found to correlate well with the Gleason score on radical prostatectomy specimen and to have good interobserver reproducibility; however, the correlation is not perfect and under grading is a particular problem, in up to 58% of cases.1, 2, 3, 6, 9, 10, 11, 12, 13, 14 Some have advocated not assigning biopsy scores of 2–4 to eliminate errors and potential harm to patients.10, 11

Sextant biopsies have been routinely used by many urologists, and prostate cancer is often found in multiple biopsy sites from the same patient. Despite extensive studies on the predictive values of prostate biopsy Gleason scores for final Gleason scores at radical prostatectomy, the implications of different Gleason scores from more than one site are uncertain despite the recommendation of assigning each biopsy core a separate Gleason score.12 In the setting of more than two positive biopsy sites, pathologists usually grade the tumor at each site separately, and the Gleason score may differ from each positive site. This study seeks to determine if the highest Gleason score in all biopsy sites, or the Gleason score in the site with the highest tumor volume on the needle biopsy is the best predictor of final primary and secondary Gleason pattern grades in the radical prostatectomy specimens.

Materials and methods

Biopsy Specimens

This study analyzed 151 patients who underwent radical prostatectomy for prostate cancer between 1999 and 2003. The patients' ages ranged from age 41 to 77 years (median, 60 years). All had clinically localized cancer. None had hormonal or radiotherapy prior to surgery. Three patients had lymph node metastasis. Biopsies were preformed using an 18-gauge needle. All patients had positive needle biopsies in at least two biopsy sites. The biopsies were graded using the Gleason grading system.5 Various preoperative biopsy findings were recorded, including the primary and secondary Gleason pattern grade from each positive biopsy site, the highest Gleason score among all biopsy sites, and the Gleason score from the site with the highest tumor volume on the needle biopsy. Tumor volume in a positive biopsy site was defined as the percentage of areas of biopsy cores involved by cancer and was estimated visually, using 5% increments.

This research was approved by the Indiana University Institutional Review Board.

Radical Prostatectomy Specimens

The radical prostatectomy specimens were examined in the whole-mount method as previously described.15, 16, 17, 18, 19, 20, 21, 22, 23 Prostates were weighed, measured, inked, and fixed in 10% neutral formalin. Following fixation, the apex and base were amputated and serially sectioned at 3–5 mm intervals in the vertical, parasaggital plane. The seminal vesicles were sectioned parallel to their junction with the prostate and entirely submitted for examination. The remaining prostate was serially sectioned perpendicular to the long axis from the apex of the prostate to the base, and whole-mount sections were prepared.

The prostatectomy specimens were graded and staged by a single urologic pathologist (LC). Gleason score was obtained by the summation of primary Gleason pattern grade and secondary Gleason pattern grade based on the assessment of the entire specimen. The 1997 TNM (tumor, nodes, metastasis) system was used for pathologic staging.24 Surgical margins were considered positive when carcinoma cells were in contact with the inked margin.15 The volume of carcinoma in the entire prostate was determined by the grid method15, 16, 25, 26, 27, 28, 29 and was the sum of the volumes of individual foci of tumor. Patient and specimen characteristics are shown in Table 1.

Table 1 Characteristics of 151 patients who underwent radical prostatectomy for prostate cancer

Statistical Analysis

Spearman correlation coefficients were calculated between the Gleason grades from the biopsy and the Gleason grades in the radical prostatectomy specimen. To determine whether the Gleason score from the biopsy site with the highest Gleason score or the Gleason score from the biopsy site with the largest tumor volume on the needle biopsy was more correlated with the Gleason score and grades from the radical prostatectomy, the correlations were compared. Since the correlations are both calculated from the same sample, the method described in Cohen and Cohen,30 which takes their dependence into account was used to compare the correlation coefficients. The Wilcoxon Rank Sum test was used to test for associations between Gleason pattern grades and categorical variables.

Results

The distribution of biopsies are shown in Table 2. The number of biopsies ranged from 2 to 12, with the majority of patients (42%) having two biopsies. All of these patients had more than one positive biopsy site, and the number of positive biopsy sites ranged from 2 to 8, with the majority (68%) having two positive biopsy sites (Table 2). In total, 88 (58%) patients had identical primary and secondary Gleason pattern grade from all positive sites. A total of 126 (83%) patients had identical primary Gleason pattern grade from all positive sites. In all, 98 (65%) patients had identical secondary Gleason pattern grade from all positive sites.

Table 2 Distribution of biopsies in 151 prostate cancer patients who underwent radical prostatectomy

Table 3 contains the distribution of Gleason scores in needle biopsy and radical prostatectomy specimens. The primary Gleason pattern grade from the biopsy site with the highest Gleason score correlated exactly with primary Gleason pattern grade in the radical prostatectomy specimen in 75.5% of cases while it undergraded the primary Gleason pattern grade in 13.9% of cases and overgraded it in 10.6% of cases (Figure 1). The Gleason score from the biopsy site with the highest tumor volume on the needle biopsy also correlated exactly with primary Gleason pattern grade in the radical prostatectomy specimen in 75.5% of cases while it undergraded the primary grade in 12.6% of cases and overgraded it in 11.9% of cases (Figure 1). Overall, the primary Gleason pattern grade from both the biopsy site with the highest Gleason score and the biopsy site with the highest tumor volume on the needle biopsy correlated significantly (P<0.001 for both) and equally well with final primary Gleason pattern grade at radical prostatectomy (Spearman correlation coefficient =0.53 for both, P=0.90).

Table 3 Distribution of Gleason scores in needle biopsy and radical prostatectomy specimens (N=151)
Figure 1
figure 1

Correlation of primary Gleason pattern grade from the core with the highest tumor volume on the needle biopsy and the core with the highest Gleason score with primary Gleason pattern grade in radical prostatectomy specimens.

The secondary Gleason pattern grade from the biopsy site with the highest Gleason score correlated exactly with secondary Gleason pattern grade in the radical prostatectomy specimen in 43.7% of cases, while it undergraded the secondary pattern grade in 30.5% of cases and overgraded it in 25.8% of cases (Figure 2). The secondary Gleason pattern grade from the biopsy site with the highest tumor volume on the needle biopsy correlated exactly with secondary Gleason pattern grade in the radical prostatectomy specimen in 47.7% of cases, while it undergraded the secondary Gleason pattern grade in 23.8% of cases and overgraded it in 28.5% of cases (Figure 2). The secondary Gleason pattern grade from both the biopsy site with the highest Gleason score and the biopsy site with the highest tumor volume on the needle biopsy correlated significantly (P<0.001 for both) with final secondary Gleason pattern grade at radical prostatectomy, although secondary Gleason pattern grade from the biopsy site with the highest tumor volume on the needle biopsy correlated with secondary Gleason pattern grade at radical prostatectomy marginally better than those from the biopsy site with the highest Gleason score (Spearman correlation coefficient, 0.32 vs 0.24; P=0.058).

Figure 2
figure 2

Correlation of secondary Gleason pattern grade from the core with the highest tumor volume on the needle biopsy and the core with the highest Gleason score with secondary Gleason pattern grade in radical prostatectomy specimens.

The Gleason score from both the biopsy site with the highest Gleason score and the biopsy site with the highest tumor volume on the needle biopsy correlated significantly (P<0.001 for both) and equally well with the final Gleason score at radical prostatectomy (Spearman correlation coefficient =0.54 and 0.55, respectively; P=0.95) (Figure 3). In all, 10 patients who had a Gleason score of 7 in the biopsy site with the highest Gleason score had a Gleason score of 6 in the radical prostatectomy, while a similar number of patients (9) who had Gleason score of 7 in the biopsy site had a Gleason score of 6 in the radical prostatectomy specimen.

Figure 3
figure 3

Correlation of overall Gleason score from the core with the highest tumor volume on the needle biopsy and the core with the highest Gleason score with final Gleason score in radical prostatectomy specimens.

Both maximum Gleason score from all biopsy sites and Gleason score from the site with the largest tumor volume on the needle biopsy were associated with final pathologic stage and extraprostatic extension (P<0.001). Both maximum Gleason score from all biopsy sites and Gleason score from the site with the largest tumor volume on the needle biopsy were associated equally well with final pathologic stage and extraprostatic extension (P<0.0001). Minimal correlation was found between biopsy Gleason score and patient's age, preoperative PSA levels, prostate weight, and tumor volume (all Spearman correlation coefficients <0.23). No significant association was found between biopsy Gleason score and extraprostatic extension, seminal vesicle invasion, surgical margin status, high-grade prostatic intraepithelial neoplasia, perineural invasion, or multifocality of cancer (all P-value >0.05).

Discussion

In this study, we found that the highest Gleason score from all biopsy sites and the Gleason score from the site with the highest tumor volume on the needle biopsy are equally predictive of final Gleason score and primary Gleason pattern grade on radical prostatectomy. Both methods are predictive of secondary Gleason pattern grade on radical prostatectomy. We recommend that the highest Gleason score from all positive biopsy sites should be used when assigning an initial score using needle biopsies for improved accuracy and prognostic value.

The Gleason grading system is widely used in urologic pathology, has been adopted as a standard grading system by the 2004 World Health Organization (WHO) classification.31 The Gleason grading system is a powerful prognostic indicator for prostatic adenocarcinoma, validated by numerous studies. The prognostic power of the Gleason grading system, particularly the percentage of Gleason pattern grade 4/5, in predicting PSA recurrence has been demonstrated by Stamey et al32 who showed this and intraprostatic vascular invasion to be the only variables predictive of PSA recurrence. The Gleason grading system does, however, have limitations, like any histologic grading system.33 Inter- and intraobserver reproducibility varies among pathologists. For example, Allsbrook et al1 have demonstrated only moderate reproducibility of grading among general pathologists and De Las Morenas et al34 have, in a comparison of the Gleason, Mostofi, and Böcking grading systems, found the Gleason system to be the least reproducible. These concerns about reproducibility are somewhat offset by other studies such as an additional study by Allsbrook et al1 that reveals good reproducibility of Gleason grading among urologic pathologists and a study by Renshaw et al35 that displays good concordance of Gleason grading between reference and general pathologists as well as demonstrating recent improvement in the reproducibility of grading prostatic adenocarcinomas among general pathologists. Another limitation of the Gleason grading system is well-documented tendency towards undergrading, particularly in needle biopsy.1, 2, 3, 6, 9, 14 In addition, the difficulty remains in quantifying the percentages of Gleason pattern grades 3 and 4 when admixed, which could render the clinically significant distinction between a Gleason score of 3+4 and 4+3 difficult.36

The ability of biopsy Gleason score to predict final Gleason score in the radical prostatectomy specimen has been studied previously. Bostwick2 studied 316 radical prostatectomies and found that the primary Gleason pattern grade on biopsy showed exact correlation with radical prostatectomy score in 54% of cases. This study showed 40% underestimation of prostatectomy Gleason score by biopsies and 25% overestimation, with the grading errors being more frequent in the lowest grade biopsies where there was a biopsy grading error of 91% in well differentiated tumors.2 The study by Cookson et al9 of 226 consecutive cases produced similar results with exact correlation of biopsy and prostatectomy scores in 31% of cases, overestimation of score in 15%, and underestimation of final score in 54% of cases. Spires et al6 studied 67 patients and found exact correlation between biopsy and prostatectomy Gleason scores in 58% of cases. Carlson et al3 studied 106 patients, and documented similar results, with 68% exact correlation, 25% underestimation of prostatectomy score, and 8% overestimation of prostatectomy score. Steinberg et al14 studied 499 patients and found exact correlation in 58% of biopsy and radical prostatectomy specimens (when graded by pathologists at John's Hopkins Hospital). Steinberg et al14 also found that a biopsy Gleason score of less than 7 predicted a radical prostatectomy Gleason score of less than 7, 64% of cases while a biopsy Gleason score of greater than (or equal to) 7 was predictive of a prostatectomy score greater than 7, 88% of the time; again, demonstrating the problem of underestimation of grade.

Clinicians routinely use sextant biopsies to detect cancer. In many cases, there are multiple positive biopsy sites, and current recommendations are to assign each biopsy a separate score.12 If there are more than two positive sites, each will be assigned a separate score, which may differ from site to site. This study was specifically designed to answer the question of, in the setting of multiple positive biopsy sites, which is the best predictor of final Gleason score. We found that the ability of both the Gleason score from the site with the highest tumor volume on the needle biopsy and the highest Gleason score from all sites to predict radical prostatectomy Gleason scores were comparable to those of techniques used in other studies with only a single positive score. Primary Gleason grade was predicted exactly in 75.5% of cases using either method in this study. Our data also show trends that both techniques of Gleason grade prediction more commonly underestimate Gleason score than they overestimate the score, similar trends were reported in other studies.1, 2, 3, 4, 6, 9, 14, 37 The origin of this type of error is likely due to sampling issues and tumor heterogeneity, although pathologic grading errors, borderline cases, and reverse sampling errors may also contribute to this discrepancy.14

We also found that secondary Gleason pattern grade from the biopsy site with the highest tumor volume on the needle biopsy correlated with secondary Gleason pattern grade at radical prostatectomy marginally better than those from the biopsy site with the highest Gleason score (Spearman correlation coefficient, 0.32 vs 0.24, P=0.058). This may be related to the so-called reverse sampling error.1, 14 In this form of error, a minor component of tumor is sampled in needle biopsy and included in calculating the biopsy-derived Gleason score, but comprises <5% of the entire tumor in the radical prostatectomy specimen, so it is not included in the final grading since Gleason grading is based on the two most prevalent patterns observed. Reverse sampling error is used by Yang et al37 as a possible explanation of why small (less than 1.5 mm) foci of Gleason score 7 carcinoma on needle biopsy are not an adverse finding. The elimination or reduced frequency of this form of error may improve accuracy of predicting final Gleason scores and Gleason pattern grades.

The significance of any high-grade cancer on biopsies should not be underestimated. Kunz and Epstein found that any Gleason 4+4=8 (high-grade) tumor in biopsies is significant in terms of predicting both stage and radical prostatectomy grade, even if it is present in only one core, and greater numbers of cores have lower-grade tumor.12 Pan et al7 has shown that small tertiary foci of high-grade cancer in prostatectomy specimens are predictive of tumor aggressiveness, again suggesting that the presence of any high-grade tumor is significant. Studies by Bostwick, Carlson et al, Spires et al, and Steinberg et al, all have found that the volume of tumor present in biopsies does not correlate with a change in the ability to predict radical prostatectomy Gleason score, emphasizing the significance of small-volume, high-grade elements.2, 3, 6, 14 Our data also indicate that both the highest Gleason score from all sites and the Gleason score from the site with the highest tumor volume on the needle biopsy are equally and significantly predictive of final Gleason score on radical prostatectomy. The reason that an increased volume of tumor on the needle biopsy is not more accurate in predicting final Gleason score may be attributed to the extremely small size of a biopsy as compared to a prostate (estimated to be approximately 1/10 000th the size of a prostate)1 and the sampling error inherent to biopsies.

In conclusion, in the setting of multiple positive prostate biopsy sites, both the Gleason score from the biopsy site with the highest tumor volume on the needle biopsy and from the site with the highest Gleason score of all sites are predictive of final scores and grades on radical prostatectomy, and both could be used in assigning an initial score using needle biopsies. We recommend that the highest Gleason score from all positive biopsy sites should be used for practical purpose.