Main

Wide spectrums of morphologic patterns and clinical behavior in breast carcinomas have led to many attempts to create meaningful classifications. Greenough1 introduced breast carcinoma grading in 1925. He used seven histologic characteristics to classify breast carcinomas into three grades of malignancy. Patey and Scarff2 in 1928 selected tubule formation, variation in nuclear size, and nuclear hyperchromatism as principal variables that were modified by evidence of secretion (good) and mitotic frequency (bad). In 1957, Bloom and Richardson3 proposed a simplified system which utilized only three of Greenough's variables: gland-formation (tubularity), degree of variation in nuclear size and shape (pleomorphism), and ‘hyperchromatic figures’ as an estimate of proliferation. Despite lack of specific criteria for quantifying variables measured, the grading system was effective for prognosis. Modifications to enhance reproducibility of scores resulted in the Nottingham–Bloom–Richardson system,4, 5 which has been endorsed by the College of American Pathologists and the World Health Organization.6 The basic principle has remained summation of scores for the three variables, each of which is assigned from one to three points according to degree of departure from normal breast epithelium. A total score of 5 or less defines grade 1, 6–7 points grade 2, and 8–9 points grade 3. Rationale for an unweighted combined scoring of these three correlated variables has never been put forth, but a relationship between grade of invasive ductal carcinoma (IDC) of the breast and outcome has been recognized consistently.3, 4, 7, 8, 9 Strength of the prognostic relationship has fallen short of levels necessary for critical therapeutic decisions, and reproducibility of grading has remained a matter of concern10, 11, 12, 13, 14, 15, 16, 17, 18, 19 (Table 1). The Cooperative Breast Cancer Tissue Resource (CBCTR) has collected and categorized 9000 breast cancer specimens available as formalin-fixed blocks in paraffin.20 Histologic malignancy grading could facilitate exploitation of the resource by investigators. Thus, we set out to evaluate reproducibility of breast carcinoma grading of CBCTR specimens. Efficacy of grading vs proliferation index (PI) for prediction of outcome was evaluated in a second set of patients.

Table 1 Breast carcinoma grading: levels of agreement reported in prior studies

Materials and methods

Populations Studied

Grading reproducibility

We used specimens selected from a previously ungraded CBCTR set of 9000 breast carcinoma specimens from years 1982 to 1990 to assess reproducibility of grading. CBCTR specimens consist of paraffin-embedded, formalin-fixed tissue blocks and microsections stored in participating institutions where the patients were diagnosed and treated.

Assessment of relationships between grade, proliferation index, and outcome

A separate set of 631 node-negative infiltrating breast carcinomas was used to evaluate prognostic efficacy of grade and other variables. At least five axillary lymph nodes were examined microscopically in each patient. The specimens were accessioned serially at St Luke's Hospital in the years 1985–1998. Patients with negative axillary lymph nodes were selected because assessment of prognosis may influence decisions to recommend surgical adjuvant therapy in this group.

These patients were selected consecutively from a larger set that had been evaluated prospectively for proliferation index by DNA precursor uptake by fresh tissue slices. All had unifocal invasive carcinomas of known size. Breast-conserving surgery followed by radiation to the residual breast was performed in 56.6%, and total mastectomy in 43.4% of patients. The number of axillary nodes examined varied from 5 to 37, mean 12.8, median 12. Therapy was individualized according to characteristics of the patient and tumor, particularly tumor size, grade, hormonal receptor assay results, and labeling index which, along with other variables, were available at time of initial treatment. High labeling index was associated with increased probability of adjuvant chemotherapy, and expression of estrogen and progesterone receptors with administration of tamoxifen.

The median period of observation after initial therapy was 6.4 years.

Mitotic Figure Recognition

Nuclear figures from 10 microsections used for Nottingham–Bloom–Richardson grading were photographed through a × 40 dry objective lens with negative color film. Final magnification of prints was 500 diameters. A total of 50 nuclear figures were included and encircled with a wax pencil in the 36 prints provided to each participating pathologist. The participant was asked to classify each object as a mitotic figure (yes/no). If a mitotic figure, then phase of mitosis, and if not a mitotic figure, then resting cell, apoptosis, pyknosis, necrosis, or other was to be specified. The identities assigned by the photographer are listed in Table 2.

Table 2 Identities of objects photographed

Tumor Grading

Cooperative Breast Cancer Tissue Resource specimens

All grading reproducibility exercises were carried out using a separate set of glass slides representing new invasive carcinomas for each exercise. Seven pathologists participated, five of them in all five trials. Mean age of pathologists is 58.6 years, median 62, range 47–74. Their mean number of years in practice is 25.7 years, median 24, range 13–44. Six pathologists had special interests in breast pathology. They graded the tumors individually without intercommunication according to the Nottingham–Bloom–Richardson system.5 Cellular, well-fixed fields from peripheral tumor regions were utilized. The first of 10 consecutive high-power fields in which mitotic figures were counted was selected without constraint that a mitotic figure was or was not present. Five trials were run in which a single microsection from each breast carcinoma was evaluated by each participant for tumor grade. No special instructions were given prior to trial 1. After the first trial, the group met to discuss sources of disagreement, review the Nottingham–Bloom–Richardson system, and assure that mitotic counts were adjusted according to standard microscope field area. After the second trial, another meeting was held to discuss discrepancies. Six of the original seven pathologists participated in two more trials using invasive carcinomas. Before the third trial, pathologists reviewed criteria for mitotic figures and recognition of prophase mitosis using a microscope for multiple viewers.

St Luke's Hospital specimens

Tumors were graded prospectively by the pathologist responsible for diagnosis. In all, 10 pathologists were active during the period the specimens were accessioned. Tubularity was scored as 1 if <5% of tumor cells participated in formation of lumens, 2 if 5–50% participated, and 3 if >50% participated. Nuclear grade was assigned as 1 if nuclei were predominantly less than 10 μm in greatest diameter, 3 if predominantly >13 μm in diameter, and 2 if intermediate. Calibrated ocular grids were used as size references. Mitosis was defined as prophase (uniformly condensed chromosomes), metaphase, anaphase, or telophase. Hyperchromatic nuclei and apoptotic figures were excluded. Mitotic indices were recorded prospectively per 10 × 40 high-power fields, field area approximately 0.18 mm2, and recorded in four categories: 0–2, 3–5, 6–10, and >10. These cutoffs were used to accommodate the known lognormal distribution of proliferation index,21 but do not correspond exactly to Nottingham–Bloom–Richardson cutoffs.5 The exact numerical score per 10 high-power fields was also recorded for 404 tumors. Grades were computed from the individual scores using the cutoffs stated above for tubularity and nuclear size. Intervals of 0–2, 3–10, and >10 were used for mitotic index because they most closely divided the population into three groups of equal size.

Assessment of Proliferation Index by DNA Precursor Uptake

Proliferation index was measured as DNA labeling index by in vitro tritiated thymidine (1985–1987), or 5-bromo-2′-deoxyuridine (bromodeoxyuridine) uptake (1987–1998) as previously described.22 Proliferation index was measured on approximately 95% of primary breast carcinoma specimens accessioned during the time of the study. Omissions occurred because of very small tumors or on rare occasion technical error. One of the 631 carcinomas with recorded proliferation index was excluded from the analysis because only in situ carcinoma was available in the specimen on which the assay was performed.

Statistical Analysis and Ethical Review

Levels of agreement among pathologists were assessed by concordance proportion (C=number of cases on which two specified pathologists agreed/number of cases) and by the kappa (κ) statistic wherein <0 is interpreted as poor, 0–0.2 as slight, 0.2–0.4 fair, 0.4–0.6 moderate, 0.6–0.8 good, and >0.8 almost perfect.23

Survival intervals were calculated from date of definitive surgery. Relapse was defined as clinically evident recurrent tumor in any site other than in ipsilateral or contralateral breast. Survival analysis was performed by the Kaplan–Meier product limit method24 with evaluation by log-rank test. Patients without relapse were censored at time of last information if alive or at death if tumor-free. Variables with univariate prognostic significance at P<0.05 in two-tailed tests and patient's age were compared by Cox's proportional hazards multivariate analysis using the statistical package JMP version 5.0 (SAS Institute, Inc., Cary, NC, USA).

The study was approved by the Review Board for Human Subjects Research of St Luke's Hospital.

Results

Nuclear Figure Classification

Seven pathologists examined 43 nuclear figure photographs. Yes/no scoring for mitosis showed a fair level of agreement with Cavg=0.69, κavg=0.38. Agreement for type of mitotic figure was moderate (Cavg=0.73, κavg=0.54), while agreement for type of nonmitotic hyperchromatic figure was also only fair (Cavg=0.38, κavg=0.23). Unanimous agreement was reached for 14 objects of which four were classified as mitotic figures and 10 as not. Six of seven pathologists agreed on another 10 objects of which two were nearly unanimous for and eight against mitosis. Concordance proportions for pairs of pathologists varied from 0.54 to 0.90. On scanning the individual responses, agreement appeared to be good for metaphase and anaphase. Two pathologists never designated a prophase. One pathologist recognized four of four prophases as classified under the microscope by the photographer, but only two others recognized any of these photographs as prophases. An en face metaphase, as classified by direct microscopy, was designated prophase by one pathologist. One telophase, as recognized by the photographer, was shown. Two pathologists designated it as telophase and four designated not a mitotic figure.

Tumor Grading Reproducibility

Results were similar in agreement among participants for the five separate trials (Table 3). Kappa statistics indicate moderate agreement on tubularity, fair to moderate agreement on nuclear pleomorphism, moderate agreement on mitotic grade, and moderate agreement on overall tumor grade. Although unanimous agreement was achieved less often than not, differences in scoring of more than one point were infrequent (Table 4). In the fourth and fifth trials, proportions of prophases were recorded. They varied from 0.1 to 3.4%, mean 1.4% in trial 4. Average pairwise concordance on proportion of mitotic figures that were in prophase was 0.37, and average pairwise κ was 0.13 (slight agreement). In the fifth trial, proportions of prophases varied from 0.2 to 3.2%, mean 0.9%. Average pairwise concordance was 0.41, average pairwise κ was 0.09 (slight agreement).

Table 3 Results of tumor grading by five to seven pathologists
Table 4 Detailed breakdown of agreement among seven pathologists, trial 2

Outcome Assessment

Cause-specific relapse-free survival analyses are shown in Figures 1, 2 and 3. Cutoffs for covariates in survival plots are listed in Table 5. Univariate analysis showed a number of univariate predictors for relapse-free tumor-specific survival (Figures 1, 2 and 3, Table 5). Cutoffs of 0–2, 3–10, and >10 mitotic figures per 10 high-power fields (Figure 2) were more effective in determining prognostic groups (P=0.0093) than cutoffs of 0–5, 6–10, and >10, which corresponded more closely to those of the Nottingham–Bloom–Richardson system and divided the tumors into groups with a log-rank significance level of P=0.021 (survival plot not shown).

Figure 1
figure 1

Cause-specific relapse-free survival plot of 631 node-negative breast carcinomas according to age, size of invasive tumor in centimeter, lymphatic or blood vessel invasion, or thymidine/bromodeoxyuridine labeling index.

Figure 2
figure 2

Cause-specific relapse-free survival plot of 631 node-negative breast carcinomas according to tubularity, nuclear size, nucleolar size, and mitotic index.

Figure 3
figure 3

Tumor-specific relapse-free survival plot of 631 node-negative breast carcinomas according to the modified Bloom–Richardson grade and with modified Bloom–Richardson grade mitotic figures per 10 high-power fields replaced by thymidine/bromodeoxyuridine labeling index. Cutoffs for labeling index were 3 and 8%.

Table 5 Cutoffs for variables in node-negative outcome study

Variables with independent predictive power for cause-specific relapse-free survival were tumor size, lymph-angioinvasion, architectural grade, and labeling index (S-phase fraction by thymidine or bromodeoxyuridine uptake) in multivariate analysis (Table 6). Independently predictive variables for cause-specific overall survival were tumor size, lymph-angioinvasion, tubularity, labeling index, and age (Table 7).

Table 6 Final proportional hazards model for cause-specific relapse-free survival
Table 7 Final proportional hazards model for cause-specific survival

When only Nottingham–Bloom–Richardson grading components were entered into the model, of those components only tubularity was independently predictive (P=0.0095). When overall tumor grade was entered into the model, it became the only significant predictor (P<0.05), displacing tubularity. When tumor size or labeling index by tritiated thymidine or bromodeoxyuridine uptake were added, either one displaced tumor grade and was the only significant predictor.

Discussion

Recognition and Classification of Mitotic Figures

Agreement was complete on a substantial number of images, showing that participants basically agreed on criteria for recognition of mitotic figures, but reproducibility in classification of mitotic and other nuclear hyperchromatic figures from photographs was in the category of ‘fair’ by concordance and kappa statistics. Although quality of the photographic prints was judged to be good, lack of concordance may in part be a result of slight loss of resolution in the photographic process and inability to evaluate nuclear figures by through-focusing. Through-focus digital imaging should be considered for future studies.

Prophases represented a negligible proportion of total mitotic figures recognized. Therefore, miscounting of prophases can account for only a small portion of variability in mitotic counts from one observer to another. Variations in the number of prophases recognized by different observers could be ascribed to differences in opinions as to the degree of chromatin condensation necessary to define a prophase. Occasionally figures thought to be en face metaphase plates by one observer were interpreted by another observer as prophase. Prophase is the longest phase of mitosis, but high resolution is required to recognize 0.2–0.3 μm chromosomal filaments present in early prophase.25 We recommend that prophases be excluded from mitotic counts performed for cancer grading.

Bloom–Richardson Grading

Reproducibility

Our results resemble those of most published studies wherein modest to good levels of agreement among independent observers for the Nottingham–Bloom–Richardson grading system and mostly fair to moderate agreement for its three components have been demonstrated (Table 3). This and other studies show that while disagreements of one step (grade 1 vs grade 2, grade 2 vs grade 3) have been common, discrepancies of more than one grade seldom occur.7, 26 Agreement did not improve in successive trials after review of criteria and group meetings. This outcome suggests that the level of agreement achievable is limited by the subjectivity of grading criteria. Prior studies and ours have shown that agreement is particularly poor for the nuclear component. Nuclear size and nucleolar size and, with more difficulty, nuclear shape are susceptible to quantitative or semiquantitative measurement. No precise standards have been set forth, and measurements are not formally made, in estimating the ‘nuclear pleomorphism’ of the Bloom–Richardson system. The situation is made complex by inclusion of more than one variable. A nucleus can be small or large, regularly or irregularly contoured, and have a large or small nucleolus. The grader must make a decision, without clear guidance, on how to classify the nucleus with mixed indicators. There is little wonder for poor reproducibility. The mitotic count also has its pitfalls. The common dictum is to count mitotic figures where they are most numerous, but without sampling the whole specimen one does not know where to begin. Nonrandom sampling introduces bias. We recommend selection of areas with maximal cellularity and best nuclear fixation. Finally, mitotic figures can be difficult to distinguish from apoptoses, pyknoses, and irregularities of nuclear staining.

Relative importance of components of Bloom–Richardson grade

We reviewed 20 studies that compared Bloom–Richardson grading components and overall grade for prognostic efficacy. Six studies were conducted on node-negative carcinomas,26, 27, 28, 29, 30, 31 13 on mixed node-negative/positive carcinomas,8, 16, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42 and two on node-positive carcinoma.42, 43 Architecture (tubularity) was the primary independent predictor in three,27, 29, 43 nuclear pleomorphism by subjective assessment in two,33, 35 morphometric measurements of nuclear area in one,36 and mitotic count in eight studies.28, 30, 31, 32, 38, 39, 40, 41 Mitotic count and overall grade predicted equally in one study.42 Mitotic index contributed to prediction along with another grading component in another four studies.8, 27, 29, 36 Overall grade was more predictive than any of its components in three studies.8, 16, 37 These published results favor mitotic index as the principal prognostic component of Bloom–Richardson grading systems. Our results also favor mitotic index along with a contribution from tubularity. Furthermore, we found that proliferation index by DNA precursor uptake was superior to mitotic index.

Can the Nottingham–Bloom–Richardson system beimproved?

The Nottingham–Bloom–Richardson system placed a disproportionate fraction of our node-negative carcinomas in the low mitotic index category. In 405 tumors for which the exact number of mitotic figures per 10 high-power fields was recorded, 67.7% were in the low interval, 11.4% mid, and 20.0% high. The counts were log-normally distributed with median=4, mean=8.6, mode=0, range 0–113 per 10 high-power fields of 0.18 mm2. Nottingham–Bloom–Richardson system scores 0–6 mitotic figures per 10 high-power fields as 1, 7–12 as 2, and >12 as 3 when corrected for field size of 0.183 mm2. Clayton reported a median of 2.5 mitotic figures/10 high-power fields with field size adjusted to 0.183 mm.28 His mean was 6.5, and 30% of tumors had 6.9 or more.25 Genestie et al39 reported that only 2% of the tumors they analyzed, all less than 3 cm diameter, surpassed the mid to high third Nottingham–Bloom–Richardson cutoff. Medri et al44 recorded 58% in the low group, 18% mid, 24% high. Proliferation index is correlated with tumor size,16 but relatively small tumor sizes in these and our studies would not likely account for so uneven a distribution by tertiles when published Nottingham–Bloom–Richardson cutoffs were used. Breast carcinomas with proliferation index in the lower third of the distribution have distinctly better survival results than those in the mid or high thirds.45, 46 Our cutoffs for labeling index, which divided a large set of invasive breast carcinomas of various stages into equal thirds,47 yielded a nearly proportional distribution of 40.2% low, 27.5% mid, and 32.4% high in the node-negative set. Furthermore, we found that a low cutoff of 2 mitotic figures per 10 high-power fields defined a group with very good prognosis (Figure 2) in comparison with the result using standard Nottingham–Bloom–Richardson cutoffs.44 Our lower cutoff for labeling index tertiles of 3% was equivalent to a mitotic index of approximately 3.3% by correlation analysis of our 404 patients for whom the exact number of mitotic figures per 10 high-power fields was recorded (r=0.63, P<0.0001). For the entire population, a cutoff of 2 mitotic figures per 10 high-power fields with the mid-interval defined by 3–10 mitotic figures per 10 high-power fields gave better separation of relapse-free survival and overall survival plots (P=0.0093 and 0.0015, respectively) than a cutoff of 5, mid-interval 6–10.

More than one multiple of 10 high-power fields must be counted to classify specimens with low mitotic counts reproducibly. For example, given population-frequency one mitotic figure per 10 high-power fields, the binomial theorem predicts a 7% probability for more than 2 mitotic figures in a single count of 10 high-power fields. If 20 high-power fields are counted and reduced to frequencies per 10 high-power fields with rounding to the lower integer, probabilities would be 1.1% for a result of 3 or more. A count of 30 high-power fields would reduce the risk of an error at the critical 2 per 10 high-power fields cutoff to 0.2%. More than 10 high-power fields need not be counted for accurate classification at the 2 per 10 high-power field threshold when the initial result is >10 per high-power field.

Theissig et al47 showed superior results when subjective nuclear pleomorphism was replaced by nuclear image cytometry, but this measurement introduces a more complex technology. We reported independent prognostic efficacy of nuclear grading by semiquantitative nuclear measurements through use of an optical grid.48 This method bypasses considerations of variation in nuclear size, nuclear features, and chromatin distribution that can complicate reproducibility of ‘nuclear pleomorphism’ assessment. While nuclear grade is usually the weakest of the Nottingham–Bloom–Richardson components, it seems to be stronger if measured by quantitative methods with potential for reproducibility.

Proliferation Index vs Grade for Prognosis

DNA precursor uptake

We used tritiated thymidine and autoradiography for labeling index measurement on the first 20% of tumors before switching to bromodeoxyuridine immunohistochemistry. The two methods utilize similar biochemical pathways and give equivalent results.21 Multivariate analysis demonstrated that the labeling index was an independent predictor of outcome, displacing Bloom–Richardson grade and its components nuclear grade and mitotic index from the multivariate model. Nonetheless, grade was a highly significant univariate predictor. Given that the covariates of outcome were extensively correlated, and interactions with therapy may have existed, ranking of covariates for predictive power might vary from one similar study to another. Our results, together with supporting findings from studies with other proliferation markers (see below), indicate that accurate immunohistochemical measurements of proliferation index will be at least equivalent to tumor grade for prognosis.

Reproducibility of DNA labeling has been demonstrated by replicate counting,49, 50 replicate sampling,49, 51 and comparison between primary breast cancer and lymph nodal metastasis.50 Approximately 25 min is required to count 2000 cells visually. A coefficient of variation of 4% can be achieved by replicate visual counts of 2000 cells when the labeling index is high. Although absolute differences decline with decreasing labeling index, interobserver coefficients of variation increase because of fewer labeled cells counted.49 Field selection also contributes to interobserver variability. In all, 61% of 65 primary breast cancers were regionally heterogeneous for labeling index by criterion of significantly more intersample variance than intrasample variance.51 Correlation coefficient for labeling index of breast carcinoma in biopsy vs mastectomy specimen was r=0.89, and for primary breast carcinoma vs axillary lymph node metastasis r=0.90.50 Counting larger numbers of cells and from multiple sites within tumors is feasible by semiautomation through image analysis and can be expected to reduce errors, particularly for tumors with low labeling index. Automation is more difficult to adapt to traditional grading. Despite errors introduced into labeling index measurement by visual counting of limited numbers of cells and tumor heterogeneity, labeling index has proven to be sufficiently reliable to demonstrate efficacy in prognosis and prediction of therapeutic response (reviewed by Medri et al52).

Choice of method for proliferation index measurement

The most practical marker for proliferation index is unsettled. Voices for retention of simple mitotic counting can still be heard.31, 53 We favor further evaluation of immunohistochemical staining procedures because of potential for automation. Uptake of DNA precursors tritiated thymidine or bromodeoxyuridine is attractive because of easy quantitation and excellent biologic rationale. This technique detects cells in the process of DNA synthesis, which is a prerequisite to cell division. It is advantageous over mitotic figure counting because the number of S-phase cells is much larger than that of mitotic cells, and difficulties in distinguishing mitotic cells from apoptoses and other nuclear changes are avoided. Its disadvantage is requirement for incubation of viable tissue, which makes it essentially impractical for routine use. Ki-67 index by MIB-1 antibody measures cycling cells (growth fraction54), which include G1, S, G2 and M cells rather than S phase alone. It is readily quantified, roughly proportional to the bromodeoxyuridine labeling index,55 has prognostic power at least comparable to that of tumor grade, and resembles DNA precursor uptake in its correlation with histologic and biochemical features of breast carcinoma.56, 57, 58, 59, 60 The International Breast Cancer Study Group found the MIB-1 Ki-67 index measured by image analysis to be superior to mitotic index.57 This appears to be the method of choice as present. The proliferation markers topoisomerase IIα45 and KiS246 also proved to be better predictors of breast carcinoma outcome than Bloom–Richardson grade. KiS2 is particularly promising because of specificity for the latter portion of the cell cycle, S phase, G2, and M. Cells in S and G2 phases are strongly committed to entering mitosis.

Immunohistochemical measurement of proliferation index is amenable to computer-processed image analysis. Image analysis has been shown to agree nearly perfectly with visual counts for estimation of Ki-67 index.61 When evaluated by computerized image analysis, the MIB-1 Ki-67 index was an independent prognostic indicator for operable breast carcinoma in general,56 and it predicted survival in node-negative patients.57 Proliferation index is a marker that now can be measured efficiently. Along with tumor size, axillary lymph nodal status, angiolymphatic invasion, architectural grade (tubular–glandular differentiation), and hormonal receptor and HER2 status it can produce a meaningful phenotypic profile of breast carcinoma. Bloom–Richardson grade does not add information predictive for outcome to this profile. The profile is readily obtainable in modern pathology laboratories, and with moderate cost.

Other Independent Markers for Node-Negative Breast Carcinoma Prognosis

We identified tumor size, lymphatic invasion, tubularity and age along with proliferation index as independent predictors of outcome for node-negative breast carcinoma. Tumor grade or mitotic index became independent predictors only when labeling index was omitted from the multivariate model. Tumor size is currently the most powerful single outcome predictor for node-negative cancer. Lymph-angioinvasion is a measure of metastatic potential that becomes more ominous when the tumor is large or the rate of cell proliferation is high. It has related strongly to relapse in node-negative cancer57 and notably in small (T1a, T1b) breast carcinomas,29, 58 in which efficacy of established markers including proliferation index is difficult to demonstrate. Its impact is limited because it is found in only a minority of breast carcinomas; it was detected in 13.4% of our node-negative tumors. Tubularity also contributed significantly in our patients. The effect of age, which was significant only for overall survival, was minimal.

Other established prognostic markers include proteolytic enzymes, HER2/neu expression, bone marrow assay for epithelial cells, microvascular counts, and gene expression analysis. Estrogen and progesterone receptors and HER2/neu have established roles in prediction of response to therapy. All of these methods can be characterized as phenotypic. Gene transcription assays have shown promising early results.62 Which approach, phenotypic or genomic, will combine the best in efficacy and economy is yet to be determined. A recent report describes retrospective use of gene product screening in 668 tamoxifen-treated patients with invasive breast carcinoma and negative axillary lymph nodes to stratify them into three risk categories.63 Resultant distant recurrence-free survival plots were substantially similar to relapse-free survival plots of our patients phenotypically and prospectively stratified as low, mid and high DNA labeling index (Figure 1).