Introduction

The immunohistochemical surrogate of molecular subclasses of breast cancers proposed by the Saint Gallen Consensus Meetings [13] is largely used to classify patients in different risk categories. However, the main drawback of this classification is the low consistency in classifying Luminal A and Luminal B/HER2-negative (HER2−) carcinomas. Actually, both of them are estrogen receptor positive (ER+) and HER2−, but Luminal B category should identify patients with worse prognosis, endorsed for chemotherapy in addition to hormone blockage. Therefore, to differentiate Luminal A from Luminal B/HER2− breast cancers carries important therapeutic implications. For this aim, the Saint Gallen Guidelines [1] recommend to assess the proliferation index by Ki67, a nuclear marker of cell proliferation that is expressed in all cell cycle phases, except G0 [4]. A Luminal B breast cancer should show a higher proliferation index than Luminal A; however, the Ki67 cut-off for differentiating these two categories changed over time. The 2011 Saint Gallen Consensus Meeting defined as “low proliferation” tumors with a Ki67 index <14 % [3], a cut-off established by comparison with PAM50 intrinsic multigene molecular test classification of Luminal cancers [5]. Otherwise, during the 2013 Saint Gallen Conference, the majority of panelists voted that a threshold of ≥20 % was indicative of “high” Ki67 status. At the same time, several works reported low reproducibility of Ki67 results, mainly in the subset of cancers with intermediate proliferation activity (between 15 and 30 %) [610]. Thus, in March 2015, during the last Saint Gallen Conference [1], the use of the median Ki67 value of local laboratory was proposed as the cut-off, and accepted by the panel of experts.

In our Institution, the median Ki67 value (tested on 547 consecutive luminal invasive breast cancers, operated between January and December 2014) is 14 %. Thus, we analyzed a large retrospective cohort of Luminal ER+/HER2− breast cancers, with long follow-up, with the following aims: (1) to establish which is the optimal Ki67 cut-off (14 vs 20 %) to stratify cases according to disease-free interval (DFI) and disease-specific survival (DSS) at 5 and 10 years, and (2) to assess the prognostic significance of intermediate Ki67 values. In addition, considering that patient prognosis depends on tumor and patient characteristics, we investigated the role of proliferation activity within a comprehensive prognostic index, which uses independent variables to stratify patients having “good” or “poor” prognosis.

Patients and methods

Study design and population

We investigated 1688 female patients with ER+/HER2− primary breast cancers who underwent surgery from June 1994 to December 2012 at the Breast Unit of the Città della Salute e della Scienza of Torino, University Hospital of Torino, Italy. Ethical approval for this study was obtained from the Ethical Committee of our Institution.

The following clinico-pathological data were obtained from clinical charts: age at diagnosis, type of surgery (conservative surgery vs radical mastectomy), therapy, type, and site of recurrence. In addition, data regarding size (<15 vs ≥15 mm), histological type and grade of tumor, nodal involvement, vascular invasion, Progesterone Receptor (PR), ER, HER2, and Ki-67 was obtained from pathological reports. In particular, for what concerns Ki67, we set cut-points at 14 % (low) and 20 % (high), and of PR at 20 % as recommended by St. Gallen experts [2, 3]. Complete clinical and histopathological data were available for 1577 out 688 patients.

Statistical analyses

Pearson’s Chi-square test and Student’s t test were preliminary performed to compare categorical and continuous variables, respectively, and to evaluate potential differences in the variable distribution among groups. The disease-free interval (DFI) was calculated from the date of surgical excision of the primary tumor to the date of the first relapse or last check-up. Disease-specific survival (DSS) was calculated from the surgical excision date of the primary tumor to the date of breast cancer death or last check-up. Survival distribution curves were plotted using the Kaplan–Meyer method, and the statistical comparisons were performed using the log-rank test. Cox regression analyses were carried out on DFI and DSS to calculate crude and adjusted HRs and 95 % CIs for the different study group. Cases lost to follow-up and cases with a non-breast cancer related deaths were censored at the last follow-up control. A model was created for evaluating the prognostic role of different variables. The proportional hazard assumption was assessed with the Schoenfeld residuals. This did not give reasons to suspect violation of this assumption. The nature of variables (continue/categorical) included in the model was evaluated considering literature reports and the results of the log-likelihood ratio test. Akaike information criterion (AIC) was used for model selection. All statistical tests were two sided. p values <0.05 were considered significant. Statistical analyses were performed using Stata/SE12.0 Statistical Software (STATA, College Station, TX).

Results

The characteristics of the 1688 patients with ER+/HER2− breast cancer stratified according to Ki67 cut-offs are reported in Tables 1 and 2. DFI and DSS were evaluated in patients with low (<14 %), intermediate (14–19 %) and high (≥20 %) Ki67 levels. Patients affected by cancers with low and intermediate Ki67 values had similar DFI and DSS, whereas patients with high Ki67 proliferation index had lower DFI and DSS than patients in the two former groups (Log-rank test DFI p < 0.001; DSS p < 0.001), Fig. 1.

Table 1 The characteristics of the 1688 patients with ER+/HER2− breast cancer stratified according to Ki67 cut-offs
Table 2 Classification on the basis of Ki67 and PgR values
Fig. 1
figure 1

DFI and DSS in low, intermediate, and high Ki67 patients

Univariate Cox analyses estimates reported in Table 2 showed that tumor size ≥15 mm, nodal involvement, high grade, and vascular invasion negatively impact on DFI and DSS, together with a high-Ki67 index. As expected, cancers having such features were treated with more aggressive surgery and radiotherapy (Table 1). PR >20 % failed to play a favorable prognostic role on DSS (Table 3).

Table 3 Univariate analyses on DFI and DSS

Multivariable analyses (Table 4) were performed on 1577 patients with complete clinical and histopathological data. The proportionality assumption was satisfied both on DFI (p = 0.6469) and DSS (p = 0.8008). Tumor size ≥15 mm, nodal involvement, 20 % Ki67 cut-point maintained an unfavorable role on DFI and DSS.

Table 4 Multivariate analyses on DFI and DSS

To create a comprehensive index associated to “good” and “poor” prognosis and based on multivariate analysis, a score was attributed to each significant variable according to its HR. Thus, a score value of 1 was given to tumors >15 mm and tumors with Ki67 ≥ 20 %, whereas a score 0 was given to tumors ≤15 mm and tumors with Ki67 <20 %. Four score values were used for lymph nodes (0: lymph nodes free of metastases; 1: from 1 to 3 metastatic lymph nodes, 2: from 4 to 9 metastatic lymph nodes; 3: more than 9 metastatic lymph nodes). The comprehensive prognostic index was created using the following formula: (tumor size score value) + (number of metastatic lymph nodes score value) + (Ki67 score value), determining a range of 0–5. Kaplan Maier analysis was then carried out for each value. We found that 575 had a final index of 0; 481 of 1; 303 of 2, 147 of 3, 40 of 4, and 21 of 5. Following the performance curves, we set the index cut-off at 3, indicating <3 a good prognosis and ≥3 a poor prognosis. Patients with an index ≥3 had a significant increased risk of relapse (DFI: HR = 4.74, CI 3.46–6.51, p < 0.001) and of death (DSS: HR = 5.03, CI 3.19–7.94, p < 0.001) Fig. 2.

Fig. 2
figure 2

DFI and DSS in patients stratified on the basis of good (<3) or poor (≥3) score

We then analyzed the outcome of patients according to treatment (1040 hormonal therapy alone; 451 chemotherapy followed by hormone therapy). Eighty-six patients were not treated (comorbidities, older age, therapy refused) and were thus excluded from further analyses. Treatment type did not change DFI and DSS of patients with a score <3 (good prognosis) Fig. 3. On the contrary, patients with poor prognosis (score ≥ 3) treated with hormonal therapy alone had a DSS disadvantage, compared to patients treated with adjuvant chemotherapy before hormonal treatment Fig. 4.

Fig. 3
figure 3

DFI and DSS in patients with good score (<3) stratified on the basis of therapy

Fig. 4
figure 4

DFI and DSS in patients with poor score (≥3) stratified on the basis of therapy

Discussion

The first result of our study is that Luminal breast cancer patients with low (<14 %) and intermediate (14–19 %) Ki67 have the same DFI and DSS and a better prognosis than patients with high (≥20 %) Ki67. This result confirms that, within ER+/HER2− breast cancers, the 20 % Ki67 cut-off can reliably (1) discriminate patients at low or high risk of recurrence and death, and (2) stratify patients at higher risk, eligible to adjuvant chemotherapy before hormone therapy.

The usefulness of Ki67 for decision on breast cancer management is a matter of discussion. A number of oncologists advise not to rely on Ki67 in the clinical practice, because of its low reproducibility [211]. Additionally, the American National Comprehensive Cancer Network (NCCN) guidelines do not provide any information regarding Ki67 immunohistochemical assessment and its role in breast cancer prognosis [12].

However, in Europe, the Saint Gallen Consensus Meeting had suggested to use Ki67 for classifying Luminal cancers since 2009. Still, uncertainty regarding the definition of highly proliferating tumors remains. In 2011, a cut-off of 14 % [3] was proposed, on the basis of the study by Cheang et al. [5], but 2 years later, it was upgraded to 20 % [2]. In the same year, Denkert et al. [13] proposed to considered Ki67 a continuum variable and suggested that the cut-points are context dependent and may change according to the prognostic or predictive role given to Ki67 itself. Thus, they suggested that the scientific community could define Ki67 cut-points depending on the study purpose [14]. To solve the literature ambiguities, we evaluated Ki67 in a subset of patients with ER+ (Luminal) breast cancer, with the aim to verify its impact on prognosis. First of all, we calculated the median value of Ki67 in our case series, as suggested by the last Saint Gallen Consensus Meeting [1] and showed that it corresponded to the value of 14 % identified by Cheang [5] as able to discriminate Luminal A from Luminal B molecular subtypes. Patients with a low (<14 %) and intermediate (14–19 %) Ki67 value showed a better DFI and DSS than those with higher Ki67 (≥20 %). As a consequence, when the purpose is to recognize high-risk patients, the 20 % cut-off is more reliable and reproducible than the 14 % cut-off. Moreover, Cserni et al. [15] showed that, in the routine practice, the proportion of Ki67 positive cells “tends to cluster around values ending with 5 or 0,” and accordingly, they proposed to select cut-off values ending with one of these two numbers.

Although we confirm that Ki67 is a reliable marker for identifying patients at low and high risk of recurrences/death, we firmly believe that Ki67 percentage has to be assessed following International Guidelines [16] and that the laboratory should be strictly subjected to local and external quality control assurance [17].

Moreover, in line with several studies suggesting that one marker is not sufficient to stratify prognosis of breast cancer patients, we designed a comprehensive prognostic index combining Ki67 with tumor size and number of metastatic lymph nodes. This index was highly predictive of DSS and DFI, at both uni- and multivariate analyses, confirming that tumor burden and proliferative index remain the most important parameters in ER+ breast cancer prognosis, as suggested by several other studies [1822]. Actually, both tumor size and the number of positive axillary lymph nodes are traditionally accepted as important prognostic factors in breast cancer patients [23, 24], and molecular tools such as Endopredict® and PAM50® include these parameters, in association with specific gene expression, to create a recurrence score called EPclin® and Prosigna® ROR score, respectively. Since a number of genes in these assays are related to proliferative index and to ER pathway, we can reasonably suggest that the use of our prognostic index in Luminal cancer could “surrogate” the multigene prognostic test results.

In addition, we found that, within the subset of patients with a poor prognostic index (≥3), those treated with hormonal therapy alone showed a poorer DSS than those receiving also adjuvant chemotherapy. This result suggests that our prognostic index may be useful to identify patients that need a more aggressive treatment.

Although our study is based on a retrospective and a monocentric dataset of patients, which could represent possible biases, in our Institute, two different pathologists, skilled in breast pathology, assess Ki67 expression, and treatment is decided during multidisciplinary meetings, allowing for discussion among different clinicians.

In conclusion, we confirm the reliability of Ki67 as a prognostic marker in Luminal breast cancers, using a cut-off value of 20 %, and we stress the important role of Ki67 in the clinical management of patients. In addition, waiting for molecular test accessibility through the healthcare system, Ki67 together with tumor size and lymph nodal status may be useful to identify ER+ breast cancer patients with adverse prognostic outcome that need combined chemotherapy and hormonal therapy.