1,162
Views
14
CrossRef citations to date
0
Altmetric
Editorial

Normal cell phenotypes of breast epithelial cells provide the foundation of a breast cancer taxonomy

&

Abstract

The current classification system for breast cancer is based on expression of empirical prognostic and predictive biomarkers. As an alternative, we propose a hypothesis-based ontological breast cancer classification modeled after the taxonomy of species in evolutionary biology. This approach uses normal breast epithelial cell types and differentiation lineages as the gold standard to classify tumors. We show that there are at least eleven previously undefined normal cell types in human breast epithelium and that each breast carcinoma is related to one of these normal cell types. We find that triple negative breast cancers do not have a ‘basal-like’ phenotype. Normal breast epithelial cells conform to four novel hormonal differentiation states and almost all human breast tumors duplicate one of these hormonal differentiation states which have significant survival differences. This ontological classification scheme provides actionable treatment strategies and provides an alternative approach for understanding tumor biology with wide-ranging implications for tumor taxonomy.

Traditionally, the development of a taxonomy of a disease entity has been based upon an understanding of the underlying pathogenesis of a particular disease. Once a disease is defined as a single and pathophysiologically uniform entity, various clinical and molecular prognostic features are then used to define the severity of the disease.

This paradigm has been difficult to follow for classification of cancer due to our lack of understanding of the underlying mechanisms. In the case of breast cancer, an empirical system has been developed over the past three decades without a clear underlying organizing principle. The widely accepted paradigm for the classification of human breast cancers has been to group tumors into three categories based on the presence of estrogen receptor (ER+), progesterone receptor (PR+) and human EGF receptor 2 (HER2+), or by their absence in triple-negative breast cancers (ER/PR/HER2-,TNBC). These categories are based on the expression of molecular targets that predict response to different types of treatment such as with the ER-antagonist tamoxifen, the selective estrogen receptor down-regulator fulvestrant and the anti-HER2 monoclonal antibody herceptin. Though pragmatic for dictating clinical treatment, such an ad-hoc classification scheme does not provide insights about the true phylogeny of breast cancer.

In recent years, purely prognostic molecular classification schemes have been proposed to replace the above-described empirical classification system for breast cancer. Several high-throughput molecular tools and associated statistical methods such as mRNA expression profiles have been used to define several prognostic subgroups of breast cancer: luminal A, luminal B, basal-like, claudin-low and Her2-like Citation[1,2]. Likewise, DNA methylation patterns have been used to identify five distinct groups Citation[3], and 10 different breast cancer subtypes have been identified based on a DNA copy number based genetic classification system Citation[4,5].

However, while prognostic categories subdivide diagnostic categories into distinct outcome groups, they cannot be the sole basis of a comprehensive classification approach. The principle reason for this is that in a purely prognostic approach the only criterion that distinguishes two entities is their difference in clinical outcome. Hence, two different entities with the similar outcome but with different underlying mechanisms of pathogenesis cannot be distinguished with this approach; such as heart attacks versus strokes. This is not a trivial issue since differences in pathophysiology may reasonably require very different treatment approaches. In addition, a purely prognostic approach may end up categorizing two different stages of a single disease as different entities; such as three vessel coronary artery disease vs. one vessel disease.

Consequently, purely molecular prognostic approaches have not yet led to a comprehensive classification system. Furthermore, there has been little overlap among the mRNA expression, DNA copy number and methylation-based prognostic groups, because they are not based on a common pathophysiology Citation[6]. As a result, a breast cancer task force recently concluded that, at the moment, molecular tools do not provide sufficiently robust information beyond histological type, grade and ER, PR and HER2 status Citation[7], and these molecular tests are therefore not routinely performed for diagnostic purposes at most institutions Citation[8].

We set out to provide a pathophysiological framework that could provide a biological setting in which prognostic categories could be discovered Citation[9]. Notably, the phylogeny of normal cell types has been successfully used as a reference point to classify lymphomas and leukemias Citation[10]. The discovery of morphologic and molecular similarities between the various subtypes of leukemias and lymphomas with normal hematopoietic cell types was very important in this process and has been an important factor in the successful classification and treatment of many hematopoietic malignancies.

In solid tissues, an in-depth characterization of the normal cell subtypes has been very difficult. Until recently, only two cell types – luminal and myoepithelial cells – had been described in the human breast Citation[11]. This limited understanding of the cell types that comprise normal breast tissue has precluded a normal cell type-based classification system for breast cancer. Inspired by the classification of hematopoietic malignancies, we hypothesized that a more detailed description of normal cell types in the human breast may be important for the effective classification of human breast tumors.

With this goal in mind, we recently analyzed more than 15,000 normal breast cells and described the normal phylogeny of cell subtypes in the luminal layer of human breast Citation[9]. We identified molecules that have bimodal patterns of expression (i.e., ‘on’ or ‘off’) in the luminal and myoepithelial layers of the breast. We first started with intermediate filament markers such as cytokeratins, which we found to be particularly useful, especially CKs 5, 7, 8, 14, 17, 18 and 19. This characterization showed that CKs 7 and 18 and Claudin-4 are expressed in all luminal layer cells but that they are not expressed in the myoepithelial layer. Conversely, CD10, smooth muscle action (SMA) and p63 are expressed in all of the myoepithelial cells but not in the cells comprising the luminal layer of normal breast. Of note, this analysis revealed important insights into the expression of cytokeratins such as CKs 5 and 14 that had previously been considered as ‘basal’ keratins. CKs 5 and 14 were presumed to have expression restricted to the normal myopepithelial cells, and this misconception was the basis for defining CK5- and CK14-positive breast cancers as ‘basal-like’ (a subset of triple-negative breast cancers). Our observations and those of others Citation[12–14] support that CKs 5 and 14 have been mistakenly referred to as ‘basal keratins’ and that they are clearly expressed in luminal cells in the lobules of normal human breast. Moreover, our analysis of tumors shows that the name ‘basal-like’ is not an appropriate description of the differentiation state or the cell-of-origin of CKs 5 and 14 expressing breast cancers as this differentiation state is similar to the subset of normal luminal cells of the breast that express CKs 5 and 14 – a distinct normal luminal cell population that does not express ER or PR.

While characterizing additional protein expression patterns in the set of over 15,000 normal breast cells, we noted the bimodal expression of the ER, the androgen receptor (AR) and the vitamin D receptor (VDR) in normal luminal cells. A comprehensive assessment of these cells using double and triple immunofluorescence analyses and a novel multiplexed immunostaining technology platform Citation[15] showed that the luminal cells conform to four hormone receptor differentiation groups based on ER, AR and VDR expression in normal human breast cells – HR0 cells expressing none of these receptors, HR1 cells expressing only one of these three receptors, HR2 cells expressing any two of these receptors and HR3 cells expressing all three of the receptors. In summary, our results indicate that the composition of normal breast epithelium is much more complex than previously appreciated – our breast taxonomy comprises at least 11 cellular differentiation states in normal human breast lobules, which can be divided into four hormone receptor groups (HR0, 1, 2 and 3; ).

Figure 1. Putative differentiation lineage hierarchy of normal human breast. Stem cells are mostly quiescent and they rarely proliferate, but they give rise to a rapidly proliferating finite lifespan progenitor cells with multi-lineage differentiation potential. The more differentiated cell types increasingly become mitotically less active, and finally the terminally differentiated cells become post-mitotic. Among the normal breast cell types, only K18[+] cells were highly proliferative, which makes them the best candidate for the transit amplifying cells. We attempted to organize the rest of the breast cell types in a way where each differentiation step involves gain or loss of a single marker. Based on this constraint, we postulate that transit amplifying cell first loses its proliferative capacity which coincides with VDR upregulation giving rise to an oligo-potential progenitor (L6). This cell either maintains K18 expression and gains ER and AR expression, giving rise to luminal HR+ cell types (L4-5 and L9-11), or it down-regulates K18 and up-regulates K5 (L7). When this cell down-regulates VDR (L3) and up-regulates SMA and p63, it generates the typical K5[+], HR– and K18– myoeptihelial cell type (M2). As this cell down-regulates K5, it generates the second subpopulation of K5– myoepithelial cells (M1).The above model depicts one possible scenario among many. Describing the interrelatedness of the breast cell types we identified and understanding their differentiation lineage hierarchy will require in vivo and functional ex vivo experiments that are not currently possible for technical reasons.

Figure 1. Putative differentiation lineage hierarchy of normal human breast. Stem cells are mostly quiescent and they rarely proliferate, but they give rise to a rapidly proliferating finite lifespan progenitor cells with multi-lineage differentiation potential. The more differentiated cell types increasingly become mitotically less active, and finally the terminally differentiated cells become post-mitotic. Among the normal breast cell types, only K18[+] cells were highly proliferative, which makes them the best candidate for the transit amplifying cells. We attempted to organize the rest of the breast cell types in a way where each differentiation step involves gain or loss of a single marker. Based on this constraint, we postulate that transit amplifying cell first loses its proliferative capacity which coincides with VDR upregulation giving rise to an oligo-potential progenitor (L6). This cell either maintains K18 expression and gains ER and AR expression, giving rise to luminal HR+ cell types (L4-5 and L9-11), or it down-regulates K18 and up-regulates K5 (L7). When this cell down-regulates VDR (L3) and up-regulates SMA and p63, it generates the typical K5[+], HR– and K18– myoeptihelial cell type (M2). As this cell down-regulates K5, it generates the second subpopulation of K5– myoepithelial cells (M1).The above model depicts one possible scenario among many. Describing the interrelatedness of the breast cell types we identified and understanding their differentiation lineage hierarchy will require in vivo and functional ex vivo experiments that are not currently possible for technical reasons.

Table 1. Normal human breast cell types.

The striking heterogeneity in the molecular characteristics of individual cells in normal breast epithelium paralleled the distinct profiles of normal hematopoietic cell populations, so we next assessed whether breast carcinomas resemble hematological malignancies with tumor cells maintaining cell type/differentiation specific patterns of protein expression that reflects the patterns observed in their non-neoplastic counterparts. Remarkably, when we compared the 11 normal breast cell types that we had identified with more than 3000 human breast tumors, we found that the vast majority (>95%) of patient tumors could be placed precisely in this normal cell type phylogeny as could most of over 60 cell lines that are commonly used for studying breast cancer. In addition, almost none of the breast cancers exhibit a pure basal-like phenotype as defined by the expression of true myoepithelial markers and absence of any luminal markers. Strikingly, when we classified the breast tumors from over 1800 patients from the Nurses’ Health Study according to the HR categories we had defined in normal breast luminal cells (HR0–3), we found a very strong association between the number of receptors expressed in a breast carcinoma and the 5-year survival of the patient – with patients with HR3+ tumors having the best survival and patients with HR0 tumors having the worst survival. We noted similar results analyzing survival based on the mRNA expression patterns of these hormone receptors from a different breast cancer cohort Citation[16] and demonstrated effects on growth by modulating the activity of AR and VDR. In all, this data suggests that evaluating the HR status of a breast cancer could provide diagnostic, prognostic as well as predictive value.

Hence, our efforts offer an alternative approach for tumor classification that differs from efforts that are focused on developing a comprehensive molecular analysis based exclusively on tumor genomic information. While such genomic efforts are clearly revealing new targetable lesions for treating some cancers, these efforts may ultimately not provide a rational classification system – particularly in tumors which have very complex molecular genetic aberrations, where each individual has a tumor with a nearly unique set of genetic changes. Likely, these ‘-omics’ approaches have not yielded the anticipated results because they have low morphologic resolution, lack objective points of reference and, most importantly, they are not hypothesis driven. These shortcomings can result in a loss of tumor lineage information, can lead to redundant classification schemes and can split tumors into smaller and smaller arbitrary groups. Instead, we propose a very different approach by using normal cell types as a gold standard to classify tumors and offer an approach for assessing risk on information garnered from analysis of cells (i.e., cell-based risk assessment rather than on expression based on analyses of homogenized cells). In normal tissues, each cell subtype is designed to perform a specific function. Since these functions are defined and finite, the maximum number of biologically important normal cell types is limited, unchanging and can be precisely defined. Tumors are similarly restricted. Therefore, our method objectively constrains the arbitrary splitting of tumors into endless subclasses and provides a durable context within which molecular data may be accurately interpreted.

What we propose here is a stepwise classification system that places tumors into lineage-based diagnostic categories based on their distinct tissue of origin, cell-of-origin and differentiation lineage. Upon defining uniform lineage-based classes, we propose to use molecular and genetic classifiers to distinguish prognostic subsets within each lineage.

Financial & competing interests disclosure

The authors acknowledge funding support from Breast Cancer Research Foundation, Play for P.I.N.K., NCI grant R01-CA146445-01, NIH Roadmap Epigenomics Project (to TA Ince) and NIH grant K08NS064168 (to S Santagata). TA Ince discloses related patent filing under review. T Ince was a scientific advisor to 30M Inc. (2007–2012). S Santagata was cofounder of and scientific advisor to Bayesian Diagnostics. The authors have no other relevant affiliations or financial involvement with any organization or entity with a financial interest in or financial conflict with the subject matter or materials discussed in the manuscript apart from those disclosed.

No writing assistance was utilized in the production of this manuscript.

Notes

References

  • Prat A, Parker JS, Karginova O, et al. Phenotypic and molecular characterization of the claudin-low intrinsic subtype of breast cancer. Breast Cancer Res 2010;12:R68
  • Prat A, Perou CM. Deconstructing the molecular portraits of breast cancer. Mol Oncol 2010;5:5-23
  • TCGA. Comprehensive molecular portraits of human breast tumours. Nature 2012;490:61-70
  • Curtis C, Shah SP, Chin SF, et al. The genomic and transcriptomic architecture of 2,000 breast tumours reveals novel subgroups. Nature 2012;486:346-52
  • Dawson SJ, Rueda OM, Aparicio S, Caldas C. A new genome-driven integrated classification of breast cancer and its implications. EMBO J 2013;32:617-28
  • Yaffe MB. The scientific drunk and the lamppost: massive sequencing efforts in cancer discovery and treatment. Sci Signal 2013;6:pe13
  • Guiu S, Michiels S, Andre F, et al. Molecular subclasses of breast cancer: how do we define them? The IMPAKT 2012 Working Group Statement. Ann Oncol 2012;23:2997-3006
  • Schnitt SJ. Classification and prognosis of invasive breast cancer: from morphology to molecular taxonomy. Mod Pathol 2010;23(Suppl 2):S60-4
  • Santagata S, Thakkar A, Ergonul A, et al. Taxonomy of breast cancer based on normal cell phenotype predicts outcome. J Clin Invest 2014;124:859-70
  • Swerdlow S, Campo E, Harris NL, et al. WHO classification of tumours of haematopoietic and lymphoid tissues. International Agency for Research on Cancer; Lyon, France: 2008
  • Jones C, Mackay A, Grigoriadis A, et al. Expression profiling of purified normal human luminal and myoepithelial breast cells identification of novel prognostic markers for breast cancer. Cancer Res 2004;64:3037-45
  • Molyneux G, Geyer FC, Magnay FA, et al. BRCA1 basal-like breast cancers originate from luminal epithelial progenitors and not from basal stem cells. Cell Stem Cell 2010;7:403-17
  • Gusterson B. Do ’basal-like’ breast cancers really exist? Nat Rev Cancer 2009;9:128-34
  • Gusterson BA, Ross DT, Heath VJ, Stein T. Basal cytokeratins and their relationship to the cellular origin and functional classification of breast cancer. Breast Cancer Res 2005;7:143-8
  • Gerdes MJ, Sevinsky CJ, Sood A, et al. Highly multiplexed single-cell analysis of formalin-fixed, paraffin-embedded cancer tissue. Proc Natl Acad Sci USA 2013;110:11982-7
  • Harrell JC, Prat A, Parker JS, et al. Genomic analysis identifies unique signatures predictive of brain, lung, and liver relapse. Breast Cancer Res Treat 2012;132:523-35

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.