Portraits of breast cancer progression
- PMID: 17683614
- PMCID: PMC1978212
- DOI: 10.1186/1471-2105-8-291
Portraits of breast cancer progression
Abstract
Background: Clustering analysis of microarray data is often criticized for giving ambiguous results because of sensitivity to data perturbation or clustering techniques used. In this paper, we describe a new method based on principal component analysis and ensemble consensus clustering that avoids these problems.
Results: We illustrate the method on a public microarray dataset from 36 breast cancer patients of whom 31 were diagnosed with at least two of three pathological stages of disease (atypical ductal hyperplasia (ADH), ductal carcinoma in situ (DCIS) and invasive ductal carcinoma (IDC). Our method identifies an optimum set of genes and divides the samples into stable clusters which correlate with clinical classification into Luminal, Basal-like and Her2+ subtypes. Our analysis reveals a hierarchical portrait of breast cancer progression and identifies genes and pathways for each stage, grade and subtype. An intriguing observation is that the disease phenotype is distinguishable in ADH and progresses along distinct pathways for each subtype. The genetic signature for disease heterogeneity across subtypes is greater than the heterogeneity of progression from DCIS to IDC within a subtype, suggesting that the disease subtypes have distinct progression pathways. Our method identifies six disease subtype and one normal clusters. The first split separates the normal samples from the cancer samples. Next, the cancer cluster splits into low grade (pathological grades 1 and 2) and high grade (pathological grades 2 and 3) while the normal cluster is unchanged. Further, the low grade cluster splits into two subclusters and the high grade cluster into four. The final six disease clusters are mapped into one Luminal A, three Luminal B, one Basal-like and one Her2+.
Conclusion: We confirm that the cancer phenotype can be identified in early stage because the genes altered in this stage progressively alter further as the disease progresses through DCIS into IDC. We identify six subtypes of disease which have distinct genetic signatures and remain separated in the clustering hierarchy. Our findings suggest that the heterogeneity of disease across subtypes is higher than the heterogeneity of the disease progression within a subtype, indicating that the subtypes are in fact distinct diseases.
Figures







Similar articles
-
Analysis of breast cancer progression using principal component analysis and clustering.J Biosci. 2007 Aug;32(5):1027-39. doi: 10.1007/s12038-007-0102-4. J Biosci. 2007. PMID: 17914245
-
Breast cancer stratification from analysis of micro-array data of micro-dissected specimens.Genome Inform. 2007;18:130-40. Genome Inform. 2007. PMID: 18546481
-
Exploratory consensus of hierarchical clusterings for melanoma and breast cancer.IEEE/ACM Trans Comput Biol Bioinform. 2010 Jan-Mar;7(1):138-52. doi: 10.1109/TCBB.2008.33. IEEE/ACM Trans Comput Biol Bioinform. 2010. PMID: 20150676
-
HER2 as a prognostic factor in breast cancer.Oncology. 2001;61 Suppl 2:67-72. doi: 10.1159/000055404. Oncology. 2001. PMID: 11694790 Review.
-
How many etiological subtypes of breast cancer: two, three, four, or more?J Natl Cancer Inst. 2014 Aug 12;106(8):dju165. doi: 10.1093/jnci/dju165. Print 2014 Aug. J Natl Cancer Inst. 2014. PMID: 25118203 Free PMC article. Review.
Cited by
-
Changes in serum and exudate creatine phosphokinase concentrations as an indicator of deep tissue injury: a pilot study.Int Wound J. 2008 Dec;5(5):674-80. doi: 10.1111/j.1742-481X.2008.00543.x. Int Wound J. 2008. PMID: 19134069 Free PMC article.
-
Identification of the YES1 Kinase as a Therapeutic Target in Basal-Like Breast Cancers.Genes Cancer. 2010 Oct;1(10):1063-73. doi: 10.1177/1947601910395583. Genes Cancer. 2010. PMID: 21779430 Free PMC article.
-
Simultaneous class discovery and classification of microarray data using spectral analysis.J Comput Biol. 2009 Jul;16(7):935-44. doi: 10.1089/cmb.2008.0227. J Comput Biol. 2009. PMID: 19580522 Free PMC article.
-
A Bayesian approach for inducing sparsity in generalized linear models with multi-category response.BMC Bioinformatics. 2015;16 Suppl 13(Suppl 13):S13. doi: 10.1186/1471-2105-16-S13-S13. Epub 2015 Sep 25. BMC Bioinformatics. 2015. PMID: 26423345 Free PMC article.
-
Modelling gene expression profiles related to prostate tumor progression using binary states.Theor Biol Med Model. 2013 May 31;10:37. doi: 10.1186/1742-4682-10-37. Theor Biol Med Model. 2013. PMID: 23721350 Free PMC article.
References
-
- Gruvberger S, Ringner M, Chen Y, Panavally S, Saal LH, Borg A, Ferno M, Peterson C, Meltzer PS. Estrogen receptor status in breast cancer is associated with remarkably distinct gene expression patterns. Cancer Res. 2001;61:5979–5984. - PubMed
-
- Mauriac L. Aromatase inhibitors: Effective endocrine therapy in the early adjuvant setting for postmenopausal women with hormone-responsive breast cancer. Best Pract Res Clin Endocrinol Metab. 2006;20:S15–29.
-
- Morris SR, Carey LA. Molecular profiling in breast cancer. Rev Endocr Metab Disord. 2007 - PubMed
-
- Perou CM, Sorlie T, Eisen MB, van de Rijn M, Jeffrey SS, Rees CA, Pollack JR, Ross DT, Johnsen H, Akslen LA, et al. Molecular portraits of human breast tumours. Nature. 2000;406:747–752. - PubMed
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Medical
Research Materials
Miscellaneous