Robust unmixing of tumor states in array comparative genomic hybridization data
- PMID: 20529894
- PMCID: PMC2881397
- DOI: 10.1093/bioinformatics/btq213
Robust unmixing of tumor states in array comparative genomic hybridization data
Abstract
Motivation: Tumorigenesis is an evolutionary process by which tumor cells acquire sequences of mutations leading to increased growth, invasiveness and eventually metastasis. It is hoped that by identifying the common patterns of mutations underlying major cancer sub-types, we can better understand the molecular basis of tumor development and identify new diagnostics and therapeutic targets. This goal has motivated several attempts to apply evolutionary tree reconstruction methods to assays of tumor state. Inference of tumor evolution is in principle aided by the fact that tumors are heterogeneous, retaining remnant populations of different stages along their development along with contaminating healthy cell populations. In practice, though, this heterogeneity complicates interpretation of tumor data because distinct cell types are conflated by common methods for assaying the tumor state. We previously proposed a method to computationally infer cell populations from measures of tumor-wide gene expression through a geometric interpretation of mixture type separation, but this approach deals poorly with noisy and outlier data.
Results: In the present work, we propose a new method to perform tumor mixture separation efficiently and robustly to an experimental error. The method builds on the prior geometric approach but uses a novel objective function allowing for robust fits that greatly reduces the sensitivity to noise and outliers. We further develop an efficient gradient optimization method to optimize this 'soft geometric unmixing' objective for measurements of tumor DNA copy numbers assessed by array comparative genomic hybridization (aCGH) data. We show, on a combination of semi-synthetic and real data, that the method yields fast and accurate separation of tumor states.
Conclusions: We have shown a novel objective function and optimization method for the robust separation of tumor sub-types from aCGH data and have shown that the method provides fast, accurate reconstruction of tumor states from mixed samples. Better solutions to this problem can be expected to improve our ability to accurately identify genetic abnormalities in primary tumor samples and to infer patterns of tumor evolution.
Supplementary information: Supplementary data are available at Bioinformatics online.
Figures









Similar articles
-
Applying unmixing to gene expression data for tumor phylogeny inference.BMC Bioinformatics. 2010 Jan 20;11:42. doi: 10.1186/1471-2105-11-42. BMC Bioinformatics. 2010. PMID: 20089185 Free PMC article.
-
Inference of tumor phylogenies from genomic assays on heterogeneous samples.J Biomed Biotechnol. 2012;2012:797812. doi: 10.1155/2012/797812. Epub 2012 May 13. J Biomed Biotechnol. 2012. PMID: 22654484 Free PMC article.
-
Smoothing waves in array CGH tumor profiles.Bioinformatics. 2009 May 1;25(9):1099-104. doi: 10.1093/bioinformatics/btp132. Epub 2009 Mar 10. Bioinformatics. 2009. PMID: 19276148
-
Analysis of Copy-Number Alterations in Single Cells Using Microarray-Based Comparative Genomic Hybridization (aCGH).Curr Protoc Cell Biol. 2014 Dec 1;65:22.19.1-23. doi: 10.1002/0471143030.cb2219s65. Curr Protoc Cell Biol. 2014. PMID: 25447076 Review.
-
Zoom-in array comparative genomic hybridization (aCGH) to detect germline rearrangements in cancer susceptibility genes.Methods Mol Biol. 2010;653:221-35. doi: 10.1007/978-1-60761-759-4_13. Methods Mol Biol. 2010. PMID: 20721746 Review.
Cited by
-
A simplicial complex-based approach to unmixing tumor progression data.BMC Bioinformatics. 2015 Aug 12;16:254. doi: 10.1186/s12859-015-0694-x. BMC Bioinformatics. 2015. PMID: 26264682 Free PMC article.
-
ISOpureR: an R implementation of a computational purification algorithm of mixed tumour profiles.BMC Bioinformatics. 2015 May 14;16:156. doi: 10.1186/s12859-015-0597-x. BMC Bioinformatics. 2015. PMID: 25972088 Free PMC article.
-
Deconvolution and phylogeny inference of structural variations in tumor genomic samples.Bioinformatics. 2018 Jul 1;34(13):i357-i365. doi: 10.1093/bioinformatics/bty270. Bioinformatics. 2018. PMID: 29950001 Free PMC article.
-
Novel multisample scheme for inferring phylogenetic markers from whole genome tumor profiles.IEEE/ACM Trans Comput Biol Bioinform. 2013 Nov-Dec;10(6):1422-31. doi: 10.1109/TCBB.2013.33. IEEE/ACM Trans Comput Biol Bioinform. 2013. PMID: 24407301 Free PMC article.
-
Medoidshift clustering applied to genomic bulk tumor data.BMC Genomics. 2016 Jan 11;17 Suppl 1(Suppl 1):6. doi: 10.1186/s12864-015-2302-x. BMC Genomics. 2016. PMID: 26817708 Free PMC article.
References
-
- Atkins JH, Gershell LJ. From the analyst's couch: selective anticancer drugs. Nat. Rev. Cancer. 2002;2:645–646. - PubMed
-
- Beerenwinkel N, et al. Mtreemix: a software package for learning and using mixture models of mutagenetic trees. Bioinformatics. 2005;21:2106–2107. - PubMed
-
- Bild AH, et al. Opinion: linking oncogenic pathways with therapeutic opportunities. Nat. Rev. Cancer. 2006;6:735–741. - PubMed
-
- Boyd S, Vandenberghe L. Convex Optimization. New York, NY: Cambridge University Press; 2004.
-
- Chan T, et al. A convex analysis based minimum-volume enclosing simplex algorithm for hyperspectral unmixing. IEEE Trans. Signal Proc. 2009;57:4418–4432.
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources