Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2015 Jan 1;31(1):137-9.
doi: 10.1093/bioinformatics/btu607. Epub 2014 Sep 10.

UNDO: a Bioconductor R package for unsupervised deconvolution of mixed gene expressions in tumor samples

Affiliations

UNDO: a Bioconductor R package for unsupervised deconvolution of mixed gene expressions in tumor samples

Niya Wang et al. Bioinformatics. .

Abstract

Summary: We develop a novel unsupervised deconvolution method, within a well-grounded mathematical framework, to dissect mixed gene expressions in heterogeneous tumor samples. We implement an R package, UNsupervised DecOnvolution (UNDO), that can be used to automatically detect cell-specific marker genes (MGs) located on the scatter radii of mixed gene expressions, estimate cellular proportions in each sample and deconvolute mixed expressions into cell-specific expression profiles. We demonstrate the performance of UNDO over a wide range of tumor-stroma mixing proportions, validate UNDO on various biologically mixed benchmark gene expression datasets and further estimate tumor purity in TCGA/CPTAC datasets. The highly accurate deconvolution results obtained suggest not only the existence of cell-specific MGs but also UNDO's ability to detect them blindly and correctly. Although the principal application here involves microarray gene expressions, our methodology can be readily applied to other types of quantitative molecular profiling data.

Availability and implementation: UNDO is available at http://bioconductor.org/packages.

PubMed Disclaimer

Figures

Fig. 1.
Fig. 1.
(a) The scatterplot of mixing gene expression profiles (MCF7 and HS27 cell lines), which forms a scatter sector (a sector-shaped distribution). (b) The scatterplot of recovered pure cell gene expression profiles (MCF7 and HS27 cell lines). (c) The flowchart of UNDO algorithm
Fig. 2.
Fig. 2.
(a) The scatterplot of the estimated versus true gene expression profile of MCF7 cell line. (b) The scatterplot of the estimated versus true gene expression profile of HS27 cell line

References

    1. Ahn J, et al. DeMix: deconvolution for mixed cancer transcriptomes using raw measured data. Bioinformatics. 2013;29:1865–1871. - PMC - PubMed
    1. Carter SL, et al. Absolute quantification of somatic DNA alterations in human cancer. Nat. Biotechnol. 2012;30:413–421. - PMC - PubMed
    1. Chen L, et al. Tissue-specific compartmental analysis for dynamic contrast-enhanced MR imaging of complex tumors. IEEE Trans. Med. Imaging. 2011;30:2044–2058. - PMC - PubMed
    1. Clarke J, et al. Statistical expression deconvolution from mixed tissue samples. Bioinformatics. 2010;26:1043–1049. - PMC - PubMed
    1. Clarke R, et al. The properties of high-dimensional data spaces: implications for exploring gene and protein expression data. Nat. Rev. Cancer. 2008;8:37–49. - PMC - PubMed

Publication types