Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Nov 19;14(11):e0224693.
doi: 10.1371/journal.pone.0224693. eCollection 2019.

ADAPTS: Automated deconvolution augmentation of profiles for tissue specific cells

Affiliations

ADAPTS: Automated deconvolution augmentation of profiles for tissue specific cells

Samuel A Danziger et al. PLoS One. .

Abstract

Immune cell infiltration of tumors and the tumor microenvironment can be an important component for determining patient outcomes. For example, immune and stromal cell presence inferred by deconvolving patient gene expression data may help identify high risk patients or suggest a course of treatment. One particularly powerful family of deconvolution techniques uses signature matrices of genes that uniquely identify each cell type as determined from single cell type purified gene expression data. Many methods from this family have been recently published, often including new signature matrices appropriate for a single purpose, such as investigating a specific type of tumor. The package ADAPTS helps users make the most of this expanding knowledge base by introducing a framework for cell type deconvolution. ADAPTS implements modular tools for customizing signature matrices for new tissue types by adding custom cell types or building new matrices de novo, including from single cell RNAseq data. It includes a common interface to several popular deconvolution algorithms that use a signature matrix to estimate the proportion of cell types present in heterogenous samples. ADAPTS also implements a novel method for clustering cell types into groups that are difficult to distinguish by deconvolution and then re-splitting those clusters using hierarchical deconvolution. We demonstrate that the techniques implemented in ADAPTS improve the ability to reconstruct the cell types present in a single cell RNAseq data set in a blind predictive analysis. ADAPTS is currently available for use in R on CRAN and GitHub.

PubMed Disclaimer

Conflict of interest statement

Our commercial affiliation with the Institute for Systems Biology and Celgene Corporation has no other relevant declarations relating to employment, consultancy, patents, products in development, or marketed products. No author has any competing interest that interferes with, or could reasonably be perceived as interfering with, the full and objective presentation, peer review, editorial decision-making, or publication of research or non-research articles submitted to this journal. This (commercial affiliation) does not alter our adherence to PLOS ONE policies on sharing data and materials.

Figures

Fig 1
Fig 1. Overview of ADAPTS modules.
New gene expression data from cell types (e.g. from a tumor microenvironment) will be used to construct a new signature matrix de novo or by augmenting an existing signature matrix. First ADAPTS will rank marker genes for the cell types using the function rankByT described in Eq 4. Then ADAPTS adds marker genes in rank order using the function AugmentSigMatrix as described in Algorithm 1 resulting in a new signature matrix. This matrix may be tested for spillover between cell types using the function spillToConvergence described in Algorithm 2. Finally, ADAPTS separates cell types with heavy spillover using the hierarchical deconvolution function hierarchicalSplit described in Algorithm 3 to estimate the percentage of cell types present in bulk gene expression data.
Fig 2
Fig 2. MGSM27 construction.
Curve showing the selection of an optimal condition number for MGSM27.
Fig 3
Fig 3. LM22 spillover matrix.
Spillover matrix showing mean misclassification of purified samples for LM22. Rows show purified cell types and columns show what those samples deconvolve as. Cells are colored by percentage, such that each row adds up to 100%. For example, if the row is ‘B.cell.memory’, the column is ‘Plasma.cells’, then the color is light blue indicating that purified ‘B.cell.memory’ samples deconvolve as containing (on average) 18% ‘Plasma.cells’.
Fig 4
Fig 4. LM22 converged spillover matrix.
Iterative deconvolution shows how easily confused cell types conspicuously form clusters. Rows show purified cell types and columns show what those samples deconvolve as. Cells are colored by percentage, such that each row adds up to 100%.
Fig 5
Fig 5. scRNAseq signature matrix construction.
Curve showing the selection of an optimal condition number for the single cell RNAseq augmented signature matrix data.
Fig 6
Fig 6. Clustering of Top 100 gene signature matrix.
The cell type clusters identified using the signature matrix constructed from the 100 genes with the highest variance across cell types in the single cell data drawn from a normal pancreas sample. Rows show purified cell types and columns show what those samples deconvolve as. Cells are colored by percentage, such that each row adds up to 100%.
Fig 7
Fig 7. Clustering of augmented gene signature matrix.
The cell type clusters identified using the augmented signature matrix that was seeded with the 100 genes exhibiting the highest variance in the normal pancreas sample. Rows show purified cell types and columns show what those samples deconvolve as. Cells are colored by percentage, such that each row adds up to 100%.

Similar articles

Cited by

References

    1. Thorsson V, Gibbs DL, Brown SD, Wolf D, Bortone DS, Yang THO, et al. The Immune Landscape of Cancer. Immunity. 2018;48(4):812–830.e14. 10.1016/j.immuni.2018.03.023 - DOI - PMC - PubMed
    1. Newman AM, Steen CB, Liu CL, Gentles AJ, Chaudhuri AA, Scherer F, et al. Determining cell type abundance and expression from bulk tissues with digital cytometry. Nature Biotechnology. 2019; p. 1. - PMC - PubMed
    1. Erkkilä T, Lehmusvaara S, Ruusuvuori P, Visakorpi T, Shmulevich I, Lähdesmäki H. Probabilistic analysis of gene expression measurements from heterogeneous tissues. Bioinformatics. 2010;26(20):2571–2577. 10.1093/bioinformatics/btq406 - DOI - PMC - PubMed
    1. Lähdesmäki H, Shmulevich L, Dunmire V, Yli-Harja O, Zhang W. In silico microdissection of microarray data from heterogeneous cell populations. BMC Bioinformatics. 2005;6(1):54 10.1186/1471-2105-6-54 - DOI - PMC - PubMed
    1. Newman AM, Liu CL, Green MR, Gentles AJ, Feng W, Xu Y, et al. Robust enumeration of cell subsets from tissue expression profiles. Nature Methods. 2015;12(5):453–457. 10.1038/nmeth.3337 - DOI - PMC - PubMed

Publication types