Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Apr 29;20(4):e1012084.
doi: 10.1371/journal.pcbi.1012084. eCollection 2024 Apr.

Reconstruction Set Test (RESET): A computationally efficient method for single sample gene set testing based on randomized reduced rank reconstruction error

Affiliations

Reconstruction Set Test (RESET): A computationally efficient method for single sample gene set testing based on randomized reduced rank reconstruction error

H Robert Frost. PLoS Comput Biol. .

Abstract

We have developed a new, and analytically novel, single sample gene set testing method called Reconstruction Set Test (RESET). RESET quantifies gene set importance based on the ability of set genes to reconstruct values for all measured genes. RESET is realized using a computationally efficient randomized reduced rank reconstruction algorithm (available via the RESET R package on CRAN) that can effectively detect patterns of differential abundance and differential correlation for self-contained and competitive scenarios. As demonstrated using real and simulated scRNA-seq data, RESET provides superior performance at a lower computational cost relative to other single sample approaches.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

Fig 1
Fig 1. Workflow representation of the randomized version of RESET as defined by Algorithm 3.
Fig 2
Fig 2. Classification performance of RESET.det, RESET.ran, VAM, GSVA, ssGSEA, and PLAGE on scRNA-seq data simulated according to the block design for a single gene set as detailed in the Methods section in S1 Text.
Each panel illustrates the relationship between the area under the receiver operating characteristic curve (AUC) and one of the simulation parameters. The vertical dotted lines mark the default parameter value used in the other panels. Error bars represent the standard error of the mean.
Fig 3
Fig 3. Classification performance of RESET.det, RESET.ran, VAM, GSVA, ssGSEA, and PLAGE on scRNA-seq data simulated according to the complex design for a single gene set as detailed in the Methods section of S1 Text.
Each panel illustrates the relationship between the area under the receiver operating characteristic curve (AUC) and one of the simulation parameters. The vertical dotted lines mark the default parameter value used in the other panels. Error bars represent the standard error of the mean.
Fig 4
Fig 4. Classification performance of RESET.det, RESET.ran, VAM, GSVA, ssGSEA, and PLAGE on scRNA-seq data simulated according to the pure self-contained design for a single gene set as detailed in the Methods section in S1 Text.
Each panel illustrates the relationship between the area under the receiver operating characteristic curve (AUC) and one of the simulation parameters. The vertical dotted lines mark the default parameter value used in the other panels. Error bars represent the standard error of the mean.
Fig 5
Fig 5. Classification performance of RESET.det, RESET.ran, RESET.pervar, VAM, GSVA, ssGSEA, and PLAGE on scRNA-seq data simulated according to the pure competitive design for disjoint and equally sized genes as detailed in the Methods section in S1 Text.
Note that RESET.det and RESET.pervar have identical performance in this case since the gene sets all have the same size. Each panel illustrates the relationship between the area under the receiver operating characteristic curve (AUC) and one of the simulation parameters. The vertical dotted lines mark the default parameter value used in the other panels. Error bars represent the standard error of the mean.
Fig 6
Fig 6. Overall classification performance of RESET.det, RESET.ran and RESET.pervar on scRNA-seq data simulated according to the block design with disjoint and equal sized gene sets as detailed in the Methods section in S1 Text.
Each panel illustrates the relationship between the area under the receiver operating characteristic curve (AUC) and one of the simulation parameters. The vertical dotted lines mark the default parameter value used in the other panels. Error bars represent the standard error of the mean.
Fig 7
Fig 7. Visualization of cell type BioCarta pathway enrichment as computed using either VAM or RESET scores on the PBMC scRNA-seq data.

Update of

Similar articles

Cited by

References

    1. Allison DB, Cui X, Page GP, Sabripour M. Microarray data analysis: from disarray to consolidation and consensus. Nature Reviews Genetics. 2006;7(1):55–65. doi: 10.1038/nrg1749 - DOI - PubMed
    1. Subramanian A, Tamayo P, Mootha VK, Mukherjee S, Ebert BL, Gillette MA, et al.. Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci U S A. 2005;102(43):15545–15550. doi: 10.1073/pnas.0506580102 - DOI - PMC - PubMed
    1. Khatri P, Sirota M, Butte AJ. Ten years of pathway analysis: current approaches and outstanding challenges. PLoS Computational Biology. 2012;8(2):e1002375. doi: 10.1371/journal.pcbi.1002375 - DOI - PMC - PubMed
    1. Hung JH, Yang TH, Hu Z, Weng Z, Delisi C. Gene set enrichment analysis: performance evaluation and usage guidelines. Brief Bioinform. 2012;13(3):281–91. doi: 10.1093/bib/bbr049 - DOI - PMC - PubMed
    1. Maciejewski H. Gene set analysis methods: statistical models and methodological differences. Brief Bioinform. 2014;15(4):504–18. doi: 10.1093/bib/bbt002 - DOI - PMC - PubMed

Publication types