Snowball: resampling combined with distance-based regression to discover transcriptional consequences of a driver mutation
- PMID: 25192743
- PMCID: PMC4271146
- DOI: 10.1093/bioinformatics/btu603
Snowball: resampling combined with distance-based regression to discover transcriptional consequences of a driver mutation
Abstract
Motivation: Large-scale cancer genomic studies, such as The Cancer Genome Atlas (TCGA), have profiled multidimensional genomic data, including mutation and expression profiles on a variety of cancer cell types, to uncover the molecular mechanism of cancerogenesis. More than a hundred driver mutations have been characterized that confer the advantage of cell growth. However, how driver mutations regulate the transcriptome to affect cellular functions remains largely unexplored. Differential analysis of gene expression relative to a driver mutation on patient samples could provide us with new insights in understanding driver mutation dysregulation in tumor genome and developing personalized treatment strategies.
Results: Here, we introduce the Snowball approach as a highly sensitive statistical analysis method to identify transcriptional signatures that are affected by a recurrent driver mutation. Snowball utilizes a resampling-based approach and combines a distance-based regression framework to assign a robust ranking index of genes based on their aggregated association with the presence of the mutation, and further selects the top significant genes for downstream data analyses or experiments. In our application of the Snowball approach to both synthesized and TCGA data, we demonstrated that it outperforms the standard methods and provides more accurate inferences to the functional effects and transcriptional dysregulation of driver mutations.
Availability and implementation: R package and source code are available from CRAN at http://cran.r-project.org/web/packages/DESnowball, and also available at http://bioinfo.mc.vanderbilt.edu/DESnowball/.
© The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Figures





Similar articles
-
Efficient methods for identifying mutated driver pathways in cancer.Bioinformatics. 2012 Nov 15;28(22):2940-7. doi: 10.1093/bioinformatics/bts564. Epub 2012 Sep 14. Bioinformatics. 2012. PMID: 22982574
-
Analysis of 7,815 cancer exomes reveals associations between mutational processes and somatic driver mutations.PLoS Genet. 2018 Nov 9;14(11):e1007779. doi: 10.1371/journal.pgen.1007779. eCollection 2018 Nov. PLoS Genet. 2018. PMID: 30412573 Free PMC article.
-
Driver gene mutations based clustering of tumors: methods and applications.Bioinformatics. 2018 Jul 1;34(13):i404-i411. doi: 10.1093/bioinformatics/bty232. Bioinformatics. 2018. PMID: 29950003 Free PMC article.
-
Genomic Characterization of Differentiated Thyroid Carcinoma.Endocrinol Metab (Seoul). 2019 Mar;34(1):1-10. doi: 10.3803/EnM.2019.34.1.1. Endocrinol Metab (Seoul). 2019. PMID: 30912334 Free PMC article. Review.
-
Design and analysis issues in genome-wide somatic mutation studies of cancer.Genomics. 2009 Jan;93(1):17-21. doi: 10.1016/j.ygeno.2008.07.005. Epub 2008 Aug 23. Genomics. 2009. PMID: 18692126 Free PMC article. Review.
Cited by
-
An integrative genomics approach for identifying novel functional consequences of PBRM1 truncated mutations in clear cell renal cell carcinoma (ccRCC).BMC Genomics. 2016 Aug 22;17 Suppl 7(Suppl 7):515. doi: 10.1186/s12864-016-2906-9. BMC Genomics. 2016. PMID: 27556922 Free PMC article.
-
In-depth genomic data analyses revealed complex transcriptional and epigenetic dysregulations of BRAFV600E in melanoma.Mol Cancer. 2015 Mar 14;14:60. doi: 10.1186/s12943-015-0328-y. Mol Cancer. 2015. PMID: 25890285 Free PMC article.
References
-
- Altucci L, et al. RAR and RXR modulation in cancer and metabolic disease. Nat. Rev. Drug Discov. 2007;6:793–810. - PubMed
-
- Borlak J, Jenke HS. Cross-talk between aryl hydrocarbon receptor and mitogen-activated protein kinase signaling pathway in liver cancer through C-RAF transcriptional regulation. Mol. Cancer Res. 2008;6:1326–1336. - PubMed
-
- Breiman L. Random forests. Mach. Learn. 2001;45:5–32.
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials