Driver gene mutations based clustering of tumors: methods and applications
- PMID: 29950003
- PMCID: PMC6022677
- DOI: 10.1093/bioinformatics/bty232
Driver gene mutations based clustering of tumors: methods and applications
Abstract
Motivation: Somatic mutations in proto-oncogenes and tumor suppressor genes constitute a major category of causal genetic abnormalities in tumor cells. The mutation spectra of thousands of tumors have been generated by The Cancer Genome Atlas (TCGA) and other whole genome (exome) sequencing projects. A promising approach to utilizing these resources for precision medicine is to identify genetic similarity-based sub-types within a cancer type and relate the pinpointed sub-types to the clinical outcomes and pathologic characteristics of patients.
Results: We propose two novel methods, ccpwModel and xGeneModel, for mutation-based clustering of tumors. In the former, binary variables indicating the status of cancer driver genes in tumors and the genes' involvement in the core cancer pathways are treated as the features in the clustering process. In the latter, the functional similarities of putative cancer driver genes and their confidence scores as the 'true' driver genes are integrated with the mutation spectra to calculate the genetic distances between tumors. We apply both methods to the TCGA data of 16 cancer types. Promising results are obtained when these methods are compared to state-of-the-art approaches as to the associations between the determined tumor clusters and patient race (or survival time). We further extend the analysis to detect mutation-characterized transcriptomic prognostic signatures, which are directly relevant to the etiology of carcinogenesis.
Availability and implementation: R codes and example data for ccpwModel and xGeneModel can be obtained from http://webusers.xula.edu/kzhang/ISMB2018/ccpw_xGene_software.zip.
Supplementary information: Supplementary data are available at Bioinformatics online.
Figures





Similar articles
-
Identifying cancer driver genes in tumor genome sequencing studies.Bioinformatics. 2011 Jan 15;27(2):175-81. doi: 10.1093/bioinformatics/btq630. Epub 2010 Dec 17. Bioinformatics. 2011. PMID: 21169372 Free PMC article.
-
Cancer driver gene discovery through an integrative genomics approach in a non-parametric Bayesian framework.Bioinformatics. 2017 Feb 15;33(4):483-490. doi: 10.1093/bioinformatics/btw662. Bioinformatics. 2017. PMID: 27797769 Free PMC article.
-
Snowball: resampling combined with distance-based regression to discover transcriptional consequences of a driver mutation.Bioinformatics. 2015 Jan 1;31(1):84-93. doi: 10.1093/bioinformatics/btu603. Epub 2014 Sep 5. Bioinformatics. 2015. PMID: 25192743 Free PMC article.
-
Whole-exome sequencing reveals recurrent somatic mutation networks in cancer.Cancer Lett. 2013 Nov 1;340(2):270-6. doi: 10.1016/j.canlet.2012.11.002. Epub 2012 Nov 12. Cancer Lett. 2013. PMID: 23153794 Review.
-
[Current topics in mutations in the cancer genome].Nihon Geka Gakkai Zasshi. 2012 Mar;113(2):185-90. Nihon Geka Gakkai Zasshi. 2012. PMID: 22582578 Review. Japanese.
Cited by
-
A transcriptomic signature for prostate cancer relapse prediction identified from the differentially expressed genes between TP53 mutant and wild-type tumors.Sci Rep. 2022 Jun 22;12(1):10561. doi: 10.1038/s41598-022-14436-y. Sci Rep. 2022. PMID: 35732666 Free PMC article.
-
Different Genomic Clusters Impact on Responses in Advanced Biliary Tract Cancer Treated with Cisplatin Plus Gemcitabine Plus Durvalumab.Target Oncol. 2024 Mar;19(2):223-235. doi: 10.1007/s11523-024-01032-5. Epub 2024 Feb 12. Target Oncol. 2024. PMID: 38345693
-
Identification of Cancer Driver Genes by Integrating Multiomics Data with Graph Neural Networks.Metabolites. 2023 Feb 24;13(3):339. doi: 10.3390/metabo13030339. Metabolites. 2023. PMID: 36984779 Free PMC article.
-
Uncovering the Understanding of the Concept of Patient Similarity in Cancer Research and Treatment: Scoping Review.J Med Internet Res. 2025 Aug 18;27:e71906. doi: 10.2196/71906. J Med Internet Res. 2025. PMID: 40824687 Free PMC article.
-
Classifying Breast Cancer Molecular Subtypes by Using Deep Clustering Approach.Front Genet. 2020 Nov 25;11:553587. doi: 10.3389/fgene.2020.553587. eCollection 2020. Front Genet. 2020. PMID: 33324444 Free PMC article.
References
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources