Network-based cancer heterogeneity analysis incorporating multi-view of prior information
- PMID: 35561185
- PMCID: PMC9113254
- DOI: 10.1093/bioinformatics/btac183
Network-based cancer heterogeneity analysis incorporating multi-view of prior information
Abstract
Motivation: Cancer genetic heterogeneity analysis has critical implications for tumour classification, response to therapy and choice of biomarkers to guide personalized cancer medicine. However, existing heterogeneity analysis based solely on molecular profiling data usually suffers from a lack of information and has limited effectiveness. Many biomedical and life sciences databases have accumulated a substantial volume of meaningful biological information. They can provide additional information beyond molecular profiling data, yet pose challenges arising from potential noise and uncertainty.
Results: In this study, we aim to develop a more effective heterogeneity analysis method with the help of prior information. A network-based penalization technique is proposed to innovatively incorporate a multi-view of prior information from multiple databases, which accommodates heterogeneity attributed to both differential genes and gene relationships. To account for the fact that the prior information might not be fully credible, we propose a weighted strategy, where the weight is determined dependent on the data and can ensure that the present model is not excessively disturbed by incorrect information. Simulation and analysis of The Cancer Genome Atlas glioblastoma multiforme data demonstrate the practical applicability of the proposed method.
Availability and implementation: R code implementing the proposed method is available at https://github.com/mengyunwu2020/PECM. The data that support the findings in this paper are openly available in TCGA (The Cancer Genome Atlas) at https://portal.gdc.cancer.gov/.
Supplementary information: Supplementary data are available at Bioinformatics online.
© The Author(s) 2022. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Figures
References
-
- Arias-Castro E., Pu X. (2017) A simple approach to sparse clustering. Comput. Stat. Data Anal., 105, 217–228.
-
- Bouveyron C., Brunet-Saumard C. (2014) Model-based clustering of high-dimensional data: a review. Comput. Stat. Data Anal., 71, 52–78.
-
- Chang X. et al. (2018) Sparse k-means with penalty for high-dimensional data clustering. Stat. Sin., 28, 1265–1284.
-
- Cowen L. et al. (2017) Network propagation: a universal amplifier of genetic associations. Nat. Rev. Genet., 18, 551–562. - PubMed
