Supervised graph contrastive learning for cancer subtype identification through multi-omics data integration
- PMID: 38404715
- PMCID: PMC10891026
- DOI: 10.1007/s13755-024-00274-x
Supervised graph contrastive learning for cancer subtype identification through multi-omics data integration
Abstract
Cancer is one of the most deadly diseases in the world. Accurate cancer subtype classification is critical for patient diagnosis, treatment, and prognosis. Ever-increasing multi-omics data describes the characteristics of the patients from different views and serves as complementary information to promote cancer subtype identification. However, omics data generally have different distributions and high dimensions. How to effectively integrate multiple omics data to classify cancer subtypes accurately is a challenge for researchers. This work proposes a method integrating multi-omics data based on supervised graph contrast learning (MCRGCN) to classify cancer subtypes. The method considers the unique feature distribution of each omics data and the interaction of different omics data features to improve the accuracy of cancer subtype classification. To achieve this, MCRGCN first constructs different sample networks based on the multi-omics data of the samples. Then, it puts the omics data and adjacency matrix of the sample into different residual graph convolution models to get multi-omics features of the samples, which are trained with a supervised comparison loss to maintain that the sample features of each omics should be as consistent as possible. Finally, we input the sample features combining multi-omics features into a classifier to obtain the cancer subtypes. We applied MCRGCN to the invasive breast carcinoma (BRCA) and glioblastoma multiforme (GBM) datasets, integrating gene expression, miRNA expression, and DNA methylation data. The results demonstrate that our model is superior to other methods in integrating multi-omics data. Moreover, the results of survival analysis experiments demonstrate that the cancer subtypes identified by our model have significant clinical features. Furthermore, our model can help to identify potential biomarkers and pathways associated with cancer subtypes.
Keywords: Cancer-subtype classification; Graph contrastive learning; Multi-omics integration.
© The Author(s), under exclusive licence to Springer Nature Switzerland AG 2024. Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
Conflict of interest statement
Conflict of interestThe authors declare no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Similar articles
-
Molecular feature-based classification of retroperitoneal liposarcoma: a prospective cohort study.Elife. 2025 May 23;14:RP100887. doi: 10.7554/eLife.100887. Elife. 2025. PMID: 40407808 Free PMC article.
-
MGDMCL: A multi-omics integration framework based on masked graph dynamic learning and multi-granularity feature contrastive learning for biomedical classification.Comput Methods Programs Biomed. 2025 Aug 13;271:109024. doi: 10.1016/j.cmpb.2025.109024. Online ahead of print. Comput Methods Programs Biomed. 2025. PMID: 40834555
-
Can a Liquid Biopsy Detect Circulating Tumor DNA With Low-passage Whole-genome Sequencing in Patients With a Sarcoma? A Pilot Evaluation.Clin Orthop Relat Res. 2025 Jan 1;483(1):39-48. doi: 10.1097/CORR.0000000000003161. Epub 2024 Jun 21. Clin Orthop Relat Res. 2025. PMID: 38905450
-
The Lived Experience of Autistic Adults in Employment: A Systematic Search and Synthesis.Autism Adulthood. 2024 Dec 2;6(4):495-509. doi: 10.1089/aut.2022.0114. eCollection 2024 Dec. Autism Adulthood. 2024. PMID: 40018061 Review.
-
Factors that influence parents' and informal caregivers' views and practices regarding routine childhood vaccination: a qualitative evidence synthesis.Cochrane Database Syst Rev. 2021 Oct 27;10(10):CD013265. doi: 10.1002/14651858.CD013265.pub2. Cochrane Database Syst Rev. 2021. PMID: 34706066 Free PMC article.
Cited by
-
Using parenclitic networks on phaeochromocytoma and paraganglioma tumours provides novel insights on global DNA methylation.Sci Rep. 2024 Dec 2;14(1):29958. doi: 10.1038/s41598-024-81486-9. Sci Rep. 2024. PMID: 39622952 Free PMC article.
References
-
- Peng W, Chen T, Liu H, Dai W, Yu N, Lan W. Improving drug response prediction based on two-space graph convolution. Comput Biol Med. 2023;158:106859. - PubMed
-
- Song J, Peng W, Wang F. An entropy-based method for identifying mutual exclusive driver genes in cancer. IEEE/ACM Trans Comput Biol Bioinform. 2019;17:758–68. - PubMed
LinkOut - more resources
Full Text Sources