Dynamic Meta-data Network Sparse PCA for Cancer Subtype Biomarker Screening
- PMID: 35711917
- PMCID: PMC9197542
- DOI: 10.3389/fgene.2022.869906
Dynamic Meta-data Network Sparse PCA for Cancer Subtype Biomarker Screening
Abstract
Previous research shows that each type of cancer can be divided into multiple subtypes, which is one of the key reasons that make cancer difficult to cure. Under these circumstances, finding a new target gene of cancer subtypes has great significance on developing new anti-cancer drugs and personalized treatment. Due to the fact that gene expression data sets of cancer are usually high-dimensional and with high noise and have multiple potential subtypes' information, many sparse principal component analysis (sparse PCA) methods have been used to identify cancer subtype biomarkers and subtype clusters. However, the existing sparse PCA methods have not used the known cancer subtype information as prior knowledge, and their results are greatly affected by the quality of the samples. Therefore, we propose the Dynamic Metadata Edge-group Sparse PCA (DM-ESPCA) model, which combines the idea of meta-learning to solve the problem of sample quality and uses the known cancer subtype information as prior knowledge to capture some gene modules with better biological interpretations. The experiment results on the three biological data sets showed that the DM-ESPCA model can find potential target gene probes with richer biological information to the cancer subtypes. Moreover, the results of clustering and machine learning classification models based on the target genes screened by the DM-ESPCA model can be improved by up to 22-23% of accuracies compared with the existing sparse PCA methods. We also proved that the result of the DM-ESPCA model is better than those of the four classic supervised machine learning models in the task of classification of cancer subtypes.
Keywords: Cancer subtype; DM-ESPCA model; biomarkers; dynamic network; meta-data; sparse PCA.
Copyright © 2022 Miao, Dong, Liu, Lo, Mei, Dang, Cai, Li, Yang, Xie and Liang.
Conflict of interest statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Figures







Similar articles
-
AWGE-ESPCA: An edge sparse PCA model based on adaptive noise elimination regularization and weighted gene network for Hermetia illucens genomic data analysis.PLoS Comput Biol. 2025 Feb 13;21(2):e1012773. doi: 10.1371/journal.pcbi.1012773. eCollection 2025 Feb. PLoS Comput Biol. 2025. PMID: 39946426 Free PMC article.
-
Edge-group sparse PCA for network-guided high dimensional data analysis.Bioinformatics. 2018 Oct 15;34(20):3479-3487. doi: 10.1093/bioinformatics/bty362. Bioinformatics. 2018. PMID: 29726900
-
Supervised Discriminative Sparse PCA for Com-Characteristic Gene Selection and Tumor Classification on Multiview Biological Data.IEEE Trans Neural Netw Learn Syst. 2019 Oct;30(10):2926-2937. doi: 10.1109/TNNLS.2019.2893190. Epub 2019 Feb 22. IEEE Trans Neural Netw Learn Syst. 2019. PMID: 30802874
-
Screening Strategy of Pancreatic Cancer in Patients with Diabetes Mellitus.Diagnostics (Basel). 2020 Aug 8;10(8):572. doi: 10.3390/diagnostics10080572. Diagnostics (Basel). 2020. PMID: 32784500 Free PMC article. Review.
-
Data-driven modeling and prediction of blood glucose dynamics: Machine learning applications in type 1 diabetes.Artif Intell Med. 2019 Jul;98:109-134. doi: 10.1016/j.artmed.2019.07.007. Epub 2019 Jul 26. Artif Intell Med. 2019. PMID: 31383477 Review.
Cited by
-
A semi-supervised weighted SPCA- and convolution KAN-based model for drug response prediction.Front Genet. 2025 Mar 21;16:1532651. doi: 10.3389/fgene.2025.1532651. eCollection 2025. Front Genet. 2025. PMID: 40191608 Free PMC article.
-
AWGE-ESPCA: An edge sparse PCA model based on adaptive noise elimination regularization and weighted gene network for Hermetia illucens genomic data analysis.PLoS Comput Biol. 2025 Feb 13;21(2):e1012773. doi: 10.1371/journal.pcbi.1012773. eCollection 2025 Feb. PLoS Comput Biol. 2025. PMID: 39946426 Free PMC article.
References
-
- Carlson M., Falcon S., Pages H., Li N. (2016). hgu133plus2. Db: Affymetrix Human Genome U133 Plus 2.0 Array Annotation Data (Chip Hgu133plus2). R. Package Version 3.
LinkOut - more resources
Full Text Sources