Joint Lp-Norm and L2,1-Norm Constrained Graph Laplacian PCA for Robust Tumor Sample Clustering and Gene Network Module Discovery
- PMID: 33708239
- PMCID: PMC7940841
- DOI: 10.3389/fgene.2021.621317
Joint Lp-Norm and L2,1-Norm Constrained Graph Laplacian PCA for Robust Tumor Sample Clustering and Gene Network Module Discovery
Abstract
The dimensionality reduction method accompanied by different norm constraints plays an important role in mining useful information from large-scale gene expression data. In this article, a novel method named Lp-norm and L2,1-norm constrained graph Laplacian principal component analysis (PL21GPCA) based on traditional principal component analysis (PCA) is proposed for robust tumor sample clustering and gene network module discovery. Three aspects are highlighted in the PL21GPCA method. First, to degrade the high sensitivity to outliers and noise, the non-convex proximal Lp-norm (0 < p < 1)constraint is applied on the loss function. Second, to enhance the sparsity of gene expression in cancer samples, the L2,1-norm constraint is used on one of the regularization terms. Third, to retain the geometric structure of the data, we introduce the graph Laplacian regularization item to the PL21GPCA optimization model. Extensive experiments on five gene expression datasets, including one benchmark dataset, two single-cancer datasets from The Cancer Genome Atlas (TCGA), and two integrated datasets of multiple cancers from TCGA, are performed to validate the effectiveness of our method. The experimental results demonstrate that the PL21GPCA method performs better than many other methods in terms of tumor sample clustering. Additionally, this method is used to discover the gene network modules for the purpose of finding key genes that may be associated with some cancers.
Keywords: L2, 1-norm; Lp-norm; gene network modules; graph regularization; principal component analysis; sparse constraint; tumor clustering.
Copyright © 2021 Kong, Song, Liu, Zheng, Yuan, Wang and Dai.
Conflict of interest statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Figures





Similar articles
-
Principal Component Analysis Based on Graph Laplacian and Double Sparse Constraints for Feature Selection and Sample Clustering on Multi-View Data.Hum Hered. 2019;84(1):47-58. doi: 10.1159/000501653. Epub 2019 Aug 29. Hum Hered. 2019. PMID: 31466072
-
PCA Based on Graph Laplacian Regularization and P-Norm for Gene Selection and Clustering.IEEE Trans Nanobioscience. 2017 Jun;16(4):257-265. doi: 10.1109/TNB.2017.2690365. Epub 2017 Mar 31. IEEE Trans Nanobioscience. 2017. PMID: 28371780
-
Joint L2,p-norm and random walk graph constrained PCA for single-cell RNA-seq data.Comput Methods Biomech Biomed Engin. 2024 Jan-Mar;27(4):498-511. doi: 10.1080/10255842.2023.2188106. Epub 2023 Mar 13. Comput Methods Biomech Biomed Engin. 2024. PMID: 36912759
-
Sparse Graph Regularization Non-Negative Matrix Factorization Based on Huber Loss Model for Cancer Data Analysis.Front Genet. 2019 Nov 20;10:1054. doi: 10.3389/fgene.2019.01054. eCollection 2019. Front Genet. 2019. PMID: 31824556 Free PMC article.
-
PCA via joint graph Laplacian and sparse constraint: Identification of differentially expressed genes and sample clustering on gene expression data.BMC Bioinformatics. 2019 Dec 30;20(Suppl 22):716. doi: 10.1186/s12859-019-3229-z. BMC Bioinformatics. 2019. PMID: 31888433 Free PMC article.
Cited by
-
Gene regulatory network inference using mixed-norms regularized multivariate model with covariance selection.PLoS Comput Biol. 2023 Jul 31;19(7):e1010832. doi: 10.1371/journal.pcbi.1010832. eCollection 2023 Jul. PLoS Comput Biol. 2023. PMID: 37523414 Free PMC article.
References
-
- Belkin M., Niyogi P. (2002). Laplacian eigenmaps and spectral techniques for embedding and clustering. Adv. Neural Inf. Process. Syst. 14 585–591.
-
- Belkin M., Niyogi P. (2003). Laplacian eigenmaps for dimensionality reduction and data representation. Neural Comput. 15 1373–1396. 10.1162/089976603321780317 - DOI
-
- Bertsekas D. P. (1982). Constrained Optimization and Lagrange Multiplier Methods. Cambridge, MA: Academic Press.
LinkOut - more resources
Full Text Sources
Other Literature Sources