An additional k-means clustering step improves the biological features of WGCNA gene co-expression networks
- PMID: 28403906
- PMCID: PMC5389000
- DOI: 10.1186/s12918-017-0420-6
An additional k-means clustering step improves the biological features of WGCNA gene co-expression networks
Abstract
Background: Weighted Gene Co-expression Network Analysis (WGCNA) is a widely used R software package for the generation of gene co-expression networks (GCN). WGCNA generates both a GCN and a derived partitioning of clusters of genes (modules). We propose k-means clustering as an additional processing step to conventional WGCNA, which we have implemented in the R package km2gcn (k-means to gene co-expression network, https://github.com/juanbot/km2gcn ).
Results: We assessed our method on networks created from UKBEC data (10 different human brain tissues), on networks created from GTEx data (42 human tissues, including 13 brain tissues), and on simulated networks derived from GTEx data. We observed substantially improved module properties, including: (1) few or zero misplaced genes; (2) increased counts of replicable clusters in alternate tissues (x3.1 on average); (3) improved enrichment of Gene Ontology terms (seen in 48/52 GCNs) (4) improved cell type enrichment signals (seen in 21/23 brain GCNs); and (5) more accurate partitions in simulated data according to a range of similarity indices.
Conclusions: The results obtained from our investigations indicate that our k-means method, applied as an adjunct to standard WGCNA, results in better network partitions. These improved partitions enable more fruitful downstream analyses, as gene modules are more biologically meaningful.
Keywords: Assessment of better gene clusters on bulk tissue; Gene co-expression networks on brain; K-means applied to WGCNA.
Figures







Similar articles
-
K-Module Algorithm: An Additional Step to Improve the Clustering Results of WGCNA Co-Expression Networks.Genes (Basel). 2021 Jan 12;12(1):87. doi: 10.3390/genes12010087. Genes (Basel). 2021. PMID: 33445666 Free PMC article.
-
WGCNA: an R package for weighted correlation network analysis.BMC Bioinformatics. 2008 Dec 29;9:559. doi: 10.1186/1471-2105-9-559. BMC Bioinformatics. 2008. PMID: 19114008 Free PMC article.
-
SGCP: a spectral self-learning method for clustering genes in co-expression networks.BMC Bioinformatics. 2024 Jul 2;25(1):230. doi: 10.1186/s12859-024-05848-w. BMC Bioinformatics. 2024. PMID: 38956463 Free PMC article.
-
Assessment of complementarity of WGCNA and NERI results for identification of modules associated to schizophrenia spectrum disorders.PLoS One. 2019 Jan 15;14(1):e0210431. doi: 10.1371/journal.pone.0210431. eCollection 2019. PLoS One. 2019. PMID: 30645614 Free PMC article.
-
[Weighted gene co-expression network analysis in biomedicine research].Sheng Wu Gong Cheng Xue Bao. 2017 Nov 25;33(11):1791-1801. doi: 10.13345/j.cjb.170006. Sheng Wu Gong Cheng Xue Bao. 2017. PMID: 29202516 Review. Chinese.
Cited by
-
Identification of hub genes associated with diabetic cardiomyopathy using integrated bioinformatics analysis.Sci Rep. 2024 Jul 3;14(1):15324. doi: 10.1038/s41598-024-65773-z. Sci Rep. 2024. PMID: 38961143 Free PMC article.
-
In Search of Biomarkers for Pathogenesis and Control of Leishmaniasis by Global Analyses of Leishmania-Infected Macrophages.Front Cell Infect Microbiol. 2018 Sep 19;8:326. doi: 10.3389/fcimb.2018.00326. eCollection 2018. Front Cell Infect Microbiol. 2018. PMID: 30283744 Free PMC article. Review.
-
Using Vocal Characteristics To Classify Psychological Distress in Adult Helpline Callers: Retrospective Observational Study.JMIR Form Res. 2022 Dec 19;6(12):e42249. doi: 10.2196/42249. JMIR Form Res. 2022. PMID: 36534456 Free PMC article.
-
ChromGene: gene-based modeling of epigenomic data.Genome Biol. 2023 Sep 7;24(1):203. doi: 10.1186/s13059-023-03041-5. Genome Biol. 2023. PMID: 37679846 Free PMC article.
-
Developments in toxicogenomics: understanding and predicting compound-induced toxicity from gene expression data.Mol Omics. 2018 Aug 6;14(4):218-236. doi: 10.1039/c8mo00042e. Mol Omics. 2018. PMID: 29917034 Free PMC article. Review.
References
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources