SILGGM: An extensive R package for efficient statistical inference in large-scale gene networks
- PMID: 30102702
- PMCID: PMC6107288
- DOI: 10.1371/journal.pcbi.1006369
SILGGM: An extensive R package for efficient statistical inference in large-scale gene networks
Abstract
Gene co-expression network analysis is extremely useful in interpreting a complex biological process. The recent droplet-based single-cell technology is able to generate much larger gene expression data routinely with thousands of samples and tens of thousands of genes. To analyze such a large-scale gene-gene network, remarkable progress has been made in rigorous statistical inference of high-dimensional Gaussian graphical model (GGM). These approaches provide a formal confidence interval or a p-value rather than only a single point estimator for conditional dependence of a gene pair and are more desirable for identifying reliable gene networks. To promote their widespread use, we herein introduce an extensive and efficient R package named SILGGM (Statistical Inference of Large-scale Gaussian Graphical Model) that includes four main approaches in statistical inference of high-dimensional GGM. Unlike the existing tools, SILGGM provides statistically efficient inference on both individual gene pair and whole-scale gene pairs. It has a novel and consistent false discovery rate (FDR) procedure in all four methodologies. Based on the user-friendly design, it provides outputs compatible with multiple platforms for interactive network visualization. Furthermore, comparisons in simulation illustrate that SILGGM can accelerate the existing MATLAB implementation to several orders of magnitudes and further improve the speed of the already very efficient R package FastGGM. Testing results from the simulated data confirm the validity of all the approaches in SILGGM even in a very large-scale setting with the number of variables or genes to a ten thousand level. We have also applied our package to a novel single-cell RNA-seq data set with pan T cells. The results show that the approaches in SILGGM significantly outperform the conventional ones in a biological sense. The package is freely available via CRAN at https://cran.r-project.org/package=SILGGM.
Conflict of interest statement
The authors have declared that no competing interests exist.
Figures




Similar articles
-
FastGGM: An Efficient Algorithm for the Inference of Gaussian Graphical Model in Biological Networks.PLoS Comput Biol. 2016 Feb 12;12(2):e1004755. doi: 10.1371/journal.pcbi.1004755. eCollection 2016 Feb. PLoS Comput Biol. 2016. PMID: 26872036 Free PMC article.
-
A Statistical Test for Differential Network Analysis Based on Inference of Gaussian Graphical Model.Sci Rep. 2019 Jul 26;9(1):10863. doi: 10.1038/s41598-019-47362-7. Sci Rep. 2019. PMID: 31350445 Free PMC article.
-
Nonlinear Network Reconstruction from Gene Expression Data Using Marginal Dependencies Measured by DCOL.PLoS One. 2016 Jul 5;11(7):e0158247. doi: 10.1371/journal.pone.0158247. eCollection 2016. PLoS One. 2016. PMID: 27380516 Free PMC article.
-
Biological Network Inference and analysis using SEBINI and CABIN.Methods Mol Biol. 2009;541:551-76. doi: 10.1007/978-1-59745-243-4_24. Methods Mol Biol. 2009. PMID: 19381531 Review.
-
Integrated inference and analysis of regulatory networks from multi-level measurements.Methods Cell Biol. 2012;110:19-56. doi: 10.1016/B978-0-12-388403-9.00002-3. Methods Cell Biol. 2012. PMID: 22482944 Free PMC article. Review.
Cited by
-
Gaussian graphical models with applications to omics analyses.Stat Med. 2022 Nov 10;41(25):5150-5187. doi: 10.1002/sim.9546. Epub 2022 Sep 26. Stat Med. 2022. PMID: 36161666 Free PMC article.
-
Network Analysis of Gene Transcriptions of Arabidopsis thaliana in Spaceflight Microgravity.Genes (Basel). 2021 Feb 25;12(3):337. doi: 10.3390/genes12030337. Genes (Basel). 2021. PMID: 33668919 Free PMC article.
-
Identifying strengths and weaknesses of methods for computational network inference from single-cell RNA-seq data.G3 (Bethesda). 2023 Mar 9;13(3):jkad004. doi: 10.1093/g3journal/jkad004. G3 (Bethesda). 2023. PMID: 36626328 Free PMC article.
-
NetCoMi: network construction and comparison for microbiome data in R.Brief Bioinform. 2021 Jul 20;22(4):bbaa290. doi: 10.1093/bib/bbaa290. Brief Bioinform. 2021. PMID: 33264391 Free PMC article.
-
Information enhanced model selection for Gaussian graphical model with application to metabolomic data.Biostatistics. 2022 Jul 18;23(3):926-948. doi: 10.1093/biostatistics/kxab006. Biostatistics. 2022. PMID: 33720330 Free PMC article.
References
-
- Weirauch MT. Gene coexpression networks for the analysis of DNA microarray data In: Dehmer M, Emmert-Streib F, Graber A, Salvador A, editors. Applied Statistics for Network Biology: Methods in Systems Biology. Wiley-VCH Verlag GmbH & Co. KGaA; 2011. pp. 215–250.
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources