Human gene coexpression landscape: confident network derived from tissue transcriptomic profiles
- PMID: 19081792
- PMCID: PMC2597745
- DOI: 10.1371/journal.pone.0003911
Human gene coexpression landscape: confident network derived from tissue transcriptomic profiles
Abstract
Background: Analysis of gene expression data using genome-wide microarrays is a technique often used in genomic studies to find coexpression patterns and locate groups of co-transcribed genes. However, most studies done at global "omic" scale are not focused on human samples and when they correspond to human very often include heterogeneous datasets, mixing normal with disease-altered samples. Moreover, the technical noise present in genome-wide expression microarrays is another well reported problem that many times is not addressed with robust statistical methods, and the estimation of errors in the data is not provided.
Methodology/principal findings: Human genome-wide expression data from a controlled set of normal-healthy tissues is used to build a confident human gene coexpression network avoiding both pathological and technical noise. To achieve this we describe a new method that combines several statistical and computational strategies: robust normalization and expression signal calculation; correlation coefficients obtained by parametric and non-parametric methods; random cross-validations; and estimation of the statistical accuracy and coverage of the data. All these methods provide a series of coexpression datasets where the level of error is measured and can be tuned. To define the errors, the rates of true positives are calculated by assignment to biological pathways. The results provide a confident human gene coexpression network that includes 3327 gene-nodes and 15841 coexpression-links and a comparative analysis shows good improvement over previously published datasets. Further functional analysis of a subset core network, validated by two independent methods, shows coherent biological modules that share common transcription factors. The network reveals a map of coexpression clusters organized in well defined functional constellations. Two major regions in this network correspond to genes involved in nuclear and mitochondrial metabolism and investigations on their functional assignment indicate that more than 60% are house-keeping and essential genes. The network displays new non-described gene associations and it allows the placement in a functional context of some unknown non-assigned genes based on their interactions with known gene families.
Conclusions/significance: The identification of stable and reliable human gene to gene coexpression networks is essential to unravel the interactions and functional correlations between human genes at an omic scale. This work contributes to this aim, and we are making available for the scientific community the validated human gene coexpression networks obtained, to allow further analyses on the network or on some specific gene associations. The data are available free online at http://bioinfow.dep.usal.es/coexpression/.
Conflict of interest statement
Figures







Similar articles
-
Evolutionary hallmarks of the human proteome: chasing the age and coregulation of protein-coding genes.BMC Genomics. 2016 Oct 25;17(Suppl 8):725. doi: 10.1186/s12864-016-3062-y. BMC Genomics. 2016. PMID: 27801289 Free PMC article.
-
Exploring tomato gene functions based on coexpression modules using graph clustering and differential coexpression approaches.Plant Physiol. 2012 Apr;158(4):1487-502. doi: 10.1104/pp.111.188367. Epub 2012 Feb 3. Plant Physiol. 2012. PMID: 22307966 Free PMC article.
-
Subspace differential coexpression analysis: problem definition and a general approach.Pac Symp Biocomput. 2010:145-56. Pac Symp Biocomput. 2010. PMID: 19908367
-
Retrospective analysis: reproducibility of interblastomere differences of mRNA expression in 2-cell stage mouse embryos is remarkably poor due to combinatorial mechanisms of blastomere diversification.Mol Hum Reprod. 2018 Jul 1;24(7):388-400. doi: 10.1093/molehr/gay021. Mol Hum Reprod. 2018. PMID: 29746690
-
Beyond modules and hubs: the potential of gene coexpression networks for investigating molecular mechanisms of complex brain disorders.Genes Brain Behav. 2014 Jan;13(1):13-24. doi: 10.1111/gbb.12106. Epub 2013 Dec 10. Genes Brain Behav. 2014. PMID: 24320616 Free PMC article. Review.
Cited by
-
Transcriptional Regulation of Aerobic Metabolism in Pichia pastoris Fermentation.PLoS One. 2016 Aug 18;11(8):e0161502. doi: 10.1371/journal.pone.0161502. eCollection 2016. PLoS One. 2016. PMID: 27537181 Free PMC article.
-
Genome-Wide Transcriptome Analysis of CD36 Overexpression in HepG2.2.15 Cells to Explore Its Regulatory Role in Metabolism and the Hepatitis B Virus Life Cycle.PLoS One. 2016 Oct 17;11(10):e0164787. doi: 10.1371/journal.pone.0164787. eCollection 2016. PLoS One. 2016. PMID: 27749922 Free PMC article.
-
An in silico analysis of dynamic changes in microRNA expression profiles in stepwise development of nasopharyngeal carcinoma.BMC Med Genomics. 2012 Jan 19;5:3. doi: 10.1186/1755-8794-5-3. BMC Med Genomics. 2012. PMID: 22260379 Free PMC article.
-
A novel network control model for identifying personalized driver genes in cancer.PLoS Comput Biol. 2019 Nov 25;15(11):e1007520. doi: 10.1371/journal.pcbi.1007520. eCollection 2019 Nov. PLoS Comput Biol. 2019. PMID: 31765387 Free PMC article.
-
Construction of a ceRNA coregulatory network and screening of hub biomarkers for salt-sensitive hypertension.J Cell Mol Med. 2020 Jul;24(13):7254-7265. doi: 10.1111/jcmm.15285. Epub 2020 May 15. J Cell Mol Med. 2020. PMID: 32410228 Free PMC article.
References
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources