Discovery of meaningful associations in genomic data using partial correlation coefficients
- PMID: 15284096
- DOI: 10.1093/bioinformatics/bth445
Discovery of meaningful associations in genomic data using partial correlation coefficients
Abstract
Motivation: A major challenge of systems biology is to infer biochemical interactions from large-scale observations, such as transcriptomics, proteomics and metabolomics. We propose to use a partial correlation analysis to construct approximate Undirected Dependency Graphs from such large-scale biochemical data. This approach enables a distinction between direct and indirect interactions of biochemical compounds, thereby inferring the underlying network topology.
Results: The method is first thoroughly evaluated with a large set of simulated data. Results indicate that the approach has good statistical power and a low False Discovery Rate even in the presence of noise in the data. We then applied the method to an existing data set of yeast gene expression. Several small gene networks were inferred and found to contain genes known to be collectively involved in particular biochemical processes. In some of these networks there are also uncharacterized ORFs present, which lead to hypotheses about their functions.
Availability: Programs running in MS-Windows and Linux for applying zeroth, first, second and third order partial correlation analysis can be downloaded at: http://mendes.vbi.vt.edu/tiki-index.php?page=Software.
Supplementary information: Supplementary information can be found at: URL to be decided.
Similar articles
-
Building and analysing genome-wide gene disruption networks.Bioinformatics. 2002;18 Suppl 2:S202-10. doi: 10.1093/bioinformatics/18.suppl_2.s202. Bioinformatics. 2002. PMID: 12386004
-
Modularized learning of genetic interaction networks from biological annotations and mRNA expression data.Bioinformatics. 2005 Jun 1;21(11):2739-47. doi: 10.1093/bioinformatics/bti406. Epub 2005 Mar 29. Bioinformatics. 2005. PMID: 15797909
-
Inferring genetic regulatory logic from expression data.Bioinformatics. 2005 Jun 1;21(11):2706-13. doi: 10.1093/bioinformatics/bti388. Epub 2005 Mar 22. Bioinformatics. 2005. PMID: 15784747
-
Exploring genetic interactions and networks with yeast.Nat Rev Genet. 2007 Jun;8(6):437-49. doi: 10.1038/nrg2085. Nat Rev Genet. 2007. PMID: 17510664 Review.
-
Inferring network interactions within a cell.Brief Bioinform. 2005 Dec;6(4):380-9. doi: 10.1093/bib/6.4.380. Brief Bioinform. 2005. PMID: 16420736 Review.
Cited by
-
Conditional mutual inclusive information enables accurate quantification of associations in gene regulatory networks.Nucleic Acids Res. 2015 Mar 11;43(5):e31. doi: 10.1093/nar/gku1315. Epub 2014 Dec 24. Nucleic Acids Res. 2015. PMID: 25539927 Free PMC article.
-
Foundational Principles for Large-Scale Inference: Illustrations Through Correlation Mining.Proc IEEE Inst Electr Electron Eng. 2016 Jan;104(1):93-110. doi: 10.1109/JPROC.2015.2494178. Epub 2015 Dec 21. Proc IEEE Inst Electr Electron Eng. 2016. PMID: 27087700 Free PMC article.
-
Robust network inference using response logic.Bioinformatics. 2019 Jul 15;35(14):i634-i642. doi: 10.1093/bioinformatics/btz326. Bioinformatics. 2019. PMID: 31510692 Free PMC article.
-
Statistical methods for the analysis of high-throughput metabolomics data.Comput Struct Biotechnol J. 2013 Mar 22;4:e201301009. doi: 10.5936/csbj.201301009. eCollection 2013. Comput Struct Biotechnol J. 2013. PMID: 24688690 Free PMC article. Review.
-
From genome-scale data to models of infectious disease: A Bayesian network-based strategy to drive model development.Math Biosci. 2015 Dec;270(Pt B):156-68. doi: 10.1016/j.mbs.2015.06.006. Epub 2015 Jun 17. Math Biosci. 2015. PMID: 26093035 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases