Data Imputation in Epistatic MAPs by Network-Guided Matrix Completion
- PMID: 25658751
- PMCID: PMC4449711
- DOI: 10.1089/cmb.2014.0158
Data Imputation in Epistatic MAPs by Network-Guided Matrix Completion
Abstract
Epistatic miniarray profile (E-MAP) is a popular large-scale genetic interaction discovery platform. E-MAPs benefit from quantitative output, which makes it possible to detect subtle interactions with greater precision. However, due to the limits of biotechnology, E-MAP studies fail to measure genetic interactions for up to 40% of gene pairs in an assay. Missing measurements can be recovered by computational techniques for data imputation, in this way completing the interaction profiles and enabling downstream analysis algorithms that could otherwise be sensitive to missing data values. We introduce a new interaction data imputation method called network-guided matrix completion (NG-MC). The core part of NG-MC is low-rank probabilistic matrix completion that incorporates prior knowledge presented as a collection of gene networks. NG-MC assumes that interactions are transitive, such that latent gene interaction profiles inferred by NG-MC depend on the profiles of their direct neighbors in gene networks. As the NG-MC inference algorithm progresses, it propagates latent interaction profiles through each of the networks and updates gene network weights toward improved prediction. In a study with four different E-MAP data assays and considered protein-protein interaction and gene ontology similarity networks, NG-MC significantly surpassed existing alternative techniques. Inclusion of information from gene networks also allowed NG-MC to predict interactions for genes that were not included in original E-MAP assays, a task that could not be considered by current imputation approaches.
Keywords: data integration; epistatic miniarray profile; gene network; genetic interaction; matrix completion; missing value imputation.
Figures





Similar articles
-
Imputing missing values for genetic interaction data.Methods. 2014 Jun 1;67(3):269-77. doi: 10.1016/j.ymeth.2014.03.032. Epub 2014 Apr 6. Methods. 2014. PMID: 24718098
-
Missing value imputation for epistatic MAPs.BMC Bioinformatics. 2010 Apr 20;11:197. doi: 10.1186/1471-2105-11-197. BMC Bioinformatics. 2010. PMID: 20406472 Free PMC article.
-
Imputing and predicting quantitative genetic interactions in epistatic MAPs.Methods Mol Biol. 2011;781:353-61. doi: 10.1007/978-1-61779-276-2_17. Methods Mol Biol. 2011. PMID: 21877290 Free PMC article.
-
Biological Network Inference and analysis using SEBINI and CABIN.Methods Mol Biol. 2009;541:551-76. doi: 10.1007/978-1-59745-243-4_24. Methods Mol Biol. 2009. PMID: 19381531 Review.
-
Computational reconstruction of protein-protein interaction networks: algorithms and issues.Methods Mol Biol. 2009;541:89-100. doi: 10.1007/978-1-59745-243-4_5. Methods Mol Biol. 2009. PMID: 19381528 Review.
Cited by
-
Exploring genetic interaction manifolds constructed from rich single-cell phenotypes.Science. 2019 Aug 23;365(6455):786-793. doi: 10.1126/science.aax4438. Epub 2019 Aug 8. Science. 2019. PMID: 31395745 Free PMC article.
-
Application of network link prediction in drug discovery.BMC Bioinformatics. 2021 Apr 12;22(1):187. doi: 10.1186/s12859-021-04082-y. BMC Bioinformatics. 2021. PMID: 33845763 Free PMC article.
-
Prediction of Genetic Interactions Using Machine Learning and Network Properties.Front Bioeng Biotechnol. 2015 Oct 26;3:172. doi: 10.3389/fbioe.2015.00172. eCollection 2015. Front Bioeng Biotechnol. 2015. PMID: 26579514 Free PMC article. Review.
-
Machine Learning for Integrating Data in Biology and Medicine: Principles, Practice, and Opportunities.Inf Fusion. 2019 Oct;50:71-91. doi: 10.1016/j.inffus.2018.09.012. Epub 2018 Sep 21. Inf Fusion. 2019. PMID: 30467459 Free PMC article.
References
-
- Cai J.-F., Candès E.J., and Shen Z. 2010. A singular value thresholding algorithm for matrix completion. SIAM J. Optimiz. 20, 1956–1982
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials