Extracting reaction networks from databases-opening Pandora's box
- PMID: 23946492
- PMCID: PMC4239801
- DOI: 10.1093/bib/bbt058
Extracting reaction networks from databases-opening Pandora's box
Abstract
Large quantities of information describing the mechanisms of biological pathways continue to be collected in publicly available databases. At the same time, experiments have increased in scale, and biologists increasingly use pathways defined in online databases to interpret the results of experiments and generate hypotheses. Emerging computational techniques that exploit the rich biological information captured in reaction systems require formal standardized descriptions of pathways to extract these reaction networks and avoid the alternative: time-consuming and largely manual literature-based network reconstruction. Here, we systematically evaluate the effects of commonly used knowledge representations on the seemingly simple task of extracting a reaction network describing signal transduction from a pathway database. We show that this process is in fact surprisingly difficult, and the pathway representations adopted by various knowledge bases have dramatic consequences for reaction network extraction, connectivity, capture of pathway crosstalk and in the modelling of cell-cell interactions. Researchers constructing computational models built from automatically extracted reaction networks must therefore consider the issues we outline in this review to maximize the value of existing pathway knowledge.
Keywords: databases; modelling; reaction networks; signal transduction.
© The Author 2013. Published by Oxford University Press.
Figures



Similar articles
-
Creating and analyzing pathway and protein interaction compendia for modelling signal transduction networks.BMC Syst Biol. 2012 May 1;6:29. doi: 10.1186/1752-0509-6-29. BMC Syst Biol. 2012. PMID: 22548703 Free PMC article.
-
Bioinformatics analyses for signal transduction networks.Sci China C Life Sci. 2008 Nov;51(11):994-1002. doi: 10.1007/s11427-008-0134-5. Epub 2008 Nov 7. Sci China C Life Sci. 2008. PMID: 18989642
-
Pathway databases and tools for their exploitation: benefits, current limitations and challenges.Mol Syst Biol. 2009;5:290. doi: 10.1038/msb.2009.47. Epub 2009 Jul 28. Mol Syst Biol. 2009. PMID: 19638971 Free PMC article.
-
Text-mining solutions for biomedical research: enabling integrative biology.Nat Rev Genet. 2012 Dec;13(12):829-39. doi: 10.1038/nrg3337. Epub 2012 Nov 14. Nat Rev Genet. 2012. PMID: 23150036 Review.
-
Network modeling of signal transduction: establishing the global view.Bioessays. 2008 Nov;30(11-12):1110-25. doi: 10.1002/bies.20834. Bioessays. 2008. PMID: 18937364 Review.
Cited by
-
A Computational Protocol for the Knowledge-Based Assessment and Capture of Pathologies.Methods Mol Biol. 2025;2868:265-284. doi: 10.1007/978-1-0716-4200-9_14. Methods Mol Biol. 2025. PMID: 39546235
-
Fixing molecular complexes in BioPAX standards to enrich interactions and detect redundancies using semantic web technologies.Bioinformatics. 2023 May 4;39(5):btad257. doi: 10.1093/bioinformatics/btad257. Bioinformatics. 2023. PMID: 37097895 Free PMC article.
-
Signaling hypergraphs.Trends Biotechnol. 2014 Jul;32(7):356-62. doi: 10.1016/j.tibtech.2014.04.007. Epub 2014 May 22. Trends Biotechnol. 2014. PMID: 24857424 Free PMC article.
-
The use of gene interaction networks to improve the identification of cancer driver genes.PeerJ. 2017 Jan 26;5:e2568. doi: 10.7717/peerj.2568. eCollection 2017. PeerJ. 2017. PMID: 28149674 Free PMC article.
-
Traceability, reproducibility and wiki-exploration for "à-la-carte" reconstructions of genome-scale metabolic models.PLoS Comput Biol. 2018 May 23;14(5):e1006146. doi: 10.1371/journal.pcbi.1006146. eCollection 2018 May. PLoS Comput Biol. 2018. PMID: 29791443 Free PMC article.
References
-
- Fraser CM, Gocayne JD, White O, et al. The minimal gene complement of Mycoplasma genitalium. Science. 1995;270(5235):397–403. - PubMed
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources