Automatic Human-like Mining and Constructing Reliable Genetic Association Database with Deep Reinforcement Learning
- PMID: 30864315
- PMCID: PMC6417822
Automatic Human-like Mining and Constructing Reliable Genetic Association Database with Deep Reinforcement Learning
Abstract
The increasing amount of scientific literature in biological and biomedical science research has created a challenge in continuous and reliable curation of the latest knowledge discovered, and automatic biomedical text-mining has been one of the answers to this challenge. In this paper, we aim to further improve the reliability of biomedical text-mining by training the system to directly simulate the human behaviors such as querying the PubMed, selecting articles from queried results, and reading selected articles for knowledge. We take advantage of the efficiency of biomedical text-mining, the exibility of deep reinforcement learning, and the massive amount of knowledge collected in UMLS into an integrative artificial intelligent reader that can automatically identify the authentic articles and effectively acquire the knowledge conveyed in the articles. We construct a system, whose current primary task is to build the genetic association database between genes and complex traits of human. Our contributions in this paper are three-fold: 1) We propose to improve the reliability of text-mining by building a system that can directly simulate the behavior of a researcher, and we develop corresponding methods, such as Bi-directional LSTM for text mining and Deep Q-Network for organizing behaviors. 2) We demonstrate the effectiveness of our system with an example in constructing a genetic association database. 3) We release our implementation as a generic framework for researchers in the community to conveniently construct other databases.
Figures
Similar articles
-
Knowledge based word-concept model estimation and refinement for biomedical text mining.J Biomed Inform. 2015 Feb;53:300-7. doi: 10.1016/j.jbi.2014.11.015. Epub 2014 Dec 12. J Biomed Inform. 2015. PMID: 25510606
-
BioReader: a text mining tool for performing classification of biomedical literature.BMC Bioinformatics. 2019 Feb 4;19(Suppl 13):57. doi: 10.1186/s12859-019-2607-x. BMC Bioinformatics. 2019. PMID: 30717659 Free PMC article.
-
Integrating unified medical language system and association mining techniques into relevance feedback for biomedical literature search.BMC Bioinformatics. 2016 Jul 19;17 Suppl 9(Suppl 9):264. doi: 10.1186/s12859-016-1129-z. BMC Bioinformatics. 2016. PMID: 27453982 Free PMC article.
-
Community challenges in biomedical text mining over 10 years: success, failure and the future.Brief Bioinform. 2016 Jan;17(1):132-44. doi: 10.1093/bib/bbv024. Epub 2015 May 1. Brief Bioinform. 2016. PMID: 25935162 Free PMC article. Review.
-
Survey of Natural Language Processing Techniques in Bioinformatics.Comput Math Methods Med. 2015;2015:674296. doi: 10.1155/2015/674296. Epub 2015 Oct 7. Comput Math Methods Med. 2015. PMID: 26525745 Free PMC article. Review.
Cited by
-
Utilizing network pharmacology to explore the underlying mechanism of Radix Salviae in diabetic retinopathy.Chin Med. 2019 Dec 30;14:58. doi: 10.1186/s13020-019-0280-7. eCollection 2019. Chin Med. 2019. PMID: 31892939 Free PMC article.
References
-
- Cohen AM and Hersh WR, A survey of current work in biomedical text mining, Briefings in bioinformatics 6, 57 (2005). - PubMed
-
- Cohen KB and Demner-Fushman D, Biomedical natural language processing (John Benjamins Publishing Company, 2014).
-
- Poste G, Bring on the biomarkers, Nature 469, 156 (2011). - PubMed