Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2017;1(1-2):19-25.
doi: 10.3233/DS-170001. Epub 2017 Dec 8.

Knowledge-based biomedical Data Science

Affiliations

Knowledge-based biomedical Data Science

Lawrence E Hunter. EPJ Data Sci. 2017.

Abstract

Computational manipulation of knowledge is an important, and often under-appreciated, aspect of biomedical Data Science. The first Data Science initiative from the US National Institutes of Health was entitled "Big Data to Knowledge (BD2K)." The main emphasis of the more than $200M allocated to that program has been on "Big Data;" the "Knowledge" component has largely been the implicit assumption that the work will lead to new biomedical knowledge. However, there is long-standing and highly productive work in computational knowledge representation and reasoning, and computational processing of knowledge has a role in the world of Data Science. Knowledge-based biomedical Data Science involves the design and implementation of computer systems that act as if they knew about biomedicine. There are many ways in which a computational approach might act as if it knew something: for example, it might be able to answer a natural language question about a biomedical topic, or pass an exam; it might be able to use existing biomedical knowledge to rank or evaluate hypotheses; it might explain or interpret data in light of prior knowledge, either in a Bayesian or other sort of framework. These are all examples of automated reasoning that act on computational representations of knowledge. After a brief survey of existing approaches to knowledge-based data science, this position paper argues that such research is ripe for expansion, and expanded application.

Keywords: Ontology; explanation; inference; knowledge representation; machine learning; reasoning; text mining.

PubMed Disclaimer

References

    1. Athenikos S, Han H. Biomedical question answering: A survey. Comput Methods Programs Biomed. 2010;99:1–24. doi: 10.1016/j.cmpb.2009.10.003. - DOI - PubMed
    1. Bandrowski A, Brinkman R, Brochhausen M, Brush M, Bug B, Chibucos M, Clancy K, Courtot M, Derom D, Dumontier M, Fan L, Fostel J, Fragoso G, Gibson F, Gonzalez-Beltran A, Haendel M, He Y, Heiskanen M, Hernandez-Boussard T, Jensen M, Lin Y, Lister A, Lord P, Malone J, Manduchi E, McGee M, Morrison N, Overton J, Parkinson H, Peters B, Rocca-Serra P, Ruttenberg A, Sansone S, Scheuermann R, Schober D, Smith B, Soldatova L, Stoeckert CJ, Taylor C, Torniai C, Turner J, Vita R, Whetzel P, Zheng J. The ontology for biomedical investigations. Plos One. 2016;11:0154556. doi: 10.1371/journal.pone.0154556. - DOI - PMC - PubMed
    1. Barros M, Couto F. Knowledge representation and management: A linked data perspective. Yearb Med Inform. 2016;10:178–183. doi: 10.15265/IY-2016-022. - DOI - PMC - PubMed
    1. Bauer M, Berleant D. Usability survey of biomedical question answering systems. Hum Genomics. 2012;6:17. doi: 10.1186/1479-7364-6-17. - DOI - PMC - PubMed
    1. Baumgartner WJ, Cohen K, Fox L, Acquaah-Mensah G, Hunter L. Manual curation is not sufficient for annotation of genomic databases. Bioinformatics. 2007;23:41–48. doi: 10.1093/bioinformatics/btm229. - DOI - PMC - PubMed

LinkOut - more resources