Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Jul:3:23-41.
doi: 10.1146/annurev-biodatasci-010820-091627. Epub 2020 Apr 7.

Knowledge-Based Biomedical Data Science

Affiliations

Knowledge-Based Biomedical Data Science

Tiffany J Callahan et al. Annu Rev Biomed Data Sci. 2020 Jul.

Abstract

Knowledge-based biomedical data science involves the design and implementation of computer systems that act as if they knew about biomedicine. Such systems depend on formally represented knowledge in computer systems, often in the form of knowledge graphs. Here we survey recent progress in systems that use formally represented knowledge to address data science problems in both clinical and biological domains, as well as progress on approaches for creating knowledge graphs. Major themes include the relationships between knowledge graphs and machine learning, the use of natural language processing to construct knowledge graphs, and the expansion of novel knowledge-based approaches to clinical and biological domains.

Keywords: Semantic Web; knowledge discovery; knowledge graph; knowledge graph embeddings; natural language processing; ontology.

PubMed Disclaimer

Figures

Figure 1
Figure 1
An example of a knowledge representation for building a biomedical knowledge graph. Boxes represent different types of data, which are drawn from ontologies and other sources of linked open data. Boxes are connected by directed edges and represent semantically and biologically meaningful relationships.
Figure 2
Figure 2
Paper selection process outline. Combining the results from PubMed and Google Scholar queries, we narrowed down the list of papers using a two-step process. First, we performed a quick review to reduce the initial number of papers. Then, we closely inspected each paper, which helped us to arrive at the final set of 83 papers.

References

    1. Hunter LE. 2017. Knowledge-based biomedical data science. Data Sci. 1(1-2):19–25 - PMC - PubMed
    1. Davis R, Shrobe H, Szolovits P. 1993. What is a knowledge representation? AIMag. 14(1):17–33
    1. Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, et al. 2000. Gene Ontology: tool for the unifi-cation of biology. Nat. Genet 25(1):25–29 - PMC - PubMed
    1. Berners-Lee T, Fielding RT, Masinter L. 2005. Uniform resource identifier (URI): generic syntax. Unpub-lished Memo., Internet Eng. Task Force, Fremont, CA. https://tools.ietf.org/html/rfc3986
    1. SPARQL (SPARQL Protoc. RDF Query Lang.) Work. Group. 2013. SPARQL 1.1 protocol. Web Resour., World Wide Web Consort. https://www.w3.org/TR/sparql11-protocol/

LinkOut - more resources