Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Dec;128(12):125001.
doi: 10.1289/EHP6994. Epub 2020 Dec 24.

Knowledge Organization Systems for Systematic Chemical Assessments

Affiliations

Knowledge Organization Systems for Systematic Chemical Assessments

Paul Whaley et al. Environ Health Perspect. 2020 Dec.

Abstract

Background: Although the implementation of systematic review and evidence mapping methods stands to improve the transparency and accuracy of chemical assessments, they also accentuate the challenges that assessors face in ensuring they have located and included all the evidence that is relevant to evaluating the potential health effects an exposure might be causing. This challenge of information retrieval can be characterized in terms of "semantic" and "conceptual" factors that render chemical assessments vulnerable to the streetlight effect.

Objectives: This commentary presents how controlled vocabularies, thesauruses, and ontologies contribute to overcoming the streetlight effect in information retrieval, making up the key components of Knowledge Organization Systems (KOSs) that enable more systematic access to assessment-relevant information than is currently achievable. The concept of Adverse Outcome Pathways is used to illustrate what a general KOS for use in chemical assessment could look like.

Discussion: Ontologies are an underexploited element of effective knowledge organization in the environmental health sciences. Agreeing on and implementing ontologies in chemical assessment is a complex but tractable process with four fundamental steps. Successful implementation of ontologies would not only make currently fragmented information about health risks from chemical exposures vastly more accessible, it could ultimately enable computational methods for chemical assessment that can take advantage of the full richness of data described in natural language in primary studies. https://doi.org/10.1289/EHP6994.

PubMed Disclaimer

Figures

Figure 1 is a set of two flow charts titled evidence scoping and mapping and systematic review. Evidence scoping and mapping has seven steps. Step 1: Broad P E C O statement. Step 2: Define eligibility criteria. Step 3: Search for literature. Step 4: Screen search results for relevance. Step 5: Extraction of basic study characteristics. Step 6: Creation of literature inventory. Step 7: Optional: Study appraisal. Systematic review has eight steps. Step 1: Problem formulation. Step 2: Focused P E C O statement. Step 3: Define eligibility criteria. Step 4: Search for literature. Step 5: Screen search results for relevance. Step 6: Data extraction. Step 7: Study appraisal. Step 8: Evidence synthesis and integration. Creation of literature inventory and Optional: Study appraisal leads to Problem formulation.
Figure 1.
The relationship between the processes involved in systematically mapping and systematically reviewing evidence. The elements where what we call the “information retrieval challenge” comes into play are highlighted in bold and yellow. Comprehensive evidence maps, if they represent complete inventories of the literature, should ultimately obviate the need for additional literature searches in systematic reviews conducted in response to the findings of a systematic evidence mapping exercise.
Figure 2 is a flow chart having five steps. Step 1: Binding to DNA-topo 2 cleavage complex. Step 2: Binding to DNA-topo 2 cleavage complex with stalls leads to replication forks. Step 3: Oxidative D N A damage with obstructs lead to D N A repair. Step 4: D N A repair prevents Chromosome aberrations. Step 5: Chromosome aberrations increases Cellular proliferation. Step 6: Cellular proliferation causes Cancer (lung, leukemia).
Figure 2.
Illustration of how lack of knowledge of relations between concepts relevant to a research topic can result in evidence of potential importance to a given question being overlooked. In this example, awareness that DNA repair is obstructed by oxidative DNA damage allows lung cancer and leukemia to be connected to stressors that cause oxidative DNA damage to be incorporated into a cancer assessment. However, lack of awareness that replication forks regulate DNA repair may result in studies of stressors that stall replication forks by binding to cleavage complexes being excluded from cancer assessments.
Figure 3 is a chart displaying the following information: Controlled vocabulary terms for “polycyclic aromatic hydrocarbons” in the Medical Subject Headings (MeSH) index (N C B I 2020). Polycyclic Aromatic Hydrocarbons. Aromatic hydrocarbons that contain extended fused-ring structures. Year introduced: 2017(1996). Tree Number(s): D 02 dot 455 dot 426 dot 559 dot 847, D 04 dot 615 MeSH Unique I D: D 011084. Entry Terms: Aromatic Hydrocarbons, Polycyclic; Hydrocarbons, Polycyclic Aromatic; Polynuclear Aromatic Hydrocarbons; Aromatic Hydrocarbons, Polynuclear; Hydrocarbons, Polynuclear Aromatic; Polycyclic Hydrocarbons, Aromatic; Aromatic Polycyclic Hydrocarbons; and Hydrocarbons, Aromatic Polycyclic.
Figure 3.
The MeSH CV entry for “polycyclic aromatic hydrocarbons,” 21 July 2020.
Figure 4 is a chart displaying the following information: Thesaurus entries for “polycyclic aromatic hydrocarbons” in the Medical Subject Headings (MeSH) index (N C B I 2020). All MeSH categories includes chemicals and drugs category in organic chemicals. Hydrocarbons includes hydrocarbons, cyclic in hydrocarbons, aromatic. Polycyclic Aromatic Hydrocarbons includes Anthracenes, Azulenes, Benz(a)Anthracenes, Benzocycloheptenes, Fluorenes, Indenes, Naphthacenes, Naphthalenes, Phenalenes, Phenanthrenes, and Pyrenes.
Figure 4.
The MeSH thesaurus entries for “polycyclic aromatic hydrocarbons,” 21 July 2020. For brevity, only first-level entries are shown.
Figure 5 is a flow chart having five steps. Step 1: Exposure causes Molecular initiating event. Step 2: Molecular initiating event causes Key event 1 which was investigated by Study A and Study B. Step 3: Key event 1 causes Key event 2 which was investigated by Study C and Study D. Step 4: Key event 2 causes Key event 3 which was investigated by Study D. Step 5: Key event 3 causes Adverse Outcome.
Figure 5.
The elements of an Adverse Outcome Pathway, whereby an exposure causes a Molecular Initiating Event, initiating a biological sequence of causally related Key Events that result in a final Adverse Outcome being manifest. Experimental research can target how a challenge might affect a Key Event (Studies A, B, and C) or how one Key Event might cause another Key Event in a Key Event Relationship (Study D). Arranging biological events, exposures, and the evidence around them in these sorts of AOP chains can be very valuable for integrating mechanistic evidence into chemical assessments but requires knowledge organization systems that reflect the complexity and heterogeneity of the relationships and event types.
Figure 6 is a matrix plotting chemical entities of biological interest, protein ontology, gene ontology, cell ontology, uber anatomy ontology, mammalian phenotype ontology, mondo disease ontology, population and community ontology, environment exposure ontology, bioassay ontology, experimental factor ontology, Systematized Nomenclature of Medicine clinical terms, and children’s health exposure analysis resource (y-axis) across adverse outcome pathway level, including exposure, molecular initiating event, cellular event, tissue event, organ event, individual adverse outcome, and population adverse outcome; biological information ontologies; and measurement information ontologies (x-axis).
Figure 6.
Existing biological ontologies can be used to define key events in computable terms and thereby make AOP information more interoperable with other toxicological data sources. The same can be done when describing the assays and biomarkers used to measure the key events. Note: BAO, BioAssay Ontology; CHEAR, Children’s Health Exposure Analysis Resource; CHEBI, Chemical Entities of Biological Interest; CL: Cell Ontology; ECTO, Environment Exposure Ontology; EFO, Experimental Factor Ontology; GO, Gene Ontology; MonDO, Mondo Disease Ontology; MP, Mammalian Phenotype Ontology; PCO, Population and Community Ontology; PRO, Protein Ontology; SNOMED CT, SNOMED Clinical Terms; UBERON, Uber Anatomy Ontology.
Figure 7A is a flow chart having four steps. Step 1: Input: Slight increases in thyroid stimulating hormone with an icon of paper. Step 2: Parsed text: Document: Feng and others, 2017, Section: abstract, paragraph: 1, sentence: 10, terms: “slight,” “increases,” “in,” “thyroid,” “stimulating,” and “hormone.” Step 3: Matched class: Class: thyrotropin measurement odd ratios thyroid stimulating hormone, Ontology: National Cancer Institute Thesaurus, Type: Direct, Context: Document: Feng and other, 2017, Section: abstract, Paragraph: 1, Sentence: 10, and Term: “thyroid stimulating hormone”. Step 4: Documents semantically indexed. An arrow pointing towards right at Documents semantically indexed with a text that reads, Annotator. Figure 7B is a flow chart having two steps. Step 1: National Cancer Institute Thesaurus Ontology: activity, clinical or research activity, intervation or procedure, laboratory procedure, chemistry test, endocrine text, hormone measurement, and thyrotropin measurement; Gene Ontology: Gene product, protein or riboprotein complex, and thyroid stimulating hormone; and Universal Medical Language System: Concept: thyroid stimulating hormone measurement, C U I: C0200230, Semantic type: laboratory procedure, and T U I: T059. Step 2: Step 1 with terminology mapping leads to Documents semantically indexed.
Figure 7.
The workflow for matching natural language strings in research reports to a hierarchy of concepts in an ontology. Natural language information is extracted from included studies (e.g., phrases such as “increase in thyroid stimulating hormone”) into an evidence inventory (A). The terms “increase,” “thyroid,” “stimulating,” and “hormone” are cleaned and mapped to ontological classes in preparation for integration with other data sets. The inventory can then be connected to other data models by mapping terminology between CVs (B). Done enough times, a large data inventory begins to accumulate.

References

    1. Arp R, Smith B, Spear AD. 2015. Building Ontologies with Basic Formal Ontology. Cambridge, MA: MIT Press; https://play.google.com/store/books/details?id=AUxQCgAAQBAJ [accessed 19 August 2020].
    1. Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, et al.. 2000. Gene ontology: tool for the unification of biology. The gene ontology consortium. Nat Genet 25(1):p 25–29, PMID: 10802651, 10.1038/75556. - DOI - PMC - PubMed
    1. Battaglia M, Atkinson MA. 2015. The streetlight effect in type 1 diabetes. Diabetes 64(4):p 1081–1090, PMID: 25805758, 10.2337/db14-1208. - DOI - PMC - PubMed
    1. Braga A, Papachristos A, Hureau D. 2012. Hot spots policing effects on crime. Campbell Systematic Rev 8(1):1–96, 10.4073/csr.2012.8. - DOI - PMC - PubMed
    1. Burgoon LD. 2017. The AOPOntology: a semantic artificial intelligence tool for predictive toxicology. Appl in Vitro Toxicol 3(3):278–281, 10.1089/aivt.2017.0012. - DOI

MeSH terms

Substances