Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Dec 15;16(12):e0260712.
doi: 10.1371/journal.pone.0260712. eCollection 2021.

Using semantics to scale up evidence-based chemical risk-assessments

Affiliations

Using semantics to scale up evidence-based chemical risk-assessments

Catherine Blake et al. PLoS One. .

Abstract

Background: The manual processes used for risk assessments are not scaling to the amount of data available. Although automated approaches appear promising, they must be transparent in a public policy setting.

Objective: Our goal is to create an automated approach that moves beyond retrieval to the extraction step of the information synthesis process, where evidence is characterized as supporting, refuting, or neutral with respect to a given outcome.

Methods: We combine knowledge resources and natural language processing to resolve coordinated ellipses and thus avoid surface level differences between concepts in an ontology and outcomes in an abstract. As with a systematic review, the search criterion, and inclusion and exclusion criterion are explicit.

Results: The system scales to 482K abstracts on 27 chemicals. Results for three endpoints that are critical for cancer risk assessments show that refuting evidence (where the outcome decreased) was higher for cell proliferation (45.9%), and general cell changes (37.7%) than for cell death (25.0%). Moreover, cell death was the only end point where supporting claims were the majority (61.3%). If the number of abstracts that measure an outcome was used as a proxy for association there would be a stronger association with cell proliferation than cell death (20/27 chemicals). However, if the amount of supporting evidence was used (where the outcome increased) the conclusion would change for 21/27 chemicals (20 from proliferation to death and 1 from death to proliferation).

Conclusions: We provide decision makers with a visual representation of supporting, neutral, and refuting evidence whilst maintaining the reproducibility and transparency needed for public policy. Our findings show that results from the retrieval step where the number of abstracts that measure an outcome are reported can be misleading if not accompanied with results from the extraction step where the directionality of the outcome is established.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

Fig 1
Fig 1. Search process used to identify MEDLINE abstracts for each of the 27 chemicals (see S1 Appendix for the complete search strings).
Fig 2
Fig 2. Cell proliferation outcome [41].
Fig 3
Fig 3. Partial view of the unified medical language system showing the Medical Subject Heading (MeSH) showing that apoptosis is a type of Regulated Cell Death, which is in turn a type of Cell Death.
Image courtesy of the U.S. National Library of Medicine.
Fig 4
Fig 4. Unified Medical Language System (UMLS) showing context for the semantic type cell.
Image courtesy of the U.S. National Library of Medicine.
Fig 5
Fig 5. Claims concerning cell proliferation and cell death reported in [4] showing the entire collection (All), abstracts that were marked as relevant (Ann) and abstracts that were not manually annotated as relevant (NotAnn) from [4], where 5A shows abstract frequencies and 5B shows claim frequencies.
Fig 6
Fig 6
Cell proliferation claims from results or conclusions sentences showing [A] total abstracts and [B] total claims.
Fig 7
Fig 7
Cell death claims from results or conclusions sentences showing [A] total abstracts and [B] total claims.
Fig 8
Fig 8
General cell claims from results or conclusions sentences showing [A] total abstracts and [B] total claims.
Fig 9
Fig 9
The number of genistein abstracts that report cell proliferation is greater than the number that report cell death (A); however, more claims refute that cell proliferation increases, whereas more claims support an increase in cell death.

References

    1. Blake C, Pratt W. Collaborative Information Synthesis I: A Model of Information Behaviors of Scientists in Medicine and Public Health. Journal of the American Society for Information Science. 2006;57(13):1740–9.
    1. Alderson P, Green S, Higgins JPT, editors. Cochrane reviewers’ handbook 4.2.2 [Updated March 2004]. Chichester, UK: John Wiley & Sons, Ltd; 2004.
    1. National Research Council. Review of the Environmental Protection Agency’s Draft IRIS Assessment of Formaldehyde. Washington (DC) 2011.
    1. Korhonen A, Sun L, Silins I, Stenius U. The First Step in the Development of Text Mining Technology for Cancer Risk Assessment: Identifying and Organizing Scientific Evidence in Risk Assessment Literature. BMC Bioinformatics. 2009;10:303. doi: 10.1186/1471-2105-10-303 - DOI - PMC - PubMed
    1. Korhonen A, Seaghdha DO, Silins I, Sun L, Hogberg J, Stenius U. Text mining for literature review and knowledge discovery in cancer risk assessment and research. PLoS One. 2012;7(4):e33427. doi: 10.1371/journal.pone.0033427 ; PubMed Central PMCID: PMC3325219. - DOI - PMC - PubMed