Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Dec;128(12):125002.
doi: 10.1289/EHP7215. Epub 2020 Dec 28.

Community Approaches for Integrating Environmental Exposures into Human Models of Disease

Affiliations

Community Approaches for Integrating Environmental Exposures into Human Models of Disease

Anne E Thessen et al. Environ Health Perspect. 2020 Dec.

Abstract

Background: A critical challenge in genomic medicine is identifying the genetic and environmental risk factors for disease. Currently, the available data links a majority of known coding human genes to phenotypes, but the environmental component of human disease is extremely underrepresented in these linked data sets. Without environmental exposure information, our ability to realize precision health is limited, even with the promise of modern genomics. Achieving integration of gene, phenotype, and environment will require extensive translation of data into a standard, computable form and the extension of the existing gene/phenotype data model. The data standards and models needed to achieve this integration do not currently exist.

Objectives: Our objective is to foster development of community-driven data-reporting standards and a computational model that will facilitate the inclusion of exposure data in computational analysis of human disease. To this end, we present a preliminary semantic data model and use cases and competency questions for further community-driven model development and refinement.

Discussion: There is a real desire by the exposure science, epidemiology, and toxicology communities to use informatics approaches to improve their research workflow, gain new insights, and increase data reuse. Critical to success is the development of a community-driven data model for describing environmental exposures and linking them to existing models of human disease. https://doi.org/10.1289/EHP7215.

PubMed Disclaimer

Figures

Figure 1 is a tabular representation having six columns, namely, Example competency question, Input, Output, Priority concepts, Ontologies, and Model data type, and two rows. Row 1: What fish species have reduced fecundity following sublethal pyrethroid pesticide exposure; receptor category, outcome, and stressor; receptor species; and in the column Priority concepts: Chemical, Taxon, and Phenotype; in the column Ontologies: Chemicals of Biological Interest or Environmental Conditions, Treatments, and Exposures Ontology; National Center for Biotechnology Information; and Unified Phenotype Ontology; in the column Model data type: Stressor, Receptor, and Phenotype/outcome. Row 2: Is iron safe for someone diagnosed with hemochromatosis (heterozygous or homozygous)?; Stressor genotype; Outcome; and in the column Priority concepts: Chemical and Disease; in the column Ontologies: Chemicals of Biological Interest or Environmental Conditions, Treatments, and Exposures Ontology and Mondo Disease Ontology; and in the column Model data type: Stressor and Disease/outcome.
Figure 1.
Example competency questions. Competency questions were developed by workshop participants (see the section “Competency Questions” in the Supplemental Material) to help guide the data model and expose deficiencies in ontological coverage. The input indicates the type of information provided by the hypothetical user in the example question. The output indicates the type of information the hypothetical user is requesting. The semantic types of the inputs and outputs are listed as priority concepts that can be represented by the listed ontologies, such as chemical (dashed line), taxon (white), phenotype or disease (hashed lines), and genotype (gray). These semantic types and are represented in the semantic model (Figure 2) as the model data type. Note: CheBI, Chemicals of Biological Interest; ECTO, Environmental Conditions, Treatments, and Exposures Ontology; GENO, Genotype Ontology; GO, Gene Ontology; Mondo, Mondo Disease Ontology; NCBI, National Center for Biotechnology Information; uPheno, Unified Phenotype Ontology.
Figure 2 is a flow diagram having three steps. Step 1: Environmental Conditions, Treatments, and Exposures Ontology, including stressor, medium, and route; disease; gene; and phenotype are interconnected with each other. Step 2: Environmental Conditions, Treatments, and Exposures Ontology, including stressor, medium, and route is connected to receptor. Step 3: Receptor is connected to gene.
Figure 2.
Semantic model for representing environmental exposures. Exposure events (open box, represented by terms in the ECTO) can be incorporated into the Monarch knowledge graph via direct links to the diseases and phenotypes resulting from the exposure, the genes being affected, and the organism(s) or organism parts that are being exposed. Note: ECTO, Environmental Conditions, Treatments, and Exposures Ontology.
Figure 3A is a flow chart with one step. Stressor, including medium and route; Exposure event, including duration, frequency, concentration, attribution or source or evidence; Receptor, including life stage and age lead to outcome, including severity and frequency. Figure 3B is a flowchart having three steps. Step 1: Axioms, including stressor, medium, and route. Step 2: Exposure event. Step 3: Annotations, including duration, frequency, concentration, and attribution or source or evidence.
Figure 3.
ExO and ECTO exposure data model. (A) The ExO (Mattingly et al. 2012) model combines stressors and receptors in the representation of the exposure event, which is linked to an outcome. Each element (dark gray) has associated metadata (light gray). (B) Exposure events in ECTO are precomposed with the stressor, medium, and route (when known) contained in the axiomatic definition: “exposure event” and (“has exposure stimulus” some “stressor”) and (“has exposure medium” some “medium”) and (“has exposure route” some “route”). Additional metadata about the exposure event (dark gray) are added as annotations on the event (light gray). Information about the receptor and the outcome are linked to the exposure event in the larger knowledge graph as shown in Figure 2. Note: ECTO, Environmental Conditions, Treatments, and Exposures Ontology; ExO, Exposure Ontology.

References

    1. Aaseth J, Wallace DR, Vejrup K, Alexander J. 2020. Methylmercury and developmental neurotoxicity: a global concern. Curr Opin Toxicol 19:80–87, 10.1016/j.cotox.2020.01.005. - DOI
    1. Abt E, Rodricks JV, Levy JI, Zeise L, Burke TA. 2010. Science and decisions: advancing risk assessment. Risk Anal 30(7):1028–1036, PMID: 20497395, 10.1111/j.1539-6924.2010.01426.x. - DOI - PubMed
    1. Amberger JS, Bocchini CA, Schiettecatte F, Scott AF, Hamosh A. 2015. OMIM.org: Online Mendelian Inheritance in Man (OMIM®), an online catalog of human genes and genetic disorders. Nucleic Acids Res 43(database issue):D789–D798, PMID: 25428349, 10.1093/nar/gku1205. - DOI - PMC - PubMed
    1. Bakal G, Talari P, Kakani EV, Kavuluru R. 2018. Exploiting semantic patterns over biomedical knowledge graphs for predicting treatment and causative relations. J Biomed Inform 82:189–199, PMID: 29763706, 10.1016/j.jbi.2018.05.003. - DOI - PMC - PubMed
    1. Boyles RR, Thessen AE, Waldrop A, Haendel MA. 2019. Ontology-based data integration for advancing toxicological knowledge. Curr Opin Toxicol 16:67–74, 10.1016/j.cotox.2019.05.005. - DOI

Publication types

Substances

LinkOut - more resources