Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Aug 15;35(8):1370-1382.
doi: 10.1021/acs.chemrestox.2c00074. Epub 2022 Jul 12.

Automating Predictive Toxicology Using ComptoxAI

Affiliations

Automating Predictive Toxicology Using ComptoxAI

Joseph D Romano et al. Chem Res Toxicol. .

Abstract

ComptoxAI is a new data infrastructure for computational and artificial intelligence research in predictive toxicology. Here, we describe and showcase ComptoxAI's graph-structured knowledge base in the context of three real-world use-cases, demonstrating that it can rapidly answer complex questions about toxicology that are infeasible using previous technologies and data resources. These use-cases each demonstrate a tool for information retrieval from the knowledge base being used to solve a specific task: The "shortest path" module is used to identify mechanistic links between perfluorooctanoic acid (PFOA) exposure and nonalcoholic fatty liver disease; the "expand network" module identifies communities that are linked to dioxin toxicity; and the quantitative structure-activity relationship (QSAR) dataset generator predicts pregnane X receptor agonism in a set of 4,021 pesticide ingredients. The contents of ComptoxAI's source data are rigorously aggregated from a diverse array of public third-party databases, and ComptoxAI is designed as a free, public, and open-source toolkit to enable diverse classes of users including biomedical researchers, public health and regulatory officials, and the general public to predict toxicology of unknowns and modes of action.

PubMed Disclaimer

Conflict of interest statement

The authors declare the following competing financial interest(s): The authors declare no competing financial interests except T.M.P. who is a member of the Expert Panel, Research Institute for Fragrance Materials.

Figures

Figure 1.
Figure 1.
ComptoxAI homepage, showing navbar links to major features (data browsing tools, installation guide, documentation, etc.), a hierarchy diagram of the major entity types included in the knowledge base, and additional introductory text. The website is available at https://comptox.ai.
Figure 2.
Figure 2.
Interactive data portal for ComptoxAI, demonstrating how individual nodes (entities) in the knowledge base can be queried. The gene shown is found by searching for the gene symbol CYP2E1, and details about the gene are represented in the “Node details” panel. The identified gene can be sent to other data browsing utilities using the buttons at the bottom of the panel. Out of view are components for displaying relationships and network paths.
Figure 3.
Figure 3.
ComptoxAI’s Neo4j graph database browser. Users can manually expand and manipulate subgraphs and can also use the powerful Cypher query language to submit database queries or call specific subroutines. In this interface, the left panel allows users to query a random subset of specific node labels (entity types) or relationship types, and the right portion of the network view panel lists the properties annotated to the currently selected node in the network.
Figure 4.
Figure 4.
Network diagram of shortest paths joining perfluorooctanoic acid (PFOA) with nonalcoholic fatty liver disease (NAFLD) in ComptoxAI. The network indicates 14 genes associated with NAFLD, 7 of which are downregulated and 8 of which are upregulated (1 is both) by PFOA. Gene nodes are arranged (via interactive drag-and-drop) to group downregulated (top) and upregulated (bottom) genes. Full-resolution image available on FigShare.
Figure 5.
Figure 5.
Adverse outcome pathway network generated by expanding a network around the node representing dioxin (2,3,7,8-tetrachlorodibenzo-p-dioxin), with network communities labeled in red. Communities roughly correspond to the following adverse outcomes: (A.) liver steatosis, (B.) impaired fertility and bile acid regulation, (C.) long-term risk of liver cancer, and (D.) other liver cancer, especially hepatocellular carcinoma. Dioxin (2,3,7,8-tetrachlorodibenzo-p-dioxin) plays a central role as a “hub” in the network. Node types are chemicals (red), AOPs (green), molecular initiating events (orange), and key events (purple). Full-resolution image available on Figshare.
Figure 6.
Figure 6.
Screenshot image of the “Build QSAR Dataset” module from the ComptoxAI website. Users specify an assay endpoint, (optional) list of related chemicals to filter by, whether to include a “discovery” dataset of chemicals with no known assay measurements for making novel predictions, and format of the dataset. The dataset is generated dynamically and returned within several seconds of submitting the query.

References

    1. Thomas RS; Bahadori T; Buckley TJ; Cowden J; Deisenroth C; Dionisio KL; Frithsen JB; Grulke CM; Gwinn MR; Harrill JA; Higuchi M; Houck KA; Hughes MF; Hunter ES; Isaacs KK; Judson RS; Knudsen TB; Lambert JC; Linnenbrink M; Martin TM; Newton SR; Padilla S; Patlewicz G; Paul-Friedman K; Phillips KA; Richard AM; Sams R; Shafer TJ; Setzer RW; Shah I; Simmons JE; Simmons SO; Singh A; Sobus JR; Strynar M; Swank A; Tornero-Valez R; Ulrich EM; Villeneuve DL; Wambaugh JF; Wetmore BA; Williams AJ The Next Generation Blueprint of Computational Toxicology at the U.S. Environmental Protection Agency. Toxicol. Sci 2019, 169, 317–332. - PMC - PubMed
    1. Luechtefeld T; Rowlands C; Hartung T Big-Data and Machine Learning to Revamp Computational Toxicology and Its Use in Risk Assessment. Toxicol. Res 2018, 7, 732–744. - PMC - PubMed
    1. Atzeni P; Antonellis VD Relational Database Theory; Benjamin/Cummings Publishing Company, 1993.
    1. Mortensen HM; Senn J; Levey T; Langley P; Williams AJ The 2021 Update of the EPA’s Adverse Outcome Pathway Database. Sci. Data 2021, 8, No. 169. - PMC - PubMed
    1. Richard AM; Judson RS; Houck KA; Grulke CM; Volarath P; Thillainadarajah I; Yang C; Rathman J; Martin MT; Wambaugh JF; Knudsen TB; Kancherla J; Mansouri K; Patlewicz G; Williams AJ; Little SB; Crofton KM; Thomas RS ToxCast Chemical Landscape: Paving the Road to 21st Century Toxicology. Chem. Res. Toxicol 2016, 29, 1225–1251. - PubMed

Publication types