Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Feb:68:477-489.
doi: 10.1016/j.jare.2024.06.002. Epub 2024 Jun 4.

Multi-task aquatic toxicity prediction model based on multi-level features fusion

Affiliations

Multi-task aquatic toxicity prediction model based on multi-level features fusion

Xin Yang et al. J Adv Res. 2025 Feb.

Abstract

Introduction: With the escalating menace of organic compounds in environmental pollution imperiling the survival of aquatic organisms, the investigation of organic compound toxicity across diverse aquatic species assumes paramount significance for environmental protection. Understanding how different species respond to these compounds helps assess the potential ecological impact of pollution on aquatic ecosystems as a whole. Compared with traditional experimental methods, deep learning methods have higher accuracy in predicting aquatic toxicity, faster data processing speed and better generalization ability.

Objectives: This article presents ATFPGT-multi, an advanced multi-task deep neural network prediction model for organic toxicity.

Methods: The model integrates molecular fingerprints and molecule graphs to characterize molecules, enabling the simultaneous prediction of acute toxicity for the same organic compound across four distinct fish species. Furthermore, to validate the advantages of multi-task learning, we independently construct prediction models, named ATFPGT-single, for each fish species. We employ cross-validation in our experiments to assess the performance and generalization ability of ATFPGT-multi.

Results: The experimental results indicate, first, that ATFPGT-multi outperforms ATFPGT-single on four fish datasets with AUC improvements of 9.8%, 4%, 4.8%, and 8.2%, respectively, demonstrating the superiority of multi-task learning over single-task learning. Furthermore, in comparison with previous algorithms, ATFPGT-multi outperforms comparative methods, emphasizing that our approach exhibits higher accuracy and reliability in predicting aquatic toxicity. Moreover, ATFPGT-multi utilizes attention scores to identify molecular fragments associated with fish toxicity in organic molecules, as demonstrated by two organic molecule examples in the main text, demonstrating the interpretability of ATFPGT-multi.

Conclusion: In summary, ATFPGT-multi provides important support and reference for the further development of aquatic toxicity assessment. All of codes and datasets are freely available online at https://github.com/zhaoqi106/ATFPGT-multi.

Keywords: Acute toxicity; Deep learning; Molecular fingerprints; Molecular graph features; Multi-task model.

PubMed Disclaimer

Conflict of interest statement

Declaration of competing interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Figures

None
Graphical abstract
Fig. 1
Fig. 1
(A) The amount of data for each fish species and its proportion to the total population. (B) The number of non-toxic and toxic compounds on each fish species.
Fig. 2
Fig. 2
(A) The overview of ATFPGT-multi. (B) The architecture of the molecular graph representation module, which presents the graph-based representation approach. (C) Shows the transformer layer details based on the multi-head attention mechanism. (D) Describes the process of focused dot product attention layer.
Fig. 3
Fig. 3
Comparative analysis of ATFPGT-multi and ATFPGT-single for four fish species datasets.
Fig. 4
Fig. 4
The performance of ATFPGT-multi and the comparative algorithms for each fish species datasets.
Fig. 5
Fig. 5
Comparison analysis between ATFPGT-multi and its six variant models on four fish datasets.
Fig. 6
Fig. 6
Visualization of atomic attention weights. (A) The exemplar molecule selected from BS dataset is presented. (B) The exemplar molecule chosen from RT dataset is displayed. (C) The exemplar molecule selected from FHM dataset is presented. (D) The exemplar molecule selected from SHM dataset is presented.

Similar articles

Cited by

References

    1. Casalegno M., Benfenati E., Sello G. An automated group contribution method in predicting aquatic toxicity: the diatomic fragment approach. Chem Res Toxicol. 2005;18:740–746. - PubMed
    1. Cunha D.L., Mendes M.P., Marques M. Environmental risk assessment of psychoactive drugs in the aquatic environment. Environ Sci Pollut Res. 2018;26:78–90. - PubMed
    1. Melvin S.D., Lanctôt C.M., Doriean N.J.C., Bennett W.W., Carroll A.R. NMR-based lipidomics of fish from a metal(loid) contaminated wetland show differences consistent with effects on cellular membranes and energy storage. Sci Total Environ. 2019;654:284–291. - PubMed
    1. Zenker A., Cicero M.R., Prestinaci F., Bottoni P., Carere M. Bioaccumulation and biomagnification potential of pharmaceuticals with a focus to the aquatic environment. J Environ Manage. 2014;133:378–387. - PubMed
    1. Grabicova K., Grabic R., Fedorova G., Fick J., Cerveny D., Kolarova J., et al. Bioaccumulation of psychoactive pharmaceuticals in fish in an effluent dominated stream. Water Res. 2017;124:654–662. - PubMed

Substances

LinkOut - more resources