Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Jun 10;97(22):11563-11571.
doi: 10.1021/acs.analchem.5c00539. Epub 2025 May 28.

Integrative Analysis of Nontargeted LC-HRMS and High-Throughput Metabarcoding Data for Aquatic Environmental Studies Using Combined Multivariate Statistical Approaches

Affiliations

Integrative Analysis of Nontargeted LC-HRMS and High-Throughput Metabarcoding Data for Aquatic Environmental Studies Using Combined Multivariate Statistical Approaches

Maryam Vosough et al. Anal Chem. .

Abstract

Significant progress in high-throughput analytical techniques has paved the way for novel approaches to integrating data sets from different compartments. This study leverages nontarget screening (NTS) via liquid chromatography-high-resolution mass spectrometry (LC-HRMS), a crucial technique for analyzing organic micropollutants and their transformation products, in combination with biological indicators. We propose a combined multivariate data processing framework that integrates LC-HRMS-based NTS data with other high-throughput data sets, exemplified here by 18S V9 rRNA and full-length 16S rRNA gene metabarcoding data sets. The power of data fusion is demonstrated by systematically evaluating the impact of treated wastewater (TWW) over time on an aquatic ecosystem through a controlled mesocosm experiment. Highly compressed NTS data were compiled through the implementation of the region of interest-multivariate curve resolution-alternating least-squares (MCR-ALS) method, known as ROIMCR. By integrating ANOVA-simultaneous component analysis with structural learning and integrative decomposition (SLIDE), the innovative SLIDE-ASCA approach enables the decomposition of global and partial common, as well as distinct variation sources arising from experimental factors and their possible interactions. SLIDE-ASCA results indicate that temporal variability explains a much larger portion of the variance (74.6%) than the treatment effect, with both contributing to global shared space variation (41%). Design structure benefits include enhanced interpretability, improved detection of key features, and a more accurate representation of complex interactions between chemical and biological data. This approach offers a greater understanding of the natural and wastewater-influenced temporal patterns for each data source, as well as reveals associations between chemical and biological markers in an exemplified perturbed aquatic ecosystem.

PubMed Disclaimer

Figures

1
1
ASCA score plots for time effect (factor β) for NTS (A), 16S rRNA (B), 18S V9 rRNA (C), and global common space of three data blocks using SLIDE-ASCA (D), respectively.
2
2
ASCA score plots for PC1 of treatment effect for individual data blocks and global common space of three data blocks using SLIDE-ASCA.
3
3
Heatmap analysis of key bio/chemical features associated with the time effect (β submodel) through SLIDE-ASCA modeling of NTS, 16S rRNA, and 18S V9 rRNA. The color scale represents time-associated changes, with red (up to +3) and blue (down to −3) for strong positive and negative associations, respectively.
4
4
Correlation circle plot for integrated modeling of NTS, 16S rRNA, and 18S V9 rRNA with DIABLO, for a 4-class classification problem.

References

    1. Aceña J., Stampachiacchiere S., Pérez S., Barceló D.. Advances in liquid chromatography–high-resolution mass spectrometry for quantitative and qualitative environmental analysis. Anal. Bioanal. Chem. 2015;407(21):6289–6299. doi: 10.1007/s00216-015-8852-6. - DOI - PubMed
    1. Vosough M., Schmidt T. C., Renner G.. Non-target screening in water analysis: Recent trends of data evaluation, quality assurance, and their future perspectives. Anal. Bioanal. Chem. 2024;416(9):2125–2136. doi: 10.1007/s00216-024-05153-8. - DOI - PMC - PubMed
    1. Shi C., Mahadwar G., Dávila-Santiago E., Bambakidis T., Crump B. C., Jones G. D.. Nontarget Chemical Composition of Surface Waters May Reflect Ecosystem Processes More than Discrete Source Contributions. Environ. Sci. Technol. 2023;57(46):18296–18305. doi: 10.1021/acs.est.2c08540. - DOI - PubMed
    1. Sieber G., Drees F., Shah M., Stach T. L., Hohrenk-Danzouma L., Bock C., Vosough M., Schumann M., Sures B., Probst A. J.. et al. Exploring the efficacy of metabarcoding and non-target screening for detecting treated wastewater. Sci. Total Environ. 2023;903:167457. doi: 10.1016/j.scitotenv.2023.167457. - DOI - PubMed
    1. Santiago-Rodriguez T. M., Hollister E. B.. Multi ‘omic data integration: A review of concepts, considerations, and approaches. Semin. Perinatol. 2021;45(6):151456. doi: 10.1016/j.semperi.2021.151456. - DOI - PubMed

MeSH terms

LinkOut - more resources