Deconvoluting and Interpreting Nontargeted Chemical Data: A Data-Driven Forensic Workflow for Identifying the Most Prominent Chemical Sources in Receiving Waters
- PMID: 40908883
- DOI: 10.1021/acs.est.5c07541
Deconvoluting and Interpreting Nontargeted Chemical Data: A Data-Driven Forensic Workflow for Identifying the Most Prominent Chemical Sources in Receiving Waters
Abstract
Chemical forensics aims to identify major contamination sources, but existing workflows often rely on predefined targets and known sources, introducing bias. Here, we present a data-driven workflow that reduces this bias by applying an unsupervised machine learning technique. We applied both nonmetric multidimensional scaling (NMDS) and non-negative matrix factorization (NMF) on the same nontargeted chemical data set to compare their different interpretations of environmental sources. Weekly nontargeted data was collected from the Fall Creek Monitoring Station (Ithaca, NY), where daily samples were previously analyzed using source-defined models. NMF was first used to decompose the full nontargeted chemical data set into a small set of chemical factors representing distinct composition profiles. Each factor was then interpreted through (1) Spearman correlations with watershed characteristics (e.g., temperature, flow) and (2) suspect screening of high-weighted nontargeted features. In addition to confirming known anthropogenic inputs, our analysis revealed potential novel sources associated with snowmelt, groundwater seepage, and seasonal hydrological dynamics. We also detected an annual shift in the chemical composition, highlighting the evolving influence of these sources. This workflow enables watershed managers to move beyond predefined sources, detect both known and emerging chemical contributors, and apply adaptive, evidence-based strategies to protect water quality under changing conditions.
Keywords: chemical forensics; non-negative matrix factorization; nontargeted analysis; unsupervised learning; watershed health.
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
