Examining the effects of analytical replication on data quality in a non-targeted analysis experiment
- PMID: 40515841
- DOI: 10.1007/s00216-025-05940-x
Examining the effects of analytical replication on data quality in a non-targeted analysis experiment
Erratum in
-
Correction to: Examining the effects of analytical replication on data quality in a non‑targeted analysis experiment.Anal Bioanal Chem. 2025 Nov;417(26):6029. doi: 10.1007/s00216-025-06101-w. Anal Bioanal Chem. 2025. PMID: 40954338 No abstract available.
Abstract
Non-targeted analysis (NTA) methods are integral to environmental monitoring given their ability to expand measurable chemical space beyond that of traditional targeted methods. Such vast quantities of NTA data are generated that exhaustive manual review is generally unfeasible. Computational tools facilitate automated data processing, but cannot always distinguish real signals (i.e., originating from a chemical in a sample) from artifacts. Replicate analysis is recommended to aid data review, but as NTA studies become larger, the cost of analytical replication becomes untenable. A need therefore exists for examination of information penalties associated with reduced replication. To investigate this issue, using an existing NTA dataset, we performed over 70,000 simulations of variable replication designs and calculated false discovery rates (FDRs) and false negative rates (FNRs) for NTA features and occurrences. We used regression models to explore associations between replication percentage and FDR/FNR, and to test whether rates were affected by NTA feature attributes. Inverse relationships were generally observed between replication percentage and FDR/FNR, such that lower replication yielded higher information penalties. Significant increases in FDR/FNR were observed for suspected per- and polyfluoroalkyl substances (PFAS) compared to non-PFAS, highlighting the potential for differences in information penalties across feature groups. Specific quantitative information penalties are expected to be unique for each NTA study based on sample type and workflow. The methods presented here can support future pilot-scale investigations that will inform the required level of replication in full-scale studies.
Keywords: False discovery rate; False negative rate; Quality assurance; Quality control; Study design.
© 2025. The Author(s), under exclusive licence to Springer-Verlag GmbH, DE part of Springer Nature.
Conflict of interest statement
Declarations. Competing interests: The authors declare no competing interests. Disclaimer: The views expressed in this paper are those of the author(s) and do not necessarily represent the views or the policies of the USEPA. Any use of trade, firm, or product names is for descriptive purposes only and does not imply endorsement by the US Government. This document has been reviewed in accordance with USEPA policy and is approved for publication.
References
-
- Sobus JR, DeWoskin RS, Tan YM, Pleil JD, Phillips MB, George BJ, et al. Uses of NHANES biomarker data for chemical risk assessment: trends, challenges, and opportunities. Environ Health Persp. 2015;123(10):919–27. - DOI
LinkOut - more resources
Full Text Sources
