Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Nov 28;62(22):5425-5434.
doi: 10.1021/acs.jcim.2c00847. Epub 2022 Oct 24.

Prediction of Retention Time and Collision Cross Section (CCSH+, CCSH-, and CCSNa+) of Emerging Contaminants Using Multiple Adaptive Regression Splines

Affiliations

Prediction of Retention Time and Collision Cross Section (CCSH+, CCSH-, and CCSNa+) of Emerging Contaminants Using Multiple Adaptive Regression Splines

Alberto Celma et al. J Chem Inf Model. .

Abstract

Ultra-high performance liquid chromatography coupled to ion mobility separation and high-resolution mass spectrometry instruments have proven very valuable for screening of emerging contaminants in the aquatic environment. However, when applying suspect or nontarget approaches (i.e., when no reference standards are available), there is no information on retention time (RT) and collision cross-section (CCS) values to facilitate identification. In silico prediction tools of RT and CCS can therefore be of great utility to decrease the number of candidates to investigate. In this work, Multiple Adaptive Regression Splines (MARS) were evaluated for the prediction of both RT and CCS. MARS prediction models were developed and validated using a database of 477 protonated molecules, 169 deprotonated molecules, and 249 sodium adducts. Multivariate and univariate models were evaluated showing a better fit for univariate models to the experimental data. The RT model (R2 = 0.855) showed a deviation between predicted and experimental data of ±2.32 min (95% confidence intervals). The deviation observed for CCS data of protonated molecules using the CCSH model (R2 = 0.966) was ±4.05% with 95% confidence intervals. The CCSH model was also tested for the prediction of deprotonated molecules, resulting in deviations below ±5.86% for the 95% of the cases. Finally, a third model was developed for sodium adducts (CCSNa, R2 = 0.954) with deviation below ±5.25% for 95% of the cases. The developed models have been incorporated in an open-access and user-friendly online platform which represents a great advantage for third-party research laboratories for predicting both RT and CCS data.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing financial interest.

Figures

Figure 1
Figure 1
Top: Comparison of experimental and predicted RT data using the RT model (A), CCS for protonated molecules using CCSH model (B), CCS for deprotonated molecules using the CCSH model (C), and CCS for sodium adducts using the CCSNa model (D). (Red line indicates region where Experimental CCS = Predicted CCS) Bottom: Histogram distribution of deviations between experimental and predicted data for RT data using the RT model (A), CCS for protonated molecules using the CCSH model (B), CCS for deprotonated molecules using the CCSH model (C), and CCS for sodium adducts using the CCSNa model (D). (Red vertical lines indicate 0% deviation).
Figure 2
Figure 2
95% prediction intervals (blue area) for the univariate MARS analysis on the square root of RT. Blue lines are placed at the predicted values ±2 min. Approximately, only 8% of observed retention times were more than 2 min away from their predicted value.
Figure 3
Figure 3
95% prediction intervals (blue area) for the univariate MARS analysis on (a) CCSH and (b) CCSNa models. blue lines are placed at 2% error bands and the purple at 5%. It is clear that the model still predicts well at higher values where there are less data but the prediction intervals are much larger to accommodate the uncertainty due to lack of data.
Figure 4
Figure 4
Online platform for the prediction of RT and CCS data using univariate models. (A) Selection of response to predict, that is, RT, CCS for (de)protonated molecules or CCS for sodium adducts; (B) introduction of molecular descriptors for the molecule of interest; (C) output of the predictor model together with the prediction intervals. Example illustrated by omeprazole.

Similar articles

Cited by

References

    1. Hernández F.; Bakker J.; Bijlsma L.; de Boer J.; Botero-Coy A. M.; Bruinen de Bruin Y.; Fischer S.; Hollender J.; Kasprzyk-Hordern B.; Lamoree M.; López F. J.; ter Laak T. L.; van Leerdam J. A.; Sancho J. V.; Schymanski E. L.; de Voogt P.; Hogendoorn E. A. The Role of Analytical Chemistry in Exposure Science: Focus on the Aquatic Environment. Chemosphere 2019, 222, 564–583. 10.1016/j.chemosphere.2019.01.118. - DOI - PubMed
    1. Hollender J.; van Bavel B.; Dulio V.; Farmen E.; Furtmann K.; Koschorreck J.; Kunkel U.; Krauss M.; Munthe J.; Schlabach M.; Slobodnik J.; Stroomberg G.; Ternes T.; Thomaidis N. S.; Togola A.; Tornero V. High Resolution Mass Spectrometry-Based Non-Target Screening Can Support Regulatory Environmental Monitoring and Chemicals Management. Environ. Sci. Eur. 2019, 31, 42.10.1186/s12302-019-0225-x. - DOI
    1. Schymanski E. L.; Singer H. P.; Slobodnik J.; Ipolyi I. M.; Oswald P.; Krauss M.; Schulze T.; Haglund P.; Letzel T.; Grosse S.; Thomaidis N. S.; Bletsou A.; Zwiener C.; Ibáñez M.; Portolés T.; De Boer R.; Reid M. J.; Onghena M.; Kunkel U.; Schulz W.; Guillon A.; Noyon N.; Leroy G.; Bados P.; Bogialli S.; Stipaničev D.; Rostkowski P.; Hollender J. Non-Target Screening with High-Resolution Mass Spectrometry: Critical Review Using a Collaborative Trial on Water Analysis. Anal. Bioanal. Chem. 2015, 407, 6237–6255. 10.1007/s00216-015-8681-7. - DOI - PubMed
    1. Samanipour S.; Langford K.; Reid M. J.; Thomas K. V. A Two Stage Algorithm for Target and Suspect Analysis of Produced Water via Gas Chromatography Coupled with High Resolution Time of Flight Mass Spectrometry. J. Chromatogr. A 2016, 1463, 153–161. 10.1016/j.chroma.2016.07.076. - DOI - PubMed
    1. Alygizakis N. A.; Oswald P.; Thomaidis N. S.; Schymanski E. L.; Aalizadeh R.; Schulze T.; Oswaldova M.; Slobodnik J. NORMAN Digital Sample Freezing Platform: A European Virtual Platform to Exchange Liquid Chromatography High Resolution-Mass Spectrometry Data and Screen Suspects in “Digitally Frozen” Environmental Samples. TrAC, Trends Anal. Chem. 2019, 115, 129–137. 10.1016/j.trac.2019.04.008. - DOI

Publication types