Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2013 Jul 12:1298:132-8.
doi: 10.1016/j.chroma.2013.05.021. Epub 2013 May 13.

Comparative analysis of mass spectral matching-based compound identification in gas chromatography-mass spectrometry

Affiliations

Comparative analysis of mass spectral matching-based compound identification in gas chromatography-mass spectrometry

Imhoi Koo et al. J Chromatogr A. .

Abstract

Compound identification in gas chromatography-mass spectrometry (GC-MS) is usually achieved by matching query spectra to spectra present in a reference library. Although several spectral similarity measures have been developed and compared using a small reference library, it still remains unknown how the relationship between the spectral similarity measure and the size of reference library affects on the identification accuracy as well as the optimal weight factor. We used three reference libraries to investigate the dependency of the optimal weight factor, spectral similarity measure and the size of reference library. Our study demonstrated that the optimal weight factor depends on not only spectral similarity measure but also the size of reference library. The mixture semi-partial correlation measure outperforms all existing spectral similarity measures in all tested reference libraries, in spite of the computational expense. Furthermore, the accuracy of compound identification using a larger reference library in future is estimated by varying the size of reference library. Simulation study indicates that the mixture semi-partial correlation measure will have the best performance with the increase of reference library in future.

PubMed Disclaimer

Figures

Figure 1
Figure 1. Contour plot of compound identification accuracy calculated using the WC similarity measure and NIST11 as reference library
The four points in the plot show the minimum and the maximum of weight factors for intensity and m/z, respectively, for identification accuracy of 79.9%.
Figure 2
Figure 2. Accuracy of compound identification by random sampling
The solid lines stand for results of 100 time random sampling. The dash lines represent the predicted identification accuracy of each spectral similarity measure using large reference library to be developed in future.
Figure 3
Figure 3. Accuracy of compound identification
An identification result is considered as correct if the correct reference spectrum is one of the multiple top ranked reference spectra.
Figure 4
Figure 4
Computational time of each spectral similarity measure.

References

    1. Stein SE, Scott DR. Journal of the American Society for Mass Spectrometry. 1994;5:859. - PubMed
    1. Hertz HS, Hites RA, Biemann K. Analytical Chemistry. 1971;43:681.
    1. Atwater BL, Stauffer DB, Mclafferty FW, Peterson DW. Analytical Chemistry. 1985;57:899.
    1. Julian RK, Higgs RE, Gygi JD, Hilton MD. Analytical Chemistry. 1998;70:3249. - PubMed
    1. Rasmussen GT, Isenhour TL. Journal of Chemical Information and Computer Sciences. 1979;19:179.

Publication types

MeSH terms