Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 1994 Sep;5(9):859-66.
doi: 10.1016/1044-0305(94)87009-8.

Optimization and testing of mass spectral library search algorithms for compound identification

Affiliations

Optimization and testing of mass spectral library search algorithms for compound identification

S E Stein et al. J Am Soc Mass Spectrom. 1994 Sep.

Abstract

Five algorithms proposed in the literature for library search identification of unknown compounds from their low resolution mass spectra were optimized and tested by matching test spectra against reference spectra in the NIST-EPA-NIH Mass Spectral Database. The algorithms were probability-based matching (PBM), dot-product, Hertz et al. similarity index, Euclidean distance, and absolute value distance. The test set consisted of 12,592 alternate spectra of about 8000 compounds represented in the database. Most algorithms were optimized by varying their mass weighting and intensity scaling factors. Rank in the list of candidatc compounds was used as the criterion for accuracy. The best performing algorithm (75% accuracy for rank 1) was the dot-product function that measures the cosine of the angle between spectra represented as vectors. Other methods in order of performance were the Euclidean distance (72%), absolute value distance (68%) PBM (65%), and Hertz et al. (64%). Intensity scaling and mass weighting were important in the optimized algorithms with the square root of the intensity scale nearly optimal and the square or cube the best mass weighting power. Several more complex schemes also were tested, but had little effect on the results. A modest improvement in the performance of the dot-product algorithm was made by adding a term that gave additional weight to relative peak intensities for spectra with many peaks in common.

PubMed Disclaimer

References

    1. J Am Soc Mass Spectrom. 1994 Apr;5(4):316-23 - PubMed
    1. J Am Soc Mass Spectrom. 1991 Sep;2(5):432-7 - PubMed
    1. J Am Soc Mass Spectrom. 1991 Sep;2(5):438-40 - PubMed

LinkOut - more resources