Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Apr 24;5(4):700-708.
doi: 10.1021/acscentsci.9b00085. Epub 2019 Mar 19.

Rapid Prediction of Electron-Ionization Mass Spectrometry Using Neural Networks

Affiliations

Rapid Prediction of Electron-Ionization Mass Spectrometry Using Neural Networks

Jennifer N Wei et al. ACS Cent Sci. .

Abstract

When confronted with a substance of unknown identity, researchers often perform mass spectrometry on the sample and compare the observed spectrum to a library of previously collected spectra to identify the molecule. While popular, this approach will fail to identify molecules that are not in the existing library. In response, we propose to improve the library's coverage by augmenting it with synthetic spectra that are predicted from candidate molecules using machine learning. We contribute a lightweight neural network model that quickly predicts mass spectra for small molecules, averaging 5 ms per molecule with a recall-at-10 accuracy of 91.8%. Achieving high-accuracy predictions requires a novel neural network architecture that is designed to capture typical fragmentation patterns from electron ionization. We analyze the effects of our modeling innovations on library matching performance and compare our models to prior machine-learning-based work on spectrum prediction.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing financial interest.

Figures

Figure 1
Figure 1
Library matching task. (a) Depiction of how query spectra are matched to a collection of reference spectra as performed by mass spectrometry software. (b) Query spectra are compared against a library comprising spectra from the NIST 2017 main library and spectra predicted by our model (outlined in blue). Spectral images adapted from NIST Webbook.
Figure 2
Figure 2
Spectral prediction with MLP forward model (a) and MLP bidirectional model (b). For both spectra plots, the true spectrum is shown in blue on top, while the predicted spectrum is shown inverted in red. Note that the spectrum predicted by the bidirectional model shows fewer stray peaks than the forward model, particularly for larger m/z values. These peaks are much easier to predict with the reverse prediction mode.
Figure 3
Figure 3
Molecular representations are passed into a multilayer perceptron to generate an initial output. This output is used to make a forward prediction starting at formula image and formula image and in reverse starting from formula image and ending at formula image. A sigmoid gating is applied to the inputs as shown in eq 5.
Figure 4
Figure 4
Performance of different model architectures. (a) Comparison of the recall@10 accuracy of the linear regression model and the multilayer perceptron model using the forward, reverse, and bidirectional architecture. (b) Performance of NEIMS at different recall levels, and its comparison against the performance of using the NIST main library itself.
Figure 5
Figure 5
Comparing the similarity between the predicted spectrum and the ground truth spectrum to the overall similarity between spectra for the same molecule.

References

    1. Hsieh Y.; Korfmacher W. A. Increasing speed and throughput when using HPLC-MS/MS systems for drug metabolism and pharmacokinetic screening. Curr. Drug Metab. 2006, 7, 479–489. 10.2174/138920006777697963. - DOI - PubMed
    1. Zhou Z.; Zare R. N. Personal information from latent fingerprints using desorption electrospray ionization mass spectrometry and machine learning. Anal. Chem. 2017, 89, 1369–1372. PMID: 28194988.10.1021/acs.analchem.6b04498. - DOI - PubMed
    1. Petrie S.; Bohme D. K. Ions in space. Mass Spectrom. Rev. 2007, 26, 258–280. 10.1002/mas.20114. - DOI - PubMed
    1. Stein S. E.Mass Spectral Database; National Institute of Standards and Technology (NIST), 2017.
    1. Mclafferty F. W.Wiley Registry of Mass Spectral Data, 11th ed.; John Wiley and Sons, 2016.

LinkOut - more resources