Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2015 Aug 28:7:44.
doi: 10.1186/s13321-015-0087-1. eCollection 2015.

MINEs: open access databases of computationally predicted enzyme promiscuity products for untargeted metabolomics

Affiliations

MINEs: open access databases of computationally predicted enzyme promiscuity products for untargeted metabolomics

James G Jeffryes et al. J Cheminform. .

Abstract

Background: In spite of its great promise, metabolomics has proven difficult to execute in an untargeted and generalizable manner. Liquid chromatography-mass spectrometry (LC-MS) has made it possible to gather data on thousands of cellular metabolites. However, matching metabolites to their spectral features continues to be a bottleneck, meaning that much of the collected information remains uninterpreted and that new metabolites are seldom discovered in untargeted studies. These challenges require new approaches that consider compounds beyond those available in curated biochemistry databases.

Description: Here we present Metabolic In silico Network Expansions (MINEs), an extension of known metabolite databases to include molecules that have not been observed, but are likely to occur based on known metabolites and common biochemical reactions. We utilize an algorithm called the Biochemical Network Integrated Computational Explorer (BNICE) and expert-curated reaction rules based on the Enzyme Commission classification system to propose the novel chemical structures and reactions that comprise MINE databases. Starting from the Kyoto Encyclopedia of Genes and Genomes (KEGG) COMPOUND database, the MINE contains over 571,000 compounds, of which 93% are not present in the PubChem database. However, these MINE compounds have on average higher structural similarity to natural products than compounds from KEGG or PubChem. MINE databases were able to propose annotations for 98.6% of a set of 667 MassBank spectra, 14% more than KEGG alone and equivalent to PubChem while returning far fewer candidates per spectra than PubChem (46 vs. 1715 median candidates). Application of MINEs to LC-MS accurate mass data enabled the identity of an unknown peak to be confidently predicted.

Conclusions: MINE databases are freely accessible for non-commercial use via user-friendly web-tools at http://minedatabase.mcs.anl.gov and developer-friendly APIs. MINEs improve metabolomics peak identification as compared to general chemical databases whose results include irrelevant synthetic compounds. Furthermore, MINEs complement and expand on previous in silico generated compound databases that focus on human metabolism. We are actively developing the database; future versions of this resource will incorporate transformation rules for spontaneous chemical reactions and more advanced filtering and prioritization of candidate structures. Graphical abstractMINE database construction and access methods. The process of constructing a MINE database from the curated source databases is depicted on the left. The methods for accessing the database are shown on the right.

Keywords: Enzyme promiscuity; Liquid chromatography–mass spectrometry; Metabolite identification; Untargeted metabolomics.

PubMed Disclaimer

Figures

Graphical abstract
Graphical abstract
MINE database construction and access methods. The process of constructing a MINE database from the curated source databases is depicted on the left. The methods for accessing the database are shown on the right.
Fig. 1
Fig. 1
MINE database construction and access methods. The process of constructing a MINE database from the curated source databases is depicted on the left. The methods for accessing the database are shown on the right.
Fig. 2
Fig. 2
Generalizing a BNICE reaction rule from known biochemical reactions. The common motif of the hydrolysis of the 1,3-diketone is shaded for emphasis.
Fig. 3
Fig. 3
Histogram of Natural Product Likeness. This plot shows the distribution of Natural Product Likeness Scores for the KEGG Database (mean score 0.77), the KEGG MINE (mean score 0.98) and a random sample of 500,000 PubChem compounds (mean score −0.52). A more positive score indicates more natural atomic features.
Fig. 4
Fig. 4
Screenshot of Metabolomics search results. This screenshot displays features of the metabolomics results including filtering by attributes and highlighting (blue) of compound present in a specified KEGG genome reconstruction.
Fig. 5
Fig. 5
Positive MS spectrum (a), positive MS/MS spectrum (b) and negative MS/MS spectrum (c). The positive MS spectrum provides the mass of the precursor ion [M+H]+ = 690.5099 Da and its isotopic abundance pattern. The prominent ion in the positive MS/MS spectrum corresponds to the neutral loss of the phosphoethanolamine head group. The negative MS/MS spectrum shows the molecular ion [M−H] as well as a pair of ions corresponding to the (16:0) and (16:1) side chains.

Similar articles

Cited by

References

    1. Patti GJ, Yanes O, Siuzdak G. Innovation: metabolomics: the apogee of the omics trilogy. Nat Rev Mol Cell Biol. 2012;13:263–269. doi: 10.1038/nrm3314. - DOI - PMC - PubMed
    1. Dromms R, Styczynski M. Systematic applications of metabolomics in metabolic engineering. Metabolites. 2012;2:1090–1122. doi: 10.3390/metabo2041090. - DOI - PMC - PubMed
    1. Roux A, Lison D, Junot C, Heilier J-F. Applications of liquid chromatography coupled to mass spectrometry-based metabolomics in clinical chemistry and toxicology: a review. Clin Biochem. 2011;44:119–135. doi: 10.1016/j.clinbiochem.2010.08.016. - DOI - PubMed
    1. Guertin KA, Moore SC, Sampson JN, Huang W-Y, Xiao Q, Stolzenberg-Solomon RZ (2014) Metabolomics in nutritional epidemiology: identifying metabolites associated with diet and quantifying their potential to uncover diet-disease relations in populations. Am J Clin Nutr ajcn.113.078758 - PMC - PubMed
    1. Scalbert A, Brennan L, Fiehn O, Hankemeier T, Kristal BS, van Ommen B, et al. Mass-spectrometry-based metabolomics: limitations and recommendations for future progress with particular focus on nutrition research. Metabolomics. 2009;5:435–458. doi: 10.1007/s11306-009-0168-0. - DOI - PMC - PubMed