Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Nov 27;5(11):1824-1833.
doi: 10.1021/acscentsci.9b00806. Epub 2019 Nov 14.

The Natural Products Atlas: An Open Access Knowledge Base for Microbial Natural Products Discovery

Affiliations

The Natural Products Atlas: An Open Access Knowledge Base for Microbial Natural Products Discovery

Jeffrey A van Santen et al. ACS Cent Sci. .

Abstract

Despite rapid evolution in the area of microbial natural products chemistry, there is currently no open access database containing all microbially produced natural product structures. Lack of availability of these data is preventing the implementation of new technologies in natural products science. Specifically, development of new computational strategies for compound characterization and identification are being hampered by the lack of a comprehensive database of known compounds against which to compare experimental data. The creation of an open access, community-maintained database of microbial natural product structures would enable the development of new technologies in natural products discovery and improve the interoperability of existing natural products data resources. However, these data are spread unevenly throughout the historical scientific literature, including both journal articles and international patents. These documents have no standard format, are often not digitized as machine readable text, and are not publicly available. Further, none of these documents have associated structure files (e.g., MOL, InChI, or SMILES), instead containing images of structures. This makes extraction and formatting of relevant natural products data a formidable challenge. Using a combination of manual curation and automated data mining approaches we have created a database of microbial natural products (The Natural Products Atlas, www.npatlas.org) that includes 24 594 compounds and contains referenced data for structure, compound names, source organisms, isolation references, total syntheses, and instances of structural reassignment. This database is accompanied by an interactive web portal that permits searching by structure, substructure, and physical properties. The Web site also provides mechanisms for visualizing natural products chemical space and dashboards for displaying author and discovery timeline data. These interactive tools offer a powerful knowledge base for natural products discovery with a central interface for structure and property-based searching and presents new viewpoints on structural diversity in natural products. The Natural Products Atlas has been developed under FAIR principles (Findable, Accessible, Interoperable, and Reusable) and is integrated with other emerging natural product databases, including the Minimum Information About a Biosynthetic Gene Cluster (MIBiG) repository, and the Global Natural Products Social Molecular Networking (GNPS) platform. It is designed as a community-supported resource to provide a central repository for known natural product structures from microorganisms and is the first comprehensive, open access resource of this type. It is expected that the Natural Products Atlas will enable the development of new natural products discovery modalities and accelerate the process of structural characterization for complex natural products libraries.

PubMed Disclaimer

Conflict of interest statement

The authors declare the following competing financial interest(s): RGL and MW are consultants for Sirenas. PCD is on the scientific advisory board for Sirenas. MHM is a member of the Scientific Advisory Board of Hexagon Bio and co-founder of Design Pharmaceuticals. MW is a founder of Ometa Labs.

Figures

Figure 1
Figure 1
Workflow for creation and curation of the Natural Products Atlas.
Figure 2
Figure 2
(A) Search interface (basic search). (B) Explore view (compound), (C) Discover view (author).
Figure 3
Figure 3
Four views in the Explore section. (A) Compound view, providing data for individual compounds. (B) Cluster view, illustrating compounds with close structural similarity. (C) Node view, illustrating clusters of compounds that are more distantly related. (D) Global view, presenting distribution of all chemical space in the Natural Products Atlas.
Figure 4
Figure 4
Global view, illustrating positions of all natural products containing a pyridine functional group as a substructure motif.
Figure 5
Figure 5
(A) Cluster view for guanacastepene E (NPAID 2040, red circle) and related compounds. (B) Node view for guanacastepene E, illustrating distribution and connectivities of related clusters (purple hexagons) and example structures from each cluster.

References

    1. Wolfender J.-L.; Litaudon M.; Touboul D.; Queiroz E. F. Innovative Omics-Based Approaches for Prioritisation and Targeted Isolation of Natural Products - New Strategies for Drug Discovery. Nat. Prod. Rep. 2019, 36 (6), 855–868. 10.1039/C9NP00004F. - DOI - PubMed
    1. Caesar L. K.; Nogo S.; Naphen C. N.; Cech N. B. Simplify: A Mass Spectrometry Metabolomics Approach to Identify Additives and Synergists from Complex Mixtures. Anal. Chem. 2019, 91 (17), 11297–11305. 10.1021/acs.analchem.9b02377. - DOI - PMC - PubMed
    1. McMillan E. A.; Kwon G.; Clemenceau J. R.; Fisher K. W.; Vaden R. M.; Shaikh A. F.; Neilsen B. K.; Kelly D.; Potts M. B.; Sung Y.-J.; et al. A Genome-Wide Functional Signature Ontology Map and Applications to Natural Product Mechanism of Action Discovery. Cell Chem. Biol. 2019, 26, 1380.10.1016/j.chembiol.2019.07.008. - DOI - PMC - PubMed
    1. Meleshko D.; Mohimani H.; Tracanna V.; Hajirasouliha I.; Medema M. H.; Korobeynikov A.; Pevzner P. A. Biosynthetic SPAdes: Reconstructing Biosynthetic Gene Clusters from Assembly Graphs. Genome Res. 2019, 29 (8), 1352–1362. 10.1101/gr.243477.118. - DOI - PMC - PubMed
    1. Alanjary M.; Steinke K.; Ziemert N. AutoMLST: An Automated Web Server for Generating Multi-Locus Species Trees Highlighting Natural Product Potential. Nucleic Acids Res. 2019, 47 (W1), W276–W282. 10.1093/nar/gkz282. - DOI - PMC - PubMed