Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Nov 11:9:1028334.
doi: 10.3389/fmolb.2022.1028334. eCollection 2022.

Inventa: A computational tool to discover structural novelty in natural extracts libraries

Affiliations

Inventa: A computational tool to discover structural novelty in natural extracts libraries

Luis-Manuel Quiros-Guerrero et al. Front Mol Biosci. .

Abstract

Collections of natural extracts hold potential for the discovery of novel natural products with original modes of action. The prioritization of extracts from collections remains challenging due to the lack of a workflow that combines multiple-source information to facilitate the data interpretation. Results from different analytical techniques and literature reports need to be organized, processed, and interpreted to enable optimal decision-making for extracts prioritization. Here, we introduce Inventa, a computational tool that highlights the structural novelty potential within extracts, considering untargeted mass spectrometry data, spectral annotation, and literature reports. Based on this information, Inventa calculates multiple scores that inform their structural potential. Thus, Inventa has the potential to accelerate new natural products discovery. Inventa was applied to a set of plants from the Celastraceae family as a proof of concept. The Pristimera indica (Willd.) A.C.Sm roots extract was highlighted as a promising source of potentially novel compounds. Its phytochemical investigation resulted in the isolation and de novo characterization of thirteen new dihydro-β-agarofuran sesquiterpenes, five of them presenting a new 9-oxodihydro-β-agarofuran base scaffold.

Keywords: bioinformatic tools; computational metabolomics; mass spectrometry; natural products; prioritization; structural novelty discovery.

PubMed Disclaimer

Conflict of interest statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Figures

FIGURE 1
FIGURE 1
A conceptual overview of Inventa’s priority score and its components. (A) Feature Component (FC): is a ratio of the number of specific and unannotated features over the total number of features by extract. (B) Literature Component (LC): is a score based on the number of compounds reported in the literature for the taxon. It is independent of the spectral data. (C) Class Component (CC): indicates if an unreported chemical class is detected in each extract compared to those reported in the species and the genus. (D) Similarity Component (SC): Compares extracts based on their general MS2 spectral information though their MEMO vectors and automatic outlier detectors. This score is independent from any retention-time based alignment procedure and complementary to FC. (E) The Priority Score (PS) is the addition of the four components. A modulating factor (w n ) gives each component a relative weight according to the user’s preferences. The higher the value, the higher the rank of the extract. (F) Results Table is a resume of individual calculation components and results.
FIGURE 2
FIGURE 2
(A) UHPLC-HRMS chromatogram (BPI positive ion mode) showing the region where the dihydro-β-agarofuran sesquiterpenoids derivatives are suspected and displaying the only two compounds annotated for P. indica roots (plant with the highest PS). (B) Ion identity networking-based interactive ion map showing the combined results of the FC and CC for the IIN. In such display all features of a single neutral molecule are grouped under a single spot. The IIN are displayed according to their status (specific unannotated (blue), specific annotated (green), and non-specific unannotated -not interesting- (yellow)). Complementary information (adducts, row id, chemical class, etc.) are displayed interactively for each IIN if available, as shown in the zoom sections for the ion identity network 1734. The intensities in both cases (bar’s height and bubble’s size) are proportional to the original quantification table (before any filtering step). The scatter plot shows the m/z ratio of each feature (or ion network identity) on the y-axis. The feature-based ion map can be found in Supplementary Figure S82.
FIGURE 3
FIGURE 3
Original dihydro-β-agarofuran derivatives isolated from the Pristimera indica roots extract.
FIGURE 4
FIGURE 4
The relative position of the isolated compound (113) in the chromatogram for the ethyl acetate extract of Pristimera indica roots. The upper chromatographic trace corresponds to the ESI in positive ionization mode, while the lower trace corresponds to the Charged Aerosol Detector (CAD), a semi-quantitative trace. Compounds highlighted in green hold a new 9-oxodihydro-β-agarofuran base scaffold.
FIGURE 5
FIGURE 5
(A) HMBC Key and ROESY correlations for the compounds isolated from P. indica roots extract. (B) Experimental and B3LYP/def2svp//B3LYP/6-31G(d,p) calculated spectra in acetonitrile for compound 3.

References

    1. Allard P. M., Péresse T., Bisson J., Gindro K., Marcourt L., Pham V. C., et al. (2016). Integration of molecular networking and in-silico MS/MS fragmentation for natural products dereplication. Anal. Chem. 88, 3317–3323. 10.1021/acs.analchem.5b04804 - DOI - PubMed
    1. Allard P. M., Genta-Jouve G., Wolfender J. L. (2017). Deep metabolome annotation in natural products research: Towards a virtuous cycle in metabolite identification. Curr. Opin. Chem. Biol. 36, 40–49. 10.1016/j.cbpa.2016.12.022 - DOI - PubMed
    1. Alvarenga N., Ferro E. A. (2006). “Bioactive triterpenes and related compounds from celastraceae,” in Studies in natural products chemistry. Editor Atta-ur-Rahman (Elsevier; ), 239–307. 10.1016/S1572-5995(06)80029-3 - DOI
    1. Amara A., Frainay C., Jourdan F., Naake T., Neumann S., Novoa-Del-Toro E. M., et al. (2022). Networks and graphs discovery in metabolomics data analysis and interpretation. Front. Mol. Biosci. 9, 841373. 10.3389/fmolb.2022.841373 - DOI - PMC - PubMed
    1. Brejnrod A., Ernst M., Dworzynski P., Rasmussen L. B., Dorrestein P. C., van der Hooft J. J. J., et al. (2019). Implementations of the chemical structural and compositional similarity metric in R and Python. bioRxiv 1, 546150. 10.1101/546150 - DOI

LinkOut - more resources