Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Dec 9;16(1):10600.
doi: 10.1038/s41467-025-65993-5.

A resource to empirically establish drug exposure records directly from untargeted metabolomics data

Haoqi Nina Zhao #  1   2 Kine Eide Kvitne #  2   3 Corinna Brungs #  4   5 Siddharth Mohan  2 Vincent Charron-Lamoureux  1   2 Wout Bittremieux  1   2   6 Runbang Tang  2 Robin Schmid  1   2   4 Santosh Lamichhane  2   7 Shipei Xing  1   2 Yasin El Abiead  1   2 Mohammadsobhan S Andalibi  8   9   10 Helena Mannochio-Russo  1   2 Madison Ambre  11 Nicole E Avalon  12 MacKenzie Bryant  11 Lindsey A Burnett  13 Andrés Mauricio Caraballo-Rodríguez  1   2 Martin Casas Maya  11 Loryn Chin  14 Lluís Corominas  15 Ronald J Ellis  8   9 Donald Franklin  9 Sagan Girod  16 Paulo Wender P Gomes  1   2   17 Lauren Hansen  11 Robert K Heaton  9 Jennifer E Iudicello  9 Alan K Jarmusch  1   2   18 Lora Khatib  8 Scott Letendre  10   19 Sarolt Magyari  2   20 Daniel McDonald  11 Ipsita Mohanty  1   2 Andrés Cumsille  2   21 David J Moore  9   10 Prajit Rajkumar  2 Dylan H Ross  22   23 Harshada Sapre  2 Mohammad Reza Zare Shahneh  24 Ruben Gil-Solsona  25 Sydney P Thomas  1   2 Caitlin Tribelhorn  11 Helena M Tubb  11 Corinn Walker  11 Crystal X Wang  9   10 Jasmine Zemlin  1   2   26 Simone Zuffa  1   2 David S Wishart  16   27 Pablo Gago-Ferrero  25 Rima Kaddurah-Daouk  28   29   30 Mingxun Wang  24 Manuela Raffatellu  11   26   31 Karsten Zengler  11   14   26   32 Tomáš Pluskal  4 Libin Xu  22 Rob Knight  11   26   33   34   35 Shirley M Tsunoda  2 Pieter C Dorrestein  36   37   38
Affiliations

A resource to empirically establish drug exposure records directly from untargeted metabolomics data

Haoqi Nina Zhao et al. Nat Commun. .

Abstract

Despite extensive efforts, extracting medication exposure information from clinical records remains challenging. To complement this approach, here we show the Global Natural Product Social Molecular Networking (GNPS) Drug Library, a tandem mass spectrometry (MS/MS) based resource designed for drug screening with untargeted metabolomics. This resource integrates MS/MS references of drugs and their metabolites/analogs with standardized vocabularies on their exposure sources, pharmacologic classes, therapeutic indications, and mechanisms of action. It enables direct analysis of drug exposure and metabolism from untargeted metabolomics data, supporting flexible summarization at multiple ontology levels to align with different research goals. We demonstrate its application by stratifying participants in a human immunodeficiency virus (HIV) cohort based on detected drug exposures. We uncover drug-associated alterations in microbiota-derived N-acyl lipids that are not captured when stratifying by self-reported medication use. Overall, GNPS Drug Library provides a scalable resource for empirical drug screening in clinical, nutritional, environmental, and other research disciplines, facilitating insights into the ecological and health consequences of drug exposures. While not intended for immediate clinical decision-making, it supports data-driven exploration of drug exposures where traditional records are limited or unreliable.

PubMed Disclaimer

Conflict of interest statement

Competing interests: R.S. is a co-founder of mzio GmbH. D.M.: D.M. is a consultant for BiomeSense, Inc., has equity, and receives income. The terms of these arrangements have been reviewed and approved by the University of California, San Diego in accordance with its conflict of interest policies. R.K.-D.: R.K.-D. is an inventor on a series of patents on use of metabolomics for the diagnosis and treatment of CNS diseases and holds equity in Metabolon Inc., Chymia LLC, and PsyProtix. M.W. is a co-founder of Ometa Labs LLC. T.P. is a co-founder of mzio GmbH. R.K. is a scientific advisory board member, and consultant for BiomeSense, Inc., has equity, and receives income. He is a scientific advisory board member and has equity in GenCirq. He has equity in and acts as a consultant for Cybele. He is a co-founder of Biota, Inc., and has equity. He is a cofounder of Micronoma and has equity and is a scientific advisory board member. He is a board member of Microbiota Vault, Inc. He is a board member of N = 1 IBS advisory board and receives income. He is a Senior Visiting Fellow of HKUST Jockey Club Institute for Advanced Study. The terms of these arrangements have been reviewed and approved by the University of California, San Diego, in accordance with its conflict of interest policies. S.M.T.: S.M.T. receives research funding from Veloxis Pharmaceuticals. P.C.D.: P.C.D. is a scientific advisor and holds equity in Cybele, and bileOmix, and is a Scientific Co-founder, advisor, holds equity and/or received income from Ometa, Arome, and Enveda, with prior approval by UC-San Diego. The remaining authors declare no competing interests.

Figures

Fig. 1
Fig. 1. The GNPS Drug Library and connected pharmacologic metadata.
a The GNPS Drug Library comprises four key resources: Drug MS/MS reference spectra, drug metabolite MS/MS reference spectra, propagated drug analogs derived from public metabolomics datasets, and pharmacologic metadata connected to each reference spectrum. b FastMASST analog search of drug spectra against public metabolomics studies yielded propagated drug-analogous MS/MS spectra, which were filtered by removing analogs for drugs with endogenous and food sources (source filter), removing mass offsets unexplained by common metabolic pathways (mass offset filter), removing analogs with GNPS library matches (library match filter), and removing analogs with unrealistic drug exposure indications (dataset testing). c Illustration of each filter employed in curating FastMASST analog match results. d Frequency of mass offsets in the propagated drug analog library. The mass offsets were grouped by unit mass and stacked based on the number of analog spectra. The most frequently observed mass offsets are colored while the rests are black. The numbers of in-source fragments, isotopes, and adducts in the drug analog library are estimated based on peak shape correlation and fragment matching strategies (see Fig. S1 for details). e An example of structural modification sites predicted by ModiFinder. Purple color highlights modified spectra and substructures, while the green color highlights unmodified ones. f Overview of the ontology-based drug metadata, highlighting common pharmaceutical classes and specific drugs in the neurology/psychiatry category. Width of the bars and lines reflects the number of unique drug structures in each class. g Barplot showing the top 20 most detected pharmacologic classes in fecal samples from the American Gut Project, a cohort of the general population from the United States (US), Europe, and Australia (1993 individuals). Bars are colored based on the pharmacologic classes. h Detected therapeutic drug class patterns by age and sex (1845 individuals with age and sex information; age 46 ± 18 years [range 3–93], with 53% being female). Bars and lines are colored based on sex. NSAID non-steroidal anti-inflammatory drugs, ACE angiotensin converting enzyme, SSRI selective serotonin reuptake inhibitor, PPI proton pump inhibitor, HSV herpes simplex virus, DHFR dihydrofolate reductase, SNRI serotonin and norepinephrine reuptake inhibitor. Source data are provided as a Source Data file.
Fig. 2
Fig. 2. Drug exposures in the HIV Neurobehavioral Research Center (HNRC) cohort with connections to microbial and endogenous metabolites.
From the HNRC cohort, 322 fecal samples were analyzed, with 222 samples from people with HIV and 100 samples from people without HIV. a Peak area visualization of drugs detected with at least two metabolites or analogs and in >10% samples. Each column represents one sample, and each row represents one drug annotation. Drug annotations were grouped based on the parent drugs and separated by gap spaces. Drug annotations were denoted based on their annotation types (as drug, drug metabolites, or drug analogs) and the pharmacologic classes of the parent drugs. All annotated ion/adduct forms of the parent drugs were visualized, leading to multiple rows of parent annotations for some drugs. Asterisks on the drug name mark parent drug annotations confirmed with commercial standards based on retention time and MS/MS spectral matches. Raw peak areas were log-transformed. b Retention time and MS/MS spectra mirror matches for drug analogs observed in both the fecal samples and the drug microbial incubations. Purple traces represent the fecal samples, while red traces represent the drug microbial incubation. Blue traces represent mixtures of the fecal samples and the microbial incubations at 1:1 volume ratio. The atomic changes of the drug analogs were based on [M + H]+ ion of the parent drug. c Hierarchical clustering of the samples from people with HIV (n = 222) based on detected antiretroviral drugs (ARV). Each row represents one detected ARV, with peak areas summed for the drug, metabolite, and analog detections followed by log-transformation (visualized with the same color scale as panel a). ARVs detected in <10% of samples are not shown. Each column represents one sample, clustered into four groups by hierarchical clustering with Ward’s linkage and Euclidean distance. d Sample-to-sample peak areas of N-acyl lipids in people with HIV, separated by the clusters derived from ARV detections shown in (c) (Group 1, n = 47; Group 2, n = 64; Group 3, n = 52; Group 4, n = 59). For each compound, the peak area in each sample was standardized to the maximum value observed across all samples. A non-parametric Kruskal-Wallis test followed by pairwise Wilcoxon test and Benjamini-Hochberg correction for multiple comparisons was performed. P-values < 0.05 were noted in the figure. Boxplots showcase the median value, first (lower) and third (upper) quartiles, and whiskers indicate the error range as 1.5 times the interquartile range. Source data are provided as a Source Data file.

Update of

  • Empirically establishing drug exposure records directly from untargeted metabolomics data.
    Zhao HN, Kvitne KE, Brungs C, Mohan S, Charron-Lamoureux V, Bittremieux W, Tang R, Schmid R, Lamichhane S, El Abiead Y, Andalibi MS, Mannochio-Russo H, Ambre M, Avalon NE, Bryant M, Caraballo-Rodríguez AM, Maya MC, Chin L, Ellis RJ, Franklin D, Girod S, Gomes PWP, Hansen L, Heaton R, Iudicello JE, Jarmusch AK, Khatib L, Letendre S, Magyari S, McDonald D, Mohanty I, Cumsille A, Moore DJ, Rajkumar P, Ross DH, Sapre H, Shahneh MRZ, Thomas SP, Tribelhorn C, Tubb HM, Walker C, Wang CX, Xing S, Zemlin J, Zuffa S, Wishart DS, Kaddurah-Daouk R, Wang M, Raffatellu M, Zengler K, Pluskal T, Xu L, Knight R, Tsunoda SM, Dorrestein PC. Zhao HN, et al. bioRxiv [Preprint]. 2024 Oct 26:2024.10.07.617109. doi: 10.1101/2024.10.07.617109. bioRxiv. 2024. Update in: Nat Commun. 2025 Dec 9;16(1):10600. doi: 10.1038/s41467-025-65993-5. PMID: 39416075 Free PMC article. Updated. Preprint.

References

    1. Vermeulen, R., Schymanski, E. L., Barabási, A.-L. & Miller, G. W. The exposome and health: Where chemistry meets biology. Science367, 392–396 (2020). - DOI - PMC - PubMed
    1. National Center for Health Statistics. Health, United States, 2019. Hyattsville, MD (2021). - PubMed
    1. Rappaport, S. M., Barupal, D. K., Wishart, D., Vineis, P. & Scalbert, A. The blood exposome and its role in discovering causes of disease. Environ. Health Perspect.122, 769–774 (2014). - DOI - PMC - PubMed
    1. de la Cuesta-Zuluaga, J., Boldt, L. & Maier, L. Response, resistance, and recovery of gut bacteria to human-targeted drug exposure. Cell Host Microbe32, 786–793 (2024). - DOI - PubMed
    1. Verdegaal, A. A. & Goodman, A. L. Integrating the gut microbiome and pharmacology. Sci. Transl. Med.16, eadg8357 (2024). - DOI - PMC - PubMed