Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Sep;17(9):901-904.
doi: 10.1038/s41592-020-0916-7. Epub 2020 Aug 17.

ReDU: a framework to find and reanalyze public mass spectrometry data

Affiliations

ReDU: a framework to find and reanalyze public mass spectrometry data

Alan K Jarmusch et al. Nat Methods. 2020 Sep.

Abstract

We present ReDU ( https://redu.ucsd.edu/ ), a system for metadata capture of public mass spectrometry-based metabolomics data, with validated controlled vocabularies. Systematic capture of knowledge enables the reanalysis of public data and/or co-analysis of one's own data. ReDU enables multiple types of analyses, including finding chemicals and associated metadata, comparing the shared and different chemicals between groups of samples, and metadata-filtered, repository-scale molecular networking.

PubMed Disclaimer

Figures

Extended Data Fig. 1
Extended Data Fig. 1. Repository-scale molecular cartography enabled by ReDU.
ReDU samples with latitude and longitude information, n = 34,003, were grouped by latitude and longitude (n = 2068 different locations) and plotted colored by number of annotations per file (log10 scaled).
Extended Data Fig. 2
Extended Data Fig. 2. Comparison of bacterial cultures using Group Comparator in ReDU.
Bacterial cultures of 1280|Staphylococcus aureus (n = 49), 1423|Bacillus subtilis (n = 89), and 1883|Streptomyces (n = 7) were compared and chemical differences are illustrated by pyroGlu-Ile, staurosporine, and surfactin-C14.
Extended Data Fig 3
Extended Data Fig 3. Repository-scale molecular networking of human blood (n = 711), fecal (n = 5,097), and urine (n = 307) supplemented by MolNetEnhancer.
(a) MolNetEnhancer enhanced molecular network in which components are colored based on Classyfire chemical class prediction. (b) Number of nodes per Classyfire chemical class prediction. Nodes without a match in Classyfire are not displayed.
Extended Data Fig 4
Extended Data Fig 4. Molecular cartography of the distribution of drugs on the human body visualized using ili.
Descaladinose azithromycin, a drug metabolite of azithromycin, distribution in human (n = 17,117; normalized by the number of files per sample).
Fig 1.
Fig 1.. ReDU Framework and Illustrative Public ReDU Data Analyses.
(a) ReDU provides users the tools to find public data in the GNPS/MassIVE knowledgebase, explore public data analyses in ReDU, and enable repository-scale co- and re-analyses in GNPS. Contributors are provided a template for sample information and a drag-and-drop validator. (b) 2D Emperor plot displaying the projection of human plasma samples (orange) from rheumatoid arthritis patients (not included in ReDU) onto all files (points) in ReDU colored by UBERON ontology (NCBI taxonomy-based opacity used: projected data, 1.0; 9606|Homo sapiens, 0.7; and all other data, 0.25). (c) Illustrative results from Chemical Explorer for 12-ketodeoxycholic acid, cholic acid, and rosuvastatin annotated in human fecal (n = 5,097) files over different life stages. (d) Group comparator performed on human blood (n = 711), fecal (n = 5,097), and urine (n = 307) samples resulted in different chemical compositions as illustrated by bilirubin, urobilin, and stercobilin.
Fig 2.
Fig 2.. Repository-scale Molecular Networking of Public Data in ReDU.
(a) Molecular network of human blood (n = 711), fecal (n = 5,097), and urine (n = 307) in ReDU with nodes colored by annotation status, annotated (red) or unannotated (grey). (b) Summary of MS/MS library matching results (level 2 or 3) are displayed for the nodes in the network and all MS/MS spectra considered in the molecular network. (c) A portion of the repository-scale molecular networking containing clindamycin. Nodes are colored by the sample type. Node size reflects the number of MassIVE datasets. Node shape annotation status (annotated, diamond or unannotated, circle). Putatively annotated clindamycin analogs (2–9), based on MS/MS interpretation, are indicated using dashed arrows and numbers, blue, corresponding to the proposed structures.

References

    1. Wang M et al. Sharing and community curation of mass spectrometry data with Global Natural Products Social Molecular Networking. Nat. Biotechnol 34, 828–837 (2016). - PMC - PubMed
    1. Haug K et al. MetaboLights: a resource evolving in response to the needs of its scientific community. Nucleic Acids Res. 48, D440–D444 (2020). - PMC - PubMed
    1. Sud M et al. Metabolomics Workbench: An international repository for metabolomics data and metadata, metabolite standards, protocols, tutorials and training, and analysis tools. Nucleic Acids Res. 44, D463–D470 (2016). - PMC - PubMed
    1. Federhen S The NCBI Taxonomy database. Nucleic Acids Res. 40, D136–D143 (2012). - PMC - PubMed
    1. Mungall CJ, Torniai C, Gkoutos GV, Lewis SE & Haendel MA Uberon, an integrative multi-species anatomy ontology. Genome Biol. 13, R5 (2012). - PMC - PubMed

Publication types