Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Jun:164:107240.
doi: 10.1016/j.envint.2022.107240. Epub 2022 Apr 18.

CCDB: A database for exploring inter-chemical correlations in metabolomics and exposomics datasets

Affiliations

CCDB: A database for exploring inter-chemical correlations in metabolomics and exposomics datasets

Dinesh Kumar Barupal et al. Environ Int. 2022 Jun.

Abstract

Inter-chemical correlations in metabolomics and exposomics datasets provide valuable information for studying relationships among chemicals reported for human specimens. With an increase in the number of compounds for these datasets, a network graph analysis and visualization of the correlation structure is difficult to interpret. We have developed the Chemical Correlation Database (CCDB), as a systematic catalogue of inter-chemical correlation in publicly available metabolomics and exposomics studies. The database has been provided via an online interface to create single compound-centric views. We have demonstrated various applications of the database to explore: 1) the chemicals from a chemical class such as Per- and Polyfluoroalkyl Substances (PFAS), polycyclic aromatic hydrocarbons (PAHs), polychlorinated biphenyls (PCBs), phthalates and tobacco smoke related metabolites; 2) xenobiotic metabolites such as caffeine and acetaminophen; 3) endogenous metabolites (acyl-carnitines); and 4) unannotated peaks for PFAS. The database has a rich collection of 35 human studies, including the National Health and Nutrition Examination Survey (NHANES) and high-quality untargeted metabolomics datasets. CCDB is supported by a simple, interactive and user-friendly web-interface to retrieve and visualize the inter-chemical correlation data. The CCDB has the potential to be a key computational resource in metabolomics and exposomics facilitating the expansion of our understanding about biological and chemical relationships among metabolites and chemical exposures in the human body. The database is available at www.ccdb.idsl.me site.

Keywords: Biomonitoring; Database; Exposomics; Inter-chemical correlation; Metabolic pathways; Metabolomics; NHANES; Software.

PubMed Disclaimer

Figures

Fig. 1.
Fig. 1.
Probable interpretations of correlation in targeted and untargeted GC/LC-HRMS datasets.
Fig. 2.
Fig. 2.
Prevalence of strong inter-chemical correlations across 35 studies in the CCDB. These are unique correlations. See the Table 1 for the description of each study and number of compounds. Table S3 shows the chemical detection rate across the indexed studies.
Fig. 3.
Fig. 3.
Correlations among chemicals within a class or having same source origin in the NHANES dataset. The correlation cutoff was 0.3 for PCB, PFC and Tobacco compounds, and 0.4 for PAHs. Online network can be accessed at the site - https://chemcor.idsl.site/originaldata/biomonitoring/#?studyid=NHANES. Edge thickness shows the correlation strength, by only the minimum and maximum correlation values are labelled on the edges for clarity. Thickness of edges are not comparable in two network figures. Abbreviations: Perfluorodecanoic acid (PFDeA), Perfluorohexane sulfonic acid (PFHxS), Perfluorononanoic acid (PFNA), Perfluoroundecanoic acid (PFUA), n-perfluorooctanoic acid (n-PFOA), n-perfluorooctane sulfonic acid (n-PFOS), Perfluoromethylheptane sulfonic acid isomers (SmPFOS), Polychlorinated Biphenyls (PCB); polyaromatic hydrocarbons (PAH), Perfluorinated compounds (PFC).
Fig. 4.
Fig. 4.
Compounds correlation with acylcarnitine 16:0 in the study ST002089. Edge thickness shows the correlation strength, by only the minimum and maximum correlation values are labelled on the edges for clarity. Thickness of edges are not comparable in two network figures. Abbreviations: acyl-carnitines (AC). Fatty acid (FA), glycerophosphoethanolamine (GPE), glycerophosphocholine (GPC).
Fig. 5.
Fig. 5.
Caffeine and phthalate metabolites in the NHANES survey data. Variable id URXMBP_PHTHTE_D (year 2005–2006) was used for mono-n-butyl phthalate (MnBP). Variable id URXMX7_CAFE_H (year 2013–2014) was used for caffeine. Label on the edges show the Pearson coefficient. Edge thickness shows the correlation strength, by only the minimum and maximum correlation values are labelled on the edges for clarity. Thickness of edges are not comparable in two network figures. Abbreviations: acetylamino-6-formylamino-3-methyluracil(AAMU).
Fig. 6.
Fig. 6.
Inter-chemical correlation among PFCs in the untargeted metabolomics datasets. Correlation threshold for ST001430 was 0.3 and for 0.6 for ST001231. White color node mean it was detected in by the reverse phase ESI (−) mode and a grey node means it was detected by a reverse phrase ESI (+) mode. Edge thickness shows the correlation strength, by only the minimum and maximum correlation values are labelled on the edges for clarity. Thickness of edges are not comparable in two network figures.
Fig. 7.
Fig. 7.
Chemical similarity enrichment analysis of PFOA and its correlation with other metabolites in the IDSLCCDB00001 study.

References

    1. Abdrabou W, Dieng MM, Diawara A, Sermé SS, Almojil D, Sombié S, Henry NB, Kargougou D, Manikandan V, Soulama I, Idaghdour Y, 2021. Metabolome modulation of the host adaptive immunity in human malaria. Nat. Metab. 3 (7), 1001–1016. - PubMed
    1. Barupal DK, Fiehn O, 2017. Chemical similarity enrichment analysis (chemrich) as alternative to biochemical pathway mapping for metabolomic datasets. Sci. Rep. 7 (1), 14567. 10.1038/s41598-017-15231-w. - DOI - PMC - PubMed
    1. Barupal DK, Fan S, Fiehn O, 2018. Integrating bioinformatics approaches for a comprehensive interpretation of metabolomics datasets. Curr. Opin. Biotechnol. 54, 1–9. 10.1016/j.copbio.2018.01.010. - DOI - PMC - PubMed
    1. Barupal DK, Baillie R, Fan S, Saykin AJ, Meikle PJ, Arnold M, Nho K, Fiehn O, Kaddurah-Daouk R, Zetterberg H, 2019a. Sets of coregulated serum lipids are associated with alzheimer’s disease pathophysiology. Alzheimers Dement (Amst) 11 (1), 619–627. - PMC - PubMed
    1. Barupal DK, Fiehn O, 2019. Generating the blood exposome database using a comprehensive text mining and database fusion approach. Environ. Health Perspect 127 (9), 97008. 10.1289/EHP4713. - DOI - PMC - PubMed

Publication types

LinkOut - more resources