Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Oct 21;26(1):259.
doi: 10.1186/s12859-025-06265-3.

MultiModalGraphics: an R package for graphical integration of multi-omics datasets

Affiliations

MultiModalGraphics: an R package for graphical integration of multi-omics datasets

Foziya Ahmed Mohammed et al. BMC Bioinformatics. .

Abstract

Multimodal visualizations are essential for identifying and interpreting complex relationships in diverse, high-dimensional biological datasets. However, existing visualization tools often lack native capabilities for embedding explicit statistical and computational annotations, hindering effective quantitative interpretation. We introduce MultiModalGraphics, an R package designed specifically for creating annotated scatterplots and heatmaps of multi-omics and high-dimensional biological data. The package allows seamless embedding of statistical summaries such as fold-changes, p-values, q-values, and standard deviations, facilitating direct quantitative comparisons. MultiModalGraphics interoperates with Bioconductor packages including MultiAssayExperiment, limma, voom, and iClusterPlus, streamlining workflows from data preprocessing and differential expression analysis to visualization. Case studies on three distinct real-world multimodal datasets illustrate its practical utility. Source code, documentation, and example datasets are available via GitHub ( https://github.com/famanalytics0/MultiModalGraphics ) and under review for inclusion into Bioconductor.

Keywords: Annotated heatmap; Composite feature heatmap; Multimodal visualization; Thresholded scatterplot.

PubMed Disclaimer

Conflict of interest statement

Declarations. Ethics approval and consent to participate: Not applicable. Consent for publication: Not Applicable. Competing interests: The authors declare no competing interests.

Figures

Fig. 1
Fig. 1
Example output from the AnnotatedHeatmap class, displaying statistical overlays (stars/dots) on heatmap tiles to indicate significant or trending molecular features. This enables immediate assessment of statistical relevance beyond color intensity alone. Data are drawn from integrated multimodal pan-cancer datasets (TCGA, ICGC, and cBioPortal), covering six cancer types: CESC (cervical squamous cell carcinoma and endocervical adenocarcinoma), OV (ovarian serous cystadenocarcinoma), PRAD (prostate adenocarcinoma), TGCT (testicular germ cell tumors), UCEC (uterine corpus endometrial carcinoma), and UCS (uterine carcinosarcoma). The corresponding R code configuration used to generate this plot is provided in Supplementary Code Examples
Fig. 2
Fig. 2
Example output from the CompositeFeatureHeatmap class, summarizing pathway-level activity across multiple brain regions and timepoints. Tile colors encode z-scores, overlaid dots encode significance, and dot size reflects feature counts, allowing multidimensional comparisons across conditions. The visualization summarizes results from 20 gene expression datasets (each ~ 45 K features across 10 mice) collected from five mouse brain regions at four timepoints (GEO accession GSE45035). The regions include AY (amygdala), HC (hippocampus), MPFC (medial prefrontal cortex), SE (septal region), ST (corpus striatum), and VS (ventral striatum). The heatmap highlights inhibition of neurogenesis and synaptic plasticity pathways associated with differentially expressed genes across time. The corresponding R code configuration used to generate this plot is provided in Supplementary Code Examples
Fig. 3
Fig. 3
Example output from the ThresholdedScatterplot class (volcano-style), illustrating log2 fold changes against − log10 p-values with thresholds for significance explicitly marked. Numbers of up- and down-regulated features are annotated in each panel, enabling transparent and quantitative interpretation of differential results. Data are from multi-tissue, multi-timepoint studies of differentially expressed genes across spleen, heart, blood, and seven brain regions at five timepoints (T5R1, T5R10, T10R1, T10R28, and T10R42; GEO accession GSE68077). Brain region abbreviations: AY (amygdala), HC (hippocampus), MPFC (medial prefrontal cortex), SE (septal region), ST (corpus striatum), and VS (ventral striatum). T indicates days of trauma exposure, and R indicates post-trauma tissue collection days. These datasets were downloaded from Gene Expression Omnibus GSE68077. The corresponding R code configuration used to generate this plot is provided in Supplementary Code

References

    1. Ligges U, Mächler M. Scatterplot3d-an r package for visualizing multivariate data. In. Technical report; 2002.
    1. Culhane AC, Thioulouse J, Perrière G, Higgins DG. MADE4: an R package for multivariate analysis of gene expression data. Bioinformatics. 2005;21(11):2789–90. - DOI - PubMed
    1. Graumann J, Cotton R. Multipanelfigure: simple assembly of multiple plots and images into a compound figure. J Stat Softw. 2018;84:1–10. - DOI
    1. Wickham H. ggplot2. Wiley Interdiscip Rev Comput Stat. 2011;3(2):180–5. - DOI
    1. Gu Z, Eils R, Schlesner M. Complex heatmaps reveal patterns and correlations in multidimensional genomic data. Bioinformatics. 2016;32(18):2847–9. - DOI - PubMed

LinkOut - more resources