Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2019 Nov 8:10:995.
doi: 10.3389/fgene.2019.00995. eCollection 2019.

Microbiome Multi-Omics Network Analysis: Statistical Considerations, Limitations, and Opportunities

Affiliations
Review

Microbiome Multi-Omics Network Analysis: Statistical Considerations, Limitations, and Opportunities

Duo Jiang et al. Front Genet. .

Abstract

The advent of large-scale microbiome studies affords newfound analytical opportunities to understand how these communities of microbes operate and relate to their environment. However, the analytical methodology needed to model microbiome data and integrate them with other data constructs remains nascent. This emergent analytical toolset frequently ports over techniques developed in other multi-omics investigations, especially the growing array of statistical and computational techniques for integrating and representing data through networks. While network analysis has emerged as a powerful approach to modeling microbiome data, oftentimes by integrating these data with other types of omics data to discern their functional linkages, it is not always evident if the statistical details of the approach being applied are consistent with the assumptions of microbiome data or how they impact data interpretation. In this review, we overview some of the most important network methods for integrative analysis, with an emphasis on methods that have been applied or have great potential to be applied to the analysis of multi-omics integration of microbiome data. We compare advantages and disadvantages of various statistical tools, assess their applicability to microbiome data, and discuss their biological interpretability. We also highlight on-going statistical challenges and opportunities for integrative network analysis of microbiome data.

Keywords: compositionality; heterogeneity; microbiome networks; multi-omics data integration; network analysis; normalization; sparsity.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Visualizing the unique challenges of microbiome data. A mock set of bacterial samples from two populations where each colored shape is a bacterial taxon. (A) Compositionality. The taxon abundance table depicts the count of each observed taxon in each sample. When sequencing microbiome samples, the resulting counts of taxa are not representative of the actual taxa counts in the sample due to constraints of sequencing. Due to this, relative abundances are generally used in analysis of microbiome data. The bar plots illustrate the difference in community representation between raw counts (top) and relative abundances (bottom). (B) Normalization. Due to the constraints of sequencing, the overall sequencing depth of a sample can impact the results. For example, shallow sequencing may miss rare taxa such as the green taxon V in the example sample A that is present in low abundance in the community. (C) Sparsity. Microbiome data are often very sparse, where most observations are zero. This is illustrated by the histogram of taxa counts for each sample where most counts are zero and there are few taxa with high counts. This can also be seen in the table for part A, where many entries are zero. (D) Heterogeneity. The table summarizes the taxonomic heterogeneity in the mock dataset between the two populations. Each sample has a unique taxonomic composition, but there are also population specific signatures. The samples in each population are dominated by a few taxa, and these dominant taxa are different for the two populations. Additionally, there are taxa that are highly abundant in one sample and absent from the rest, such as the purple taxon Y in sample A.

Similar articles

Cited by

References

    1. Akavia U. D., Litvin O., Kim J., Sanchez-Garcia F., Kotliar D., Causton H. C., et al. (2010). An integrated approach to uncover drivers of cancer. Cell. 143, 1005–1017. org/ 10,1016/j.cell. 2010, 11.013 - PMC - PubMed
    1. Albayrak L., Khanipov K., Golovko G., Fofanov Y. (2018). Detection of multi-dimensional co-exclusion patterns in microbial communities. Bioinformatics (Oxford, England). 34, 3695–3701. org/ 10.1093/bioinformatics/bty414 - PubMed
    1. Alivisatos A. P., Blaser M. J., Brodie E. L., Chun M., Dangl J. L., Donohue T. J., et al. (2015). A unified initiative to harness Earth’s microbiomes. Science. 350, 507–508. org/ 10.1126/science.aac8480 - PubMed
    1. Amano S. I., Ogawa K. I., Miyake Y. (2018). Node property of weighted networks considering connectability to nodes within two degrees of separation. Sci. Rep. 8, 8464. org/10.1038/s41598-018-26781-y - DOI - PMC - PubMed
    1. Aylward F. O., Eppley J. M., Smith J. M., Chavez F. P., Scholin C. A., DeLong E. F. (2015). Microbial community transcriptional networks are conserved in three domains at ocean basin scales. Proc. Natl Acad. Sci. 112, 5443–5448. org/ 10.1073/pnas.1502883112 - PMC - PubMed

LinkOut - more resources