Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2015 Oct;22(10):930-9.
doi: 10.1089/cmb.2015.0021. Epub 2015 May 14.

A Method for Determining Taxonomical Contributions to Group Differences in Microbiomic Investigations

Affiliations

A Method for Determining Taxonomical Contributions to Group Differences in Microbiomic Investigations

Alexa Pragman et al. J Comput Biol. 2015 Oct.

Abstract

Here we show how one can decompose the contribution of different levels of taxonomic classification in terms of their impact on differences in the microbiota when comparing two groups. First we demonstrate a difficulty in trying to estimate taxonomic effects at multiple levels simultaneously and demonstrate an approach to determining which taxa have differences in means that are identified. We then develop a model based on an approach that is popular in the RNA-Seq analysis literature and apply it to the problem of determining which taxa differ between two patient groups. This model provides a more powerful method than simpler alternatives. A Bayesian computational strategy is used to obtain exact inference. Simulation studies indicate that the procedure works as intended, and an application to the study of COPD demonstrates the method's practical utility. Software is provided for implementing the method.

Keywords: Bayesian modeling; COPD; microbiota; overdispersed counts.

PubMed Disclaimer

Conflict of interest statement

No competing financial interests exist.

Figures

<b>FIG. 1.</b>
FIG. 1.
Posterior medians of the ratio of the mean in the healthy group to the mean in the COPD group for the phylum Actinobacteria (lighter values indicate higher levels in the healthy group). Some family names have been truncated. COPD, chronic obstructive pulmonary disease.
<b>FIG. 2.</b>
FIG. 2.
Posterior medians of the ratio of the mean in the healthy group to the mean in the COPD group for the phylum Bacteroidetes (lighter values indicate higher levels in the healthy group). Some family names have been truncated.
<b>FIG. 3.</b>
FIG. 3.
Posterior medians of the ratio of the mean in the healthy group to the mean in the COPD group for four phyla with only a single genus (lighter values indicate higher levels in the healthy group).
<b>FIG. 4.</b>
FIG. 4.
Posterior medians of the ratio of the mean in the healthy group to the mean in the COPD group for the phylum Firmicutes (lighter values indicate higher levels in the healthy group).
<b>FIG. 5.</b>
FIG. 5.
Posterior medians of the ratio of the mean in the healthy group to the mean in the COPD group for the phylum Fusobacteria (lighter values indicate higher levels in the healthy group).
<b>FIG. 6.</b>
FIG. 6.
Posterior medians of the ratio of the mean in the healthy group to the mean in the COPD group for the phylum Proteobacteria (lighter values indicate higher levels in the healthy group). The class names have been truncated (removing “proteobacteria”).
<b>FIG. 7.</b>
FIG. 7.
Simulation results for six scenarios. Each plot shows the frequency with which the posterior probability of a difference between two groups exceeds 95% (the horizontal line is at 0.05). Taxa are ordered from left to right by taxonomical courseness (i.e., phyla are on the left and genera on the right).

References

    1. Anders S., and Huber W. 2010. Differential expression analysis for sequence count data. Genome Biol. 11, R106. - PMC - PubMed
    1. Bourgon R., Gentleman R., and Huber W. 2010. Independent filtering increases detection power for high-throughput experiments. Proc. Natl. Acad. Sci. USA 107, 9546–9551 - PMC - PubMed
    1. Gelman A. 2006. Prior distributions for variance parameters in hierarchical models. Bayesian Anal. 1, 515–533
    1. Gelman A., and Rubin D. 1992. Inference from iterative simulation using multiple sequences (with discussion). Stat. Sci. 7, 457–511
    1. La Rosa P.S., Brooks J.P., Deych E., et al. 2012. Hypothesis testing and power calculations for taxonomic-based human microbiome data. PLoS One 7, e52078. - PMC - PubMed

LinkOut - more resources