Two-group Poisson-Dirichlet mixtures for multiple testing
- PMID: 32535900
- PMCID: PMC8638421
- DOI: 10.1111/biom.13314
Two-group Poisson-Dirichlet mixtures for multiple testing
Abstract
The simultaneous testing of multiple hypotheses is common to the analysis of high-dimensional data sets. The two-group model, first proposed by Efron, identifies significant comparisons by allocating observations to a mixture of an empirical null and an alternative distribution. In the Bayesian nonparametrics literature, many approaches have suggested using mixtures of Dirichlet Processes in the two-group model framework. Here, we investigate employing mixtures of two-parameter Poisson-Dirichlet Processes instead, and show how they provide a more flexible and effective tool for large-scale hypothesis testing. Our model further employs nonlocal prior densities to allow separation between the two mixture components. We obtain a closed-form expression for the exchangeable partition probability function of the two-group model, which leads to a straightforward Markov Chain Monte Carlo implementation. We compare the performance of our method for large-scale inference in a simulation study and illustrate its use on both a prostate cancer data set and a case-control microbiome study of the gastrointestinal tracts in children from underdeveloped countries who have been recently diagnosed with moderate-to-severe diarrhea.
Keywords: Bayesian nonparametrics; Poisson-Dirichlet process; microbiome analysis; multiple testing; two-group model.
© 2020 The International Biometric Society.
Figures


References
-
- Ahdesmaki M, Zuber V, Gibb S and Strimmer K (2015) sda: Shrinkage Discriminant Analysis and CAT Score Variable Selection. R Package Version 1.3.7.
-
- Benjamini Y and Hochberg Y (1995) Controlling the false discovery rate: a practical and powerful approach to multiple testing. Journal of the Royal Statistical Society. Series B: Statistical Methodology, 57, 289–300.
-
- Canale A, Corradin R and Nipoti B (2019) Importance conditional sampling for Pitman-Yor mixtures. arXiv.
-
- Dahl DB (2005) Sequentially-allocated merge-split sampler for conjugate and nonconjugate Dirichlet process mixture models. Technical Report, Department of Statistics, University of Winsconsin, Madison.
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources