Estimating the proportion of true null hypotheses when the statistics are discrete

Isaac Dialsingh¹, Stefanie R Austin², Naomi S Altman²

Affiliations

¹ Department of Mathematics and Statistics, The University of the West Indies, St. Augustine Campus, Trinidad and Tobago and.
² Department of Statistics, The Pennsylvania State University, State College, PA 16802-2111, USA.

PMID: 25735771
PMCID: PMC4495288
DOI: 10.1093/bioinformatics/btv104

Estimating the proportion of true null hypotheses when the statistics are discrete

Isaac Dialsingh et al. Bioinformatics. 2015.

. 2015 Jul 15;31(14):2303-9.

doi: 10.1093/bioinformatics/btv104. Epub 2015 Mar 2.

Authors

Isaac Dialsingh¹, Stefanie R Austin², Naomi S Altman²

Affiliations

¹ Department of Mathematics and Statistics, The University of the West Indies, St. Augustine Campus, Trinidad and Tobago and.
² Department of Statistics, The Pennsylvania State University, State College, PA 16802-2111, USA.

PMID: 25735771
PMCID: PMC4495288
DOI: 10.1093/bioinformatics/btv104

Abstract

Motivation: In high-dimensional testing problems π0, the proportion of null hypotheses that are true is an important parameter. For discrete test statistics, the P values come from a discrete distribution with finite support and the null distribution may depend on an ancillary statistic such as a table margin that varies among the test statistics. Methods for estimating π0 developed for continuous test statistics, which depend on a uniform or identical null distribution of P values, may not perform well when applied to discrete testing problems.

Results: This article introduces a number of π0 estimators, the regression and 'T' methods that perform well with discrete test statistics and also assesses how well methods developed for or adapted from continuous tests perform with discrete tests. We demonstrate the usefulness of these estimators in the analysis of high-throughput biological RNA-seq and single-nucleotide polymorphism data.

Availability and implementation: implemented in R.

PubMed Disclaimer

Figures

**Fig. 1.**
P values from discrete and continuous tests with $π_{0} = 0.80$ . a) P-values for continuous tests b) P-values for discrete tests

**Fig. 2.**
P values from real data. **(a)** Raw P-values from the primate liver RNAseq study with biological replication. **(b)** Raw P-values from the bovine iron SNP study

**Fig. 3.**
Plots of ${\hat{ϕ}}_{j t}$ versus $ϕ_{0 j t}$ for one random sample of RNA-2, m = 10 000, data for two different π₀ values. (a) π₀ = 0.3. (b) π₀ = 0.8

**Fig. 4.**
The $25 th$ percentile of ${\hat{π}}_{0}$ for different estimators for the RNA-seq simulations. (a) π₀ estimates, RNA-1, m = 10 000. (b) π₀ estimates, RNA-2, m = 10 000

**Fig. 5.**
The 25th percentile of ${\hat{π}}_{0}$ for different estimators for the SNP simulations. (a) π₀ estimates for SNP study with 50 controls 50 treated, m = 10 000. (b) π₀ estimates for SNP study with 80 controls 20 treated, m = 10 000

See this image and copyright information in PMC

References

1. Bancroft T., et al. (2013) Estimation of false discovery rate using sequential permutation p-values. Biometrics , 69, 1–7. - PubMed
1. Benjamini Y., Hochberg Y. (1995) Controlling the false discovery rate: a practical and powerful approach to multiple testing. J. R. Stat. Soc. B , 57, 289–300.
1. Benjamini Y., Hochberg Y. (2000) On the adaptive control of the false discovery rate in multiple testing with independent statistics. J. Behav. Educ. Stat. , 25, 60–83.
1. Black M.A. (2004) A note on the adaptive control of false discovery rates. J. R. Stat. Soc. B , 66, 297–304.
1. Blekhman R., et al. (2010) Sex-specific and lineage-specific alternative splicing in primates. Genome Res. , 20, 180–189. - PMC - PubMed

Publication types

Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Estimating the proportion of true null hypotheses when the statistics are discrete

Affiliations

Estimating the proportion of true null hypotheses when the statistics are discrete

Authors

Affiliations

Abstract

Figures

References

Publication types

MeSH terms

Substances

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources