Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2015 Jul 15;31(14):2303-9.
doi: 10.1093/bioinformatics/btv104. Epub 2015 Mar 2.

Estimating the proportion of true null hypotheses when the statistics are discrete

Affiliations

Estimating the proportion of true null hypotheses when the statistics are discrete

Isaac Dialsingh et al. Bioinformatics. .

Abstract

Motivation: In high-dimensional testing problems π0, the proportion of null hypotheses that are true is an important parameter. For discrete test statistics, the P values come from a discrete distribution with finite support and the null distribution may depend on an ancillary statistic such as a table margin that varies among the test statistics. Methods for estimating π0 developed for continuous test statistics, which depend on a uniform or identical null distribution of P values, may not perform well when applied to discrete testing problems.

Results: This article introduces a number of π0 estimators, the regression and 'T' methods that perform well with discrete test statistics and also assesses how well methods developed for or adapted from continuous tests perform with discrete tests. We demonstrate the usefulness of these estimators in the analysis of high-throughput biological RNA-seq and single-nucleotide polymorphism data.

Availability and implementation: implemented in R.

PubMed Disclaimer

Figures

Fig. 1.
Fig. 1.
P values from discrete and continuous tests with π0=0.80. a) P-values for continuous tests b) P-values for discrete tests
Fig. 2.
Fig. 2.
P values from real data. (a) Raw P-values from the primate liver RNAseq study with biological replication. (b) Raw P-values from the bovine iron SNP study
Fig. 3.
Fig. 3.
Plots of ϕ^jt versus ϕ0jt for one random sample of RNA-2, m = 10 000, data for two different π0 values. (a) π0 = 0.3. (b) π0 = 0.8
Fig. 4.
Fig. 4.
The 25th percentile of π^0 for different estimators for the RNA-seq simulations. (a) π0 estimates, RNA-1, m = 10 000. (b) π0 estimates, RNA-2, m = 10 000
Fig. 5.
Fig. 5.
The 25th percentile of π^0 for different estimators for the SNP simulations. (a) π0 estimates for SNP study with 50 controls 50 treated, m = 10 000. (b) π0 estimates for SNP study with 80 controls 20 treated, m = 10 000

References

    1. Bancroft T., et al. (2013) Estimation of false discovery rate using sequential permutation p-values. Biometrics , 69, 1–7. - PubMed
    1. Benjamini Y., Hochberg Y. (1995) Controlling the false discovery rate: a practical and powerful approach to multiple testing. J. R. Stat. Soc. B , 57, 289–300.
    1. Benjamini Y., Hochberg Y. (2000) On the adaptive control of the false discovery rate in multiple testing with independent statistics. J. Behav. Educ. Stat. , 25, 60–83.
    1. Black M.A. (2004) A note on the adaptive control of false discovery rates. J. R. Stat. Soc. B , 66, 297–304.
    1. Blekhman R., et al. (2010) Sex-specific and lineage-specific alternative splicing in primates. Genome Res. , 20, 180–189. - PMC - PubMed

Publication types