Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2014;101(4):964-970.
doi: 10.1093/biomet/asu033.

Analytic P-value calculation for the higher criticism test in finite d problems

Affiliations

Analytic P-value calculation for the higher criticism test in finite d problems

Ian J Barnett et al. Biometrika. 2014.

Abstract

The higher criticism is effective for testing a joint null hypothesis against a sparse alternative, e.g., for testing the effect of a gene or a genetic pathway that consists of d genetic markers. Accurate p-value calculations for the higher criticism based on the asymptotic distribution require a very large d, which is not the case for the number of genetic variants in a gene or a pathway. In this paper we propose an analytic method that accurately computes the p-value of the higher criticism test for finite d problems. Unlike previous treatments, this method does not rely on asymptotics in d or simulation, and is exact for arbitrary d when the test statistics are normally distributed. The method is particularly computationally advantageous when d is not large. We illustrate the proposed method with a case-control genome-wide association study of lung cancer and compare its power to competing methods through simulations.

Keywords: Empirical process; Genome-wide association studies; Higher criticism; Multiple hypothesis testing; Signal detection.

PubMed Disclaimer

Figures

Fig. 1
Fig. 1
An example c(t | h) = h[2d Φ̄ (t){1 − 2 Φ̄ (t)}]1/2 + 2dΦ̄ (t) is plotted with d = 6 and h = 2.4. The partition given by Lemma 1 is labeled on the t-axis.
Fig. 2
Fig. 2
The power for various methods are compared. The higher criticism applied to the decorrelated test statistics is represented by the solid line. The likelihood ratio test is represented by the dotted line. The sequence kernel association test is represented by the dashed line. Power is simulated for each ρ that is a nonnegative multiple of 0·05 and the smoothed results are displayed. Signal sparsity is 5% with β = 0.11 in the left plot with signal sparsity 10% with β = 0.08 in the right plot.

References

    1. Ahdesmki M, Strimmer K. Feature selection in omics prediction problems using cat scores and false non-discovery rate control. The Annals of Applied Statistics. 2009;4:503–519.
    1. Arias-Castro E, Candès EJ, Plan Y. Global testing under sparse alternatives: ANOVA, multiple comparisons and the higher criticism. The Annals of Statistics. 2011;39:2533–2556.
    1. Donoho D, Jin J. Higher criticism for detecting sparse heterogeneous mixtures. The Annals of Statistics. 2004;34:962–994.
    1. Donoho D, Jin J. Higher criticism thresholding: optimal feature selection when useful features are rare and weak. Proceedings of the National Academy of Sciences. 2008;105(39):14790–14795. - PMC - PubMed
    1. Gaenssler P. In: Empirical Processes. Gupta SS, editor. Vol. 3. Institute of Mathematical Statistics; 1983. pp. 1–11. Lecture Notes-Monograph Series.

LinkOut - more resources