Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2017 Apr 1;18(2):275-294.
doi: 10.1093/biostatistics/kxw041.

False discovery rates: a new deal

False discovery rates: a new deal

Matthew Stephens. Biostatistics. .

Abstract

We introduce a new Empirical Bayes approach for large-scale hypothesis testing, including estimating false discovery rates (FDRs), and effect sizes. This approach has two key differences from existing approaches to FDR analysis. First, it assumes that the distribution of the actual (unobserved) effects is unimodal, with a mode at 0. This "unimodal assumption" (UA), although natural in many contexts, is not usually incorporated into standard FDR analysis, and we demonstrate how incorporating it brings many benefits. Specifically, the UA facilitates efficient and robust computation-estimating the unimodal distribution involves solving a simple convex optimization problem-and enables more accurate inferences provided that it holds. Second, the method takes as its input two numbers for each test (an effect size estimate and corresponding standard error), rather than the one number usually used ($p$ value or $z$ score). When available, using two numbers instead of one helps account for variation in measurement precision across tests. It also facilitates estimation of effects, and unlike standard FDR methods, our approach provides interval estimates (credible regions) for each effect in addition to measures of significance. To provide a bridge between interval estimates and significance measures, we introduce the term "local false sign rate" to refer to the probability of getting the sign of an effect wrong and argue that it is a superior measure of significance than the local FDR because it is both more generally applicable and can be more robustly estimated. Our methods are implemented in an R package ashr available from http://github.com/stephens999/ashr.

Keywords: Empirical Bayes; False discovery rates; Multiple testing; Shrinkage; Unimodal.

PubMed Disclaimer

Figures

Fig. 1.
Fig. 1.
Illustration that the UA in ash can produce very different results from existing methods. The figure shows, for a single simulated dataset, the way different methods decompose formula image values (left) and formula image scores (right) into a null component (dark blue) and an alternative component (cyan). In the formula image score space the alternative distribution is placed on the bottom to highlight the differences in its shape among methods. The three existing methods (qvalue, locfdr, mixfdr) all produce a “hole” in the alternative formula image score distribution around 0. In contrast ash makes the UA—that the effect sizes, and thus the formula image scores, have a unimodal distribution about 0—which yields a very different decomposition. (In this case the ash decomposition is closer to the truth: the data were simulated under a model where all of the effects are non-zero, so the “true” decomposition would make everything cyan.)
Fig. 2.
Fig. 2.
Results of simulation studies (constant precision formula image). (a) Densities of non-zero effects, formula image, used in simulations. (b) Comparison of true and estimated values of formula image. When the UA holds all methods typically yield conservative (over-)estimates for formula image, with ash being least conservative, and hence most accurate. qvalue is sometimes anti-conservative when formula image. When the UA does not hold (“bimodal” scenario) the ash estimates are slightly anti-conservative. (c) Comparison of true and estimated formula image from ash (ash.n). Black line is formula image and red line is formula image. Estimates of formula image are conservative when UA holds, due to conservative estimates of formula image. (d) As in (c), but for formula image instead of formula image. Estimates of formula image are consistently less conservative than formula image when UA holds, and also less anti-conservative in bimodal scenario.
Fig. 3.
Fig. 3.
Comparison of estimated cdfs from ash and the NPMLE. Different ash methods perform similarly, so only ash.hu is shown for clarity. Each panel shows results for a single example data set, one for each scenario in Figure 2(a). The results illustrate how the UA made by ash regularizes the estimated cdfs compared with the NPMLE.
Fig. 4.
Fig. 4.
Simulations showing how, with existing methods, but not ash, poor-precision observations can contaminate signal from good-precision observations. (a) Density histograms of formula image values for good-precision, poor-precision, and combined observations. The combined data show less signal than the good-precision data, due to the contamination effect of the poor-precision measurements. (b) Results of different methods applied to good-precision observations only (formula image-axis) and combined data (formula image-axis). Each point shows the “significance” (formula image values from qvalue; formula image for locfdr; formula image for ash) of a good-precision observation under the two different analyses. For existing methods including the poor-precision observations reduces significance of good-precision observations, whereas for ash the poor-precision observations have little effect (because they have a very flat likelihood). (c) The relationship between formula image and formula image-value is different for good-precision (formula image) and low-precision (formula image) measurements: ash assigns the low-precision measurements a higher formula image, effectively downweighting them. (d) Trade-off between true positives (formula image) vs false positives (formula image) as the significance threshold (formula image or formula image value) is varied. By downweighting the low-precision observations ash re-orders the significance of observations, producing more true positives at a given number of false positives. It is important to note that this behaviour of ash depends on choice of formula image. See Section 3.2.1 for discussion.

Similar articles

Cited by

References

    1. Benjamini Y. and Hochberg Y. (1995). Controlling the false discovery rate: a practical and powerful approach to multiple testing. Journal of the Royal Statistical Society. Series B (Methodological) 57, 289–300.
    1. Benjamini Y. and Yekutieli D. (2005). False discovery rate--adjusted multiple confidence intervals for selected parameters. Journal of the American Statistical Association 100, 71–81.
    1. Boyd S. and Vandenberghe L. (2004). Convex Optimization. Cambridge, UK: Cambridge University Press.
    1. Carvalho C. M. Polson N. G. and Scott J. G. (2010). The horseshoe estimator for sparse signals. Biometrika 97, asq017.
    1. Cordy C. B. and Thomas D. R. (1997). Deconvolution of a distribution function. Journal of the American Statistical Association 92, 1459–1465.

Publication types