. 2017 Apr 1;18(2):275-294.

doi: 10.1093/biostatistics/kxw041.

False discovery rates: a new deal

Matthew Stephens

PMID: 27756721
PMCID: PMC5379932
DOI: 10.1093/biostatistics/kxw041

False discovery rates: a new deal

Matthew Stephens. Biostatistics. 2017.

. 2017 Apr 1;18(2):275-294.

doi: 10.1093/biostatistics/kxw041.

Author

Matthew Stephens

PMID: 27756721
PMCID: PMC5379932
DOI: 10.1093/biostatistics/kxw041

Abstract

We introduce a new Empirical Bayes approach for large-scale hypothesis testing, including estimating false discovery rates (FDRs), and effect sizes. This approach has two key differences from existing approaches to FDR analysis. First, it assumes that the distribution of the actual (unobserved) effects is unimodal, with a mode at 0. This "unimodal assumption" (UA), although natural in many contexts, is not usually incorporated into standard FDR analysis, and we demonstrate how incorporating it brings many benefits. Specifically, the UA facilitates efficient and robust computation-estimating the unimodal distribution involves solving a simple convex optimization problem-and enables more accurate inferences provided that it holds. Second, the method takes as its input two numbers for each test (an effect size estimate and corresponding standard error), rather than the one number usually used ($p$ value or $z$ score). When available, using two numbers instead of one helps account for variation in measurement precision across tests. It also facilitates estimation of effects, and unlike standard FDR methods, our approach provides interval estimates (credible regions) for each effect in addition to measures of significance. To provide a bridge between interval estimates and significance measures, we introduce the term "local false sign rate" to refer to the probability of getting the sign of an effect wrong and argue that it is a superior measure of significance than the local FDR because it is both more generally applicable and can be more robustly estimated. Our methods are implemented in an R package ashr available from http://github.com/stephens999/ashr.

Keywords: Empirical Bayes; False discovery rates; Multiple testing; Shrinkage; Unimodal.

PubMed Disclaimer

Figures

**Fig. 1.**
Illustration that the UA in ash can produce very different results from existing methods. The figure shows, for a single simulated dataset, the way different methods decompose values (left) and scores (right) into a null component (dark blue) and an alternative component (cyan). In the score space the alternative distribution is placed on the bottom to highlight the differences in its shape among methods. The three existing methods (qvalue, locfdr, mixfdr) all produce a “hole” in the alternative score distribution around 0. In contrast ash makes the UA—that the effect sizes, and thus the scores, have a unimodal distribution about 0—which yields a very different decomposition. (In this case the ash decomposition is closer to the truth: the data were simulated under a model where all of the effects are non-zero, so the “true” decomposition would make everything cyan.)

formula image — **Fig. 1.**
Illustration that the UA in ash can produce very different results from existing methods. The figure shows, for a single simulated dataset, the way different methods decompose values (left) and scores (right) into a null component (dark blue) and an alternative component (cyan). In the score space the alternative distribution is placed on the bottom to highlight the differences in its shape among methods. The three existing methods (qvalue, locfdr, mixfdr) all produce a “hole” in the alternative score distribution around 0. In contrast ash makes the UA—that the effect sizes, and thus the scores, have a unimodal distribution about 0—which yields a very different decomposition. (In this case the ash decomposition is closer to the truth: the data were simulated under a model where all of the effects are non-zero, so the “true” decomposition would make everything cyan.)

**Fig. 2.**
Results of simulation studies (constant precision ). (a) Densities of non-zero effects, , used in simulations. (b) Comparison of true and estimated values of . When the UA holds all methods typically yield conservative (over-)estimates for , with ash being least conservative, and hence most accurate. qvalue is sometimes anti-conservative when . When the UA does not hold (“bimodal” scenario) the ash estimates are slightly anti-conservative. (c) Comparison of true and estimated from ash (ash.n). Black line is and red line is . Estimates of are conservative when UA holds, due to conservative estimates of . (d) As in (c), but for instead of . Estimates of are consistently less conservative than when UA holds, and also less anti-conservative in bimodal scenario.

**Fig. 3.**
Comparison of estimated cdfs from ash and the NPMLE. Different ash methods perform similarly, so only ash.hu is shown for clarity. Each panel shows results for a single example data set, one for each scenario in Figure 2(a). The results illustrate how the UA made by ash regularizes the estimated cdfs compared with the NPMLE.

**Fig. 4.**
Simulations showing how, with existing methods, but not ash, poor-precision observations can contaminate signal from good-precision observations. (a) Density histograms of values for good-precision, poor-precision, and combined observations. The combined data show less signal than the good-precision data, due to the contamination effect of the poor-precision measurements. (b) Results of different methods applied to good-precision observations only (-axis) and combined data (-axis). Each point shows the “significance” ( values from qvalue; for locfdr; for ash) of a good-precision observation under the two different analyses. For existing methods including the poor-precision observations reduces significance of good-precision observations, whereas for ash the poor-precision observations have little effect (because they have a very flat likelihood). (c) The relationship between and -value is different for good-precision () and low-precision () measurements: ash assigns the low-precision measurements a higher , effectively downweighting them. (d) Trade-off between true positives () vs false positives () as the significance threshold ( or value) is varied. By downweighting the low-precision observations ash re-orders the significance of observations, producing more true positives at a given number of false positives. It is important to note that this behaviour of ash depends on choice of . See Section 3.2.1 for discussion.

See this image and copyright information in PMC

Cited by

"All-In-One" Genetic Tool Assessing Endometrial Receptivity for Personalized Screening of Female Sex Steroid Hormones.
Deryabin P, Domnina A, Gorelova I, Rulev M, Petrosyan M, Nikolsky N, Borodkina A. Deryabin P, et al. Front Cell Dev Biol. 2021 Feb 15;9:624053. doi: 10.3389/fcell.2021.624053. eCollection 2021. Front Cell Dev Biol. 2021. PMID: 33659249 Free PMC article.
Tetracycline Antibiotics Induce Host-Dependent Disease Tolerance to Infection.
Colaço HG, Barros A, Neves-Costa A, Seixas E, Pedroso D, Velho T, Willmann KL, Faisca P, Grabmann G, Yi HS, Shong M, Benes V, Weis S, Köcher T, Moita LF. Colaço HG, et al. Immunity. 2021 Jan 12;54(1):53-67.e7. doi: 10.1016/j.immuni.2020.09.011. Epub 2020 Oct 14. Immunity. 2021. PMID: 33058782 Free PMC article.
Comprehensive alpha, beta, and delta cell transcriptomics reveal an association of cellular aging with MHC class I upregulation.
Staels W, Berthault C, Bourgeois S, Laville V, Lourenço C, De Leu N, Scharfmann R. Staels W, et al. Mol Metab. 2024 Sep;87:101990. doi: 10.1016/j.molmet.2024.101990. Epub 2024 Jul 14. Mol Metab. 2024. PMID: 39009220 Free PMC article.
Feeder-free generation and characterization of endocardial and cardiac valve cells from human pluripotent stem cells.
Liu CZ, Prasad A, Jadhav B, Liu Y, Gu M, Sharp AJ, Gelb BD. Liu CZ, et al. iScience. 2023 Nov 30;27(1):108599. doi: 10.1016/j.isci.2023.108599. eCollection 2024 Jan 19. iScience. 2023. PMID: 38170020 Free PMC article.
Fine mapping spatiotemporal mechanisms of genetic variants underlying cardiac traits and disease.
D'Antonio M, Nguyen JP, Arthur TD; iPSCORE Consortium; Matsui H, D'Antonio-Chronowska A, Frazer KA. D'Antonio M, et al. Nat Commun. 2023 Feb 28;14(1):1132. doi: 10.1038/s41467-023-36638-2. Nat Commun. 2023. PMID: 36854752 Free PMC article.

See all "Cited by" articles

References

1. Benjamini Y. and Hochberg Y. (1995). Controlling the false discovery rate: a practical and powerful approach to multiple testing. Journal of the Royal Statistical Society. Series B (Methodological) 57, 289–300.
1. Benjamini Y. and Yekutieli D. (2005). False discovery rate--adjusted multiple confidence intervals for selected parameters. Journal of the American Statistical Association 100, 71–81.
1. Boyd S. and Vandenberghe L. (2004). Convex Optimization. Cambridge, UK: Cambridge University Press.
1. Carvalho C. M. Polson N. G. and Scott J. G. (2010). The horseshoe estimator for sparse signals. Biometrika 97, asq017.
1. Cordy C. B. and Thomas D. R. (1997). Deconvolution of a distribution function. Journal of the American Statistical Association 92, 1459–1465.

Publication types

Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions

Grants and funding

R01 HG002585/HG/NHGRI NIH HHS/United States

LinkOut - more resources

Full Text Sources
Other Literature Sources
- The Lens - Patent Citations Database
- scite Smart Citations

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

False discovery rates: a new deal

False discovery rates: a new deal

Author

Abstract

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources