Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Mar 18;122(11):e2410849122.
doi: 10.1073/pnas.2410849122. Epub 2025 Mar 14.

Combining exchangeable P-values

Affiliations

Combining exchangeable P-values

Matteo Gasparin et al. Proc Natl Acad Sci U S A. .

Erratum in

Abstract

The problem of combining P-values is an old and fundamental one, and the classic assumption of independence is often violated or unverifiable in many applications. There are many well-known rules that can combine a set of arbitrarily dependent P-values (for the same hypothesis) into a single P-value. We show that essentially all these existing rules can be strictly improved when the P-values are exchangeable, or when external randomization is allowed (or both). For example, we derive randomized and/or exchangeable improvements of well-known rules like "twice the median" and "twice the average," as well as geometric and harmonic means. Exchangeable P-values are often produced one at a time (for example, under repeated tests involving data splitting), and our rules can combine them sequentially as they are produced, stopping when the combined P-values stabilize. Our work also improves rules for combining arbitrarily dependent P-values, since the latter becomes exchangeable if they are presented to the analyst in a random order. The main technical advance is to show that all existing combination rules can be obtained by calibrating the P-values to e-values (using an [Formula: see text]-dependent calibrator), averaging those e-values, converting to a level-[Formula: see text] test using Markov's inequality, and finally obtaining P-values by combining this family of tests; the improvements are delivered via recent randomized and exchangeable variants of Markov's inequality.

Keywords: dependent P-values; e-values; global null testing; multiple testing; randomization.

PubMed Disclaimer

Conflict of interest statement

Competing interests statement:The authors declare no competing interest.

Figures

Fig. 1.
Fig. 1.
Combination of P-values using different ex-p-merging functions under high (Left) and low (Right) dependence. The performance of the different ex-p-merging functions is almost reversed in the two situations.
Fig. 2.
Fig. 2.
Combination of P-values using different ex-p-merging functions and different ordering based on the sample size. Non-ex-p-merging functions valid under arbitrary dependence are added for comparison. The ex-p-merging rules are more powerful if P-values are ordered in decreasing order with respect to the sample size.

References

    1. Fisher R. A., Statistical Methods for Research Workers (Oliver and Boyd, 1934), vol. 5.
    1. Pearson K., On a new method of determining “goodness of fit”. Biometrika 26, 425–442 (1934).
    1. Simes R. J., An improved Bonferroni procedure for multiple tests of significance. Biometrika 73, 751–754 (1986).
    1. Sarkar S. K., Some probability inequalities for ordered MTP2 random variables: A proof of the simes conjecture. Ann. Stat. 26, 494–504 (1998).
    1. Benjamini Y., Yekutieli D., The control of the false discovery rate in multiple testing under dependency. Ann. Stat. 29, 1165–1188 (2001).

LinkOut - more resources