Combining exchangeable P-values

Matteo Gasparin^#¹, Ruodu Wang^#², Aaditya Ramdas^#³

Affiliations

¹ Department of Statistical Sciences, University of Padova, Padova 35121, Italy.
² Department of Statistics and Actuarial Science, University of Waterloo, Waterloo, ON N2L 3G1, Canada.
³ Department of Statistics and Data Science, Carnegie Mellon University, Pittsburgh, PA 15213.

^# Contributed equally.

PMID: 40085658
PMCID: PMC11929381
DOI: 10.1073/pnas.2410849122

Combining exchangeable P-values

Matteo Gasparin et al. Proc Natl Acad Sci U S A. 2025.

. 2025 Mar 18;122(11):e2410849122.

doi: 10.1073/pnas.2410849122. Epub 2025 Mar 14.

Authors

Matteo Gasparin^#¹, Ruodu Wang^#², Aaditya Ramdas^#³

Affiliations

¹ Department of Statistical Sciences, University of Padova, Padova 35121, Italy.
² Department of Statistics and Actuarial Science, University of Waterloo, Waterloo, ON N2L 3G1, Canada.
³ Department of Statistics and Data Science, Carnegie Mellon University, Pittsburgh, PA 15213.

^# Contributed equally.

PMID: 40085658
PMCID: PMC11929381
DOI: 10.1073/pnas.2410849122

Erratum in

Correction for Gasparin et al., Combining exchangeable P-values.
[No authors listed] [No authors listed] Proc Natl Acad Sci U S A. 2025 Apr 22;122(16):e2507343122. doi: 10.1073/pnas.2507343122. Epub 2025 Apr 18. Proc Natl Acad Sci U S A. 2025. PMID: 40249792 Free PMC article. No abstract available.

Abstract

The problem of combining P-values is an old and fundamental one, and the classic assumption of independence is often violated or unverifiable in many applications. There are many well-known rules that can combine a set of arbitrarily dependent P-values (for the same hypothesis) into a single P-value. We show that essentially all these existing rules can be strictly improved when the P-values are exchangeable, or when external randomization is allowed (or both). For example, we derive randomized and/or exchangeable improvements of well-known rules like "twice the median" and "twice the average," as well as geometric and harmonic means. Exchangeable P-values are often produced one at a time (for example, under repeated tests involving data splitting), and our rules can combine them sequentially as they are produced, stopping when the combined P-values stabilize. Our work also improves rules for combining arbitrarily dependent P-values, since the latter becomes exchangeable if they are presented to the analyst in a random order. The main technical advance is to show that all existing combination rules can be obtained by calibrating the P-values to e-values (using an [Formula: see text]-dependent calibrator), averaging those e-values, converting to a level-[Formula: see text] test using Markov's inequality, and finally obtaining P-values by combining this family of tests; the improvements are delivered via recent randomized and exchangeable variants of Markov's inequality.

Keywords: dependent P-values; e-values; global null testing; multiple testing; randomization.

PubMed Disclaimer

Conflict of interest statement

Competing interests statement:The authors declare no competing interest.

Figures

**Fig. 1.**
Combination of P-values using different ex-p-merging functions under high (*Left*) and low (*Right*) dependence. The performance of the different ex-p-merging functions is almost reversed in the two situations.

**Fig. 2.**
Combination of P-values using different ex-p-merging functions and different ordering based on the sample size. Non-ex-p-merging functions valid under arbitrary dependence are added for comparison. The ex-p-merging rules are more powerful if P-values are ordered in decreasing order with respect to the sample size.

See this image and copyright information in PMC

References

1. Fisher R. A., Statistical Methods for Research Workers (Oliver and Boyd, 1934), vol. 5.
1. Pearson K., On a new method of determining “goodness of fit”. Biometrika 26, 425–442 (1934).
1. Simes R. J., An improved Bonferroni procedure for multiple tests of significance. Biometrika 73, 751–754 (1986).
1. Sarkar S. K., Some probability inequalities for ordered MTP2 random variables: A proof of the simes conjecture. Ann. Stat. 26, 494–504 (1998).
1. Benjamini Y., Yekutieli D., The control of the false discovery rate in multiple testing under dependency. Ann. Stat. 29, 1165–1188 (2001).

Grants and funding

LinkOut - more resources

Full Text Sources
- Atypon
- PubMed Central

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Combining exchangeable P-values

Affiliations

Combining exchangeable P-values

Authors

Affiliations

Erratum in

Abstract

Conflict of interest statement

Figures

References

Grants and funding

LinkOut - more resources

Full Text Sources