Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024;33(2):463-476.
doi: 10.1080/10618600.2023.2270720. Epub 2023 Nov 27.

Accurate and Ultra-Efficient p-Value Calculation for Higher Criticism Tests

Affiliations

Accurate and Ultra-Efficient p-Value Calculation for Higher Criticism Tests

Wenjia Wang et al. J Comput Graph Stat. 2024.

Abstract

In modern data science, higher criticism (HC) method is effective for detecting rare and weak signals. The computation, however, has long been an issue when the number of p-values combined ( K ) and/or the number of repeated HC tests ( N ) are large. Some computing methods have been developed, but they all have significant shortcomings, especially when a stringent significance level is required. In this paper, we propose an accurate and highly efficient computing strategy for four variations of HC. Specifically, we propose an unbiased cross-entropy-based importance sampling method ( IS C E ) to benchmark all existing computing methods, and develop a modified SetTest method (MST) that resolves numerical issues of the existing SetTest approach. We further develop an ultra-fast approach (UFI) combining pre-calculated statistical tables and cubic spline interpolation. Finally, following extensive simulations, we provide a computing strategy integrating MST, UFI and other existing methods with R package "HCp" for virtually any K and small p-values ( 10 - 20 ). The method is applied to a COVID-19 disease surveillance example for spatio-temporal outbreak detection from case numbers of 804 days in 3,342 counties in the United States. Results confirm viability of the computing strategy for large-scale inferences. Supplementary materials for this article are available online.

Keywords: analytical approximation; asymptotic rare and weak model; higher criticism; importance sampling; p-value computation.

PubMed Disclaimer

Figures

Fig. 1
Fig. 1
(A) Left panel: the MST is recommended for THCRT of arbitrary truncation with K1000, while a hybrid of MST and Li-Siegmund is recommended for targeted p-value 103 and p-value <103 respectively when K is above 1000. Right panel: a hybrid strategy combining the MST for K100, naive Monte Carlo and the Li-Siegmund for targeted p-value 103 and p-value <103 respectively is recommended for THCRTM of arbitrary truncation under K>100. (B) In our “HCp” R package, statistical tables are pre-calculated when K2000 and target p-value >1014 for the specific HC tests: (i) THCRF and (ii) THCRT with k0=1,k1=K2 (left panel), as well as (iii) THCRM and (iv) THCRTM With k0=1,k1=K2 of target p-value >1012 (right panel), where the UFI method is applicable and prefered.
Fig. 2
Fig. 2
(A) illustrates the accuracy of the Barnett-Lin, SetTest, and MST method in computing small p-values of THCRF with K=500 benchmarked by ISCE method (the methods started with * in the legend is our proposed methods). The x-axis is the log value of THCRF statistic, and the y-axis is the corresponding log10 p-value. Similarly, (B) evaluates the accuracy of the SetTest and MST method benchmarked by the ISCE method for case (ii) THCRT of K=500. (C) and (D) shows the accuracy of the MST benchmarked by the ISCE method for THCRM and case (iv) THCRTM respectively under K=100. The sampling size for ISCE is M=104.
Fig. 3
Fig. 3
(A) and (C) illustrate the performance of the Li-Siegmund, MST, and naive Monte Carlo method with sampling size M=106 in estimating large p-values for THCRM and case (iv) THCRTM of K=100 respectively. The error bar of the naive Monte Carlo estimate represents the square root of mean square error at the original scale over 50 replications. The y-axis is the p-value at original scale. (B) and (D) illustrate the performance of Li-Siegmund, MST, and ISCE method with sampling size M=106 in estimating small p-values for THCRM and case (iv) THCRTM of K=100 respectively. The error bar of the ISCE estimate is the square root of mean square error at log10 scale over 50 replications and the y-axis represents the log10 p-value.
Fig. 4
Fig. 4
(A) compares the time (in second) consumed of computing the p-values of 100 different THCRF tests by the Barnett-Lin, SetTest, Li-Siegmund, ISCE, MST and UFI method with respect to varying K. The formula of the fitted polynomial curve for each method is labeled in the figure. (B) removes the Barnett-Lin method and compares the other five methods. (C) illustrates the consistent p-value estimate by the UFI method with the analytic truth by the MST method for THCRF of various K. The black straight line y=x is the reference representing that the UFI estimates are exactly the same as the analytic truth.
Fig. 5
Fig. 5
The maps show the significance level (log10 p-value) of the COVID-19 outbreak testing by the UFI method for each of the 3,342 counties in the US at three time periods: 03/22/2021-04/20/2021 (top); 11/17/2021-12/16/2021 (middle); 02/15/2022-03/16/2022 (bottom). There are more significant counties outbreaking COVID-19 at the end of 2021 and during the period from mid-February to mid-March in 2022.

Similar articles

References

    1. Barnett I, Mukherjee R, and Lin X. The generalized higher criticism for testing SNP-Set effects in genetic association studies. Journal of the American Statistical Association, 112, 06 2016. doi: 10.1080/01621459.2016.1192039. - DOI - PMC - PubMed
    1. Barnett IJ and Lin X. Analytical p-value calculation for the higher criticism test in finite-d problems. Biometrika, 101(4):964–970, 08 2014. ISSN 0006-3444. doi: 10.1093/biomet/asu033. URL 10.1093/biomet/asu033. - DOI - DOI - PMC - PubMed
    1. Berk R and Jones D. Goodness-of-fit test statistics that dominate the Kolmogorov statistics. Probability Theory and Related Fields, 47(1):47–59, Jan. 1979. ISSN 0178-8051. doi: 10.1007/BF00533250. - DOI
    1. Cai TT and Wu Y. Optimal detection of sparse mixtures against a given null distribution. IEEE Transactions on Information Theory, 60(4):2217–2232, 2014. doi: 10.1109/TIT.2014.2304295. - DOI
    1. Cayón L, Jin J, and Treaster A. Higher Criticism statistic: detecting and identifying non-Gaussianity in the WMAP first-year data. Monthly Notices of the Royal Astronomical Society, 362(3):826–832, 09 2005. ISSN 0035-8711. doi: 10.1111/j.1365-2966.2005.09277.x. URL 10.1111/j.1365-2966.2005.09277.x. - DOI - DOI

LinkOut - more resources