Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025;120(549):186-197.
doi: 10.1080/01621459.2024.2310314. Epub 2024 Mar 8.

Model-Free Statistical Inference on High-Dimensional Data

Affiliations

Model-Free Statistical Inference on High-Dimensional Data

Xu Guo et al. J Am Stat Assoc. 2025.

Abstract

This paper aims to develop an effective model-free inference procedure for high-dimensional data. We first reformulate the hypothesis testing problem via sufficient dimension reduction framework. With the aid of new reformulation, we propose a new test statistic and show that its asymptotic distribution is χ 2 distribution whose degree of freedom does not depend on the unknown population distribution. We further conduct power analysis under local alternative hypotheses. In addition, we study how to control the false discovery rate of the proposed χ 2 tests, which are correlated, to identify important predictors under a model-free framework. To this end, we propose a multiple testing procedure and establish its theoretical guarantees. Monte Carlo simulation studies are conducted to assess the performance of the proposed tests and an empirical analysis of a real-world data set is used to illustrate the proposed methodology.

Keywords: False discovery rate control; Marginal coordinate hypothesis; Orthogonality; Sufficient dimension reduction.

PubMed Disclaimer

Conflict of interest statement

Disclosure Statement The authors report there are no competing interests to declare.

Figures

Fig. 1
Fig. 1
The plot of empirical size and power with respect to h when the Lasso penalty is used. “t1” and “t2” represent randomly selected inactive variables.

Similar articles

References

    1. Azzalini A and Capitanio A (1999). Statistical applications of the multivariate skew normal distribution. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 61(3):579–602.
    1. Barber RF and Candès EJ (2015). Controlling the false discovery rate via knockoffs. The Annals of Statistics, 43(5):2055–2085.
    1. Barber RF, Candes EJ, and Samworth RJ (2020). Robust inference with knockoffs. Annals of Statistics, 48(3):1409–1431.
    1. Belloni A, Chernozhukov V, Chetverikov D, and Wei Y (2018). Uniformly valid post-regularization confidence regions for many functional parameters in z-estimation framework. Annals of Statistics, 46(6B):3643–3675. - PMC - PubMed
    1. Belloni A, Chernozhukov V, and Kato K (2015). Uniform post-selection inference for least absolute deviation regression and other z-estimation problems. Biometrika, 102(1):77–94.

LinkOut - more resources