Model-Free Statistical Inference on High-Dimensional Data
- PMID: 40641907
- PMCID: PMC12240534
- DOI: 10.1080/01621459.2024.2310314
Model-Free Statistical Inference on High-Dimensional Data
Abstract
This paper aims to develop an effective model-free inference procedure for high-dimensional data. We first reformulate the hypothesis testing problem via sufficient dimension reduction framework. With the aid of new reformulation, we propose a new test statistic and show that its asymptotic distribution is distribution whose degree of freedom does not depend on the unknown population distribution. We further conduct power analysis under local alternative hypotheses. In addition, we study how to control the false discovery rate of the proposed tests, which are correlated, to identify important predictors under a model-free framework. To this end, we propose a multiple testing procedure and establish its theoretical guarantees. Monte Carlo simulation studies are conducted to assess the performance of the proposed tests and an empirical analysis of a real-world data set is used to illustrate the proposed methodology.
Keywords: False discovery rate control; Marginal coordinate hypothesis; Orthogonality; Sufficient dimension reduction.
Conflict of interest statement
Disclosure Statement The authors report there are no competing interests to declare.
Figures
Similar articles
-
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3. Cochrane Database Syst Rev. 2022. PMID: 35593186 Free PMC article.
-
Automated devices for identifying peripheral arterial disease in people with leg ulceration: an evidence synthesis and cost-effectiveness analysis.Health Technol Assess. 2024 Aug;28(37):1-158. doi: 10.3310/TWCG3912. Health Technol Assess. 2024. PMID: 39186036 Free PMC article.
-
Factors that impact on the use of mechanical ventilation weaning protocols in critically ill adults and children: a qualitative evidence-synthesis.Cochrane Database Syst Rev. 2016 Oct 4;10(10):CD011812. doi: 10.1002/14651858.CD011812.pub2. Cochrane Database Syst Rev. 2016. PMID: 27699783 Free PMC article.
-
123I-MIBG scintigraphy and 18F-FDG-PET imaging for diagnosing neuroblastoma.Cochrane Database Syst Rev. 2015 Sep 29;2015(9):CD009263. doi: 10.1002/14651858.CD009263.pub2. Cochrane Database Syst Rev. 2015. PMID: 26417712 Free PMC article.
-
A New Measure of Quantified Social Health Is Associated With Levels of Discomfort, Capability, and Mental and General Health Among Patients Seeking Musculoskeletal Specialty Care.Clin Orthop Relat Res. 2025 Apr 1;483(4):647-663. doi: 10.1097/CORR.0000000000003394. Epub 2025 Feb 5. Clin Orthop Relat Res. 2025. PMID: 39915110
References
-
- Azzalini A and Capitanio A (1999). Statistical applications of the multivariate skew normal distribution. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 61(3):579–602.
-
- Barber RF and Candès EJ (2015). Controlling the false discovery rate via knockoffs. The Annals of Statistics, 43(5):2055–2085.
-
- Barber RF, Candes EJ, and Samworth RJ (2020). Robust inference with knockoffs. Annals of Statistics, 48(3):1409–1431.
-
- Belloni A, Chernozhukov V, and Kato K (2015). Uniform post-selection inference for least absolute deviation regression and other z-estimation problems. Biometrika, 102(1):77–94.
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources