A Selective Overview of Variable Selection in High Dimensional Feature Space
- PMID: 21572976
- PMCID: PMC3092303
A Selective Overview of Variable Selection in High Dimensional Feature Space
Abstract
High dimensional statistical problems arise from diverse fields of scientific research and technological development. Variable selection plays a pivotal role in contemporary statistical learning and scientific discoveries. The traditional idea of best subset selection methods, which can be regarded as a specific form of penalized likelihood, is computationally too expensive for many modern statistical applications. Other forms of penalized likelihood methods have been successfully developed over the last decade to cope with high dimensionality. They have been widely applied for simultaneously selecting important variables and estimating their effects in high dimensional statistical inference. In this article, we present a brief account of the recent developments of theory, methods, and implementations for high dimensional variable selection. What limits of the dimensionality such methods can handle, what the role of penalty functions is, and what the statistical properties are rapidly drive the advances of the field. The properties of non-concave penalized likelihood and its roles in high dimensional statistical modeling are emphasized. We also review some recent advances in ultra-high dimensional variable selection, with emphasis on independence screening and two-scale methods.
Figures
References
-
- Abramovich F, Benjamini Y, Donoho D, Johnstone I. Adapting to unknown sparsity by controlling the false discovery rate. Ann. Statist. 2006;34:584–653.
-
- Akaike H. Information theory and an extension of the maximum likelihood principle. In: Petrov BN, Csaki F, editors. Second International Symposium on Information Theory. Budapest: Akademiai Kiado; 1973. pp. 267–281.
-
- Akaike H. A new look at the statistical model identification. IEEE Trans. Auto. Control. 1974;19:716–723.
-
- Antoniadis A. Smoothing noisy data with tapered coiflets series. Scand. J. Statist. 1996;23:313–330.
-
- Antoniadis A, Fan J. Regularization of wavelets approximations (with discussion) J. Amer. Statist. Assoc. 2001;96:939–967.
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources