Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2011 Aug;57(8):5467-5484.
doi: 10.1109/TIT.2011.2158486.

Non-Concave Penalized Likelihood with NP-Dimensionality

Affiliations

Non-Concave Penalized Likelihood with NP-Dimensionality

Jianqing Fan et al. IEEE Trans Inf Theory. 2011 Aug.

Abstract

Penalized likelihood methods are fundamental to ultra-high dimensional variable selection. How high dimensionality such methods can handle remains largely unknown. In this paper, we show that in the context of generalized linear models, such methods possess model selection consistency with oracle properties even for dimensionality of Non-Polynomial (NP) order of sample size, for a class of penalized likelihood approaches using folded-concave penalty functions, which were introduced to ameliorate the bias problems of convex penalty functions. This fills a long-standing gap in the literature where the dimensionality is allowed to grow slowly with the sample size. Our results are also applicable to penalized likelihood with the L(1)-penalty, which is a convex function at the boundary of the class of folded-concave penalty functions under consideration. The coordinate optimization is implemented for finding the solution paths, whose performance is evaluated by a few simulation examples and the real data analysis.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Boxplots of PE, L2 loss, and #S over 100 simulations for all methods in logistic regression, where p = 25. The x-axis represents different methods. Top panel is for BIC and bottom panel is for SIC.
Figure 2
Figure 2
Boxplots of PE, L2 loss, and #S over 100 simulations for all methods in logistic regression, where p = 500 and 1000. The x-axis represents different methods. Top panel is for p = 500 and bottom panel is for p = 1000.

Similar articles

Cited by

References

    1. Antoniadis A, Fan J. Regularization of wavelets approximations (with discussion) J. Amer. Statist. Assoc. 2001;96:939–967.
    1. Bennett G. Probability inequalities for the sum of independent random variables. J. Amer. Statist. Assoc. 1962;57:33–45.
    1. Bickel PJ, Ritov Y, Tsybakov A. Simultaneous analysis of Lasso and Dantzig selector. Ann. Statist. 2009;37:1705–1732.
    1. Breiman L. Better subset regression using the non-negative garrote. Technometrics. 1995;37:373–384.
    1. Bunea F, Tsybakov A, Wegkamp MH. Sparsity oracle inequalities for the Lasso. Elec. Jour. Statist. 2007;1:169–194.

LinkOut - more resources