Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2016 Jan;78(1):193-210.
doi: 10.1111/rssb.12108. Epub 2015 Feb 15.

The lasso for high dimensional regression with a possible change point

Affiliations

The lasso for high dimensional regression with a possible change point

Sokbae Lee et al. J R Stat Soc Series B Stat Methodol. 2016 Jan.

Abstract

We consider a high dimensional regression model with a possible change point due to a covariate threshold and develop the lasso estimator of regression coefficients as well as the threshold parameter. Our lasso estimator not only selects covariates but also selects a model between linear and threshold regression models. Under a sparsity assumption, we derive non-asymptotic oracle inequalities for both the prediction risk and the l1-estimation loss for regression coefficients. Since the lasso estimator selects variables simultaneously, we show that oracle inequalities can be established without pretesting the existence of the threshold effect. Furthermore, we establish conditions under which the estimation error of the unknown threshold parameter can be bounded by a factor that is nearly n-1 even when the number of regressors can be much larger than the sample size n. We illustrate the usefulness of our proposed estimation method via Monte Carlo simulations and an application to real data.

Keywords: Lasso; Oracle inequalities; Sample splitting; Sparsity; Threshold models.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Mean prediction errors and mean M(α^) (♦, τ=0.3; □, τ=0.4; ◯, τ=0.5; △, c=0): (a) M=100; (b) M=200; (c) M=400
Figure 2
Figure 2
Mean l1‐errors for α and τ (♦, τ=0.3; □, τ=0.4; ◯, τ=0.5; △, c=0): (a) M=100; (b) M=200; (c) M=400

References

    1. Barro, R . and Lee, J . (1994) Data set for a panel of 139 countries. Report National Bureau of Economic Research, Cambridge. (Available from http://admin.nber.org/pub/barro.lee/
    1. Barro, R . and Sala‐i‐Martin, X . (1995) Economic Growth. New York: McGraw‐Hill.
    1. Belloni, A. and Chernozhukov, V. (2011a) l1‐penalized quantile regression in high‐dimensional sparse models. Ann. Statist., 39, 82–130.
    1. Belloni, A . and Chernozhukov, V . (2011b) High dimensional sparse econometric models: an introduction In Inverse Problems and High‐dimensional Estimation (eds Alquier P., Gautier E. and Stoltz G.), pp. 121–156. Berlin: Springer.
    1. Bickel, P. J. , Ritov, Y. and Tsybakov, A. B. (2009) Simultaneous analysis of Lasso and Dantzig selector. Ann. Statist., 37, 1705–1732.

LinkOut - more resources