Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Feb;84(1):149-173.
doi: 10.1111/rssb.12479. Epub 2021 Nov 16.

Transfer Learning for High-Dimensional Linear Regression: Prediction, Estimation and Minimax Optimality

Affiliations

Transfer Learning for High-Dimensional Linear Regression: Prediction, Estimation and Minimax Optimality

Sai Li et al. J R Stat Soc Series B Stat Methodol. 2022 Feb.

Abstract

This paper considers estimation and prediction of a high-dimensional linear regression in the setting of transfer learning where, in addition to observations from the target model, auxiliary samples from different but possibly related regression models are available. When the set of informative auxiliary studies is known, an estimator and a predictor are proposed and their optimality is established. The optimal rates of convergence for prediction and estimation are faster than the corresponding rates without using the auxiliary samples. This implies that knowledge from the informative auxiliary samples can be transferred to improve the learning performance of the target problem. When the set of informative auxiliary samples is unknown, we propose a data-driven procedure for transfer learning, called Trans-Lasso, and show its robustness to non-informative auxiliary samples and its efficiency in knowledge transfer. The proposed procedures are demonstrated in numerical studies and are applied to a dataset concerning the associations among gene expressions. It is shown that Trans-Lasso leads to improved performance in gene expression prediction in a target tissue by incorporating data from multiple different tissues as auxiliary samples.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Estimation errors of the Ad hoc 1-transfer, Agg-Lasso, Lasso, Oracle Trans-Lasso, and Trans-Lasso with identity covariance matrices of the predictors. The two rows correspond to configurations (i) and (ii), respectively. The y-axis corresponds to bβ22 for some estimator b.
Figure 2.
Figure 2.
Estimation errors of the Ad hoc 1-transfer, Agg-Lasso, Lasso, Oracle Trans-Lasso, and Trans-Lasso with homogeneous covariance matrices. The two rows correspond to configurations (i) and (ii), respectively. The y-axis corresponds to bβ22 for some estimator b.
Figure 3.
Figure 3.
Estimation errors of the Ad hoc 1-transfer, Agg-Lasso, Lasso, Oracle Trans-Lasso, and Trans-Lasso with heterogeneous covariance matrices. The two rows correspond to configurations (i) and (ii), respectively. The y-axis corresponds to bβ22 for some estimator b.
Figure 4.
Figure 4.
Prediction errors of Agg-Lasso, Naive Trans-Lasso, Trans-Lasso, and Ad hoc 1-transfer relative to the Lasso evaluated via 5-fold cross validation for gene JAM2 in multiple tissues.
Figure 5.
Figure 5.
Prediction errors of Ad hoc 1-transfer, Agg-Lasso, Naive Trans-Lasso*, and Trans-Lasso relative to the Lasso for the 25 genes on chromosome 21 and in Module 137, in multiple target tissues. The Naive Trans-Lasso has two outliers for the tissue Cerebellum not showing in the figure with values 1.61 and 1.95.

References

    1. Agarwal A, Negahban S, and Wainwright MJ (2012). Noisy matrix decomposition via convex relaxation: Optimal rates in high dimensions. The Annals of Statistics 40(2), 1171–1197.
    1. Ahmed T and Bajwa WU (2019). Exsis: Extended sure independence screening for ultrahigh-dimensional linear models. Signal Processing 159, 33–48.
    1. Ando RK and Zhang T (2005). A framework for learning predictive structures from multiple tasks and unlabeled data. Journal of Machine Learning Research 6, 1817–1853.
    1. Bastani H (2020). Predicting with proxies: Transfer learning in high dimension. Management Science 67(5), 2657–3320.
    1. Bühlmann P and van de Geer S (2015). High-dimensional inference in misspecified linear models. Electronic Journal of Statistics 9(1), 1449–1473.

LinkOut - more resources