Multi-Task Learning with Summary Statistics
- PMID: 39351341
- PMCID: PMC11440483
Multi-Task Learning with Summary Statistics
Abstract
Multi-task learning has emerged as a powerful machine learning paradigm for integrating data from multiple sources, leveraging similarities between tasks to improve overall model performance. However, the application of multi-task learning to real-world settings is hindered by data-sharing constraints, especially in healthcare settings. To address this challenge, we propose a flexible multi-task learning framework utilizing summary statistics from various sources. Additionally, we present an adaptive parameter selection approach based on a variant of Lepski's method, allowing for data-driven tuning parameter selection when only summary statistics are available. Our systematic non-asymptotic analysis characterizes the performance of the proposed methods under various regimes of the sample complexity and overlap. We demonstrate our theoretical findings and the performance of the method through extensive simulations. This work offers a more flexible tool for training related models across various domains, with practical implications in genetic risk prediction and many other fields.
Figures
References
-
- Burnham Kenneth P. and Anderson David R.. “Multimodel Inference: Understanding AIC and BIC in Model Selection”. In: Sociological Methods & Research 33.2 (Nov. 2004), pp. 261–304. ISSN: 0049–1241, 1552–8294. DOI: 10.1177/0049124104268644. - DOI
-
- Chen Ting-Huei et al. “A Penalized Regression Framework for Building Polygenic Risk Models Based on Summary Statistics From Genome-Wide Association Studies and Incorporating External Information”. In: Journal of the American Statistical Association 116.533 (Jan. 2, 2021), pp. 133–143. ISSN: 0162–1459, 1537–274X. DOI: 10.1080/01621459.2020.1764849. - DOI - PMC - PubMed
-
- Chichignoud Michaël, Lederer Johannes, and Wainwright Martin. A Practical Scheme and Fast Algorithm to Tune the Lasso With Optimality Guarantees. arXiv:1410.0247. type: article. arXiv, Nov. 8, 2016. arXiv: 1410.0247[math, stat].
Grants and funding
LinkOut - more resources
Full Text Sources