Multivariate adaptive shrinkage improves cross-population transcriptome prediction and association studies in underrepresented populations
- PMID: 37869564
- PMCID: PMC10589725
- DOI: 10.1016/j.xhgg.2023.100216
Multivariate adaptive shrinkage improves cross-population transcriptome prediction and association studies in underrepresented populations
Abstract
Transcriptome prediction models built with data from European-descent individuals are less accurate when applied to different populations because of differences in linkage disequilibrium patterns and allele frequencies. We hypothesized that methods that leverage shared regulatory effects across different conditions, in this case, across different populations, may improve cross-population transcriptome prediction. To test this hypothesis, we made transcriptome prediction models for use in transcriptome-wide association studies (TWASs) using different methods (elastic net, joint-tissue imputation [JTI], matrix expression quantitative trait loci [Matrix eQTL], multivariate adaptive shrinkage in R [MASHR], and transcriptome-integrated genetic association resource [TIGAR]) and tested their out-of-sample transcriptome prediction accuracy in population-matched and cross-population scenarios. Additionally, to evaluate model applicability in TWASs, we integrated publicly available multiethnic genome-wide association study (GWAS) summary statistics from the Population Architecture using Genomics and Epidemiology (PAGE) study and Pan-ancestry genetic analysis of the UK Biobank (PanUKBB) with our developed transcriptome prediction models. In regard to transcriptome prediction accuracy, MASHR models performed better or the same as other methods in both population-matched and cross-population transcriptome predictions. Furthermore, in multiethnic TWASs, MASHR models yielded more discoveries that replicate in both PAGE and PanUKBB across all methods analyzed, including loci previously mapped in GWASs and loci previously not found in GWASs. Overall, our study demonstrates the importance of using methods that benefit from different populations' effect size estimates in order to improve TWASs for multiethnic or underrepresented populations.
Keywords: PrediXcan; genetics; genomics; human genetics; multi-ancestry GWAS; multivarite adaptive shrinkage; transcriptome prediction; transcriptome-wide association studies.
© 2023 The Author(s).
Conflict of interest statement
H.E.W. is a member of the HGG Advances Editorial Board.
Figures
References
-
- Buniello A., MacArthur J.A.L., Cerezo M., Harris L.W., Hayhurst J., Malangone C., McMahon A., Morales J., Mountjoy E., Sollis E., et al. The NHGRI-EBI GWAS catalog of published genome-wide association studies, targeted arrays and summary statistics 2019. Nucleic Acids Res. 2019;47:D1005–D1012. doi: 10.1093/nar/gky1120. - DOI - PMC - PubMed
-
- Morales J., Welter D., Bowler E.H., Cerezo M., Harris L.W., McMahon A.C., Hall P., Junkins H.A., Milano A., Hastings E., et al. A standardized framework for representation of ancestry data in genomics studies, with application to the NHGRI-EBI GWAS Catalog. Genome Biol. 2018;19:21. doi: 10.1186/s13059-018-1396-2. - DOI - PMC - PubMed
Publication types
MeSH terms
Grants and funding
- R01 HL105756/HL/NHLBI NIH HHS/United States
- P30 DK063491/DK/NIDDK NIH HHS/United States
- R15 HG009569/HG/NHGRI NIH HHS/United States
- UL1 TR000040/TR/NCATS NIH HHS/United States
- N01 HC095166/HL/NHLBI NIH HHS/United States
- P30 DK020595/DK/NIDDK NIH HHS/United States
- N01 HC095160/HL/NHLBI NIH HHS/United States
- 75N92020D00002/HL/NHLBI NIH HHS/United States
- HHSN268201500003C/HL/NHLBI NIH HHS/United States
- N01 HC095161/HL/NHLBI NIH HHS/United States
- 75N92020D00005/HL/NHLBI NIH HHS/United States
- N01 HC095168/HL/NHLBI NIH HHS/United States
- R01 HL120393/HL/NHLBI NIH HHS/United States
- U54 HG003067/HG/NHGRI NIH HHS/United States
- R01 HL135009/HL/NHLBI NIH HHS/United States
- UL1 TR001079/TR/NCATS NIH HHS/United States
- N01 HC095169/HL/NHLBI NIH HHS/United States
- R01 DK101921/DK/NIDDK NIH HHS/United States
- 75N92020D00001/HL/NHLBI NIH HHS/United States
- R01 HL101250/HL/NHLBI NIH HHS/United States
- N01 HC095167/HL/NHLBI NIH HHS/United States
- N01 HC095159/HL/NHLBI NIH HHS/United States
- 75N92020D00003/HL/NHLBI NIH HHS/United States
- UL1 TR001420/TR/NCATS NIH HHS/United States
- 75N92020D00004/HL/NHLBI NIH HHS/United States
- N01 HC095163/HL/NHLBI NIH HHS/United States
- 75N92020D00007/HL/NHLBI NIH HHS/United States
- HHSN268201500003I/HL/NHLBI NIH HHS/United States
- RF1 AG054474/AG/NIA NIH HHS/United States
- R01 HL126477/HL/NHLBI NIH HHS/United States
- 75N92020D00006/HL/NHLBI NIH HHS/United States
- R01 HL117626/HL/NHLBI NIH HHS/United States
- N01 HC095162/HL/NHLBI NIH HHS/United States
- UL1 TR001881/TR/NCATS NIH HHS/United States
- N01 HC095165/HL/NHLBI NIH HHS/United States
- N01 HC095164/HL/NHLBI NIH HHS/United States
