Multi-population genomic prediction using a multi-task Bayesian learning model
- PMID: 24884927
- PMCID: PMC4024655
- DOI: 10.1186/1471-2156-15-53
Multi-population genomic prediction using a multi-task Bayesian learning model
Abstract
Background: Genomic prediction in multiple populations can be viewed as a multi-task learning problem where tasks are to derive prediction equations for each population and multi-task learning property can be improved by sharing information across populations. The goal of this study was to develop a multi-task Bayesian learning model for multi-population genomic prediction with a strategy to effectively share information across populations. Simulation studies and real data from Holstein and Ayrshire dairy breeds with phenotypes on five milk production traits were used to evaluate the proposed multi-task Bayesian learning model and compare with a single-task model and a simple data pooling method.
Results: A multi-task Bayesian learning model was proposed for multi-population genomic prediction. Information was shared across populations through a common set of latent indicator variables while SNP effects were allowed to vary in different populations. Both simulation studies and real data analysis showed the effectiveness of the multi-task model in improving genomic prediction accuracy for the smaller Ayshire breed. Simulation studies suggested that the multi-task model was most effective when the number of QTL was small (n = 20), with an increase of accuracy by up to 0.09 when QTL effects were lowly correlated between two populations (ρ = 0.2), and up to 0.16 when QTL effects were highly correlated (ρ = 0.8). When QTL genotypes were included for training and validation, the improvements were 0.16 and 0.22, respectively, for scenarios of the low and high correlation of QTL effects between two populations. When the number of QTL was large (n = 200), improvement was small with a maximum of 0.02 when QTL genotypes were not included for genomic prediction. Reduction in accuracy was observed for the simple pooling method when the number of QTL was small and correlation of QTL effects between the two populations was low. For the real data, the multi-task model achieved an increase of accuracy between 0 and 0.07 in the Ayrshire validation set when 28,206 SNPs were used, while the simple data pooling method resulted in a reduction of accuracy for all traits except for protein percentage. When 246,668 SNPs were used, the accuracy achieved from the multi-task model increased by 0 to 0.03, while using the pooling method resulted in a reduction of accuracy by 0.01 to 0.09. In the Holstein population, the three methods had similar performance.
Conclusions: Results in this study suggest that the proposed multi-task Bayesian learning model for multi-population genomic prediction is effective and has the potential to improve the accuracy of genomic prediction.
Similar articles
-
Multibreed genomic prediction using multitrait genomic residual maximum likelihood and multitask Bayesian variable selection.J Dairy Sci. 2018 May;101(5):4279-4294. doi: 10.3168/jds.2017-13366. Epub 2018 Mar 15. J Dairy Sci. 2018. PMID: 29550121
-
A multi-trait Bayesian method for mapping QTL and genomic prediction.Genet Sel Evol. 2018 Mar 24;50(1):10. doi: 10.1186/s12711-018-0377-y. Genet Sel Evol. 2018. PMID: 29571285 Free PMC article.
-
Using selection index theory to estimate consistency of multi-locus linkage disequilibrium across populations.BMC Genet. 2015 Jul 19;16:87. doi: 10.1186/s12863-015-0252-6. BMC Genet. 2015. PMID: 26187501 Free PMC article.
-
Application of Bayesian genomic prediction methods to genome-wide association analyses.Genet Sel Evol. 2022 May 13;54(1):31. doi: 10.1186/s12711-022-00724-8. Genet Sel Evol. 2022. PMID: 35562659 Free PMC article. Review.
-
Bayesian Approaches in Exploring Gene-environment and Gene-gene Interactions: A Comprehensive Review.Cancer Genomics Proteomics. 2023 Dec;20(6suppl):669-678. doi: 10.21873/cgp.20414. Cancer Genomics Proteomics. 2023. PMID: 38035701 Free PMC article. Review.
Cited by
-
Improving Genomic Predictions in Multi-Breed Cattle Populations: A Comparative Analysis of BayesR and GBLUP Models.Genes (Basel). 2024 Feb 18;15(2):253. doi: 10.3390/genes15020253. Genes (Basel). 2024. PMID: 38397242 Free PMC article.
-
Genomic prediction for numerically small breeds, using models with pre-selected and differentially weighted markers.Genet Sel Evol. 2018 Oct 10;50(1):49. doi: 10.1186/s12711-018-0419-5. Genet Sel Evol. 2018. PMID: 30314431 Free PMC article.
-
Assessing the Potential for Genome-Assisted Breeding in Red Perilla Using Quantitative Trait Locus Analysis and Genomic Prediction.Genes (Basel). 2023 Nov 27;14(12):2137. doi: 10.3390/genes14122137. Genes (Basel). 2023. PMID: 38136959 Free PMC article.
-
Genetic Strategies for Enhancing Rooster Fertility in Tropical and Humid Climates: Challenges and Opportunities.Animals (Basel). 2025 Apr 10;15(8):1096. doi: 10.3390/ani15081096. Animals (Basel). 2025. PMID: 40281930 Free PMC article. Review.
-
Across population genomic prediction scenarios in which Bayesian variable selection outperforms GBLUP.BMC Genet. 2015 Dec 23;16:146. doi: 10.1186/s12863-015-0305-x. BMC Genet. 2015. PMID: 26698836 Free PMC article.
References
-
- Hayes BJ, Bowman PJ, Chamberlain AJ, Goddard ME. Invited review: genomic selection in dairy cattle: progress and challenges (vol 92, pg 433, 2009) J Dairy Sci. 2009;92(3):1313–1313. - PubMed
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources