LASSO with cross-validation for genomic selection
- PMID: 20122298
- DOI: 10.1017/S0016672309990334
LASSO with cross-validation for genomic selection
Abstract
We used a least absolute shrinkage and selection operator (LASSO) approach to estimate marker effects for genomic selection. The least angle regression (LARS) algorithm and cross-validation were used to define the best subset of markers to include in the model. The LASSO-LARS approach was tested on two data sets: a simulated data set with 5865 individuals and 6000 Single Nucleotide Polymorphisms (SNPs); and a mouse data set with 1885 individuals genotyped for 10 656 SNPs and phenotyped for a number of quantitative traits. In the simulated data, three approaches were used to split the reference population into training and validation subsets for cross-validation: random splitting across the whole population; random sampling of validation set from the last generation only, either within or across families. The highest accuracy was obtained by random splitting across the whole population. The accuracy of genomic estimated breeding values (GEBVs) in the candidate population obtained by LASSO-LARS was 0.89 with 156 explanatory SNPs. This value was higher than those obtained by Best Linear Unbiased Prediction (BLUP) and a Bayesian method (BayesA), which were 0.75 and 0.84, respectively. In the mouse data, 1600 individuals were randomly allocated to the reference population. The GEBVs for the remaining 285 individuals estimated by LASSO-LARS were more accurate than those obtained by BLUP and BayesA for weight at six weeks and slightly lower for growth rate and body length. It was concluded that LASSO-LARS approach is a good alternative method to estimate marker effects for genomic selection, particularly when the cost of genotyping can be reduced by using a limited subset of markers.
Similar articles
-
Genomic selection using regularized linear regression models: ridge regression, lasso, elastic net and their extensions.BMC Proc. 2012 May 21;6 Suppl 2(Suppl 2):S10. doi: 10.1186/1753-6561-6-S2-S10. Epub 2012 May 21. BMC Proc. 2012. PMID: 22640436 Free PMC article.
-
L2-Boosting algorithm applied to high-dimensional problems in genomic selection.Genet Res (Camb). 2010 Jun;92(3):227-37. doi: 10.1017/S0016672310000261. Genet Res (Camb). 2010. PMID: 20667166
-
Application of Bayesian least absolute shrinkage and selection operator (LASSO) and BayesCπ methods for genomic selection in French Holstein and Montbéliarde breeds.J Dairy Sci. 2013 Jan;96(1):575-91. doi: 10.3168/jds.2011-5225. Epub 2012 Nov 3. J Dairy Sci. 2013. PMID: 23127905
-
Comparison of methods for the implementation of genome-assisted evaluation of Spanish dairy cattle.J Dairy Sci. 2013 Jan;96(1):625-34. doi: 10.3168/jds.2012-5631. Epub 2012 Oct 24. J Dairy Sci. 2013. PMID: 23102955
-
Methods of plant breeding in the genome era.Genet Res (Camb). 2010 Dec;92(5-6):423-41. doi: 10.1017/S0016672310000583. Genet Res (Camb). 2010. PMID: 21429273 Review.
Cited by
-
SABO-ILSTSVR: a genomic prediction method based on improved least squares twin support vector regression.Front Genet. 2024 Jun 14;15:1415249. doi: 10.3389/fgene.2024.1415249. eCollection 2024. Front Genet. 2024. PMID: 38948357 Free PMC article.
-
Genomic selection using regularized linear regression models: ridge regression, lasso, elastic net and their extensions.BMC Proc. 2012 May 21;6 Suppl 2(Suppl 2):S10. doi: 10.1186/1753-6561-6-S2-S10. Epub 2012 May 21. BMC Proc. 2012. PMID: 22640436 Free PMC article.
-
PANOMICS meets germplasm.Plant Biotechnol J. 2020 Jul;18(7):1507-1525. doi: 10.1111/pbi.13372. Epub 2020 May 19. Plant Biotechnol J. 2020. PMID: 32163658 Free PMC article. Review.
-
Controlling the Overfitting of Heritability in Genomic Selection through Cross Validation.Sci Rep. 2017 Oct 20;7(1):13678. doi: 10.1038/s41598-017-14070-z. Sci Rep. 2017. PMID: 29057969 Free PMC article.
-
Machine learning approaches reveal genomic regions associated with sugarcane brown rust resistance.Sci Rep. 2020 Nov 18;10(1):20057. doi: 10.1038/s41598-020-77063-5. Sci Rep. 2020. PMID: 33208862 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources