Efficient Use of Historical Data for Genomic Selection: A Case Study of Stem Rust Resistance in Wheat
- PMID: 33228293
- DOI: 10.3835/plantgenome2014.09.0046
Efficient Use of Historical Data for Genomic Selection: A Case Study of Stem Rust Resistance in Wheat
Abstract
Genomic selection (GS) is a methodology that can improve crop breeding efficiency. To implement GS, a training population (TP) with phenotypic and genotypic data is required to train a statistical model used to predict genotyped selection candidates (SCs). A key factor impacting prediction accuracy is the relationship between the TP and the SCs. This study used empirical data for quantitative adult plant resistance to stem rust of wheat (Triticum aestivum L.) to investigate the utility of a historical TP (TPH ) compared with a population-specific TP (TPPS ), the potential for TPH optimization, and the utility of TPH data when close relative data is available for training. We found that, depending on the population size, a TPPS was 1.5 to 4.4 times more accurate than a TPH , and TPH optimization based on the mean of the generalized coefficient of determination or prediction error variance enabled the selection of subsets that led to significantly higher accuracy than randomly selected subsets. Retaining historical data when data on close relatives were available lead to a 11.9% increase in accuracy, at best, and a 12% decrease in accuracy, at worst, depending on the heritability. We conclude that historical data could be used successfully to initiate a GS program, especially if the dataset is very large and of high heritability. Training population optimization would be useful for the identification of TPH subsets to phenotype additional traits. However, after model updating, discarding historical data may be warranted. More studies are needed to determine if these observations represent general trends.
© 2015 The Authors.
References
-
- Amin, N., van Duijn, C.M., Aulchenko, Y.S.. 2007. A genomic background based method for association analysis in related individuals. PLoS ONE 2(12):E1274. doi: 10.1371/journal.pone.0001274
-
- Asoro, F.G., Newell, M.A., Beavis, W.D., Scott, M.P., Tinker, N.A., Jannink, J.L.. 2011. Accuracy and training population design for genomic selection on quantitative traits in elite North American oats. Plant Gen. 4:132-144. doi: 10.3835/plantgenome2011.02.0007
-
- Astle, W., Balding, D.J.. 2009. Population structure and cryptic relatedness in genetic association studies. Stat. Sci. 24:451-471. doi: 10.1214/09-STS307http://gateway.webofknowledge.com/gateway/Gateway.cgi?GWVersion=2&SrcApp=PA...
-
- Bates, D., Maechler, M.. 2010. lme4: Linear mixed-effects models using S4 classes. http://cran.r-project.org/package=lme4 (accessed 1 Dec. 2013).
-
- Beeck, C.P., Cowling, W.A., Smith, A.B., Cullis, B.R.. 2010. Analysis of yield and oil from a series of canola breeding trials. Part I. Fitting factor analytic mixed models with pedigree information. Genome 53:992-1001. doi: 10.1139/G10-051http://gateway.webofknowledge.com/gateway/Gateway.cgi?GWVersion=2&SrcApp=PA...
Grants and funding
LinkOut - more resources
Full Text Sources