Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2015 Mar;8(1):eplantgenome2014.09.0046.
doi: 10.3835/plantgenome2014.09.0046.

Efficient Use of Historical Data for Genomic Selection: A Case Study of Stem Rust Resistance in Wheat

Affiliations
Free article

Efficient Use of Historical Data for Genomic Selection: A Case Study of Stem Rust Resistance in Wheat

J Rutkoski et al. Plant Genome. 2015 Mar.
Free article

Abstract

Genomic selection (GS) is a methodology that can improve crop breeding efficiency. To implement GS, a training population (TP) with phenotypic and genotypic data is required to train a statistical model used to predict genotyped selection candidates (SCs). A key factor impacting prediction accuracy is the relationship between the TP and the SCs. This study used empirical data for quantitative adult plant resistance to stem rust of wheat (Triticum aestivum L.) to investigate the utility of a historical TP (TPH ) compared with a population-specific TP (TPPS ), the potential for TPH optimization, and the utility of TPH data when close relative data is available for training. We found that, depending on the population size, a TPPS was 1.5 to 4.4 times more accurate than a TPH , and TPH optimization based on the mean of the generalized coefficient of determination or prediction error variance enabled the selection of subsets that led to significantly higher accuracy than randomly selected subsets. Retaining historical data when data on close relatives were available lead to a 11.9% increase in accuracy, at best, and a 12% decrease in accuracy, at worst, depending on the heritability. We conclude that historical data could be used successfully to initiate a GS program, especially if the dataset is very large and of high heritability. Training population optimization would be useful for the identification of TPH subsets to phenotype additional traits. However, after model updating, discarding historical data may be warranted. More studies are needed to determine if these observations represent general trends.

PubMed Disclaimer

References

    1. Amin, N., van Duijn, C.M., Aulchenko, Y.S.. 2007. A genomic background based method for association analysis in related individuals. PLoS ONE 2(12):E1274. doi: 10.1371/journal.pone.0001274
    1. Asoro, F.G., Newell, M.A., Beavis, W.D., Scott, M.P., Tinker, N.A., Jannink, J.L.. 2011. Accuracy and training population design for genomic selection on quantitative traits in elite North American oats. Plant Gen. 4:132-144. doi: 10.3835/plantgenome2011.02.0007
    1. Astle, W., Balding, D.J.. 2009. Population structure and cryptic relatedness in genetic association studies. Stat. Sci. 24:451-471. doi: 10.1214/09-STS307http://gateway.webofknowledge.com/gateway/Gateway.cgi?GWVersion=2&SrcApp=PA...
    1. Bates, D., Maechler, M.. 2010. lme4: Linear mixed-effects models using S4 classes. http://cran.r-project.org/package=lme4 (accessed 1 Dec. 2013).
    1. Beeck, C.P., Cowling, W.A., Smith, A.B., Cullis, B.R.. 2010. Analysis of yield and oil from a series of canola breeding trials. Part I. Fitting factor analytic mixed models with pedigree information. Genome 53:992-1001. doi: 10.1139/G10-051http://gateway.webofknowledge.com/gateway/Gateway.cgi?GWVersion=2&SrcApp=PA...

LinkOut - more resources