Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2009 May;182(1):355-64.
doi: 10.1534/genetics.108.098277. Epub 2009 Mar 18.

Factors affecting accuracy from genomic selection in populations derived from multiple inbred lines: a Barley case study

Affiliations

Factors affecting accuracy from genomic selection in populations derived from multiple inbred lines: a Barley case study

Shengqiang Zhong et al. Genetics. 2009 May.

Abstract

We compared the accuracies of four genomic-selection prediction methods as affected by marker density, level of linkage disequilibrium (LD), quantitative trait locus (QTL) number, sample size, and level of replication in populations generated from multiple inbred lines. Marker data on 42 two-row spring barley inbred lines were used to simulate high and low LD populations from multiple inbred line crosses: the first included many small full-sib families and the second was derived from five generations of random mating. True breeding values (TBV) were simulated on the basis of 20 or 80 additive QTL. Methods used to derive genomic estimated breeding values (GEBV) were random regression best linear unbiased prediction (RR-BLUP), Bayes-B, a Bayesian shrinkage regression method, and BLUP from a mixed model analysis using a relationship matrix calculated from marker data. Using the best methods, accuracies of GEBV were comparable to accuracies from phenotype for predicting TBV without requiring the time and expense of field evaluation. We identified a trade-off between a method's ability to capture marker-QTL LD vs. marker-based relatedness of individuals. The Bayesian shrinkage regression method primarily captured LD, the BLUP methods captured relationships, while Bayes-B captured both. Under most of the study scenarios, mixed-model analysis using a marker-derived relationship matrix (BLUP) was more accurate than methods that directly estimated marker effects, suggesting that relationship information was more valuable than LD information. When markers were in strong LD with large-effect QTL, or when predictions were made on individuals several generations removed from the training data set, however, the ranking of method performance was reversed and BLUP had the lowest accuracy.

PubMed Disclaimer

Figures

F<sc>igure</sc> 1.—
Figure 1.—
Decline of LD as measured by formula image against distance in centimorgans for all markers with minor allele frequency >0.2 (858 markers in total). (A–C) formula image in the original 42 lines, design 1 and design 2, respectively. Gray lines are smoothed running averages.
F<sc>igure</sc> 2.—
Figure 2.—
Correlation between simulated and predicted breeding values in individuals derived from one generation of randomly mating the training population (accuracy). (A and B) Analyses with dense markers. (C and D) Analyses with sparse markers. (B) Results with observed QTL. (A, C, and D) Results with unobserved QTL. The standard error for each point is small (<0.002) and is not shown. Note that the y-axis scale for B is different from that for A, C, and D.
F<sc>igure</sc> 3.—
Figure 3.—
Same as for Figure 2, but predictions are for individuals derived from four generations of randomly mating the training population.
F<sc>igure</sc> 4.—
Figure 4.—
Prediction accuracy in individuals derived from one generation of random mating with different population sizes under sparse markers with 80 QTL. All scenarios are under the 80 QTL setting. (A) Results with unobserved QTL. (B) Results with observed QTL. Note that the y-axis scale for A is different from that for B.

References

    1. Bernardo, R., and J. Yu, 2007. Prospects for genome-wide selection for quantitative traits in maize. Crop Sci. 47 1082–1090.
    1. Calus, M. P. L., and R. F. Veerkamp, 2007. Accuracy of breeding values when using and ignoring the polygenic effect in genomic breeding value estimation with a marker density of one SNP per cM. J. Anim. Breed. Genet. 124 362–368. - PubMed
    1. Falconer, D. S., and T. F. C. Mackay, 1996. Introduction to Quantitative Genetics. Longman, New York.
    1. Goddard, M. E., 2008. Genomic selection: prediction of accuracy and maximisation of long term response. Genetica (in press). - PubMed
    1. Habier, D., R. L. Fernando and J. C. M. Dekkers, 2007. The impact of genetic relationship information on genome-assisted breeding values. Genetics 177 2389–2397. - PMC - PubMed

Publication types