Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2012 Apr;2(4):429-35.
doi: 10.1534/g3.111.001453. Epub 2012 Apr 1.

A common dataset for genomic analysis of livestock populations

A common dataset for genomic analysis of livestock populations

Matthew A Cleveland et al. G3 (Bethesda). 2012 Apr.

Abstract

Although common datasets are an important resource for the scientific community and can be used to address important questions, genomic datasets of a meaningful size have not generally been available in livestock species. We describe a pig dataset that PIC (a Genus company) has made available for comparing genomic prediction methods. We also describe genomic evaluation of the data using methods that PIC considers best practice for predicting and validating genomic breeding values, and we discuss the impact of data structure on accuracy. The dataset contains 3534 individuals with high-density genotypes, phenotypes, and estimated breeding values for five traits. Genomic breeding values were calculated using BayesB, with phenotypes and de-regressed breeding values, and using a single-step genomic BLUP approach that combines information from genotyped and un-genotyped animals. The genomic breeding value accuracy increased with increased trait heritability and with increased relationship between training and validation. In nearly all cases, BayesB using de-regressed breeding values outperformed the other approaches, but the single-step evaluation performed only slightly worse. This dataset was useful for comparing methods for genomic prediction using real data. Our results indicate that validation approaches accounting for relatedness between populations can correct for potential overestimation of genomic breeding value accuracies, with implications for genotyping strategies to carry out genomic selection programs.

Keywords: GenPred; cross-validation; genomic relationships; pig; shared data resources.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Accuracy (r2) of estimated breeding values for genotyped animals (N = 3534).
Figure 2
Figure 2
Genomic breeding value accuracy of the 6-fold cross-validation (XVal), for all traits, using a standard polygenic BLUP (BLUP), BayesB with phenotypes (BayesB_ph), the single-step approach (SStep), and BayesB with estimated breeding values (BayesB_ebv).
Figure 3
Figure 3
Genomic breeding value accuracy of the 6-fold cross-validation (XVal) for individuals with at least one parent (P), at least one offspring (O), at least one parent and one offspring (PO), or no parents or offspring (N) genotyped in the training set. All traits were analyzed with BayesB with phenotypes (BayesB_ph), the single-step approach (SStep), and BayesB with estimated breeding values (BayesB_ebv).
Figure 4
Figure 4
Genomic breeding value accuracy of the 6-fold cross-validation (XVal) for individuals with one parent (1P), both parents (2P), at least one offspring (O), at least one parent and one offspring (PO), or no parents or offspring (N) genotyped in the training set, for T4. The trait was analyzed with BayesB with phenotypes (BayesB_ph), the single-step approach (SStep), and BayesB with estimated breeding values (BayesB_ebv).
Figure 5
Figure 5
Genomic breeding value accuracy of the 6-fold cross-validation (XVal) for bins representing the number of validation animals having genomic relationship coefficients with animals in training that exceed 0.45, for T4. The trait was analyzed with BayesB with phenotypes (BayesB_ph), the single-step approach (SStep), and BayesB with estimated breeding values (BayesB_ebv).
Figure 6
Figure 6
Genomic breeding value accuracy of the validation set containing young animals (YoungVal), for all traits, using a standard polygenic BLUP (BLUP), BayesB with phenotypes (BayesB_ph), the single-step approach (SStep), and BayesB with estimated breeding values (BayesB_ebv).

References

    1. Clark S. A., Hickey J. M., van der Werf J. H. J., 2011. Different models of genetic variation and their effect on genomic evaluation. Genet. Sel. Evol. 48: 18. - PMC - PubMed
    1. Cleveland M. A., Forni S., Garrick D. J., Deeb N., 2010. Prediction of genomic breeding values in a commercial pig population. Proceedings of the 9th World Congress on Genetics Applied to Livestock Production. Leipzig, Germany, August 1–6, 2010, Paper 0266
    1. Daetwyler H. D., Villanueva B., Bijma P., Woolliams J. A., 2007. Inbreeding in genome-wide selection. J. Anim. Breed. Genet. 124: 369–376 - PubMed
    1. Dawber T. R., Meadors G. F., Moore F. E., Jr, 1951. Epidemiological approaches to heart disease: the Framingham Study. Am. J. Public Health 41: 279–286 - PMC - PubMed
    1. Deeb N., Forni S., Cleveland M. A., 2010. Linkage disequilibrium decay in commercial pigs. Proceedings of the International Plant and Animal Genome XVIII Conference. San Diego, CA, January 9–13, 2010, P602

LinkOut - more resources