Accuracy of genotype imputation in sheep breeds
- PMID: 22221027
- DOI: 10.1111/j.1365-2052.2011.02208.x
Accuracy of genotype imputation in sheep breeds
Abstract
Although genomic selection offers the prospect of improving the rate of genetic gain in meat, wool and dairy sheep breeding programs, the key constraint is likely to be the cost of genotyping. Potentially, this constraint can be overcome by genotyping selection candidates for a low density (low cost) panel of SNPs with sparse genotype coverage, imputing a much higher density of SNP genotypes using a densely genotyped reference population. These imputed genotypes would then be used with a prediction equation to produce genomic estimated breeding values. In the future, it may also be desirable to impute very dense marker genotypes or even whole genome re-sequence data from moderate density SNP panels. Such a strategy could lead to an accurate prediction of genomic estimated breeding values across breeds, for example. We used genotypes from 48 640 (50K) SNPs genotyped in four sheep breeds to investigate both the accuracy of imputation of the 50K SNPs from low density SNP panels, as well as prospects for imputing very dense or whole genome re-sequence data from the 50K SNPs (by leaving out a small number of the 50K SNPs at random). Accuracy of imputation was low if the sparse panel had less than 5000 (5K) markers. Across breeds, it was clear that the accuracy of imputing from sparse marker panels to 50K was higher if the genetic diversity within a breed was lower, such that relationships among animals in that breed were higher. The accuracy of imputation from sparse genotypes to 50K genotypes was higher when the imputation was performed within breed rather than when pooling all the data, despite the fact that the pooled reference set was much larger. For Border Leicesters, Poll Dorsets and White Suffolks, 5K sparse genotypes were sufficient to impute 50K with 80% accuracy. For Merinos, the accuracy of imputing 50K from 5K was lower at 71%, despite a large number of animals with full genotypes (2215) being used as a reference. For all breeds, the relationship of individuals to the reference explained up to 64% of the variation in accuracy of imputation, demonstrating that accuracy of imputation can be increased if sires and other ancestors of the individuals to be imputed are included in the reference population. The accuracy of imputation could also be increased if pedigree information was available and was used in tracking inheritance of large chromosome segments within families. In our study, we only considered methods of imputation based on population-wide linkage disequilibrium (largely because the pedigree for some of the populations was incomplete). Finally, in the scenarios designed to mimic imputation of high density or whole genome re-sequence data from the 50K panel, the accuracy of imputation was much higher (86-96%). This is promising, suggesting that in silico genome re-sequencing is possible in sheep if a suitable pool of key ancestors is sequenced for each breed.
© 2011 The Authors, Animal Genetics © 2011 Stichting International Foundation for Animal Genetics.
Similar articles
-
Design of a low-density SNP chip for the main Australian sheep breeds and its effect on imputation and genomic prediction accuracy.Anim Genet. 2015 Oct;46(5):544-56. doi: 10.1111/age.12340. Epub 2015 Sep 11. Anim Genet. 2015. PMID: 26360638
-
Imputation of genotypes with low-density chips and its effect on reliability of direct genomic values in Dutch Holstein cattle.J Dairy Sci. 2012 Feb;95(2):876-89. doi: 10.3168/jds.2011-4490. J Dairy Sci. 2012. PMID: 22281352
-
Assets of imputation to ultra-high density for productive and functional traits.J Dairy Sci. 2013 Sep;96(9):6047-58. doi: 10.3168/jds.2013-6793. Epub 2013 Jun 28. J Dairy Sci. 2013. PMID: 23810591
-
Evaluation of measures of correctness of genotype imputation in the context of genomic prediction: a review of livestock applications.Animal. 2014 Nov;8(11):1743-53. doi: 10.1017/S1751731114001803. Epub 2014 Jul 21. Animal. 2014. PMID: 25045914 Review.
-
Review: Opportunities and challenges for small populations of dairy cattle in the era of genomics.Animal. 2016 Jun;10(6):1050-60. doi: 10.1017/S1751731116000410. Epub 2016 Mar 9. Animal. 2016. PMID: 26957010 Review.
Cited by
-
Imputation of sequence level genotypes in the Franches-Montagnes horse breed.Genet Sel Evol. 2014 Oct 1;46(1):63. doi: 10.1186/s12711-014-0063-7. Genet Sel Evol. 2014. PMID: 25927638 Free PMC article.
-
Methods of tagSNP selection and other variables affecting imputation accuracy in swine.BMC Genet. 2013 Feb 21;14:8. doi: 10.1186/1471-2156-14-8. BMC Genet. 2013. PMID: 23433396 Free PMC article.
-
Identification of key ancestors of modern germplasm in a breeding program of maize.Theor Appl Genet. 2014 Dec;127(12):2545-53. doi: 10.1007/s00122-014-2396-6. Epub 2014 Sep 11. Theor Appl Genet. 2014. PMID: 25208647
-
The utility of low-density genotyping for imputation in the Thoroughbred horse.Genet Sel Evol. 2014 Feb 4;46(1):9. doi: 10.1186/1297-9686-46-9. Genet Sel Evol. 2014. PMID: 24495673 Free PMC article.
-
Genotype imputation accuracy in a F2 pig population using high density and low density SNP panels.BMC Genet. 2013 May 8;14:38. doi: 10.1186/1471-2156-14-38. BMC Genet. 2013. PMID: 23651538 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources