Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2010 Jun;185(2):623-31.
doi: 10.1534/genetics.110.116590. Epub 2010 Mar 22.

Accurate prediction of genetic values for complex traits by whole-genome resequencing

Affiliations

Accurate prediction of genetic values for complex traits by whole-genome resequencing

Theo Meuwissen et al. Genetics. 2010 Jun.

Abstract

Whole-genome resequencing technology has improved rapidly during recent years and is expected to improve further such that the sequencing of an entire human genome sequence for $1000 is within reach. Our main aim here is to use whole-genome sequence data for the prediction of genetic values of individuals for complex traits and to explore the accuracy of such predictions. This is relevant for the fields of plant and animal breeding and, in human genetics, for the prediction of an individual's risk for complex diseases. Here, population history and genomic architectures were simulated under the Wright-Fisher population and infinite-sites mutation model, and prediction of genetic value was by the genomic selection approach, where a Bayesian nonlinear model was used to predict the effects of individual SNPs. The Bayesian model assumed a priori that only few SNPs are causative, i.e., have an effect different from zero. When using whole-genome sequence data, accuracies of prediction of genetic value were >40% increased relative to the use of dense approximately 30K SNP chips. At equal high density, the inclusion of the causative mutations yielded an extra increase of accuracy of 2.5-3.7%. Predictions of genetic value remained accurate even when the training and evaluation data were 10 generations apart. Best linear unbiased prediction (BLUP) of SNP effects does not take full advantage of the genome sequence data, and nonlinear predictions, such as the Bayesian method used here, are needed to achieve maximum accuracy. On the basis of theoretical work, the results could be extended to more realistic genome and population sizes.

PubMed Disclaimer

Figures

F<sc>igure</sc> 1.—
Figure 1.—
The accuracy of the predictions of total genetic value in the TEST1 data set as a function of the marker density (d) in SNPs per morgan. The densities evaluated were d = 1000, 11,000, and 33,000, and the training data contained T = 200 individuals.
F<sc>igure</sc> 2.—
Figure 2.—
The posterior probability of a SNP being fitted in the model and the simulated QTL variance plotted against the position along the chromosome for three randomly picked 3-QTL+ data sets (plotted are the 100 SNPs that surround the 3 causative SNPs, resulting in three segments in each graph (separated by formula image).
F<sc>igure</sc> 3.—
Figure 3.—
The posterior probability of fitting a causative SNP vs. the genetic variance explained by this SNP. (a and b) The 3-QTL+ and 30-QTL+ data, with T = 200 records; (c) the 6-QTL+ data with 400 records.

References

    1. Barrett, J. C., S. Hansoul, D. L. Nicolae, J. H. Cho, R. H. Duerr et al., 2008. Genome-wide association defines more than 30 distinct susceptibility loci for Crohn's disease. Nat. Genet. 40 955–962. - PMC - PubMed
    1. Daetwyler, H. D., B. Villanueva and J. A. Woolliams, 2008. Accuracy of predicting the genetic risk of disease using a genome-wide approach. PLoS ONE 3 e3395. - PMC - PubMed
    1. Erickson, D. L., C. B. Fenster, H. K. Stenoien and D. Price, 2004. Quantitative trait locus analyses and the study of evolutionary process. Mol. Ecol. 13 2505–2522. - PubMed
    1. Falconer, D., and T. Mackay, 1996. Introduction to Quantitative Genetics. Longman, London. - PMC - PubMed
    1. Goddard, M. E., 2009. Genomic selection: prediction of accuracy and maximisation of long term response. Genetica 136 245–257. - PubMed