Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2013 Jun;194(2):493-503.
doi: 10.1534/genetics.113.150227. Epub 2013 Mar 27.

Genomic predictability of interconnected biparental maize populations

Affiliations

Genomic predictability of interconnected biparental maize populations

Christian Riedelsheimer et al. Genetics. 2013 Jun.

Abstract

Intense structuring of plant breeding populations challenges the design of the training set (TS) in genomic selection (GS). An important open question is how the TS should be constructed from multiple related or unrelated small biparental families to predict progeny from individual crosses. Here, we used a set of five interconnected maize (Zea mays L.) populations of doubled-haploid (DH) lines derived from four parents to systematically investigate how the composition of the TS affects the prediction accuracy for lines from individual crosses. A total of 635 DH lines genotyped with 16,741 polymorphic SNPs were evaluated for five traits including Gibberella ear rot severity and three kernel yield component traits. The populations showed a genomic similarity pattern, which reflects the crossing scheme with a clear separation of full sibs, half sibs, and unrelated groups. Prediction accuracies within full-sib families of DH lines followed closely theoretical expectations, accounting for the influence of sample size and heritability of the trait. Prediction accuracies declined by 42% if full-sib DH lines were replaced by half-sib DH lines, but statistically significantly better results could be achieved if half-sib DH lines were available from both instead of only one parent of the validation population. Once both parents of the validation population were represented in the TS, including more crosses with a constant TS size did not increase accuracies. Unrelated crosses showing opposite linkage phases with the validation population resulted in negative or reduced prediction accuracies, if used alone or in combination with related families, respectively. We suggest identifying and excluding such crosses from the TS. Moreover, the observed variability among populations and traits suggests that these uncertainties must be taken into account in models optimizing the allocation of resources in GS.

Keywords: Fusarium graminearum; G-BLUP; GenPred; shared data resources; whole-genome prediction.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Crossing scheme to obtain the five populations.
Figure 2
Figure 2
Heatmap showing IBS similarities (red, above diagonal) and genomic correlations (blue, below diagonal) among all genotypes. The numbers in the blocks refer to average IBS similarities within and between populations. Average genomic correlations in comparisons with linkage phase similarities can be found in Table S2. FS, full sibs; HS, half sibs; UR, unrelated.
Figure 3
Figure 3
(A–D) Prediction accuracies for within-population prediction using full sibs (1A). Shown are the average values for all traits obtained from 100 repetitions. The dashed lines indicate phenotypic accuracies calculated as the square root of the heritability.
Figure 4
Figure 4
(A and B) Observed vs. expected within-population prediction accuracy using full sibs (1A) averaged over (A) all populations except the smallest one (π2,4) and (B) traits. The expected prediction accuracy was calculated using the formula suggested by Daetwyler et al. (2010), assuming an average genome size L = 16.34 M and effective number of loci Me = 2NeL. Effective population size (Ne) was calculated using the harmonic mean approximation for two generations.
Figure 5
Figure 5
(A–E) Prediction accuracies for individual validation populations (VP) depending on the composition of the training set (TS) and the total TS size. Shown are the mean values over all traits, repetitions, and possible combinations to construct the TS. The dashed lines indicates the TS size at which a model-based statistical comparison of TS compositions was performed. See Table 1 for details about the different TS compositions. Results for individual traits can be found in Figure S4, Figure S5, Figure S6, Figure S7, and Figure S8.

References

    1. Albrecht T., Wimmer V., Auinger H.-J., Erbe M., Knaak C., et al. , 2011. Genome-based prediction of testcross values in maize. Theor. Appl. Genet. 123: 339–350. - PubMed
    1. Asoro F. G., Newell M. A., Beavis W. D., Scott M. P., Jannink J.-L., 2011. Accuracy and training population design for genomic selection on quantitative traits in elite North American oats. Plant Genome 4: 132–144.
    1. Astle W., Balding D. J., 2009. Population structure and cryptic relatedness in genetic association studies. Stat. Sci. 24: 451–471.
    1. Bernardo R., Yu J., 2009. Prospects for genomewide selection for quantitative traits in maize. Crop Sci. 47: 1082–1090.
    1. Bolduan C., Montes J.-M., Dhillon B. S., Mirdita V., Melchinger A. E., 2009. Determination of mycotoxin concentration by ELISA and near-infrared spectroscopy in fusarium-inoculated maize. Cereal Res. Commun. 37: 521–529.

Publication types

LinkOut - more resources