Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2011 Apr;122(6):1149-60.
doi: 10.1007/s00122-010-1519-y. Epub 2011 Jan 11.

Effect of population structure corrections on the results of association mapping tests in complex maize diversity panels

Affiliations

Effect of population structure corrections on the results of association mapping tests in complex maize diversity panels

Sofiane Mezmouk et al. Theor Appl Genet. 2011 Apr.

Abstract

Association mapping of sequence polymorphisms underlying the phenotypic variability of quantitative agronomical traits is now a widely used method in plant genetics. However, due to the common presence of a complex genetic structure within the plant diversity panels, spurious associations are expected to be highly frequent. Several methods have thus been suggested to control for panel structure. They mainly rely on ad hoc criteria for selecting the number of ancestral groups; which is often not evident for the complex panels that are commonly used in maize. It was thus necessary to evaluate the effect of the selected structure models on the association mapping results. A real maize data set (342 maize inbred lines and 12,000 SNPs) was used for this study. The panel structure was estimated using both Bayesian and dimensional reduction methods, considering an increasing number of ancestral groups. Effect on association tests depends in particular on the number of ancestral groups and on the trait analyzed. The results also show that using a high number of ancestral groups leads to an over-corrected model in which all causal loci vanish. Finally the results of all models tested were combined in a meta-analysis approach. In this way, robust associations were highlighted for each analyzed trait.

PubMed Disclaimer

Figures

Fig. 1
Fig. 1
Log-likelihood of the STRUCTURE software outputs as a function of the number of degrees of freedom absorbed by the structure model (filled diamond), and the mean of the log-likelihood of 20 STRUCTURE independent runs as a function of the number of degrees of freedom absorbed by the structure model (solid line). Outlier STRUCTURE runs with very low Log-likelihood values (until −2,356,000) induced a high mean decrease at 4 degrees of freedom (5 groups) and at more than 16 degrees of freedom (17 groups)
Fig. 2
Fig. 2
Values of the delta(K) criterion as a function of the number of degrees of freedom absorbed by the structure model
Fig. 3
Fig. 3
Neighbor joining tree of the STRUCTURE software outputs. The pairwise Euclidian distances were computed from the predicted allele frequencies from among the 381 STRUCTURE software outputs (no genetic structure model and 20 runs of a number of ancestral groups from 2 to 20). The surrounded part of the tree was zoomed; it corresponds to the outputs that are grouped in accordance with the number of ancestral groups (K)
Fig. 4
Fig. 4
BIC values of a male flowering time (MFT) and b thousand-kernel weight (TKW) as a function of the number of degrees of freedom absorbed by the structure model. The blue and red lines represent the most likely (Q 1) and second most likely (Q 2) STRUCTURE software models, respectively; the green line represents PCA (P) models and the light-blue line represents the MCA (M) models
Fig. 5
Fig. 5
Number of SNPs significantly associated with a male flowering time (MFT) and b thousand-kernel weight (TKW) as a function of the number of degrees of freedom absorbed by the structure model. The blue and red lines represent the most likely (Q 1) and second most likely (Q 2) STRUCTURE software models, respectively; the green line represents PCA (P) models and the light-blue line represents the MCA (M) models. The surrounded parts correspond to a zoom of the curves
Fig. 6
Fig. 6
Total number of significant SNPs over the four structure models tested (solid line curve) and percentage of SNPs common to all of them (dotted line curve) as a function of the number of degrees of freedom absorbed by the structure model

References

    1. Abdurakhmonov IY, Saha S, Jenkins JN, Buriev ZT, Shermatov SE, Scheffler BE, Pepper AE, Yu JZ, Kohel RJ, Abdukarimov A. Linkage disequilibrium based association mapping of fiber quality traits in G. hirsutum L. variety germplasm. Genetica. 2009;136:401–417. doi: 10.1007/s10709-008-9337-8. - DOI - PubMed
    1. Benjamini Y, Hochberg Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J R Stat Soc. 1995;57:289–300.
    1. Buckler ES, Stevens NM. Maize origins, domestication, and selection. In: Motley TJ, Zerega N, Cross H, editors. Darwin’s harvest. New York: Columbia University Press; 2005. pp. 67–90.
    1. Cacharrón J, Saedler H, Theißen G. Expression of MADS box genes ZMM8 and ZMM14 during inflorescence development of Zea mays discriminates between the upper and the lower floret of each spikelet. Dev Genes Evol. 1999;209:411–420. doi: 10.1007/s004270050271. - DOI - PubMed
    1. Camus-Kulandaivelu L, Veyrieras JB, Madur D, Combes V, Fourmann M, Barraud S, Dubreuil P, Gouesnard B, Manicacci D, Charcosset A. Maize adaptation to temperate climate: relationship between population structure and polymorphism in the Dwarf8 gene. Genetics. 2006;172:2449–2463. doi: 10.1534/genetics.105.048603. - DOI - PMC - PubMed

Publication types

LinkOut - more resources