Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2010 Jul;121(2):337-51.
doi: 10.1007/s00122-010-1313-x. Epub 2010 Mar 17.

Codominant scoring of AFLP in association panels

Affiliations

Codominant scoring of AFLP in association panels

Gerrit Gort et al. Theor Appl Genet. 2010 Jul.

Abstract

A study on the codominant scoring of AFLP markers in association panels without prior knowledge on genotype probabilities is described. Bands are scored codominantly by fitting normal mixture models to band intensities, illustrating and optimizing existing methodology, which employs the EM-algorithm. We study features that improve the performance of the algorithm, and the unmixing in general, like parameter initialization, restrictions on parameters, data transformation, and outlier removal. Parameter restrictions include equal component variances, equal or nearly equal distances between component means, and mixing probabilities according to Hardy-Weinberg Equilibrium. Histogram visualization of band intensities with superimposed normal densities, and optional classification scores and other grouping information, assists further in the codominant scoring. We find empirical evidence favoring the square root transformation of the band intensity, as was found in segregating populations. Our approach provides posterior genotype probabilities for marker loci. These probabilities can form the basis for association mapping and are more useful than the standard scoring categories A, H, B, C, D. They can also be used to calculate predictors for additive and dominance effects. Diagnostics for data quality of AFLP markers are described: preference for three-component mixture model, good separation between component means, and lack of singletons for the component with highest mean. Software has been developed in R, containing the models for normal mixtures with facilitating features, and visualizations. The methods are applied to an association panel in tomato, comprising 1,175 polymorphic markers on 94 tomato hybrids, as part of a larger study within the Dutch Centre for BioSystems Genomics.

PubMed Disclaimer

Figures

Fig. 1
Fig. 1
Histograms of band intensities of marker 1,039 with superimposed normal densities. Subplots a and b show color-coded hard classifications based on probability thresholds 0.95, and 0.98, respectively. In the last case, some observations are classified as unknown (Z)
Fig. 2
Fig. 2
Four examples of AFLP markers from the tomato data with histograms of band intensities, and well fitting normal mixture densities
Fig. 3
Fig. 3
Examples of features helping unmixing of marker intensities for the tomato data. Subplots 1ab deal with starting values of parameters; 2a1a2 restriction on σ: hetero- versus homoscedasticity; 2b1b2 restriction on μ: equidistant component means; 2c1c2 HWE restriction on π; 3a1a4 transformation of band intensity; 4a1a2 number of components of mixture model; 4b1b3 separation of group means; 4c1c2 outliers; five extra information in plot
Fig. 4
Fig. 4
Histogram and fitted normal mixtures with unrestricted πj (subplot a) and restricted πj according to HWE (subplot b)

Similar articles

Cited by

References

    1. van Berloo R, van Heusden S, Bovy A, Meijer-Dekens F, Lindhout P, van Eeuwijk F. Genetic research in a public–private research consortium: prospects for indirect use of Elite breeding germplasm in academic research. Euphytica. 2008;161:293–300. doi: 10.1007/s10681-007-9519-y. - DOI
    1. van Berloo R, Zhu AG, Ursem R, Verbakel H, Gort G, van Eeuwijk FA. Diversity and linkage disequilibrium analysis within a selected set of cultivated tomatoes. Theor Appl Genet. 2008;117:89–101. doi: 10.1007/s00122-008-0755-x. - DOI - PMC - PubMed
    1. Bezdek J. Pattern recognition with fuzzy objective function algorithms. New York: Plenum Press; 1981.
    1. Böhning D, Seidel W, Alf M, Garel B, Patilea V, Walther G. Advances in mixture models. Comput Stat Data Anal. 2007;51:5205–5210. doi: 10.1016/j.csda.2006.10.025. - DOI
    1. Castiglioni P, Ajmone-Marsan P, van Wijk R, Motto M. AFLP markers in a molecular linkage map of maize: codominant scoring and linkage group distribution. Theor Appl Genet. 1999;99:425–431. doi: 10.1007/s001220051253. - DOI - PubMed

Substances