Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2004 Aug;167(4):1915-28.
doi: 10.1534/genetics.103.015693.

Significance tests and weighted values for AFLP similarities, based on Arabidopsis in silico AFLP fragment length distributions

Affiliations

Significance tests and weighted values for AFLP similarities, based on Arabidopsis in silico AFLP fragment length distributions

Wim J M Koopman et al. Genetics. 2004 Aug.

Abstract

Many AFLP studies include relatively unrelated genotypes that contribute noise to data sets instead of signal. We developed: (1) estimates of expected AFLP similarities between unrelated genotypes, (2) significance tests for AFLP similarities, enabling the detection of unrelated genotypes, and (3) weighted similarity coefficients, including band position information. Detection of unrelated genotypes and use of weighted similarity coefficients will make the analysis of AFLP data sets more informative and more reliable. Test statistics and weighted coefficients were developed for total numbers of shared bands and for Dice, Jaccard, Nei and Li, and simple matching (dis)similarity coefficients. Theoretical and in silico AFLP fragment length distributions (FLDs) were examined as a basis for the tests. The in silico AFLP FLD based on the Arabidopsis thaliana genome sequence was the most appropriate for angiosperms. The G + C content of the selective nucleotides in the in silico AFLP procedure significantly influenced the FLD. Therefore, separate test statistics were calculated for AFLP procedures with high, average, and low G + C contents in the selective nucleotides. The test statistics are generally applicable for angiosperms with a G + C content of approximately 35-40%, but represent conservative estimates for genotypes with higher G + C contents. For the latter, test statistics based on a rice genome sequence are more appropriate.

PubMed Disclaimer

References

    1. Mol Ecol. 2000 Jun;9(6):815-6 - PubMed
    1. Biol Cell. 1993;78(1-2):41-51 - PubMed
    1. Plant Physiol. 2001 Dec;127(4):1579-89 - PubMed
    1. Nucleic Acids Res. 1995 Nov 11;23(21):4407-14 - PubMed
    1. Genome Res. 1999 Sep;9(9):825-9 - PubMed