Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2003 Jun 5:4:22.
doi: 10.1186/1471-2105-4-22. Epub 2003 Jun 5.

Poly: a quantitative analysis tool for simple sequence repeat (SSR) tracts in DNA

Affiliations

Poly: a quantitative analysis tool for simple sequence repeat (SSR) tracts in DNA

Jeff W Bizzaro et al. BMC Bioinformatics. .

Abstract

Background: Simple sequence repeats (SSRs), microsatellites or polymeric sequences are common in DNA and are important biologically. From mononucleotide to trinucleotide repeats and beyond, they can be found in long (> 6 repeating units) tracts and may be characterized by quantifying the frequencies in which they are found and their tract lengths. However, most of the existing computer programs that find SSR tracts do not include these methods.

Results: A computer program named Poly has been written not only to find SSR tracts but to analyze the results quantitatively.

Conclusions: Poly is significant in its use of non-standard, quantitative methods of analysis. And, with its flexible object model and data structure, Poly and its generated data can be used for even more sophisticated analyses.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Frequency plots (one per panel) for equivalent organism sequence data across a range of GC compositions: Dictyostelium discoideum (0.97 Mb at 26.% GC), Oryza sativa japonica (1.5 Mb at 45.% GC) and Chlamydomonas reinhardtii (0.35 Mb at 62.% GC), including a sequence produced by a random number generator with equal weights between bases (1.0 Mb at 50.% GC). The sequences are concatenated from GenBank document "source" sequences and analyzed by Poly as described in the text. These data are presented solely to illustrate the methods described here and not to describe new research.
Figure 2
Figure 2
Operation of the Poly algorithm on DNA sequences. Panel A illustrates the process for a window size of n = 1. Panel B illustrates the process for n = 3. The dotted lines show which bases are compared. White boxes represent bases which are part of an SSR tract, and black boxes represent those which are not.

Similar articles

Cited by

References

    1. Toth G, Gaspari Z, Jurka J. Microsatellites in different eukaryotic genomes: survey and analysis. Genome Res. 2000;10:967–981. doi: 10.1101/gr.10.7.967. - DOI - PMC - PubMed
    1. Streisinger G, Okada Y, Emrich J, Newton J, Tsugita A, Terzaghi E, Inouye M. Frameshift mutations and the genetic code. Cold Spring Harb Symp Quant Biol. 1966;31:77–84. - PubMed
    1. Kunkel TA, Soni A. Mutagenesis by transient misalignment. J Biol Chem. 1988;263:14784–14789. - PubMed
    1. Marx KA, Hess ST, Blake RD. Characteristics of the large (dA).(dT) homopolymer tracts in D. discoideum gene flanking and intron sequences. J Biomol Struct Dyn. 1993;11:57–66. - PubMed
    1. Struhl K. Naturally occurring poly(dA-dT) sequences are upstream promoter elements for constitutive transcription in yeast. Proc Natl Acad Sci U S A. 1985;82:8419–8423. - PMC - PubMed

LinkOut - more resources