Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 2004 Feb;2(1):24-31.
doi: 10.1016/s1672-0229(04)02004-2.

Analysis on frequency and density of microsatellites in coding sequences of several eukaryotic genomes

Affiliations
Comparative Study

Analysis on frequency and density of microsatellites in coding sequences of several eukaryotic genomes

Bin Li et al. Genomics Proteomics Bioinformatics. 2004 Feb.

Abstract

Microsatellites or simple sequence repeats (SSRs) have been found in most organisms during the last decade. Since large-scale sequences are being generated, especially those that can be used to search for microsatellites, the development of these markers is getting more convenient. Keeping SSRs in viewing the importance of the application, available CDS (coding sequences) or ESTs (expressed sequence tags) of some eukaryotic species were used to study the frequency and density of various types of microsatellites. On the basis of surveying CDS or EST sequences amounting to 66.6 Mb in silkworm, 37.2 Mb in fly, 20.8 Mb in mosquito, 60.0 Mb in mouse, 34.9 Mb in zebrafish and 33.5 Mb in Caenorhabditis elegans, the frequency of SSRs was 1/1.00 Kb in silkworm, 1/0.77 Kb in fly, 1/1.03 Kb in mosquito, 1/1.21 Kb in mouse, 1/1.25 Kb in zebrafish and 1/1.38 Kb in C. elegans. The overall average SSR frequency of these species is 1/1.07 Kb. Hexanucleotide repeats (64.5%-76.6%) are the most abundant class of SSR in the investigated species, followed by trimeric, dimeric, tetrameric, monomeric and pentameric repeats. Furthermore, the A-rich repeats are predominant in each type of SSRs, whereas G-rich repeats are rare in the coding regions.

PubMed Disclaimer

Figures

Fig. 1
Fig. 1
Comparison of densities of each SSR across the eukaryotic genomes. The density is expressed in base-pairs of SSRs sequence per megabase-pairs of dataset sequences.
Fig. 2
Fig. 2
Comparative analysis of redundant and non-redundant SSRs in silkworm EST set. Redundant SSRs were identified in total set of ESTs (116,914), and non-redundant SSRs were identified in non-redundant set of ESTs (14,081) and were selected after assembling.
Fig. 3
Fig. 3
Distribution of SSRs in eukaryotic genomes. In all species hexameric repeats are the most frequent followed by trimeric repeats, while penatameric repeats are under 2% of the total number of SSRs.
Fig. 4
Fig. 4
An example of the distribution of microsatellites of different repeat units in ESTs of silkworm. The number of SSRs in a particular class decreases with the increasing number of repeat units.
Fig. 5
Fig. 5
Density of different dimeric repeats per million base pairs of CDS or ESTs sequences in different genomes.
Fig. 6
Fig. 6
Density of different trimeric repeats per million base pairs of CDS or ESTs sequences in different genomes.
Fig. 7
Fig. 7
Density of different tetrameric repeats per million base pairs of CDS or ESTs sequences in different genomes.

References

    1. Dib C. A comprehensive genetic map of the human genome based on 5,264 microsatellites. Nature. 1996;380:149–152. - PubMed
    1. Kashi Y. Simple sequence repeats as a source of quantitative genetic variation. Trends Genet. 1997;13:74–78. - PubMed
    1. International Human Genome Sequencing Consortium Initial sequencing and analysis of the human genome. Nature. 2001;409:860–921. - PubMed
    1. Subramanian S. Genome-wide analysis of microsatellite repeats in humans: their abundance and density in specific genomic regions. Genome Biol. 2003;4:R13. - PMC - PubMed
    1. Yu J. Inconsistency between SSR groupings and genetic backgrounds of white corn inbreds. Maydica. 2001;46:133–139.

Publication types