Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2018 Apr:49:70-78.
doi: 10.1016/j.gde.2018.03.003. Epub 2018 Mar 23.

Satellite DNA evolution: old ideas, new approaches

Affiliations
Review

Satellite DNA evolution: old ideas, new approaches

Sarah Sander Lower et al. Curr Opin Genet Dev. 2018 Apr.

Abstract

A substantial portion of the genomes of most multicellular eukaryotes consists of large arrays of tandemly repeated sequence, collectively called satellite DNA. The processes generating and maintaining different satellite DNA abundances across lineages are important to understand as satellites have been linked to chromosome mis-segregation, disease phenotypes, and reproductive isolation between species. While much theory has been developed to describe satellite evolution, empirical tests of these models have fallen short because of the challenges in assessing satellite repeat regions of the genome. Advances in computational tools and sequencing technologies now enable identification and quantification of satellite sequences genome-wide. Here, we describe some of these tools and how their applications are furthering our knowledge of satellite evolution and function.

PubMed Disclaimer

Figures

Figure 1
Figure 1. Assembly-free strategies for identifying and analyzing repetitive sequences
A) Alignment based approaches generally collapse repeat-derived reads onto known repeat consensus sequences (ConTExt, McGurk and Barbash, bioRxiv doi: 10.1101/158386). Copy number can be inferred from the alignment depth, while sequence polymorphism can be assessed from the collapsed reads. The large gray rectangle represents consensus sequence, while smaller black rectangles represent individual sequencing reads. Red positions within reads indicate sequence polymorphism. B) Kmer-based approaches decompose sequencing reads into overlapping subsequences of length k. Reads derived from simple satellites will be enriched for a small set of kmers. These kmers can be quantified (k-Seek [22]) or used in more abstract representations of reads [23]. C) Graph-based methods construct representations of repeats using sequence similarity. The nodes in such graphs are sequences: kmers (De Bruijin graphs), sequencing reads (e.g. RepeatExplorer [19]), or entire repeats [44]. The edges reflect neighboring relationships (sequence composition similarity). Because of their tandem nature, satellites often present as as circular graphs. Distinct repeat families can be identified as clusters of nodes. Here, two distinct satellites (the separate circular graphs, left and center) as well as a non-tandem sequence (right) are depicted.
None
Ectopic recombination between arrays results in copy number and sequence

References

    1. John B, Miklos GL. Functional aspects of satellite DNA and heterochromatin. Int Rev Cytol. 1979;58:1–114. - PubMed
    1. Choo KH, Vissel B, Earle E. Evolution of α-satellite DNA on human acrocentric chromosomes. Genomics. 5:332–344. 1989/8. - PubMed
    1. Camacho JPM, Ruiz-Ruano FJ, Martín-Blázquez R, López-León MD, Cabrero J, Lorite P, Cabral-de-Mello DC, Bakkali M. A step to the gigantic genome of the desert locust: chromosome sizes and repeated DNAs. Chromosoma. 2015;124:263–275. - PubMed
    1. Kit S. Equilibrium sedimentation in density gradients of DNA preparations from animal tissues. J Mol Biol. 1961;3:711–IN2. - PubMed
    1. Khost DE, Eickbush DG, Larracuente AM. Single-molecule sequencing resolves the detailed structure of complex satellite DNA loci in Drosophila melanogaster. Genome Res. 2017;27:709–721. The authors employ PacBio sequencing of Drosophila melanogaster to achieve detailed assemblies of the major autosomal complex satellite DNA loci, particularly Responder and 1.688 gm/cm3. They show that PacBio assembly with error correction can significantly extend contigs into repetitive pericentromeric regions. They urge careful examination of assemblies across multiple parameters, using orthogonal molecular and genomic evidence to validate assemblies to ensure accuracy. Their data support models of TE gain at the ends of arrays and are consistent with the homogenizing effect of concerted evolution. - PMC - PubMed

Publication types

LinkOut - more resources