Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2016 Dec 1;8(12):3485-3507.
doi: 10.1093/gbe/evw247.

The Evolution of LINE-1 in Vertebrates

Affiliations

The Evolution of LINE-1 in Vertebrates

Stéphane Boissinot et al. Genome Biol Evol. .

Abstract

The abundance and diversity of the LINE-1 (L1) retrotransposon differ greatly among vertebrates. Mammalian genomes contain hundreds of thousands L1s that have accumulated since the origin of mammals. A single group of very similar elements is active at a time in mammals, thus a single lineage of active families has evolved in this group. In contrast, non-mammalian genomes (fish, amphibians, reptiles) harbor a large diversity of concurrently transposing families, which are all represented by very small number of recently inserted copies. Why the pattern of diversity and abundance of L1 is so different among vertebrates remains unknown. To address this issue, we performed a detailed analysis of the evolution of active L1 in 14 mammals and in 3 non-mammalian vertebrate model species. We examined the evolution of base composition and codon bias, the general structure, and the evolution of the different domains of L1 (5′UTR, ORF1, ORF2, 3′UTR). L1s differ substantially in length, base composition, and structure among vertebrates. The most variation is found in the 5′UTR, which is longer in amniotes, and in the ORF1, which tend to evolve faster in mammals. The highly divergent L1 families of lizard, frog, and fish share species-specific features suggesting that they are subjected to the same functional constraints imposed by their host. The relative conservation of the 5′UTR and ORF1 in non-mammalian vertebrates suggests that the repression of transposition by the host does not act in a sequence-specific manner and did not result in an arms race, as is observed in mammals.

PubMed Disclaimer

Figures

F<sc>ig</sc>. 1.—
Fig. 1.—
Pattern of evolution of L1 families in mammals and non-mammals. The phylogenies are ML trees based on the data of Khan et al. (2006) and Novick et al. (2009). The number of copy for each family is indicated in bold. (A) This phylogeny represents the evolution of L1 families in human and demonstrates the ladder-like mode of evolution typical of mammals. (B) This phylogeny is based on lizard L1 families (Novick et al. 2009) and is typical of non-mammalian vertebrates (reptiles, amphibians and fish).
F<sc>ig</sc>. 2.—
Fig. 2.—
(A) Typical structure of human and murine rodents full-length L1 elements (CCD = Coiled-coil domain; RRM = RNA recognition motif; CTD = C-terminal domain; EN = Endonuclease domain; RT = Reverse transcriptase domain). (B) Schematic structure of full-length L1 families in mammals, lizard, frog and zebrafish.
F<sc>ig</sc>. 3.—
Fig. 3.—
Maximum likelihood phylogeny of L1 families based on ORF2 amino acid sequences.
F<sc>ig</sc>. 4.—
Fig. 4.—
Evolution of the mammalian IGR. The figure suggests that the ancestor of mammals had an IGR that was lost after the split between afrotheria (elephant and hyrax) and other mammals and that an IGR was regained in pig. The branch-lengths on the phylogeny are not up to scale.
F<sc>ig</sc>. 5.—
Fig. 5.—
Base composition at the three codon positions for ORF1 and ORF2.
F<sc>ig</sc>. 6.—
Fig. 6.—
Frequency of amino acids in ORF1 and ORF2 for mammals, lizard, frog, and zebrafish.
F<sc>ig</sc>. 7.—
Fig. 7.—
Dotmatcher analysis of the horse, elephant, dog, and cow 5′UTR against themselves. Note the long tandem duplication in horse and elephant and the repeats rich region of the dog and cow 5′UTRs (framed with blue boxes).
F<sc>ig</sc>. 8.—
Fig. 8.—
Alignment of the 5′ termini of L1 in mammals (A), lizard clade 1 (B), lizard clade 2 (C), frog (D), and zebrafish (E). The length of the alignments varies among groups since the length of the 5′ termini that could be aligned differed.
F<sc>ig</sc>. 9.—
Fig. 9.—
Amino acid alignment of the RRM and CTD. The two RRMs are boxed in blue, the amino acids forming the stabilizing salt bridge are indicated with orange arrows, the residues providing RNA-binding side chains are indicated with green arrows, and the PDPK docking sites are boxed in red and the PP1 docking site in purple.
F<sc>ig</sc>. 10.—
Fig. 10.—
Schematic structure of the CCD of ORF1. The structure of the coiled coils is based on the analysis with a 28 residues window width.
F<sc>ig</sc>. 11.—
Fig. 11.—
(A) Schematic structure of the IGR showing the position of the predicted IRES. (B) RNA structure of the predicted IRES of the elephant, frog L1-15 and zebrafish L1-1A compared with the IRES of a dicistroviridae, the cripavirus-1 infecting the insect Homalodisca coagulata (GenBank accession number KT207917).

References

    1. Adey NB, Schichman SA, et al. 1994. Rodent L1 evolution has been driven by a single dominant lineage that has repeatedly acquired new transcriptional regulatory sequences. Mol Biol Evol. 11:778–789. - PubMed
    1. Adey NB, Tollefsbol TO, Sparks AB, Edgell MH, Hutchison CA., III. 1994. Molecular resurrection of an extinct ancestral promoter for mouse L1. Proc Natl Acad Sci U S A. 91:1569–1573. - PMC - PubMed
    1. Alfoldi J, et al. 2011. The genome of the green anole lizard and a comparative analysis with birds and mammals. Nature 477:587–591. - PMC - PubMed
    1. Alisch RS, Garcia-Perez JL, Muotri AR, Gage FH, Moran JV. 2006. Unconventional translation of mammalian LINE-1 retrotransposons. Genes Dev. 20:210–224. - PMC - PubMed
    1. Athanikar JN, Badge RM, Moran JV. 2004. A YY1-binding site is required for accurate human LINE-1 transcription initiation. Nucleic Acids Res. 32:3846–3855. - PMC - PubMed

LinkOut - more resources