Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2010 Feb 1;1(1):7.
doi: 10.1186/1759-8753-1-7.

Heads or tails: L1 insertion-associated 5' homopolymeric sequences

Affiliations

Heads or tails: L1 insertion-associated 5' homopolymeric sequences

Thomas J Meyer et al. Mob DNA. .

Abstract

Background: L1s are one of the most successful autonomous mobile elements in primate genomes. These elements comprise as much as 17% of primate genomes with the majority of insertions occurring via target primed reverse transcription (TPRT). Twin priming, a variant of TPRT, can result in unusual DNA sequence architecture. These insertions appear to be inverted, truncated L1s flanked by target site duplications.

Results: We report on loci with sequence architecture consistent with variants of the twin priming mechanism and introduce dual priming, a mechanism that could generate similar sequence characteristics. These insertions take the form of truncated L1s with hallmarks of classical TPRT insertions but having a poly(T) simple repeat at the 5' end of the insertion. We identified loci using computational analyses of the human, chimpanzee, orangutan, rhesus macaque and marmoset genomes. Insertion site characteristics for all putative loci were experimentally verified.

Conclusions: The 39 loci that passed our computational and experimental screens probably represent inversion-deletion events which resulted in a 5' inverted poly(A) tail. Based on our observations of these loci and their local sequence properties, we conclude that they most probably represent twin priming events with unusually short non-inverted portions. We postulate that dual priming could, theoretically, produce the same patterns. The resulting homopolymeric stretches associated with these insertion events may promote genomic instability and create potential target sites for future retrotransposition events.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Classical target primed reverse transcription (TPRT), twin priming, variants of twin priming and dual priming mechanisms. (a) A schematic of classical TPRT. The poly(A) tail of an L1 mRNA anneals to the target site created by L1 endonuclease. L1 reverse transcription (RT) primes at the target site and synthesizes the bottom-strand cDNA. A subsequent second-strand nick and synthesis results in an L1 insertion with a 3' poly(A) flanked by TSDs. (b) Twin priming. In this variant of TPRT, after the second-strand nick, a site internal to the mRNA anneals to the top strand overhang. A second RT molecule primes at this site, generating an inverted L1 cDNA. (c) This twin priming variant involves the disengagement of the first RT before reaching the end of the poly(A) tail, resulting in an insertion with a 5' poly(T) stretch, but lacking a 3' poly(A) tail. Like classical twin priming, this mechanism results in an inverted L1 structure. (d) A second twin priming variant creates an insertion with both a 3' poly(A) tail and a 5' poly(T) stretch. The first RT falls off before reaching the end of the poly(A) tail. (e) Dual priming. Classical TPRT involving the first mRNA begins on the first strand. After the second strand nick, a second mRNA anneals to the second strand and undergoes classical TPRT. Note that this panel is rotated 180° relative to the orientation of all other panels. This is done to show that the resulting insertion will appear the same to computational filters as the above twin priming variant.
Figure 2
Figure 2
Investigation of candidate loci and variations within the homopolymeric stretches. (a) A triple alignment of pT684 to two outgroup species, the rhesus macaque and the common marmoset. The TSDs are highlighted in grey, the poly(T) stretch in green, and the L1 is highlighted in blue. (b) A gel chromatograph of polymerase chain reaction products depicting an insertion present in humans, chimpanzees, gorillas and orangutans, but absent in rhesus macaque and owl monkey. (c) Internal primers were designed around the poly(T) stretches for all human-specific loci; two loci are shown here. For each locus, HeLa DNA and a mixture of the DNA of 80 human individuals was run out on a 4% agarose gel with 100 bp and 20 bp ladders. No within-species variation in poly(T) length was observed.
Figure 3
Figure 3
Alignment of candidate L1s to their L1 consensus sequences. Schematic of the position of each candidate L1 when aligned against an L1 consensus sequence. Stars indicate that the 3' end of the locus aligns to a portion of the poly(A) tail in the consensus. Loci are color-coded to indicate in which species each was found.
Figure 4
Figure 4
Microhomology and comparison of insertion site characteristics of each locus. (a) An analysis of the microhomology of the six nucleotides adjacent to each insertion junction (with '1' indicating the nucleotide closest to the insert) to the corresponding sequence in the putative mRNA. Exclusion of a junction from analysis is indicated by a (-) and positions where microhomology is found are shaded grey. Those positions at which significant microhomology were found are highlighted in blue. (b) A binomial distribution analysis of the 6 bp at each junction revealed significant microhomology at both the 3' and 5' junctions of the insertions. No significant microhomology was found at the internal junction. P-values highlighted in blue are significant at p < 0.001. (c) A WebLogo analysis of the 6 bp found at the 3'junction. The logo supports our finding of microhomology at this junction, and is consistent with the expected motif at the L1 endonuclease cleavage site.
Figure 5
Figure 5
L1 endonuclease (EN) cleavage site analyses at the 5' and 3' junctions. For both the 5' and 3' target sites of each locus, the last four nucleotides of the target site and first nucleotide of the flanking sequence were compared to the canonical L1 EN cleavage motif (5'-TTTT/A-3'). To investigate the possibility that the candidate L1s were inserted in the antisense orientation, the 5' target site was reverse complemented and analysed. The black bars show the frequency of each divergence value at the 3' target site among our 39 loci, while the blue bars show values for the 5' target sites. The 3' target sites show more divergence from the typical EN cleavage motif than the 5' target site.

Similar articles

Cited by

References

    1. Smit AF, Toth G, Riggs AD, Jurka J. Ancestral, mammalian-wide subfamilies of LINE-1 repetitive sequences. J Mol Biol. 1995;246:401–417. doi: 10.1006/jmbi.1994.0095. - DOI - PubMed
    1. Cordaux R, Batzer MA. The impact of retrotransponsons on human genome evolution. Nature Reviews Genetics. 2009;10:691–703. doi: 10.1038/nrg2640. - DOI - PMC - PubMed
    1. Lander ES, Linton LM, Birren B, Nusbaum C, Zody MC, Baldwin J, Devon K, Dewar K, Doyle M, FitzHugh W. Initial sequencing and analysis of the human genome. Nature. 2001;409:860–921. doi: 10.1038/35057062. - DOI - PubMed
    1. Brouha B, Schustak J, Badge RM, Lutz-Prigge S, Farley AH, Moran JV, Kazazian HH Jr. Hot L1s account for the bulk of retrotransposition in the human population. Proc Natl Acad Sci USA. 2003;100:5280–5285. doi: 10.1073/pnas.0831042100. - DOI - PMC - PubMed
    1. Smit AF. The origin of interspersed repeats in the human genome. Curr Opin Genet Dev. 1996;6:743–748. doi: 10.1016/S0959-437X(96)80030-X. - DOI - PubMed

LinkOut - more resources