Molecular archeology of L1 insertions in the human genome
- PMID: 12372140
- PMCID: PMC134481
- DOI: 10.1186/gb-2002-3-10-research0052
Molecular archeology of L1 insertions in the human genome
Abstract
Background: As the rough draft of the human genome sequence nears a finished product and other genome-sequencing projects accumulate sequence data exponentially, bioinformatics is emerging as an important tool for studies of transposon biology. In particular, L1 elements exhibit a variety of sequence structures after insertion into the human genome that are amenable to computational analysis. We carried out a detailed analysis of the anatomy and distribution of L1 elements in the human genome using a new computer program, TSDfinder, designed to identify transposon boundaries precisely.
Results: Structural variants of L1 elements shared similar trends in the length and quality of their target site duplications (TSDs) and poly(A) tails. Furthermore, we found no correlation between the composition and genomic location of the pre-insertion locus and the resulting anatomy of the L1 insertion. We verified that L1 insertions with TSDs have the 5'-TTAAAA-3' cleavage site associated with L1 endonuclease activity. In addition, the second target DNA cut required for L1 insertion weakly matches the consensus pattern TTAAAA. On the other hand, the L1-internal breakpoints of deleted and inverted L1 elements do not resemble L1 endonuclease cleavage sites. Finally, the genome sequence data indicate that whereas singly inverted elements are common, doubly inverted elements are almost never found.
Conclusions: The sequence data give no indication that the creation of L1 structural variants depends on characteristics of the insertion locus. In addition, the formation of 5' truncated and 5' inverted L1s are probably not due to the action of the L1 endonuclease.
Figures
References
-
- International Human Genome Sequencing Consortium (IHGSC) Initial sequencing of the human genome. Nature. 2001;409:860–921. - PubMed
-
- Jurka J. Subfamily structure and evolution of the human L1 family of repetitive sequences. J Mol Evol. 1989;29:496–503. - PubMed
-
- Smit AF. The origin of interspersed repeats in the human genome. Curr Opin Genet Dev. 1996;6:743–748. - PubMed
-
- DeBerardinis RJ, Goodier JL, Ostertag EM, Kazazian HH., Jr Rapid amplification of a retrotransposon subfamily is evolving the mouse genome. Nat Genet. 1998;20:288–290. - PubMed
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials
