Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2022 Oct 12;12(10):1583.
doi: 10.3390/life12101583.

Human Retrotransposons and Effective Computational Detection Methods for Next-Generation Sequencing Data

Affiliations
Review

Human Retrotransposons and Effective Computational Detection Methods for Next-Generation Sequencing Data

Haeun Lee et al. Life (Basel). .

Abstract

Transposable elements (TEs) are classified into two classes according to their mobilization mechanism. Compared to DNA transposons that move by the "cut and paste" mechanism, retrotransposons mobilize via the "copy and paste" method. They have been an essential research topic because some of the active elements, such as Long interspersed element 1 (LINE-1), Alu, and SVA elements, have contributed to the genetic diversity of primates beyond humans. In addition, they can cause genetic disorders by altering gene expression and generating structural variations (SVs). The development and rapid technological advances in next-generation sequencing (NGS) have led to new perspectives on detecting retrotransposon-mediated SVs, especially insertions. Moreover, various computational methods have been developed based on NGS data to precisely detect the insertions and deletions in the human genome. Therefore, this review discusses details about the recently studied and utilized NGS technologies and the effective computational approaches for discovering retrotransposons through it. The final part covers a diverse range of computational methods for detecting retrotransposon insertions with human NGS data. This review will give researchers insights into understanding the TEs and how to investigate them and find connections with research interests.

Keywords: computational tools; next-generation sequencing (NGS); retrotransposons; transposable elements.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

Figure 1
Figure 1
Composition of non-LTR retrotransposons in the human. Non-LTR retrotransposons constitute approximately 34% of the entire human genome. A total of 17% of L1 elements, 11% of Alu elements, and 0.2% of SVAs belong to non-LTR retrotransposons. As some of the elements are still active in humans, they cause genetic disorders and contribute to diversity.
Figure 2
Figure 2
Structure of L1, Alu, and SVA elements. (a) L1 elements have ~6 kb length. ORF1 in L1 element encodes RNA-binding protein. ORF2 encodes endonuclease and reverse transcriptase for self-mobilization. (b) Alu elements have a dimeric structure with a length of 300 bp. The two monomers are present on both sides of the A-rich region. (c) The canonical length of SVA elements is 2 kb. They contain five distinctive regions.
Figure 3
Figure 3
Illustration of discordant read, read-pair and soft-clipped reads. (a) A discordant read indicates that one-end read is fully mapped to the reference, but another end is not mapped to the reference. (b) A read pair provides putative insertion site and gives information about breakpoint interval based on 5′cluster at one end and 3′cluster at the other end. (c) A soft-clipped read (split read) refers a mate pair where one part is partially mapped to the reference. Hence, the truncated read contains both reference and novel insertion sequences.

References

    1. Cordaux R., Batzer M.A. The impact of retrotransposons on human genome evolution. Nat. Rev. Genet. 2009;10:691–703. doi: 10.1038/nrg2640. - DOI - PMC - PubMed
    1. McClintock B. Controlling elements and the gene. Cold Spring Harb. Symp. Quant. Biol. 1956;21:197–216. doi: 10.1101/SQB.1956.021.01.017. - DOI - PubMed
    1. de Koning A.P.J., Gu W.J., Castoe T.A., Batzer M.A., Pollock D.D. Repetitive Elements May Comprise Over Two-Thirds of the Human Genome. PLoS Genet. 2011;7:e1002384. doi: 10.1371/journal.pgen.1002384. - DOI - PMC - PubMed
    1. Pheasant M., Mattick J.S. Raising the estimate of functional human sequences. Genome Res. 2007;17:1245–1253. doi: 10.1101/gr.6406307. - DOI - PubMed
    1. Kazazian H.H. Mobile elements: Drivers of genome evolution. Science. 2004;303:1626–1632. doi: 10.1126/science.1089670. - DOI - PubMed

LinkOut - more resources