Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2015;14(9):1476-84.
doi: 10.1080/15384101.2015.1022696.

New bioinformatic tool for quick identification of functionally relevant endogenous retroviral inserts in human genome

Affiliations

New bioinformatic tool for quick identification of functionally relevant endogenous retroviral inserts in human genome

Andrew Garazha et al. Cell Cycle. 2015.

Abstract

Endogenous retroviruses (ERVs) and LTR retrotransposons (LRs) occupy ∼8% of human genome. Deep sequencing technologies provide clues to understanding of functional relevance of individual ERVs/LRs by enabling direct identification of transcription factor binding sites (TFBS) and other landmarks of functional genomic elements. Here, we performed the genome-wide identification of human ERVs/LRs containing TFBS according to the ENCODE project. We created the first interactive ERV/LRs database that groups the individual inserts according to their familial nomenclature, number of mapped TFBS and divergence from their consensus sequence. Information on any particular element can be easily extracted by the user. We also created a genome browser tool, which enables quick mapping of any ERV/LR insert according to genomic coordinates, known human genes and TFBS. These tools can be used to easily explore functionally relevant individual ERV/LRs, and for studying their impact on the regulation of human genes. Overall, we identified ∼110,000 ERV/LR genomic elements having TFBS. We propose a hypothesis of "domestication" of ERV/LR TFBS by the genome milieu including subsequent stages of initial epigenetic repression, partial functional release, and further mutation-driven reshaping of TFBS in tight coevolution with the enclosing genomic loci.

Keywords: Database; LTR retrotransposon; endogenous retrovirus; genome browser; human genome; transcription factor binding site.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Correlation between NDT and RT+ for all ERV/LR families. Each data point represents a separate ERV/LR family. NDT is the normalized density of TFBS, and RT+ is the proportion of TFBS-containing elements in a family. “TFBS+/all” means RT+, and “TNT/all” means NDT.
Figure 2.
Figure 2.
Distribution of the individual TFBS and DHS-containing ERV/LR elements is dependent on the number of TFBS per single element. “TFBS consisted HERV” means individual ERV/LR elements containing TFBS, shown in red; “TFBS & DNase I consisted HERV” means individual ERV/LR elements containing both TFBS and DNase I hypersensitivity site(s), shown in blue.
Figure 3.
Figure 3.
Profiling of LTR enhancer activity in luciferase reporter experiments. (A) Schematic representation of the luciferase reporter constructs. Filled arrow – individual LTR elements tested in this assay; empty arrow – SV40 promoter; black bar – luciferase gene; (B) relative enhancer activities for LTR elements 1–12, established in a dual-luciferase assay. Data show means ± standard deviations of 4 independent experiments. Data is shown for the cell lines Tera-1, NT2/D1, A549, NGP127 and HepG2.
Figure 4.
Figure 4.
Proportion of TFBS-containing elements in correlation with the divergence of each ERV/LR family from their consensus sequences. Each data point represents a separate ERV/LR family. “TFBS+/all” means RT+. The divergence is shown as a millidiv score, with each unit equal to one substitution per 1000 nucleotides.
Figure 5.
Figure 5.
Distribution of TFBS for the particular transcription factor proteins among all the ERV/LR elements, in correlation with the divergence of the respective ERV/LR elements from their consensus sequence. The distribution is shown for NF-YA (red) and Rad21 (blue) transcription factor proteins. The divergence is shown as a millidiv score, with each unit equal to one substitution per 1000 nucleotides. The Y-axis is arbitrary and is customized for each transcription factor.
Figure 6.
Figure 6.
Screen shot of the representative HERV/LR browser output page. The user settings were “LTR5Hs” as the repeat family, “chr1” as the chromosome number. The Browser displays all the LTR5Hs inserts on the 1st chromosome, featuring TFBS – positive elements. An option is shown “show list of browser HERVs” that enables listing all the elements of a selected category on the browser screen, as a table supplemented by hyperlinks to the structure of particular each element. The zooming tool is enabled to facilitate navigation.

Similar articles

Cited by

References

    1. Sverdlov ED. Retroviruses and primate evolution. Bioessays 2000; 2:161-71; http://dx.doi.org/10.1002/(SICI)1521-1878(200002)22:2%3c161::AID-BIES7%3... - DOI - PubMed
    1. Buzdin A. Human-specific endogenous retroviruses. Scientific World J 2007; 7:1848-68; PMID:18060323; http://dx.doi.org/10.1100/tsw.2007.270 - DOI - PMC - PubMed
    1. Hohn O, Hanke K, Bannert N. HERV-K(HML-2) the best preserved family of HERVs:Endogenization, expression, and implications in health and disease. Front Oncol 2013; 3:246; PMID:24066280; http://dx.doi.org/10.3389/fonc.2013.00246 - DOI - PMC - PubMed
    1. Dewannieux M, Heidmann T. Endogenous retroviruses:acquisition, amplification and taming of genome invaders. Curr Opin Virol 2013; 6:646-56; http://dx.doi.org/10.1016/j.coviro.2013.08.005 - DOI - PubMed
    1. Buzdin A, Ustyugova S, Khodosevich K, Mamedov I, Lebedev Y, Hunsmann G, Sverdlov E. Human-specific subfamilies of HERV-K (HML-2) long terminal repeats:three master genes were active simultaneously during branching of hominoid lineages. Genomics 2003; 81(2):149-56; PMID:12620392; http://dx.doi.org/10.1016/S0888-7543(02)00027-7 - DOI - PubMed

Publication types

MeSH terms

LinkOut - more resources