Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2022 Sep;24(9):1332-1340.
doi: 10.1038/s41556-022-00970-4. Epub 2022 Aug 25.

Mammalian genome innovation through transposon domestication

Affiliations
Review

Mammalian genome innovation through transposon domestication

Andrew J Modzelewski et al. Nat Cell Biol. 2022 Sep.

Abstract

Since the discovery of transposons, their sheer abundance in host genomes has puzzled many. While historically viewed as largely harmless 'parasitic' DNAs during evolution, transposons are not a mere record of ancient genome invasion. Instead, nearly every element of transposon biology has been integrated into host biology. Here we review how host genome sequences introduced by transposon activities provide raw material for genome innovation and document the distinct evolutionary path of each species.

PubMed Disclaimer

Conflict of interest statement

Declaration of interests

The authors declare no competing interests.

Figures

Figure 1.
Figure 1.. Transposon domestication contributes to host biology.
Transposon domestication provides new mechanisms for host genome innovation in diverse developmental and physiological processes, generating numerous gene regulatory elements, functional ncRNAs and protein-coding genes,,,,,,,,,,,,–,,,,,,,,,,–. The diagram, while likely representing the tip of an iceberg, summarizes key studies that characterize the in vivo validated transposon functions in the host genomes.
Figure 2.
Figure 2.. Transposon-derived gene regulatory elements diversify host gene isoforms and enrich expression regulation modality.
Transposon-derived sequences contribute to gene enhancers, promoters, exons, terminators, splicing donors/acceptors, and chromatin boundaries, regulating the structure and expression of proximal host gene isoforms. TE domestication expands gene regulatory modality, enriches transcript diversity, and diversifies functional reservoirs in host genomes. Pink squares, TE elements; blue squares, protein coding exons or protein coding genes; red hexagons, CTCF; yellow star, a gene regulatory element.
Figure 3.
Figure 3.. Transposons confer unique modes of cis-gene regulation in host genomes.
Compared to other gene regulatory sequences, transposons have distinct properties, as they are repetitive in nature and frequently species-specific. Species-specific gene regulation occurs when distinct TEs integrate proximal to homologous genes across species, generating a unique expression pattern in each species (top). Convergent gene regulation occurs when distinct TE insertions across species converge on the same regulatory principal to yield nearly identical expression patterns (middle). Coordinated gene regulation occurs when related transposon elements from the same TE family spread in a given host genome and land proximal to a cohort of host genes to coordinate their expression (bottom).
Figure 4:
Figure 4:. Co-option of transposon-encoded proteins contributes to new host biology.
A diagram illustrates the functional parallel between the retroviral Gag, Pol and Env proteins and their domesticated counterparts encoded by retrotransposons. Retroviral life cycle (top) begins when retroviruses infect the host cells and integrate into the host genome. Subsequently, the host machineries drives the expression of viral Gag, Pol and Env, allowing the retrovirus to mature before released from the host cells. Here, we show examples of domesticated Gag and Env genes (bottom), which are repurposed for neuronal functions, host defense, and placenta development. The remarkable modern innovations conferred by retrotransposon encoded proteins can be traced back to their proviral functions.

References

    1. McCLINTOCK B The Origin and Behavior of Mutable Loci in Maize. Proc Natl Acad Sci U S A 36, 344 (1950). - PMC - PubMed
    1. Britten RJ & Davidson EH Gene Regulation for Higher Cells: A Theory. Science (1979) 165, 349–357 (1969). - PubMed
    1. Nurk S et al. The complete sequence of a human genome. bioRxiv 2021.05.26.445798 (2021) doi:10.1101/2021.05.26.445798. - DOI
    1. Lander ES et al. Initial sequencing and analysis of the human genome. Nature 2001 409:6822 409, 860–921 (2001). - PubMed
    1. Craig Venter J et al. The sequence of the human genome. Science (1979) 291, 1304–1351 (2001). - PubMed

Publication types

Substances