Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comment
. 2017 Aug 1;7(4):1-9.
doi: 10.1080/2159256X.2017.1362494. eCollection 2017.

Proteomics technique opens new frontiers in mobilome research

Affiliations
Comment

Proteomics technique opens new frontiers in mobilome research

Andrew D Davidson et al. Mob Genet Elements. .

Abstract

A large proportion of the genome of most eukaryotic organisms consists of highly repetitive mobile genetic elements. The sum of these elements is called the "mobilome," which in eukaryotes is made up mostly of transposons. Transposable elements contribute to disease, evolution, and normal physiology by mediating genetic rearrangement, and through the "domestication" of transposon proteins for cellular functions. Although 'omics studies of mobilome genomes and transcriptomes are common, technical challenges have hampered high-throughput global proteomics analyses of transposons. In a recent paper, we overcame these technical hurdles using a technique called "proteomics informed by transcriptomics" (PIT), and thus published the first unbiased global mobilome-derived proteome for any organism (using cell lines derived from the mosquito Aedes aegypti). In this commentary, we describe our methods in more detail, and summarise our major findings. We also use new genome sequencing data to show that, in many cases, the specific genomic element expressing a given protein can be identified using PIT. This proteomic technique therefore represents an important technological advance that will open new avenues of research into the role that proteins derived from transposons and other repetitive and sequence diverse genetic elements, such as endogenous retroviruses, play in health and disease.

Keywords: Aedes aegypti; LTR retrotransposon; endogenous retrovirus; mobilome; proteomics informed by transcriptomics (PIT); repetitive element; transposon proteomics.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
PIT is a superior proteomic method for detecting proteins derived from mobile genetic elements. (A) ‘Standard’ workflow for global proteomic analysis. Purified proteins are separated by gel electrophoresis, tryptically digested into <20-residue peptides, and analyzed by LC-MS/MS. Detected peptides are identified from their respective spectra using predicted proteins, peptides, and spectra ultimately derived from genome annotation and/or transposons reference databases. (B) PIT workflow. Experimental samples are split into protein (processed as described in A) and RNA; the latter is subjected to RNA sequencing. Short RNA-Seq reads are bioinformatically assembled into transcripts de novo (without referring to a reference genome), and proteins, peptides, and spectra are predicted from these transcripts to produce a bespoke reference database for identifying peptides. In A & B, solid arrows indicate ‘wet’ experiments; dashed arrows indicate ‘dry’ (bioinformatic) analyses. (C) Bioinformatic identification of transposable elements expressing protein, in this case ‘transposon A’ (LTR retrotransposon used for illustration purposes). (i) Short peptides detected by LC-MS/MS often match multiple transposons sharing short stretches of perfect amino acid conservation, and individual elements deviate from the database consensus, complicating the assignment of peptides to their correct transposon. (ii) In PIT, detected peptides perfectly match their experimentally verified transcripts, allowing protein-producing mobile genetic elements to be accurately identified. When combined with a perfectly matched genome sequence, the precise genomic location of protein-producing transposons can also be determined (see Fig. 2).
Figure 2.
Figure 2.
PIT can identify the specific unique genomic copy of an element that is expressing protein. (A) Number of genomic transposon sequences that precisely match PIT transcripts associated with 2 or more peptides. Transcripts with only one genomic match (purple bars) can be mapped to their exact source. Numbers (x axis) correspond to PIT hit IDs from our associated paper. (B) Physical chromosome map for Ae. aegypti; protein-expressing transposons for which the genomic source could be identified are annotated.

Comment on

Similar articles

Cited by

References

    1. Alzohairy AM, Gyulai G, Jansen RK, Bahieldin A. Transposable elements domesticated and neofunctionalized by eukaryotic genomes. Plasmid. 2013;69:1-15. doi:10.1016/j.plasmid.2012.08.001. PMID:22960324 - DOI - PubMed
    1. Maringer K, Yousuf A, Heesom KJ, Fan J, Lee D, Fernandez-Sesma A, Bessant C, Matthews DA, Davidson AD. Proteomics informed by transcriptomics for characterising active transposable elements and genome annotation in Aedes aegypti. BMC Genomics. 2017;18:1-18. doi:10.1186/s12864-016-3432-5. PMID:28049423 - DOI - PMC - PubMed
    1. Wicker T, Sabot F, Hua-Van A, Bennetzen JL, Capy P, Chalhoub B, Flavell A, Leroy P, Morgante M, Panaud O, et al.. A unified classification system for eukaryotic transposable elements. Nat Rev Genet. 2007;8:973-82. doi:10.1038/nrg2165. PMID:17984973 - DOI - PubMed
    1. Nefedova L, Kim A. Mechanisms of LTR-Retroelement transposition: lessons from drosophila melanogaster. Viruses. 2017;9:81. doi:10.3390/v9040081 - DOI - PMC - PubMed
    1. Han JS. Non-long terminal repeat (non-LTR) retrotransposons: mechanisms, recent developments, and unanswered questions. Mob DNA. 2010;1:15. doi:10.1186/1759-8753-1-15. PMID:20462415 - DOI - PMC - PubMed

LinkOut - more resources