Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2013 Dec;23(12):2042-52.
doi: 10.1101/gr.154625.113. Epub 2013 Sep 11.

Analysis of variable retroduplications in human populations suggests coupling of retrotransposition to cell division

Affiliations

Analysis of variable retroduplications in human populations suggests coupling of retrotransposition to cell division

Alexej Abyzov et al. Genome Res. 2013 Dec.

Abstract

In primates and other animals, reverse transcription of mRNA followed by genomic integration creates retroduplications. Expressed retroduplications are either "retrogenes" coding for functioning proteins, or expressed "processed pseudogenes," which can function as noncoding RNAs. To date, little is known about the variation in retroduplications in terms of their presence or absence across individuals in the human population. We have developed new methodologies that allow us to identify "novel" retroduplications (i.e., those not present in the reference genome), to find their insertion points, and to genotype them. Using these methods, we catalogued and analyzed 174 retroduplication variants in almost one thousand humans, which were sequenced as part of Phase 1 of The 1000 Genomes Project Consortium. The accuracy of our data set was corroborated by (1) multiple lines of sequencing evidence for retroduplication (e.g., depth of coverage in exons vs. introns), (2) experimental validation, and (3) the fact that we can reconstruct a correct phylogenetic tree of human subpopulations based solely on retroduplications. We also show that parent genes of retroduplication variants tend to be expressed at the M-to-G1 transition in the cell cycle and that M-to-G1 expressed genes have more copies of fixed retroduplications than genes expressed at other times. These findings suggest that cell division is coupled to retrotransposition and, perhaps, is even a requirement for it.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Approach for novel retroduplication discovery. (A) If an analyzed genome has an unknown (i.e., absent from reference genome) retroduplication, then sequencing reads originating from the retroduplication can be used for it to be discovered. (B) Reads aligned to the reference genome provide three lines of evidence for the novel retroduplication: reads clustering around the insertion point, increased read depth in exons, and mapping of unaligned reads to a splice-junction library. (C) The existence of a novel retroduplication for the SKA3 gene in the CEU trio is supported by the three lines of evidence. The retroduplication is polymorphic as it is not present in the mother's genome. (D) PCR validation strategy. Two sets of primers test for the presence of a splice junction and for the insertion point, respectively. (E) Existence of novel retroduplication for SKA3 is validated in the daughter's and father's genomes but not in the mother's. (F) The novel retroduplication for SKA3 is polymorphic in the CEU population, as PCR across the insertion point yields a product in only some of the individuals tested.
Figure 2.
Figure 2.
Frequency of novel retroduplications by populations. Most of the novel retroduplications are discovered in only one population (due to conservative calling) but are present in a few more as evident from genotyping. The phylogenetic tree was constructed based on the overlap of novel retroduplications between different populations. The tree shows that, except in one case, populations separate perfectly by continental groups. Outlier clustering of the Finnish population (FIN) is likely due to its distinct data properties allowing discovery of more unique retroduplications (see text). Admixed populations and the Iberian population (with just a few sequenced individuals) were excluded from phylogenetic analysis.
Figure 3.
Figure 3.
Enrichment of parent genes for expression at different phases of the cell cycle. A list of periodic genes was produced previously (Whitfield et al. 2002) and downloaded from Cyclebase (Gauthier et al. 2010). (A) Genes with RDVs and recent retroduplication in the reference genome are significantly enriched (with P-values = 0.012 and 0.008, respectively, and denoted by [*]), for expression in the M and M/G1 cell cycle phases. M/G1 represents genes with uncertain phase assignment close to the M-to-G1 transition due to measurement imprecision. During M/G1, cell division occurs (red horizontal bar). Due to the saturation effect (see text), the enrichment for parent genes being expressed during M or M/G1 is not obvious when analyzing all known retroduplications. (B) Average number of retroduplications in the reference genome per gene (y-axis) is depicted for periodic genes with maximum expression at particular cell cycle phases (x-axis). Genes expressed in M and M/G1 phases generate significantly more (P-value = 0.0047) retroduplications than genes expressed during other phases, suggesting that cell cycle timing directly relates to retroduplication frequency.

Similar articles

Cited by

References

    1. The 1000 Genomes Project Consortium 2012. An integrated map of genetic variation from 1,092 human genomes. Nature 491: 56–65 - PMC - PubMed
    1. Abyzov A, Gerstein M 2011. AGE: Defining breakpoints of genomic structural variants at single-nucleotide resolution, through optimal alignments with gap excision. Bioinformatics 27: 595–603 - PMC - PubMed
    1. Abyzov A, Urban AE, Snyder M, Gerstein M 2011. CNVnator: An approach to discover, genotype, and characterize typical and atypical CNVs from family and population genome sequencing. Genome Res 21: 974–984 - PMC - PubMed
    1. Baertsch R, Diekhans M, Kent WJ, Haussler D, Brosius J 2008. Retrocopy contributions to the evolution of the human genome. BMC Genomics 9: 466. - PMC - PubMed
    1. Bai Y, Casola C, Feschotte C, Betrán E 2007. Comparative genomics reveals a constant rate of origination and convergent acquisition of functional retrogenes in Drosophila. Genome Biol 8: R11. - PMC - PubMed

Publication types

LinkOut - more resources