Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Oct 11;10(1):4630.
doi: 10.1038/s41467-019-12520-y.

Contribution of retrotransposition to developmental disorders

Affiliations

Contribution of retrotransposition to developmental disorders

Eugene J Gardner et al. Nat Commun. .

Abstract

Mobile genetic Elements (MEs) are segments of DNA which can copy themselves and other transcribed sequences through the process of retrotransposition (RT). In humans several disorders have been attributed to RT, but the role of RT in severe developmental disorders (DD) has not yet been explored. Here we identify RT-derived events in 9738 exome sequenced trios with DD-affected probands. We ascertain 9 de novo MEs, 4 of which are likely causative of the patient's symptoms (0.04%), as well as 2 de novo gene retroduplications. Beyond identifying likely diagnostic RT events, we estimate genome-wide germline ME mutation rate and selective constraint and demonstrate that coding RT events have signatures of purifying selection equivalent to those of truncating mutations. Overall, our analysis represents a comprehensive interrogation of the impact of retrotransposition on protein coding genes and a framework for future evolutionary and disease studies.

PubMed Disclaimer

Conflict of interest statement

M.E.H. is a co-founder of, consultant to, and holds shares in, Congenica Ltd, a genetics diagnostic company. The remaining authors declare no competing interests.

Figures

Fig. 1
Fig. 1
The DDD RT call set. ae Histograms of the total number of variants per individual for the four classes of RT events identified in the DDD cohort (Alu: blue; L1: green; SVA: orange; PPGs: red; combined RT events: grey) in size one bins. f Allele frequency distributions for the RT classes depicted in ae in log10 allele frequency bins. g Insert size estimates provided by MELT for the MEI classes ascertained in this study in log10 insert size bins. All plots only include variants from unaffected parents
Fig. 2
Fig. 2
Coding constraint on MEIs. a Cumulative consequence annotations for Alu, L1, and SVA MEIs in all samples (n = 28,132 individuals) analyzed. The majority of variants identified in this study fell within the noncoding space (either an enhancer or intron). b Comparison of constraint between MEIs and SNVs in unaffected parents. To compare the impact of exonic and intronic Alu (blue) and all MEIs (grey) to varying classes of SNVs (black), we used two metrics: the proportion of variants in genes that have been identified as LoF intolerant as gauged by pLI-score (x-axis) and the proportion of variants identified in only one individual (i.e., singletons; y-axis). Error bars indicate 95% confidence intervals based on population proportion; confidence intervals were calculated for SNVs, but are too small to appear at the resolution displayed in this figure
Fig. 3
Fig. 3
RT-derived de novos in the DDD. We identified a total of nine de novo MEIs, four of which disrupted the protein-coding sequence of a known DD gene: a SETD5, b NSD1, c MEF2C, and d ARID2. Shown in each panel is a diagram of the affected gene (blue model) with the relevant insertion indicated with a colored bubble. To the right are PCR validations confirming the de novo status of each mutation; a positive result is indicated by a raised secondary band present only in the proband sample (red arrow). e Circos diagram and PCR results for two identified germ-line de novo PPGs. For each de novo PPG shown is a diagram of the donor gene (gene model), location of duplication as PPG (directional arrow), and new insertion site. Exons from the donor gene included in the PPG are indicated by brackets underneath the donor gene model. To confirm PPG presence, PCR was performed (Methods) on proband, paternal, and maternal gDNA (sample in each lane is shown by pedigree). The band which represents the PPG is marked with a red arrow, and was confirmed via capillary sequencing (Supplementary Fig. 12). Dashed lines indicate intergenic regions, all genes models are shown in sense orientation, and PPG gene diagrams are not to scale
Fig. 4
Fig. 4
Estimating enrichment of deleterious MEIs. Depicted are the total number of expected (black) and observed (red) de novo mutations observed in exons (a) and enhancers (b) for all, high pLI (pLI > 0.9), and known monoallelic DD (MA DDG2P) genes. Expectation is based on the Poisson distribution of 100 simulations utilizing the neutral mutation rate (1.2 × 10–11 μ). P-values are based on the Poisson distribution and used to determine statistical deviation of observed to expected de novo counts for exons and enhancers

References

    1. Mills RE, Bennett EA, Iskow RC, Devine SE. Which transposable elements are active in the human genome? Trends Genet. 2007;23:183–191. doi: 10.1016/j.tig.2007.02.006. - DOI - PubMed
    1. Zhang Z, Harrison PM, Liu Y, Gerstein M. Millions of years of evolution preserved: a comprehensive catalog of the processed pseudogenes in the human genome. Genome Res. 2003;13:2541–2558. doi: 10.1101/gr.1429003. - DOI - PMC - PubMed
    1. Stewart C, et al. A comprehensive map of mobile element insertion polymorphisms in humans. PLoS Genet. 2011;7:e1002236. doi: 10.1371/journal.pgen.1002236. - DOI - PMC - PubMed
    1. Sudmant PH, et al. An integrated map of structural variation in 2,504 human genomes. Nature. 2015;526:75–81. doi: 10.1038/nature15394. - DOI - PMC - PubMed
    1. Gardner EJ, et al. The mobile element locator tool (MELT): population-scale mobile element discovery and biology. Genome Res. 2017;27:1916–1929. doi: 10.1101/gr.218032.116. - DOI - PMC - PubMed

Publication types