Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2015 Sep 22:3:e1269.
doi: 10.7717/peerj.1269. eCollection 2015.

PopAlu: population-scale detection of Alu polymorphisms

Affiliations

PopAlu: population-scale detection of Alu polymorphisms

Yu Qian et al. PeerJ. .

Abstract

Alu elements are sequences of approximately 300 basepairs that together comprise more than 10% of the human genome. Due to their recent origin in primate evolution some Alu elements are polymorphic in humans, present in some individuals while absent in others. We present PopAlu, a tool to detect polymorphic Alu elements on a population scale from paired-end sequencing data. PopAlu uses read pair distance and orientation as well as split reads to identify the location and precise breakpoints of polymorphic Alus. Genotype calling enables us to differentiate between homozygous and heterozygous carriers, making the output of PopAlu suitable for use in downstream analyses such as genome-wide association studies (GWAS). We show on a simulated dataset that PopAlu calls Alu elements inserted and deleted with respect to a reference genome with high accuracy and high precision. Our analysis of real data of a human trio from the 1000 Genomes Project confirms that PopAlu is able to produce highly accurate genotype calls. To our knowledge, PopAlu is the first tool that identifies polymorphic Alu elements from multiple individuals simultaneously, pinpoints the precise breakpoints and calls genotypes with high accuracy.

Keywords: Alu elements; Mobile element insertion; Paired-end sequencing; Polymorphism genotyping; Structural variation.

PubMed Disclaimer

Conflict of interest statement

Birte Kehr and Bjarni V. Halldórsson are employees of deCODE genetics/Amgen.

Figures

Figure 1
Figure 1. Example read alignments at an Alu deletion site.
Arrows show read directions. The blue part of the reads can be mapped to the reference outside of the Alu and the red part can be mapped to the Alu. (A) shows example reads from a haplotype that carries allele H1. (B) shows example reads from a haplotype that carries allele H0. A heterozygote diploid can have reads shown in both (A) and (B).
Figure 2
Figure 2. Example read alignment at an Alu insertion site.
Arrows show read directions. The blue part of the reads can be mapped to the reference and the red parts are clipped or mapped somewhere else in the reference. (A) shows example reads from a non-Alu haplotype. (B) shows example reads from an Alu insertion haplotype. A heterozygote diploid can have reads shown in both (A) and (B).
Figure 3
Figure 3. Example instance of our two-level voting system that determines the exact breakpoints of an Alu insertion.
At the first level, split-reads vote for a left and right breakpoint position within each individual. At the second level, individuals vote for the positions that received the largest numbers of votes at the first level to choose the final breakpoint positions AL and AR. In this example, position b is elected as AR and position e is elected as AL.

References

    1. Batzer M, Deininger P. Alu repeats and human genomic diversity. Nature Reviews Genetics. 2002;3(5):370–379. doi: 10.1038/nrg798. - DOI - PubMed
    1. Cordaux R, Batzer MA. The impact of retrotransposons on human genome evolution. Nature Reviews Genetics. 2009;10(10):691–703. doi: 10.1038/nrg2640. - DOI - PMC - PubMed
    1. David M, Mustafa H, Brudno M. Detecting Alu insertions from high-throughput sequencing data. Nucleic Acids Research. 2013;41(17):e1269. doi: 10.1093/nar/gkt612. - DOI - PMC - PubMed
    1. De Andrade A, Wang M, Bonaldo MF, Xie H, Soares MB. Genetic and epigenetic variations contributed by Alu retrotransposition. BMC Genomics. 2011;12(1):617. doi: 10.1186/1471-2164-12-617. - DOI - PMC - PubMed
    1. Deininger P. Alu elements: know the SINEs. Genome Biology. 2011;12(12) doi: 10.1186/gb-2011-12-12-236. Article 236. - DOI - PMC - PubMed

LinkOut - more resources