Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2014 May 22;15(1):394.
doi: 10.1186/1471-2164-15-394.

GAP-Seq: a method for identification of DNA palindromes

Affiliations

GAP-Seq: a method for identification of DNA palindromes

Hui Yang et al. BMC Genomics. .

Abstract

Background: Closely spaced long inverted repeats, also known as DNA palindromes, can undergo intrastrand annealing to form DNA hairpins. The ability to form these hairpins results in genome instability, difficulties in maintaining clones in Escherichia coli and major problems for most DNA sequencing approaches. Because of their role in genomic instability and gene amplification in some human cancers, it is important to develop systematic approaches to detect and characterize DNA palindromes.

Results: We developed a new protocol to identify palindromes that couples the S1 nuclease treated Cot0 DNA (GAPF) with high-throughput sequencing (GAP-Seq). Unlike earlier protocols, it does not involve restriction enzymatic digestion prior to DNA snap-back thereby preserving longer DNA sequences. It also indicates the location of the novel junction, which can then be recovered. Using MCF-7 breast cancer cell line as the proof-of-principle analysis, we have identified 35 palindrome candidates and physically characterized the top 5 candidates and their junctions. Because this protocol eliminates many of the false positives that plague earlier techniques, we have improved palindrome identification.

Conclusions: The GAP-Seq approach underscores the importance of developing new tools for identifying and characterizing palindromes, and provides a new strategy to systematically assess palindromes in genomes. It will be useful for studying human cancers and other diseases associated with palindromes.

PubMed Disclaimer

Figures

Figure 1
Figure 1
High-throughput next generation sequencing of fast annealing DNA treated by S1 nuclease. (A) Sequencing library construction. We prepared samples for sequencing based on efficient intrastrand base pairing of palindromes. Briefly, the genomic DNA was denatured and rapidly reannealed. After single-strand specific S1 nuclease digestion, DNA was extracted by phenol-chloroform and libraries were prepared for 454 and Illumina sequencing. (B) Data analysis. 1: Mapping of unique reads to human reference genome (NCBI36/hg18). Black lines represent mapped unique reads. Hatched rectangles represent regions masked by RepeatMasker (M). 2: Assembly of uniquely mapped reads to identify contiguous regions (contigs, black rectangles), and calculation of base read ratio (B) of each contig. For a contig region “c” with n mapped reads, formula image . 3: Clustering two or more contigs with a base read ratio >1.5 that are within 7.5 kb of each other to make a joined contig. 4: Determining Rank “R” that is the sum of all read lengths in the joined contig divided by the length of the joined contig minus the masked regions formula image.
Figure 2
Figure 2
Palindrome mapping strategy. (A) Read density distribution in Chr15q21.1: 47,529,204-47,550,373 region shown as 1 kb bins. (B) qPCR analysis to monitor for palindrome enrichment and determine the directionality of the Chr15q21.1 palindrome. We calculated the amount of depletion of a specific TaqMan primer set region based on Ct value before and after GAPF protocol in both IMR-90 and MCF-7 samples. The fold enrichment is based on comparing the fold depletion among different primer sets (P1, P2, P3 and P4) relative to a single copy sequence in the genome (RAD52). The location of TaqMan primer sets P1, P2, P3 and P4 is indicated in Figure 2C. (C) Map of genomic region Chr15: 47,520,000-47,550,000 with restriction sites and primer locations. (D) Southern blot analysis. Genomic DNA IMR-90 and MCF-7 cells was digested with BamHI, BglII or NcoI. Asterisks (*) mark the rearranged bands from MCF-7 genomic DNA. (E) Snap-back (SB) southern blots of BamHI digested IMR-90 and MCF-7 DNA. Arrowhead indicates the half sized fragments after snap-back in MCF-7. (F) Inversion-PCR. Three primers all from the same strand in normal genomic DNA were used for PCR (Primers 1–3). Since primers 1 and 2 are located in the palindromic region, they can also be used as reverse primers. Because primer 3 is in the spacer, it is able to produce a PCR product with primer 1 or 3 containing the novel junction “J” as indicated in the figure.
Figure 3
Figure 3
Schematics and sequence of palindrome junctions. Sequence analysis of the palindromic junctions identified 7 novel junctions. In each of the aligned breakpoint sequences, lowercase letters are Repeat-masker masked sequences. Uppercase letters represent unique sequences. Microhomology at the breakpoints is shown as bold letters. Insertion of Chr16 fragment at Chr15q21.1 spacer is shown as a black rectangle and deletion at Chr8q21.2 spacer is shown as a triangle in the schematics.
Figure 4
Figure 4
Summary of Affymetrix SNP6 copy number gains and palindrome candidates from Roche 454 sequencing of MCF-7. Key: red triangle-palindrome candidates from Roche 454 sequencing; green line-copy number gains based on Affymetrix SNP6 analysis; blue rectangle-centromere region. The data used to generate this figure are shown in Additional file 4: Table S3.

Similar articles

Cited by

References

    1. Lewis SM, Cote AG. Palindromes and genomic stress fractures: bracing and repairing the damage. DNA Repair (Amst) 2006;5(9–10):1146–1160. doi: 10.1016/j.dnarep.2006.05.014. - DOI - PubMed
    1. Lobachev KS, Rattray A, Narayanan V. Hairpin- and cruciform-mediated chromosome breakage: causes and consequences in eukaryotic cells. Front Biosci. 2007;12:4208–4220. doi: 10.2741/2381. - DOI - PubMed
    1. Inagaki H, Ohye T, Kogo H, Kato T, Bolor H, Taniguchi M, Shaikh TH, Emanuel BS, Kurahashi H. Chromosomal instability mediated by non-B DNA: cruciform conformation and not DNA sequence is responsible for recurrent translocation in humans. Genome Res. 2009;19(2):191–198. doi: 10.1101/gr.079244.108. - DOI - PMC - PubMed
    1. Leach DR. Long DNA palindromes, cruciform structures, genetic instability and secondary structure repair. Bioessays. 1994;16(12):893–900. doi: 10.1002/bies.950161207. - DOI - PubMed
    1. Tanaka H, Yao MC. Palindromic gene amplification–an evolutionarily conserved role for DNA inverted repeats in the genome. Nat Rev Cancer. 2009;9(3):216–224. doi: 10.1038/nrc2591. - DOI - PubMed

Publication types