Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2011 Feb;18(1):31-8.
doi: 10.1093/dnares/dsq029. Epub 2010 Dec 16.

Multiplex target enrichment using DNA indexing for ultra-high throughput SNP detection

Affiliations

Multiplex target enrichment using DNA indexing for ultra-high throughput SNP detection

Elaine M Kenny et al. DNA Res. 2011 Feb.

Abstract

Screening large numbers of target regions in multiple DNA samples for sequence variation is an important application of next-generation sequencing but an efficient method to enrich the samples in parallel has yet to be reported. We describe an advanced method that combines DNA samples using indexes or barcodes prior to target enrichment to facilitate this type of experiment. Sequencing libraries for multiple individual DNA samples, each incorporating a unique 6-bp index, are combined in equal quantities, enriched using a single in-solution target enrichment assay and sequenced in a single reaction. Sequence reads are parsed based on the index, allowing sequence analysis of individual samples. We show that the use of indexed samples does not impact on the efficiency of the enrichment reaction. For three- and nine-indexed HapMap DNA samples, the method was found to be highly accurate for SNP identification. Even with sequence coverage as low as 8x, 99% of sequence SNP calls were concordant with known genotypes. Within a single experiment, this method can sequence the exonic regions of hundreds of genes in tens of samples for sequence and structural variation using as little as 1 μg of input DNA per sample.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Experimental Design. Genomic DNA from nine HapMap samples was chosen for the study (three trio families). DNA from one of the samples (NA11881) was prepared twice (with and without an indexed adapter), target enriched and sequenced separately as single samples (non-indexed sample and one indexed sample in Step 1 and enriched libraries 1 and 2 in Step 2). One trio family (NA11881, NA11882 and NA10859; all indexed) was pooled after the Illumina genomic DNA sample prep and enriched together using one SureSelect enrichment reaction to produce the enriched library 3 sample. Indexed DNA from all nine samples was also pooled after the Illumina genomic DNA sample prep and enriched together using one SureSelect enrichment reaction to produce the enriched library 4 sample. Note: enriched libraries 3 and 4 were also sequenced using 80 bp reads to generate additional data for validation of the method for SNP detection.
Figure 2.
Figure 2.
Sequence coverage across on-target and off-target regions. Sequence coverage is plotted for the single non-indexed and indexed samples at an on-target site (PTBP2 on chromosome 1; A) and at an off-target site (chromosome 12; B). Inclusion of the index does not dramatically change the pattern of sequence coverage at on-target or off-target regions. The higher sequence coverage observed for the non-indexed sample compared with the indexed sample reflects the larger number of clusters that passed QC filtering during the sequence run (Supplementary Table SB).
Figure 3.
Figure 3.
Percentage of sequence reads per indexed sample in sequenced libraries. Percentage distribution per sample of sequence reads (pre-alignment to reference genome; 40 and 80 bp data combined) for the three-index (A) and the nine-index (B) sample libraries. The relative underperformance of sample NA10859 in the three-index library is not observed in the nine-index library and is unlikely to be due to a systemtaic problem with the ACACAT index.

References

    1. Dahl F., Gullberg M., Stenberg J., Landegren U., Nilsson M. Multiplex amplification enabled by selective circularization of large sets of genomic DNA fragments. Nucleic Acids Res. 2005;33:e71. doi:10.1093/nar/gni070. - DOI - PMC - PubMed
    1. Landegren U., Schallmeiner E., Nilsson M., et al. Molecular tools for a molecular medicine: analyzing genes, transcripts and proteins using padlock and proximity probes. J. Mol. Recognit. 2004;17:194–7. doi:10.1002/jmr.664. - DOI - PubMed
    1. Li J.B., Gao Y., Aach J., et al. Multiplex padlock targeted sequencing reveals human hypermutable CpG variations. Genome Res. 2009;19:1606–15. doi:10.1101/gr.092213.109. - DOI - PMC - PubMed
    1. Lovett M., Kere J., Hinton L.M. Direct selection: a method for the isolation of cDNAs encoded by large genomic regions. Proc. Natl Acad. Sci. USA. 1991;88:9628–32. doi:10.1073/pnas.88.21.9628. - DOI - PMC - PubMed
    1. Porreca G.J., Zhang K., Li J.B., et al. Multiplex amplification of large sets of human exons. Nat. Methods. 2007;4:931–6. doi:10.1038/nmeth1110. - DOI - PubMed

Publication types

MeSH terms