Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2011;6(6):e21088.
doi: 10.1371/journal.pone.0021088. Epub 2011 Jun 30.

A flexible approach for highly multiplexed candidate gene targeted resequencing

Affiliations

A flexible approach for highly multiplexed candidate gene targeted resequencing

Georges Natsoulis et al. PLoS One. 2011.

Abstract

We have developed an integrated strategy for targeted resequencing and analysis of gene subsets from the human exome for variants. Our capture technology is geared towards resequencing gene subsets substantially larger than can be done efficiently with simplex or multiplex PCR but smaller in scale than exome sequencing. We describe all the steps from the initial capture assay to single nucleotide variant (SNV) discovery. The capture methodology uses in-solution 80-mer oligonucleotides. To provide optimal flexibility in choosing human gene targets, we designed an in silico set of oligonucleotides, the Human OligoExome, that covers the gene exons annotated by the Consensus Coding Sequencing Project (CCDS). This resource is openly available as an Internet accessible database where one can download capture oligonucleotides sequences for any CCDS gene and design custom capture assays. Using this resource, we demonstrated the flexibility of this assay by custom designing capture assays ranging from 10 to over 100 gene targets with total capture sizes from over 100 Kilobases to nearly one Megabase. We established a method to reduce capture variability and incorporated indexing schemes to increase sample throughput. Our approach has multiple applications that include but are not limited to population targeted resequencing studies of specific gene subsets, validation of variants discovered in whole genome sequencing surveys and possible diagnostic analysis of disease gene subsets. We also present a cost analysis demonstrating its cost-effectiveness for large population studies.

PubMed Disclaimer

Conflict of interest statement

Competing Interests: The authors have declared that no competing interests exist.

Figures

Figure 1
Figure 1. Adjustment of capture oligonucleotides performance.
Pre- and post-adjustment capture oligonucleotides performance of capture assays 1 and 2 are shown. Capture assay 1's target size was 102.48 Kb and this intermediate version of capture assay 2 covered 616 Kb. The Y axis shows the proportions of bases across the target whose fold-coverage can be sorted into each order of magnitude before and after capture adjustment. Nominally, we opted for a sequencing depth between 100 and 1,000 as an adequate representation. In both assays, the proportion of bases whose FC is less than 100 drops significantly; in the case of capture assay 2, the number of bases with excessively high FC has dropped significantly as well.
Figure 2
Figure 2. Evaluation of Alu sequence in non-specific capture.
Ten-thousand consensus length (297 bases) Alu sequences were randomly selected and aligned. The percentage of Alu sequences containing MseI, BfaI, SauIIIA and CviQI sites along the multiple alignment positions is shown. The four most prevalent restriction sites are SauIIIA sites. The two most frequent amongst these are present in 50 to 75% of the Alu sequences. We note that the alignment sequence is much longer than many individual Alu sequences because of insertions and deletions.
Figure 3
Figure 3. Comparison of targeted resequencing of independent samples.
We show an example of a 1,049 base captured region, occurring between coordinates 11096583 and 11097631 of chromosome 1. The fold-coverage from the three samples has been normalized by taking the ratio of fold-coverage at each position to the median depth for the sample, and then taking the log10 of that ratio. Purple lines indicate a capture oligonucleotide's target. The exons are indicated by the blue lines. Vertical lines, extending from the beginning and end of each captured amplicon, show that the discontinuities in depth are associated with the ends of captured targets.

References

    1. Bentley D, Balasubramanian S, Swerdlow H, Smith G, Milton J, et al. Accurate whole human genome sequencing using reversible terminator chemistry. Nature. 2008;456:53–59. - PMC - PubMed
    1. Harris TD, Buzby PR, Babcock H, Beer E, Bowers J, et al. Single-molecule DNA sequencing of a viral genome. Science. 2008;320:106–109. - PubMed
    1. Margulies M, Egholm M, Altman WE, Attiya S, Bader JS, et al. Genome sequencing in microfabricated high-density picolitre reactors. Nature. 2005;437:376–380. - PMC - PubMed
    1. Shendure J, Porreca GJ, Reppas NB, Lin X, McCutcheon JP, et al. Accurate multiplex polony sequencing of an evolved bacterial genome. Science. 2005;309:1728–1732. - PubMed
    1. Bentley DR. Whole-genome re-sequencing. Curr Opin Genet Dev. 2006;16:545–552. - PubMed

Publication types