Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 2010;11(12):R119.
doi: 10.1186/gb-2010-11-12-r119. Epub 2010 Dec 8.

Rapid, low-input, low-bias construction of shotgun fragment libraries by high-density in vitro transposition

Affiliations
Comparative Study

Rapid, low-input, low-bias construction of shotgun fragment libraries by high-density in vitro transposition

Andrew Adey et al. Genome Biol. 2010.

Abstract

We characterize and extend a highly efficient method for constructing shotgun fragment libraries in which transposase catalyzes in vitro DNA fragmentation and adaptor incorporation simultaneously. We apply this method to sequencing a human genome and find that coverage biases are comparable to those of conventional protocols. We also extend its capabilities by developing protocols for sub-nanogram library construction, exome capture from 50 ng of input DNA, PCR-free and colony PCR library construction, and 96-plex sample indexing.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Methods for constructing in vitro fragment libraries. (a) In the conventional protocol, mechanical or endonuclease fragmentation is followed by end-polishing, A-tailing, adaptor ligation and PCR. (b) With transposase-mediated adaptor insertion, fragmentation and adaptor insertion occur in a single 5-min in vitro step, followed by PCR. For both methods, a primer-embedded sample-specific barcode can be incorporated during PCR amplification (black triangle). Dark blue: Genomic DNA. Light green: End repaired sequence. Red: A-tail. Magenta/dark green and purple/dark green: Adaptors. Mid blue/brown/orange: Transposase adaptors. Cyan/light green triangles: Endonuclease fragmentation. Grey curved dotted lines: Sonication. Grey hexagon: Transposase.
Figure 2
Figure 2
Schematic of steps associated with different library preparation methods. Transposase-catalyzed adaptor insertion significantly reduces the number of steps and time associated with library construction (green path).
Figure 3
Figure 3
Comparison of coverage bias. (a) Coverage distribution across the E. coli genome with transposase (blue), sonication (red), and endonuclease (green) methods (solid lines) and replicates (dotted lines), normalized for total sequencing depth. (b) Coverage distribution across the PA1 and CRW10 bacteriophage genomes with transposase (blue), nebulization (red), and endonuclease (green) methods (dotted lines represent replicate libraries). (c) G+C bias for E. coli was assessed by calculating G+C content of the reference in 500 bp bins and plotting the coverage in each for transposase (blue), sonication (red), and endonuclease (green) methods, all of which show an approximately equivalent bias against the extremes.
Figure 4
Figure 4
Insert size showing steric hindrance. (a) Insert size was generated from libraries spiked into a paired-end 101 bp run resulting in a large proportion of reads reading into the adaptor sequence. Tails of reads were then aligned to one another to discern the insert size between adaptors, resulting in a mapping-independent insert size at the lower extreme. All reads with an insert size less than 25 bp were PCR artifacts. (b) The noticeable drop below 40 bp is consistent with a model for complete saturation of transposition events on a given stretch of DNA. The roughly 110 Å transposase homodimer (grey) is bound to genomic DNA (blue), such that the core of the enzyme acts on a 9 bp region drawn out to 41 Å as well as approximately 10 additional bases of DNA flanking either side (~34 Å each) that are essentially protected from a subsequent transposase attack due to steric hindrance. Since the core region is duplicated in the process, the minimum spacing of transposition events is approximately 38 bp.
Figure 5
Figure 5
Sequence coverage of human and Drosophila. (a) Coverage distribution as a percentage of the genome for human (YH1) using transposase (dark blue, autosomes; light blue, sex chromosomes) and sonication [15] (down-sampled to equivalent coverage; red, autosomes; orange, sex chromosomes) methods. Poisson (no bias) distributions (gray) with λ = 12 (sex chromosomes) and λ = 24 (autosomes) are also shown. Poisson distribution is the expected if there were absolutely no bias. (b) Coverage distribution as a percentage of Drosophila autosomes using transposase (blue, down-sampled to equivalent coverage) and sonication (Drosophila Population Genomics Project (DPGP), red) methods, as well as Poisson distribution with λ = 12 (gray). (c,d) Coverage with respect to G+C content of the reference in 10 kb or 1 kb bins for (c) human (YH1) and (d) Drosophila genomes respectively, for transposase (blue) and sonication (red) methods at comparable global genomic coverage.
Figure 6
Figure 6
Library complexity. Library complexity for each library shown by incremental, random sampling of 50,000 reads, without replacement, and plotting (on log-log scale) the number of uniquely occurring read-pairs with respect to total number of sampled read-pairs. Species: DM, Drosophila; EC, E. coli; NA18507 and YH1, human. Methods for fragmentation: End., endonuclease; Son., sonication; Tr., transposase. Size selection ranges are given for the YH1 libraries (all these were generated using transposase. Libararies ending in "2" are replicate libraries. 100% uniqueis in gray, i.e. the distribution if there were no duplicates of any sort.
Figure 7
Figure 7
PCR-free reduction in G+C coverage bias and direct-from-colony coverage distribution. (a) Coverage with respect to G+C content in E. coli with and without PCR was assessed by calculating G+C content of the reference in 500 bp bins and plotting the coverage in each for transposase after PCR (left) and with the PCR-free method (right). A significant reduction in coverage bias at the extremes of G+C content is observed. (b) Coverage distribution for E. coli library prepared directly from cell lysate without purification.

References

    1. Kahvejian A, Quackenbush J, Thompson JF. What would you do if you could sequence everything? Nat Biotechnol. 2008;26:1125–1133. doi: 10.1038/nbt1494. - DOI - PMC - PubMed
    1. Shendure J, Ji H. Next-generation DNA sequencing. Nat Biotechnol. 2008;26:1135–1145. doi: 10.1038/nbt1486. - DOI - PubMed
    1. Bentley DR, Balasubramanian S, Swerdlow HP, Smith GP, Milton J, Brown CG, Hall KP, Evers DJ, Barnes CL, Bignell HR, Boutell JM, Bryant J, Carter RJ, Keira Cheetham R, Cox AJ, Ellis DJ, Flatbush MR, Gormley NA, Humphray SJ, Irving LJ, Karbelashvili MS, Kirk SM, Li H, Liu X, Maisinger KS, Murray LJ, Obradovic B, Ost T, Parkinson ML, Pratt MR. et al.Accurate whole human genome sequencing using reversible terminator chemistry. Nature. 2008;456:53–59. doi: 10.1038/nature07517. - DOI - PMC - PubMed
    1. Margulies M, Egholm M, Altman WE, Attiya S, Bader JS, Bemben LA, Berka J, Braverman MS, Chen YJ, Chen Z, Dewell SB, Du L, Fierro JM, Gomes XV, Godwin BC, He W, Helgesen S, Ho CH, Irzyk GP, Jando SC, Alenquer ML, Jarvie TP, Jirage KB, Kim JB, Knight JR, Lanza JR, Leamon JH, Lefkowitz SM, Lei M, Li J. et al.Genome sequencing in microfabricated high-density picolitre reactors. Nature. 2005;437:376–380. - PMC - PubMed
    1. McKernan KJ, Peckham HE, Costa GL, McLaughlin SF, Fu Y, Tsung EF, Clouser CR, Duncan C, Ichikawa JK, Lee CC, Zhang Z, Ranade SS, Dimalanta ET, Hyland FC, Sokolsky TD, Zhang L, Sheridan A, Fu H, Hendrickson CL, Li B, Kotler L, Stuart JR, Malek JA, Manning JM, Antipova AA, Perez DS, Moore MP, Hayashibara KC, Lyons MR, Beaudoin RE. et al.Sequence and structural variation in a human genome uncovered by short-read, massively parallel ligation sequencing using two-base encoding. Genome Res. 2009;19:1527–1541. doi: 10.1101/gr.091868.109. - DOI - PMC - PubMed

Publication types

LinkOut - more resources