. 2015 May 22;10(5):e0128036.

doi: 10.1371/journal.pone.0128036. eCollection 2015.

Inexpensive multiplexed library preparation for megabase-sized genomes

Michael Baym¹, Sergey Kryazhimskiy², Tami D Lieberman¹, Hattie Chung¹, Michael M Desai³, Roy Kishony⁴

Affiliations

¹ Department of Systems Biology, Harvard Medical School, Boston, Massachusetts, United States of America.
² Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, Massachusetts, United States of America; FAS Center for Systems Biology, Harvard University, Cambridge, Massachusetts, United States of America.
³ Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, Massachusetts, United States of America; FAS Center for Systems Biology, Harvard University, Cambridge, Massachusetts, United States of America; Department of Physics, Harvard University, Cambridge, Massachusetts, United States of America.
⁴ Department of Systems Biology, Harvard Medical School, Boston, Massachusetts, United States of America; Faculty of Biology and Department of Computer Science, Technion-Israel Institute of Technology, Haifa, Israel.

PMID: 26000737
PMCID: PMC4441430
DOI: 10.1371/journal.pone.0128036

Inexpensive multiplexed library preparation for megabase-sized genomes

Michael Baym et al. PLoS One. 2015.

. 2015 May 22;10(5):e0128036.

doi: 10.1371/journal.pone.0128036. eCollection 2015.

Authors

Michael Baym¹, Sergey Kryazhimskiy², Tami D Lieberman¹, Hattie Chung¹, Michael M Desai³, Roy Kishony⁴

Affiliations

¹ Department of Systems Biology, Harvard Medical School, Boston, Massachusetts, United States of America.
² Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, Massachusetts, United States of America; FAS Center for Systems Biology, Harvard University, Cambridge, Massachusetts, United States of America.
³ Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, Massachusetts, United States of America; FAS Center for Systems Biology, Harvard University, Cambridge, Massachusetts, United States of America; Department of Physics, Harvard University, Cambridge, Massachusetts, United States of America.
⁴ Department of Systems Biology, Harvard Medical School, Boston, Massachusetts, United States of America; Faculty of Biology and Department of Computer Science, Technion-Israel Institute of Technology, Haifa, Israel.

PMID: 26000737
PMCID: PMC4441430
DOI: 10.1371/journal.pone.0128036

Erratum in

Correction: Inexpensive Multiplexed Library Preparation for Megabase-Sized Genomes.
PLOS ONE Staff. PLOS ONE Staff. PLoS One. 2015 Jun 18;10(6):e0131262. doi: 10.1371/journal.pone.0131262. eCollection 2015. PLoS One. 2015. PMID: 26086768 Free PMC article. No abstract available.

Abstract

Whole-genome sequencing has become an indispensible tool of modern biology. However, the cost of sample preparation relative to the cost of sequencing remains high, especially for small genomes where the former is dominant. Here we present a protocol for rapid and inexpensive preparation of hundreds of multiplexed genomic libraries for Illumina sequencing. By carrying out the Nextera tagmentation reaction in small volumes, replacing costly reagents with cheaper equivalents, and omitting unnecessary steps, we achieve a cost of library preparation of $8 per sample, approximately 6 times cheaper than the standard Nextera XT protocol. Furthermore, our procedure takes less than 5 hours for 96 samples. Several hundred samples can then be pooled on the same HiSeq lane via custom barcodes. Our method will be useful for re-sequencing of microbial or viral genomes, including those from evolution experiments, genetic screens, and environmental samples, as well as for other sequencing applications including large amplicon, open chromosome, artificial chromosomes, and RNA sequencing.

PubMed Disclaimer

Conflict of interest statement

Competing Interests: This study was partially funding by Hoffman-LaRoche (RK). There are no further declarations relating to employment, consultancy, patents, products in development, or marketed products. This does not alter the authors' adherence to PLOS ONE policies on sharing data and materials. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Figures

**Fig 1. Schematic of library preparation workflow.**

**Fig 2. Dependence of fragment size distribution on input gDNA concentration and bead volume.**
DNA fragment size distribution is affected by starting genomic DNA concentration (rows) as described in Module 1 as well as the relative amount of bead buffer used in PCR clean-up (columns) as described in Module 4. Size distribution is measured by BioAnalyzer and reported in fluorescence units. Data is from *Stenotrophomonas maltophilia* (67% GC, 4.8 Mb genome). At high initial gDNA concentration (1.25 ng/μl) the fragment distribution is right-skewed, though this anomalous peak does not appear to significantly affect sequencing output.

**Fig 3. Distribution of the fraction of unique reads over 261 E. *coli* samples.**
Samples had between 0.2 and 2.8 million reads with 90% of samples having over 1 million reads. Raw reads were filtered and then aligned to a reference genome using bowtie2. Unique reads are those that appear only once in the alignment for a particular sample. These are the reads that remain after use of the rmdup tool in samtools. Non-unique reads arise primarily when the same tagmented fragment is amplified during PCR. A low fraction of non-unique reads implies a diversity of fragments after tagmentation, and that errors introduced during PCR will not reach high frequencies.

**Fig 4. Correspondence between BioAnalyzer traces and the length distribution of aligned reads.**
Panels A-C show three representative BioAnalyzer traces from three sample preparations of S. *maltophilia*. Panels D-F show the corresponding estimated fragment-size distributions (black) and the actual distributions of fragment lengths imputed from alignment to the reference genome (blue). A BioAnalyzer trace *f(x)* shows fluorescence f at fragment length x. However, we are interested in *n(x)*, the (relative) number of fragments n of length x. Since fluorescence of a DNA fragment is proportional to its length, *n(x)* ∝ *f(x) / x*. Note that sequencing can be successful despite the presence of apparently very long fragments (which are likely heteroduplexes) in the BioAnalyzer traces (Panels C and F).

**Fig 5. Size and number of aligned fragments as a function of post-PCR DNA concentration.**
Data is from two plates of E. *coli* samples, with 83 and 95 samples per plate (S2 Table). Input gDNA concentrations ranged from 2 to 25ng/μl, and were standardized to 0.5ng/μl. Based on estimated fragment-length distributions, Plates 1 and 2 were pooled in mass ratio 0.8:1. In this preparation, 2 samples (1%) failed to yield libraries, and 17 (10%) produced low, but usable, numbers of reads (between 0.1 and 0.4 million)

See this image and copyright information in PMC

References

1. Tyson GW, Chapman J, Hugenholtz P, Allen EE, Ram RJ, et al. (2004) Community structure and metabolism through reconstruction of microbial genomes from the environment. Nature 428: 37–43. 10.1038/nature02340 - DOI - PubMed
1. Hegreness M, Kishony R (2007) Analysis of genetic systems using experimental evolution and whole-genome sequencing. Genome Biology 8: 201 10.1186/gb-2007-8-1-201 - DOI - PMC - PubMed
1. Mardis ER (2008) Next-generation DNA sequencing methods. Annu Rev Genomics Hum Genet. - PubMed
1. Bik HM, Porazinska DL, Creer S, Caporaso JG, Knight R, et al. (2012) Sequencing our way towards understanding global eukaryotic biodiversity. Trends in Ecology and Evolution 27: 233–243. 10.1016/j.tree.2011.11.010 - DOI - PMC - PubMed
1. Barrick JE, Lenski RE (2013) Genome dynamics during experimental evolution. Nat Rev Genet 14: 827–839. 10.1038/nrg3564 - DOI - PMC - PubMed

Publication types

Actions
Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

LinkOut - more resources

Full Text Sources
Other Literature Sources
- The Lens - Patent Citations Database
- scite Smart Citations

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Inexpensive multiplexed library preparation for megabase-sized genomes

Affiliations

Inexpensive multiplexed library preparation for megabase-sized genomes

Authors

Affiliations

Erratum in

Abstract

Conflict of interest statement

Figures

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources