. 2014 Jun 2;9(6):e97379.

doi: 10.1371/journal.pone.0097379. eCollection 2014.

Direct squencing from the minimal number of DNA molecules needed to fill a 454 picotiterplate

Mária Džunková¹, Marc Garcia-Garcerà², Llúcia Martínez-Priego³, Giussepe D'Auria¹, Francesc Calafell⁴, Andrés Moya¹

Affiliations

¹ Área de Genómica y Salud, Fundación para el Fomento de la Investigación Sanitaria y Biomédica de la Comunidad Valenciana (FISABIO-Salud Pública), Valencia, Spain; Instituto Cavanilles de Biodiversidad y Biología Evolutiva, Universitat de València, Valencia, Spain; CIBER en Epidemiología y Salud Pública (CIBEResp), Madrid, Spain.
² Área de Genómica y Salud, Fundación para el Fomento de la Investigación Sanitaria y Biomédica de la Comunidad Valenciana (FISABIO-Salud Pública), Valencia, Spain; Instituto Cavanilles de Biodiversidad y Biología Evolutiva, Universitat de València, Valencia, Spain; Institut de Biologia Evolutiva, CSIC-Universitat Pompeu Fabra, Barcelona, Spain.
³ Área de Genómica y Salud, Fundación para el Fomento de la Investigación Sanitaria y Biomédica de la Comunidad Valenciana (FISABIO-Salud Pública), Valencia, Spain; Instituto Cavanilles de Biodiversidad y Biología Evolutiva, Universitat de València, Valencia, Spain.
⁴ Institut de Biologia Evolutiva, CSIC-Universitat Pompeu Fabra, Barcelona, Spain.

PMID: 24887077
PMCID: PMC4041646
DOI: 10.1371/journal.pone.0097379

Direct squencing from the minimal number of DNA molecules needed to fill a 454 picotiterplate

Mária Džunková et al. PLoS One. 2014.

. 2014 Jun 2;9(6):e97379.

doi: 10.1371/journal.pone.0097379. eCollection 2014.

Authors

Mária Džunková¹, Marc Garcia-Garcerà², Llúcia Martínez-Priego³, Giussepe D'Auria¹, Francesc Calafell⁴, Andrés Moya¹

Affiliations

¹ Área de Genómica y Salud, Fundación para el Fomento de la Investigación Sanitaria y Biomédica de la Comunidad Valenciana (FISABIO-Salud Pública), Valencia, Spain; Instituto Cavanilles de Biodiversidad y Biología Evolutiva, Universitat de València, Valencia, Spain; CIBER en Epidemiología y Salud Pública (CIBEResp), Madrid, Spain.
² Área de Genómica y Salud, Fundación para el Fomento de la Investigación Sanitaria y Biomédica de la Comunidad Valenciana (FISABIO-Salud Pública), Valencia, Spain; Instituto Cavanilles de Biodiversidad y Biología Evolutiva, Universitat de València, Valencia, Spain; Institut de Biologia Evolutiva, CSIC-Universitat Pompeu Fabra, Barcelona, Spain.
³ Área de Genómica y Salud, Fundación para el Fomento de la Investigación Sanitaria y Biomédica de la Comunidad Valenciana (FISABIO-Salud Pública), Valencia, Spain; Instituto Cavanilles de Biodiversidad y Biología Evolutiva, Universitat de València, Valencia, Spain.
⁴ Institut de Biologia Evolutiva, CSIC-Universitat Pompeu Fabra, Barcelona, Spain.

PMID: 24887077
PMCID: PMC4041646
DOI: 10.1371/journal.pone.0097379

Erratum in

PLoS One. 2014;9(7):e102719

Abstract

The large amount of DNA needed to prepare a library in next generation sequencing protocols hinders direct sequencing of small DNA samples. This limitation is usually overcome by the enrichment of such samples with whole genome amplification (WGA), mostly by multiple displacement amplification (MDA) based on φ29 polymerase. However, this technique can be biased by the GC content of the sample and is prone to the development of chimeras as well as contamination during enrichment, which contributes to undesired noise during sequence data analysis, and also hampers the proper functional and/or taxonomic assignments. An alternative to MDA is direct DNA sequencing (DS), which represents the theoretical gold standard in genome sequencing. In this work, we explore the possibility of sequencing the genome of Escherichia coli fs 24 from the minimum number of DNA molecules required for pyrosequencing, according to the notion of one-bead-one-molecule. Using an optimized protocol for DS, we constructed a shotgun library containing the minimum number of DNA molecules needed to fill a selected region of a picotiterplate. We gathered most of the reference genome extension with uniform coverage. We compared the DS method with MDA applied to the same amount of starting DNA. As expected, MDA yielded a sparse and biased read distribution, with a very high amount of unassigned and unspecific DNA amplifications. The optimized DS protocol allows unbiased sequencing to be performed from samples with a very small amount of DNA.

PubMed Disclaimer

Conflict of interest statement

Competing Interests: The authors have declared that no competing interests exist.

Figures

**Figure 1. Flowchart of the minimal library preparation protocol.**
Panel A: The experimental work started with cell sorting, where 20,000 cells were separated in two replicates to confirm the whole experiment. The DNA from 20,000 cells was extracted and split into halves, where one half was amplified with GenomiPhi (MDA) and a second half was processed without whole genome amplification (DS). The shotgun libraries were prepared with the same alternative protocol for the both MDA and DSsamples. Library quality control points were the test PCR with emPCR primers to prove the removal of self-ligated adaptors and the library concentration checking with qPCR. The MDAsample and DSsample with different MIDs in two repetitions were combined into two sequencing runs as sho! wn in the scheme. Panel B: DNA amount requirements in the standard Rapid Library Preparation Method Manual GS FLX+ Series – XL+ (May 2011) compared with the amounts actually needed for sequencing on a selected PTP region. The minimal amount of prepared library required for proceeding to emPCR step in the standard 454 protocol may lose 99% of input DNA during the library preparation step. Then, this amount is diluted to a working stock of 10–7 molecules, defined as the best starting point to perform the emPCR titration step. However, if the exact number of molecules is quantified with qPCR, the emPCR titration step can be omitted, so actually only 0.13 pg of prepared library are needed for sequencing on 1/8 region of PTP (equivalent to 340,000 ssDNA molecules). This allows to use an alternative shotgun protocol where the DNA losses are reduced.

**Figure 2. Results of *E. coli* genome mapping and blast to NCBI database.**
Proportions (in %) of Mbp mapped by SSAHA2 to *E. coli* genome are shown for MDA and DS sequences, separately for each sequencing run. It can be observed that the percentage of mapped DS reads were significantly higher than the MDA reads. The reads that were not mapped to *E. coli* were analyzed by blast in “nr” database. However, most reads remained unidentified, especially in the case of MDA.

**Figure 3. Distribution of coverage throughout the *E. coli* genome.**
The comparison of the genome coverage obtained by MDA and DS methods. The genome coverage of MDA reads was characterized by unequal distribution with many gaps and several areas with extremely high coverage (up to 121 x), while the highest coverage obtained by DS was only 15 x and it was better distributed throughout the whole genome.

**Figure 4. Clustering analysis of the k-mer abundance distribution.**
Comparison of the relative abundances of 6-mer in the different datasets using hierarchical clustering. As observed, the most likely conformation shows aggregation of *E. coli* with DS methodology, while *B. subtilis* is associated with MDA.

See this image and copyright information in PMC

Cited by

Exploring the human microbiome from multiple perspectives: factors altering its composition and function.
Rojo D, Méndez-García C, Raczkowska BA, Bargiela R, Moya A, Ferrer M, Barbas C. Rojo D, et al. FEMS Microbiol Rev. 2017 Jul 1;41(4):453-478. doi: 10.1093/femsre/fuw046. FEMS Microbiol Rev. 2017. PMID: 28333226 Free PMC article. Review.
Expanding a Wastewater-Based Surveillance Methodology for DNA Isolation from a Workflow Optimized for SARS-CoV-2 RNA Quantification.
Babler KM, Sharkey ME, Amirali A, Boone MM, Comerford S, Currall BB, Grills GS, Laine J, Mason CE, Reding B, Schürer S, Stevenson M, Vidović D, Williams SL, Solo-Gabriele HM. Babler KM, et al. J Biomol Tech. 2023 Dec 20;34(4):3fc1f5fe.dfa8d906. doi: 10.7171/3fc1f5fe.dfa8d906. eCollection 2023 Dec. J Biomol Tech. 2023. PMID: 38268997 Free PMC article.
CleanBar: a versatile demultiplexing tool for split-and-pool barcoding in single-cell omics.
Arnau V, Ortiz-Maiques A, Valero-Tebar J, Mora-Quilis L, Kurmauskaite V, Campos Dopazo L, Domingo-Calap P, Džunková M. Arnau V, et al. ISME Commun. 2025 Aug 1;5(1):ycaf134. doi: 10.1093/ismeco/ycaf134. eCollection 2025 Jan. ISME Commun. 2025. PMID: 40860566 Free PMC article.
Metagenomic assessment of the interplay between the environment and the genetic diversification of Acinetobacter.
Garcia-Garcera M, Touchon M, Brisse S, Rocha EPC. Garcia-Garcera M, et al. Environ Microbiol. 2017 Dec;19(12):5010-5024. doi: 10.1111/1462-2920.13949. Epub 2017 Dec 1. Environ Microbiol. 2017. PMID: 28967182 Free PMC article.
A simple, reproducible and cost-effective procedure to analyse gut phageome: from phage isolation to bioinformatic approach.
d'Humières C, Touchon M, Dion S, Cury J, Ghozlane A, Garcia-Garcera M, Bouchier C, Ma L, Denamur E, P C Rocha E. d'Humières C, et al. Sci Rep. 2019 Aug 5;9(1):11331. doi: 10.1038/s41598-019-47656-w. Sci Rep. 2019. PMID: 31383878 Free PMC article.

See all "Cited by" articles

References

1. Wolinsky H (2007) The thousand-dollar genome. genetic brinkmanship or personalized medicine? EMBO Rep 8: 900–903. - PMC - PubMed
1. Binga EK, Lasken RS, Neufeld JD (2008) Something from (almost) nothing: the impact of multiple displacement amplification on microbial ecology. ISME J 2: 233–241. - PubMed
1. Vlček Č, Pačes V (1986) Nucleotide sequence of the late region of bacillus phage phi 29 completes the 19285-bp sequence of phi 29 genome. Comparison with the homologous sequence of phage PZA. Gene 46: 215–225. - PubMed
1. Dean FB, Nelson JR, Giesler TL, Lasken RS (2001) Rapid amplification of plasmid and phage DNA using Phi29 DNA polymerase and multiply-primed rolling circle amplification. Genome Res 11: 1095–1099. - PMC - PubMed
1. Paez JG, Lin M, Beroukhim R, Lee JC, Zhao X, et al. (2004) Genome coverage and sequence fidelity of phi 29 polymerase-based multiple strand displacement whole genome amplification. Nucleic Acids Res 32: e71. - PMC - PubMed

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Associated data

SRA/ERP003418

LinkOut - more resources

Full Text Sources
Other Literature Sources
- scite Smart Citations
Research Materials
- NCI CPTC Antibody Characterization Program
Miscellaneous
- NCI CPTAC Assay Portal

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Direct squencing from the minimal number of DNA molecules needed to fill a 454 picotiterplate

Affiliations

Direct squencing from the minimal number of DNA molecules needed to fill a 454 picotiterplate

Authors

Affiliations

Erratum in

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Associated data

LinkOut - more resources

Full Text Sources

Other Literature Sources

Research Materials

Miscellaneous

Erratum in

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Associated data

Related information

LinkOut - more resources

Full Text Sources

Other Literature Sources

Research Materials

Miscellaneous