A new strategy for genome assembly using short sequence reads and reduced representation libraries
- PMID: 20123915
- PMCID: PMC2813480
- DOI: 10.1101/gr.097956.109
A new strategy for genome assembly using short sequence reads and reduced representation libraries
Abstract
We have developed a novel approach for using massively parallel short-read sequencing to generate fast and inexpensive de novo genomic assemblies comparable to those generated by capillary-based methods. The ultrashort (<100 base) sequences generated by this technology pose specific biological and computational challenges for de novo assembly of large genomes. To account for this, we devised a method for experimentally partitioning the genome using reduced representation (RR) libraries prior to assembly. We use two restriction enzymes independently to create a series of overlapping fragment libraries, each containing a tractable subset of the genome. Together, these libraries allow us to reassemble the entire genome without the need of a reference sequence. As proof of concept, we applied this approach to sequence and assembled the majority of the 125-Mb Drosophila melanogaster genome. We subsequently demonstrate the accuracy of our assembly method with meaningful comparisons against the current available D. melanogaster reference genome (dm3). The ease of assembly and accuracy for comparative genomics suggest that our approach will scale to future mammalian genome-sequencing efforts, saving both time and money without sacrificing quality.
Figures









Similar articles
-
Rapid Low-Cost Assembly of the Drosophila melanogaster Reference Genome Using Low-Coverage, Long-Read Sequencing.G3 (Bethesda). 2018 Oct 3;8(10):3143-3154. doi: 10.1534/g3.118.200162. G3 (Bethesda). 2018. PMID: 30018084 Free PMC article.
-
Whole-genome sequencing and assembly with high-throughput, short-read technologies.PLoS One. 2007 May 30;2(5):e484. doi: 10.1371/journal.pone.0000484. PLoS One. 2007. PMID: 17534434 Free PMC article.
-
High-quality draft assemblies of mammalian genomes from massively parallel sequence data.Proc Natl Acad Sci U S A. 2011 Jan 25;108(4):1513-8. doi: 10.1073/pnas.1017351108. Epub 2010 Dec 27. Proc Natl Acad Sci U S A. 2011. PMID: 21187386 Free PMC article.
-
Chromosome-level hybrid de novo genome assemblies as an attainable option for nonmodel insects.Mol Ecol Resour. 2020 Sep;20(5):1277-1293. doi: 10.1111/1755-0998.13176. Epub 2020 Jun 7. Mol Ecol Resour. 2020. PMID: 32329220 Review.
-
State of the art de novo assembly of human genomes from massively parallel sequencing data.Hum Genomics. 2010 Apr;4(4):271-7. doi: 10.1186/1479-7364-4-4-271. Hum Genomics. 2010. PMID: 20511140 Free PMC article. Review.
Cited by
-
Analysis of evolution and genetic diversity of sweetpotato and its related different polyploidy wild species I. trifida using RAD-seq.BMC Plant Biol. 2018 Sep 5;18(1):181. doi: 10.1186/s12870-018-1399-x. BMC Plant Biol. 2018. PMID: 30185158 Free PMC article.
-
A combinatorial approach to the restriction of a mouse genome.BMC Res Notes. 2013 Jul 22;6:284. doi: 10.1186/1756-0500-6-284. BMC Res Notes. 2013. PMID: 23875927 Free PMC article.
-
SNP-based genetic linkage map of tobacco (Nicotiana tabacum L.) using next-generation RAD sequencing.J Biol Res (Thessalon). 2015 Oct 6;22:11. doi: 10.1186/s40709-015-0034-3. eCollection 2015 Dec. J Biol Res (Thessalon). 2015. PMID: 26473145 Free PMC article.
-
Challenges of sequencing human genomes.Brief Bioinform. 2010 Sep;11(5):484-98. doi: 10.1093/bib/bbq016. Epub 2010 Jun 2. Brief Bioinform. 2010. PMID: 20519329 Free PMC article. Review.
-
A strategy for direct mapping and identification of mutations by whole-genome sequencing.Genetics. 2010 Sep;186(1):427-30. doi: 10.1534/genetics.110.119230. Epub 2010 Jul 6. Genetics. 2010. PMID: 20610404 Free PMC article.
References
-
- Adams MD, Celniker SE, Holt RA, Evans CA, Gocayne JD, Amanatides PG, Scherer SE, Li PW, Hoskins RA, Galle RF, et al. The genome sequence of Drosophila melanogaster. Science. 2000;287:2185–2195. - PubMed
-
- Altshuler D, Pollara VJ, Cowles CR, Van Etten WJ, Baldwin J, Linton L, Lander ES. An SNP map of the human genome generated by reduced representation shotgun sequencing. Nature. 2000;407:513–516. - PubMed
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases
Research Materials
Miscellaneous