De novo assembly of human genomes with massively parallel short read sequencing
- PMID: 20019144
- PMCID: PMC2813482
- DOI: 10.1101/gr.097261.109
De novo assembly of human genomes with massively parallel short read sequencing
Abstract
Next-generation massively parallel DNA sequencing technologies provide ultrahigh throughput at a substantially lower unit data cost; however, the data are very short read length sequences, making de novo assembly extremely challenging. Here, we describe a novel method for de novo assembly of large genomes from short read sequences. We successfully assembled both the Asian and African human genome sequences, achieving an N50 contig size of 7.4 and 5.9 kilobases (kb) and scaffold of 446.3 and 61.9 kb, respectively. The development of this de novo short read assembly method creates new opportunities for building reference sequences and carrying out accurate analyses of unexplored genomes in a cost-effective way.
Figures




Similar articles
-
State of the art de novo assembly of human genomes from massively parallel sequencing data.Hum Genomics. 2010 Apr;4(4):271-7. doi: 10.1186/1479-7364-4-4-271. Hum Genomics. 2010. PMID: 20511140 Free PMC article. Review.
-
Long-read sequencing and de novo assembly of a Chinese genome.Nat Commun. 2016 Jun 30;7:12065. doi: 10.1038/ncomms12065. Nat Commun. 2016. PMID: 27356984 Free PMC article.
-
Fine de novo sequencing of a fungal genome using only SOLiD short read data: verification on Aspergillus oryzae RIB40.PLoS One. 2013 May 7;8(5):e63673. doi: 10.1371/journal.pone.0063673. Print 2013. PLoS One. 2013. PMID: 23667655 Free PMC article.
-
Structural variation in two human genomes mapped at single-nucleotide resolution by whole genome de novo assembly.Nat Biotechnol. 2011 Jul 24;29(8):723-30. doi: 10.1038/nbt.1904. Nat Biotechnol. 2011. PMID: 21785424
-
Genome structural variation discovery and genotyping.Nat Rev Genet. 2011 May;12(5):363-76. doi: 10.1038/nrg2958. Epub 2011 Mar 1. Nat Rev Genet. 2011. PMID: 21358748 Free PMC article. Review.
Cited by
-
Integrated Microbiome and Metabolome Analysis Reveals Hypothalamic-Comorbidities Related Signatures in Craniopharyngioma.Adv Sci (Weinh). 2024 Oct;11(39):e2400684. doi: 10.1002/advs.202400684. Epub 2024 Sep 3. Adv Sci (Weinh). 2024. PMID: 39225628 Free PMC article.
-
Draft genome sequence of the sponge-associated strain Bacillus atrophaeus C89, a potential producer of marine drugs.J Bacteriol. 2012 Aug;194(16):4454. doi: 10.1128/JB.00835-12. J Bacteriol. 2012. PMID: 22843588 Free PMC article.
-
Paenibacillus hamazuiensis sp. nov., a bacterium isolated from Hamazui hot spring in Yunnan province, south-west China.Arch Microbiol. 2022 Oct 21;204(11):676. doi: 10.1007/s00203-022-03282-1. Arch Microbiol. 2022. PMID: 36269423
-
Genome sequence of foxtail millet (Setaria italica) provides insights into grass evolution and biofuel potential.Nat Biotechnol. 2012 May 13;30(6):549-54. doi: 10.1038/nbt.2195. Nat Biotechnol. 2012. PMID: 22580950
-
Identification of representative genes of the central nervous system of the locust, Locusta migratoria manilensis by deep sequencing.J Insect Sci. 2012;12:86. doi: 10.1673/031.012.8601. J Insect Sci. 2012. PMID: 23421689 Free PMC article.
References
-
- Bentley DR. Whole-genome re-sequencing. Curr Opin Genet Dev. 2006;16:545–552. - PubMed
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases