The Theory and Practice of Genome Sequence Assembly
- PMID: 25939056
- DOI: 10.1146/annurev-genom-090314-050032
The Theory and Practice of Genome Sequence Assembly
Abstract
The current genomic revolution was made possible by joint advances in genome sequencing technologies and computational approaches for analyzing sequence data. The close interaction between biologists and computational scientists is perhaps most apparent in the development of approaches for sequencing entire genomes, a feat that would not be possible without sophisticated computational tools called genome assemblers (short for genome sequence assemblers). Here, we survey the key developments in algorithms for assembling genome sequences since the development of the first DNA sequencing methods more than 35 years ago.
Keywords: algorithm; bioinformatics; genome sequencing; sequence assembly; shotgun sequencing.
Similar articles
-
Whole genome assembly from 454 sequencing output via modified DNA graph concept.Comput Biol Chem. 2009 Jun;33(3):224-30. doi: 10.1016/j.compbiolchem.2009.04.005. Epub 2009 May 3. Comput Biol Chem. 2009. PMID: 19477687
-
Bioinformatics software for biologists in the genomics era.Bioinformatics. 2007 Jul 15;23(14):1713-7. doi: 10.1093/bioinformatics/btm239. Epub 2007 May 7. Bioinformatics. 2007. PMID: 17485425
-
Comparative analysis of algorithms for whole-genome assembly of pyrosequencing data.Brief Bioinform. 2012 May;13(3):269-80. doi: 10.1093/bib/bbr063. Epub 2011 Oct 21. Brief Bioinform. 2012. PMID: 22021898 Review.
-
PGA4genomics for comparative genome assembly based on genetic algorithm optimization.Genomics. 2009 Oct;94(4):284-6. doi: 10.1016/j.ygeno.2009.06.006. Epub 2009 Jun 30. Genomics. 2009. PMID: 19573591
-
Whole genome sequencing.Methods Mol Biol. 2010;628:215-26. doi: 10.1007/978-1-60327-367-1_12. Methods Mol Biol. 2010. PMID: 20238084 Review.
Cited by
-
In Silico Whole Genome Sequencer and Analyzer (iWGS): a Computational Pipeline to Guide the Design and Analysis of de novo Genome Sequencing Studies.G3 (Bethesda). 2016 Nov 8;6(11):3655-3662. doi: 10.1534/g3.116.034249. G3 (Bethesda). 2016. PMID: 27638685 Free PMC article.
-
A haplotype-aware de novo assembly of related individuals using pedigree sequence graph.Bioinformatics. 2020 Apr 15;36(8):2385-2392. doi: 10.1093/bioinformatics/btz942. Bioinformatics. 2020. PMID: 31860070 Free PMC article.
-
SyRI: finding genomic rearrangements and local sequence differences from whole-genome assemblies.Genome Biol. 2019 Dec 16;20(1):277. doi: 10.1186/s13059-019-1911-0. Genome Biol. 2019. PMID: 31842948 Free PMC article.
-
Modern technologies and algorithms for scaffolding assembled genomes.PLoS Comput Biol. 2019 Jun 5;15(6):e1006994. doi: 10.1371/journal.pcbi.1006994. eCollection 2019 Jun. PLoS Comput Biol. 2019. PMID: 31166948 Free PMC article. Review.
-
Next generation sequencing of SARS-CoV-2 genomes: challenges, applications and opportunities.Brief Bioinform. 2021 Mar 22;22(2):616-630. doi: 10.1093/bib/bbaa297. Brief Bioinform. 2021. PMID: 33279989 Free PMC article.
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources