Review of general algorithmic features for genome assemblers for next generation sequencers
- PMID: 22768980
- PMCID: PMC5054208
- DOI: 10.1016/j.gpb.2012.05.006
Review of general algorithmic features for genome assemblers for next generation sequencers
Abstract
In the realm of bioinformatics and computational biology, the most rudimentary data upon which all the analysis is built is the sequence data of genes, proteins and RNA. The sequence data of the entire genome is the solution to the genome assembly problem. The scope of this contribution is to provide an overview on the art of problem-solving applied within the domain of genome assembly in the next-generation sequencing (NGS) platforms. This article discusses the major genome assemblers that were proposed in the literature during the past decade by outlining their basic working principles. It is intended to act as a qualitative, not a quantitative, tutorial to all working on genome assemblers pertaining to the next generation of sequencers. We discuss the theoretical aspects of various genome assemblers, identifying their working schemes. We also discuss briefly the direction in which the area is headed towards along with discussing core issues on software simplicity.
Copyright © 2012 Beijing Institute of Genomics, Chinese Academy of Sciences. Published by Elsevier Ltd. All rights reserved.
Figures





























Similar articles
-
Next-generation sequence assembly: four stages of data processing and computational challenges.PLoS Comput Biol. 2013;9(12):e1003345. doi: 10.1371/journal.pcbi.1003345. Epub 2013 Dec 12. PLoS Comput Biol. 2013. PMID: 24348224 Free PMC article. Review.
-
Genome assembly reborn: recent computational challenges.Brief Bioinform. 2009 Jul;10(4):354-66. doi: 10.1093/bib/bbp026. Epub 2009 May 29. Brief Bioinform. 2009. PMID: 19482960 Free PMC article.
-
Sequence assembly using next generation sequencing data--challenges and solutions.Sci China Life Sci. 2014 Nov;57(11):1140-8. doi: 10.1007/s11427-014-4752-9. Epub 2014 Oct 17. Sci China Life Sci. 2014. PMID: 25326069 Review.
-
A practical comparison of de novo genome assembly software tools for next-generation sequencing technologies.PLoS One. 2011 Mar 14;6(3):e17915. doi: 10.1371/journal.pone.0017915. PLoS One. 2011. PMID: 21423806 Free PMC article.
-
Comparing de novo genome assembly: the long and short of it.PLoS One. 2011 Apr 29;6(4):e19175. doi: 10.1371/journal.pone.0019175. PLoS One. 2011. PMID: 21559467 Free PMC article.
Cited by
-
Optimal reference sequence selection for genome assembly using minimum description length principle.EURASIP J Bioinform Syst Biol. 2012 Nov 27;2012(1):18. doi: 10.1186/1687-4153-2012-18. EURASIP J Bioinform Syst Biol. 2012. PMID: 23186305 Free PMC article.
-
Music of metagenomics-a review of its applications, analysis pipeline, and associated tools.Funct Integr Genomics. 2022 Feb;22(1):3-26. doi: 10.1007/s10142-021-00810-y. Epub 2021 Oct 18. Funct Integr Genomics. 2022. PMID: 34657989 Review.
-
GenSeed-HMM: A Tool for Progressive Assembly Using Profile HMMs as Seeds and its Application in Alpavirinae Viral Discovery from Metagenomic Data.Front Microbiol. 2016 Mar 4;7:269. doi: 10.3389/fmicb.2016.00269. eCollection 2016. Front Microbiol. 2016. PMID: 26973638 Free PMC article.
-
SplitStrains, a tool to identify and separate mixed Mycobacterium tuberculosis infections from WGS data.Microb Genom. 2021 Jun;7(6):000607. doi: 10.1099/mgen.0.000607. Microb Genom. 2021. PMID: 34165419 Free PMC article.
-
The A, C, G, and T of Genome Assembly.Biomed Res Int. 2016;2016:6329217. doi: 10.1155/2016/6329217. Epub 2016 May 10. Biomed Res Int. 2016. PMID: 27247941 Free PMC article. Review.
References
-
- Oxford Molecular Group PLC. AssemblyLIGN 1.0. 9. Oxford, United Kingdom: Oxford Molecular Group PLC; 1998.
-
- Broveak T. Geneworks. Biotechnol Software Internet J. 1996;13:1114.
-
- Parker S. Autoassembler sequence assembly software. Methods Mol Biol. 1997;70:107–118. - PubMed
-
- Swindell S.R., Plasterer T.N. SEQMAN. Contig assembly. Methods Mol Biol. 1997;70:75–89. - PubMed
-
- Bromberg C. Gene Codes Corporation; 1995. Sequencher.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Miscellaneous