Computational complexity of algorithms for sequence comparison, short-read assembly and genome alignment
- PMID: 28392341
- DOI: 10.1016/j.biosystems.2017.03.003
Computational complexity of algorithms for sequence comparison, short-read assembly and genome alignment
Abstract
A multitude of algorithms for sequence comparison, short-read assembly and whole-genome alignment have been developed in the general context of molecular biology, to support technology development for high-throughput sequencing, numerous applications in genome biology and fundamental research on comparative genomics. The computational complexity of these algorithms has been previously reported in original research papers, yet this often neglected property has not been reviewed previously in a systematic manner and for a wider audience. We provide a review of space and time complexity of key sequence analysis algorithms and highlight their properties in a comprehensive manner, in order to identify potential opportunities for further research in algorithm or data structure optimization. The complexity aspect is poised to become pivotal as we will be facing challenges related to the continuous increase of genomic data on unprecedented scales and complexity in the foreseeable future, when robust biological simulation at the cell level and above becomes a reality.
Keywords: Bioinformatics algorithms; Computational complexity; Genome alignment; Sequence comparison; Short-read assembly.
Copyright © 2017 Elsevier B.V. All rights reserved.
Similar articles
-
Comparative analysis of algorithms for next-generation sequencing read alignment.Bioinformatics. 2011 Oct 15;27(20):2790-6. doi: 10.1093/bioinformatics/btr477. Epub 2011 Aug 19. Bioinformatics. 2011. PMID: 21856737
-
Bioinformatics challenges in de novo transcriptome assembly using short read sequences in the absence of a reference genome sequence.Nat Prod Rep. 2013 Apr;30(4):490-500. doi: 10.1039/c3np20099j. Nat Prod Rep. 2013. PMID: 23377493 Review.
-
Michael Waterman's Contributions to Computational Biology and Bioinformatics.J Comput Biol. 2022 Jul;29(7):601-615. doi: 10.1089/cmb.2022.29066.pp. Epub 2022 Jun 21. J Comput Biol. 2022. PMID: 35727100 Review.
-
Short Read Alignment Using SOAP2.Methods Mol Biol. 2016;1374:241-52. doi: 10.1007/978-1-4939-3167-5_13. Methods Mol Biol. 2016. PMID: 26519410
-
The present and future of de novo whole-genome assembly.Brief Bioinform. 2018 Jan 1;19(1):23-40. doi: 10.1093/bib/bbw096. Brief Bioinform. 2018. PMID: 27742661 Review.
Cited by
-
How Machine Learning and Statistical Models Advance Molecular Diagnostics of Rare Disorders Via Analysis of RNA Sequencing Data.Front Mol Biosci. 2021 Jun 1;8:647277. doi: 10.3389/fmolb.2021.647277. eCollection 2021. Front Mol Biosci. 2021. PMID: 34141720 Free PMC article. Review.
-
HELIOS: High-speed sequence alignment in optics.PLoS Comput Biol. 2022 Nov 21;18(11):e1010665. doi: 10.1371/journal.pcbi.1010665. eCollection 2022 Nov. PLoS Comput Biol. 2022. PMID: 36409684 Free PMC article.
-
Genome assembly composition of the String "ACGT" array: a review of data structure accuracy and performance challenges.PeerJ Comput Sci. 2023 Jul 13;9:e1180. doi: 10.7717/peerj-cs.1180. eCollection 2023. PeerJ Comput Sci. 2023. PMID: 37547391 Free PMC article.
-
Developing computational biology at meridian 23° E, and a little eastwards.J Biol Res (Thessalon). 2018 Nov 14;25:18. doi: 10.1186/s40709-018-0091-5. eCollection 2018 Dec. J Biol Res (Thessalon). 2018. PMID: 30460210 Free PMC article.
-
Athena: Automated Tuning of k-mer based Genomic Error Correction Algorithms using Language Models.Sci Rep. 2019 Nov 6;9(1):16157. doi: 10.1038/s41598-019-52196-4. Sci Rep. 2019. PMID: 31695060 Free PMC article.
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources