Kalign2: high-performance multiple alignment of protein and nucleotide sequences allowing external features
- PMID: 19103665
- PMCID: PMC2647288
- DOI: 10.1093/nar/gkn1006
Kalign2: high-performance multiple alignment of protein and nucleotide sequences allowing external features
Abstract
In the growing field of genomics, multiple alignment programs are confronted with ever increasing amounts of data. To address this growing issue we have dramatically improved the running time and memory requirement of Kalign, while maintaining its high alignment accuracy. Kalign version 2 also supports nucleotide alignment, and a newly introduced extension allows for external sequence annotation to be included into the alignment procedure. We demonstrate that Kalign2 is exceptionally fast and memory-efficient, permitting accurate alignment of very large numbers of sequences. The accuracy of Kalign2 compares well to the best methods in the case of protein alignments while its accuracy on nucleotide alignments is generally superior. In addition, we demonstrate the potential of using known or predicted sequence annotation to improve the alignment accuracy. Kalign2 is freely available for download from the Kalign web site (http://msa.sbc.su.se/).
Figures



Similar articles
-
KalignP: improved multiple sequence alignments using position specific gap penalties in Kalign2.Bioinformatics. 2011 Jun 15;27(12):1702-3. doi: 10.1093/bioinformatics/btr235. Epub 2011 Apr 19. Bioinformatics. 2011. PMID: 21505030 Free PMC article.
-
Kalign--an accurate and fast multiple sequence alignment algorithm.BMC Bioinformatics. 2005 Dec 12;6:298. doi: 10.1186/1471-2105-6-298. BMC Bioinformatics. 2005. PMID: 16343337 Free PMC article.
-
Kalign, Kalignvu and Mumsa: web servers for multiple sequence alignment.Nucleic Acids Res. 2006 Jul 1;34(Web Server issue):W596-9. doi: 10.1093/nar/gkl191. Nucleic Acids Res. 2006. PMID: 16845078 Free PMC article.
-
Protein multiple sequence alignment benchmarking through secondary structure prediction.Bioinformatics. 2017 May 1;33(9):1331-1337. doi: 10.1093/bioinformatics/btw840. Bioinformatics. 2017. PMID: 28093407 Free PMC article.
-
PicXAA-Web: a web-based platform for non-progressive maximum expected accuracy alignment of multiple biological sequences.Nucleic Acids Res. 2011 Jul;39(Web Server issue):W8-12. doi: 10.1093/nar/gkr244. Epub 2011 Apr 22. Nucleic Acids Res. 2011. PMID: 21515632 Free PMC article.
Cited by
-
Apprehending the NAD+-ADPr-Dependent Systems in the Virus World.Viruses. 2022 Sep 7;14(9):1977. doi: 10.3390/v14091977. Viruses. 2022. PMID: 36146784 Free PMC article.
-
ALOG domains: provenance of plant homeotic and developmental regulators from the DNA-binding domain of a novel class of DIRS1-type retroposons.Biol Direct. 2012 Nov 12;7:39. doi: 10.1186/1745-6150-7-39. Biol Direct. 2012. PMID: 23146749 Free PMC article.
-
Evolutionarily ancient BAH-PHD protein mediates Polycomb silencing.Proc Natl Acad Sci U S A. 2020 May 26;117(21):11614-11623. doi: 10.1073/pnas.1918776117. Epub 2020 May 11. Proc Natl Acad Sci U S A. 2020. PMID: 32393638 Free PMC article.
-
Profile Comparer Extended: phylogeny of lytic polysaccharide monooxygenase families using profile hidden Markov model alignments.F1000Res. 2019 Oct 31;8:1834. doi: 10.12688/f1000research.21104.1. eCollection 2019. F1000Res. 2019. PMID: 31956399 Free PMC article.
-
Domains in Action: Understanding Ddi1's Diverse Functions in the Ubiquitin-Proteasome System.Int J Mol Sci. 2024 Apr 6;25(7):4080. doi: 10.3390/ijms25074080. Int J Mol Sci. 2024. PMID: 38612889 Free PMC article. Review.
References
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases