MAFFT version 5: improvement in accuracy of multiple sequence alignment
- PMID: 15661851
- PMCID: PMC548345
- DOI: 10.1093/nar/gki198
MAFFT version 5: improvement in accuracy of multiple sequence alignment
Abstract
The accuracy of multiple sequence alignment program MAFFT has been improved. The new version (5.3) of MAFFT offers new iterative refinement options, H-INS-i, F-INS-i and G-INS-i, in which pairwise alignment information are incorporated into objective function. These new options of MAFFT showed higher accuracy than currently available methods including TCoffee version 2 and CLUSTAL W in benchmark tests consisting of alignments of >50 sequences. Like the previously available options, the new options of MAFFT can handle hundreds of sequences on a standard desktop computer. We also examined the effect of the number of homologues included in an alignment. For a multiple alignment consisting of approximately 8 sequences with low similarity, the accuracy was improved (2-10 percentage points) when the sequences were aligned together with dozens of their close homologues (E-value < 10(-5)-10(-20)) collected from a database. Such improvement was generally observed for most methods, but remarkably large for the new options of MAFFT proposed here. Thus, we made a Ruby script, mafftE.rb, which aligns the input sequences together with their close homologues collected from SwissProt using NCBI-BLAST.
Figures

TCoffee, default;
PROBCONS, default;
CLUSTAL W, default;
MUSCLE-i,
muscle -maxiters 16;MUSCLE-2,
muscle -maxiters 1;MUSCLE-fast,
muscle -sv -maxiters 1 -diags1 -distance1 kbit20_3.
Similar articles
-
Improvement in the accuracy of multiple sequence alignment program MAFFT.Genome Inform. 2005;16(1):22-33. Genome Inform. 2005. PMID: 16362903
-
MAFFT-DASH: integrated protein sequence and structural alignment.Nucleic Acids Res. 2019 Jul 2;47(W1):W5-W10. doi: 10.1093/nar/gkz342. Nucleic Acids Res. 2019. PMID: 31062021 Free PMC article.
-
Parallelization of MAFFT for large-scale multiple sequence alignments.Bioinformatics. 2018 Jul 15;34(14):2490-2492. doi: 10.1093/bioinformatics/bty121. Bioinformatics. 2018. PMID: 29506019 Free PMC article.
-
Improved accuracy of multiple ncRNA alignment by incorporating structural information into a MAFFT-based framework.BMC Bioinformatics. 2008 Apr 25;9:212. doi: 10.1186/1471-2105-9-212. BMC Bioinformatics. 2008. PMID: 18439255 Free PMC article.
-
Mind the gaps: evidence of bias in estimates of multiple sequence alignments.Mol Biol Evol. 2007 Nov;24(11):2433-42. doi: 10.1093/molbev/msm176. Epub 2007 Aug 20. Mol Biol Evol. 2007. PMID: 17709332
Cited by
-
A tortoise-infecting picornavirus expands the host range of the family Picornaviridae.Arch Virol. 2015 May;160(5):1319-23. doi: 10.1007/s00705-015-2366-6. Epub 2015 Feb 28. Arch Virol. 2015. PMID: 25721297 Free PMC article.
-
Whole genome sequencing of Streptomyces actuosus ISP-5337, Streptomyces sioyaensis B-5408, and Actinospica acidiphila B-2296 reveals secondary metabolomes with antibiotic potential.Biotechnol Rep (Amst). 2021 Feb 9;29:e00596. doi: 10.1016/j.btre.2021.e00596. eCollection 2021 Mar. Biotechnol Rep (Amst). 2021. PMID: 33643857 Free PMC article.
-
Complete mitochondrial genome of Mukaria splendida Distant (Hemiptera: Cicadellidae: Deltocephalinae: Mukariini) and phylogenetic analysis.Mitochondrial DNA B Resour. 2021 Feb 17;6(2):622-623. doi: 10.1080/23802359.2021.1875925. Mitochondrial DNA B Resour. 2021. PMID: 33644391 Free PMC article.
-
Selective forces acting during multi-domain protein evolution: the case of multi-domain globins.Springerplus. 2015 Jul 16;4:354. doi: 10.1186/s40064-015-1124-2. eCollection 2015. Springerplus. 2015. PMID: 26191481 Free PMC article.
-
Taxonomic recognition of some species-level lineages circumscribed in nominal Rhizoplaca subdiscrepans s. lat. (Lecanoraceae, Ascomycota).PeerJ. 2020 Aug 3;8:e9555. doi: 10.7717/peerj.9555. eCollection 2020. PeerJ. 2020. PMID: 32832264 Free PMC article.
References
-
- Grasso C., Lee C. Combining partial order alignment and progressive multiple sequence alignment increases alignment speed and scalability to very large alignment problems. Bioinformatics. 2004;20:1546–1556. - PubMed
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials