Parallelization of MAFFT for large-scale multiple sequence alignments
- PMID: 29506019
- PMCID: PMC6041967
- DOI: 10.1093/bioinformatics/bty121
Parallelization of MAFFT for large-scale multiple sequence alignments
Abstract
Summary: We report an update for the MAFFT multiple sequence alignment program to enable parallel calculation of large numbers of sequences. The G-INS-1 option of MAFFT was recently reported to have higher accuracy than other methods for large data, but this method has been impractical for most large-scale analyses, due to the requirement of large computational resources. We introduce a scalable variant, G-large-INS-1, which has equivalent accuracy to G-INS-1 and is applicable to 50 000 or more sequences.
Availability and implementation: This feature is available in MAFFT versions 7.355 or later at https://mafft.cbrc.jp/alignment/software/mpi.html.
Supplementary information: Supplementary data are available at Bioinformatics online.
Figures

References
-
- Glöckner F.O. et al. (2017) 25 years of serving the community with ribosomal RNA gene reference databases and tools. J. Biotechnol., 261, 169–176. - PubMed
-
- González-Domínguez J. et al. (2016) MSAProbs-MPI: parallel multiple sequence aligner for distributed-memory systems. Bioinformatics, 32, 3826–3828. - PubMed
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources