Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Dec 15;35(24):5315-5317.
doi: 10.1093/bioinformatics/btz572.

Tailor-made multiple sequence alignments using the PRALINE 2 alignment toolkit

Affiliations

Tailor-made multiple sequence alignments using the PRALINE 2 alignment toolkit

Maurits J J Dijkstra et al. Bioinformatics. .

Abstract

Summary: PRALINE 2 is a toolkit for custom multiple sequence alignment workflows. It can be used to incorporate sequence annotations, such as secondary structure or (DNA) motifs, into the alignment scoring, as well as to customize many other aspects of a progressive multiple alignment workflow.

Availability and implementation: PRALINE 2 is implemented in Python and available as open source software on GitHub: https://github.com/ibivu/PRALINE/.

PubMed Disclaimer

Figures

Fig. 1.
Fig. 1.
Overview of the PRALINE 2 algorithm, showing how an alignment can be improved by incorporating sequence annotations. (A) Three input amino acid sequences alongside two annotation tracks: the 3-state secondary structure (C, E, H) and an annotation predicting whether an N-terminal glycosylation site exists at a position (M) or not (*). A pair of columns, shown in red and green, is tracked throughout the steps of the algorithm. (B) Scoring in more detail. The three types of symbols contribute independently to the total score: amino acids are scored by BLOSUM62, secondary structure by a 5/-5 match/mismatch scheme, and, if both positions are a glycosylation motif, a score boost of 20 is applied. (C) The resulting alignment (amino acid sequences only); note that the motifs, shown in yellow, are correctly aligned, due to the motif scoring

References

    1. Alva V. et al. (2016) The MPI bioinformatics Toolkit as an integrative platform for advanced protein sequence and structure analysis. Nucleic Acids Res., 44, W410–W415. - PMC - PubMed
    1. Dijkstra M. et al. (2018) Motif-Aware PRALINE: improving the alignment of motif regions. PLoS Comput. Biol., 14, e1006547.. - PMC - PubMed
    1. Henikoff S., Henikoff J.G. (1992) Amino acid substitution matrices from protein blocks. Proc. Natl. Acad. Sci. USA, 89, 10915–10919. - PMC - PubMed
    1. Heringa J. (1999) Two strategies for sequence comparison: profile-preprocessed and secondary structure-induced multiple alignment. Comput. Chem., 23, 341–364. - PubMed
    1. Hogeweg P., Hesper B. (1984) The alignment of sets of sequences and the construction of phyletic trees: an integrated method. J. Mol. Evol., 20, 175–186. - PubMed