A strategy for the rapid multiple alignment of protein sequences. Confidence levels from tertiary structure comparisons
- PMID: 3430611
- DOI: 10.1016/0022-2836(87)90316-0
A strategy for the rapid multiple alignment of protein sequences. Confidence levels from tertiary structure comparisons
Abstract
An algorithm is presented for the multiple alignment of protein sequences that is both accurate and rapid computationally. The approach is based on the conventional dynamic-programming method of pairwise alignment. Initially, two sequences are aligned, then the third sequence is aligned against the alignment of both sequences one and two. Similarly, the fourth sequence is aligned against one, two and three. This is repeated until all sequences have been aligned. Iteration is then performed to yield a final alignment. The accuracy of sequence alignment is evaluated from alignment of the secondary structures in a family of proteins. For the globins, the multiple alignment was on average 99% accurate compared to 90% for pairwise comparison of sequences. For the alignment of immunoglobulin constant and variable domains, the use of many sequences yielded an alignment of 63% average accuracy compared to 41% average for individual variable/constant alignments. The multiple alignment algorithm yields an assignment of disulphide connectivity in mammalian serotransferrin that is consistent with crystallographic data, whereas pairwise alignments give an alternative assignment.
Similar articles
-
Multiple sequence alignment with hierarchical clustering.Nucleic Acids Res. 1988 Nov 25;16(22):10881-90. doi: 10.1093/nar/16.22.10881. Nucleic Acids Res. 1988. PMID: 2849754 Free PMC article.
-
Multiple protein sequence alignment from tertiary structure comparison: assignment of global and residue confidence levels.Proteins. 1992 Oct;14(2):309-23. doi: 10.1002/prot.340140216. Proteins. 1992. PMID: 1409577
-
Using iterative dynamic programming to obtain accurate pairwise and multiple alignments of protein structures.Proc Int Conf Intell Syst Mol Biol. 1996;4:59-67. Proc Int Conf Intell Syst Mol Biol. 1996. PMID: 8877505
-
A multiple sequence alignment algorithm for homologous proteins using secondary structure information and optionally keying alignments to functionally important sites.Comput Appl Biosci. 1989 Apr;5(2):141-50. doi: 10.1093/bioinformatics/5.2.141. Comput Appl Biosci. 1989. PMID: 2751764
-
Alignment of protein sequences by their profiles.Protein Sci. 2004 Apr;13(4):1071-87. doi: 10.1110/ps.03379804. Protein Sci. 2004. PMID: 15044736 Free PMC article.
Cited by
-
Identification, cloning, and expression of the major capsid protein gene of human herpesvirus 6.J Virol. 1990 Feb;64(2):714-22. doi: 10.1128/JVI.64.2.714-722.1990. J Virol. 1990. PMID: 2153237 Free PMC article.
-
Improved accuracy of multiple ncRNA alignment by incorporating structural information into a MAFFT-based framework.BMC Bioinformatics. 2008 Apr 25;9:212. doi: 10.1186/1471-2105-9-212. BMC Bioinformatics. 2008. PMID: 18439255 Free PMC article.
-
Key interactions in integrin ectodomain responsible for global conformational change detected by elastic network normal-mode analysis.Biophys J. 2008 Sep 15;95(6):2895-908. doi: 10.1529/biophysj.108.131045. Epub 2008 May 30. Biophys J. 2008. PMID: 18515366 Free PMC article.
-
Evolution of the primate lentiviruses: evidence from vpx and vpr.EMBO J. 1992 Sep;11(9):3405-12. doi: 10.1002/j.1460-2075.1992.tb05419.x. EMBO J. 1992. PMID: 1324171 Free PMC article.
-
Sequence of a functional invertebrate GABAA receptor subunit which can form a chimeric receptor with a vertebrate alpha subunit.EMBO J. 1991 Nov;10(11):3239-45. doi: 10.1002/j.1460-2075.1991.tb04887.x. EMBO J. 1991. PMID: 1655414 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources