Multiple protein sequence alignment from tertiary structure comparison: assignment of global and residue confidence levels
- PMID: 1409577
- DOI: 10.1002/prot.340140216
Multiple protein sequence alignment from tertiary structure comparison: assignment of global and residue confidence levels
Abstract
An algorithm is presented for the accurate and rapid generation of multiple protein sequence alignments from tertiary structure comparisons. A preliminary multiple sequence alignment is performed using sequence information, which then determines an initial superposition of the structures. A structure comparison algorithm is applied to all pairs of proteins in the superimposed set and a similarity tree calculated. Multiple sequence alignments are then generated by following the tree from the branches to the root. At each branchpoint of the tree, a structure-based sequence alignment and coordinate transformations are output, with the multiple alignment of all structures output at the root. The algorithm encoded in STAMP (STructural Alignment of Multiple Proteins) is shown to give alignments in good agreement with published structural accounts within the dehydrogenase fold domains, globins, and serine proteinases. In order to reduce the need for visual verification, two similarity indices are introduced to determine the quality of each generated structural alignment. Sc quantifies the global structural similarity between pairs or groups of proteins, whereas Pij' provides a normalized measure of the confidence in the alignment of each residue. STAMP alignments have the quality of each alignment characterized by Sc and Pij' values and thus provide a reproducible resource for studies of residue conservation within structural motifs.
Similar articles
-
An integrated approach to the analysis and modeling of protein sequences and structures. III. A comparative study of sequence conservation in protein structural families using multiple structural alignments.J Mol Biol. 2000 Aug 18;301(3):691-711. doi: 10.1006/jmbi.2000.3975. J Mol Biol. 2000. PMID: 10966778
-
MUSTANG: a multiple structural alignment algorithm.Proteins. 2006 Aug 15;64(3):559-74. doi: 10.1002/prot.20921. Proteins. 2006. PMID: 16736488
-
CAALIGN: a program for pairwise and multiple protein-structure alignment.Acta Crystallogr D Biol Crystallogr. 2007 Apr;63(Pt 4):514-25. doi: 10.1107/S0907444907000844. Epub 2007 Mar 16. Acta Crystallogr D Biol Crystallogr. 2007. PMID: 17372357
-
A sequence similarity search algorithm based on a probabilistic interpretation of an alignment scoring system.Proc Int Conf Intell Syst Mol Biol. 1996;4:44-51. Proc Int Conf Intell Syst Mol Biol. 1996. PMID: 8877503 Review.
-
Determination of reliable regions in protein sequence alignments.Protein Eng. 1990 Jul;3(7):565-9. doi: 10.1093/protein/3.7.565. Protein Eng. 1990. PMID: 2217130 Review.
Cited by
-
Structural insights into putative molybdenum cofactor biosynthesis protein C (MoaC2) from Mycobacterium tuberculosis H37Rv.PLoS One. 2013;8(3):e58333. doi: 10.1371/journal.pone.0058333. Epub 2013 Mar 19. PLoS One. 2013. PMID: 23526978 Free PMC article.
-
Structure of Aichi Virus 1 and Its Empty Particle: Clues to Kobuvirus Genome Release Mechanism.J Virol. 2016 Nov 14;90(23):10800-10810. doi: 10.1128/JVI.01601-16. Print 2016 Dec 1. J Virol. 2016. PMID: 27681122 Free PMC article.
-
Structure-function analysis of Sedolisins: evolution of tripeptidyl peptidase and endopeptidase subfamilies in fungi.BMC Bioinformatics. 2018 Dec 4;19(1):464. doi: 10.1186/s12859-018-2404-y. BMC Bioinformatics. 2018. PMID: 30514213 Free PMC article.
-
Structural similarity to bridge sequence space: finding new families on the bridges.Protein Sci. 2005 May;14(5):1305-14. doi: 10.1110/ps.041187405. Protein Sci. 2005. PMID: 15840833 Free PMC article.
-
A proximity-based in silico approach to identify redox-labile disulfide bonds: The example of FVIII.PLoS One. 2022 Feb 7;17(2):e0262409. doi: 10.1371/journal.pone.0262409. eCollection 2022. PLoS One. 2022. PMID: 35130281 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources