Evolutionary divergence plots of homologous proteins
- PMID: 1520737
- DOI: 10.1016/0300-9084(92)90157-a
Evolutionary divergence plots of homologous proteins
Abstract
A simple and efficient method is described for analyzing quantitatively multiple protein sequence alignments and finding the most conserved blocks as well as the maxima of divergence within the set of aligned sequences. It consists of calculating the mean distance and the root-mean-square distance in each column of the multiple alignment, averaging the values in a window of defined length and plotting the results as a function of the position of the window. Due attention is paid to the presence of gaps in the columns. Several examples are provided, using the sequences of several cytochromes c, serine proteases, lysozymes and globins. Two distance matrices are compared, namely the matrix derived by Gribskov and Burgess from the Dayhoff matrix, and the Risler Structural Superposition Matrix. In each case, the divergence plots effectively point to the specific residues which are known to be essential for the catalytic activity of the proteins. In addition, the regions of maximum divergence are clearly delineated. Interestingly, they are generally observed in positions immediately flanking the most conserved blocks. The method should therefore be useful for delineating the peptide segments which will be good candidates for site-directed mutagenesis and for visualizing the evolutionary constraints along homologous polypeptide chains.
Similar articles
-
Comparison of five methods for finding conserved sequences in multiple alignments of gene regulatory regions.Nucleic Acids Res. 1999 Oct 1;27(19):3899-910. doi: 10.1093/nar/27.19.3899. Nucleic Acids Res. 1999. PMID: 10481030 Free PMC article.
-
Using CLUSTAL for multiple sequence alignments.Methods Enzymol. 1996;266:383-402. doi: 10.1016/s0076-6879(96)66024-8. Methods Enzymol. 1996. PMID: 8743695
-
An integrated approach to the analysis and modeling of protein sequences and structures. III. A comparative study of sequence conservation in protein structural families using multiple structural alignments.J Mol Biol. 2000 Aug 18;301(3):691-711. doi: 10.1006/jmbi.2000.3975. J Mol Biol. 2000. PMID: 10966778
-
Identification of functionally conserved residues with the use of entropy-variability plots.Proteins. 2003 Sep 1;52(4):544-52. doi: 10.1002/prot.10490. Proteins. 2003. PMID: 12910454
-
CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice.Nucleic Acids Res. 1994 Nov 11;22(22):4673-80. doi: 10.1093/nar/22.22.4673. Nucleic Acids Res. 1994. PMID: 7984417 Free PMC article.
Cited by
-
The CLUSTAL_X windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools.Nucleic Acids Res. 1997 Dec 15;25(24):4876-82. doi: 10.1093/nar/25.24.4876. Nucleic Acids Res. 1997. PMID: 9396791 Free PMC article.
-
An analysis of the sequence of part of the right arm of chromosome II of S. cerevisiae reveals new genes encoding an amino-acid permease and a carboxypeptidase.Curr Genet. 1994 Jul;26(1):1-7. doi: 10.1007/BF00326297. Curr Genet. 1994. PMID: 7954890
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources