HomologyPlot: searching for homology to a family of proteins using a database of unique conserved patterns
- PMID: 8064334
- DOI: 10.1007/BF00119867
HomologyPlot: searching for homology to a family of proteins using a database of unique conserved patterns
Abstract
A new database of conserved amino acid residues is derived from the multiple sequence alignment of over 84 families of protein sequences that have been reported in the literature. This database contains sequences of conserved hydrophobic core patterns which are probably important for structure and function, since they are conserved for most sequences in that family. This database differs from other single-motif or signature databases reported previously, since it contains multiple patterns for each family. The new database is used to align a new sequence with the conserved regions of a family. This is analogous to reports in the literature where multiple sequence alignments are used to improve a sequence alignment. A program called HomologyPlot (suitable for IBM or compatible computers) uses this database to find homology of a new sequence to a family of protein sequences. There are several advantages to using multiple patterns. First, the program correctly identifies a new sequence as a member of a known family. Second, the search of the entire database is rapid and requires less than one minute. This is similar to performing a multiple sequence alignment of a new sequence to all of the known protein family sequences. Third, the alignment of a new sequence to family members is reliable and can reproduce the alignment of conserved regions already described in the literature. The speed and efficiency of this method is enhanced, since there is no need to score for insertions or deletions as is done in the more commonly used sequence alignment methods. In this method only the patterns are aligned. HomologyPlot also provides general information on each family, as well as a listing of patterns in a family.
Similar articles
-
Using CLUSTAL for multiple sequence alignments.Methods Enzymol. 1996;266:383-402. doi: 10.1016/s0076-6879(96)66024-8. Methods Enzymol. 1996. PMID: 8743695
-
An integrated approach to the analysis and modeling of protein sequences and structures. III. A comparative study of sequence conservation in protein structural families using multiple structural alignments.J Mol Biol. 2000 Aug 18;301(3):691-711. doi: 10.1006/jmbi.2000.3975. J Mol Biol. 2000. PMID: 10966778
-
Identification of functionally conserved residues with the use of entropy-variability plots.Proteins. 2003 Sep 1;52(4):544-52. doi: 10.1002/prot.10490. Proteins. 2003. PMID: 12910454
-
Subtilases: the superfamily of subtilisin-like serine proteases.Protein Sci. 1997 Mar;6(3):501-23. doi: 10.1002/pro.5560060301. Protein Sci. 1997. PMID: 9070434 Free PMC article. Review.
-
Homology modelling and protein engineering strategy of subtilases, the family of subtilisin-like serine proteinases.Protein Eng. 1991 Oct;4(7):719-37. doi: 10.1093/protein/4.7.719. Protein Eng. 1991. PMID: 1798697 Review.
References
Publication types
MeSH terms
Substances
LinkOut - more resources
Other Literature Sources