Statistical significance of sequence patterns in proteins
- PMID: 7583634
- DOI: 10.1016/0959-440x(95)80098-0
Statistical significance of sequence patterns in proteins
Abstract
I discuss three recent developments in sequence analysis by the statistical method of scores. First is the identification of segments of high aggregate score in a single protein sequence. Charge clusters and hyper-charge runs are prime examples. Proteins containing hyper-charge runs are principally associated with DNA and RNA processing, chromatin structure, ion storage and exchange, and protein complex assembly. Second is the protein sequence comparisons identifying common segments having high total similarity scores. These are illustrated by comparisons within the family of prokaryotic heat shock 70 kDa proteins. Third is the scoring protocols applied to the inverse folding problem.
Similar articles
-
Applications and statistics for multiple high-scoring segments in molecular sequences.Proc Natl Acad Sci U S A. 1993 Jun 15;90(12):5873-7. doi: 10.1073/pnas.90.12.5873. Proc Natl Acad Sci U S A. 1993. PMID: 8390686 Free PMC article.
-
A model for statistical significance of local similarities in structure.J Mol Biol. 2003 Mar 7;326(5):1307-16. doi: 10.1016/s0022-2836(03)00045-7. J Mol Biol. 2003. PMID: 12595245 Review.
-
Methods for assessing the statistical significance of molecular sequence features by using general scoring schemes.Proc Natl Acad Sci U S A. 1990 Mar;87(6):2264-8. doi: 10.1073/pnas.87.6.2264. Proc Natl Acad Sci U S A. 1990. PMID: 2315319 Free PMC article.
-
Detecting periodic patterns in biological sequences.Bioinformatics. 1998;14(6):498-507. doi: 10.1093/bioinformatics/14.6.498. Bioinformatics. 1998. PMID: 9694988
-
Three-dimensional profiles for analysing protein sequence-structure relationships.Faraday Discuss. 1992;(93):25-34. Faraday Discuss. 1992. PMID: 1290936 Review.
Cited by
-
Phototactic migration of Dictyostelium cells is linked to a new type of gelsolin-related protein.Mol Biol Cell. 1999 Jan;10(1):161-78. doi: 10.1091/mbc.10.1.161. Mol Biol Cell. 1999. PMID: 9880334 Free PMC article.
-
ProtRepeatsDB: a database of amino acid repeats in genomes.BMC Bioinformatics. 2006 Jul 7;7:336. doi: 10.1186/1471-2105-7-336. BMC Bioinformatics. 2006. PMID: 16827924 Free PMC article.
-
n-Gram characterization of genomic islands in bacterial genomes.Comput Methods Programs Biomed. 2009 Mar;93(3):241-56. doi: 10.1016/j.cmpb.2008.10.014. Epub 2008 Dec 19. Comput Methods Programs Biomed. 2009. PMID: 19101056 Free PMC article.
-
Compositional biases of bacterial genomes and evolutionary implications.J Bacteriol. 1997 Jun;179(12):3899-913. doi: 10.1128/jb.179.12.3899-3913.1997. J Bacteriol. 1997. PMID: 9190805 Free PMC article.
-
How are close residues of protein structures distributed in primary sequence?Proc Natl Acad Sci U S A. 1995 Dec 19;92(26):12136-40. doi: 10.1073/pnas.92.26.12136. Proc Natl Acad Sci U S A. 1995. PMID: 8618859 Free PMC article.
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources