Eigenvalue analysis of amino acid substitution matrices reveals a sharp transition of the mode of sequence conservation in proteins
- PMID: 15130930
- DOI: 10.1093/bioinformatics/bth297
Eigenvalue analysis of amino acid substitution matrices reveals a sharp transition of the mode of sequence conservation in proteins
Abstract
The pattern of amino acid substitutions and sequence conservation over many structure-based alignments of protein sequences was analyzed as a function of percentage sequence identity. The statistics of the amino acid substitutions were converted into the form of log-odds amino acid substitution matrices to which eigenvalue decomposition was applied. It was found that the most important component of the substitution matrices exhibited a sharp transition at the sequence identity of 30-35%, which coincides with the twilight zone. Above the transition point, the most dominant component is related to the mutability of amino acids and it acts to disfavor any substitutions, whereas below the transition point, the most dominant component is related to the hydrophobicity of amino acids and substitutions between residues of similar hydrophobic character are positively favored. Implications for protein evolution and sequence analysis are discussed.
Similar articles
-
Periodic distributions of hydrophobic amino acids allows the definition of fundamental building blocks to align distantly related proteins.Proteins. 2007 May 15;67(3):695-708. doi: 10.1002/prot.21319. Proteins. 2007. PMID: 17299747
-
Improved pairwise alignments of proteins in the Twilight Zone using local structure predictions.Bioinformatics. 2006 Feb 15;22(4):413-22. doi: 10.1093/bioinformatics/bti828. Epub 2005 Dec 13. Bioinformatics. 2006. PMID: 16352653
-
An examination of the conservation of surface patch polarity for proteins.Bioinformatics. 2004 Sep 22;20(14):2197-204. doi: 10.1093/bioinformatics/bth218. Epub 2004 Apr 8. Bioinformatics. 2004. PMID: 15073014
-
Scoring residue conservation.Proteins. 2002 Aug 1;48(2):227-41. doi: 10.1002/prot.10146. Proteins. 2002. PMID: 12112692 Review.
-
The WWWH of remote homolog detection: the state of the art.Brief Bioinform. 2007 Mar;8(2):78-87. doi: 10.1093/bib/bbl032. Epub 2006 Sep 26. Brief Bioinform. 2007. PMID: 17003074 Review.
Cited by
-
Amino acid "little Big Bang": representing amino acid substitution matrices as dot products of Euclidian vectors.BMC Bioinformatics. 2010 Jan 4;11:4. doi: 10.1186/1471-2105-11-4. BMC Bioinformatics. 2010. PMID: 20047649 Free PMC article.
-
Deep Learning for Protein-Protein Interaction Site Prediction.Methods Mol Biol. 2021;2361:263-288. doi: 10.1007/978-1-0716-1641-3_16. Methods Mol Biol. 2021. PMID: 34236667
-
Comprehensive analysis of the mouse cytochrome P450 family responsible for omega-3 epoxidation of eicosapentaenoic acid.Sci Rep. 2018 May 21;8(1):7954. doi: 10.1038/s41598-018-26325-4. Sci Rep. 2018. PMID: 29784972 Free PMC article.
-
Large-scale in silico mutagenesis experiments reveal optimization of genetic code and codon usage for protein mutational robustness.BMC Biol. 2020 Oct 20;18(1):146. doi: 10.1186/s12915-020-00870-9. BMC Biol. 2020. PMID: 33081759 Free PMC article.
-
Structure of Nanobody Nb23.Molecules. 2021 Jun 11;26(12):3567. doi: 10.3390/molecules26123567. Molecules. 2021. PMID: 34207949 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources