Amino acid substitution matrices from an information theoretic perspective
- PMID: 2051488
- PMCID: PMC7130686
- DOI: 10.1016/0022-2836(91)90193-a
Amino acid substitution matrices from an information theoretic perspective
Abstract
Protein sequence alignments have become an important tool for molecular biologists. Local alignments are frequently constructed with the aid of a "substitution score matrix" that specifies a score for aligning each pair of amino acid residues. Over the years, many different substitution matrices have been proposed, based on a wide variety of rationales. Statistical results, however, demonstrate that any such matrix is implicitly a "log-odds" matrix, with a specific target distribution for aligned pairs of amino acid residues. In the light of information theory, it is possible to express the scores of a substitution matrix in bits and to see that different matrices are better adapted to different purposes. The most widely used matrix for protein sequence comparison has been the PAM-250 matrix. It is argued that for database searches the PAM-120 matrix generally is more appropriate, while for comparing two specific proteins with suspected homology the PAM-200 matrix is indicated. Examples discussed include the lipocalins, human alpha 1 B-glycoprotein, the cystic fibrosis transmembrane conductance regulator and the globins.
Similar articles
-
A protein alignment scoring system sensitive at all evolutionary distances.J Mol Evol. 1993 Mar;36(3):290-300. doi: 10.1007/BF00160485. J Mol Evol. 1993. PMID: 8483166
-
Optimizing substitution matrices by separating score distributions.Bioinformatics. 2004 Apr 12;20(6):863-73. doi: 10.1093/bioinformatics/btg494. Epub 2004 Jan 29. Bioinformatics. 2004. PMID: 14752003
-
The construction of amino acid substitution matrices for the comparison of proteins with non-standard compositions.Bioinformatics. 2005 Apr 1;21(7):902-11. doi: 10.1093/bioinformatics/bti070. Epub 2004 Oct 27. Bioinformatics. 2005. PMID: 15509610
-
Protein database searches using compositionally adjusted substitution matrices.FEBS J. 2005 Oct;272(20):5101-9. doi: 10.1111/j.1742-4658.2005.04945.x. FEBS J. 2005. PMID: 16218944 Free PMC article. Review.
-
Substitution scoring matrices for proteins - An overview.Protein Sci. 2020 Nov;29(11):2150-2163. doi: 10.1002/pro.3954. Epub 2020 Oct 12. Protein Sci. 2020. PMID: 32954566 Free PMC article. Review.
Cited by
-
Zar1 represses translation in Xenopus oocytes and binds to the TCS in maternal mRNAs with different characteristics than Zar2.Biochim Biophys Acta. 2013 Oct;1829(10):1034-46. doi: 10.1016/j.bbagrm.2013.06.001. Epub 2013 Jul 1. Biochim Biophys Acta. 2013. PMID: 23827238 Free PMC article.
-
BRONCO: Biomedical entity Relation ONcology COrpus for extracting gene-variant-disease-drug relations.Database (Oxford). 2016 Apr 13;2016:baw043. doi: 10.1093/database/baw043. Print 2016. Database (Oxford). 2016. PMID: 27074804 Free PMC article.
-
Genomic analysis of a 1 Mb region near the telomere of Hessian fly chromosome X2 and avirulence gene vH13.BMC Genomics. 2006 Jan 16;7:7. doi: 10.1186/1471-2164-7-7. BMC Genomics. 2006. PMID: 16412254 Free PMC article.
-
Large-scale trends in the evolution of gene structures within 11 animal genomes.PLoS Comput Biol. 2006 Mar;2(3):e15. doi: 10.1371/journal.pcbi.0020015. Epub 2006 Mar 3. PLoS Comput Biol. 2006. PMID: 16518452 Free PMC article.
-
SIRT1 gene expression upon genotoxic damage is regulated by APE1 through nCaRE-promoter elements.Mol Biol Cell. 2014 Feb;25(4):532-47. doi: 10.1091/mbc.E13-05-0286. Epub 2013 Dec 19. Mol Biol Cell. 2014. PMID: 24356447 Free PMC article.
References
-
- Altschul S.F., Erickson B.W. A nonlinear measure of subalignment similarity and its significance levels. Bull. Math. Biol. 1986;48:617–632. - PubMed
-
- Altschul S.F., Gish W., Miller W., Myers E.W., Lipman D.J. Basic local alignment search tool. J. Mol. Biol. 1990;215:403–410. - PubMed
-
- Argos P. A sensitive procedure to compare amino acid sequences. J. Mol. Biol. 1987;193:385–396. - PubMed
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Miscellaneous