Estimating amino acid substitution models: a comparison of Dayhoff's estimator, the resolvent approach and a maximum likelihood method
- PMID: 11752185
- DOI: 10.1093/oxfordjournals.molbev.a003985
Estimating amino acid substitution models: a comparison of Dayhoff's estimator, the resolvent approach and a maximum likelihood method
Abstract
Evolution of proteins is generally modeled as a Markov process acting on each site of the sequence. Replacement frequencies need to be estimated based on sequence alignments. Here we compare three approaches: First, the original method by Dayhoff, Schwartz, and Orcutt (1978) Atlas Protein Seq. Struc. 5:345-352, secondly, the resolvent method (RV) by Müller and Vingron (2000) J. Comput. Biol. 7(6):761-776, and finally a maximum likelihood approach (ML) developed in this paper. We evaluate the methods using a highly divergent and inhomogeneous set of sequence alignments as an input to the estimation procedure. ML is the method of choice for small sets of input data. Although the RV method is computationally much less demanding it performs only slightly worse than ML. Therefore, it is perfectly appropriate for large-scale applications.
Similar articles
-
Modeling amino acid replacement.J Comput Biol. 2000;7(6):761-76. doi: 10.1089/10665270050514918. J Comput Biol. 2000. PMID: 11382360
-
Efficient methods for estimating amino acid replacement rates.J Mol Evol. 2006 Jun;62(6):663-73. doi: 10.1007/s00239-004-0113-9. Epub 2006 Apr 28. J Mol Evol. 2006. PMID: 16752207
-
Scoredist: a simple and robust protein sequence distance estimator.BMC Bioinformatics. 2005 Apr 27;6:108. doi: 10.1186/1471-2105-6-108. BMC Bioinformatics. 2005. PMID: 15857510 Free PMC article.
-
Models of molecular evolution and phylogeny.Genome Res. 1998 Dec;8(12):1233-44. doi: 10.1101/gr.8.12.1233. Genome Res. 1998. PMID: 9872979 Review.
-
Functional Proteins from Short Peptides: Dayhoff's Hypothesis Turns 50.Angew Chem Int Ed Engl. 2016 Dec 23;55(52):15966-15971. doi: 10.1002/anie.201609977. Epub 2016 Nov 16. Angew Chem Int Ed Engl. 2016. PMID: 27865046 Review.
Cited by
-
The ranging of amino acids substitution matrices of various types in accordance with the alignment accuracy criterion.BMC Bioinformatics. 2020 Sep 14;21(Suppl 11):294. doi: 10.1186/s12859-020-03616-0. BMC Bioinformatics. 2020. PMID: 32921315 Free PMC article.
-
Finding regulatory elements and regulatory motifs: a general probabilistic framework.BMC Bioinformatics. 2007 Sep 27;8 Suppl 6(Suppl 6):S4. doi: 10.1186/1471-2105-8-S6-S4. BMC Bioinformatics. 2007. PMID: 17903285 Free PMC article. Review.
-
Optimizing substitution matrix choice and gap parameters for sequence alignment.BMC Bioinformatics. 2009 Dec 2;10:396. doi: 10.1186/1471-2105-10-396. BMC Bioinformatics. 2009. PMID: 19954534 Free PMC article.
-
Homology-extended sequence alignment.Nucleic Acids Res. 2005 Feb 7;33(3):816-24. doi: 10.1093/nar/gki233. Print 2005. Nucleic Acids Res. 2005. PMID: 15699183 Free PMC article.
-
On the reliability and the limits of inference of amino acid sequence alignments.Bioinformatics. 2022 Jun 24;38(Suppl 1):i255-i263. doi: 10.1093/bioinformatics/btac247. Bioinformatics. 2022. PMID: 35758808 Free PMC article.
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources