FLU, an amino acid substitution model for influenza proteins
- PMID: 20384985
- PMCID: PMC2873421
- DOI: 10.1186/1471-2148-10-99
FLU, an amino acid substitution model for influenza proteins
Abstract
Background: The amino acid substitution model is the core component of many protein analysis systems such as sequence similarity search, sequence alignment, and phylogenetic inference. Although several general amino acid substitution models have been estimated from large and diverse protein databases, they remain inappropriate for analyzing specific species, e.g., viruses. Emerging epidemics of influenza viruses raise the need for comprehensive studies of these dangerous viruses. We propose an influenza-specific amino acid substitution model to enhance the understanding of the evolution of influenza viruses.
Results: A maximum likelihood approach was applied to estimate an amino acid substitution model (FLU) from approximately 113,000 influenza protein sequences, consisting of approximately 20 million residues. FLU outperforms 14 widely used models in constructing maximum likelihood phylogenetic trees for the majority of influenza protein alignments. On average, FLU gains approximately 42 log likelihood points with an alignment of 300 sites. Moreover, topologies of trees constructed using FLU and other models are frequently different. FLU does indeed have an impact on likelihood improvement as well as tree topologies. It was implemented in PhyML and can be downloaded from ftp://ftp.sanger.ac.uk/pub/1000genomes/lsq/FLU or included in PhyML 3.0 server at http://www.atgc-montpellier.fr/phyml/.
Conclusions: FLU should be useful for any influenza protein analysis system which requires an accurate description of amino acid substitutions.
Figures





Similar articles
-
Superiority of a mechanistic codon substitution model even for protein sequences in phylogenetic analysis.BMC Evol Biol. 2013 Nov 21;13:257. doi: 10.1186/1471-2148-13-257. BMC Evol Biol. 2013. PMID: 24256155 Free PMC article.
-
An improved general amino acid replacement matrix.Mol Biol Evol. 2008 Jul;25(7):1307-20. doi: 10.1093/molbev/msn067. Epub 2008 Mar 26. Mol Biol Evol. 2008. PMID: 18367465
-
Phylogenetic mixture models for proteins.Philos Trans R Soc Lond B Biol Sci. 2008 Dec 27;363(1512):3965-76. doi: 10.1098/rstb.2008.0180. Philos Trans R Soc Lond B Biol Sci. 2008. PMID: 18852096 Free PMC article.
-
QMaker: Fast and Accurate Method to Estimate Empirical Models of Protein Evolution.Syst Biol. 2021 Aug 11;70(5):1046-1060. doi: 10.1093/sysbio/syab010. Syst Biol. 2021. PMID: 33616668 Free PMC article.
-
Modeling protein evolution with several amino acid replacement matrices depending on site rates.Mol Biol Evol. 2012 Oct;29(10):2921-36. doi: 10.1093/molbev/mss112. Epub 2012 Apr 6. Mol Biol Evol. 2012. PMID: 22491036
Cited by
-
Simulation data for the estimation of numerical constants for approximating pairwise evolutionary distances between amino acid sequences.Data Brief. 2019 Jul 8;25:104212. doi: 10.1016/j.dib.2019.104212. eCollection 2019 Aug. Data Brief. 2019. PMID: 31440543 Free PMC article.
-
Evolutionary Insights from Association Rule Mining of Co-Occurring Mutations in Influenza Hemagglutinin and Neuraminidase.Viruses. 2024 Sep 25;16(10):1515. doi: 10.3390/v16101515. Viruses. 2024. PMID: 39459850 Free PMC article.
-
Superiority of a mechanistic codon substitution model even for protein sequences in phylogenetic analysis.BMC Evol Biol. 2013 Nov 21;13:257. doi: 10.1186/1471-2148-13-257. BMC Evol Biol. 2013. PMID: 24256155 Free PMC article.
-
Viral suppressors of the RIG-I-mediated interferon response are pre-packaged in influenza virions.Nat Commun. 2014 Dec 9;5:5645. doi: 10.1038/ncomms6645. Nat Commun. 2014. PMID: 25487526 Free PMC article.
-
A Guide to Phylogenomic Inference.Methods Mol Biol. 2024;2802:267-345. doi: 10.1007/978-1-0716-3838-5_11. Methods Mol Biol. 2024. PMID: 38819564
References
-
- Felsenstein J. Infering Phylogenies. Sunderland, Massachusetts, US: Sinauer Associates; 2004.
-
- Ziheng Y. Computational Molecular Evolution. 1. Oxford, UK: Oxford University Press; 2006.
-
- Opperdoes FR. In: The Phylogenetics Handbook A Practical Approach to DNA and Protein Phylogeny. Salemi M, Vandamme AM, editor. Cambridge: Cambridge University Press; 2003. Phylogenetic analysis using protein sequences; pp. 207–235.
-
- Setubal C, Meidanis J. Introduction to Computational Molecular Biology. 1. Boston, Massachusetts, US: PWS Publishing; 1997.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources