Statistical properties of the branch-site test of positive selection
- PMID: 21087944
- DOI: 10.1093/molbev/msq303
Statistical properties of the branch-site test of positive selection
Abstract
The branch-site test is a likelihood ratio test to detect positive selection along prespecified lineages on a phylogeny that affects only a subset of codons in a protein-coding gene, with positive selection indicated by accelerated nonsynonymous substitutions (with ω = d(N)/d(S) > 1). This test may have more power than earlier methods, which average nucleotide substitution rates over sites in the protein and/or over branches on the tree. However, a few recent studies questioned the statistical basis of the test and claimed that the test generated too many false positives. In this paper, we examine the null distribution of the test and conduct a computer simulation to examine the false-positive rate and the power of the test. The results suggest that the asymptotic theory is reliable for typical data sets, and indeed in our simulations, the large-sample null distribution was reliable with as few as 20-50 codons in the alignment. We examined the impact of sequence length, the strength of positive selection, and the proportion of sites under positive selection on the power of the branch-site test. We found that the test was far more powerful in detecting episodic positive selection than branch-based tests, which average substitution rates over all codons in the gene and thus miss the signal when most codons are under strong selective constraint. Recent claims of statistical problems with the branch-site test are due to misinterpretations of simulation results. Our results, as well as previous simulation studies that have demonstrated the robustness of the test, suggest that the branch-site test may be a useful tool for detecting episodic positive selection and for generating biological hypotheses for mutation studies and functional analyses. The test is sensitive to sequence and alignment errors and caution should be exercised concerning its use when data quality is in doubt.
Similar articles
-
The effect of insertions, deletions, and alignment errors on the branch-site test of positive selection.Mol Biol Evol. 2010 Oct;27(10):2257-67. doi: 10.1093/molbev/msq115. Epub 2010 May 5. Mol Biol Evol. 2010. PMID: 20447933
-
Multiple hypothesis testing to detect lineages under positive selection that affects only a few sites.Mol Biol Evol. 2007 May;24(5):1219-28. doi: 10.1093/molbev/msm042. Epub 2007 Mar 5. Mol Biol Evol. 2007. PMID: 17339634
-
Frequent false detection of positive selection by the likelihood method with branch-site models.Mol Biol Evol. 2004 Jul;21(7):1332-9. doi: 10.1093/molbev/msh117. Epub 2004 Mar 10. Mol Biol Evol. 2004. PMID: 15014150
-
The quest for natural selection in the age of comparative genomics.Heredity (Edinb). 2007 Dec;99(6):567-79. doi: 10.1038/sj.hdy.6801052. Epub 2007 Sep 12. Heredity (Edinb). 2007. PMID: 17848974 Review.
-
Models of coding sequence evolution.Brief Bioinform. 2009 Jan;10(1):97-109. doi: 10.1093/bib/bbn049. Epub 2008 Oct 29. Brief Bioinform. 2009. PMID: 18971241 Free PMC article. Review.
Cited by
-
Adaptive Evolution as a Predictor of Species-Specific Innate Immune Response.Mol Biol Evol. 2015 Jul;32(7):1717-29. doi: 10.1093/molbev/msv051. Epub 2015 Mar 10. Mol Biol Evol. 2015. PMID: 25758009 Free PMC article.
-
Neocortical development as an evolutionary platform for intragenomic conflict.Front Neuroanat. 2013 Apr 9;7:2. doi: 10.3389/fnana.2013.00002. eCollection 2013. Front Neuroanat. 2013. PMID: 23576960 Free PMC article.
-
Rapidly evolving changes and gene loss associated with host switching in Corynebacterium pseudotuberculosis.PLoS One. 2018 Nov 12;13(11):e0207304. doi: 10.1371/journal.pone.0207304. eCollection 2018. PLoS One. 2018. PMID: 30419061 Free PMC article.
-
Orthologous Divergence and Paralogous Anticonvergence in Molecular Evolution of Triplicated Green Opsin Genes in Medaka Fish, Genus Oryzias.Genome Biol Evol. 2020 Jun 1;12(6):911-923. doi: 10.1093/gbe/evaa111. Genome Biol Evol. 2020. PMID: 32467976 Free PMC article.
-
Long-lived rodents reveal signatures of positive selection in genes associated with lifespan.PLoS Genet. 2018 Mar 23;14(3):e1007272. doi: 10.1371/journal.pgen.1007272. eCollection 2018 Mar. PLoS Genet. 2018. PMID: 29570707 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources