Multiple hypothesis testing to detect lineages under positive selection that affects only a few sites
- PMID: 17339634
- DOI: 10.1093/molbev/msm042
Multiple hypothesis testing to detect lineages under positive selection that affects only a few sites
Abstract
Detection of positive Darwinian selection has become ever more important with the rapid growth of genomic data sets. Recent branch-site models of codon substitution account for variation of selective pressure over branches on the tree and across sites in the sequence and provide a means to detect short episodes of molecular adaptation affecting just a few sites. In likelihood ratio tests based on such models, the branches to be tested for positive selection have to be specified a priori. In the absence of a biological hypothesis to designate so-called foreground branches, one may test many branches, but a correction for multiple testing becomes necessary. In this paper, we employ computer simulation to evaluate the performance of 6 multiple test correction procedures when the branch-site models are used to test every branch on the phylogeny for positive selection. Four of the methods control the familywise error rates (FWERs), whereas the other 2 control the false discovery rate (FDR). We found that all correction procedures achieved acceptable FWER except for extremely divergent sequences and serious model violations, when the test may become unreliable. The power of the test to detect positive selection is influenced by the strength of selection and the sequence divergence, with the highest power observed at intermediate divergences. The 4 correction procedures that control the FWER had similar power. We recommend Rom's procedure for its slightly higher power, but the simple Bonferroni correction is useable as well. The 2 correction procedures that control the FDR had slightly more power and also higher FWER. We demonstrate the multiple test procedures by analyzing gene sequences from the extracellular domain of the cluster of differentiation 2 (CD2) gene from 10 mammalian species. Both our simulation and real data analysis suggest that the multiple test procedures are useful when multiple branches have to be tested on the same data set.
Similar articles
-
Statistical properties of the branch-site test of positive selection.Mol Biol Evol. 2011 Mar;28(3):1217-28. doi: 10.1093/molbev/msq303. Epub 2010 Nov 18. Mol Biol Evol. 2011. PMID: 21087944
-
Frequent false detection of positive selection by the likelihood method with branch-site models.Mol Biol Evol. 2004 Jul;21(7):1332-9. doi: 10.1093/molbev/msh117. Epub 2004 Mar 10. Mol Biol Evol. 2004. PMID: 15014150
-
Likelihood-based clustering (LiBaC) for codon models, a method for grouping sites according to similarities in the underlying process of evolution.Mol Biol Evol. 2008 Sep;25(9):1995-2007. doi: 10.1093/molbev/msn145. Epub 2008 Jun 26. Mol Biol Evol. 2008. PMID: 18586695
-
Analysis of multilocus models of association.Genet Epidemiol. 2003 Jul;25(1):36-47. doi: 10.1002/gepi.10237. Genet Epidemiol. 2003. PMID: 12813725 Review.
-
Inference of selection from multiple species alignments.Curr Opin Genet Dev. 2002 Dec;12(6):688-94. doi: 10.1016/s0959-437x(02)00348-9. Curr Opin Genet Dev. 2002. PMID: 12433583 Review.
Cited by
-
Positively selected sites in cetacean myoglobins contribute to protein stability.PLoS Comput Biol. 2013;9(3):e1002929. doi: 10.1371/journal.pcbi.1002929. Epub 2013 Mar 7. PLoS Comput Biol. 2013. PMID: 23505347 Free PMC article.
-
Duplicated Myosin V Genes in Teleosts Show Evolutionary Rate Variations among the Motor and Cargo-Binding Domains.Genome Biol Evol. 2019 Feb 1;11(2):415-430. doi: 10.1093/gbe/evy258. Genome Biol Evol. 2019. PMID: 30496538 Free PMC article.
-
Discovery of the First Germline-Restricted Gene by Subtractive Transcriptomic Analysis in the Zebra Finch, Taeniopygia guttata.Curr Biol. 2018 May 21;28(10):1620-1627.e5. doi: 10.1016/j.cub.2018.03.067. Epub 2018 May 3. Curr Biol. 2018. PMID: 29731307 Free PMC article.
-
On the origin and evolutionary history of NANOG.PLoS One. 2014 Jan 17;9(1):e85104. doi: 10.1371/journal.pone.0085104. eCollection 2014. PLoS One. 2014. PMID: 24465486 Free PMC article.
-
Emergence, persistence, and positive selection of yellow fever virus in Colombia.Front Microbiol. 2025 Apr 7;16:1548556. doi: 10.3389/fmicb.2025.1548556. eCollection 2025. Front Microbiol. 2025. PMID: 40260085 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources