False-positive selection identified by ML-based methods: examples from the Sig1 gene of the diatom Thalassiosira weissflogii and the tax gene of a human T-cell lymphotropic virus
- PMID: 15014169
- DOI: 10.1093/molbev/msh098
False-positive selection identified by ML-based methods: examples from the Sig1 gene of the diatom Thalassiosira weissflogii and the tax gene of a human T-cell lymphotropic virus
Abstract
Sexually induced gene 1 (Sig1) in the centric diatom Thalassiosira weissflogii is considered to encode a gamete recognition protein. Sorhannus (2003) analyzed nucleotide sequences of Sig1 using parsimony analysis and the maximum-likelihood (ML)-based Bayesian method for inferring positive selection at single amino acid sites and reported that positively selected sites were detected by the latter method but not by the former. He then concluded that for this type of study, the ML-based method is more reliable than parsimony analysis. Here we show that his results apparently represent false-positive cases of the ML-based method and that there is no solid evidence that this gene contains positively selected sites. We further demonstrate that in the tax gene of human T-cell lymphotropic virus type I (HTLV-I), all codon sites, including invariable sites, can be inferred as positively selected sites by the ML-based method. These observations indicate that the ML-based method may produce many false-positive sites. One of the main reasons for the occurrence of false positives is that in the ML-based method, codon sites are grouped into several categories, with different nonsynonymous/synonymous rate ratios (omegas), on a purely statistical basis, and positive selection is inferred indirectly by examining whether the average omega for each category is greater than 1. In parsimony analysis, however, the evolutionary change of nucleotides at each codon site is examined. For this reason, parsimony-based methods rarely produce false positives and are safer than ML-based methods for detecting positive selection at individual codon sites, although a large number of sequences are necessary.
Similar articles
-
The effect of positive selection on a sexual reproduction gene in Thalassiosira weissflogii (Bacillariophyta): results obtained from maximum-likelihood and parsimony-based methods.Mol Biol Evol. 2003 Aug;20(8):1326-8. doi: 10.1093/molbev/msg145. Epub 2003 May 30. Mol Biol Evol. 2003. PMID: 12777506
-
Statistical properties of the methods for detecting positively selected amino acid sites.Gene. 2006 Jan 3;365:125-9. doi: 10.1016/j.gene.2005.09.002. Epub 2005 Oct 26. Gene. 2006. PMID: 16256279
-
Likelihood-based clustering (LiBaC) for codon models, a method for grouping sites according to similarities in the underlying process of evolution.Mol Biol Evol. 2008 Sep;25(9):1995-2007. doi: 10.1093/molbev/msn145. Epub 2008 Jun 26. Mol Biol Evol. 2008. PMID: 18586695
-
Phylogenetic analysis in molecular evolutionary genetics.Annu Rev Genet. 1996;30:371-403. doi: 10.1146/annurev.genet.30.1.371. Annu Rev Genet. 1996. PMID: 8982459 Review.
-
Clinically meaningful change: false positives in the estimation of individual change.Psychol Assess. 2014 Jun;26(2):370-83. doi: 10.1037/a0035419. Epub 2014 Feb 3. Psychol Assess. 2014. PMID: 24490678 Review.
Cited by
-
Global Diversity within and between Human Herpesvirus 1 and 2 Glycoproteins.J Virol. 2015 Aug;89(16):8206-18. doi: 10.1128/JVI.01302-15. Epub 2015 May 27. J Virol. 2015. PMID: 26018161 Free PMC article.
-
Molecular evolution of glutathione S-transferases in the genus Drosophila.Genetics. 2007 Nov;177(3):1363-75. doi: 10.1534/genetics.107.075838. Genetics. 2007. PMID: 18039872 Free PMC article.
-
Pervasive adaptive evolution in primate seminal proteins.PLoS Genet. 2005 Sep;1(3):e35. doi: 10.1371/journal.pgen.0010035. PLoS Genet. 2005. PMID: 16170411 Free PMC article.
-
Rapid adaptive evolution of the tumor suppressor gene Pten in an insect lineage.J Mol Evol. 2006 Jun;62(6):738-44. doi: 10.1007/s00239-005-0002-x. Epub 2006 Apr 11. J Mol Evol. 2006. PMID: 16612540
-
Improved inference of site-specific positive selection under a generalized parametric codon model when there are multinucleotide mutations and multiple nonsynonymous rates.BMC Evol Biol. 2019 Jan 14;19(1):22. doi: 10.1186/s12862-018-1326-7. BMC Evol Biol. 2019. PMID: 30642241 Free PMC article.
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources