Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 2003 Jul;164(3):1229-36.
doi: 10.1093/genetics/164.3.1229.

Effect of recombination on the accuracy of the likelihood method for detecting positive selection at amino acid sites

Affiliations
Comparative Study

Effect of recombination on the accuracy of the likelihood method for detecting positive selection at amino acid sites

Maria Anisimova et al. Genetics. 2003 Jul.

Abstract

Maximum-likelihood methods based on models of codon substitution accounting for heterogeneous selective pressures across sites have proved to be powerful in detecting positive selection in protein-coding DNA sequences. Those methods are phylogeny based and do not account for the effects of recombination. When recombination occurs, such as in population data, no unique tree topology can describe the evolutionary history of the whole sequence. This violation of assumptions raises serious concerns about the likelihood method for detecting positive selection. Here we use computer simulation to evaluate the reliability of the likelihood-ratio test (LRT) for positive selection in the presence of recombination. We examine three tests based on different models of variable selective pressures among sites. Sequences are simulated using a coalescent model with recombination and analyzed using codon-based likelihood models ignoring recombination. We find that the LRT is robust to low levels of recombination (with fewer than three recombination events in the history of a sample of 10 sequences). However, at higher levels of recombination, the type I error rate can be as high as 90%, especially when the null model in the LRT is unrealistic, and the test often mistakes recombination as evidence for positive selection. The test that compares the more realistic models M7 (beta) against M8 (beta and omega) is more robust to recombination, where the null model M7 allows the positive selection pressure to vary between 0 and 1 (and so does not account for positive selection), and the alternative model M8 allows an additional discrete class with omega = d(N)/d(S) that could be estimated to be >1 (and thus accounts for positive selection). Identification of sites under positive selection by the empirical Bayes method appears to be less affected than the LRT by recombination.

PubMed Disclaimer

References

    1. Mol Biol Evol. 2002 Aug;19(8):1376-84 - PubMed
    1. Mol Biol Evol. 1998 May;15(5):590-9 - PubMed
    1. Genetics. 2000 May;155(1):431-49 - PubMed
    1. Mol Biol Evol. 2000 Oct;17(10):1578-9 - PubMed
    1. Proc Natl Acad Sci U S A. 1997 Jul 22;94(15):7712-8 - PubMed

Publication types