Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 2006 Apr;23(4):817-27.
doi: 10.1093/molbev/msj102. Epub 2006 Feb 1.

Recombination estimation under complex evolutionary models with the coalescent composite-likelihood method

Affiliations
Comparative Study

Recombination estimation under complex evolutionary models with the coalescent composite-likelihood method

Antonio Carvajal-Rodríguez et al. Mol Biol Evol. 2006 Apr.

Abstract

The composite-likelihood estimator (CLE) of the population recombination rate considers only sites with exactly two alleles under a finite-sites mutation model (McVean, G. A. T., P. Awadalla, and P. Fearnhead. 2002. A coalescent-based method for detecting and estimating recombination from gene sequences. Genetics 160:1231-1241). While in such a model the identity of alleles is not considered, the CLE has been shown to be robust to minor misspecification of the underlying mutational model. However, there are many situations where the putative mutation and demographic history can be quite complex. One good example is rapidly evolving pathogens, like HIV-1. First we evaluated the performance of the CLE and the likelihood permutation test (LPT) under more complex, realistic models, including a general time reversible (GTR) substitution model, rate heterogeneity among sites (Gamma), positive selection, population growth, population structure, and noncontemporaneous sampling. Second, we relaxed some of the assumptions of the CLE allowing for a four-allele, GTR + Gamma model in an attempt to use the data more efficiently. Through simulations and the analysis of real data, we concluded that the CLE is robust to severe misspecifications of the substitution model, but underestimates the recombination rate in the presence of exponential growth, population mixture, selection, or noncontemporaneous sampling. In such cases, the use of more complex models slightly increases performance in some occasions, especially in the case of the LPT. Thus, our results provide for a more robust application of the estimation of recombination rates.

PubMed Disclaimer

Figures

Fig. 1
Fig. 1
Relative power and false-positive rate of the LPT. Performance of the LPT under the JC2 and JCall is compared to the best results obtained by any of the 14 recombination detection methods evaluated by Posada and Crandall (2001b). (A) Power to detect recombination when θ = 0.01 (B) Power to detect recombination when θ = 0.05 (C) False-positive rate when θ = 0.01 (D) False-positive rate when θ = 0.05. Gamma shape is the shape of the gamma distribution for rate variation among sites (inf = infinity).

References

    1. Anderson JP, Rodrigo AG, Learn GH, Wang Y, Weinstock H, Kalish ML, Robbins KE, Hood L, Mullins JI. Substitution model of sequence evolution for the human immunodeficiency virus type 1 subtype B gp120 gene over the C2-V5 region. J Mol Evol. 2001;53:55–68. - PubMed
    1. Anisimova M, Nielsen R, Yang Z. Effect of recombination on the accuracy of the likelihood method for detecting positive selection at amino acid sites. Genetics. 2003;164:1229–1236. - PMC - PubMed
    1. Awadalla P. The evolutionary genomics of pathogen recombination. Nat Rev Genet. 2003;4:50–60. - PubMed
    1. Barton NH, Etheridge AM. The effect of selection on genealogies. Genetics. 2004;166:1115–1131. - PMC - PubMed
    1. Buendia P, Narasimhan G. MinPD: distance-based phylogenetic analysis and recombination detection of serially-sampled HIV quasispecies. Proceedings of the IEEE Computer Society Bioinformatics Conference; Stanford, Calif. 2004. - PMC - PubMed

Publication types