Inferring Indel Parameters using a Simulation-based Approach
- PMID: 26537226
- PMCID: PMC4700945
- DOI: 10.1093/gbe/evv212
Inferring Indel Parameters using a Simulation-based Approach
Abstract
In this study, we present a novel methodology to infer indel parameters from multiple sequence alignments (MSAs) based on simulations. Our algorithm searches for the set of evolutionary parameters describing indel dynamics which best fits a given input MSA. In each step of the search, we use parametric bootstraps and the Mahalanobis distance to estimate how well a proposed set of parameters fits input data. Using simulations, we demonstrate that our methodology can accurately infer the indel parameters for a large variety of plausible settings. Moreover, using our methodology, we show that indel parameters substantially vary between three genomic data sets: Mammals, bacteria, and retroviruses. Finally, we demonstrate how our methodology can be used to simulate MSAs based on indel parameters inferred from real data sets.
Keywords: Mahalanobis distance; alignments; indels; phylogeny; simulations.
© The Author(s) 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Figures
References
-
- Abhiman S, Daub CO, Sonnhammer EL. 2006. Prediction of function divergence in protein families using the substitution rate variation parameter alpha. Mol Biol Evol. 23:1406–1413. - PubMed
-
- Barry D, Hartigan JA. 1987. Asynchronous distance between homologous DNA sequences. Biometrics 43:261–276. - PubMed
-
- Bay RA, Bielawski JP. 2011. Recombination detection under evolutionary scenarios relevant to functional divergence. J Mol Evol. 73:273–286. - PubMed
-
- Bay RA, Bielawski JP. 2013. Inference of functional divergence among proteins when the evolutionary process is non-stationary. J Mol Evol. 76:205–215. - PubMed
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
