Alignment Modulates Ancestral Sequence Reconstruction Accuracy
- PMID: 29618097
- PMCID: PMC5995191
- DOI: 10.1093/molbev/msy055
Alignment Modulates Ancestral Sequence Reconstruction Accuracy
Abstract
Accurate reconstruction of ancestral states is a critical evolutionary analysis when studying ancient proteins and comparing biochemical properties between parental or extinct species and their extant relatives. It relies on multiple sequence alignment (MSA) which may introduce biases, and it remains unknown how MSA methodological approaches impact ancestral sequence reconstruction (ASR). Here, we investigate how MSA methodology modulates ASR using a simulation study of various evolutionary scenarios. We evaluate the accuracy of ancestral protein sequence reconstruction for simulated data and compare reconstruction outcomes using different alignment methods. Our results reveal biases introduced not only by aligner algorithms and assumptions, but also tree topology and the rate of insertions and deletions. Under many conditions we find no substantial differences between the MSAs. However, increasing the difficulty for the aligners can significantly impact ASR. The MAFFT consistency aligners and PRANK variants exhibit the best performance, whereas FSA displays limited performance. We also discover a bias towards reconstructed sequences longer than the true ancestors, deriving from a preference for inferring insertions, in almost all MSA methodological approaches. In addition, we find measures of MSA quality generally correlate highly with reconstruction accuracy. Thus, we show MSA methodological differences can affect the quality of reconstructions and propose MSA methods should be selected with care to accurately determine ancestral states with confidence.
Figures
References
-
- Akanuma S, Iwami S, Yokoi T, Nakamura N, Watanabe H, Yokobori S, Yamagishi A.. 2011. Phylogeny-based design of a B-subunit of DNA gyrase and its ATPase domain using a small set of homologous amino acid sequences. J Mol Biol. 4122: 212–225. - PubMed
-
- Akanuma S, Yokobori S, Nakajima Y, Bessho M, Yamagishi A.. 2015. Robustness of predictions of extremely thermally stable proteins in ancient organisms. Evolution 6911: 2954–2962. - PubMed
-
- Anisimova M, Cannarozzi GM, Liberles DA.. 2010. Finding the balance between the mathematical and biological optima in multiple sequence alignment. Trends Evol Biol. 21: 7–48.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
