Separation of phylogenetic and functional associations in biological sequences by using the parametric bootstrap
- PMID: 10725404
- PMCID: PMC16231
- DOI: 10.1073/pnas.97.7.3288
Separation of phylogenetic and functional associations in biological sequences by using the parametric bootstrap
Abstract
Quantitative analyses of biological sequences generally proceed under the assumption that individual DNA or protein sequence elements vary independently. However, this assumption is not biologically realistic because sequence elements often vary in a concerted manner resulting from common ancestry and structural or functional constraints. We calculated intersite associations among aligned protein sequences by using mutual information. To discriminate associations resulting from common ancestry from those resulting from structural or functional constraints, we used a parametric bootstrap algorithm to construct replicate data sets. These data are expected to have intersite associations resulting solely from phylogeny. By comparing the distribution of our association statistic for the replicate data against that calculated for empirical data, we were able to assign a probability that two sites covaried resulting from structural or functional constraint rather than phylogeny. We tested our method by using an alignment of 237 basic helix-loop-helix (bHLH) protein domains. Comparison of our results against a solved three-dimensional structure confirmed the identification of several sites important to function and structure of the bHLH domain. This analytical procedure has broad utility as a first step in the identification of sites that are important to biological macromolecular structure and function when a solved structure is unavailable.
Figures

Similar articles
-
Correlations among amino acid sites in bHLH protein domains: an information theoretic analysis.Mol Biol Evol. 2000 Jan;17(1):164-78. doi: 10.1093/oxfordjournals.molbev.a026229. Mol Biol Evol. 2000. PMID: 10666716
-
Phylogeny, functional annotation, and protein interaction network analyses of the Xenopus tropicalis basic helix-loop-helix transcription factors.Biomed Res Int. 2013;2013:145037. doi: 10.1155/2013/145037. Epub 2013 Nov 10. Biomed Res Int. 2013. PMID: 24312906 Free PMC article.
-
Genome-wide identification and characterization of cucumber bHLH family genes and the functional characterization of CsbHLH041 in NaCl and ABA tolerance in Arabidopsis and cucumber.BMC Plant Biol. 2020 Jun 11;20(1):272. doi: 10.1186/s12870-020-02440-1. BMC Plant Biol. 2020. PMID: 32527214 Free PMC article.
-
Vertebrate hairy and Enhancer of split related proteins: transcriptional repressors regulating cellular differentiation and embryonic patterning.Oncogene. 2001 Dec 20;20(58):8342-57. doi: 10.1038/sj.onc.1205094. Oncogene. 2001. PMID: 11840327 Review.
-
The mammalian basic helix-loop-helix/PAS family of transcriptional regulators.Int J Biochem Cell Biol. 2004 Feb;36(2):189-204. doi: 10.1016/s1357-2725(03)00211-5. Int J Biochem Cell Biol. 2004. PMID: 14643885 Review.
Cited by
-
Correlated mutation analysis on the catalytic domains of serine/threonine protein kinases.PLoS One. 2009 Jun 15;4(6):e5913. doi: 10.1371/journal.pone.0005913. PLoS One. 2009. PMID: 19526051 Free PMC article.
-
Interfaces Between Alpha-helical Integral Membrane Proteins: Characterization, Prediction, and Docking.Comput Struct Biotechnol J. 2019 May 25;17:699-711. doi: 10.1016/j.csbj.2019.05.005. eCollection 2019. Comput Struct Biotechnol J. 2019. PMID: 31303974 Free PMC article.
-
Prevalence of epistasis in the evolution of influenza A surface proteins.PLoS Genet. 2011 Feb;7(2):e1001301. doi: 10.1371/journal.pgen.1001301. Epub 2011 Feb 17. PLoS Genet. 2011. PMID: 21390205 Free PMC article.
-
Coevolution in defining the functional specificity.Proteins. 2009 Apr;75(1):231-40. doi: 10.1002/prot.22239. Proteins. 2009. PMID: 18831050 Free PMC article.
-
Accurate simulation and detection of coevolution signals in multiple sequence alignments.PLoS One. 2012;7(10):e47108. doi: 10.1371/journal.pone.0047108. Epub 2012 Oct 16. PLoS One. 2012. PMID: 23091608 Free PMC article.
References
-
- Swofford D L, Olsen G J, Waddell P J, Hillis D M. In: Molecular Systematics. 2nd Ed. Hillis D M, Moritz C, Mable B K, editors. Sunderland, MA: Sinauer; 1996. pp. 407–514.
-
- Chelvanayagam G, Eggenschwiler A, Knecht L, Gonnet G H, Benner S A. Protein Eng. 1997;10:307–316. - PubMed
-
- Pollock D D, Taylor W R. Protein Eng. 1997;10:647–657. - PubMed
-
- Thompson M J, Goldstein R A. Proteins. 1996;25:28–37. - PubMed
-
- Gobel U, Sander C, Schneider R, Valencia A. Proteins Struct Funct Genet. 1994;18:309–317. - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources