An interactive visualization tool to explore the biophysical properties of amino acids and their contribution to substitution matrices
- PMID: 16817972
- PMCID: PMC1524819
- DOI: 10.1186/1471-2105-7-329
An interactive visualization tool to explore the biophysical properties of amino acids and their contribution to substitution matrices
Abstract
Background: Quantitative descriptions of amino acid similarity, expressed as probabilistic models of evolutionary interchangeability, are central to many mainstream bioinformatic procedures such as sequence alignment, homology searching, and protein structural prediction. Here we present a web-based, user-friendly analysis tool that allows any researcher to quickly and easily visualize relationships between these bioinformatic metrics and to explore their relationships to underlying indices of amino acid molecular descriptors.
Results: We demonstrate the three fundamental types of question that our software can address by taking as a specific example the connections between 49 measures of amino acid biophysical properties (e.g., size, charge and hydrophobicity), a generalized model of amino acid substitution (as represented by the PAM74-100 matrix), and the mutational distance that separates amino acids within the standard genetic code (i.e., the number of point mutations required for interconversion during protein evolution). We show that our software allows a user to recapture the insights from several key publications on these topics in just a few minutes.
Conclusion: Our software facilitates rapid, interactive exploration of three interconnected topics: (i) the multidimensional molecular descriptors of the twenty proteinaceous amino acids, (ii) the correlation of these biophysical measurements with observed patterns of amino acid substitution, and (iii) the causal basis for differences between any two observed patterns of amino acid substitution. This software acts as an intuitive bioinformatic exploration tool that can guide more comprehensive statistical analyses relating to a diverse array of specific research questions.
Figures




Similar articles
-
Periodic distributions of hydrophobic amino acids allows the definition of fundamental building blocks to align distantly related proteins.Proteins. 2007 May 15;67(3):695-708. doi: 10.1002/prot.21319. Proteins. 2007. PMID: 17299747
-
PR2ALIGN: a stand-alone software program and a web-server for protein sequence alignment using weighted biochemical properties of amino acids.BMC Res Notes. 2015 May 7;8:187. doi: 10.1186/s13104-015-1152-6. BMC Res Notes. 2015. PMID: 25947299 Free PMC article.
-
The ranging of amino acids substitution matrices of various types in accordance with the alignment accuracy criterion.BMC Bioinformatics. 2020 Sep 14;21(Suppl 11):294. doi: 10.1186/s12859-020-03616-0. BMC Bioinformatics. 2020. PMID: 32921315 Free PMC article.
-
COPid: composition based protein identification.In Silico Biol. 2008;8(2):121-8. In Silico Biol. 2008. PMID: 18928200
-
State-of-the-art bioinformatics protein structure prediction tools (Review).Int J Mol Med. 2011 Sep;28(3):295-310. doi: 10.3892/ijmm.2011.705. Epub 2011 May 23. Int J Mol Med. 2011. PMID: 21617841 Review.
Cited by
-
Insights into Protein Sequence and Structure-Derived Features Mediating 3D Domain Swapping Mechanism using Support Vector Machine Based Approach.Bioinform Biol Insights. 2010 Jun 17;4:33-42. doi: 10.4137/bbi.s4464. Bioinform Biol Insights. 2010. PMID: 20634983 Free PMC article.
-
Predicting Flavonoid UGT Regioselectivity.Adv Bioinformatics. 2011;2011:506583. doi: 10.1155/2011/506583. Epub 2011 Jun 30. Adv Bioinformatics. 2011. PMID: 21747849 Free PMC article.
-
Computational approach to unravel the impact of missense mutations of proteins (D2HGDH and IDH2) causing D-2-hydroxyglutaric aciduria 2.Metab Brain Dis. 2018 Oct;33(5):1699-1710. doi: 10.1007/s11011-018-0278-3. Epub 2018 Jul 9. Metab Brain Dis. 2018. PMID: 29987523
-
Mu-8: visualizing differences between proteins and their families.BMC Proc. 2014 Aug 28;8(Suppl 2 Proceedings of the 3rd Annual Symposium on Biologica):S5. doi: 10.1186/1753-6561-8-S2-S5. eCollection 2014. BMC Proc. 2014. PMID: 25237392 Free PMC article.
-
Bioinformatics classification of mutations in patients with Mucopolysaccharidosis IIIA.Metab Brain Dis. 2019 Dec;34(6):1577-1594. doi: 10.1007/s11011-019-00465-6. Epub 2019 Aug 5. Metab Brain Dis. 2019. PMID: 31385193 Free PMC article.
References
-
- Henikoff S, Henikoff JG. Performance evaluation of amino acid substitution matrices. Proteins. 1993;17:49–61. - PubMed
-
- Jeanmougin F, Thompson JD, Gouy M, Higgins DG, Gibson TJ. Multiple sequence alignment with Clustal X. Trends Biochem Sci. 1998;23:403–405. - PubMed
-
- Tress M, Ezkurdia I, Grana O, Lopez G, Valencia A. Assessment of predictions submitted for the CASP6 comparative modelling category. Proteins. 2005. - PubMed
-
- Vilim RB, Cunningham RM, Lu B, Kheradpour P, Stevens FJ. Fold-specific substitution matrices for protein classification. Bioinformatics. 2004;20:847–853. - PubMed
-
- Teodorescu O, Galor T, Pillardy J, Elber R. Enriching the sequence substitution matrix by structural information. Proteins. 2004;54:41–48. - PubMed
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources