A machine learning approach for the identification of odorant binding proteins from sequence-derived properties
- PMID: 17880712
- PMCID: PMC2216042
- DOI: 10.1186/1471-2105-8-351
A machine learning approach for the identification of odorant binding proteins from sequence-derived properties
Abstract
Background: Odorant binding proteins (OBPs) are believed to shuttle odorants from the environment to the underlying odorant receptors, for which they could potentially serve as odorant presenters. Although several sequence based search methods have been exploited for protein family prediction, less effort has been devoted to the prediction of OBPs from sequence data and this area is more challenging due to poor sequence identity between these proteins.
Results: In this paper, we propose a new algorithm that uses Regularized Least Squares Classifier (RLSC) in conjunction with multiple physicochemical properties of amino acids to predict odorant-binding proteins. The algorithm was applied to the dataset derived from Pfam and GenDiS database and we obtained overall prediction accuracy of 97.7% (94.5% and 98.4% for positive and negative classes respectively).
Conclusion: Our study suggests that RLSC is potentially useful for predicting the odorant binding proteins from sequence-derived properties irrespective of sequence similarity. Our method predicts 92.8% of 56 odorant binding proteins non-homologous to any protein in the swissprot database and 97.1% of the 414 independent dataset proteins, suggesting the usefulness of RLSC method for facilitating the prediction of odorant binding proteins from sequence information.
Figures
Similar articles
-
Predicting disulfide connectivity from protein sequence using multiple sequence feature vectors and secondary structure.Bioinformatics. 2007 Dec 1;23(23):3147-54. doi: 10.1093/bioinformatics/btm505. Epub 2007 Oct 17. Bioinformatics. 2007. PMID: 17942444
-
High-throughput identification of interacting protein-protein binding sites.BMC Bioinformatics. 2007 Jun 27;8:223. doi: 10.1186/1471-2105-8-223. BMC Bioinformatics. 2007. PMID: 17594507 Free PMC article.
-
Quantitative prediction of mouse class I MHC peptide binding affinity using support vector machine regression (SVR) models.BMC Bioinformatics. 2006 Mar 31;7:182. doi: 10.1186/1471-2105-7-182. BMC Bioinformatics. 2006. PMID: 16579851 Free PMC article.
-
Odorant-binding proteins: structural aspects.Ann N Y Acad Sci. 1998 Nov 30;855:281-93. doi: 10.1111/j.1749-6632.1998.tb10584.x. Ann N Y Acad Sci. 1998. PMID: 9929622 Review.
-
Analyzing molecular interactions.Curr Protoc Bioinformatics. 2003 May;Chapter 8:Unit8.1. doi: 10.1002/0471250953.bi0801s01. Curr Protoc Bioinformatics. 2003. PMID: 18428708 Review.
Cited by
-
DOR - a Database of Olfactory Receptors - Integrated Repository for Sequence and Secondary Structural Information of Olfactory Receptors in Selected Eukaryotic Genomes.Bioinform Biol Insights. 2014 Jun 12;8:147-58. doi: 10.4137/BBI.S14858. eCollection 2014. Bioinform Biol Insights. 2014. PMID: 25002814 Free PMC article.
-
Prediction of lysine ubiquitylation with ensemble classifier and feature selection.Int J Mol Sci. 2011;12(12):8347-61. doi: 10.3390/ijms12128347. Epub 2011 Nov 28. Int J Mol Sci. 2011. PMID: 22272076 Free PMC article.
-
Insights into Protein Sequence and Structure-Derived Features Mediating 3D Domain Swapping Mechanism using Support Vector Machine Based Approach.Bioinform Biol Insights. 2010 Jun 17;4:33-42. doi: 10.4137/bbi.s4464. Bioinform Biol Insights. 2010. PMID: 20634983 Free PMC article.
-
Using support vector machine combined with auto covariance to predict protein-protein interactions from protein sequences.Nucleic Acids Res. 2008 May;36(9):3025-30. doi: 10.1093/nar/gkn159. Epub 2008 Apr 4. Nucleic Acids Res. 2008. PMID: 18390576 Free PMC article.
-
Large-scale identification of odorant-binding proteins and chemosensory proteins from expressed sequence tags in insects.BMC Genomics. 2009 Dec 25;10:632. doi: 10.1186/1471-2164-10-632. BMC Genomics. 2009. PMID: 20034407 Free PMC article.
References
-
- Buck L, Axel R. A novel multigene family may encode odorant receptors: a molecular basis for odor recognition. Cell. 1991;65:175–187. - PubMed
-
- Ache BW. Towards a common strategy for transducing olfactory information. Semin Cell Biol. 1994;5:55–63. - PubMed
-
- Hildebrand JG, Shepherd GM. Mechanisms of olfactory discrimination: Converging evidence for common principles across phyla. Ann Rev Neurosci. 1997;20:595–631. - PubMed
-
- Pelosi P. Perireceptor events in olfaction. J Neurobiol. 1996;30:3–19. - PubMed
-
- Vogt RG, Riddiford LM. Pheromone binding and inactivation by moth antennae. Nature. 1981;293:161–163. - PubMed
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources