Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2007 Sep 19:8:351.
doi: 10.1186/1471-2105-8-351.

A machine learning approach for the identification of odorant binding proteins from sequence-derived properties

Affiliations

A machine learning approach for the identification of odorant binding proteins from sequence-derived properties

Ganesan Pugalenthi et al. BMC Bioinformatics. .

Abstract

Background: Odorant binding proteins (OBPs) are believed to shuttle odorants from the environment to the underlying odorant receptors, for which they could potentially serve as odorant presenters. Although several sequence based search methods have been exploited for protein family prediction, less effort has been devoted to the prediction of OBPs from sequence data and this area is more challenging due to poor sequence identity between these proteins.

Results: In this paper, we propose a new algorithm that uses Regularized Least Squares Classifier (RLSC) in conjunction with multiple physicochemical properties of amino acids to predict odorant-binding proteins. The algorithm was applied to the dataset derived from Pfam and GenDiS database and we obtained overall prediction accuracy of 97.7% (94.5% and 98.4% for positive and negative classes respectively).

Conclusion: Our study suggests that RLSC is potentially useful for predicting the odorant binding proteins from sequence-derived properties irrespective of sequence similarity. Our method predicts 92.8% of 56 odorant binding proteins non-homologous to any protein in the swissprot database and 97.1% of the 414 independent dataset proteins, suggesting the usefulness of RLSC method for facilitating the prediction of odorant binding proteins from sequence information.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Description of the feature selection method. Redundant features are sequentially removed until the number of remaining features reaches a pre-defined number.

Similar articles

Cited by

References

    1. Buck L, Axel R. A novel multigene family may encode odorant receptors: a molecular basis for odor recognition. Cell. 1991;65:175–187. - PubMed
    1. Ache BW. Towards a common strategy for transducing olfactory information. Semin Cell Biol. 1994;5:55–63. - PubMed
    1. Hildebrand JG, Shepherd GM. Mechanisms of olfactory discrimination: Converging evidence for common principles across phyla. Ann Rev Neurosci. 1997;20:595–631. - PubMed
    1. Pelosi P. Perireceptor events in olfaction. J Neurobiol. 1996;30:3–19. - PubMed
    1. Vogt RG, Riddiford LM. Pheromone binding and inactivation by moth antennae. Nature. 1981;293:161–163. - PubMed

Publication types

LinkOut - more resources