Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2004 Jul 1;32(Web Server issue):W383-9.
doi: 10.1093/nar/gkh416.

GPCRpred: an SVM-based method for prediction of families and subfamilies of G-protein coupled receptors

Affiliations

GPCRpred: an SVM-based method for prediction of families and subfamilies of G-protein coupled receptors

Manoj Bhasin et al. Nucleic Acids Res. .

Abstract

G-protein coupled receptors (GPCRs) belong to one of the largest superfamilies of membrane proteins and are important targets for drug design. In this study, a support vector machine (SVM)-based method, GPCRpred, has been developed for predicting families and subfamilies of GPCRs from the dipeptide composition of proteins. The dataset used in this study for training and testing was obtained from http://www.soe.ucsc.edu/research/compbio/gpcr/. The method classified GPCRs and non-GPCRs with an accuracy of 99.5% when evaluated using 5-fold cross-validation. The method is further able to predict five major classes or families of GPCRs with an overall Matthew's correlation coefficient (MCC) and accuracy of 0.81 and 97.5% respectively. In recognizing the subfamilies of the rhodopsin-like family, the method achieved an average MCC and accuracy of 0.97 and 97.3% respectively. The method achieved overall accuracy of 91.3% and 96.4% at family and subfamily level respectively when evaluated on an independent/blind dataset of 650 GPCRs. A server for recognition and classification of GPCRs based on multiclass SVMs has been set up at http://www.imtech.res.in/raghava/gpcrpred/. We have also suggested subfamilies for 42 sequences which were previously identified as unclassified ClassA GPCRs. The supplementary information is available at http://www.imtech.res.in/raghava/gpcrpred/info.html.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Diagrammatic view of the three-step strategy used to predict the subfamilies of GPCRs.
Figure 2
Figure 2
The GPCRpred home page and result page. (A) The GPCRpred homepage showing the principle features of the interface. (B) A GPCRpred results page showing a summary of the submitted sequence and final prediction results.

References

    1. Elrod D.W. and Chou,K.C. (2002) A study on the correlation of G-protein-coupled receptor types with amino acid composition. Protein Eng., 15, 713–715. - PubMed
    1. Horn F., Vriend,G. and Cohen,F.E. (2001) Collecting and harvesting biological data: the GPCRDB and NucleaRDB information systems. Nucleic Acids Res., 29, 346–349. - PMC - PubMed
    1. Attwood T.K., Croning,M.D. and Gaulton,A. (2002) Deriving structural and functional insights from a ligand-based hierarchical classification of G protein-coupled receptors. Protein Eng., 15, 7–12. - PubMed
    1. Sadowski M.I. and Parish,J.H. (2003) Automated generation and refinement of protein signatures: case study with G-protein coupled receptors. Bioinformatics. 19, 727–734. - PubMed
    1. Karchin R., Karplus,K. and Haussler,D. (2002) Classifying G-protein coupled receptors with support vector machines. Bioinformatics, 18,147–159. - PubMed

Publication types