Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2014 Oct;18(10):636-44.
doi: 10.1089/omi.2014.0073. Epub 2014 Aug 18.

GPCRsort-responding to the next generation sequencing data challenge: prediction of G protein-coupled receptor classes using only structural region lengths

Affiliations

GPCRsort-responding to the next generation sequencing data challenge: prediction of G protein-coupled receptor classes using only structural region lengths

Mehmet Emre Sahin et al. OMICS. 2014 Oct.

Abstract

Next generation sequencing (NGS) and the attendant data deluge are increasingly impacting molecular life sciences research. Chief among the challenges and opportunities is to enhance our ability to classify molecular target data into meaningful and cohesive systematic nomenclature. In this vein, the G protein-coupled receptors (GPCRs) are the largest and most divergent receptor family that plays a crucial role in a host of pathophysiological pathways. For the pharmaceutical industry, GPCRs are a major drug target and it is estimated that 60%-70% of all medicines in development today target GPCRs. Hence, they require an efficient and rapid classification to group the members according to their functions. In addition to NGS and the Big Data challenge we currently face, an emerging number of orphan GPCRs further demand for novel, rapid, and accurate classification of the receptors since the current classification tools are inadequate and slow. This study presents the development of a new classification tool for GPCRs using the structural features derived from their primary sequences: GPCRsort. Comparison experiments with the current known GPCR classification techniques showed that GPCRsort is able to rapidly (in the order of minutes) classify uncharacterized GPCRs with 97.3% accuracy, whereas the best available technique's accuracy is 90.7%. GPCRsort is available in the public domain for postgenomics life scientists engaged in GPCR research with NGS: http://bioserver.ceng.metu.edu.tr/GPCRSort .

PubMed Disclaimer

Figures

<b>FIG. 1.</b>
FIG. 1.
Schematic diagram of a GPCR.
<b>FIG. 2.</b>
FIG. 2.
Comparison of running times of methods. GPCRBind running time is taken from Cobanoglu et al., (2011).

References

    1. Bhasin M, and Raghava GP. (2004). GPCRpred: An SVM-based method for prediction of families and subfamilies of G-protein coupled receptors. Nucleic Acids Res 32, 383–389 - PMC - PubMed
    1. Breiman L. (1996). Bagging Predictors. Mach Learn 24, 123–140
    1. Breiman L. (2001). Random Forests. Mach Learn 45, 5–32
    1. Cardoso JC, Pinto VC, Vieira FA, Clark MS, and Power DM. (2006). Evolution of secretin family GPCR members in the metazoa. BMC Evol Biol 6, 108. - PMC - PubMed
    1. Caruana R, Karampatziakis N, and Yessenalina A. (2008). An empirical evaluation of supervised learning in high dimensions. Proc Int Conf Mach Learn 25, 96–103

Publication types

Substances

LinkOut - more resources