GPCR-PEnDB: a database of protein sequences and derived features to facilitate prediction and classification of G protein-coupled receptors
- PMID: 33216895
- PMCID: PMC7678784
- DOI: 10.1093/database/baaa087
GPCR-PEnDB: a database of protein sequences and derived features to facilitate prediction and classification of G protein-coupled receptors
Abstract
G protein-coupled receptors (GPCRs) constitute the largest group of membrane receptor proteins in eukaryotes. Due to their significant roles in various physiological processes such as vision, smell and inflammation, GPCRs are the targets of many prescription drugs. However, the functional and sequence diversity of GPCRs has kept their prediction and classification based on amino acid sequence data as a challenging bioinformatics problem. There are existing computational approaches, mainly using machine learning and statistical methods, to predict and classify GPCRs based on amino acid sequence and sequence derived features. In this paper, we describe a searchable MySQL database, named GPCR-PEnDB (GPCR Prediction Ensemble Database), of confirmed GPCRs and non-GPCRs. It was constructed with the goal of allowing users to conveniently access useful information of GPCRs in a wide range of organisms and to compile reliable training and testing datasets for different combinations of computational tools. This database currently contains 3129 confirmed GPCR and 3575 non-GPCR sequences collected from the UniProtKB/Swiss-Prot protein database, encompassing over 1200 species. The non-GPCR entries include transmembrane proteins for evaluating various prediction programs' abilities to distinguish GPCRs from other transmembrane proteins. Each protein is linked to information about its source organism, classification, sequence lengths and composition, and other derived sequence features. We present examples of using this database along with its graphical user interface, to query for GPCRs with specific sequence properties and to compare the accuracies of five tools for GPCR prediction. This initial version of GPCR-PEnDB will provide a framework for future extensions to include additional sequence and feature data to facilitate the design and assessment of software tools and experimental studies to help understand the functional roles of GPCRs. Database URL: gpcr.utep.edu/database.
© The Author(s) 2020. Published by Oxford University Press.
Figures






Similar articles
-
GPCR-MPredictor: multi-level prediction of G protein-coupled receptors using genetic ensemble.Amino Acids. 2012 May;42(5):1809-23. doi: 10.1007/s00726-011-0902-6. Epub 2011 Apr 20. Amino Acids. 2012. PMID: 21505826
-
An improved classification of G-protein-coupled receptors using sequence-derived features.BMC Bioinformatics. 2010 Aug 9;11:420. doi: 10.1186/1471-2105-11-420. BMC Bioinformatics. 2010. PMID: 20696050 Free PMC article.
-
Bioinformatics tools for predicting GPCR gene functions.Adv Exp Med Biol. 2014;796:205-24. doi: 10.1007/978-94-007-7423-0_10. Adv Exp Med Biol. 2014. PMID: 24158807 Review.
-
A method for the prediction of GPCRs coupling specificity to G-proteins using refined profile Hidden Markov Models.BMC Bioinformatics. 2005 Apr 22;6:104. doi: 10.1186/1471-2105-6-104. BMC Bioinformatics. 2005. PMID: 15847681 Free PMC article.
-
Prediction and targeting of GPCR oligomer interfaces.Prog Mol Biol Transl Sci. 2020;169:105-149. doi: 10.1016/bs.pmbts.2019.11.007. Epub 2020 Jan 6. Prog Mol Biol Transl Sci. 2020. PMID: 31952684 Review.
Cited by
-
Stringent in-silico identification of putative G-protein-coupled receptors (GPCRs) of the entomopathogenic nematode Heterorhabditis bacteriophora.J Nematol. 2023 Nov 23;55(1):20230038. doi: 10.2478/jofnem-2023-0038. eCollection 2023 Feb. J Nematol. 2023. PMID: 38026552 Free PMC article.
-
Rapid transcriptomic and physiological changes in the freshwater pennate diatom Mayamaea pseudoterrestris in response to copper exposure.DNA Res. 2022 Dec 1;29(6):dsac037. doi: 10.1093/dnares/dsac037. DNA Res. 2022. PMID: 36197113 Free PMC article.
-
Chromosome-level assembly of the Phytophthora agathidicida genome reveals adaptation in effector gene families.Front Microbiol. 2022 Nov 2;13:1038444. doi: 10.3389/fmicb.2022.1038444. eCollection 2022. Front Microbiol. 2022. PMID: 36406440 Free PMC article.
-
AnnoPRO: a strategy for protein function annotation based on multi-scale protein representation and a hybrid deep learning of dual-path encoding.Genome Biol. 2024 Feb 1;25(1):41. doi: 10.1186/s13059-024-03166-1. Genome Biol. 2024. PMID: 38303023 Free PMC article.
-
What Makes GPCRs from Different Families Bind to the Same Ligand?Biomolecules. 2022 Jun 21;12(7):863. doi: 10.3390/biom12070863. Biomolecules. 2022. PMID: 35883418 Free PMC article.
References
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources