. 2012;7(2):e30869.

doi: 10.1371/journal.pone.0030869. Epub 2012 Feb 21.

iNR-PhysChem: a sequence-based predictor for identifying nuclear receptors and their subfamilies via physical-chemical property matrix

Xuan Xiao¹, Pu Wang, Kuo-Chen Chou

Affiliations

PMID: 22363503
PMCID: PMC3283608
DOI: 10.1371/journal.pone.0030869

iNR-PhysChem: a sequence-based predictor for identifying nuclear receptors and their subfamilies via physical-chemical property matrix

Xuan Xiao et al. PLoS One. 2012.

. 2012;7(2):e30869.

doi: 10.1371/journal.pone.0030869. Epub 2012 Feb 21.

Authors

Xuan Xiao¹, Pu Wang, Kuo-Chen Chou

Affiliation

¹ Computer Department, Jing-De-Zhen Ceramic Institute, Jing-De-Zhen, China. xiaoxuan0326@yahoo.com.cn

PMID: 22363503
PMCID: PMC3283608
DOI: 10.1371/journal.pone.0030869

Abstract

Nuclear receptors (NRs) form a family of ligand-activated transcription factors that regulate a wide variety of biological processes, such as homeostasis, reproduction, development, and metabolism. Human genome contains 48 genes encoding NRs. These receptors have become one of the most important targets for therapeutic drug development. According to their different action mechanisms or functions, NRs have been classified into seven subfamilies. With the avalanche of protein sequences generated in the postgenomic age, we are facing the following challenging problems. Given an uncharacterized protein sequence, how can we identify whether it is a nuclear receptor? If it is, what subfamily it belongs to? To address these problems, we developed a predictor called iNR-PhysChem in which the protein samples were expressed by a novel mode of pseudo amino acid composition (PseAAC) whose components were derived from a physical-chemical matrix via a series of auto-covariance and cross-covariance transformations. It was observed that the overall success rate achieved by iNR-PhysChem was over 98% in identifying NRs or non-NRs, and over 92% in identifying NRs among the following seven subfamilies: NR1--thyroid hormone like, NR2--HNF4-like, NR3--estrogen like, NR4--nerve growth factor IB-like, NR5--fushi tarazu-F1 like, NR6--germ cell nuclear factor like, and NR0--knirps like. These rates were derived by the jackknife tests on a stringent benchmark dataset in which none of protein sequences included has ≥60% pairwise sequence identity to any other in a same subset. As a user-friendly web-server, iNR-PhysChem is freely accessible to the public at either http://www.jci-bioinfo.cn/iNR-PhysChem or http://icpr.jci.edu.cn/bioinfo/iNR-PhysChem. Also a step-by-step guide is provided on how to use the web-server to get the desired results without the need to follow the complicated mathematics involved in developing the predictor. It is anticipated that iNR-PhysChem may become a useful high throughput tool for both basic research and drug design.

PubMed Disclaimer

Conflict of interest statement

Competing Interests: The authors have declared that no competing interests exist.

Figures

**Figure 1. An illustration to show two types of covariance.**
(a) The auto-covariance refers to the coupling between two subsequences from a same sequence when they are separated by unit. (b) The cross-covariance refers to the coupling between two subsequences from two different sequences as indicated by two open curly braces.

formula image — **Figure 1. An illustration to show two types of covariance.**
(a) The auto-covariance refers to the coupling between two subsequences from a same sequence when they are separated by unit. (b) The cross-covariance refers to the coupling between two subsequences from two different sequences as indicated by two open curly braces.

**Figure 2. A flowchart to show the prediction process of iNR-PhysChem.**
T1 represents the benchmark dataset from for training the 1^st-level prediction; T2 represents the benchmark dataset from for training the 2^nd-level prediction. See the text for further explanation.

**Figure 3. An illustration to show the predicted results fallen into four different quadrants.**
(I) TP, the true positive quadrant (green) for correct prediction of positive dataset, (II) FP, the false positive quadrant (red) for incorrect prediction of negative dataset; (III) TN, the true negative quadrant (blue) for correct prediction of negative dataset; and (IV) FN, the false negative quadrant (pink) for incorrect prediction of positive dataset.

**Figure 4. A semi-screenshot to see the top page of iNR-PhysChem.**
The web-server is at either http://www.jci-bioinfo.cn/iNR-PhysChem or http://icpr.jci.edu.cn/bioinfo/iNR-PhysChem.

**Figure 5. The 3D graph to show the success rates by the 5-fold cross-validation with different values of C and in the SVM engine.**
(a) The results obtained for the 1^st-level prediction. (b) The results obtained for the 2^nd-level prediction.

See this image and copyright information in PMC

Cited by

Comprehensive comparative analysis and identification of RNA-binding protein domains: multi-class classification and feature selection.
Jahandideh S, Srinivasasainagendra V, Zhi D. Jahandideh S, et al. J Theor Biol. 2012 Nov 7;312:65-75. doi: 10.1016/j.jtbi.2012.07.013. Epub 2012 Aug 3. J Theor Biol. 2012. PMID: 22884576 Free PMC article.
Accurate prediction of nuclear receptors with conjoint triad feature.
Wang H, Hu X. Wang H, et al. BMC Bioinformatics. 2015 Dec 3;16:402. doi: 10.1186/s12859-015-0828-1. BMC Bioinformatics. 2015. PMID: 26630876 Free PMC article.
iCataly-PseAAC: Identification of Enzymes Catalytic Sites Using Sequence Evolution Information with Grey Model GM (2,1).
Xiao X, Hui MJ, Liu Z, Qiu WR. Xiao X, et al. J Membr Biol. 2015 Dec;248(6):1033-41. doi: 10.1007/s00232-015-9815-8. Epub 2015 Jun 16. J Membr Biol. 2015. PMID: 26077845
iDNA-Prot|dis: identifying DNA-binding proteins by incorporating amino acid distance-pairs and reduced alphabet profile into the general pseudo amino acid composition.
Liu B, Xu J, Lan X, Xu R, Zhou J, Wang X, Chou KC. Liu B, et al. PLoS One. 2014 Sep 3;9(9):e106691. doi: 10.1371/journal.pone.0106691. eCollection 2014. PLoS One. 2014. PMID: 25184541 Free PMC article.
PFP-GO: Integrating protein sequence, domain and protein-protein interaction information for protein function prediction using ranked GO terms.
Sengupta K, Saha S, Halder AK, Chatterjee P, Nasipuri M, Basu S, Plewczynski D. Sengupta K, et al. Front Genet. 2022 Sep 29;13:969915. doi: 10.3389/fgene.2022.969915. eCollection 2022. Front Genet. 2022. PMID: 36246645 Free PMC article.

See all "Cited by" articles

References

1. Evans RM. The steroid and thyroid hormone receptor superfamily. Science. 1988;240:889–895. - PMC - PubMed
1. Olefsky JM. Nuclear Receptor Minireview Series. Journal of Biological Chemistry. 2001;276:36863–36864. - PubMed
1. Altucci L, Gronemeyer H. Nuclear receptors in cell life and death. Trends in Endocrinology and Metabolism. 2001;12:460–468. - PubMed
1. Florence H, Gerrit V, Fred EC. Collecting and harvesting biological data: the GPCRDB and NucleaRDB information systems. Nucleic Acids Research. 2001;29:346–349. - PMC - PubMed
1. Mangelsdorf DJ, Thummel C, Beato M, Herrlich P, Schultz G, et al. The nuclear receptor superfamily: The second decade. Cell. 1995;83:835–839. - PMC - PubMed

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Substances

Actions
Actions

LinkOut - more resources

Full Text Sources
Research Materials
- NCI CPTC Antibody Characterization Program

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

iNR-PhysChem: a sequence-based predictor for identifying nuclear receptors and their subfamilies via physical-chemical property matrix

Affiliation

iNR-PhysChem: a sequence-based predictor for identifying nuclear receptors and their subfamilies via physical-chemical property matrix

Authors

Affiliation

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Substances

LinkOut - more resources

Full Text Sources

Research Materials

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Substances

Related information

LinkOut - more resources

Full Text Sources

Research Materials