. 2015 Jul 15;10(7):e0133260.

doi: 10.1371/journal.pone.0133260. eCollection 2015.

SNBRFinder: A Sequence-Based Hybrid Algorithm for Enhanced Prediction of Nucleic Acid-Binding Residues

Xiaoxia Yang¹, Jia Wang¹, Jun Sun¹, Rong Liu¹

Affiliations

Affiliation

¹ Agricultural Bioinformatics Key Laboratory of Hubei Province, College of Informatics, Huazhong Agricultural University, Wuhan, Hubei, People's Republic of China.

PMID: 26176857
PMCID: PMC4503397
DOI: 10.1371/journal.pone.0133260

SNBRFinder: A Sequence-Based Hybrid Algorithm for Enhanced Prediction of Nucleic Acid-Binding Residues

Xiaoxia Yang et al. PLoS One. 2015.

. 2015 Jul 15;10(7):e0133260.

doi: 10.1371/journal.pone.0133260. eCollection 2015.

Authors

Xiaoxia Yang¹, Jia Wang¹, Jun Sun¹, Rong Liu¹

Affiliation

¹ Agricultural Bioinformatics Key Laboratory of Hubei Province, College of Informatics, Huazhong Agricultural University, Wuhan, Hubei, People's Republic of China.

PMID: 26176857
PMCID: PMC4503397
DOI: 10.1371/journal.pone.0133260

Abstract

Protein-nucleic acid interactions are central to various fundamental biological processes. Automated methods capable of reliably identifying DNA- and RNA-binding residues in protein sequence are assuming ever-increasing importance. The majority of current algorithms rely on feature-based prediction, but their accuracy remains to be further improved. Here we propose a sequence-based hybrid algorithm SNBRFinder (Sequence-based Nucleic acid-Binding Residue Finder) by merging a feature predictor SNBRFinderF and a template predictor SNBRFinderT. SNBRFinderF was established using the support vector machine whose inputs include sequence profile and other complementary sequence descriptors, while SNBRFinderT was implemented with the sequence alignment algorithm based on profile hidden Markov models to capture the weakly homologous template of query sequence. Experimental results show that SNBRFinderF was clearly superior to the commonly used sequence profile-based predictor and SNBRFinderT can achieve comparable performance to the structure-based template methods. Leveraging the complementary relationship between these two predictors, SNBRFinder reasonably improved the performance of both DNA- and RNA-binding residue predictions. More importantly, the sequence-based hybrid prediction reached competitive performance relative to our previous structure-based counterpart. Our extensive and stringent comparisons show that SNBRFinder has obvious advantages over the existing sequence-based prediction algorithms. The value of our algorithm is highlighted by establishing an easy-to-use web server that is freely accessible at http://ibi.hzau.edu.cn/SNBRFinder.

PubMed Disclaimer

Conflict of interest statement

Competing Interests: The authors have declared that no competing interests exist.

Figures

**Fig 1. Flowchart of our SNBRFinder algorithm.**
SNBRFinder is a sequence-based hybrid prediction algotirhm comprising a feature-based predictor SNBRFinder^F and a template-based predictor SNBRFinder^T. SNBRFinder^F was built using the support vector machine algorithm whose inputs include comprehensive sequence descriptors and SNBRFinder^T was implemented with the sequence alignment algorithm based on profile hidden Markov models.

**Fig 2. HHscore distribution of optimal templates for DB312 and RB264.**
The HHscore produced by HHblits ranges from 0 to 100% and is used to measure the similarity between the query sequence and its optimal template.

**Fig 3. Comparison of our algorithms and existing approaches on three datasets.**
(A) DB33, (B) RB49, (C) RB44. In (A) and (B), Accuracy1 = (TP+TN)/(TP+TN+FP+FN) and Accuracy2 = (Sensitivity+Specificity)/2. In (C), the AUC values of SNBRFinder^T, HomPRIP, and PRBR are not provided, because the outputs of these three predictors are binary values. With the exception of SNBRFinder and RNABindRPlus (including the component predictors), the evaluation measures of the other approaches are derived from the recent review articles.

**Fig 4. Distribution of the ratio of positive predictions for non-nucleic acid binding and nucleic acid binding proteins.**
(A) NB250 annotated by our predictors trained with DB312, (B) NB250 annotated by our predictors trained with RB264. The solid bars represent the prediction results of non-nucleic acid binding sequences, while the hollow bars represent the prediction results of nucleic acid binding sequences.

**Fig 5. Snapshots of SNBRFinder web server.**
The submission page allows users to input mutiple protein sequences and specify the binding nucleic acid type. When the submitted job is finished, SNBRFinder will demonstrate the prediction results from three perspectives. The first section provides summary information about the query sequence and its optimal template. The second section is graphical representation of the prediction results. The last section includes details about the prediction results such as the outputs from our three predictors.

See this image and copyright information in PMC

Cited by

Twenty years of advances in prediction of nucleic acid-binding residues in protein sequences.
Basu S, Yu J, Kihara D, Kurgan L. Basu S, et al. Brief Bioinform. 2024 Nov 22;26(1):bbaf016. doi: 10.1093/bib/bbaf016. Brief Bioinform. 2024. PMID: 39833102 Free PMC article. Review.
Predictive modeling of moonlighting DNA-binding proteins.
Varghese DM, Nussinov R, Ahmad S. Varghese DM, et al. NAR Genom Bioinform. 2022 Dec 2;4(4):lqac091. doi: 10.1093/nargab/lqac091. eCollection 2022 Dec. NAR Genom Bioinform. 2022. PMID: 36474806 Free PMC article.
Precise prediction of phase-separation key residues by machine learning.
Sun J, Qu J, Zhao C, Zhang X, Liu X, Wang J, Wei C, Liu X, Wang M, Zeng P, Tang X, Ling X, Qing L, Jiang S, Chen J, Chen TSR, Kuang Y, Gao J, Zeng X, Huang D, Yuan Y, Fan L, Yu H, Ding J. Sun J, et al. Nat Commun. 2024 Mar 26;15(1):2662. doi: 10.1038/s41467-024-46901-9. Nat Commun. 2024. PMID: 38531854 Free PMC article.
A boosting approach for prediction of protein-RNA binding residues.
Tang Y, Liu D, Wang Z, Wen T, Deng L. Tang Y, et al. BMC Bioinformatics. 2017 Dec 1;18(Suppl 13):465. doi: 10.1186/s12859-017-1879-2. BMC Bioinformatics. 2017. PMID: 29219069 Free PMC article.
Multi-Agent Systems for Resource Allocation and Scheduling in a Smart Grid.
Binyamin SS, Ben Slama S. Binyamin SS, et al. Sensors (Basel). 2022 Oct 22;22(21):8099. doi: 10.3390/s22218099. Sensors (Basel). 2022. PMID: 36365795 Free PMC article. Review.

See all "Cited by" articles

References

1. Chen Y, Varani G. Protein families and RNA recognition. FEBS J. 2005;272: 2088–97. - PubMed
1. Gangloff S, Soustelle C, Fabre F. Homologous recombination is responsible for cell death in the absence of the Sgs1 and Srs2 helicases. Nature genetics. 2000;25: 192–4. - PubMed
1. Ahmad S, Gromiha MM, Sarai A. Analysis and prediction of DNA-binding proteins and their binding residues based on composition, sequence and structural information. Bioinformatics. 2004;20: 477–86. - PubMed
1. Chen YC, Lim C. Predicting RNA-binding sites from the protein structure based on electrostatics, evolution and geometry. Nucleic Acids Res. 2008;36: e29 10.1093/nar/gkn008 - DOI - PMC - PubMed
1. Chen YC, Sargsyan K, Wright JD, Huang YS, Lim C. Identifying RNA-binding residues based on evolutionary conserved structural and energetic features. Nucleic Acids Res. 2014;42: e15 10.1093/nar/gkt1299 - DOI - PMC - PubMed

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Substances

Actions

LinkOut - more resources

Full Text Sources
Other Literature Sources
- scite Smart Citations

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

SNBRFinder: A Sequence-Based Hybrid Algorithm for Enhanced Prediction of Nucleic Acid-Binding Residues

Affiliation

SNBRFinder: A Sequence-Based Hybrid Algorithm for Enhanced Prediction of Nucleic Acid-Binding Residues

Authors

Affiliation

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Substances

LinkOut - more resources

Full Text Sources

Other Literature Sources

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Substances

Related information

LinkOut - more resources

Full Text Sources

Other Literature Sources