DeepDISOBind: accurate prediction of RNA-, DNA- and protein-binding intrinsically disordered residues with deep multi-task learning
- PMID: 34905768
- DOI: 10.1093/bib/bbab521
DeepDISOBind: accurate prediction of RNA-, DNA- and protein-binding intrinsically disordered residues with deep multi-task learning
Abstract
Proteins with intrinsically disordered regions (IDRs) are common among eukaryotes. Many IDRs interact with nucleic acids and proteins. Annotation of these interactions is supported by computational predictors, but to date, only one tool that predicts interactions with nucleic acids was released, and recent assessments demonstrate that current predictors offer modest levels of accuracy. We have developed DeepDISOBind, an innovative deep multi-task architecture that accurately predicts deoxyribonucleic acid (DNA)-, ribonucleic acid (RNA)- and protein-binding IDRs from protein sequences. DeepDISOBind relies on an information-rich sequence profile that is processed by an innovative multi-task deep neural network, where subsequent layers are gradually specialized to predict interactions with specific partner types. The common input layer links to a layer that differentiates protein- and nucleic acid-binding, which further links to layers that discriminate between DNA and RNA interactions. Empirical tests show that this multi-task design provides statistically significant gains in predictive quality across the three partner types when compared to a single-task design and a representative selection of the existing methods that cover both disorder- and structure-trained tools. Analysis of the predictions on the human proteome reveals that DeepDISOBind predictions can be encoded into protein-level propensities that accurately predict DNA- and RNA-binding proteins and protein hubs. DeepDISOBind is available at https://www.csuligroup.com/DeepDISOBind/.
Keywords: deep learning; intrinsic disorder; protein–nucleic acids interactions; protein–protein interactions.
© The Author(s) 2021. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Similar articles
-
Comparative Assessment of Intrinsic Disorder Predictions with a Focus on Protein and Nucleic Acid-Binding Proteins.Biomolecules. 2020 Dec 4;10(12):1636. doi: 10.3390/biom10121636. Biomolecules. 2020. PMID: 33291838 Free PMC article. Review.
-
High-throughput prediction of RNA, DNA and protein binding regions mediated by intrinsic disorder.Nucleic Acids Res. 2015 Oct 15;43(18):e121. doi: 10.1093/nar/gkv585. Epub 2015 Jun 24. Nucleic Acids Res. 2015. PMID: 26109352 Free PMC article.
-
Prediction of Disordered RNA, DNA, and Protein Binding Regions Using DisoRDPbind.Methods Mol Biol. 2017;1484:187-203. doi: 10.1007/978-1-4939-6406-2_14. Methods Mol Biol. 2017. PMID: 27787828
-
Computational insights into intrinsically disordered regions in protein-nucleic acid complexes.Int J Biol Macromol. 2024 Oct;277(Pt 1):134021. doi: 10.1016/j.ijbiomac.2024.134021. Epub 2024 Jul 19. Int J Biol Macromol. 2024. PMID: 39032884
-
Computational prediction of functions of intrinsically disordered regions.Prog Mol Biol Transl Sci. 2019;166:341-369. doi: 10.1016/bs.pmbts.2019.04.006. Epub 2019 May 20. Prog Mol Biol Transl Sci. 2019. PMID: 31521235 Review.
Cited by
-
Computational Prediction of Linear Interacting Peptides.Methods Mol Biol. 2025;2867:233-245. doi: 10.1007/978-1-0716-4196-5_14. Methods Mol Biol. 2025. PMID: 39576585
-
A deep learning method for predicting interactions for intrinsically disordered regions of proteins.bioRxiv [Preprint]. 2025 Jan 22:2024.12.19.629373. doi: 10.1101/2024.12.19.629373. bioRxiv. 2025. PMID: 39763873 Free PMC article. Preprint.
-
pyRBDome: a comprehensive computational platform for enhancing RNA-binding proteome data.Life Sci Alliance. 2024 Jul 30;7(10):e202402787. doi: 10.26508/lsa.202402787. Print 2024 Oct. Life Sci Alliance. 2024. PMID: 39079742 Free PMC article.
-
Twenty years of advances in prediction of nucleic acid-binding residues in protein sequences.Brief Bioinform. 2024 Nov 22;26(1):bbaf016. doi: 10.1093/bib/bbaf016. Brief Bioinform. 2024. PMID: 39833102 Free PMC article. Review.
-
Learning Biophysical Dynamics with Protein Language Models.bioRxiv [Preprint]. 2025 Jul 15:2024.10.11.617911. doi: 10.1101/2024.10.11.617911. bioRxiv. 2025. PMID: 39464109 Free PMC article. Preprint.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources