Inferring boundary information of discontinuous-domain proteins
- PMID: 18779100
- DOI: 10.1109/TNB.2008.2002283
Inferring boundary information of discontinuous-domain proteins
Abstract
Wetlaufer introduced the classification of domains into continuous and discontinuous. Continuous domains form from a single-chain segment and discontinuous domains are composed of two or more chain segments. Richardson identified approximately 100 domains in her review. Her assignment was based on the concepts that the domain would be independently stable and/or could undergo rigid-body-like movements with respect to the entire protein. There are now several instances where structurally similar domains occur in different proteins in the absence of noticeable sequence similarity. Possibly, the most notable of such domains is the trios-phosphate isomerase (TIM) barrel. With the increase in the number of known sequences, computer algorithms are required to identify the discontinuous domain of an unknown protein chain in order to determine its structure and function. We have developed a novel algorithm for discontinuous-domain boundary prediction based on a machine learning algorithm and interresidue contact interactions values. We have used 415 proteins, including 100 discontinuous-domain chains for training. There is no method available that is designed solely on a sequence based for the prediction of discontinuous domain. DomainDiscovery performed significantly well compared to the structure-based methods like structural classification of proteins (SCOP), class, architecture, topology and homologous superfamily (CATH), and DOMain MAKer (DOMAK).
Similar articles
-
Identification of putative domain linkers by a neural network - application to a large sequence database.BMC Bioinformatics. 2006 Jun 27;7:323. doi: 10.1186/1471-2105-7-323. BMC Bioinformatics. 2006. PMID: 16800897 Free PMC article.
-
Modelling interaction sites in protein domains with interaction profile hidden Markov models.Bioinformatics. 2006 Dec 1;22(23):2851-7. doi: 10.1093/bioinformatics/btl486. Epub 2006 Sep 25. Bioinformatics. 2006. PMID: 17000753
-
Mining sequential patterns for protein fold recognition.J Biomed Inform. 2008 Feb;41(1):165-79. doi: 10.1016/j.jbi.2007.05.004. Epub 2007 May 17. J Biomed Inform. 2008. PMID: 17573243
-
Recent progresses in the application of machine learning approach for predicting protein functional class independent of sequence similarity.Proteomics. 2006 Jul;6(14):4023-37. doi: 10.1002/pmic.200500938. Proteomics. 2006. PMID: 16791826 Review.
-
Structure-based function prediction: approaches and applications.Brief Funct Genomic Proteomic. 2008 Jul;7(4):291-302. doi: 10.1093/bfgp/eln030. Epub 2008 Jul 3. Brief Funct Genomic Proteomic. 2008. PMID: 18599513 Review.
Cited by
-
Extending Protein Domain Boundary Predictors to Detect Discontinuous Domains.PLoS One. 2015 Oct 26;10(10):e0141541. doi: 10.1371/journal.pone.0141541. eCollection 2015. PLoS One. 2015. PMID: 26502173 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources