Improving the performance of DomainDiscovery of protein domain boundary assignment using inter-domain linker index
- PMID: 17254311
- PMCID: PMC1764483
- DOI: 10.1186/1471-2105-7-S5-S6
Improving the performance of DomainDiscovery of protein domain boundary assignment using inter-domain linker index
Abstract
Background: Knowledge of protein domain boundaries is critical for the characterisation and understanding of protein function. The ability to identify domains without the knowledge of the structure--by using sequence information only--is an essential step in many types of protein analyses. In this present study, we demonstrate that the performance of DomainDiscovery is improved significantly by including the inter-domain linker index value for domain identification from sequence-based information. Improved DomainDiscovery uses a Support Vector Machine (SVM) approach and a unique training dataset built on the principle of consensus among experts in defining domains in protein structure. The SVM was trained using a PSSM (Position Specific Scoring Matrix), secondary structure, solvent accessibility information and inter-domain linker index to detect possible domain boundaries for a target sequence.
Results: Improved DomainDiscovery is compared with other methods by benchmarking against a structurally non-redundant dataset and also CASP5 targets. Improved DomainDiscovery achieves 70% accuracy for domain boundary identification in multi-domains proteins.
Conclusion: Improved DomainDiscovery compares favourably to the performance of other methods and excels in the identification of domain boundaries for multi-domain proteins as a result of introducing support vector machine with benchmark_2 dataset.
Figures





Similar articles
-
DomNet: protein domain boundary prediction using enhanced general regression network and new profiles.IEEE Trans Nanobioscience. 2008 Jun;7(2):172-81. doi: 10.1109/TNB.2008.2000747. IEEE Trans Nanobioscience. 2008. PMID: 18556265
-
Armadillo: domain boundary prediction by amino acid composition.J Mol Biol. 2005 Jul 29;350(5):1061-73. doi: 10.1016/j.jmb.2005.05.037. J Mol Biol. 2005. PMID: 15978619
-
Improved general regression network for protein domain boundary prediction.BMC Bioinformatics. 2008;9 Suppl 1(Suppl 1):S12. doi: 10.1186/1471-2105-9-S1-S12. BMC Bioinformatics. 2008. PMID: 18315843 Free PMC article.
-
Inferring boundary information of discontinuous-domain proteins.IEEE Trans Nanobioscience. 2008 Sep;7(3):200-5. doi: 10.1109/TNB.2008.2002283. IEEE Trans Nanobioscience. 2008. PMID: 18779100
-
Prediction of number and position of domain boundaries in multi-domain proteins by use of amino acid sequence alone.Curr Protein Pept Sci. 2007 Apr;8(2):189-95. doi: 10.2174/138920307780363460. Curr Protein Pept Sci. 2007. PMID: 17430200 Review.
Cited by
-
DomHR: accurately identifying domain boundaries in proteins using a hinge region strategy.PLoS One. 2013 Apr 11;8(4):e60559. doi: 10.1371/journal.pone.0060559. Print 2013. PLoS One. 2013. PMID: 23593247 Free PMC article.
-
DomSVR: domain boundary prediction with support vector regression from sequence information alone.Amino Acids. 2010 Aug;39(3):713-26. doi: 10.1007/s00726-010-0506-6. Epub 2010 Feb 18. Amino Acids. 2010. PMID: 20165918 Free PMC article.
-
Hierarchical kernel mixture models for the prediction of AIDS disease progression using HIV structural gp120 profiles.BMC Genomics. 2010 Dec 2;11 Suppl 4(Suppl 4):S22. doi: 10.1186/1471-2164-11-S4-S22. BMC Genomics. 2010. PMID: 21143806 Free PMC article.
-
Multi-head attention-based U-Nets for predicting protein domain boundaries using 1D sequence features and 2D distance maps.BMC Bioinformatics. 2022 Jul 19;23(1):283. doi: 10.1186/s12859-022-04829-1. BMC Bioinformatics. 2022. PMID: 35854211 Free PMC article.
-
OPUS-Dom: applying the folding-based method VECFOLD to determine protein domain boundaries.J Mol Biol. 2009 Jan 30;385(4):1314-29. doi: 10.1016/j.jmb.2008.10.093. Epub 2008 Nov 10. J Mol Biol. 2009. PMID: 19026662 Free PMC article.
References
-
- Veretnik S, Shindyalov IN. In: Computational Methods for Domain Partitioning in Protein Structures" in Computational Methods for Protein Structure and Modeling. Xu Y, Xu D, Liang J, editor. Springer-Verlag; 2006. - PubMed
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources