Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2007 May;16(5):947-55.
doi: 10.1110/ps.062597307.

DDOMAIN: Dividing structures into domains using a normalized domain-domain interaction profile

Affiliations

DDOMAIN: Dividing structures into domains using a normalized domain-domain interaction profile

Hongyi Zhou et al. Protein Sci. 2007 May.

Abstract

Dividing protein structures into domains is proven useful for more accurate structural and functional characterization of proteins. Here, we develop a method, called DDOMAIN, that divides structure into DOMAINs using a normalized contact-based domain-domain interaction profile. Results of DDOMAIN are compared to AUTHORS annotations (domain definitions are given by the authors who solved protein structures), as well as to popular SCOP and CATH annotations by human experts and automatic programs. DDOMAIN's automatic annotations are most consistent with the AUTHORS annotations (90% agreement in number of domains and 88% agreement in both number of domains and at least 85% overlap in domain assignment of residues) if its three adjustable parameters are trained by the AUTHORS annotations. By comparison, the agreement is 83% (81% with at least 85% overlap criterion) between SCOP-trained DDOMAIN and SCOP annotations and 77% (73%) between CATH-trained DDOMAIN and CATH annotations. The agreement between DDOMAIN and AUTHORS annotations goes beyond single-domain proteins (97%, 82%, and 56% for single-, two-, and three-domain proteins, respectively). For an "easy" data set of proteins whose CATH and SCOP annotations agree with each other in number of domains, the agreement is 90% (89%) between "easy-set"-trained DDOMAIN and CATH/SCOP annotations. The consistency between SCOP-trained DDOMAIN and SCOP annotations is superior to two other recently developed, SCOP-trained, automatic methods PDP (protein domain parser), and DomainParser 2. We also tested a simple consensus method made of PDP, DomainParser 2, and DDOMAIN and a different version of DDOMAIN based on a more sophisticated statistical energy function. The DDOMAIN server and its executable are available in the services section on http://sparks.informatics.iupui.edu.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
The contact-based interaction profile of chain A of protein 1gg3 is shown along with the domains identified by DDOMAIN (solid line), SCOP (dashed line), and CATH (dash dotted line). The residue index here is different from the actual residue number in the PDB structure file because only residues with coordinates are numbered.
Figure 2.
Figure 2.
The success rates for the SCOP set as a function of one of the three parameters (Elowcutoff, Eexcesscutoff, and Lcut) in DDOMAIN with two other parameters fixed. For Eexcesscutoff (dashed curve in the top figure), Elowcutoff and Lcut are fixed at 0.53 and 34, respectively. For Elowcutoff (solid curve in the top figure), Eexcesscutoff and Lcut are fixed at 0.36 and 34, respectively. For Lcut (solid curve at the bottom figure), Elowcutoff and Eexcesscutoff are fixed at 0.53 and 0.36, respectively. This figure explores the variation of the parameters around the optimized values (0.53, 0.36, and 34 for Elowcutoff, Eexcesscutoff, and Lcut, respectively).

References

    1. Alexandrov N. and Shindyalov, I. 2003. PDP: Protein domain parser. Bioinformatics 19: 429–430. - PubMed
    1. Bateman A., Coin, L., Durbin, R., Finn, R.D., Hollich, V., Griffiths-Jones, S., Khanna, A., Marshall, M., Moxon, S., Sonnhammer, E.L.L., et al. 2004. The pfam protein families database. Nucleic Acids Res. 32: D138–D141. - PMC - PubMed
    1. Chivian D., Kim, D.E., Malmstrom, L., Bradley, P., Robertson, T., Murphy, P., Strauss, C.E., Bonneau, R., Rohl, C.A., and Baker, D. 2003. Automated prediction of CASP-5 structures using the Robetta server. Proteins 53: 524–533. - PubMed
    1. Crippen G.M. 1978. The tree structural organization of proteins. J. Mol. Biol. 126: 315–332. - PubMed
    1. Guo J.T., Xu, D., Kim, D., and Xu, Y. 2003. Improving the performance of DomainParser for structural domain partition using neural network. Nucleic Acids Res. 31: 944–952. - PMC - PubMed

Publication types

LinkOut - more resources