Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2009 Dec 3;10 Suppl 3(Suppl 3):S23.
doi: 10.1186/1471-2164-10-S3-S23.

DNA-binding residues and binding mode prediction with binding-mechanism concerned models

Affiliations

DNA-binding residues and binding mode prediction with binding-mechanism concerned models

Yu-Feng Huang et al. BMC Genomics. .

Abstract

Background: Protein-DNA interactions are essential for fundamental biological activities including DNA transcription, replication, packaging, repair and rearrangement. Proteins interacting with DNA can be classified into two categories of binding mechanisms - sequence-specific and non-specific binding. Protein-DNA specific binding provides a mechanism to recognize correct nucleotide base pairs for sequence-specific identification. Protein-DNA non-specific binding shows sequence independent interaction for accelerated targeting by interacting with DNA backbone. Both sequence-specific and non-specific binding residues contribute to their roles for interaction.

Results: The proposed framework has two stage predictors: DNA-binding residues prediction and binding mode prediction. In the first stage - DNA-binding residues prediction, the predictor for DNA specific binding residues achieves 96.45% accuracy with 50.14% sensitivity, 99.31% specificity, 81.70% precision, and 62.15% F-measure. The predictor for DNA non-specific binding residues achieves 89.14% accuracy with 53.06% sensitivity, 95.25% specificity, 65.47% precision, and 58.62% F-measure. While combining prediction results of sequence-specific and non-specific binding residues with OR operation, the predictor achieves 89.26% accuracy with 56.86% sensitivity, 95.63% specificity, 71.92% precision, and 63.51% F-measure. In the second stage, protein-DNA binding mode prediction achieves 75.83% accuracy while using support vector machine with multi-class prediction.

Conclusion: This article presents the design of a sequence based predictor aiming to identify sequence-specific and non-specific binding residues in a transcription factor with DNA binding-mechanism concerned. The protein-DNA binding mode prediction was introduced to help improve DNA-binding residues prediction. In addition, the results of this study will help with the design of binding-mechanism concerned predictors for other families of proteins interacting with DNA.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Sequence-specific and non-specific binding residues of PDB 2PRT:A. Residues colored by red are sequence-specific binding residues. Residues colored by blue are non-specific binding residues. Residues colored by purple are both sequence-specific and non-specific binding residues.
Figure 2
Figure 2
A difficult case (PDB ID 1LMB:4) of binding residue prediction, which can be enhanced with the best aligned template of correct predicted protein-DNA binding mode. Residues colored by red means false positive. Residues colored by blue means false negative. Residues colored by green means true positive. (a) Prediction of sequence-specific binding residues. (b) Prediction of non-specific binding residues. (c) Combination of sequence-specific and non-specific binding residues prediction. (d) Enhanced prediction with the best aligned template of correct protein-DNA binding mode prediction.
Figure 3
Figure 3
Overall framework for DNA-binding residues prediction.

Similar articles

Cited by

References

    1. Berman HM, Westbrook J, Feng Z, Gilliland G, Bhat TN, Weissig H, Shindyalov IN, Bourne PE. The Protein Data Bank. Nucleic Acids Res. 2000;28(1):235–242. doi: 10.1093/nar/28.1.235. - DOI - PMC - PubMed
    1. Berman HM, Olson WK, Beveridge DL, Westbrook J, Gelbin A, Demeny T, Hsieh SH, Srinivasan AR, Schneider B. The nucleic acid database. A comprehensive relational database of three-dimensional structures of nucleic acids. Biophys J. 1992;63(3):751–759. doi: 10.1016/S0006-3495(92)81649-1. - DOI - PMC - PubMed
    1. Reddy CK, Das A, Jayaram B. Do water molecules mediate protein-DNA recognition? J Mol Biol. 2001;314(3):619–632. doi: 10.1006/jmbi.2001.5154. - DOI - PubMed
    1. Luscombe NM, Thornton JM. Protein-DNA interactions: amino acid conservation and the effects of mutations on binding specificity. J Mol Biol. 2002;320(5):991–1009. doi: 10.1016/S0022-2836(02)00571-5. - DOI - PubMed
    1. Luscombe NM, Laskowski RA, Thornton JM. Amino acid-base interactions: a three-dimensional analysis of protein-DNA interactions at an atomic level. Nucleic Acids Res. 2001;29(13):2860–2874. doi: 10.1093/nar/29.13.2860. - DOI - PMC - PubMed

Publication types

LinkOut - more resources