Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2006 Jul 1;34(Web Server issue):W243-8.
doi: 10.1093/nar/gkl298.

BindN: a web-based tool for efficient prediction of DNA and RNA binding sites in amino acid sequences

Affiliations

BindN: a web-based tool for efficient prediction of DNA and RNA binding sites in amino acid sequences

Liangjiang Wang et al. Nucleic Acids Res. .

Abstract

BindN (http://bioinformatics.ksu.edu/bindn/) takes an amino acid sequence as input and predicts potential DNA or RNA-binding residues with support vector machines (SVMs). Protein datasets with known DNA or RNA-binding residues were selected from the Protein Data Bank (PDB), and SVM models were constructed using data instances encoded with three sequence features, including the side chain pK(a) value, hydrophobicity index and molecular mass of an amino acid. The results suggest that DNA-binding residues can be predicted at 69.40% sensitivity and 70.47% specificity, while prediction of RNA-binding residues achieves 66.28% sensitivity and 69.84% specificity. When compared with previous studies, the SVM models appear to be more accurate and more efficient for online predictions. BindN provides a useful tool for understanding the function of DNA and RNA-binding proteins based on primary sequence data.

PubMed Disclaimer

Figures

Figure 1
Figure 1
ROC curves for prediction of DNA and RNA-binding residues with SVMs.
Figure 2
Figure 2
Representative prediction results shown in the context of three-dimensional structures. In each complex, the correctly predicted binding residues (true positives) are in red and spacefill; the correctly predicted non-binding residues (true negatives) are in green and wireframe; the binding residues but predicted as negatives (false negatives) are in blue and spacefill; the non-binding residues but predicted as positives (false positives) are in yellow and spacefill; the nucleic acid molecule is shown in purple. (a) Putative DNA-binding residues predicted for the mouse ETS-1 transcription factor. The structure (PDB ID: 1K79) includes residues 331–440 of the ETS-1 protein. Chain D of 1K79 was used as the input sequence to BindN with the expected specificity set to 90%. (b) Putative RNA-binding residues predicted for the box C/D RNA-binding domain of the archaeal protein L7Ae. Chain B of the structure (PDB ID: 1RLG) was used for BindN prediction with the expected specificity set to 90%.
Figure 3
Figure 3
Sample output from the BindN server. Putative DNA-binding residues were predicted for the Arabidopsis transcription factor WRKY1 (residues 301–380). Most of the positive predictions are located in the putative WRKY DNA-binding domain, for which structural data are still not available.

References

    1. Ptashne M. Regulation of transcription: from lambda to eukaryotes. Trends Biochem. Sci. 2005;30:275–279. - PubMed
    1. Noller H.F. RNA structure: reading the ribosome. Science. 2005;309:1508–1514. - PubMed
    1. Hertel K.J., Graveley B.R. RS domains contact the pre-mRNA throughout spliceosome assembly. Trends Biochem. Sci. 2005;30:115–118. - PubMed
    1. Jones S., Daley D.T.A., Luscombe N.M., Berman H.M., Thornton J.M. Protein–RNA interactions: a structural analysis. Nucleic Acids Res. 2001;29:943–954. - PMC - PubMed
    1. Luscombe N.M., Laskowski R.A., Thornton J.M. Amino acid-base interactions: a three-dimensional analysis of protein–DNA interactions at an atomic level. Nucleic Acids Res. 2001;29:2860–2874. - PMC - PubMed

Publication types