. 2010 Mar 30:11:160.

doi: 10.1186/1471-2105-11-160.

Identification of NAD interacting residues in proteins

Hifzur R Ansari¹, Gajendra P S Raghava

Affiliations

PMID: 20353553
PMCID: PMC2853471
DOI: 10.1186/1471-2105-11-160

Identification of NAD interacting residues in proteins

Hifzur R Ansari et al. BMC Bioinformatics. 2010.

. 2010 Mar 30:11:160.

doi: 10.1186/1471-2105-11-160.

Authors

Hifzur R Ansari¹, Gajendra P S Raghava

Affiliation

¹ Institute of Microbial Technology, Sector 39A, Chandigarh, 160036, India.

PMID: 20353553
PMCID: PMC2853471
DOI: 10.1186/1471-2105-11-160

Abstract

Background: Small molecular cofactors or ligands play a crucial role in the proper functioning of cells. Accurate annotation of their target proteins and binding sites is required for the complete understanding of reaction mechanisms. Nicotinamide adenine dinucleotide (NAD+ or NAD) is one of the most commonly used organic cofactors in living cells, which plays a critical role in cellular metabolism, storage and regulatory processes. In the past, several NAD binding proteins (NADBP) have been reported in the literature, which are responsible for a wide-range of activities in the cell. Attempts have been made to derive a rule for the binding of NAD+ to its target proteins. However, so far an efficient model could not be derived due to the time consuming process of structure determination, and limitations of similarity based approaches. Thus a sequence and non-similarity based method is needed to characterize the NAD binding sites to help in the annotation. In this study attempts have been made to predict NAD binding proteins and their interacting residues (NIRs) from amino acid sequence using bioinformatics tools.

Results: We extracted 1556 proteins chains from 555 NAD binding proteins whose structure is available in Protein Data Bank. Then we removed all redundant protein chains and finally obtained 195 non-redundant NAD binding protein chains, where no two chains have more than 40% sequence identity. In this study all models were developed and evaluated using five-fold cross validation technique on the above dataset of 195 NAD binding proteins. While certain type of residues are preferred (e.g. Gly, Tyr, Thr, His) in NAD interaction, residues like Ala, Glu, Leu, Lys are not preferred. A support vector machine (SVM) based method has been developed using various window lengths of amino acid sequence for predicting NAD interacting residues and obtained maximum Matthew's correlation coefficient (MCC) 0.47 with accuracy 74.13% at window length 17. We also developed a SVM based method using evolutionary information in the form of position specific scoring matrix (PSSM) and obtained maximum MCC 0.75 with accuracy 87.25%.

Conclusion: For the first time a sequence-based method has been developed for the prediction of NAD binding proteins and their interacting residues, in the absence of any prior structural information. The present model will aid in the understanding of NAD+ dependent mechanisms of action in the cell. To provide service to the scientific community, we have developed a user-friendly web server, which is available from URL http://www.imtech.res.in/raghava/nadbinder/.

PubMed Disclaimer

Figures

**Figure 1**
**Percentage composition of NAD interacting and non-interacting residues**.

**Figure 2**
**Structure of human Aldose reductase (**2ACS) **showing prediction of NAD interacting residues by NADbinder**. NAD shown in magenta, True positives in red and False positives in blue colour (only the portion of protein with residue mentioned is shown here).

**Figure 3**
**ROC Plot for SVM models developed using single sequence (binary pattern) for window size from 3 to 21**. (W indicates the window length and value in bracket shows Area under curve).

**Figure 4**
**ROC Plot for PSSM based SVM models developed using window size from 3 to 21**. (W indicates the window length and value in bracket shows Area under curve).

See this image and copyright information in PMC

References

1. Reeves GA, Talavera D, Thornton JM. Genome and proteome annotation: organization, interpretation and integration. J R Soc Interface. 2009;6(31):129–147. doi: 10.1098/rsif.2008.0341. - DOI - PMC - PubMed
1. Porter CT, Bartlett GJ, Thornton JM. The Catalytic Site Atlas: a resource of catalytic sites and residues identified in enzymes using structural data. Nucleic Acids Res. 2004. pp. D129–133. - DOI - PMC - PubMed
1. Holliday GL, Almonacid DE, Bartlett GJ, O'Boyle NM, Torrance JW, Murray-Rust P, Mitchell JB, Thornton JM. MACiE (Mechanism, Annotation and Classification in Enzymes): novel tools for searching catalytic mechanisms. Nucleic Acids Res. 2007. pp. D515–520. - DOI - PMC - PubMed
1. Bashton M, Nobeli I, Thornton JM. PROCOGNATE: a cognate ligand domain mapping for enzymes. Nucleic Acids Res. 2008. pp. D618–622. - PMC - PubMed
1. Talavera D, Laskowski RA, Thornton JM. WSsas: a web service for the annotation of functional residues through structural homologues. Bioinformatics. 2009;25(9):1192–1194. doi: 10.1093/bioinformatics/btp116. - DOI - PubMed

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Substances

Actions
Actions

LinkOut - more resources

Full Text Sources
Molecular Biology Databases
- NIAID Data Ecosystem - Find datasets on Infectious and Immune-mediated Diseases
Miscellaneous
- NCI CPTAC Assay Portal

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Identification of NAD interacting residues in proteins

Affiliation

Identification of NAD interacting residues in proteins

Authors

Affiliation

Abstract

Figures

References

Publication types

MeSH terms

Substances

LinkOut - more resources

Full Text Sources

Molecular Biology Databases

Miscellaneous