Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 2005 Aug 5;33(14):4455-65.
doi: 10.1093/nar/gki755. Print 2005.

Predicting specificity-determining residues in two large eukaryotic transcription factor families

Affiliations
Comparative Study

Predicting specificity-determining residues in two large eukaryotic transcription factor families

Jason E Donald et al. Nucleic Acids Res. .

Abstract

Certain amino acid residues in a protein, when mutated, change the protein's function. We present an improved method of finding these specificity-determining positions that uses all the protein sequence data available for a family of homologous proteins. We study in detail two families of eukaryotic transcription factors, basic leucine zippers and nuclear receptors, because of the large amount of sequences and experimental data available. These protein families also have a clear definition of functional specificity: DNA-binding specificity. We compare our results to three other methods, including the evolutionary trace algorithm and a method that depends on orthology relationships. All of the predictions are compared to the available mutational and crystallographic data. We find that our method provides superior predictions of the known specificity-determining residues and also predicts residue positions within these families that deserve further study for their roles in functional specificity.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Plot of score for basic leucine zipper DNA base specificity-determining residues versus the number of predictions considered. The vertical lines represent the number of the last prediction with a Z-score >3.0.
Figure 2
Figure 2
Plot of score for nuclear receptor DNA base specificity-determining residues versus the number of predictions considered. The vertical lines represent the number of the last prediction with a Z-score >3.0.
Figure 3
Figure 3
Plot of giant component size and score for basic leucine zipper DNA base specificity-determining residues predicted using the top eight predictions for all possible different sequence identity cutoffs.
Figure 4
Figure 4
Plot of giant component size and score for nuclear receptor DNA base specificity-determining residues predicted using the top ten predictions for all possible different sequence identity cutoffs.
Figure 5
Figure 5
Crystal structure of GCN4 bound to DNA (pdb:1YSA) (62). The eight predicted residues for the basic leucine zipper family are shown in VDW representation. The top ranked residue is colored red and the eighth is colored blue, with the colors shifted stepwise from red to blue for residues two through seven. The protein is shown in ribbon representation (colored black) and the DNA in lines representation (black). Figure made using VMD (63).
Figure 6
Figure 6
Crystal structure of GCR bound to DNA (pdb:1R4R) (49). The ten predicted residues for the nuclear receptor family are shown in VDW representation. The top ranked residue is colored red and the tenth is colored blue, with the colors shifted stepwise from red to blue for residues two through nine. The protein is shown in ribbon representation (colored black) and the DNA in lines representation (black). Figure made using VMD (63).

References

    1. Suckow M., von Wilcken-Bergmann B., Muller-Hill B. Identification of three residues in the basic regions of the bZIP proteins GCN4, C/EBP and TAF-1 that are involved in specific DNA binding. EMBO J. 1993;12:1193–1200. - PMC - PubMed
    1. Tian W., Arakaki A.K., Skolnick J. EFICAz: a comprehensive approach for accurate genome-scale enzyme function inference. Nucleic Acids Res. 2004;32:6226–6239. - PMC - PubMed
    1. Lichtarge O., Bourne H.R., Cohen F.E. An evolutionary trace method defines binding surfaces common to protein families. J. Mol. Biol. 1996;257:342–358. - PubMed
    1. Lichtarge O., Yamamoto K.R., Cohen F.E. Identification of functional surfaces of the zinc binding domains of intracellular receptors. J. Mol. Biol. 1997;274:325–337. - PubMed
    1. Mirny L.A., Gelfand M.S. Using orthologous and paralogous proteins to identify specificity-determining residues in bacterial transcription factors. J. Mol. Biol. 2002;321:7–20. - PubMed

Publication types

MeSH terms

Substances