Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2016 Aug;84(8):1147-61.
doi: 10.1002/prot.25061. Epub 2016 Jun 15.

Statistical analysis of structural determinants for protein-DNA-binding specificity

Affiliations

Statistical analysis of structural determinants for protein-DNA-binding specificity

Rosario I Corona et al. Proteins. 2016 Aug.

Abstract

DNA-binding proteins play critical roles in biological processes including gene expression, DNA packaging and DNA repair. They bind to DNA target sequences with different degrees of binding specificity, ranging from highly specific (HS) to nonspecific (NS). Alterations of DNA-binding specificity, due to either genetic variation or somatic mutations, can lead to various diseases. In this study, a comparative analysis of protein-DNA complex structures was carried out to investigate the structural features that contribute to binding specificity. Protein-DNA complexes were grouped into three general classes based on degrees of binding specificity: HS, multispecific (MS), and NS. Our results show a clear trend of structural features among the three classes, including amino acid binding propensities, simple and complex hydrogen bonds, major/minor groove and base contacts, and DNA shape. We found that aspartate is enriched in HS DNA binding proteins and predominately binds to a cytosine through a single hydrogen bond or two consecutive cytosines through bidentate hydrogen bonds. Aromatic residues, histidine and tyrosine, are highly enriched in the HS and MS groups and may contribute to specific binding through different mechanisms. To further investigate the role of protein flexibility in specific protein-DNA recognition, we analyzed the conformational changes between the bound and unbound states of DNA-binding proteins and structural variations. The results indicate that HS and MS DNA-binding domains have larger conformational changes upon DNA-binding and larger degree of flexibility in both bound and unbound states. Proteins 2016; 84:1147-1161. © 2016 Wiley Periodicals, Inc.

Keywords: DNA shape; aromatic residue; aspartate; conformational changes; flexibility; hydrogen bond; structural variations; transcription factor; π-interaction.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Flowchart for compiling the non-redundant datasets of DNA-binding domains.
Figure 2
Figure 2
Residue-base contacts in protein-DNA complexes. (A) Amino acid propensities for DNA base interaction in HS (red), MS (green) and NS (blue) groups; (B) Percentage of major (red) and minor groove (cyan) contacts.
Figure 3
Figure 3
Aspartate forms one bidentate hydrogen bond with two consecutive cytosine bases and one single hydrogen bond with a distant cytosine, via the major groove, in endonuclease NgoMIV (PDB ID: 4abt).
Figure 4
Figure 4
Comparison of protein-DNA interactions. (A) Protein-DNA contact area (PDCA); (B) number of residue-base contacts (NRBC); and (C) NRBC density, NRBC normalized to the total contact area (PDCA). ***p<0.001; **p<0.01; *p<0.05.
Figure 5
Figure 5
Comparison of DNA base/backbone and major/minor groove contacts. (A) Percentage of DNA backbone-only and DNA base contacts; (B) percentage of major and minor groove contacts; (C) number of major groove contacts; and (D) number of minor groove contacts. ***p<0.001; **p<0.01; *p<0.05.
Figure 6
Figure 6
Hydrogen bonds between proteins and DNA. (A) Number of hydrogen bonds between protein side-chains and DNA (PDHB); (B) number of hydrogen bonds between protein side-chains and DNA bases (PBHB); and (C) number of residues that form bidentate hydrogen bonds. ***p<0.001; **p<0.01; *p<0.05.
Figure 7
Figure 7
Comparison of DNA shape features. Median (A) propeller, (B) opening, (C) rise, and (D) roll per structure. The shape features are calculated using 3DNA . ***p<0.001; ** p<0.01; * p<0.05.
Figure 8
Figure 8
Conformational changes upon DNA-binding. Cα RMSD between the bound and unbound structures in the pairNR30 dataset using (A) all residues and (B) binding residues only. Median Δχ1 (C) and MAD Δχ1 (D) per domain. *** p<0.001; ** p<0.01; * p<0.05.
Figure 9
Figure 9
Comparison of structural variations in the multiHolo dataset for median RMSD (A) and MAD RMSD (B); in the multiApo dataset for median RMSD (C) and MAD RMSD (D); in the multiApoHolo dataset for median RMSD (E) and MAD RMSD (F). ***p<0.001; ** p<0.01; * p<0.05.

Similar articles

Cited by

References

    1. Schott JJ, Benson DW, Basson CT, Pease W, Silberbach GM, Moak JP, Maron BJ, Seidman CE, Seidman JG. Congenital heart disease caused by mutations in the transcription factor NKX2-5. Science. 1998;281(5373):108–111. - PubMed
    1. Filippova GN, Qi CF, Ulmer JE, Moore JM, Ward MD, Hu YJ, Loukinov DI, Pugacheva EM, Klenova EM, Grundy PE, Feinberg AP, Cleton-Jansen AM, Moerland EW, Cornelisse CJ, Suzuki H, Komiya A, Lindblom A, Dorion-Bonnet F, Neiman PE, Morse HC, 3rd, Collins SJ, Lobanenkov VV. Tumor-associated zinc finger mutations in the CTCF transcription factor selectively alter tts DNA-binding specificity. Cancer research. 2002;62(1):48–52. - PubMed
    1. Luscombe NM, Thornton JM. Protein-DNA interactions: amino acid conservation and the effects of mutations on binding specificity. Journal of molecular biology. 2002;320(5):991–1009. - PubMed
    1. Latchman DS. Transcription-factor mutations and disease. The New England journal of medicine. 1996;334(1):28–33. - PubMed
    1. Gohler T, Jager S, Warnecke G, Yasuda H, Kim E, Deppert W. Mutant p53 proteins bind DNA in a DNA structure-selective mode. Nucleic acids research. 2005;33(3):1087–1100. - PMC - PubMed

Publication types

LinkOut - more resources