Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 2005 May 3;102(18):6401-6.
doi: 10.1073/pnas.0408964102. Epub 2005 Apr 25.

Sequence signatures and the probabilistic identification of proteins in the Myc-Max-Mad network

Affiliations
Comparative Study

Sequence signatures and the probabilistic identification of proteins in the Myc-Max-Mad network

William R Atchley et al. Proc Natl Acad Sci U S A. .

Abstract

Accurate identification of specific groups of proteins by their amino acid sequence is an important goal in genome research. Here we combine information theory with fuzzy logic search procedures to identify sequence signatures or predictive motifs for members of the Myc-Max-Mad transcription factor network. Myc is a well known oncoprotein, and this family is involved in cell proliferation, apoptosis, and differentiation. We describe a small set of amino acid sites from the N-terminal portion of the basic helix-loop-helix (bHLH) domain that provide very accurate sequence signatures for the Myc-Max-Mad transcription factor network and three of its member proteins. A predictive motif involving 28 contiguous bHLH sequence elements found 337 network proteins in the GenBank NR database with no mismatches or misidentifications. This motif also identifies at least one previously unknown fungal protein with strong affinity to the Myc-Max-Mad network. Another motif found 96% of known Myc protein sequences with only a single mismatch, including sequences from genomes previously not thought to contain Myc proteins. The predictive motif for Myc is very similar to the ancestral sequence for the Myc group estimated from phylogenetic analyses. Based on available crystal structure studies, this motif is discussed in terms of its functional consequences. Our results provide insight into evolutionary diversification of DNA binding and dimerization in a well characterized family of regulatory proteins and provide a method of identifying signature motifs in protein families.

PubMed Disclaimer

Figures

Fig. 1.
Fig. 1.
Entropy (sequence variability) profile of the bHLH domain from the Myc-Mad-Mad network.
Fig. 2.
Fig. 2.
Alignment of Myc, Max, and Mad together with four putative fungal Myc sequences, visualized by genedoc (www.psc.edu/biomed/genedoc), with conserved sites indicated by differential shading. Two gaps separate the various labeled functional domains. Heptad leucine repeats in the zipper are labeled with a bold plus (+) sign.
Fig. 3.
Fig. 3.
A comparison of the AF method, meme, and hmmer (discussed in the text).

References

    1. Atchley, W. R., Terhalle, W. & Dress, A. (1999) J. Mol. Evol. 48, 501-516. - PubMed
    1. Grandori, C., Cowley, S. M., James, L. P. & Eisenman, R. N. (2000) Annu. Rev. Cell Dev. Biol. 16, 653-699. - PubMed
    1. Levens, D. L. (2003) Genes Dev. 17, 1071-1077. - PubMed
    1. Luscher, B. (2001) Gene 277, 1-14. - PubMed
    1. Nasi, S., Ciarapica, R., Jucker, R., Rosati, J. & Soucek, L. (2001) FEBS Lett. 490, 153-162. - PubMed

Publication types

MeSH terms

Substances

LinkOut - more resources