Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2010 Apr;27(4):862-74.
doi: 10.1093/molbev/msp288. Epub 2009 Nov 25.

Origin and diversification of basic-helix-loop-helix proteins in plants

Affiliations

Origin and diversification of basic-helix-loop-helix proteins in plants

Nuno Pires et al. Mol Biol Evol. 2010 Apr.

Abstract

Basic helix-loop-helix (bHLH) proteins are a class of transcription factors found throughout eukaryotic organisms. Classification of the complete sets of bHLH proteins in the sequenced genomes of Arabidopsis thaliana and Oryza sativa (rice) has defined the diversity of these proteins among flowering plants. However, the evolutionary relationships of different plant bHLH groups and the diversity of bHLH proteins in more ancestral groups of plants are currently unknown. In this study, we use whole-genome sequences from nine species of land plants and algae to define the relationships between these proteins in plants. We show that few (less than 5) bHLH proteins are encoded in the genomes of chlorophytes and red algae. In contrast, many bHLH proteins (100-170) are encoded in the genomes of land plants (embryophytes). Phylogenetic analyses suggest that plant bHLH proteins are monophyletic and constitute 26 subfamilies. Twenty of these subfamilies existed in the common ancestors of extant mosses and vascular plants, whereas six further subfamilies evolved among the vascular plants. In addition to the conserved bHLH domains, most subfamilies are characterized by the presence of highly conserved short amino acid motifs. We conclude that much of the diversity of plant bHLH proteins was established in early land plants, over 440 million years ago.

PubMed Disclaimer

Figures

F<sc>IG</sc>. 1.
FIG. 1.
Phylogenetic relationships of the species used in this study. The total number of bHLH proteins found in the genome of each species is indicated. The cladogram is based on the current view of plant and eukaryotic phylogeny (Baldauf 2003; Lewis and McCourt 2004; Rodríguez-Ezpeleta et al. 2005); aRobinson and Lopes (2000); bSimionato et al. (2007).
F<sc>IG</sc>. 2.
FIG. 2.
Alignment of the bHLH domain of representative plant proteins. A representative of each of the 26 subfamilies of plant bHLH is shown, together with the human protein Max, a well-characterized bHLH protein. The shaded boxes indicate the position of the DNA-binding basic region, the two α-helixes, and the variable loop region (Ferré-D'Amaré et al. 1993). The numbering of the amino acids follows (Atchley and Fitch 1997). This is a subset of the full alignment with all the proteins used in this study (supplementary fig. 1, Supplementary Material online).
F<sc>IG</sc>. 3.
FIG. 3.
Twenty subfamilies of bHLH were already established in the common ancestral of vascular plants and mosses. Maximum likelihood analysis of 544 plant bHLH, shown as an unrooted cladogram. The blue balloons delineate the 26 subfamilies of plant bHLH proteins. Colored dots symbolize the species to which the proteins in each group belong (yellow: Oryza sativa [monocot]; red: Arabidopsis thaliana [eudicot]; green: Selaginella moellendorffii [lycophyte]; blue: Physcomitrella patens [moss]; purple: Volvox carteri, Chlamydomonas reinhardtii, Chlorella vulgaris, Ostreococcus tauri, and Cyanidioschyzon merolae (chlorophytes and red algae). A full tree with protein names, proportional branch lengths, and clade support values is given in supplementary fig. S2 (Supplementary Material online).
F<sc>IG</sc>. 4.
FIG. 4.
Plant bHLH do not group with other eukaryote bHLH. A Bayesian analysis was performed on an alignment of the bHLH sequence of one representative of each of the 26 subfamilies of plant bHLH, all the chlorophyte and red algae proteins, 5 proteins found in diatom Thalassiosira pseudonana, 8 Saccharomyces cerevisiae proteins, and representatives of 5 major groups of metazoan bHLH in the sponge Amphimedon queenslandica and Homo sapiens. The tree is unrooted. The numbers in the clades are posterior probability values; clades with less than 50% support were collapsed.
F<sc>IG</sc>. 5.
FIG. 5.
Non-bHLH amino acid motifs are highly conserved in each bHLH subfamily. An idealized representation of a typical member of each bHLH subfamily is shown, with the bHLH domain and other conserved motifs drawn as shaded boxes. The diagrams are not drawn to scale. The sequences of each motif in individual proteins are given in supplementary table S2 (Supplementary Material online).

Comment in

Similar articles

Cited by

References

    1. Abascal F, Zardoya R, Posada D. ProtTest: selection of best-fit models of protein evolution. Bioinformatics. 2005;21:2104–2105. - PubMed
    1. Abe H, Urao T, Ito T, Seki M, Shinozaki K, Yamaguchi-Shinozaki K. Arabidopsis AtMYC2 (bHLH) and AtMYB2 (MYB) function as transcriptional activators in abscisic acid signaling. Plant Cell. 2003;15:63–78. - PMC - PubMed
    1. Anantharaman V, Koonin EV, Aravind L. Regulatory potential, phyletic distribution and evolution of ancient, intracellular small-molecule-binding domains. J Mol Biol. 2001;307:1271–1292. - PubMed
    1. Armbrust EV, Berges JA, Bowler C, et al. (45 co-authors) The genome of the diatom Thalassiosira Pseudonana: ecology, evolution, and metabolism. Science. 2004;306:79–86. - PubMed
    1. Atchley WR, Fitch WM. A natural classification of the basic helix-loop-helix class of transcription factors. Proc Natl Acad Sci USA. 1997;94:5172–5176. - PMC - PubMed

Publication types

MeSH terms