Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2017 Mar 1;9(3):777-789.
doi: 10.1093/gbe/evx041.

Cross-Species Genome-Wide Identification of Evolutionary Conserved MicroProteins

Affiliations

Cross-Species Genome-Wide Identification of Evolutionary Conserved MicroProteins

Daniel Straub et al. Genome Biol Evol. .

Abstract

MicroProteins are small single-domain proteins that act by engaging their targets into different, sometimes nonproductive protein complexes. In order to identify novel microProteins in any sequenced genome of interest, we have developed miPFinder, a program that identifies and classifies potential microProteins. In the past years, several microProteins have been discovered in plants where they are mainly involved in the regulation of development by fine-tuning transcription factor activities. The miPFinder algorithm identifies all up to date known plant microProteins and extends the microProtein concept beyond transcription factors to other protein families. Here, we reveal potential microProtein candidates in several plant and animal reference genomes. A large number of these microProteins are species-specific while others evolved early and are evolutionary highly conserved. Most known microProtein genes originated from large ancestral genes by gene duplication, mutation and subsequent degradation. Gene ontology analysis shows that putative microProtein ancestors are often located in the nucleus, and involved in DNA binding and formation of protein complexes. Additionally, microProtein candidates act in plant transcriptional regulation, signal transduction and anatomical structure development. MiPFinder is freely available to find microProteins in any genome and will aid in the identification of novel microProteins in plants and animals.

Keywords: metazoa; miPFinder; microProteins; plants; protein–protein interaction.

PubMed Disclaimer

Figures

F<sc>ig</sc>. 1.—
Fig. 1.—
Flow chart miPFinder. Mandatory steps are with a light gray background. Orange, databases; green, data packages; gray, tools; blue, lists; white, custom functions.
F<sc>ig</sc>. 2.—
Fig. 2.—
Circos plot of individual microProtein candidates. Links indicate conservation between species based on OrthoFinder. Red, in all 11 species; dark blue, exclusively in all five metazoans; light blue, only in metazoans; dark green, exclusively in all six plants; light green, only in plants.
F<sc>ig</sc>. 3.—
Fig. 3.—
MicroProtein candidates in transcription factor families in metazoans. The presence of microProtein candidates in human (upper left, red), mouse (upper right, blue), zebrafish (right, yellow), fruit fly (bottom, green) and roundworm (left, gray) in the respective transcription factor family is indicated as bold line.
F<sc>ig</sc>. 4.—
Fig. 4.—
Gene Ontology and protein class analysis of microProtein-subsets. For all sets, only the most significant ancestor of a microProtein candidate family was analyzed. (A and B) The subsets represent microProtein candidate families with the following conservation in: a: all species; b: all plants; c: all dicots; d: some dicots; e: all monocots; f: some monocots; g: some plants; h: all metazoa; i: all vertebrates; j: some vertebrates; k: nonvertebrates; l: some metazoa. (A) The GO terms are sorted in order of their descending average abundance of all subsets and color coded by their subset specific percent of genes with GO annotation. (B) Selected GO terms extracted from A as indicated by dashed lines. NF, not found. (C) Protein classes that are regulated by Arabidopsis (left) and human (right) high probability microProteins.

References

    1. Bedoya F, Sandler LL, Harton JA. 2007. Pyrin-only protein 2 modulates NF-kappaB and disrupts ASC:CLR interactions. J Immunol. 178:3837–3845. - PubMed
    1. Benezra R, Davis RL, Lockshon D, Turner DL, Weintraub H. 1990. The protein Id: a negative regulator of helix-loop-helix DNA binding proteins. Cell 61:49–59. - PubMed
    1. Brauer PM, et al. 2011. The alternative splice variant of protein tyrosine kinase 6 negatively regulates growth and enhances PTK6-mediated inhibition of beta-catenin. PLoS One 6:e14789.. - PMC - PubMed
    1. Brauer PM, Tyner AL. 2010. Building a better understanding of the intracellular tyrosine kinase PTK6 – BRK by BRK. Biochim Biophys Acta. 1806:66–73. - PMC - PubMed
    1. Camacho C, et al. 2009. BLAST+: architecture and applications. BMC Bioinformatics 10:421.. - PMC - PubMed

Publication types