Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2011 Apr;7(4):e1002029.
doi: 10.1371/journal.pcbi.1002029. Epub 2011 Apr 7.

Modification of gene duplicability during the evolution of protein interaction network

Affiliations

Modification of gene duplicability during the evolution of protein interaction network

Matteo D'Antonio et al. PLoS Comput Biol. 2011 Apr.

Abstract

Duplications of genes encoding highly connected and essential proteins are selected against in several species but not in human, where duplicated genes encode highly connected proteins. To understand when and how gene duplicability changed in evolution, we compare gene and network properties in four species (Escherichia coli, yeast, fly, and human) that are representative of the increase in evolutionary complexity, defined as progressive growth in the number of genes, cells, and cell types. We find that the origin and conservation of a gene significantly correlates with the properties of the encoded protein in the protein-protein interaction network. All four species preserve a core of singleton and central hubs that originated early in evolution, are highly conserved, and accomplish basic biological functions. Another group of hubs appeared in metazoans and duplicated in vertebrates, mostly through vertebrate-specific whole genome duplication. Such recent and duplicated hubs are frequently targets of microRNAs and show tissue-selective expression, suggesting that these are alternative mechanisms to control their dosage. Our study shows how networks modified during evolution and contributes to explaining the occurrence of somatic genetic diseases, such as cancer, in terms of network perturbations.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

Figure 1
Figure 1. Four-step pipeline to assign gene origin, conservation, and duplicability.
(A) All unique genes of the four species are assigned to clusters of orthologs with different inclusiveness. (B) All 373 species present in EggNOG are associated with seven internal nodes of the tree of life. (C) Orthologs and paralogs of each gene are identified in the seven internal nodes. (D, E, F) These pieces of information are combined to identify origin, conservation, and duplicability of each gene. LUCA, last universal common ancestor.
Figure 2
Figure 2. Origin, conservation, and duplicability of genes in evolution.
(A) The percentage of genes that originated at each internal nodes of the tree of life is shown for the four species used in the analysis, and for seven additional species. The group-specific nodes correspond to primates for H. sapiens, rodents for M. musculus, birds for Gallus gallus, fishes for D. rerio, nematodes for C. elegans, insects for D. melanogaster and A. mellifera, fungi for S. cerevisiae and Schizosaccharomyces pombe, and bacteria for E. coli and Bacillus subtilis. The lack of specific genes for C. elegans, G. gallus and M. musculus is likely an artifact due to presence in EggNOG of few species for the corresponding group-specific nodes. LUCA, last universal common ancestor; euk, eukaryotes; opi, opisthokonts; met, metazoans; ver, vertebrates. (B) The percentage of genes that have the same conservation is shown for each species. Conservation is measured as the number of internal nodes where no ortholog is found since the gene appeared. In all species, conservation ranges from 0 (i.e. no missing node) to 5 (i.e. the gene originated with LUCA and has orthologs only in prokaryotes and in the group-specific cluster). Since only few genes have conservation 5, we grouped them with genes with conservation 4. (C) The percentage of singleton and duplicated genes is shown for all eleven species.
Figure 3
Figure 3. Relationship between gene origin and conservation and network properties.
Degree (connectivity) and betweenness (centrality) are compared between (A) proteins that originated at a given node and younger or older proteins; and (B) proteins with a given conservation and less or more conserved proteins. In both analyses, the differences are assessed with the Wilcoxon test and the resulting p-values are transformed into heatmaps. Each square represents genes that originated at a given internal node or with a given level of conservation. The color represents the p-value. Red is associated with more connected or more central proteins, green is associated with less connected or less central proteins. The lower bound of p-values is set equal to 10−3.
Figure 4
Figure 4. Properties of ancient and recent hubs.
(A) Degree (connectivity) and betweenness (centrality) of proteins encoded by duplicated and singleton genes of same age are compared using the Wilcoxon test and the obtained p-values are transformed into heatmaps. Each square represents genes that originated at a given internal node and the color represents the p-value. Red indicates that duplicated genes encode significantly more connected or more central proteins than singleton proteins; green indicates that proteins encoded by singleton genes are significantly more connected or more central than duplicated proteins. The lower bound of p-values is set equal to 10−3. (B) Functional differences are analyzed between (1) ancestral and recent human hubs; (2) all ancestral and all recent human genes; (3) all singletons and all duplicated human genes. For each comparison, significance is assessed with Fisher's exact test and the p-values are adjusted for the False Discovery Rate (FDR). Vertical bars correspond to individual GO terms that are further grouped into 12 functional categories. Blue bars represent the enrichment of duplicated, recent genes, or hubs, orange represents the enrichment of singletons, ancient genes, or hubs.
Figure 5
Figure 5. Dosage regulations of human hubs.
(A) The fraction of human duplicated hubs that are ohnologs, miRNA targets, and tissue-selective genes is compared to the corresponding fraction of singleton hubs. Although the main contribution is due to ohnologs (B), the enrichment still remains detectable when miRNA targets (C) and tissue-selective genes (D) are considered separately. Small-scale duplications refer to duplicated hubs that are not the result of whole-genome duplication (i.e. they are not within the dataset of ohnologs). Since the number of hubs that originated with opisthokonts and primates is only 43 and 17, we group them with hubs that originated with eukaryotes and mammals, respectively. * significant enrichment when compared to older genes (Fisher's exact test).
Figure 6
Figure 6. Dosage regulation of the atrophin genes.
Atrophins are metazoan-specific genes that underwent duplication in vertebrates. The fly ortholog Atro is highly dosage sensitive: increased and reduced expression due to modifications of miR-8 lead to neurogenerative and survival defects , . Rere, one of the two vertebrate atrophin paralogs, is target of mir200b and miR-429, the vertebrate counterparts of miR-8. Dosage modifications of Rere lead to re-localization of the other paralog, ATN1, in the nucleus, upon direct binding . Interestingly, ATN1 is the gene responsible for the dentatorubral-pallidoluysian atrophy (DRPLA) .

Similar articles

Cited by

References

    1. Papp B, Pal C, Hurst LD. Dosage sensitivity and the evolution of gene families in yeast. Nature. 2003;424:194–197. - PubMed
    1. Hughes AL, Friedman R. Gene duplication and the properties of biological networks. J Mol Evol. 2005;61:758–764. - PMC - PubMed
    1. Prachumwat A, Li WH. Protein function, connectivity, and duplicability in yeast. Mol Biol Evol. 2006;23:30–39. - PubMed
    1. Yang J, Lusk R, Li WH. Organismal complexity, protein complexity, and gene duplicability. Proc Natl Acad Sci U S A. 2003;100:15661–15665. - PMC - PubMed
    1. Makino T, Hokamp K, McLysaght A. The complex relationship of gene duplication and essentiality. Trends Genet. 2009;25:152–155. - PubMed

Publication types