Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 2007;8(10):R209.
doi: 10.1186/gb-2007-8-10-r209.

All duplicates are not equal: the difference between small-scale and genome duplication

Affiliations
Comparative Study

All duplicates are not equal: the difference between small-scale and genome duplication

Luke Hakes et al. Genome Biol. 2007.

Abstract

Background: Genes in populations are in constant flux, being gained through duplication and occasionally retained or, more frequently, lost from the genome. In this study we compare pairs of identifiable gene duplicates generated by small-scale (predominantly single-gene) duplications with those created by a large-scale gene duplication event (whole-genome duplication) in the yeast Saccharomyces cerevisiae.

Results: We find a number of quantifiable differences between these data sets. Whole-genome duplicates tend to exhibit less profound phenotypic effects when deleted, are functionally less divergent, and are associated with a different set of functions than their small-scale duplicate counterparts. At first sight, either of these latter two features could provide a plausible mechanism by which the difference in dispensability might arise. However, we uncover no evidence suggesting that this is the case. We find that the difference in dispensability observed between the two duplicate types is limited to gene products found within protein complexes, and probably results from differences in the relative strength of the evolutionary pressures present following each type of duplication event.

Conclusion: Genes, and the proteins they specify, originating from small-scale and whole-genome duplication events differ in quantifiable ways. We infer that this is not due to their association with different functional categories; rather, it is a direct result of biases in gene retention.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Comparison of the shared interaction ratio for duplicate gene products and random protein pairs. Whole-genome duplicates (WGDs) are illustrated in blue and small-scale duplicates (SSDs) are illustrated in red. Mean shared interaction ratio r is plotted against gene sequence divergence measured by non-synonymous substitution rate (Ka). The dashed lines indicate the average shared interaction ratio for WGDs (blue), SSDs (red), and pairs of proteins selected at random from the genome (black). Error bars show standard errors on the mean of r for each bin.
Figure 2
Figure 2
Relationship between semantic distance and the proportion of pairs within each duplicate set. Whole-genome duplicates (WGDs) are illustrated in blue, small-scale duplicates (SSDs) in red, and random gene pairings in gray. A higher semantic distance indicates greater functional divergence.
Figure 3
Figure 3
Visualization of the two sets of duplicates on a semantic distance network. (a) The yeast proteome is distributed spatially according to semantic distance, with six high-level functional classes highlighted in different colors that are either over-represented or under-represented in the whole-genome duplicate (WGD) or small-scale duplicate (SSD) sets (see Table 1). (b) WGDs are shown in blue and SSDs in red; the same six functional classes are highlighted. The products of the two types of duplicate gene have a tendency to occupy separate areas of semantic space, indicating involvement in different functions.
Figure 4
Figure 4
Relationship between semantic distance, duplicate set and complex membership. The proportion of duplicate pairs having a certain level of functional divergence as measured by semantic distance for the following: pairs of complex-forming whole-genome duplicate (WGD; dark blue), complex-forming small-scale duplicate (SSD; red), non-complex-forming WGD (light blue), and non-complex-forming SSD (pink) proteins. Significant differences in the degree of functional divergence between the pairs in the two categories (complex and non-complex) are observed. No significant difference between the semantic distances of pairs of SSDs found in complexes and complex-forming WGD pairs is observed; nor, indeed, is there any difference between SSD pairs not in complexes and WGD pairs not found within complexes.

Similar articles

Cited by

References

    1. Nei M. Gene duplication and nucleotide substitution in evolution. Nature. 1969;221:40–42. doi: 10.1038/221040a0. - DOI - PubMed
    1. Ohno S. Evolution by Gene Duplication. London, New York: Allen & Unwin, Springer-Verlag; 1970.
    1. Davis JC, Petrov DA. Do disparate mechanisms of duplication add similar genes to the genome? Trends Genet. 2005;21:548–551. doi: 10.1016/j.tig.2005.07.008. - DOI - PubMed
    1. Guan Y, Dunham MJ, Troyanskaya OG. Functional analysis of gene duplications in Saccharomyces cerevisiae. Genetics. 2007;175:933–943. doi: 10.1534/genetics.106.064329. - DOI - PMC - PubMed
    1. Gu X, Zhang Z, Huang W. Rapid evolution of expression and regulatory divergences after yeast gene duplication. Proc Natl Acad Sci USA. 2005;102:707–712. doi: 10.1073/pnas.0409186102. - DOI - PMC - PubMed

Publication types

LinkOut - more resources