Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2017 Jan 31;17(1):38.
doi: 10.1186/s12862-016-0848-0.

Analysis of a mechanistic Markov model for gene duplicates evolving under subfunctionalization

Affiliations

Analysis of a mechanistic Markov model for gene duplicates evolving under subfunctionalization

Tristan L Stark et al. BMC Evol Biol. .

Abstract

Background: Gene duplication has been identified as a key process driving functional change in many genomes. Several biological models exist for the evolution of a pair of duplicates after a duplication event, and it is believed that gene duplicates can evolve in different ways, according to one process, or a mix of processes. Subfunctionalization is one such process, under which the two duplicates can be preserved by dividing up the function of the original gene between them. Analysis of genomic data using subfunctionalization and related processes has thus far been relatively coarse-grained, with mathematical treatments usually focusing on the phenomenological features of gene duplicate evolution.

Results: Here, we develop and analyze a mathematical model using the mechanics of subfunctionalization and the assumption of Poisson rates of mutation. By making use of the results from the literature on the Phase-Type distribution, we are able to derive exact analytical results for the model. The main advantage of the mechanistic model is that it leads to testable predictions of the phenomenological behavior (instead of building this behavior into the model a priori), and allows for the estimation of biologically meaningful parameters. We fit the survival function implied by this model to real genome data (Homo sapiens, Mus musculus, Rattus norvegicus and Canis familiaris), and compare the fit against commonly used phenomenological survival functions. We estimate the number of regulatory regions, and rates of mutation (relative to silent site mutation) in the coding and regulatory regions. We find that for the four genomes tested the subfunctionalization model predicts that duplicates most-likely have just a few regulatory regions, and the rate of mutation in the coding region is around 5-10 times greater than the rate in the regulatory regions. This is the first model-based estimate of the number of regulatory regions in duplicates.

Conclusions: Strong agreement between empirical results and the predictions of our model suggest that subfunctionalization provides a consistent explanation for the evolution of many gene duplicates.

Keywords: Continuous-time Markov chain (CTMC); Gene duplication; Neofunctionalization; Nonfunctionalization; Phase-type distribution; Pseudogenization; Subfunctionalization.

PubMed Disclaimer

Figures

Fig. 1
Fig. 1
The (biological) transition diagram for z=4. Regions hit by null mutation are coloured red, and regions which are protected by selective pressure are coloured yellow
Fig. 2
Fig. 2
Pseudogenization rate hPz(t) with for z=12 γ less than a, greater than b and equal to c γ crit. Panel d shows the overall shape of hPz(t), with negative values of t included. a Pseudogenization rate hPz(t) with γ<γcritz. The change in concavity occurs at t≈2.7. As such, the sigmoidal nature of the function is apparent - we see an initially slowly decreasing hazard rate which decreases more and more rapidly up to the change in concavity, after which the decline in the hazard rate slows, and approaches the asymptote at zero. b Pseudogenization rate hPz(t) with γ>γcritz. Here the change in concavity occurs for some t<0, and hence cannot be seen in real, physical time (t>0). The shape is not obviously sigmoidal, and looks similar to that of an exponential decay. The rate of pseudogenization is initially declining rapidly, before approaching its asymptote at zero. c Pseudogenization rate hPz(t) with γ=γcritz. Here the change in concavity occurs at exactly t=0. This is qualitatively similar to the case in panel b, with the pseudogenization rate rapidly declining, and the decline becoming slower as the rate approaches its asymptote at zero. d Pseudogenization rate hPz(t) taken as a function over all . This gives a complete picture of the shape of the pseudogenization rate function. Smaller values of γ translate the graph to the right, and result in a longer initial period of slowly declining pseudogenization rate. If γ>γ crit, the point of inflection occurs to the left of t=0, and we see behaviour similar to panel a
Fig. 3
Fig. 3
Critical values γcritz for various values of z. When ur/ucγcritz the change in concavity for the pseudogenization rate function will occur in positive time. Otherwise, the change occurs in negative t and the sigmoidal nature of the function will not be apparent when plotted for t>0
Fig. 4
Fig. 4
Maximum likelihood estimates for γ=u r/u c for each z=2,3,…,20 for four mammalian species. a γ vs z in the MLE for Canis familiaris. b γ vs z in the MLE for Homo sapiens c γ vs z in the MLE for Rattus norvegicus d γ vs z in the MLE for Mus musculus
Fig. 5
Fig. 5
The profile likelihood curve for u c in the Rattus norvegicus genome

References

    1. Ohno S. The enormous diversity in genome sizes of fish as a reflection of nature’s extensive experiments with gene duplication. Trans Am Fish Soc. 1970;99(1):120–130. doi: 10.1577/1548-8659(1970)99<120:TEDIGS>2.0.CO;2. - DOI
    1. Lynch M, Conery JS. The evolutionary fate and consequences of duplicate genes. Science. 2000;290(5494):1151–1155. doi: 10.1126/science.290.5494.1151. - DOI - PubMed
    1. Innan H, Kondrashov F. The evolution of gene duplications: classifying and distinguishing between models. Nat Rev Genet. 2010;11(2):97–108. - PubMed
    1. Liberles DA, Kolesov G, Dittmar K. Evolution after Gene Duplication. In: Dittmar K, Liberles D, editors. Hoboken: Wiley : 2010. doi:10.1002/9780470619902.ch1. - DOI
    1. Force A, Lynch M, Pickett FB, Amores A, Yan Yl. Postlethwait J. Preservation of duplicate genes by complementary, degenerative mutations. Genetics. 1999;151(4):1531–1545. - PMC - PubMed

LinkOut - more resources