Analytical expression of the purine/pyrimidine codon probability after and before random mutations
- PMID: 8281128
- DOI: 10.1007/BF02460698
Analytical expression of the purine/pyrimidine codon probability after and before random mutations
Abstract
Recently, we proposed a new model of DNA sequence evolution (Arquès and Michel. 1990b. Bull. math. Biol. 52, 741-772) according to which actual genes on the purine/pyrimidine (R/Y) alphabet (R = purine = adenine or guanine, Y = pyrimidine = cytosine or thymine) are the result of two successive evolutionary genetic processes: (i) a mixing (independent) process of non-random oligonucleotides (words of base length less than 10: YRY(N)6, YRYRYR and YRYYRY are so far identified; N = R or Y) leading to primitive genes (words of several hundreds of base length) and followed by (ii) a random mutation process, i.e., transformations of a base R (respectively Y) into the base Y (respectively R) at random sites in these primitive genes. Following this model the problem investigated here is the study of the variation of the 8 R/Y codon probabilities RRR, ..., YYY under random mutations. Two analytical expressions solved here allow analysis of this variation in the classical evolutionary sense (from the past to the present, i.e., after random mutations), but also in the inverted evolutionary sense (from the present to the past, i.e., before random mutations). Different properties are also derived from these formulae. Finally, a few applications of these formulae are presented. They prove the proposition in Arquès and Michel (1990b. Bull. math. Biol. 52, 741-772), Section 3.3.2, with the existence of a maximal mean number of random mutations per base of the order 0.3 in the protein coding genes. They also confirm the mixing process of oligonucleotides by excluding the purine/pyrimidine contiguous and alternating tracts from the formation process of primitive genes.
Similar articles
-
Analytical expression of the purine/pyrimidine autocorrelation function after and before random mutations.Math Biosci. 1994 Sep;123(1):103-25. doi: 10.1016/0025-5564(94)90020-5. Math Biosci. 1994. PMID: 7949744
-
Identification and simulation of new non-random statistical properties common to different populations of eukaryotic non-coding genes.J Theor Biol. 1993 Apr 7;161(3):329-42. doi: 10.1006/jtbi.1993.1059. J Theor Biol. 1993. PMID: 8331957
-
A model of DNA sequence evolution.Bull Math Biol. 1990;52(6):741-72. doi: 10.1007/BF02460807. Bull Math Biol. 1990. PMID: 2279193
-
Identification and simulation of shifted periodicities common to protein coding genes of eukaryotes, prokaryotes and viruses.J Theor Biol. 1995 Feb 7;172(3):279-91. doi: 10.1006/jtbi.1995.0024. J Theor Biol. 1995. PMID: 7715198
-
Analytical solutions of the dinucleotide probability after and before random mutations.J Theor Biol. 1995 Aug 21;175(4):533-44. doi: 10.1006/jtbi.1995.0161. J Theor Biol. 1995. PMID: 7475089
Cited by
-
Phylogenetic inference with weighted codon evolutionary distances.J Mol Evol. 2009 Apr;68(4):377-92. doi: 10.1007/s00239-009-9212-y. Epub 2009 Mar 24. J Mol Evol. 2009. PMID: 19308635
-
Solving the master equation for Indels.BMC Bioinformatics. 2017 May 12;18(1):255. doi: 10.1186/s12859-017-1665-1. BMC Bioinformatics. 2017. PMID: 28494756 Free PMC article.
References
MeSH terms
Substances
LinkOut - more resources
Miscellaneous