A model of DNA sequence evolution
- PMID: 2279193
- DOI: 10.1007/BF02460807
A model of DNA sequence evolution
Abstract
Statistical studies of gene populations on the purine/pyrimidine alphabet have shown that the mean occurrence probability of the i-motif YRY(N)iYRY (R = purine, Y = pyrimidine, N = R or Y) is not uniform by varying i in the range, but presents a maximum at i = 6 in the following populations: protein coding genes of eukaryotes, prokaryotes, chloroplasts and mitochondria, and also viral introns, ribosomal RNA genes and transfer RNA genes (Arquès and Michel, 1987b, J. theor. Biol. 128, 457-461). From the "universality" of this observation, we suggested that the oligonucleotide YRY(N)6 is a primitive one and that it has a central function in DNA sequence evolution (Arquès and Michel, 1987b, J. theor. Biol. 128, 457-461). Following this idea, we introduce a concept of a model of DNA sequence evolution which will be validated according to a schema presented in three parts. In the first part, using the last version of the gene database, the YRY(N)6YRY preferential occurrence (maximum at i = 6) is confirmed for the populations mentioned above and is extended to some newly analysed populations: chloroplast introns, chloroplast 5' regions, mitochondrial 5' regions and small nuclear RNA genes. On the other hand, the YRY(N)6YRY preferential occurrence and periodicities are used in order to classify 18 gene populations. In the second part, we will demonstrate that several statistical features characterizing different gene populations (in particular the YRY(N)6YRY preferential occurrence and the periodicities) can be retrieved from a simple Markov model based on the mixing of the two oligonucleotides YRY(N)6 and YRY(N)3 and based on the percentages of RYR and YRY in the unspecified trinucleotides (N)3 of YRY(N)6 and YRY(N)3. Several properties are identified and prove in particular that the oligonucleotide mixing is an independent process and that several different features are functions of a unique parameter. In the third part, the return of the model to the reality shows a strong correlation between reality and simulation concerning the presence of a large alternating purine/pyrimidine stretches and of periodicities. It also contributes to a greater understanding of biological reality, e.g. the presence or the absence of large alternating purine/pyrimidine stretches can be explained as being a simple consequence of the mixing of two particular oligonucleotides. Finally, we believe that such an approach is the first step toward a unified model of DNA sequence evolution allowing the molecular understanding of both the origin of life and the actual biological reality.
Similar articles
-
Analytical expression of the purine/pyrimidine autocorrelation function after and before random mutations.Math Biosci. 1994 Sep;123(1):103-25. doi: 10.1016/0025-5564(94)90020-5. Math Biosci. 1994. PMID: 7949744
-
Identification and simulation of shifted periodicities common to protein coding genes of eukaryotes, prokaryotes and viruses.J Theor Biol. 1995 Feb 7;172(3):279-91. doi: 10.1006/jtbi.1995.0024. J Theor Biol. 1995. PMID: 7715198
-
Identification and simulation of new non-random statistical properties common to different populations of eukaryotic non-coding genes.J Theor Biol. 1993 Apr 7;161(3):329-42. doi: 10.1006/jtbi.1993.1059. J Theor Biol. 1993. PMID: 8331957
-
A simulation of the genetic periodicities modulo 2 and 3 with processes of nucleotide insertions and deletions.J Theor Biol. 1992 May 7;156(1):113-27. doi: 10.1016/s0022-5193(05)80659-9. J Theor Biol. 1992. PMID: 1379311
-
Chloroplast and mitochondrial genomes from a liverwort, Marchantia polymorpha--gene organization and molecular evolution.Biosci Biotechnol Biochem. 1996 Jan;60(1):16-24. doi: 10.1271/bbb.60.16. Biosci Biotechnol Biochem. 1996. PMID: 8824820 Review.
Cited by
-
Identification of a circular code periodicity in the bacterial ribosome: origin of codon periodicity in genes?RNA Biol. 2020 Apr;17(4):571-583. doi: 10.1080/15476286.2020.1719311. Epub 2020 Feb 11. RNA Biol. 2020. PMID: 31960748 Free PMC article.
-
Analytical expression of the purine/pyrimidine codon probability after and before random mutations.Bull Math Biol. 1993 Nov;55(6):1025-38. doi: 10.1007/BF02460698. Bull Math Biol. 1993. PMID: 8281128
-
Structural and thermodynamic properties of DNA uncover different evolutionary histories.J Mol Evol. 1995 Jun;40(6):698-704. doi: 10.1007/BF00160519. J Mol Evol. 1995. PMID: 7643419