Doublet frequencies in sequenced nucleic acids
- PMID: 1107565
- DOI: 10.1007/BF01732535
Doublet frequencies in sequenced nucleic acids
Abstract
A doublet frequency count (set of frequencies of the 16 possible two-base sequences) can be calculated from the experimentally determined overall sequence of a nucleic acid. In this paper, a statistical methodology is developed for comparing such counts with random, with others of the same type or with doublet proportions found in whole DNAs. The methods are applied to two major categories of sequenced RNAs. It is found that vertebrate ribosomal and transfer RNAs show significant differences from the overall vertebrate DNA pattern, especially in the frequency of the doublet CG. Bacterial rRNA and tRNA, on the other hand, show less dissimilarity from total DNA. In the RNA of the small bacteriophage MS2, the doublet frequencies of the translated regions of the genome resemble those in the host E. coli, whereas those in the intercistronic regions differ substantially. All these findings are discussed in relation to the origin, evolution and selection of the nucleic acids concerned.
Similar articles
-
Base composition of rapidly-labelled RNA in E. coli undergoing thymineless death.Biochem Biophys Res Commun. 1967 Mar 9;26(5):532-8. doi: 10.1016/0006-291x(67)90097-6. Biochem Biophys Res Commun. 1967. PMID: 4860538 No abstract available.
-
Investigation of the secondary structure of Escherichia coli 5 S RNA by high-resolution nuclear magnetic resonance.J Mol Biol. 1974 Aug 25;87(4):755-74. doi: 10.1016/0022-2836(74)90083-7. J Mol Biol. 1974. PMID: 4610155 No abstract available.
-
Sequential degradation of nucleic acids. Degradation of Escherichia coli B phenylalanine transfer ribonucleic acid.Biochemistry. 1969 Aug;8(8):3254-60. doi: 10.1021/bi00836a018. Biochemistry. 1969. PMID: 4309204 No abstract available.
-
[Primary and spatial structure of tRNA].Mol Biol (Mosk). 1984 Sep-Oct;18(5):1233-48. Mol Biol (Mosk). 1984. PMID: 6209547 Review. Russian.
-
[Thiolation of nucleic acids].Pharmazie. 1969 May;24(5):241-4. Pharmazie. 1969. PMID: 4980527 Review. German. No abstract available.
Cited by
-
DNA sequence of the constant gene region of the mouse immunoglobulin kappa chain.Nucleic Acids Res. 1981 Feb 25;9(4):971-81. doi: 10.1093/nar/9.4.971. Nucleic Acids Res. 1981. PMID: 6785724 Free PMC article.
-
Hierarchical analysis of influenza A hemagglutinin gene sequences.Nucleic Acids Res. 1982 Sep 11;10(17):5375-89. doi: 10.1093/nar/10.17.5375. Nucleic Acids Res. 1982. PMID: 7145705 Free PMC article.
-
DNA methylation and the frequency of CpG in animal DNA.Nucleic Acids Res. 1980 Apr 11;8(7):1499-504. doi: 10.1093/nar/8.7.1499. Nucleic Acids Res. 1980. PMID: 6253938 Free PMC article.
-
Markov chain analysis finds a significant influence of neighboring bases on the occurrence of a base in eucaryotic nuclear DNA sequences both protein-coding and noncoding.J Mol Evol. 1984-1985;21(3):278-88. doi: 10.1007/BF02102360. J Mol Evol. 1984. PMID: 6443131
-
Doublet frequencies and codon weighting in the DNA of Escherichia coli and its phages.J Mol Evol. 1976 Aug 3;8(2):117-35. doi: 10.1007/BF01739098. J Mol Evol. 1976. PMID: 787545