The application of Markov chain analysis to oligonucleotide frequency prediction and physical mapping of Drosophila melanogaster
- PMID: 1641330
- PMCID: PMC334014
- DOI: 10.1093/nar/20.14.3651
The application of Markov chain analysis to oligonucleotide frequency prediction and physical mapping of Drosophila melanogaster
Abstract
Here we compare several methods for predicting oligonucleotide frequencies in 691 kb of Drosophila melanogaster DNA. As in previous work on Escherichia coli and Saccharomyces cerevisiae, a relatively simple equation based on tetranucleotide frequencies can be used in predicting frequencies of higher order oligonucleotides. For example, the mean of observed/expected abundances of 4,096 hexamers was 1.07 with a sample standard deviation of .55. This simple predictor arises by considering each base on the sense strand of D. melanogaster to depend only on the three bases 5' to it (a 3rd order Markov chain) and is more accurate than the random predictor. This equation is useful in predicting restriction enzyme fragment sizes, selecting restriction enzymes that cut preferentially in coding vs noncoding regions, and in selecting probes to fingerprint clones in contig mapping. Once again, this equation well predicts the occurrence of higher order oligonucleotides, supporting our hypothesis that this predictor holds in evolutionarily diverse organisms. When ranked from highest to lowest abundance, the observed frequencies of oligomers of a given length are closely tracked by the predicted abundances of a 3rd order Markov chain. Through use of the dependence of oligomer frequencies on base composition, we report a list of oligomers that will be useful for the completion of a cosmid physical map of D. melanogaster. Presently, the library is such that it will be possible to construct large contigs using only 30 oligonucleotide probes to fingerprint cosmids.
Similar articles
-
Mono- through hexanucleotide composition of the sense strand of yeast DNA: a Markov chain analysis.Nucleic Acids Res. 1988 Jul 25;16(14B):7145-58. doi: 10.1093/nar/16.14.7145. Nucleic Acids Res. 1988. PMID: 3043378 Free PMC article.
-
Mono- through hexanucleotide composition of the Escherichia coli genome: a Markov chain analysis.Nucleic Acids Res. 1987 Mar 25;15(6):2611-26. doi: 10.1093/nar/15.6.2611. Nucleic Acids Res. 1987. PMID: 3550699 Free PMC article.
-
Analysis of two cosmid clones from chromosome 4 of Drosophila melanogaster reveals two new genes amid an unusual arrangement of repeated sequences.Genome Res. 1999 Feb;9(2):137-49. Genome Res. 1999. PMID: 10022978 Free PMC article.
-
DNA fingerprinting of eukaryote genomes by synthetic oligodeoxyribonucleotide probes.Indian J Biochem Biophys. 1991 Feb;28(1):1-9. Indian J Biochem Biophys. 1991. PMID: 2055594 Review.
-
Isolation and characterization of DNA probes for human chromosome 21.Prog Clin Biol Res. 1990;360:53-67. Prog Clin Biol Res. 1990. PMID: 2247511 Review.
Cited by
-
Counting stem cells: methodological constraints.Nat Methods. 2012 May 30;9(6):567-74. doi: 10.1038/nmeth.2043. Nat Methods. 2012. PMID: 22669654
-
Over- and underrepresentation of short DNA words in herpesvirus genomes.J Comput Biol. 1996 Fall;3(3):345-60. doi: 10.1089/cmb.1996.3.345. J Comput Biol. 1996. PMID: 8891954 Free PMC article.
-
On the consistency of a physical mapping method to reconstruct a chromosome in vitro.Genetics. 1996 Jan;142(1):267-84. doi: 10.1093/genetics/142.1.267. Genetics. 1996. PMID: 8770604 Free PMC article.
-
Atypical regions in large genomic DNA sequences.Proc Natl Acad Sci U S A. 1994 Jul 19;91(15):7134-8. doi: 10.1073/pnas.91.15.7134. Proc Natl Acad Sci U S A. 1994. PMID: 8041759 Free PMC article.
-
Mechanisms and evolution of control logic in prokaryotic transcriptional regulation.Microbiol Mol Biol Rev. 2009 Sep;73(3):481-509, Table of Contents. doi: 10.1128/MMBR.00037-08. Microbiol Mol Biol Rev. 2009. PMID: 19721087 Free PMC article. Review.
References
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Molecular Biology Databases
Research Materials