Highly recurring sequence elements identified in eukaryotic DNAs by computer analysis are often homologous to regulatory sequences or protein binding sites
- PMID: 3822840
- PMCID: PMC340585
- DOI: 10.1093/nar/15.4.1835
Highly recurring sequence elements identified in eukaryotic DNAs by computer analysis are often homologous to regulatory sequences or protein binding sites
Abstract
We have used computer assisted dot matrix and oligonucleotide frequency analyses to identify highly recurring sequence elements of 7-11 base pairs in eukaryotic genes and viral DNAs. Such elements are found much more frequently than expected, often with an average spacing of a few hundred base pairs. Furthermore, the most abundant repetitive elements observed in the ovalbumin locus, the beta-globin gene cluster, the metallothionein gene and the viral genomes of SV40, polyoma, Herpes simplex-1 and Mouse Mammary Tumor Virus were sequences shown previously to be protein binding sites or sequences important for regulating gene expression. These sequences were present in both exons and introns as well as promoter regions. These observations suggest that such sequences are often highly overrepresented within the specific gene segments with which they are associated. Computer analysis of other genetic units, including viral genomes and oncogenes, has identified a number of highly recurring sequence elements that could serve similar regulatory or protein-binding functions. A model for the role of such reiterated sequence elements in DNA organization and function is presented.
Similar articles
-
Sequence organization in regulatory regions of DNA of minute virus of mice.Virus Genes. 1989 Mar;2(2):167-82. doi: 10.1007/BF00315260. Virus Genes. 1989. PMID: 2541561
-
Computer tool FUNSITE for analysis of eukaryotic regulatory genomic sequences.Proc Int Conf Intell Syst Mol Biol. 1995;3:197-205. Proc Int Conf Intell Syst Mol Biol. 1995. PMID: 7584437
-
Nucleotide sequence analysis of avian retroviruses: structural similarities with transposable elements.Fed Proc. 1982 Aug;41(10):2659-61. Fed Proc. 1982. PMID: 6286367
-
Use of long sequence alignments to study the evolution and regulation of mammalian globin gene clusters.Mol Biol Evol. 1993 Jan;10(1):73-102. doi: 10.1093/oxfordjournals.molbev.a039991. Mol Biol Evol. 1993. PMID: 8383794 Review.
-
Sequence organization of animal nuclear DNA.Hum Genet. 1980;55(1):1-18. doi: 10.1007/BF00329120. Hum Genet. 1980. PMID: 6256281 Review.
Cited by
-
Sequence organization in regulatory regions of DNA of minute virus of mice.Virus Genes. 1989 Mar;2(2):167-82. doi: 10.1007/BF00315260. Virus Genes. 1989. PMID: 2541561
-
Potential genetic functions of tandem repeated DNA sequence blocks in the human genome are based on a highly conserved "chromatin folding code".Hum Genet. 1990 Mar;84(4):301-36. doi: 10.1007/BF00196228. Hum Genet. 1990. PMID: 2407640 Review.
-
Complex interaction of yeast nuclear proteins with the enhancer/promoter region of SV40.Curr Genet. 1991 Nov;20(5):359-63. doi: 10.1007/BF00317062. Curr Genet. 1991. PMID: 1666981
-
Attachment of DNA to the nucleoskeleton of HeLa cells examined using physiological conditions.Nucleic Acids Res. 1990 Aug 11;18(15):4385-93. doi: 10.1093/nar/18.15.4385. Nucleic Acids Res. 1990. PMID: 2167466 Free PMC article.
-
Statistical analysis of nucleotide sequences.Nucleic Acids Res. 1990 Nov 25;18(22):6641-7. doi: 10.1093/nar/18.22.6641. Nucleic Acids Res. 1990. PMID: 2251125 Free PMC article.
References
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Medical