Distribution of genes in the genome of Arabidopsis thaliana and its implications for the genome organization of plants
- PMID: 9707597
- PMCID: PMC21458
- DOI: 10.1073/pnas.95.17.10044
Distribution of genes in the genome of Arabidopsis thaliana and its implications for the genome organization of plants
Abstract
Previous work has shown that, in the large genomes of three Gramineae [rice, maize, and barley: 415, 2,500, and 5,300 megabases (Mb), respectively] most genes are clustered in long DNA segments (collectively called the "gene space") that represent a small fraction (12-24%) of nuclear DNA, cover a very narrow (0.8-1.6%) GC range, and are separated by vast expanses of gene-empty sequences. In the present work, we have analyzed the small (ca. 120 Mb) nuclear genome of Arabidopsis thaliana and shown that its organization is drastically different from that of the genomes of Gramineae. Indeed, (i) genes are distributed over about 85% of the main band of DNA in CsCl and cover an 8% GC range; (ii) ORFs are fairly evenly distributed in long (>50 kb) sequences from GenBank that amount to about 10 Mb; and (iii) the GC levels of protein-coding sequences (and of their third codon positions) are correlated with the GC levels of their flanking sequences. The different pattern of gene distribution of Arabidopsis compared with Gramineae appears to be because the genomes of the latter comprise (i) many large gene-empty regions separating gene clusters and (ii) abundant transposons in the intergenic sequences of gene clusters. Both sequences are absent or very scarce in the Arabidopsis genome. These observations provide a comparative view of angiosperm genome organization.
Figures




Similar articles
-
The distribution of T-DNA in the genomes of transgenic Arabidopsis and rice.FEBS Lett. 2000 Apr 14;471(2-3):161-4. doi: 10.1016/s0014-5793(00)01393-4. FEBS Lett. 2000. PMID: 10767414
-
The distribution of genes in the genomes of Gramineae.Proc Natl Acad Sci U S A. 1997 Jun 24;94(13):6857-61. doi: 10.1073/pnas.94.13.6857. Proc Natl Acad Sci U S A. 1997. PMID: 9192656 Free PMC article.
-
The gene distribution in the genomes of pea, tomato and date palm.FEBS Lett. 1999 Dec 10;463(1-2):139-42. doi: 10.1016/s0014-5793(99)01587-2. FEBS Lett. 1999. PMID: 10601654
-
Colinearity and gene density in grass genomes.Trends Plant Sci. 2000 Jun;5(6):246-51. doi: 10.1016/s1360-1385(00)01629-0. Trends Plant Sci. 2000. PMID: 10838615 Review.
-
Angiosperm mitochondrial genomes and mutations.Mitochondrion. 2008 Jan;8(1):5-14. doi: 10.1016/j.mito.2007.10.006. Epub 2007 Nov 4. Mitochondrion. 2008. PMID: 18065297 Review.
Cited by
-
Two classes of genes in plants.Genetics. 2000 Apr;154(4):1819-25. doi: 10.1093/genetics/154.4.1819. Genetics. 2000. PMID: 10747072 Free PMC article.
-
Targeted analysis of orthologous phytochrome A regions of the sorghum, maize, and rice genomes using comparative gene-island sequencing.Plant Physiol. 2002 Dec;130(4):1614-25. doi: 10.1104/pp.012567. Plant Physiol. 2002. PMID: 12481045 Free PMC article.
-
Gene content and density in banana ( Musa acuminata) as revealed by genomic sequencing of BAC clones.Theor Appl Genet. 2004 Jun;109(1):129-39. doi: 10.1007/s00122-004-1603-2. Epub 2004 Feb 18. Theor Appl Genet. 2004. PMID: 14985976
-
Organization and structural evolution of four multigene families in Arabidopsis thaliana: AtLCAD, AtLGT, AtMYST and AtHD-GL2.Plant Mol Biol. 2000 Mar;42(5):703-17. doi: 10.1023/a:1006368316413. Plant Mol Biol. 2000. PMID: 10809443
-
Insights from the GC content analysis of 76genome survey sequences (GSS) from Elaeisoleifera.Bioinformation. 2010 Sep 20;5(4):141-5. doi: 10.6026/97320630005141. Bioinformation. 2010. PMID: 21364775 Free PMC article.
References
-
- Salinas J, Matassi G, Montero L M, Bernardi G. Nucleic Acids Res. 1988;19:5561–5567. - PubMed
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Miscellaneous