Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 2002 Oct;12(10):1549-55.
doi: 10.1101/gr.268302.

Mosaic organization of orthologous sequences in grass genomes

Affiliations
Comparative Study

Mosaic organization of orthologous sequences in grass genomes

Rentao Song et al. Genome Res. 2002 Oct.

Abstract

Although comparative genetic mapping studies show extensive genome conservation among grasses, recent data provide many exceptions to gene collinearity at the DNA sequence level. Rice, sorghum, and maize are closely related grass species, once sharing a common ancestor. Because they diverged at different times during evolution, they provide an excellent model to investigate sequence divergence. We isolated, sequenced, and compared orthologous regions from two rice subspecies, sorghum, and maize to investigate the nature of their sequence differences. This study represents the most extensive sequence comparison among grasses, including the largest contiguous genomic sequences from sorghum (425 kb) and maize (435 kb) to date. Our results reveal a mosaic organization of the orthologous regions, with conserved sequences interspersed with nonconserved sequences. Gene amplification, gene movement, and retrotransposition account for the majority of the nonconserved sequences. Our analysis also shows that gene amplification is frequently linked with gene movement. Analyzing an additional 2.9 Mb of genomic sequence from rice not only corroborates our observations, but also suggests that a significant portion of grass genomes may consist of paralogous sequences derived from gene amplification. We propose that sequence divergence started from hotspots along chromosomes and expanded by accumulating small-scale genomic changes during evolution.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Sequence comparison of orthologous regions from rice, sorghum, and maize. Sequences of genomic regions from rice (subspecies japonica and indica), sorghum, and maize have been vertically arranged with a size ruler (in kilobases) for each sequence. Only orthologous regions (CR1–CR4) are highlighted. Light-blue areas indicate the homologous regions. The boxed sections between CR2 and CR3 in sorghum and maize represent an insertion relative to rice. Pink boxes along the rulers indicate conserved sequence regions across species (CR1–CR4, CRa, CRb). In maize, CR4 was divided into CR4-1 and CR4-2, reflecting an insertion relative to sorghum and rice. In sorghum, CRb was split into two pieces because of an insertion relative to maize. Green bars indicate RFLP marker php200725 and its orthologs. The second green bar in the sorghum region marks the duplication of php200725. Small triangles indicate amplified storage-protein gene copies and their orientation (22-kD zein in maize and 22-kD kafirin in sorghum, respectively). Violet arrows indicate Mla1-like disease-resistance genes and their orientation. All other genes are indicated by small red bars. Genes without a pink outer box indicate nonconserved genes. Gray bars indicate areas of retrotransposons, and blue bars DNA transposons. Two hatched boxes in sorghum indicate two MITE-rich regions (5′ region, <1 kb/MITE; 3′ region, <2 kb/MITE). All kafirins and zeins showed homology to each other, but only the oldest copies are indicated. A summary of predicted genes and their positions is given in Table 1 for convenience.

References

    1. Albagli O, Dhordain P, Deweindt C, Lecicq G, Leprince D. The BTB/POZ domain: A new protein–protein interaction motif common to DNA- and action-binding proteins. Cell Growth Differ. 1995;6:1193–1198. - PubMed
    1. Altschul SF, Madden TL, Schäffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ. Gapped BLAST and PSI-BLAST: A new generation of protein database search programs. Nucleic Acids Res. 1997;25:3389–3402. - PMC - PubMed
    1. The Arabidopsis Genome Initiative. Analysis of the genome sequence of the flowering plant Arabidopsis thaliana. Nature. 2000;408:796–815. - PubMed
    1. Arumunganathan K, Earle ED. Nuclear DNA content of some important plant species. Plant Mol Biol Rep. 1991;9:208–219.
    1. Bancroft I. Insights into the structural and functional evolution of plant genomes afforded by the nucleotide sequences of Chromosomes 2 and 4 of Arabidoposis thaliana. Yeast. 2000;17:1–5. - PMC - PubMed

Publication types

Associated data

LinkOut - more resources