Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2009 Nov;5(11):e1000728.
doi: 10.1371/journal.pgen.1000728. Epub 2009 Nov 20.

Detailed analysis of a contiguous 22-Mb region of the maize genome

Affiliations

Detailed analysis of a contiguous 22-Mb region of the maize genome

Fusheng Wei et al. PLoS Genet. 2009 Nov.

Abstract

Most of our understanding of plant genome structure and evolution has come from the careful annotation of small (e.g., 100 kb) sequenced genomic regions or from automated annotation of complete genome sequences. Here, we sequenced and carefully annotated a contiguous 22 Mb region of maize chromosome 4 using an improved pseudomolecule for annotation. The sequence segment was comprehensively ordered, oriented, and confirmed using the maize optical map. Nearly 84% of the sequence is composed of transposable elements (TEs) that are mostly nested within each other, of which most families are low-copy. We identified 544 gene models using multiple levels of evidence, as well as five miRNA genes. Gene fragments, many captured by TEs, are prevalent within this region. Elimination of gene redundancy from a tetraploid maize ancestor that originated a few million years ago is responsible in this region for most disruptions of synteny with sorghum and rice. Consistent with other sub-genomic analyses in maize, small RNA mapping showed that many small RNAs match TEs and that most TEs match small RNAs. These results, performed on approximately 1% of the maize genome, demonstrate the feasibility of refining the B73 RefGen_v1 genome assembly by incorporating optical map, high-resolution genetic map, and comparative genomic data sets. Such improvements, along with those of gene and repeat annotation, will serve to promote future functional genomic and phylogenomic research in maize and other grasses.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

Figure 1
Figure 1. Physical and genetic features of AR182.
Genetic (center), Optical (left), and physical and synteny maps (right) of AR182 are shown. (A) Magnified view of the AR182 pseudomolecule in silico restriction map (blue track) and Optical Map (gold track) alignment after finishing (each box = 1 restriction fragment); turquoise box demarcates zoomed region on (B) showing the entire alignment ∼22 Mb (∼1,000 ordered restriction fragments); (C) A simplified maize Chr4 genetic map; (D) The maize Chr4 physical map; (E) Overall syntenic relationships among maize, sorghum and rice with respect to AR182. This three-way synteny map was built by SyMap from pseudomolecule to pseudomolecule comparisons.
Figure 2
Figure 2. TE and gene distribution along AR182.
The distribution was constructed based on nucleotide length of the related TE in 100-kb sliding windows. The numbers at the left vertical axis represent the nucleotide length of related TE classifications. The numbers in the right axis are the gene number counts.
Figure 3
Figure 3. Comparative mapping of protein-coding and miRNA genes in orthologous segments of the rice, sorghum, and maize genomes.
Abbreviations: Osj2 = rice (japonica) chromosome 2; Sb4 = sorghum chromosome 4; Zm4 = maize chromosome 4 (AR182); Zm5 = maize chromosome 5 (homeologous region). All mappings are drawn relative to rice as a common reference. Genes are shown as tick marks in the outer radius of correspondence lines. Inversions are indicated with yellow highlighting. For Zm4 the density of repetitive sequence is shown in gray. Zm5 mappings are to individual BACs (boxes) projected onto the FPC map. (A) Mappings of protein-coding genes based on reciprocal best hit. (B) Mappings of miRNA genes based on family membership.
Figure 4
Figure 4. Distribution of truncated genes among syntenic and non-syntenic maize loci.
Protein coding length ratio (length of maize/length of sorghum or rice) between highest scoring homologs is used as a measure of truncation. Non-syntenic loci contrast with syntenic loci in showing a bimodal distribution.
Figure 5
Figure 5. Box-plots showing divergence rates among syntenic (SYN) and non-syntenic (nSYN) maize genes relative to their best scoring homolog in sorghum.
(A) Ks. (B) Ka. Genes were categorized by CDS length ratio using a threshold of 0.8 (maize CDS length / sorghum CDS length). Sample sizes: nSYN(<0.8) n = 68; nSYN(≥0.8) n = 32; SYN(<0.8) n = 54; SYN(≥0.8) n = 340.
Figure 6
Figure 6. Distributions of DNA transposons and their related small RNAs.
(A) Number of distinct DNA transposon-related small RNAs under different genetic backgrounds. (B) The total number of DNA transposon-related small RNAs under different genetic backgrounds. (C) Small RNAs discrepancies between B73 and K55-wt backgrounds.

Similar articles

Cited by

References

    1. Duvick DN. Genetic contributions to yield gains of U.S. hybrid maize, 1930–1980. In: Fehr WR, editor. Genetic contributions to yield gains of five major crop plants. Madison, WI: Crop Science Soc of America Special Publ No 7; 1984.
    1. Bennetzen JL, Hake S. The maize handbook; Volumes 1 and II. New York: Springer; 2009.
    1. McClintock B. Mutable loci in maize. Carnegie Inst Wash Year Book. 1948;47:155–169. - PubMed
    1. SanMiguel P, Tikhonov A, Jin Y-K, Motchoulskaia N, Zakharov D, et al. Nested retrotransposons in the intergenic regions of the maize genome. Science. 1996;274:765–768. - PubMed
    1. SanMiguel P, Bennetzen JL. Evidence that a recent increase in maize genome size was caused by the massive amplification of intergene retrotransposons. Annals Bot. 1998;82:37–44.

Publication types

MeSH terms