Quality assessment of maize assembled genomic islands (MAGIs) and large-scale experimental verification of predicted genes
- PMID: 16103354
- PMCID: PMC1186025
- DOI: 10.1073/pnas.0503394102
Quality assessment of maize assembled genomic islands (MAGIs) and large-scale experimental verification of predicted genes
Abstract
Recent sequencing efforts have targeted the gene-rich regions of the maize (Zea mays L.) genome. We report the release of an improved assembly of maize assembled genomic islands (MAGIs). The 114,173 resulting contigs have been subjected to computational and physical quality assessments. Comparisons to the sequences of maize bacterial artificial chromosomes suggest that at least 97% (160 of 165) of MAGIs are correctly assembled. Because the rates at which junction-testing PCR primers for genomic survey sequences (90-92%) amplify genomic DNA are not significantly different from those of control primers ( approximately 91%), we conclude that a very high percentage of genic MAGIs accurately reflect the structure of the maize genome. EST alignments, ab initio gene prediction, and sequence similarity searches of the MAGIs are available at the Iowa State University MAGI web site. This assembly contains 46,688 ab initio predicted genes. The expression of almost half (628 of 1,369) of a sample of the predicted genes that lack expression evidence was validated by RT-PCR. Our analyses suggest that the maize genome contains between approximately 33,000 and approximately 54,000 expressed genes. Approximately 5% (32 of 628) of the maize transcripts discovered do not have detectable paralogs among maize ESTs or detectable homologs from other species in the GenBank NR nucleotide/protein database. Analyses therefore suggest that this assembly of the maize genome contains approximately 350 previously uncharacterized expressed genes. We hypothesize that these "orphans" evolved quickly during maize evolution and/or domestication.
Figures


Similar articles
-
Computational finishing of large sequence contigs reveals interspersed nested repeats and gene islands in the rf1-associated region of maize.Plant Physiol. 2009 Oct;151(2):483-95. doi: 10.1104/pp.109.143370. Epub 2009 Aug 12. Plant Physiol. 2009. PMID: 19675151 Free PMC article.
-
Deep sampling of the Palomero maize transcriptome by a high throughput strategy of pyrosequencing.BMC Genomics. 2009 Jul 6;10:299. doi: 10.1186/1471-2164-10-299. BMC Genomics. 2009. PMID: 19580677 Free PMC article.
-
Evaluation of five ab initio gene prediction programs for the discovery of maize genes.Plant Mol Biol. 2005 Feb;57(3):445-60. doi: 10.1007/s11103-005-0271-1. Plant Mol Biol. 2005. PMID: 15830133
-
Genomic screening for artificial selection during domestication and improvement in maize.Ann Bot. 2007 Nov;100(5):967-73. doi: 10.1093/aob/mcm173. Epub 2007 Aug 18. Ann Bot. 2007. PMID: 17704539 Free PMC article. Review.
-
Progress in maize gene discovery: a project update.Funct Integr Genomics. 2003 Mar;3(1-2):25-32. doi: 10.1007/s10142-002-0078-y. Epub 2002 Oct 1. Funct Integr Genomics. 2003. PMID: 12590340 Review.
Cited by
-
Computational finishing of large sequence contigs reveals interspersed nested repeats and gene islands in the rf1-associated region of maize.Plant Physiol. 2009 Oct;151(2):483-95. doi: 10.1104/pp.109.143370. Epub 2009 Aug 12. Plant Physiol. 2009. PMID: 19675151 Free PMC article.
-
Global gene expression analysis of the shoot apical meristem of maize (Zea mays L.).Plant J. 2007 Nov;52(3):391-404. doi: 10.1111/j.1365-313X.2007.03244.x. Epub 2007 Aug 23. Plant J. 2007. PMID: 17764504 Free PMC article.
-
A Mutator transposon insertion is associated with ectopic expression of a tandemly repeated multicopy Myb gene pericarp color1 of maize.Genetics. 2008 Apr;178(4):1859-74. doi: 10.1534/genetics.107.082503. Genetics. 2008. PMID: 18430921 Free PMC article.
-
Transcript profiling by 3'-untranslated region sequencing resolves expression of gene families.Plant Physiol. 2008 Jan;146(1):32-44. doi: 10.1104/pp.107.108597. Epub 2007 Nov 16. Plant Physiol. 2008. PMID: 18024554 Free PMC article.
-
CSRDB: a small RNA integrated database and browser resource for cereals.Nucleic Acids Res. 2007 Jan;35(Database issue):D829-33. doi: 10.1093/nar/gkl991. Epub 2006 Dec 14. Nucleic Acids Res. 2007. PMID: 17169981 Free PMC article.
References
-
- Martienssen, R. A., Rabinowicz, P. D., O'Shaughnessy, A. & McCombie, W. R. (2004) Curr. Opin. Plant Biol. 7, 102-107. - PubMed
-
- Whitelaw, C. A., Barbazuk, W. B., Pertea, G., Chan, A. P., Cheung, F., Lee, Y., Zheng, L., van Heeringen, S., Karamycheva, S., Bennetzen, J. L., et al. (2003) Science 302, 2118-2120. - PubMed
-
- Palmer, L. E., Rabinowicz, P. D., O'Shaughnessy, A. L., Balija, V. S., Nascimento, L. U., Dike, S., de la Bastide, M., Martienssen, R. A. & McCombie, W. R. (2003) Science 302, 2115-2117. - PubMed
-
- Emrich, S. J., Aluru, S., Fu, Y., Wen, T. J., Narayanan, M., Guo, L., Ashlock, D. A. & Schnable, P. S. (2004) Bioinformatics 20, 140-147. - PubMed
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials