Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 2007 Feb;17(2):175-83.
doi: 10.1101/gr.5509507. Epub 2007 Jan 8.

Curated genome annotation of Oryza sativa ssp. japonica and comparative genome analysis with Arabidopsis thaliana

Rice Annotation Project  1 Takeshi ItohTsuyoshi TanakaRoberto A BarreroChisato YamasakiYasuyuki FujiiPhillip B HiltonBaltazar A AntonioHideo AonoRolf ApweilerRichard BruskiewichThomas BureauFrances BurrAntonio Costa de OliveiraGalina FuksTakuya HabaraGeorg HabererBin HanErimi HaradaAiko T HirakiHirohiko HirochikaDouglas HoenHiroki HokariSatomi HosokawaYue-ie HsingHiroshi IkawaKazuho IkeoTadashi ImanishiYukiyo ItoPankaj JaiswalMasako KannoYoshihiro KawaharaToshiyuki KawamuraHiroaki KawashimaJitendra P KhuranaShoshi KikuchiSetsuko KomatsuKanako O KoyanagiHiromi KubookaDamien LieberherrYao-Cheng LinDavid LonsdaleTakashi MatsumotoAkihiro MatsuyaW Richard McCombieJoachim MessingAkio MiyaoNicola MulderYoshiaki NagamuraJongmin NamNobukazu NamikiHisataka NumaShin NurimotoClaire O'DonovanHajime OhyanagiToshihisa OkidoSatoshi OotaNaoki OsatoLance E PalmerFrancis QuetierSaurabh RaghuvanshiNaomi SaichiHiroaki SakaiYasumichi SakaiKatsumi SakataTetsuya SakuraiFumihiko SatoYoshiharu SatoHeiko SchoofMotoaki SekiMichie ShibataYuji ShimizuKazuo ShinozakiYuji ShinsoNagendra K SinghBrian Smith-WhiteJun-ichi TakedaMotohiko TaninoTatiana TatusovaSupat ThongjueaFusano TodokoroMika TsuganeAkhilesh K TyagiApichart VanavichitAihui WangRod A WingKaori YamaguchiMayu YamamotoNaoyuki YamamotoYeisoo YuHao ZhangQiang ZhaoKenichi HigoBenjamin BurrTakashi GojoboriTakuji Sasaki
Affiliations
Comparative Study

Curated genome annotation of Oryza sativa ssp. japonica and comparative genome analysis with Arabidopsis thaliana

Rice Annotation Project et al. Genome Res. 2007 Feb.

Abstract

We present here the annotation of the complete genome of rice Oryza sativa L. ssp. japonica cultivar Nipponbare. All functional annotations for proteins and non-protein-coding RNA (npRNA) candidates were manually curated. Functions were identified or inferred in 19,969 (70%) of the proteins, and 131 possible npRNAs (including 58 antisense transcripts) were found. Almost 5000 annotated protein-coding genes were found to be disrupted in insertional mutant lines, which will accelerate future experimental validation of the annotations. The rice loci were determined by using cDNA sequences obtained from rice and other representative cereals. Our conservative estimate based on these loci and an extrapolation suggested that the gene number of rice is approximately 32,000, which is smaller than previous estimates. We conducted comparative analyses between rice and Arabidopsis thaliana and found that both genomes possessed several lineage-specific genes, which might account for the observed differences between these species, while they had similar sets of predicted functional domains among the protein sequences. A system to control translational efficiency seems to be conserved across large evolutionary distances. Moreover, the evolutionary process of protein-coding genes was examined. Our results suggest that natural selection may have played a role for duplicated genes in both species, so that duplication was suppressed or favored in a manner that depended on the function of a gene.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Correlation between the number of tRNA gene copies and occurrence frequency of amino acids in (A) O. sativa and (B) A. thaliana. The horizontal axis indicates the occurrence frequency (%), and the vertical axis indicates the copy number. R2 values were 0.41 and 0.16 in O. sativa and A. thaliana, respectively.
Figure 2.
Figure 2.
Number of orthologous clusters created by duplication after the speciation of O. sativa and A. thaliana. The vertical axis indicates the log-scaled number of orthologs (clusters), and the horizontal axis indicates the number of paralog members in a cluster. Black bars represent O. sativa, and white bars represent A. thaliana.
Figure 3.
Figure 3.
Number of functional domains based on InterProScan and Gene Ontology. Note that the numbers do not correspond to the numbers of the proteins because a single protein can contain multiple domains.
Figure 4.
Figure 4.
Result of similarity search of the O. sativa and A. thaliana proteins against the protein databases. The proteins were classified into five groups: homologous to nonplant, (nonflowering) plant, (nonmonocot/eudicot) flowering plant, (non-Oryzeae/Arabidopsis) monocot/eudicot, and Oryzeae/Arabidopsis-specific proteins.

References

    1. The Arabidopsis Genome Initiative Analysis of the genome sequence of the flowering plant Arabidopsis thaliana. Nature. 2000;408:796–815. - PubMed
    1. Bennetzen J.L., Coleman C., Liu R., Ma J., Ramakrishna W., Coleman C., Liu R., Ma J., Ramakrishna W., Liu R., Ma J., Ramakrishna W., Ma J., Ramakrishna W., Ramakrishna W. Consistent over-estimation of gene number in complex plant genomes. Curr. Opin. Plant Biol. 2004;7:732–736. - PubMed
    1. Camon E., Magrane M., Barrell D., Binns D., Fleischmann W., Kersey P., Mulder N., Oinn T., Maslen J., Cox A., Magrane M., Barrell D., Binns D., Fleischmann W., Kersey P., Mulder N., Oinn T., Maslen J., Cox A., Barrell D., Binns D., Fleischmann W., Kersey P., Mulder N., Oinn T., Maslen J., Cox A., Binns D., Fleischmann W., Kersey P., Mulder N., Oinn T., Maslen J., Cox A., Fleischmann W., Kersey P., Mulder N., Oinn T., Maslen J., Cox A., Kersey P., Mulder N., Oinn T., Maslen J., Cox A., Mulder N., Oinn T., Maslen J., Cox A., Oinn T., Maslen J., Cox A., Maslen J., Cox A., Cox A., et al. The Gene Ontology Annotation (GOA) Project: Implementation of GO in SWISS-PROT, TrEMBL, and InterPro. Genome Res. 2003;13:662–672. - PMC - PubMed
    1. Carninci P., Kasukawa T., Katayama S., Gough J., Frith M.C., Maeda N., Oyama R., Ravasi T., Lenhard B., Wells C., Kasukawa T., Katayama S., Gough J., Frith M.C., Maeda N., Oyama R., Ravasi T., Lenhard B., Wells C., Katayama S., Gough J., Frith M.C., Maeda N., Oyama R., Ravasi T., Lenhard B., Wells C., Gough J., Frith M.C., Maeda N., Oyama R., Ravasi T., Lenhard B., Wells C., Frith M.C., Maeda N., Oyama R., Ravasi T., Lenhard B., Wells C., Maeda N., Oyama R., Ravasi T., Lenhard B., Wells C., Oyama R., Ravasi T., Lenhard B., Wells C., Ravasi T., Lenhard B., Wells C., Lenhard B., Wells C., Wells C., et al. The transcriptional landscape of the mammalian genome. Science. 2005;309:1559–1563. - PubMed
    1. Chaw S.M., Chang C.C., Chen H.L., Li W.H., Chang C.C., Chen H.L., Li W.H., Chen H.L., Li W.H., Li W.H. Dating the monocot–dicot divergence and the origin of core eudicots using whole chloroplast genomes. J. Mol. Evol. 2004;58:424–441. - PubMed

Publication types

LinkOut - more resources