Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2005 May;138(1):38-46.
doi: 10.1104/pp.104.059204.

Databases and information integration for the Medicago truncatula genome and transcriptome

Affiliations

Databases and information integration for the Medicago truncatula genome and transcriptome

Steven B Cannon et al. Plant Physiol. 2005 May.

Abstract

An international consortium is sequencing the euchromatic genespace of Medicago truncatula. Extensive bioinformatic and database resources support the marker-anchored bacterial artificial chromosome (BAC) sequencing strategy. Existing physical and genetic maps and deep BAC-end sequencing help to guide the sequencing effort, while EST databases provide essential resources for genome annotation as well as transcriptome characterization and microarray design. Finished BAC sequences are joined into overlapping sequence assemblies and undergo an automated annotation process that integrates ab initio predictions with EST, protein, and other recognizable features. Because of the sequencing project's international and collaborative nature, data production, storage, and visualization tools are broadly distributed. This paper describes databases and Web resources for the project, which provide support for physical and genetic maps, genome sequence assembly, gene prediction, and integration of EST data. A central project Web site at medicago.org/genome provides access to genome viewers and other resources project-wide, including an Ensembl implementation at medicago.org, physical map and marker resources at mtgenome.ucdavis.edu, and genome viewers at the University of Oklahoma (www.genome.ou.edu), the Institute for Genomic Research (www.tigr.org), and Munich Information for Protein Sequences Center (mips.gsf.de).

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
BAC sequence assembly viewer. The database and Web interface at http://medicago.org/genome includes a variety of visualization tools and data formats, including this sequence assembly viewer. The viewer accesses database information for each BAC, shown in the popup menu. Links from the popup menu lead to other database reports and directly to other data sources and views: GenBank records, contig and marker reports, contig images (such as one shown in Fig. 2), and genome viewers.
Figure 2.
Figure 2.
Image of FPC contig. The database and Web interface at http://mtgenome.ucdavis.edu provides access and visualizations for the Medicago physical and genetic map. This image is typical of FPC contigs in the project, showing fingerprint-based BAC overlaps, BACs being sequenced, BACs with and without BAC end sequence, markers, and map position. BES, BAC end sequence; MTGS, M. truncatula genome sequence.
Figure 3.
Figure 3.
Ensembl and MtDB EST pipelines and information resources. The EnsEBML annotation and the MtDB and Nimbus EST processing pipelines are supported by an automated system for downloading and maintaining up-to-date consistent target information (such as BLAST, HMM, and other targets on the right side of figure), and associated metadata. Core database resources, including BAC and EST sequences, BLAST hits, gene predictions, and other features (left and center of figure) are generally maintained in a relational format, and are accessible to all aspects of the projects. Software resources are generally deployed across compute clusters, and are managed by a scheduling system that directs loading and submission of work to cluster queues.

Comment in

  • Biological databases for plant research.
    Rhee SY, Crosby B. Rhee SY, et al. Plant Physiol. 2005 May;138(1):1-3. doi: 10.1104/pp.104.900158. Plant Physiol. 2005. PMID: 15888672 Free PMC article. No abstract available.

References

    1. Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ (1997) Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res 25: 3389–3402 - PMC - PubMed
    1. Apweiler R, Bairoch A, Wu CH, Barker WC, Boeckmann B, Ferro S, Gasteiger E, Huang H, Lopez R, Magrane M, et al (2004) UniProt: the Universal Protein knowledgebase. Nucleic Acids Res 32: D115–D119 - PMC - PubMed
    1. Arumuganathan K, Earle ED (1991) Nuclear DNA content of some important plant species. Plant Mol Biol Rep 9: 208–218
    1. Bell CJ, Dixon RA, Farmer AD, Flores R, Inman J, Gonzales RA, Harrison MJ, Paiva NL, Scott AD, Weller JW, et al (2001) The Medicago Genome Initiative: a model legume database. Nucleic Acids Res 29: 114–117 - PMC - PubMed
    1. Birney E, Andrews D, Bevan P, Caccamo M, Cameron G, Chen Y, Clarke L, Coates G, Cox T, Cuff J, et al (2004) An overview of Ensembl. Nucleic Acids Res 32: D468–D470 - PMC - PubMed

Publication types