Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2015 Aug;53(8):486-97.
doi: 10.1002/dvg.22873. Epub 2015 Jul 16.

Xenbase: Core features, data acquisition, and data processing

Affiliations

Xenbase: Core features, data acquisition, and data processing

Christina James-Zorn et al. Genesis. 2015 Aug.

Abstract

Xenbase, the Xenopus model organism database (www.xenbase.org), is a cloud-based, web-accessible resource that integrates the diverse genomic and biological data from Xenopus research. Xenopus frogs are one of the major vertebrate animal models used for biomedical research, and Xenbase is the central repository for the enormous amount of data generated using this model tetrapod. The goal of Xenbase is to accelerate discovery by enabling investigators to make novel connections between molecular pathways in Xenopus and human disease. Our relational database and user-friendly interface make these data easy to query and allows investigators to quickly interrogate and link different data types in ways that would otherwise be difficult, time consuming, or impossible. Xenbase also enhances the value of these data through high-quality gene expression curation and data integration, by providing bioinformatics tools optimized for Xenopus experiments, and by linking Xenopus data to other model organisms and to human data. Xenbase draws in data via pipelines that download data, parse the content, and save them into appropriate files and database tables. Furthermore, Xenbase makes these data accessible to the broader biomedical community by continually providing annotated data updates to organizations such as NCBI, UniProtKB, and Ensembl. Here, we describe our bioinformatics, genome-browsing tools, data acquisition and sharing, our community submitted and literature curation pipelines, text-mining support, gene page features, and the curation of gene nomenclature and gene models.

Keywords: Xenopus; biocuration; gene expression; genomics; human disease modeling; model organism database; phenotypes.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Xenbase content. Schematic representation of the major data types stored within Xenbase, and their relationships to each other.
Figure 2
Figure 2
An example of a Xenbase Gene Page. In this example data for the pax6 gene is displayed. Under the top navigation bar a set of ‘tabs’ are visible, each of which will load specialized content when selected. Within the body of the Summary tab (the default), information is organized into rows containing the same type of data, and columns that contain data from the same species or sub-genome. Most tabs indicate if they contain content with either an icon or number, to avoid non-productive selection of empty tabs.
Figure 3
Figure 3
Xenbase GBrowse. The Xenbase browser can load three different builds of both X. tropicalis (4.1, 7.1 and 8.0) and X. laevis (6.0, 7.2 [both J strain] and WT1.0). Multiple gene models, their annotations, and a variety of epigenomic and ancillary data are available for most genome builds. Many tracks launch pop-up windows with further information. In this view a user has clicked on the morpholino feature labelled “cyp26a1 MO1”. This reagent was designed against X. laevis cyp26a, so a number of mismatches are present in the alignment to the active view, which is X. tropicalis build 7.1. The stacked ChIP-seq tracks for p300 are displayed using Topoview (http://flybase.org/static_pages/docs/software/topoview.html).
Figure 4
Figure 4
Curation pipeline. The curation pipelines incorporate an integrated set of automatic, semi-automatic and manual processes, where as much data as possible from each publication is curated. A final step is when the paper is entered into the “Xenbase Article Curation Tracker” (XACT) an automatic set of Ruby and Perl scripts that generate a Google fusion table containing all the information on papers imported each week. Curators access this table to monitor the status, content and priority of all papers in the database from the time of its first entry to its complete curation.
Figure 5
Figure 5. Literature Page
In addition to features such as title, authors and abstract Xenbase publications also display lists of genes cited, antibodies and morpholinos referenced (with hyperlinks to these separate resource pages). All anatomical features (e.g. neural tube) identified by text-mining for XAO terms are hyperlinked, as are gene symbols (e.g. Rfx2). Author names are also linked to pages in our community module and through this, to the author’s other publications.

References

    1. Amberger J, Bocchini CA, Scott AF, Hamosh A. McKusick’s Online Mendelian Inheritance in Man (OMIM) Nucleic Acids Res. 2009;37:D793–796. - PMC - PubMed
    1. Bell GW, Yatskievych TA, Antin PB. GEISHA, a whole-mount in situ hybridization gene expression screen in chicken embryos. Dev Dyn. 2004;229:677–687. - PubMed
    1. Bowes JB, Snyder KA, Segerdell E, Jarabek CJ, Azam K, Zorn AM, Vize PD. Xenbase: Gene Expression and Improved Integration. Nucleic Acids Res. 2010;38:D607–612. - PMC - PubMed
    1. Camacho C, Coulouris G, Avagyan V, Ma N, Papadopoulos J, Bealer K, Madden TL. BLAST+: architecture and applications. BMC Bioinformatics. 2009;10:421-2105-10-421. - PMC - PubMed
    1. Eppig JT, Blake JA, Bult CJ, Kadin JA, Richardson JE Mouse Genome Database Group. The Mouse Genome Database (MGD): Facilitating mouse as a model for human biology and disease. Nucleic Acids Res. 2015;43:D726–736. - PMC - PubMed

Publication types

LinkOut - more resources