EnsMart: a generic system for fast and flexible access to biological data
- PMID: 14707178
- PMCID: PMC314293
- DOI: 10.1101/gr.1645104
EnsMart: a generic system for fast and flexible access to biological data
Abstract
The EnsMart system (www.ensembl.org/EnsMart) provides a generic data warehousing solution for fast and flexible querying of large biological data sets and integration with third-party data and tools. The system consists of a query-optimized database and interactive, user-friendly interfaces. EnsMart has been applied to Ensembl, where it extends its genomic browser capabilities, facilitating rapid retrieval of customized data sets. A wide variety of complex queries, on various types of annotations, for numerous species are supported. These can be applied to many research problems, ranging from SNP selection for candidate gene screening, through cross-species evolutionary comparisons, to microarray annotation. Users can group and refine biological data according to many criteria, including cross-species analyses, disease links, sequence variations, and expression patterns. Both tabulated list data and biological sequence output can be generated dynamically, in HTML, text, Microsoft Excel, and compressed formats. A wide range of sequence types, such as cDNA, peptides, coding regions, UTRs, and exons, with additional upstream and downstream regions, can be retrieved. The EnsMart database can be accessed via a public Web site, or through a Java application suite. Both implementations and the database are freely available for local installation, and can be extended or adapted to 'non-Ensembl' data sets.
Figures








Similar articles
-
PhyloPat: phylogenetic pattern analysis of eukaryotic genes.BMC Bioinformatics. 2006 Sep 1;7:398. doi: 10.1186/1471-2105-7-398. BMC Bioinformatics. 2006. PMID: 16948844 Free PMC article.
-
GeneTools--application for functional annotation and statistical hypothesis testing.BMC Bioinformatics. 2006 Oct 24;7:470. doi: 10.1186/1471-2105-7-470. BMC Bioinformatics. 2006. PMID: 17062145 Free PMC article.
-
Retrieve-ensembl-seq: user-friendly and large-scale retrieval of single or multi-genome sequences from Ensembl.Bioinformatics. 2009 Oct 15;25(20):2739-40. doi: 10.1093/bioinformatics/btp519. Epub 2009 Aug 31. Bioinformatics. 2009. PMID: 19720677
-
Genome information resources - developments at Ensembl.Trends Genet. 2004 Jun;20(6):268-72. doi: 10.1016/j.tig.2004.04.002. Trends Genet. 2004. PMID: 15145580 Review.
-
UCSC genome browser: deep support for molecular biomedical research.Biotechnol Annu Rev. 2008;14:63-108. doi: 10.1016/S1387-2656(08)00003-3. Biotechnol Annu Rev. 2008. PMID: 18606360 Review.
Cited by
-
JBioWH: an open-source Java framework for bioinformatics data integration.Database (Oxford). 2013 Jul 11;2013:bat051. doi: 10.1093/database/bat051. Print 2013. Database (Oxford). 2013. PMID: 23846595 Free PMC article.
-
"Guilt by association" is the exception rather than the rule in gene networks.PLoS Comput Biol. 2012;8(3):e1002444. doi: 10.1371/journal.pcbi.1002444. Epub 2012 Mar 29. PLoS Comput Biol. 2012. PMID: 22479173 Free PMC article.
-
The other side of comparative genomics: genes with no orthologs between the cow and other mammalian species.BMC Genomics. 2009 Dec 14;10:604. doi: 10.1186/1471-2164-10-604. BMC Genomics. 2009. PMID: 20003425 Free PMC article.
-
Integrative approaches to the prediction of protein functions based on the feature selection.BMC Bioinformatics. 2009 Dec 31;10:455. doi: 10.1186/1471-2105-10-455. BMC Bioinformatics. 2009. PMID: 20043848 Free PMC article.
-
T1DBase: integration and presentation of complex data for type 1 diabetes research.Nucleic Acids Res. 2007 Jan;35(Database issue):D742-6. doi: 10.1093/nar/gkl933. Epub 2006 Dec 14. Nucleic Acids Res. 2007. PMID: 17169983 Free PMC article.
References
-
- Devlin, B. 1997. Data warehouse. From architecture to implementation, chapter 2. Addison Wesley Longman, Inc., Reading, MA.
WEB SITE REFERENCES
-
- www.ebi.ac.uk/miamexpress; MIAMExpress.
-
- www.rzpd.de/colBox/html/; RZPD's Genome-Matrix.
-
- www.ncbi.nlm.nih.gov; MapViewer at NCBI.
-
- www.ensembl.org/EnsMart; EnsMart.
-
- www.sanger.ac.uk; The Vertebrate Genome Annotation database.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases