Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2005 Jan 1;33(Database issue):D466-70.
doi: 10.1093/nar/gki045.

Improvements to GALA and dbERGE II: databases featuring genomic sequence alignment, annotation and experimental results

Affiliations

Improvements to GALA and dbERGE II: databases featuring genomic sequence alignment, annotation and experimental results

Laura Elnitski et al. Nucleic Acids Res. .

Abstract

We describe improvements to two databases that give access to information on genomic sequence similarities, functional elements in DNA and experimental results that demonstrate those functions. GALA, the database of Genome ALignments and Annotations, is now a set of interlinked relational databases for five vertebrate species, human, chimpanzee, mouse, rat and chicken. For each species, GALA records pairwise and multiple sequence alignments, scores derived from those alignments that reflect the likelihood of being under purifying selection or being a regulatory element, and extensive annotations such as genes, gene expression patterns and transcription factor binding sites. The user interface supports simple and complex queries, including operations such as subtraction and intersections as well as clustering and finding elements in proximity to features. dbERGE II, the database of Experimental Results on Gene Expression, contains experimental data from a variety of functional assays. Both databases are now run on the DB2 database management system. Improved hardware and tuning has reduced response times and increased querying capacity, while simplified query interfaces will help direct new users through the querying process. Links are available at http://www.bx.psu.edu/.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Use of GALA to identify a known regulatory element using features of alignments and annotations. The results of four queries on GALA are displayed as custom tracks in the UCSC Genome Browser (1,2), along with genes and the phyloHMM conservation track (21). The distal major regulatory element for the alpha-like globin genes is marked by an arrow. The track ‘phyloHMMcons>=0.4 no exons’ shows all the DNA segments with a phyloHMMcons score of at least 0.4, which is a rather stringent threshold, after subtracting all the segments exceeding the threshold that overlapped an exon. The track ‘conserved GATA-1 binding site’ shows the matches to GATA-1 binding site weight matrices in TRANSFAC (24) that are also conserved in human, mouse and rat. These match the weight matrices in those species as well. The track ‘cluster of consvd GATA-1 binding sites’ shows the conserved GATA-1 binding sites that have another one within 100 bp. The track ‘Hi phyloHMMcons, no exons, cluster consvd GATA-1 binding sites’ shows all the segments from the first data track that include one of the clusters on the third data track.

Similar articles

Cited by

References

    1. Kent W.J., Sugnet,C.W., Furey,T.S., Roskin,K.M., Pringle,T.H., Zahler,A.M. and Haussler,D. (2002) The human genome browser at UCSC. Genome Res., 12, 996–1006. - PMC - PubMed
    1. Karolchik D., Baertsch,R., Diekhans,M., Furey,T.S., Hinrichs,A., Lu,Y.T., Roskin,K.M., Schwartz,M., Sugnet,C.W., Thomas,D.J. et al. (2003) The UCSC Genome Browser Database. Nucleic Acids Res., 31, 51–54. - PMC - PubMed
    1. Birney E., Andrews,D., Bevan,P., Caccamo,M., Cameron,G., Chen,Y., Clarke,L., Coates,G., Cox,T., Cuff,J., Curwen,V., Cutts,T., Down,T., Durbin,R., Eyras,E., Fernandez-Suarez,X.M., Gane,P., Gibbins,B., Gilbert,J., Hammond,M., Hotz,H., Iyer,V., Kahari,A., Jekosch,K., Kasprzyk,A., Keefe,D., Keenan,S., Lehvaslaiho,H., McVicker,G., Melsopp,C., Meidl,P., Mongin,E., Pettett,R., Potter,S., Proctor,G., Rae,M., Searle,S., Slater,G., Smedley,D., Smith,J., Spooner,W., Stabenau,A., Stalker,J., Storey,R., Ureta-Vidal,A., Woodwark,C., Clamp,M. and Hubbard,T. (2004) Ensembl 2004. Nucleic Acids Res., 32 (Database issue), D468–D470. - PMC - PubMed
    1. Wheeler D.L., Church,D.M., Edgar,R., Federhen,S., Helmberg,W., Madden,T.L., Pontius,J.U., Schuler,G.D., Schriml,L.M., Sequeira,E., Suzek,T.O., Tatusova,T.A. and Wagner,L. (2004) Database resources of the National Center for Biotechnology Information: update. Nucleic Acids Res., 32 (Database issue), D35–D40. - PMC - PubMed
    1. Giardine B.M., Elnitski,L., Riemer,C., Makalowska,I., Schwartz,S., Miller,W. and Hardison,R.C. (2003) GALA, a database for genomic sequence alignments and annotations. Genome Res., 13, 732–741. - PMC - PubMed