Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2004 Jan 1;32(Database issue):D59-63.
doi: 10.1093/nar/gkh051.

Xpro: database of eukaryotic protein-encoding genes

Affiliations

Xpro: database of eukaryotic protein-encoding genes

Vivek Gopalan et al. Nucleic Acids Res. .

Abstract

Xpro is a relational database that contains all the eukaryotic protein-encoding DNA sequences contained in GenBank with associated data required for the analysis of eukaryotic gene architecture. In addition to the information found in the GenBank records, which includes properties such as sequence, position, length and description about introns, exons and protein-coding regions, Xpro provides annotations on the splice sites and intron phases. Furthermore, Xpro validates intron positions using alignment information between the record's sequence and EST sequences found in dbEST. In the process of validation, alternative splicing information is also obtained and can be found in the database. The intron-containing genes in the Xpro are also classified as experimental or predicted based on the intron position validation and specific keywords in the GenBank records that are present in predicted genes. An Entrez-like query system, which is familiar to most biologists, is provided for accessing the information present in the database system. A non-redundant set of Xpro database contents is also obtained by cross-referencing to the Swiss-Prot/TrEMBL and Pfam databases. The database currently contains information for 493,983 genes--351,918 intron- containing genes and 142,065 intron-less genes. Xpro is updated for each new GenBank release and is freely available via the internet at http://origin.bic. nus.edu.sg/xpro.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Xpro sample results page for the GenBank protein sequence accession number CAAD2447.1.
Figure 2
Figure 2
Graphical representation of various alternative splicing variants based on the BLAT alignment of the protein-coding region of mRNA sequence derived from the Xpro database with the EST sequences in the dbEST database. Exon sequences in the alignment are represented in different colors so that the type of alternative splicing can be identified.

References

    1. Gilbert W. and Glynias,M. (1993) On the ancient nature of introns. Gene, 135, 137–144. - PubMed
    1. Gilbert W. (1987) The exon theory of genes. Cold Spring Harbor Symp. Quant. Biol., 52, 901–905. - PubMed
    1. Kriventseva E.V. and Gelfand,M.S. (1999) Statistical analysis of the exon–intron structure of higher and lower eukaryote genes. J. Biomol. Struct. Dyn., 17, 281–288. - PubMed
    1. Roy S.W., Fedorov,A. and Gilbert,W. (2002) The signal of ancient introns is obscured by intron density and homolog number. Proc. Natl Acad. Sci. USA, 99, 15513–15517. - PMC - PubMed
    1. Roy S.W., Fedorov,A. and Gilbert,W. (2003) Large-scale comparison of intron positions in mammalian genes shows intron loss but no gain. Proc. Natl Acad. Sci. USA, 100, 7158–7162. - PMC - PubMed

Publication types