Xpro: database of eukaryotic protein-encoding genes
- PMID: 14681359
- PMCID: PMC308785
- DOI: 10.1093/nar/gkh051
Xpro: database of eukaryotic protein-encoding genes
Abstract
Xpro is a relational database that contains all the eukaryotic protein-encoding DNA sequences contained in GenBank with associated data required for the analysis of eukaryotic gene architecture. In addition to the information found in the GenBank records, which includes properties such as sequence, position, length and description about introns, exons and protein-coding regions, Xpro provides annotations on the splice sites and intron phases. Furthermore, Xpro validates intron positions using alignment information between the record's sequence and EST sequences found in dbEST. In the process of validation, alternative splicing information is also obtained and can be found in the database. The intron-containing genes in the Xpro are also classified as experimental or predicted based on the intron position validation and specific keywords in the GenBank records that are present in predicted genes. An Entrez-like query system, which is familiar to most biologists, is provided for accessing the information present in the database system. A non-redundant set of Xpro database contents is also obtained by cross-referencing to the Swiss-Prot/TrEMBL and Pfam databases. The database currently contains information for 493,983 genes--351,918 intron- containing genes and 142,065 intron-less genes. Xpro is updated for each new GenBank release and is freely available via the internet at http://origin.bic. nus.edu.sg/xpro.
Figures


References
-
- Gilbert W. and Glynias,M. (1993) On the ancient nature of introns. Gene, 135, 137–144. - PubMed
-
- Gilbert W. (1987) The exon theory of genes. Cold Spring Harbor Symp. Quant. Biol., 52, 901–905. - PubMed
-
- Kriventseva E.V. and Gelfand,M.S. (1999) Statistical analysis of the exon–intron structure of higher and lower eukaryote genes. J. Biomol. Struct. Dyn., 17, 281–288. - PubMed
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Medical
Research Materials