Genome analysis with gene-indexing databases
- PMID: 11728605
- DOI: 10.1016/s0163-7258(01)00151-6
Genome analysis with gene-indexing databases
Abstract
The recent release of the draft sequence and the eventual completion of the human genome present the scientific community with a rich source of data to mine. Yet, these data are content poor in the absence of additional correlative information. Expressed sequence tag (EST) datasets and their associated gene indices have existed for many years, and represent the first attempt at understanding the complexity of the genome. These datasets remain extremely important as information sources and, in particular, as tools for analyzing the completed genomes. Here, we discuss the nature of ESTs and their associated tools and gene-indexing databases. In particular, we will compare three EST gene indices (UNIGENE, Merck Gene Index Version 2.0 and Doubletwist CAT), discuss how these gene indices are applied for both genome analysis and drug discovery, and demonstrate their importance as a complementary dataset to the annotated human genome.
Similar articles
-
A comprehensive approach to clustering of expressed human gene sequence: the sequence tag alignment and consensus knowledge base.Genome Res. 1999 Nov;9(11):1143-55. doi: 10.1101/gr.9.11.1143. Genome Res. 1999. PMID: 10568754 Free PMC article.
-
High-throughput identification, database storage and analysis of SNPs in EST sequences.Genome Inform. 2001;12:194-203. Genome Inform. 2001. PMID: 11791238
-
The TIGR gene indices: reconstruction and representation of expressed gene sequences.Nucleic Acids Res. 2000 Jan 1;28(1):141-5. doi: 10.1093/nar/28.1.141. Nucleic Acids Res. 2000. PMID: 10592205 Free PMC article.
-
Rapid in silico cloning of genes using expressed sequence tags (ESTs).Biotechnol Annu Rev. 2000;5:25-44. doi: 10.1016/s1387-2656(00)05031-6. Biotechnol Annu Rev. 2000. PMID: 10874996 Review.
-
A practical guide to orient yourself in the labyrinth of genome databases.Hum Mol Genet. 1998;7(10):1641-8. doi: 10.1093/hmg/7.10.1641. Hum Mol Genet. 1998. PMID: 9735386 Review.
Cited by
-
PipeOnline 2.0: automated EST processing and functional data sorting.Nucleic Acids Res. 2002 Nov 1;30(21):4761-9. doi: 10.1093/nar/gkf585. Nucleic Acids Res. 2002. PMID: 12409467 Free PMC article.
-
Bioinformatic approaches to augment study of epithelial-to-mesenchymal transition in lung cancer.Physiol Genomics. 2014 Oct 1;46(19):699-724. doi: 10.1152/physiolgenomics.00062.2014. Epub 2014 Aug 5. Physiol Genomics. 2014. PMID: 25096367 Free PMC article. Review.
-
ESAP plus: a web-based server for EST-SSR marker development.BMC Genomics. 2016 Dec 22;17(Suppl 13):1035. doi: 10.1186/s12864-016-3328-4. BMC Genomics. 2016. PMID: 28155670 Free PMC article.
-
The hepatic transcriptome in human liver disease.Comp Hepatol. 2006 Nov 7;5:6. doi: 10.1186/1476-5926-5-6. Comp Hepatol. 2006. PMID: 17090326 Free PMC article.
-
Integrative analysis of intraerythrocytic differentially expressed transcripts yields novel insights into the biology of Plasmodium falciparum.Malar J. 2003 Nov 14;2(1):38. doi: 10.1186/1475-2875-2-38. Malar J. 2003. PMID: 14617379 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials
Miscellaneous