Mining the human genome using microarrays of open reading frames
- PMID: 11062470
- DOI: 10.1038/81613
Mining the human genome using microarrays of open reading frames
Abstract
To test the hypothesis that the human genome project will uncover many genes not previously discovered by sequencing of expressed sequence tags (ESTs), we designed and produced a set of microarrays using probes based on open reading frames (ORFs) in 350 Mb of finished and draft human sequence. Our approach aims to identify all genes directly from genomic sequence by querying gene expression. We analysed genomic sequence with a suite of ORF prediction programs, selected approximately one ORF per gene, amplified the ORFs from genomic DNA and arrayed the amplicons onto treated glass slides. Of the first 10,000 arrayed ORFs, 31% are completely novel and 29% are similar, but not identical, to sequences in public databases. Approximately one-half of these are expressed in the tissues we queried by microarray. Subsequent verification by other techniques confirmed expression of several of the novel genes. Expressed sequence tags (ESTs) have yielded vast amounts of data, but our results indicate that many genes in the human genome will only be found by genomic sequencing.
Similar articles
-
[Analysis, identification and correction of some errors of model refseqs appeared in NCBI Human Gene Database by in silico cloning and experimental verification of novel human genes].Yi Chuan Xue Bao. 2004 May;31(5):431-43. Yi Chuan Xue Bao. 2004. PMID: 15478601 Chinese.
-
Expression profiling of the Leishmania life cycle: cDNA arrays identify developmentally regulated genes present but not annotated in the genome.Mol Biochem Parasitol. 2004 Jul;136(1):87-100. doi: 10.1016/j.molbiopara.2004.03.004. Mol Biochem Parasitol. 2004. PMID: 15138070
-
Identification of novel highly expressed genes in pancreatic ductal adenocarcinomas through a bioinformatics analysis of expressed sequence tags.Cancer Biol Ther. 2004 Nov;3(11):1081-9; discussion 1090-1. doi: 10.4161/cbt.3.11.1175. Epub 2004 Nov 12. Cancer Biol Ther. 2004. PMID: 15467436
-
Microarrays and high-throughput transcriptomic analysis in species with incomplete availability of genomic sequences.N Biotechnol. 2009 Jun;25(5):272-9. doi: 10.1016/j.nbt.2009.03.013. Epub 2009 Apr 5. N Biotechnol. 2009. PMID: 19446516 Review.
-
Expression profiling using cDNA microarrays.Nat Genet. 1999 Jan;21(1 Suppl):10-4. doi: 10.1038/4434. Nat Genet. 1999. PMID: 9915494 Review.
Cited by
-
Molecular pathology of solid tumours: translating research into clinical practice. Introduction and overview.Mol Pathol. 2001 Aug;54(4):201-2. doi: 10.1136/mp.54.4.201. Mol Pathol. 2001. PMID: 11477130 Free PMC article. No abstract available.
-
Insulin-like growth factor binding protein 2: gene expression microarrays and the hypothesis-generation paradigm.Brain Pathol. 2002 Jan;12(1):87-94. doi: 10.1111/j.1750-3639.2002.tb00425.x. Brain Pathol. 2002. PMID: 11770904 Free PMC article. Review.
-
A transcript finishing initiative for closing gaps in the human transcriptome.Genome Res. 2004 Jul;14(7):1413-23. doi: 10.1101/gr.2111304. Epub 2004 Jun 14. Genome Res. 2004. PMID: 15197164 Free PMC article.
-
The functional landscape of mouse gene expression.J Biol. 2004;3(5):21. doi: 10.1186/jbiol16. Epub 2004 Dec 6. J Biol. 2004. PMID: 15588312 Free PMC article.
-
Active retrotransposons are a common feature of grass genomes.Plant Physiol. 2001 Mar;125(3):1283-92. doi: 10.1104/pp.125.3.1283. Plant Physiol. 2001. PMID: 11244109 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Miscellaneous