C. elegans ORFeome version 3.1: increasing the coverage of ORFeome resources with improved gene predictions
- PMID: 15489327
- PMCID: PMC528921
- DOI: 10.1101/gr.2496804
C. elegans ORFeome version 3.1: increasing the coverage of ORFeome resources with improved gene predictions
Abstract
The first version of the Caenorhabditis elegans ORFeome cloning project, based on release WS9 of Wormbase (August 1999), provided experimental verifications for approximately 55% of predicted protein-encoding open reading frames (ORFs). The remaining 45% of predicted ORFs could not be cloned, possibly as a result of mispredicted gene boundaries. Since the release of WS9, gene predictions have improved continuously. To test the accuracy of evolving predictions, we attempted to PCR-amplify from a highly representative worm cDNA library and Gateway-clone approximately 4200 ORFs missed earlier and for which new predictions are available in WS100 (May 2003). In this set we successfully cloned 63% of ORFs with supporting experimental data ("touched" ORFs), and 42% of ORFs with no supporting experimental evidence ("untouched" ORFs). Approximately 2000 full-length ORFs were cloned in-frame, 13% of which were corrected in their exon/intron structure relative to WS100 predictions. In total, approximately 12,500 C. elegans ORFs are now available as Gateway Entry clones for various reverse proteomics (ORFeome v3.1). This work illustrates why the cloning of a complete C. elegans ORFeome, and likely the ORFeomes of other multicellular organisms, needs to be an iterative process that requires multiple rounds of experimental validation together with gradually improving gene predictions.
Figures





Similar articles
-
C. elegans ORFeome version 1.1: experimental verification of the genome annotation and resource for proteome-scale protein expression.Nat Genet. 2003 May;34(1):35-41. doi: 10.1038/ng1140. Nat Genet. 2003. PMID: 12679813
-
Closing in on the C. elegans ORFeome by cloning TWINSCAN predictions.Genome Res. 2005 Apr;15(4):577-82. doi: 10.1101/gr.3329005. Genome Res. 2005. PMID: 15805498 Free PMC article.
-
WormBase as an integrated platform for the C. elegans ORFeome.Genome Res. 2004 Oct;14(10B):2155-61. doi: 10.1101/gr.2521304. Genome Res. 2004. PMID: 15489338 Free PMC article.
-
ORFeome projects: gateway between genomics and omics.Curr Opin Chem Biol. 2004 Feb;8(1):20-5. doi: 10.1016/j.cbpa.2003.12.002. Curr Opin Chem Biol. 2004. PMID: 15036152 Review.
-
ORFeome cloning and systems biology: standardized mass production of the parts from the parts-list.Genome Res. 2004 Oct;14(10B):2001-9. doi: 10.1101/gr.2769804. Genome Res. 2004. PMID: 15489318 Review.
Cited by
-
SPOP loss of function protects against tauopathy.Proc Natl Acad Sci U S A. 2023 Jan 3;120(1):e2207250120. doi: 10.1073/pnas.2207250120. Epub 2022 Dec 27. Proc Natl Acad Sci U S A. 2023. PMID: 36574656 Free PMC article.
-
Mapping the Protein-Protein Interactome Networks Using Yeast Two-Hybrid Screens.Adv Exp Med Biol. 2015;883:187-214. doi: 10.1007/978-3-319-23603-2_11. Adv Exp Med Biol. 2015. PMID: 26621469 Free PMC article. Review.
-
Computer-Assisted Transgenesis of Caenorhabditis elegans for Deep Phenotyping.Genetics. 2015 Sep;201(1):39-46. doi: 10.1534/genetics.115.179648. Epub 2015 Jul 10. Genetics. 2015. PMID: 26163188 Free PMC article.
-
Global identification of protein kinase substrates by protein microarray analysis.Nat Protoc. 2009;4(12):1820-7. doi: 10.1038/nprot.2009.194. Nat Protoc. 2009. PMID: 20010933 Free PMC article.
-
Proximity labeling identifies LOTUS domain proteins that promote the formation of perinuclear germ granules in C. elegans.Elife. 2021 Nov 3;10:e72276. doi: 10.7554/eLife.72276. Elife. 2021. PMID: 34730513 Free PMC article.
References
-
- Blumenthal, T., Evans, D., Link, C.D., Guffanti, A., Lawson, D., Thierry-Mieg, J., Thierry-Mieg, D., Chiu, W.L., Duke, K., Kiraly, M., et al. 2002. A global analysis of Caenorhabditis elegans operons. Nature 417: 797-798. - PubMed
-
- Burset, M. and Guigo, R. 1996. Evaluation of gene structure prediction programs. Genomics 34: 353-367. - PubMed
-
- The C. elegans Sequencing Consortium. 1998. Genome sequence of the nematode C. elegans: A platform for investigating biology. Science 282: 2012-2018. - PubMed
-
- Cliften, P.F., Hillier, L.W., Fulton, L., Graves, T., Miner, T., Gish, W.R., Waterston, R.H., and Johnston, M. 2001. Surveying Saccharomyces genomes to identify functional elements by comparative DNA sequence analysis. Genome Res. 11: 1143-1144. - PubMed
-
- Cliften, P., Sudarsanam, P., Desikan, A., Fulton, L., Fulton, B., Majors, J., Waterston, R., Cohen, B.A., and Johnston, M. 2003. Finding functional features in Saccharomyces genomes by phylogenetic footprinting. Science 301: 71-76. - PubMed
WEB SITE REFERENCES
-
- http://elegans.swmed.edu/Announcements/genome_complete.html; The Caenorhabditis elegans WWW server.
-
- http://ftp.genome.washington.edu/cgi-bin/genefinder_req.pl; GeneFinder Web Server.
-
- http://worfdb.dfci.harvard.edu; WorfDB, the central repository of the C. elegans ORFeome.
-
- http://ws100.Wormbase.org; frozen release WS100 of Wormbase.
-
- http://www.ddbj.nig.ac.jp/; DNA Data Bank of Japan.
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Molecular Biology Databases