Biological information: making it accessible and integrated (and trying to make sense of it)
- PMID: 12385995
- DOI: 10.1093/bioinformatics/18.suppl_2.s140
Biological information: making it accessible and integrated (and trying to make sense of it)
Abstract
The availability of the genome sequences of human and mouse, human sequence variation data and other large genetic data sets will lead to a revolution in understanding of the human machine and the treatment of its diseases. The success of the international genome sequencing consortiums shows what can be achieved by well coordinated large scale public domain projects and the benefits of data access to all. It is already clear that the availability of this sequence is having a huge impact on research worldwide. Complete genome sequences provide a framework to pull all biological data together such that each piece has the potential to say something about biology as a whole. Biology is too complex for any organisation to have a monopoly of ideas or data, so the collection, analysis and access to this data can be contributed to by research institutes around the world. However, although it is possible for all this data to be accessible to all through the internet, the more organisations provide data or analysis separately, the harder it becomes for anyone to collect and integrate the results. To address these problems of intergration of data, open standards for biological data exchange, such as the 'Distributed Annotation System' (DAS) are being developed and bioinformatics (Dowell et al., 2001) as a whole is now being strongly driven by the open source software (OSS) model for collaborative software development (Hubbard and Birney, 1999). The leading provider of human genome annotation, the Ensembl project (http://www.ensembl.org), is entirely an OSS project and has been widely adopted by academic and commerical organisations alike (Hubbard et al., 2002). Accurate automatic annotation of features such as genes in vertebrate genomes currently relies on supporting evidence in the form of homologies to mRNAs, ESTs or protein. However, it appears that sufficient high quality experimentally curated annotation now exists to be used as a substrate for machine learning algorithms to create effective models of biological signal sequences (Down and Hubbard, 2002). Is there hope for ab initio prediction methods after all?
Similar articles
-
The Ensembl genome database project.Nucleic Acids Res. 2002 Jan 1;30(1):38-41. doi: 10.1093/nar/30.1.38. Nucleic Acids Res. 2002. PMID: 11752248 Free PMC article.
-
Ensembl 2004.Nucleic Acids Res. 2004 Jan 1;32(Database issue):D468-70. doi: 10.1093/nar/gkh038. Nucleic Acids Res. 2004. PMID: 14681459 Free PMC article.
-
Integrating sequence and structural biology with DAS.BMC Bioinformatics. 2007 Sep 12;8:333. doi: 10.1186/1471-2105-8-333. BMC Bioinformatics. 2007. PMID: 17850653 Free PMC article.
-
Interoperability with Moby 1.0--it's better than sharing your toothbrush!Brief Bioinform. 2008 May;9(3):220-31. doi: 10.1093/bib/bbn003. Epub 2008 Jan 31. Brief Bioinform. 2008. PMID: 18238804 Review.
-
Extensible open source content management systems and frameworks: a solution for many needs of a bioinformatics group.Brief Bioinform. 2008 Jan;9(1):69-74. doi: 10.1093/bib/bbm057. Epub 2007 Dec 5. Brief Bioinform. 2008. PMID: 18057072 Review.
Cited by
-
Statistical Viewer: a tool to upload and integrate linkage and association data as plots displayed within the Ensembl genome browser.BMC Bioinformatics. 2005 Apr 12;6:95. doi: 10.1186/1471-2105-6-95. BMC Bioinformatics. 2005. PMID: 15826305 Free PMC article.
-
Systematic recovery and analysis of full-ORF human cDNA clones.Genome Res. 2004 Oct;14(10B):2083-92. doi: 10.1101/gr.2473704. Genome Res. 2004. PMID: 15489330 Free PMC article.
-
Web-based physician order entry: an open source solution with broad physician involvement.AMIA Annu Symp Proc. 2003;2003:724-7. AMIA Annu Symp Proc. 2003. PMID: 14728268 Free PMC article.
-
SKINOMICS: Transcriptional Profiling in Dermatology and Skin Biology.Curr Genomics. 2012 Aug;13(5):363-8. doi: 10.2174/138920212801619241. Curr Genomics. 2012. PMID: 23372422 Free PMC article.
MeSH terms
LinkOut - more resources
Full Text Sources