A web services choreography scenario for interoperating bioinformatics applications
- PMID: 15113410
- PMCID: PMC394315
- DOI: 10.1186/1471-2105-5-25
A web services choreography scenario for interoperating bioinformatics applications
Abstract
Background: Very often genome-wide data analysis requires the interoperation of multiple databases and analytic tools. A large number of genome databases and bioinformatics applications are available through the web, but it is difficult to automate interoperation because: 1) the platforms on which the applications run are heterogeneous, 2) their web interface is not machine-friendly, 3) they use a non-standard format for data input and output, 4) they do not exploit standards to define application interface and message exchange, and 5) existing protocols for remote messaging are often not firewall-friendly. To overcome these issues, web services have emerged as a standard XML-based model for message exchange between heterogeneous applications. Web services engines have been developed to manage the configuration and execution of a web services workflow.
Results: To demonstrate the benefit of using web services over traditional web interfaces, we compare the two implementations of HAPI, a gene expression analysis utility developed by the University of California San Diego (UCSD) that allows visual characterization of groups or clusters of genes based on the biomedical literature. This utility takes a set of microarray spot IDs as input and outputs a hierarchy of MeSH Keywords that correlates to the input and is grouped by Medical Subject Heading (MeSH) category. While the HTML output is easy for humans to visualize, it is difficult for computer applications to interpret semantically. To facilitate the capability of machine processing, we have created a workflow of three web services that replicates the HAPI functionality. These web services use document-style messages, which means that messages are encoded in an XML-based format. We compared three approaches to the implementation of an XML-based workflow: a hard coded Java application, Collaxa BPEL Server and Taverna Workbench. The Java program functions as a web services engine and interoperates with these web services using a web services choreography language (BPEL4WS).
Conclusion: While it is relatively straightforward to implement and publish web services, the use of web services choreography engines is still in its infancy. However, industry-wide support and push for web services standards is quickly increasing the chance of success in using web services to unify heterogeneous bioinformatics applications. Due to the immaturity of currently available web services engines, it is still most practical to implement a simple, ad-hoc XML-based workflow by hard coding the workflow as a Java application. For advanced web service users the Collaxa BPEL engine facilitates a configuration and management environment that can fully handle XML-based workflow.
Figures





Similar articles
-
Biowep: a workflow enactment portal for bioinformatics applications.BMC Bioinformatics. 2007 Mar 8;8 Suppl 1(Suppl 1):S19. doi: 10.1186/1471-2105-8-S1-S19. BMC Bioinformatics. 2007. PMID: 17430563 Free PMC article.
-
Seahawk: moving beyond HTML in Web-based bioinformatics analysis.BMC Bioinformatics. 2007 Jun 18;8:208. doi: 10.1186/1471-2105-8-208. BMC Bioinformatics. 2007. PMID: 17577405 Free PMC article.
-
AnaBench: a Web/CORBA-based workbench for biomolecular sequence analysis.BMC Bioinformatics. 2003 Dec 16;4:63. doi: 10.1186/1471-2105-4-63. BMC Bioinformatics. 2003. PMID: 14678565 Free PMC article.
-
Web tools for predictive toxicology model building.Expert Opin Drug Metab Toxicol. 2012 Jul;8(7):791-801. doi: 10.1517/17425255.2012.685158. Epub 2012 May 12. Expert Opin Drug Metab Toxicol. 2012. PMID: 22577953 Review.
-
A survey and evaluation of Web-based tools/databases for variant analysis of TCGA data.Brief Bioinform. 2019 Jul 19;20(4):1524-1541. doi: 10.1093/bib/bby023. Brief Bioinform. 2019. PMID: 29617727 Free PMC article. Review.
Cited by
-
MOWServ: a web client for integration of bioinformatic resources.Nucleic Acids Res. 2010 Jul;38(Web Server issue):W671-6. doi: 10.1093/nar/gkq497. Epub 2010 Jun 4. Nucleic Acids Res. 2010. PMID: 20525794 Free PMC article.
References
-
- van Someren EP, Wessels LF, Backer E, Reinders MJ. Genetic network modeling. Pharmacogenomics. 2002;3:507–25. - PubMed
-
- Wilkinson M, Links M. BioMoby: An open source biological web services proposal. Brief Bioinform. 2002;3:331–341. - PubMed
-
- myGrid BioServices http://www.mygrid.org.uk/myGrid/web/components/BioServices/
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources