Tavaxy: integrating Taverna and Galaxy workflows with cloud computing support
- PMID: 22559942
- PMCID: PMC3583125
- DOI: 10.1186/1471-2105-13-77
Tavaxy: integrating Taverna and Galaxy workflows with cloud computing support
Abstract
Background: Over the past decade the workflow system paradigm has evolved as an efficient and user-friendly approach for developing complex bioinformatics applications. Two popular workflow systems that have gained acceptance by the bioinformatics community are Taverna and Galaxy. Each system has a large user-base and supports an ever-growing repository of application workflows. However, workflows developed for one system cannot be imported and executed easily on the other. The lack of interoperability is due to differences in the models of computation, workflow languages, and architectures of both systems. This lack of interoperability limits sharing of workflows between the user communities and leads to duplication of development efforts.
Results: In this paper, we present Tavaxy, a stand-alone system for creating and executing workflows based on using an extensible set of re-usable workflow patterns. Tavaxy offers a set of new features that simplify and enhance the development of sequence analysis applications: It allows the integration of existing Taverna and Galaxy workflows in a single environment, and supports the use of cloud computing capabilities. The integration of existing Taverna and Galaxy workflows is supported seamlessly at both run-time and design-time levels, based on the concepts of hierarchical workflows and workflow patterns. The use of cloud computing in Tavaxy is flexible, where the users can either instantiate the whole system on the cloud, or delegate the execution of certain sub-workflows to the cloud infrastructure.
Conclusions: Tavaxy reduces the workflow development cycle by introducing the use of workflow patterns to simplify workflow creation. It enables the re-use and integration of existing (sub-) workflows from Taverna and Galaxy, and allows the creation of hybrid workflows. Its additional features exploit recent advances in high performance cloud computing to cope with the increasing data size and complexity of analysis.The system can be accessed either through a cloud-enabled web-interface or downloaded and installed to run within the user's local environment. All resources related to Tavaxy are available at http://www.tavaxy.org.
Figures










Similar articles
-
Support for Taverna workflows in the VPH-Share cloud platform.Comput Methods Programs Biomed. 2017 Jul;146:37-46. doi: 10.1016/j.cmpb.2017.05.006. Epub 2017 May 20. Comput Methods Programs Biomed. 2017. PMID: 28688488
-
The Taverna workflow suite: designing and executing workflows of Web Services on the desktop, web or in the cloud.Nucleic Acids Res. 2013 Jul;41(Web Server issue):W557-61. doi: 10.1093/nar/gkt328. Epub 2013 May 2. Nucleic Acids Res. 2013. PMID: 23640334 Free PMC article.
-
Biowep: a workflow enactment portal for bioinformatics applications.BMC Bioinformatics. 2007 Mar 8;8 Suppl 1(Suppl 1):S19. doi: 10.1186/1471-2105-8-S1-S19. BMC Bioinformatics. 2007. PMID: 17430563 Free PMC article.
-
Scalable Data Analysis in Proteomics and Metabolomics Using BioContainers and Workflows Engines.Proteomics. 2020 May;20(9):e1900147. doi: 10.1002/pmic.201900147. Epub 2019 Dec 18. Proteomics. 2020. PMID: 31657527 Free PMC article. Review.
-
Improving data workflow systems with cloud services and use of open data for bioinformatics research.Brief Bioinform. 2018 Sep 28;19(5):1035-1050. doi: 10.1093/bib/bbx039. Brief Bioinform. 2018. PMID: 28419324 Free PMC article. Review.
Cited by
-
GeNNet: an integrated platform for unifying scientific workflows and graph databases for transcriptome data analysis.PeerJ. 2017 Jul 5;5:e3509. doi: 10.7717/peerj.3509. eCollection 2017. PeerJ. 2017. PMID: 28695067 Free PMC article.
-
Multi-level meta-workflows: new concept for regularly occurring tasks in quantum chemistry.J Cheminform. 2016 Oct 20;8:58. doi: 10.1186/s13321-016-0169-8. eCollection 2016. J Cheminform. 2016. PMID: 27818709 Free PMC article.
-
SeqMule: automated pipeline for analysis of human exome/genome sequencing data.Sci Rep. 2015 Sep 18;5:14283. doi: 10.1038/srep14283. Sci Rep. 2015. PMID: 26381817 Free PMC article.
-
Scalable and cost-effective NGS genotyping in the cloud.BMC Med Genomics. 2015 Oct 15;8:64. doi: 10.1186/s12920-015-0134-9. BMC Med Genomics. 2015. PMID: 26470712 Free PMC article.
-
Executing SADI services in Galaxy.J Biomed Semantics. 2014 Sep 22;5(1):42. doi: 10.1186/2041-1480-5-42. eCollection 2014. J Biomed Semantics. 2014. PMID: 25309716 Free PMC article.
References
-
- Sana M, Iascone M, Marchetti D, Palatini J, Galasso M, Volinia S. GAMES identifies and annotates mutations in next-generation sequencing projects. Bioinformics. 2010;27:9–13. - PubMed
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Miscellaneous