ElasticBLAST: accelerating sequence search via cloud computing
- PMID: 36967390
- PMCID: PMC10040096
- DOI: 10.1186/s12859-023-05245-9
ElasticBLAST: accelerating sequence search via cloud computing
Abstract
Background: Biomedical researchers use alignments produced by BLAST (Basic Local Alignment Search Tool) to categorize their query sequences. Producing such alignments is an essential bioinformatics task that is well suited for the cloud. The cloud can perform many calculations quickly as well as store and access large volumes of data. Bioinformaticians can also use it to collaborate with other researchers, sharing their results, datasets and even their pipelines on a common platform.
Results: We present ElasticBLAST, a cloud native application to perform BLAST alignments in the cloud. ElasticBLAST can handle anywhere from a few to many thousands of queries and run the searches on thousands of virtual CPUs (if desired), deleting resources when it is done. It uses cloud native tools for orchestration and can request discounted instances, lowering cloud costs for users. It is supported on Amazon Web Services and Google Cloud Platform. It can search BLAST databases that are user provided or from the National Center for Biotechnology Information.
Conclusion: We show that ElasticBLAST is a useful application that can efficiently perform BLAST searches for the user in the cloud, demonstrating that with two examples. At the same time, it hides much of the complexity of working in the cloud, lowering the threshold to move work to the cloud.
Keywords: AWS Batch; Alignment; BLAST; Cloud computing; Kubernetes.
© 2023. This is a U.S. Government work and not under copyright protection in the US; foreign copyright protection may apply.
Conflict of interest statement
The authors declare they have no competing interests.
Figures
Update of
-
ElasticBLAST: Accelerating Sequence Search via Cloud Computing.bioRxiv [Preprint]. 2023 Jan 4:2023.01.04.522777. doi: 10.1101/2023.01.04.522777. bioRxiv. 2023. Update in: BMC Bioinformatics. 2023 Mar 26;24(1):117. doi: 10.1186/s12859-023-05245-9. PMID: 36789435 Free PMC article. Updated. Preprint.
References
-
- BLAST Databases [https://github.com/ncbi/blast_plus_docs/#blast-databases] Accessed 16 November 2022.
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials
