Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2022 Mar;14(1):1-14.
doi: 10.1007/s12539-021-00473-0. Epub 2021 Sep 6.

A Review of Parallel Implementations for the Smith-Waterman Algorithm

Affiliations
Review

A Review of Parallel Implementations for the Smith-Waterman Algorithm

Zeyu Xia et al. Interdiscip Sci. 2022 Mar.

Abstract

The rapid advances in sequencing technology have led to an explosion of sequence data. Sequence alignment is the central and fundamental problem in many sequence analysis procedure, while local alignment is often the kernel of these algorithms. Usually, Smith-Waterman algorithm is used to find the best subsequence match between given sequences. However, the high time complexity makes the algorithm time-consuming. A lot of approaches have been developed to accelerate and parallelize it, such as vector-level parallelization, thread-level parallelization, process-level parallelization, and heterogeneous acceleration, but the current researches seem unsystematic, which hinders the further research of parallelizing the algorithm. In this paper, we summarize the current research status of parallel local alignments and describe the data layout in these work. Based on the research status, we emphasize large-scale genomic comparisons. By surveying some typical alignment tools' performance, we discuss some possible directions in the future. We hope our work will provide the developers of the alignment tool with technical principle support, and help researchers choose proper alignment tools.

Keywords: Inter-sequence alignment; Intra-sequence alignment; Smith–Waterman algorithm; Vector-level parallelization.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no conflict of interest.

Figures

Fig. 1
Fig. 1
Comparison between scalar operation and vector operation
Fig. 2
Fig. 2
Distributed and shared memory system model
Fig. 3
Fig. 3
Data dependencies in the alignment matrix
Fig. 4
Fig. 4
Three intra-sequence alignment approaches
Fig. 5
Fig. 5
Sequential layout and striped layout
Fig. 6
Fig. 6
Data dependencies of matrix H and E in striped layout
Fig. 7
Fig. 7
Data dependencies of the first and last H vectors between the adjacent columns
Fig. 8
Fig. 8
Data dependencies of the F vectors on each column
Fig. 9
Fig. 9
Inter-sequence alignment
Fig. 10
Fig. 10
Blocks of target sequence computed simultaneously

References

    1. Khan MI, Kamal MS, Chowdhury L. Msupda: a memory efficient algorithm for sequence alignment. Interdiscip Sci Comput Life Sci. 2016;8(1):84–94. doi: 10.1007/s12539-015-0275-8. - DOI - PubMed
    1. Kirkness EF, Bafna V, Halpern AL, Levy S, Remington K, Rusch DB, Delcher AL, Pop M, Wang W, Fraser CM, et al. The dog genome: survey sequencing and comparative analysis. Science. 2003;301(5641):1898–1903. doi: 10.1126/science.1086432. - DOI - PubMed
    1. Issa M, Elaziz MA. Analyzing COVID-19 virus based on enhanced fragmented biological local aligner using improved ions motion optimization algorithm. Appl Soft Comput. 2020;96:106683. doi: 10.1016/j.asoc.2020.106683. - DOI - PMC - PubMed
    1. Liu Y, Schmidt B. Gswabe: faster gpu-accelerated sequence alignment with optimal alignment retrieval for short dna sequences. Concurr Comput Pract Exp. 2015;27(4):958–972. doi: 10.1002/cpe.3371. - DOI
    1. Needleman SB, Wunsch CD. A general method applicable to the search for similarities in the amino acid sequence of two proteins. J Mol Biol. 1970;48(3):443–453. doi: 10.1016/b978-0-12-131200-8.50031-9. - DOI - PubMed

LinkOut - more resources