Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Aug 3;18(8):e0288371.
doi: 10.1371/journal.pone.0288371. eCollection 2023.

Evaluation of an optimized germline exomes pipeline using BWA-MEM2 and Dragen-GATK tools

Affiliations

Evaluation of an optimized germline exomes pipeline using BWA-MEM2 and Dragen-GATK tools

Nofe Alganmi et al. PLoS One. .

Abstract

The next-generation sequencing (NGS) technology represents a significant advance in genomics and medical diagnosis. Nevertheless, the time it takes to perform sequencing, data analysis, and variant interpretation is a bottleneck in using next-generation sequencing in precision medicine. For accurate and efficient performance in clinical diagnostic lab practice, a consistent data analysis pipeline is necessary to avoid false variant calls and achieve optimum accuracy. This study aims to compare the performance of two NGS data analysis pipeline compartments, including short-read mapping (BWA-MEM and BWA-MEM2) and variant calling (GATK-HaplotypeCaller and DRAGEN-GATK). On Whole Exome Sequencing (WES) data, computational performance was assessed using several criteria, including mapping efficiency, variant calling performance, false positive calls rate, and time. We examined four gold-standard WES data sets: Ashkenazim father (NA24149), Ashkenazim mother (NA24143), Ashkenazim son (NA24385), and Asian son (NA25631). In addition, eighteen exome samples were analyzed based on different read counts, and coverage was used precisely in the run-time assessment. By using BWA-MEM 2 and Dragen-GATK, this study achieved faster and more accurate detection for SNVs and indels than the standard GATK Best Practices workflow. This systematic comparison will enable the bioinformatics community to develop a more efficient and faster solution for analyzing NGS data.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

Fig 1
Fig 1. Bioinformatics pipelines.
Fig 2
Fig 2. Performance comparison: Indels false positive calls.
Fig 3
Fig 3. Venn diagrams showing the intersection of variants called among replicate runs using BWA-MEM2+Dragen-GATK pipeline on NA24149, NA24143, NA24385,and NA24631 samples.
Fig 4
Fig 4. Run time performance of the two aligners.
Fig 5A
Fig 5A
A. Run time performance of the two aligners. B. Mapping efficiency metrics (total targeted reads, total targeted bases, and mean target coverage) between BWA-mem and BWA-mem2 aligners.
Fig 6
Fig 6. Run time performance of the two variant callers.

References

    1. Musich R, Cadle-Davidson L, Osier MV. Comparison of Short-Read Sequence Aligners Indicates Strengths and Weaknesses for Biologists to Consider. Frontiers in Plant Science. 2021;12. doi: 10.3389/fpls.2021.657240 - DOI - PMC - PubMed
    1. Harrath Y, Mahjoub A, AbuBakr F, Azhar M. Comparative Evaluation of Short Read Alignment Tools for next Generation DNA Sequencing. In: 2019 International Conference on Innovation and Intelligence for Informatics, Computing, and Technologies (3ICT). IEEE; 2019.
    1. Lee H, Lee KW, Lee T, Park D, Chung J, Lee C, et al. Performance evaluation method for read mapping tool in clinical panel sequencing. Genes & Genomics. 2017;40(2):189–197. doi: 10.1007/s13258-017-0621-9 - DOI - PMC - PubMed
    1. Hatem A, Bozdağ D, Toland AE, Çatalyürek Ümit V. Benchmarking short sequence mapping tools. BMC Bioinformatics. 2013;14(1). doi: 10.1186/1471-2105-14-184 - DOI - PMC - PubMed
    1. Li H, Durbin R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics. 2009;25(14):1754–1760. doi: 10.1093/bioinformatics/btp324 - DOI - PMC - PubMed