Sharing genetic variants with the NGS pipeline is essential for effective genomic data sharing and reproducibility in health information exchange
- PMID: 33500538
- PMCID: PMC7838410
- DOI: 10.1038/s41598-021-82006-9
Sharing genetic variants with the NGS pipeline is essential for effective genomic data sharing and reproducibility in health information exchange
Abstract
Genetic variants causing underlying pharmacogenetic and disease phenotypes have been used as the basis for clinical decision-making. However, due to the lack of standards for next-generation sequencing (NGS) pipelines, reproducing genetic variants among institutions is still difficult. The aim of this study is to show how many important variants for clinical decisions can be individually detected using different pipelines. Genetic variants were derived from 105 breast cancer patient target DNA sequences via three different variant-calling pipelines. HaplotypeCaller, Mutect2 tumor-only mode in the Genome Analysis ToolKit (GATK), and VarScan were used in variant calling from the sequence read data processed by the same NGS preprocessing tools using Variant Effect Predictor. GATK HaplotypeCaller, VarScan, and MuTect2 found 25,130, 16,972, and 4232 variants, comprising 1491, 1400, and 321 annotated variants with ClinVar significance, respectively. The average number of ClinVar significant variants in the patients was 769.43, 16.50% of the variants were detected by only one variant caller. Despite variants with significant impact on clinical decision-making, the detected variants are different for each algorithm. To utilize genetic variants in the clinical field, a strict standard for NGS pipelines is essential.
Conflict of interest statement
The authors declare no competing interests.
Figures



Similar articles
-
Performance evaluation of pipelines for mapping, variant calling and interval padding, for the analysis of NGS germline panels.BMC Bioinformatics. 2021 Apr 28;22(1):218. doi: 10.1186/s12859-021-04144-1. BMC Bioinformatics. 2021. PMID: 33910496 Free PMC article.
-
Benchmarking variant callers in next-generation and third-generation sequencing analysis.Brief Bioinform. 2021 May 20;22(3):bbaa148. doi: 10.1093/bib/bbaa148. Brief Bioinform. 2021. PMID: 32698196
-
Validation and assessment of variant calling pipelines for next-generation sequencing.Hum Genomics. 2014 Jul 30;8(1):14. doi: 10.1186/1479-7364-8-14. Hum Genomics. 2014. PMID: 25078893 Free PMC article.
-
Comprehensive fundamental somatic variant calling and quality management strategies for human cancer genomes.Brief Bioinform. 2021 May 20;22(3):bbaa083. doi: 10.1093/bib/bbaa083. Brief Bioinform. 2021. PMID: 32510555 Review.
-
Gene and Variant Annotation for Mendelian Disorders in the Era of Advanced Sequencing Technologies.Annu Rev Genomics Hum Genet. 2017 Aug 31;18:229-256. doi: 10.1146/annurev-genom-083115-022545. Epub 2017 Apr 17. Annu Rev Genomics Hum Genet. 2017. PMID: 28415856 Review.
Cited by
-
The evaluation of Bcftools mpileup and GATK HaplotypeCaller for variant calling in non-human species.Sci Rep. 2022 Jul 5;12(1):11331. doi: 10.1038/s41598-022-15563-2. Sci Rep. 2022. PMID: 35790846 Free PMC article.
References
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources