High-frequency, low-coverage "false positives" mutations may be true in GS Junior sequencing studies
- PMID: 29062110
- PMCID: PMC5653793
- DOI: 10.1038/s41598-017-13116-6
High-frequency, low-coverage "false positives" mutations may be true in GS Junior sequencing studies
Abstract
The GS Junior sequencer provides simplified procedures for library preparation and data processing. Errors in pyrosequencing generate some biases during library construction and emulsion PCR amplification. False-positive mutations are identified by related characteristics described in the manufacturer's manual, and some detected mutations may have 'borderline' characteristics when they are detected in few reads or at low frequency. Among these mutations, however, some may be true positives. This study aimed to improve the accuracy of identifying true positives among mutations with borderline false-positive characteristics detected with GS Junior sequencing. Mutations with the borderline features were tested for validity with Sanger sequencing. We examined 10 mutations detected in coverages <20-fold at frequencies >30% (group A) and 16 mutations detected in coverages >20-fold at frequencies < 30% (group B). In group A, two mutations were not confirmed, and two mutations with 100% frequency were confirmed as heterozygous alleles. No mutation in group B was confirmed. The two groups had significantly different false-positive prevalences (p = 0.001). These results suggest that mutations detected at frequencies less than 30% can be confidently identified as false-positives but that mutations detected at frequencies over 30%, despite coverages less than 20-fold, should be verified with Sanger sequencing.
Conflict of interest statement
The authors declare that they have no competing interests.
Figures



Similar articles
-
Evaluation of exome variants using the Ion Proton Platform to sequence error-prone regions.PLoS One. 2017 Jul 24;12(7):e0181304. doi: 10.1371/journal.pone.0181304. eCollection 2017. PLoS One. 2017. PMID: 28742110 Free PMC article.
-
Clinical Next-Generation Sequencing Pipeline Outperforms a Combined Approach Using Sanger Sequencing and Multiplex Ligation-Dependent Probe Amplification in Targeted Gene Panel Analysis.J Mol Diagn. 2016 Sep;18(5):657-667. doi: 10.1016/j.jmoldx.2016.04.002. Epub 2016 Jul 2. J Mol Diagn. 2016. PMID: 27376475
-
A Workflow to Improve Variant Calling Accuracy in Molecular Barcoded Sequencing Reads.J Comput Biol. 2019 Jan;26(1):96-103. doi: 10.1089/cmb.2018.0110. Epub 2018 Aug 17. J Comput Biol. 2019. PMID: 30117742
-
High-specificity detection of rare alleles with Paired-End Low Error Sequencing (PELE-Seq).BMC Genomics. 2016 Jun 14;17:464. doi: 10.1186/s12864-016-2669-3. BMC Genomics. 2016. PMID: 27301885 Free PMC article.
-
Fundamentals of pyrosequencing.Arch Pathol Lab Med. 2013 Sep;137(9):1296-303. doi: 10.5858/arpa.2012-0463-RA. Arch Pathol Lab Med. 2013. PMID: 23991743 Review.
Cited by
-
Targeted Next-generation Sequencing and Bioinformatics Pipeline to Evaluate Genetic Determinants of Constitutional Disease.J Vis Exp. 2018 Apr 4;(134):57266. doi: 10.3791/57266. J Vis Exp. 2018. PMID: 29683450 Free PMC article.
References
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Medical