Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2017 Dec 14;18(1):554.
doi: 10.1186/s12859-017-1985-1.

pyAmpli: an amplicon-based variant filter pipeline for targeted resequencing data

Affiliations

pyAmpli: an amplicon-based variant filter pipeline for targeted resequencing data

Matthias Beyens et al. BMC Bioinformatics. .

Abstract

Background: Haloplex targeted resequencing is a popular method to analyze both germline and somatic variants in gene panels. However, involved wet-lab procedures may introduce false positives that need to be considered in subsequent data-analysis. No variant filtering rationale addressing amplicon enrichment related systematic errors, in the form of an all-in-one package, exists to our knowledge.

Results: We present pyAmpli, a platform independent parallelized Python package that implements an amplicon-based germline and somatic variant filtering strategy for Haloplex data. pyAmpli can filter variants for systematic errors by user pre-defined criteria. We show that pyAmpli significantly increases specificity, without reducing sensitivity, essential for reporting true positive clinical relevant mutations in gene panel data.

Conclusions: pyAmpli is an easy-to-use software tool which increases the true positive variant call rate in targeted resequencing data. It specifically reduces errors related to PCR-based enrichment of targeted regions.

Keywords: Germline; Next-generation sequencing; Somatic; Targeted resequencing; Variant filtering.

PubMed Disclaimer

Conflict of interest statement

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Figures

Fig. 1
Fig. 1
Sashimi target enrichment plots. mTOR Exon 54 coverage for two different target enrichment methods is represented by a Sashimi plot: 1) hybridization capture-based technique, showing more bell-shaped enrichment of random fragments (red histogram) and 2) amplicon- or multiplex PCR-based approach, showing vertical enrichment blocks of identical fragments (blue histogram)
Fig. 2
Fig. 2
Vertical read blocks and variant calling bias visualization. Typical vertically enriched read blocks are illustrated in (2a and 2b). Aspecific fragments, not corresponding to predicted amplicons are shown in (2a) and variants restricted to read ends are shown in (2b). Called variants are indicated by a red dashed rectangular. Reads are given in blue and pink colored horizontal bars, indicating read orientation. Theoretical manufacturer designed Haloplex probes are presented in green colored horizontal bars below their corresponding enriched reads

References

    1. Sommen M, Schrauwen I, Vandeweyer G, Boeckx N, Corneveaux JJ, van den Ende J, Boudewyns A, De Leenheer E, Janssens S, Claes K, Verstreken M, Strenzke N, Predöhl F, Wuyts W, Mortier G, Bitner-Glindzicz M, Moser T, Coucke P, Huentelman MJ, Van Camp G. DNA diagnostics of hereditary hearing loss: a targeted resequencing approach combined with a mutation classification system. Hum Mutat. 2016;37:812–819. doi: 10.1002/humu.22999. - DOI - PubMed
    1. Stamatopoulos B, Timbs A, Bruce D, Smith T, Clifford R, Robbe P, Burns A, Vavoulis DV, Lopez L, Antoniou P, Mason J, Dreau H, Schuh A. Targeted deep sequencing reveals relevant subclonal IgHV rearrangements in chronic lymphocytic leukemia. Leukemia. 2017;31:837–845. doi: 10.1038/leu.2016.307. - DOI - PubMed
    1. Samorodnitsky E, Datta J, Jewell BM, Hagopian R, Miya J, Wing MR, Damodaran S, Lippus JM, Reeser JW, Bhatt D, Timmers CD, Roychowdhury S. Comparison of custom capture for targeted next-generation DNA sequencing. J Mol Diagn. 2015;17:64–75. doi: 10.1016/j.jmoldx.2014.09.009. - DOI - PMC - PubMed
    1. Leanne de Kock Y, Wang C, Revil T, Badescu D, Rivera B, Sabbaghian N, Wu M, Weber E, Sandoval C, Hopman SMJ, Merks JHM, van Hagen JM, Bouts AHM, Plager DA, Ramasubramanian A, Forsmark L, Doyle KL, Toler T, Callahan J, Engelenberg C, Soglio DB-D, Priest JR, Ragoussis J, Foulkes WD. High-sensitiviy sequencing reveals multi-organ somatic mosaicism causing DICER1 syndrome. J Med Genet. 2016;53:43–52. doi: 10.1136/jmedgenet-2015-103428. - DOI - PubMed
    1. van Rossum G, de Boer J. Interactively testing remote servers using the python programming language. CWI Q. 1991;4:283–303.