Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Jan;21(1):53-61.
doi: 10.1038/s41436-018-0016-6. Epub 2018 Aug 13.

Development of an evidence-based algorithm that optimizes sensitivity and specificity in ES-based diagnostics of a clinically heterogeneous patient population

Affiliations

Development of an evidence-based algorithm that optimizes sensitivity and specificity in ES-based diagnostics of a clinically heterogeneous patient population

Peter Bauer et al. Genet Med. 2019 Jan.

Abstract

Purpose: Next-generation sequencing (NGS) is rapidly replacing Sanger sequencing in genetic diagnostics. Sensitivity and specificity of NGS approaches are not well-defined, but can be estimated from applying NGS and Sanger sequencing in parallel. Utilizing this strategy, we aimed at optimizing exome sequencing (ES)-based diagnostics of a clinically diverse patient population.

Methods: Consecutive DNA samples from unrelated patients with suspected genetic disease were exome-sequenced; comparatively nonstringent criteria were applied in variant calling. One thousand forty-eight variants in genes compatible with the clinical diagnosis were followed up by Sanger sequencing. Based on a set of variant-specific features, predictors for true positives and true negatives were developed.

Results: Sanger sequencing confirmed 81.9% of ES-derived variants. Calls from the lower end of stringency accounted for the majority of the false positives, but also contained ~5% of the true positives. A predictor incorporating three variant-specific features classified 91.7% of variants with 100% specificity and 99.75% sensitivity. Confirmation status of the remaining variants (8.3%) was not predictable.

Conclusions: Criteria for variant calling in ES-based diagnostics impact on specificity and sensitivity. Confirmatory sequencing for a proportion of variants, therefore, remains a necessity. Our study exemplifies how these variants can be defined on an empirical basis.

Keywords: Genetic testing; Laboratory standards; Sensitivity; Specificity; exome sequencing.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflicts of interest.

Figures

Fig. 1
Fig. 1. Performance of filtering criteria commonly used during variant calling in next-generation sequencing (NGS).
a Distribution of Sanger-confirmed vs. Sanger-rejected variants in our data set. b Modeling of the effects of standard panel- and exome sequencing (ES)-associated filtering cutoffs. c Contribution of individual parameters to the drop in sensitivity (stippled and dotted gray lines: common cutoffs for panel-based and ES-based approaches, respectively; solid gray lines: cutoffs initially used by the present study). The asterisk denotes that sensitivity as indicated here is not equivalent to the (unknown) overall sensitivity of the assay, but refers to our set of 1048 Sanger followed-up candidate variants
Fig. 2
Fig. 2. Features of exome sequencing (ES)-derived candidate variants, and association with Sanger confirmation status.
Stippled gray lines indicate the corresponding values for all 1,048 variants. a Analogous features. Significance of differences between means (+/- SEM) was calculated using the two-sided Student’s t-test. b Digital features. Significance of differences was calculated using Fisher’s exact test
Fig. 3
Fig. 3. Iterative analyses aiming at the definition of classifiers that can predict true positive or false positive ES-derived candidate variants while maintaining high specificity and sensitivity.
Analogous features are depicted as receiver operating characteristic curves. Binary classifications according to digital features are indicated by filled symbols. a First round of analysis on all 1,048 variants. Arrow: “quality”-based binary classifier that correctly predicts status “confirmed” for 813 variants; 100% specificity is thus retained. b Second round of analysis on the 235 variants that remained after round 1. Arrow: “variant reads”–based binary classifier that correctly predicts status “not confirmed” for 87 of 88 variants. Overall sensitivity decreases to 99.9%. c Third round of analysis on the 147 variants that remained after round 2. Arrow: “frequency”-based binary classifier that correctly predicts status “not confirmed” for 59 of 60 variants. Overall sensitivity decreases to 99.8%. d A fourth round of analysis on the 87 variants remaining after round 3 does not define additional classifiers
Fig. 4
Fig. 4. Characteristics of the three groups created by our classifiers.
a Group-specific means (+/- SEM) for analogous features (upper panel), and intragroup distribution of states for digital features (lower panel). Note that for the majority of features, the variants that require Sanger sequencing occupy an intermediate position. b Performance of filtering criteria in the set of variants that require Sanger-based confirmation (compare Fig. 1a, b)
Fig. 5
Fig. 5. Workflow for following up on identification of a candidate variant by exome sequencing (ES).
Note that this workflow is based on the data set presented in the present study, and is also specific to it (corresponding numbers of variants are indicated in parentheses). Gray sectors and corresponding percentages in pie charts indicate the remaining Sanger sequencing load

References

    1. Boycott KM, Vanstone MR, Bulman DE, et al. Rare-disease genetics in the era of next-generation sequencing: discovery to translation. Nat Rev Genet. 2013;14:681–91. doi: 10.1038/nrg3555. - DOI - PubMed
    1. Pinto AM, Ariani F, Bianciardi L, et al. Exploiting the potential of next-generation sequencing in genomic medicine. Expert Rev Mol Diagn. 2016;16:1037–47. doi: 10.1080/14737159.2016.1224181. - DOI - PubMed
    1. Tewhey R, Nakano M, Wang X, et al. Enrichment of sequencing targets from the human genome by solution hybridization. Genome Biol. 2009;10:R116. doi: 10.1186/gb-2009-10-10-r116. - DOI - PMC - PubMed
    1. Rehm HL, Bale SJ, Bayrak-Toydemir P, et al. ACMG clinical laboratory standards for next-generation sequencing. Genet Med. 2013;15:733–47. doi: 10.1038/gim.2013.92. - DOI - PMC - PubMed
    1. Shearer AE, DeLuca AP, Hildebrand MS, et al. Comprehensive genetic testing for hereditary hearing loss using massively parallel sequencing. Proc Natl Acad Sci USA. 2010;107:21104–9. doi: 10.1073/pnas.1012989107. - DOI - PMC - PubMed

MeSH terms