Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Feb 21;15(2):e0229063.
doi: 10.1371/journal.pone.0229063. eCollection 2020.

The stochastic nature of errors in next-generation sequencing of circulating cell-free DNA

Affiliations

The stochastic nature of errors in next-generation sequencing of circulating cell-free DNA

David A Nix et al. PLoS One. .

Abstract

Challenges with distinguishing circulating tumor DNA (ctDNA) from next-generation sequencing (NGS) artifacts limits variant searches to established solid tumor mutations. Here we show early and random PCR errors are a principal source of NGS noise that persist despite duplex molecular barcoding, removal of artifacts due to clonal hematopoiesis of indeterminate potential, and suppression of patterned errors. We also demonstrate sample duplicates are necessary to eliminate the stochastic noise associated with NGS. Integration of sample duplicates into NGS analytics may broaden ctDNA applications by removing NGS-related errors that confound identification of true very low frequency variants during searches for ctDNA without a priori knowledge of specific mutations to target.

PubMed Disclaimer

Conflict of interest statement

Author SH is an employee of ARUP Laboratories, a commercial company. This does not alter our adherence to PLOS ONE policies on sharing data and materials. All other authors have declared that no competing interests exist.

Figures

Fig 1
Fig 1. Comparison of sequencing metrics between adapter types.
The total number of reads where both read 1 and read 2 were present was similar between the singleton and duplex adapter groups (a). However, after consensus sequence determination, there were significantly fewer consensus sequences (b) and larger overall family sizes (c) in the duplex adapter group. Read depth (d) was greater in the duplex adapters at larger family sizes. Bar and whiskers represent mean±SD. Data points shown in (d) represent the mean value from the seven control samples.
Fig 2
Fig 2. Use of singleton and duplex adapters to reduce noise.
Compared to singleton adapters, consensus sequences derived from duplex adapters provided greater error correction (a) and lower error (b) at family size (FS) ≥2. The gain in error correction using duplex adapters was similar regardless of family size and increments in family size reduced error regardless of adapter type (c). Although errors most commonly occurred at an allele frequency <0.1%, a substantial portion of errors had an allele frequency >0.1% (d). Bar and whiskers represent mean±SD. Data points shown in (c) represent the mean value from the seven control samples.
Fig 3
Fig 3. Sources of error and effects of error correction in ccfDNA.
Removal of potential CHIP-related artifacts had a relatively small impact on error (a), particularly when compared to removal of highly patterned error (i.e., positions with errors in all seven controls; b). The greatest reduction in error occurred with application of sample duplicates (c). The effects on error from using duplex adapters (X), accounting for CHIP-related artifacts (C), removing positions with highly patterned error (P), and applying data from samples duplicates (D) are shown individually in (d). Accounting for all of the different sources of noise yielded the lowest error (d, pink), which continually decreased with increments in family size. Data points shown in (d) represent the mean value from the seven control samples. FS = family size; CHIP = clonal hematopoiesis of indeterminate potential.
Fig 4
Fig 4. Distribution of NRA types associated with ccfDNA.
All data are shown for family size (FS) ≥2. The number of counts for each type of the twelve possible base pair changes is shown in (a) for all observed NRAs. In (b), the percent of the total NRAs for each base pair change is shown. The distribution for base pair changes associated with CHIP-related artifacts and highly patterned error (NRAs common to all seven samples) is shown in (c) and (d), respectively. For CHIP-related artifacts, only results from duplex adapters are shown because buffy coat DNA was sequenced only with duplex adapters. In (d), error bars are absent because the NRAs were present in all seven samples for each adapter type. In (e), the number of counts associated with each type of base pair change present in both of the full library duplex adapter duplicates is depicted. After using full library duplex adapter duplicates to reduce error and removing CHIP-related artifacts and highly patterned error, the distribution of each type of base pair change is shown in (f). Error bars represent SD. CHIP = clonal hematopoiesis of indeterminate potential.

References

    1. Siravegna G, Marsoni S, Siena S, Bardelli A. Integrating liquid biopsies into the management of cancer. Nat Rev Clin Oncol. 2017;14(9):531–48. Epub 2017/03/03. 10.1038/nrclinonc.2017.14 . - DOI - PubMed
    1. Sun K, Jiang P, Chan KC, Wong J, Cheng YK, Liang RH, et al. Plasma DNA tissue mapping by genome-wide methylation sequencing for noninvasive prenatal, cancer, and transplantation assessments. Proc Natl Acad Sci U S A. 2015;112(40):E5503–12. 10.1073/pnas.1508736112 . - DOI - PMC - PubMed
    1. Dawson SJ, Tsui DW, Murtaza M, Biggs H, Rueda OM, Chin SF, et al. Analysis of circulating tumor DNA to monitor metastatic breast cancer. N Engl J Med. 2013;368(13):1199–209. 10.1056/NEJMoa1213261 . - DOI - PubMed
    1. Diehl F, Li M, Dressman D, He Y, Shen D, Szabo S, et al. Detection and quantification of mutations in the plasma of patients with colorectal tumors. Proc Natl Acad Sci U S A. 2005;102(45):16368–73. 10.1073/pnas.0507904102 . - DOI - PMC - PubMed
    1. Bettegowda C, Sausen M, Leary RJ, Kinde I, Wang Y, Agrawal N, et al. Detection of circulating tumor DNA in early- and late-stage human malignancies. Sci Transl Med. 2014;6(224):224ra24 Epub 2014/02/21. 10.1126/scitranslmed.3007094 . - DOI - PMC - PubMed

Publication types

Substances