Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comment
. 2018 Sep 28;361(6409):eaas9824.
doi: 10.1126/science.aas9824.

Comment on "DNA damage is a pervasive cause of sequencing errors, directly confounding variant identification"

Affiliations
Comment

Comment on "DNA damage is a pervasive cause of sequencing errors, directly confounding variant identification"

Chip Stewart et al. Science. .

Abstract

Chen et al (Reports, 17 February 2017, p. 752) highlight an important problem of sequencing artifacts caused by DNA damage at the time of sample processing. However, their manuscript contains several errors that led the authors to incorrect conclusions. Moreover, the same sequencing artifacts were previously described and mitigated in The Cancer Genome Atlas and other published sequencing projects.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
(A) Picard oxoQ score vs. GIVG_T for &1,900 TCGA tumor exomes. Lack of agreement between oxoQ and GIVG_T scores is apparent in samples with high DNA damage (oxoQ < 30) highlighted in the gray box. GIVG_T is calculated with Chen et al.’s code (estimate_damage.pl) with default parameters; oxoQ was calculated with Picard CollectSequencingArtifactMetrics (11) output. (B) OxoQ vs. GIVG_T corrected (using base quality Q > 20 and coverage depth ≥ 20, comparable to Picard defaults) showing excellent agreement. (C) Histogram of individual base qualities in a typical TCGA 8-oxoG damaged sample (oxoQ = 24.2) compared to a low-damage sample (oxoQ=48.9). Most of the bases in the damaged sample have Q < 30 and hence are ignored by default GIVG_T. This explains both the lack of agreement between the scores seen in (A) and authors missing the sequence context associated with 8-oxoG damage (D vs E) –– the Q > 30 filtering removed bases with the type of damage that is being quantified. “Lego” plot showing the distribution of errors in different 5′ and 3′ sequence contexts (plotted in reverse complement by convention), with Chen et al.’s (D) and our corrected (E) filtering criteria, showing distortion from the characteristic 8-oxoG context pattern arising from data selection criteria. (F) The fraction of bases that remain after applying Chen et al.’s filters show that most bases are removed in highly damaged samples.
Figure 2.
Figure 2.
Comparison of 8-oxoG related error rates and other error modes. Samples (A-C) with corrected GIVG_T damage levels below 5 (equivalent to oxoQ & 35) have fewer 8-oxoG–related base errors (red) compared to other sequencing error modes (blue). (A) Count of tumors with a specific corrected GIVG_T score (majority of TCGA samples have very low damage levels). (B) Percentage of additional base mismatches caused by 8-oxoG in relation to GIVG_T score. (C) 8-oxoG–related error rates (red) and other error modes (blue). Other modes dominate until corrected GIVG_T & 5. (D) Copy of Fig 4E from Chen et al. (1). (E) Our estimated FDR vs. corrected GIVG_T for &1,900 TCGA tumors from MuTect (14) before 8-oxoG filtering (GIVG_T < 7.5). (F, G) Estimated FDR vs. corrected GIVG_T for &1,900 TCGA tumors from MuTect before (F) and after (G) 8-oxoG filtering (as described in (2)). Results demonstrate no inflation in number of mutation calls after applying the 8-oxoG software filter.

Comment in

Comment on

References

    1. Chen L, Liu P, Evans TC, Ettwiller LM, DNA damage is a pervasive cause of sequencing errors, directly confounding variant identification., Science 355, 752–756 (2017). - PubMed
    1. Costello M, Pugh TJ, Fennell TJ, Stewart C, Lichtenstein L, Meldrim JC, Fostel JL, Friedrich DC, Perrin D, Dionne D, Kim S, Gabriel SB, Lander ES, Fisher S, Getz G, Discovery and characterization of artifactual mutations in deep coverage targeted capture sequencing data due to oxidative DNA damage during sample preparation., Nucleic Acids Res. 41, e67 (2013). - PMC - PubMed
    1. Pugh TJ, Morozova O, Attiyeh EF, Asgharzadeh S, Wei JS, Auclair D, Carter SL, Cibulskis K, Hanna M, Kiezun A, Kim J, Lawrence MS, Lichenstein L, McKenna A, Pedamallu CS, Ramos AH, Shefler E, Sivachenko A, Sougnez C, Stewart C, Ally A, Birol I, Chiu R, Corbett RD, Hirst M, Jackman SD, Kamoh B, Khodabakshi AH, Krzywinski M, Lo A, Moore RA, Mungall KL, Qian J, Tam A, Thiessen N, Zhao Y, Cole KA, Diamond M, Diskin SJ, Mosse YP, Wood AC, Ji L, Sposto R, Badgett T, London WB, Moyer Y, Gastier-Foster JM, Smith MA, Guidry Auvil JM, Gerhard DS, Hogarty MD, Jones SJ, Lander ES, Gabriel SB, Getz G, Seeger RC, Khan J, Marra MA, Meyerson M, Maris JM, The genetic landscape of high-risk neuroblastoma., Nat. Genet. 45, 279–84 (2013). - PMC - PubMed
    1. Crompton BD, Stewart C, Taylor-Weiner A, Alexe G, Kurek KC, Calicchio ML, Kiezun A, Carter SL, Shukla SA, Mehta SS, Thorner AR, de Torres C, Lavarino C, Suñol M, McKenna A, Sivachenko A, Cibulskis K, Lawrence MS, Stojanov P, Rosenberg M, Ambrogio L, Auclair D, Seepo S, Blumenstiel B, DeFelice M, Imaz-Rosshandler I, Schwarz-Cruz Y Celis A, Rivera MN, Rodriguez-Galindo C, Fleming MD, Golub TR, Getz G, Mora J, Stegmaier K, The genomic landscape of pediatric Ewing sarcoma., Cancer Discov. 4, 1326–41 (2014). - PubMed
    1. Integrated genomic characterization of papillary thyroid carcinoma., Cell 159, 676–90 (2014). - PMC - PubMed

LinkOut - more resources