Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 2015 Dec 7;10(12):e0144162.
doi: 10.1371/journal.pone.0144162. eCollection 2015.

Comparison of Accuracy of Whole-Exome Sequencing with Formalin-Fixed Paraffin-Embedded and Fresh Frozen Tissue Samples

Affiliations
Comparative Study

Comparison of Accuracy of Whole-Exome Sequencing with Formalin-Fixed Paraffin-Embedded and Fresh Frozen Tissue Samples

Ensel Oh et al. PLoS One. .

Abstract

Formalin fixing with paraffin embedding (FFPE) has been a standard sample preparation method for decades, and archival FFPE samples are still very useful resources. Nonetheless, the use of FFPE samples in cancer genome analysis using next-generation sequencing, which is a powerful technique for the identification of genomic alterations at the nucleotide level, has been challenging due to poor DNA quality and artificial sequence alterations. In this study, we performed whole-exome sequencing of matched frozen samples and FFPE samples of tissues from 4 cancer patients and compared the next-generation sequencing data obtained from these samples. The major differences between data obtained from the 2 types of sample were the shorter insert size and artificial base alterations in the FFPE samples. A high proportion of short inserts in the FFPE samples resulted in overlapping paired reads, which could lead to overestimation of certain variants; >20% of the inserts in the FFPE samples were double sequenced. A large number of soft clipped reads was found in the sequencing data of the FFPE samples, and about 30% of total bases were soft clipped. The artificial base alterations, C>T and G>A, were observed in FFPE samples only, and the alteration rate ranged from 200 to 1,200 per 1M bases when sequencing errors were removed. Although high-confidence mutation calls in the FFPE samples were compatible to that in the frozen samples, caution should be exercised in terms of the artifacts, especially for low-confidence calls. Despite the clearly observed artifacts, archival FFPE samples can be a good resource for discovery or validation of biomarkers in cancer research based on whole-exome sequencing.

PubMed Disclaimer

Conflict of interest statement

Competing Interests: The authors have declared that no competing interests exist.

Figures

Fig 1
Fig 1. Distribution of insert sizes and frequencies of double-sequenced regions.
The distribution of insert sizes was calculated from properly mapped paired reads. The distributions of formalin-fixed paraffin-embedded (FFPE) samples were skewed to the left because of a large number of short inserts (A). The short inserts generated abundant overlapping paired ends in FFPE samples (B), and soft clipped bases (C).
Fig 2
Fig 2. Distribution of nucleotide quality according to mapping status.
The nucleotide quality scores were analyzed according to mapping status. In the mapped reads, there was no significant difference between the distributions of the 2 types of sample. In the unmapped reads, the blood and frozen samples showed a higher percentage of low-quality bases (≤ 20 on the Phred scale, black arrow) compared to the FFPE samples.
Fig 3
Fig 3. Frequency of base transition in formalin-fixed paraffin-embedded (FFPE) samples.
A: A strategy for estimation of rates of sequencing errors/background DNA damage and overall base alteration rates. The discrepant bases at homozygous sites in control samples are likely to be sequencing errors or background DNA mutation. Conversely, discrepant bases at homozygous sites in frozen or FFPE tissue samples could be either sequencing errors/background DNA damage or base alteration caused by preservation methods. B: The rate of base alterations caused by formalin fixation. High frequencies of C>T and G>A were observed in FFPE tissue samples only.
Fig 4
Fig 4. Comparison of somatic single nucleotide variant (SNV) calls.
The overall concordance of somatic SNV calls between FFPE-frozen paired samples. A. The overlapping fraction of somatic mutation calls. B. The overlapping fraction of somatic mutation calls in FFPE samples when the matched frozen samples have at least one supporting read at the mutation.

References

    1. Srinivasan M, Sedmak D, Jewell S. Effect of fixatives and tissue processing on the content and integrity of nucleic acids. The American journal of pathology. 2002;161(6):1961–71. 10.1016/S0002-9440(10)64472-0 - DOI - PMC - PubMed
    1. Williams C, Ponten F, Moberg C, Soderkvist P, Uhlen M, Ponten J, et al. A high frequency of sequence alterations is due to formalin fixation of archival specimens. The American journal of pathology. 1999;155(5):1467–71. 10.1016/S0002-9440(10)65461-2 - DOI - PMC - PubMed
    1. Paik S, Shak S, Tang G, Kim C, Baker J, Cronin M, et al. A multigene assay to predict recurrence of tamoxifen-treated, node-negative breast cancer. The New England journal of medicine. 2004;351(27):2817–26. 10.1056/NEJMoa041588 . - DOI - PubMed
    1. Choi YL, Oh E, Park S, Kim Y, Park YH, Song K, et al. Triple-negative, basal-like, and quintuple-negative breast cancers: better prediction model for survival. BMC cancer. 2010;10:507 10.1186/1471-2407-10-507 - DOI - PMC - PubMed
    1. Espinosa E, Sanchez-Navarro I, Gamez-Pozo A, Marin AP, Hardisson D, Madero R, et al. Comparison of prognostic gene profiles using qRT-PCR in paraffin samples: a retrospective study in patients with early breast cancer. PloS one. 2009;4(6):e5911 10.1371/journal.pone.0005911 - DOI - PMC - PubMed

Publication types