A computational approach to distinguish somatic vs. germline origin of genomic alterations from deep sequencing of cancer specimens without a matched normal
- PMID: 29415044
- PMCID: PMC5832436
- DOI: 10.1371/journal.pcbi.1005965
A computational approach to distinguish somatic vs. germline origin of genomic alterations from deep sequencing of cancer specimens without a matched normal
Abstract
A key constraint in genomic testing in oncology is that matched normal specimens are not commonly obtained in clinical practice. Thus, while well-characterized genomic alterations do not require normal tissue for interpretation, a significant number of alterations will be unknown in whether they are germline or somatic, in the absence of a matched normal control. We introduce SGZ (somatic-germline-zygosity), a computational method for predicting somatic vs. germline origin and homozygous vs. heterozygous or sub-clonal state of variants identified from deep massively parallel sequencing (MPS) of cancer specimens. The method does not require a patient matched normal control, enabling broad application in clinical research. SGZ predicts the somatic vs. germline status of each alteration identified by modeling the alteration's allele frequency (AF), taking into account the tumor content, tumor ploidy, and the local copy number. Accuracy of the prediction depends on the depth of sequencing and copy number model fit, which are achieved in our clinical assay by sequencing to high depth (>500x) using MPS, covering 394 cancer-related genes and over 3,500 genome-wide single nucleotide polymorphisms (SNPs). Calls are made using a statistic based on read depth and local variability of SNP AF. To validate the method, we first evaluated performance on samples from 30 lung and colon cancer patients, where we sequenced tumors and matched normal tissue. We examined predictions for 17 somatic hotspot mutations and 20 common germline SNPs in 20,182 clinical cancer specimens. To assess the impact of stromal admixture, we examined three cell lines, which were titrated with their matched normal to six levels (10-75%). Overall, predictions were made in 85% of cases, with 95-99% of variants predicted correctly, a significantly superior performance compared to a basic approach based on AF alone. We then applied the SGZ method to the COSMIC database of known somatic variants in cancer and found >50 that are in fact more likely to be germline.
Conflict of interest statement
I have read the journal's policy and the authors of this manuscript have the following competing interests: JXS, YH, MM, GMF, JSR, VAM, PJS, and DL are paid employees of Foundation Medicine. JXS, YH, ES, GMF, JSR, VAM, PJS, DL, and RY are shareholders of Foundation Medicine. ES and RY are former employees of Foundation Medicine, but were employees when the studies in this paper were conducted.
Figures



Similar articles
-
A method to reduce ancestry related germline false positives in tumor only somatic variant calling.BMC Med Genomics. 2017 Oct 19;10(1):61. doi: 10.1186/s12920-017-0296-8. BMC Med Genomics. 2017. PMID: 29052513 Free PMC article.
-
Discrimination of Germline EGFR T790M Mutations in Plasma Cell-Free DNA Allows Study of Prevalence Across 31,414 Cancer Patients.Clin Cancer Res. 2017 Dec 1;23(23):7351-7359. doi: 10.1158/1078-0432.CCR-17-1745. Epub 2017 Sep 25. Clin Cancer Res. 2017. PMID: 28947568 Free PMC article.
-
Personalized genomic analyses for cancer mutation discovery and interpretation.Sci Transl Med. 2015 Apr 15;7(283):283ra53. doi: 10.1126/scitranslmed.aaa7161. Sci Transl Med. 2015. PMID: 25877891 Free PMC article.
-
Clonal evolution of acute myeloid leukemia highlighted by latest genome sequencing studies.Oncotarget. 2016 Sep 6;7(36):58586-58594. doi: 10.18632/oncotarget.10850. Oncotarget. 2016. PMID: 27474172 Free PMC article. Review.
-
Somatic mutation panels: Time to clear their names.Cancer Genet. 2019 Jun;235-236:84-92. doi: 10.1016/j.cancergen.2019.04.065. Epub 2019 Apr 26. Cancer Genet. 2019. PMID: 31101556 Review.
Cited by
-
Genetic Ancestry Inference from Cancer-Derived Molecular Data across Genomic and Transcriptomic Platforms.Cancer Res. 2023 Jan 4;83(1):49-58. doi: 10.1158/0008-5472.CAN-22-0682. Cancer Res. 2023. PMID: 36351074 Free PMC article.
-
Liquid Profiling for Cancer Patient Stratification in Precision Medicine-Current Status and Challenges for Successful Implementation in Standard Care.Diagnostics (Basel). 2022 Mar 19;12(3):748. doi: 10.3390/diagnostics12030748. Diagnostics (Basel). 2022. PMID: 35328301 Free PMC article. Review.
-
LACE 2.0: an interactive R tool for the inference and visualization of longitudinal cancer evolution.BMC Bioinformatics. 2023 Mar 17;24(1):99. doi: 10.1186/s12859-023-05221-3. BMC Bioinformatics. 2023. PMID: 36932333 Free PMC article.
-
Deletions on 9p21 are associated with worse outcomes after anti-PD-1/PD-L1 monotherapy but not chemoimmunotherapy.NPJ Precis Oncol. 2022 Jun 23;6(1):44. doi: 10.1038/s41698-022-00286-4. NPJ Precis Oncol. 2022. PMID: 35739333 Free PMC article.
-
Implementing tumor mutational burden (TMB) analysis in routine diagnostics-a primer for molecular pathologists and clinicians.Transl Lung Cancer Res. 2018 Dec;7(6):703-715. doi: 10.21037/tlcr.2018.08.14. Transl Lung Cancer Res. 2018. PMID: 30505715 Free PMC article. Review.
References
-
- Frampton GM, Fichtenholtz A, Otto GA, Wang K, Downing SR, He J, et al. Development and validation of a clinical cancer genomic profiling test based on massively parallel DNA sequencing. Nat Biotechnol. 2013;31(11):1023–31. doi: 10.1038/nbt.2696 - DOI - PMC - PubMed
-
- Stephens PJ, Tarpey PS, Davies H, Van Loo P, Greenman C, Wedge DC, et al. The landscape of cancer genes and mutational processes in breast cancer. Nature. 2012;486(7403):400–4. doi: 10.1038/nature11017 - DOI - PMC - PubMed
-
- Cancer Genome Atlas Research N. Comprehensive molecular characterization of gastric adenocarcinoma. Nature. 2014;513(7517):202–9. doi: 10.1038/nature13480 - DOI - PMC - PubMed
-
- Kanchi KL, Johnson KJ, Lu C, McLellan MD, Leiserson MD, Wendl MC, et al. Integrated analysis of germline and somatic variants in ovarian cancer. Nat Commun. 2014;5:3156 doi: 10.1038/ncomms4156 - DOI - PMC - PubMed
-
- Koboldt DC, Steinberg KM, Larson DE, Wilson RK, Mardis ER. The next-generation sequencing revolution and its impact on genomics. Cell. 2013;155(1):27–38. doi: 10.1016/j.cell.2013.09.006 - DOI - PMC - PubMed
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources